Sgr022910 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr022910
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
DescriptionSAGA-Tad1 domain-containing protein
Locationtig00000729: 981262 .. 982524 (+)
RNA-Seq ExpressionSgr022910
SyntenySgr022910
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCAACCTCAGCAGAGCTTGAGAATTGACTTGGGTGAATTGAAATCACAGATAGTGAAGAAGCTCGGAACCGATCGGTCAAAATGGTATTTCTTTTACTTGAATAGGTTCTTGTGTCAGAAGCTGAGTAAGAATGAGTTTGATAAGATATGTTGTCGTGTGCTTGGAAGGGAGAATCTTCGGCTGCATAATCAATTGATACAGTCAATCTTGAAAAATGCATGCCAAGCTAAGGCTGCACCACCGATGCCTGTAGCAGGCTATCCGAAAACTTCAACACAGTCTGCAAAAATTTCCCCTGTTATAGAAGATGGGAATGAGGACGGTGGAGCTGTTTTTCCTACTTCCACTCAAGGTATTCCCATTTGGTCTAATGGAGGTTTTCCAGTGTCCCCAAGAAAGAGCAGGTCTGGGATACACGACCGCAAACTCAAAGACAGACCGAGTCCTCTAGGTCCAAACGGGAAGGTTGAATGTATCTCACATCAATCAGCAGGCAAGGAAGATGGCAGCTGTAAAATCATGATGGATAATGGTGATGCAACTCTGTGTGACTATCAGAGACCAGTGCAGCATCTGCAAGGAGTTGCTGAACTACCTCAAAACAATATTGAGTCTAGAGTTCAGCAACCAGCAGGAAAGCAAGTTCTACACAGTAAGATCCAGGTTGAAGGAACTAAAGTTGAGGACAGGGAAGAAGCGGGACAGTCAAACCATTCGAGTTTACTTCGAAGTCTCTTACTTGCACCTCTTGGGATTCCTTTTTGCTCAGCTAGTATTGGTGGGGCCCGCAAAGCGAGGCCTGTGGATTGTGGGGGTGATTTTGTTAGCTTTAGCGATATTGGTCATCTGTCAGATACCGAGTCGTTGAGACGCCGCATGGAACAAATTGCTGCAGTACAGGGGCTTGGTAGCGTCTCTGCAGATTGTGCTAATATCTTAAATAAAGTGTTGGATGTATATGTGAAGCAGTTAATTAGGTCCTGCGTTGACTTGGTTGGAGCATGGCCTACATGTGAGCTTGAGAAGCCTCTTGCTCATAAGCAGCAGATTCAAGGGAAGGTTATGAATGGCATGTTGCCGAATAATCAATTACACATGCGACATGGCAATGGAAACGGAGAAATTATGCACGAGCACAGATTACGTTGCTCGGTATCATTGCTTGATTTCAAAGTAGCAATGGAGCTTAACCCAAAGCAACTTGGGGAAGACTGGCCTTTGCTACTGGAGAAAATCCGTATGCGTGCATTTGAGGAATAA

mRNA sequence

ATGCAACCTCAGCAGAGCTTGAGAATTGACTTGGGTGAATTGAAATCACAGATAGTGAAGAAGCTCGGAACCGATCGGTCAAAATGGTATTTCTTTTACTTGAATAGGTTCTTGTGTCAGAAGCTGAGTAAGAATGAGTTTGATAAGATATGTTGTCGTGTGCTTGGAAGGGAGAATCTTCGGCTGCATAATCAATTGATACAGTCAATCTTGAAAAATGCATGCCAAGCTAAGGCTGCACCACCGATGCCTGTAGCAGGCTATCCGAAAACTTCAACACAGTCTGCAAAAATTTCCCCTGTTATAGAAGATGGGAATGAGGACGGTGGAGCTGTTTTTCCTACTTCCACTCAAGGTATTCCCATTTGGTCTAATGGAGGTTTTCCAGTGTCCCCAAGAAAGAGCAGGTCTGGGATACACGACCGCAAACTCAAAGACAGACCGAGTCCTCTAGGTCCAAACGGGAAGGTTGAATGTATCTCACATCAATCAGCAGGCAAGGAAGATGGCAGCTGTAAAATCATGATGGATAATGGTGATGCAACTCTGTGTGACTATCAGAGACCAGTGCAGCATCTGCAAGGAGTTGCTGAACTACCTCAAAACAATATTGAGTCTAGAGTTCAGCAACCAGCAGGAAAGCAAGTTCTACACAGTAAGATCCAGGTTGAAGGAACTAAAGTTGAGGACAGGGAAGAAGCGGGACAGTCAAACCATTCGAGTTTACTTCGAAGTCTCTTACTTGCACCTCTTGGGATTCCTTTTTGCTCAGCTAGTATTGGTGGGGCCCGCAAAGCGAGGCCTGTGGATTGTGGGGGTGATTTTGTTAGCTTTAGCGATATTGGTCATCTGTCAGATACCGAGTCGTTGAGACGCCGCATGGAACAAATTGCTGCAGTACAGGGGCTTGGTAGCGTCTCTGCAGATTGTGCTAATATCTTAAATAAAGTGTTGGATGTATATGTGAAGCAGTTAATTAGGTCCTGCGTTGACTTGGTTGGAGCATGGCCTACATGTGAGCTTGAGAAGCCTCTTGCTCATAAGCAGCAGATTCAAGGGAAGGTTATGAATGGCATGTTGCCGAATAATCAATTACACATGCGACATGGCAATGGAAACGGAGAAATTATGCACGAGCACAGATTACGTTGCTCGGTATCATTGCTTGATTTCAAAGTAGCAATGGAGCTTAACCCAAAGCAACTTGGGGAAGACTGGCCTTTGCTACTGGAGAAAATCCGTATGCGTGCATTTGAGGAATAA

Coding sequence (CDS)

ATGCAACCTCAGCAGAGCTTGAGAATTGACTTGGGTGAATTGAAATCACAGATAGTGAAGAAGCTCGGAACCGATCGGTCAAAATGGTATTTCTTTTACTTGAATAGGTTCTTGTGTCAGAAGCTGAGTAAGAATGAGTTTGATAAGATATGTTGTCGTGTGCTTGGAAGGGAGAATCTTCGGCTGCATAATCAATTGATACAGTCAATCTTGAAAAATGCATGCCAAGCTAAGGCTGCACCACCGATGCCTGTAGCAGGCTATCCGAAAACTTCAACACAGTCTGCAAAAATTTCCCCTGTTATAGAAGATGGGAATGAGGACGGTGGAGCTGTTTTTCCTACTTCCACTCAAGGTATTCCCATTTGGTCTAATGGAGGTTTTCCAGTGTCCCCAAGAAAGAGCAGGTCTGGGATACACGACCGCAAACTCAAAGACAGACCGAGTCCTCTAGGTCCAAACGGGAAGGTTGAATGTATCTCACATCAATCAGCAGGCAAGGAAGATGGCAGCTGTAAAATCATGATGGATAATGGTGATGCAACTCTGTGTGACTATCAGAGACCAGTGCAGCATCTGCAAGGAGTTGCTGAACTACCTCAAAACAATATTGAGTCTAGAGTTCAGCAACCAGCAGGAAAGCAAGTTCTACACAGTAAGATCCAGGTTGAAGGAACTAAAGTTGAGGACAGGGAAGAAGCGGGACAGTCAAACCATTCGAGTTTACTTCGAAGTCTCTTACTTGCACCTCTTGGGATTCCTTTTTGCTCAGCTAGTATTGGTGGGGCCCGCAAAGCGAGGCCTGTGGATTGTGGGGGTGATTTTGTTAGCTTTAGCGATATTGGTCATCTGTCAGATACCGAGTCGTTGAGACGCCGCATGGAACAAATTGCTGCAGTACAGGGGCTTGGTAGCGTCTCTGCAGATTGTGCTAATATCTTAAATAAAGTGTTGGATGTATATGTGAAGCAGTTAATTAGGTCCTGCGTTGACTTGGTTGGAGCATGGCCTACATGTGAGCTTGAGAAGCCTCTTGCTCATAAGCAGCAGATTCAAGGGAAGGTTATGAATGGCATGTTGCCGAATAATCAATTACACATGCGACATGGCAATGGAAACGGAGAAATTATGCACGAGCACAGATTACGTTGCTCGGTATCATTGCTTGATTTCAAAGTAGCAATGGAGCTTAACCCAAAGCAACTTGGGGAAGACTGGCCTTTGCTACTGGAGAAAATCCGTATGCGTGCATTTGAGGAATAA

Protein sequence

MQPQQSLRIDLGELKSQIVKKLGTDRSKWYFFYLNRFLCQKLSKNEFDKICCRVLGRENLRLHNQLIQSILKNACQAKAAPPMPVAGYPKTSTQSAKISPVIEDGNEDGGAVFPTSTQGIPIWSNGGFPVSPRKSRSGIHDRKLKDRPSPLGPNGKVECISHQSAGKEDGSCKIMMDNGDATLCDYQRPVQHLQGVAELPQNNIESRVQQPAGKQVLHSKIQVEGTKVEDREEAGQSNHSSLLRSLLLAPLGIPFCSASIGGARKARPVDCGGDFVSFSDIGHLSDTESLRRRMEQIAAVQGLGSVSADCANILNKVLDVYVKQLIRSCVDLVGAWPTCELEKPLAHKQQIQGKVMNGMLPNNQLHMRHGNGNGEIMHEHRLRCSVSLLDFKVAMELNPKQLGEDWPLLLEKIRMRAFEE
Homology
BLAST of Sgr022910 vs. NCBI nr
Match: XP_022132327.1 (uncharacterized protein LOC111005206 [Momordica charantia])

HSP 1 Score: 730.7 bits (1885), Expect = 7.1e-207
Identity = 369/421 (87.65%), Postives = 388/421 (92.16%), Query Frame = 0

Query: 1   MQPQQSLRIDLGELKSQIVKKLGTDRSKWYFFYLNRFLCQKLSKNEFDKICCRVLGRENL 60
           MQPQQSLRIDLGELKSQIVKKLGTDRSK YFFYLNRFL QKLSKNEFDK+C RVLGR+NL
Sbjct: 1   MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCRRVLGRDNL 60

Query: 61  RLHNQLIQSILKNACQAKAAPPMPVAGYPKTSTQSAKISPVIEDGNEDGGAVFPTSTQGI 120
            LHNQLIQSILKNACQAKAAPP+PVAGYPKTSTQSAK+SPVIEDGNED GAV+PTSTQ I
Sbjct: 61  WLHNQLIQSILKNACQAKAAPPLPVAGYPKTSTQSAKVSPVIEDGNEDTGAVYPTSTQSI 120

Query: 121 PIWSNGGFPVSPRKSRSGIHDRKLKDRPSPLGPNGKVECISHQSAGKEDGSCKIMMDNGD 180
           PIWSNGGFP SPRKSRSGI DRKLKDRPSPLGPNGKVECISHQSAGK+DGSCK+MM NGD
Sbjct: 121 PIWSNGGFPASPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKKDGSCKMMMVNGD 180

Query: 181 ATLCDYQRPVQHLQGVAELPQNNIESRVQQPAGKQVLHSKIQVEGTKVEDREEAGQSNHS 240
           ATLCDYQRPVQHLQGVAELP+NNIE+R+ +PAGKQVL++KI  EGTKV DREEAG S HS
Sbjct: 181 ATLCDYQRPVQHLQGVAELPENNIEARI-RPAGKQVLNNKIHDEGTKVGDREEAGHSIHS 240

Query: 241 SLLRSLLLAPLGIPFCSASIGGARKARPVDCGGDFVSFSDIGHLSDTESLRRRMEQIAAV 300
            LL+S LLAPLGIPFCSASIGGARKARP D GGDFVSFSDIGHLSDTESLRRRMEQIAAV
Sbjct: 241 GLLQSRLLAPLGIPFCSASIGGARKARPADFGGDFVSFSDIGHLSDTESLRRRMEQIAAV 300

Query: 301 QGLGSVSADCANILNKVLDVYVKQLIRSCVDLVGAWP-TCELEKPLAHKQQIQGKVMNGM 360
            GLGSVSAD ANILNKVLDVY+KQLIRSCV LVG  P  CE EKPL  K Q+QGKV+NGM
Sbjct: 301 HGLGSVSADSANILNKVLDVYLKQLIRSCVGLVGTCPMPCEPEKPLTDKLQVQGKVINGM 360

Query: 361 LPNNQLHMRHGNGNGEIMHEHRLRCSVSLLDFKVAMELNPKQLGEDWPLLLEKIRMRAFE 420
           LPNNQLH RH NG+ E+MHEHRLRCSVSLLDFKVAMELNPKQLGEDWPLLLEKIRMRAFE
Sbjct: 361 LPNNQLHGRHSNGSREVMHEHRLRCSVSLLDFKVAMELNPKQLGEDWPLLLEKIRMRAFE 420

BLAST of Sgr022910 vs. NCBI nr
Match: XP_023546134.1 (uncharacterized protein LOC111805335 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 724.2 bits (1868), Expect = 6.6e-205
Identity = 366/420 (87.14%), Postives = 385/420 (91.67%), Query Frame = 0

Query: 1   MQPQQSLRIDLGELKSQIVKKLGTDRSKWYFFYLNRFLCQKLSKNEFDKICCRVLGRENL 60
           MQPQQSLRIDL ELKSQIVKKLGTDRSK YFFYLNRFL QKLSKNEFDK+CCRVLGRENL
Sbjct: 1   MQPQQSLRIDLCELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENL 60

Query: 61  RLHNQLIQSILKNACQAKAAPPMPVAGYPKTSTQSAKISPVIEDGNEDGGAVFPTSTQGI 120
            LHNQLIQSILKNACQAKAAPP+P AGYPKTSTQ+AKISPVIEDGNEDGGAVFPTSTQGI
Sbjct: 61  WLHNQLIQSILKNACQAKAAPPIPAAGYPKTSTQAAKISPVIEDGNEDGGAVFPTSTQGI 120

Query: 121 PIWSNGGFPVSPRKSRSGIHDRKLKDRPSPLGPNGKVECISHQSAGKEDGSCKIMMDNGD 180
           PIWSN GFPVSPRK RSGI DRKLKDRPS L PN KVECIS QSA KEDGSC+IM+DNG+
Sbjct: 121 PIWSNEGFPVSPRKCRSGIRDRKLKDRPSLLAPNLKVECISPQSACKEDGSCRIMLDNGN 180

Query: 181 ATLCDYQRPVQHLQGVAELPQNNIESRVQQPAGKQVLHSKIQVEGTKVEDREEAGQSNHS 240
           AT CDYQRPVQHLQGV ELP+NNIE+RVQ+P+GKQVL  ++QVEGTKVEDREEA QSN S
Sbjct: 181 ATSCDYQRPVQHLQGVYELPENNIEARVQRPSGKQVL--QMQVEGTKVEDREEARQSNRS 240

Query: 241 SLLRSLLLAPLGIPFCSASIGGARKARPVDCGGDFVSFSDIGHLSDTESLRRRMEQIAAV 300
           SLLRS LLAPLGIPFCSASIGGA K RPVDCGG+F SFSD+GHL DTESLRRRMEQIAAV
Sbjct: 241 SLLRSRLLAPLGIPFCSASIGGAHKTRPVDCGGNF-SFSDMGHLLDTESLRRRMEQIAAV 300

Query: 301 QGLGSVSADCANILNKVLDVYVKQLIRSCVDLVGAWPTCELEKPLAHKQQIQGKVMNGML 360
           QGLGSVSADCANILNKVLDVY+KQLIRSCVDLVGAWP  E EKPLAH QQIQGKV+NGML
Sbjct: 301 QGLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAFEPEKPLAHNQQIQGKVINGML 360

Query: 361 PNNQLHMRHGNGNGEIMHEHRLRCSVSLLDFKVAMELNPKQLGEDWPLLLEKIRMRAFEE 420
           PNNQLH  H NGNGE++HE RL CS+SLLDFKVAMELNPKQLGEDWPLLLEKI MRAF E
Sbjct: 361 PNNQLHRLHSNGNGEVVHERRLHCSISLLDFKVAMELNPKQLGEDWPLLLEKISMRAFTE 417

BLAST of Sgr022910 vs. NCBI nr
Match: XP_022997521.1 (uncharacterized protein LOC111492414 [Cucurbita maxima])

HSP 1 Score: 713.0 bits (1839), Expect = 1.5e-201
Identity = 362/420 (86.19%), Postives = 381/420 (90.71%), Query Frame = 0

Query: 1   MQPQQSLRIDLGELKSQIVKKLGTDRSKWYFFYLNRFLCQKLSKNEFDKICCRVLGRENL 60
           MQPQQSLRIDL ELKSQIVKKLGTDRSK YFFYLNRFL QKLSKNEFDK+CCRVLGRENL
Sbjct: 1   MQPQQSLRIDLCELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENL 60

Query: 61  RLHNQLIQSILKNACQAKAAPPMPVAGYPKTSTQSAKISPVIEDGNEDGGAVFPTSTQGI 120
            LHNQLIQSILKNACQAKAAPP+P AGYPKTSTQ+AKISPVIEDGNEDGGAVF TSTQGI
Sbjct: 61  WLHNQLIQSILKNACQAKAAPPIPAAGYPKTSTQAAKISPVIEDGNEDGGAVFATSTQGI 120

Query: 121 PIWSNGGFPVSPRKSRSGIHDRKLKDRPSPLGPNGKVECISHQSAGKEDGSCKIMMDNGD 180
           PIWSN GF +SPRK RSGI DRKLKDRPS L PN KVECIS QSA KEDGSC+IMMDNG+
Sbjct: 121 PIWSNEGFSMSPRKCRSGIRDRKLKDRPSLLAPNLKVECISAQSACKEDGSCRIMMDNGN 180

Query: 181 ATLCDYQRPVQHLQGVAELPQNNIESRVQQPAGKQVLHSKIQVEGTKVEDREEAGQSNHS 240
           AT CDYQRPVQHLQGV ELP+NNIE+RVQ+P+GKQVL  ++QVEGTKVEDREEA QSN S
Sbjct: 181 ATSCDYQRPVQHLQGVFELPENNIEARVQRPSGKQVL--QMQVEGTKVEDREEARQSNRS 240

Query: 241 SLLRSLLLAPLGIPFCSASIGGARKARPVDCGGDFVSFSDIGHLSDTESLRRRMEQIAAV 300
           SLLRS LLAPLGIPFCSASIGGA K RPVDCGG+F SFSD+GHL DTESLRRRMEQIAAV
Sbjct: 241 SLLRSRLLAPLGIPFCSASIGGAHKTRPVDCGGNF-SFSDMGHLLDTESLRRRMEQIAAV 300

Query: 301 QGLGSVSADCANILNKVLDVYVKQLIRSCVDLVGAWPTCELEKPLAHKQQIQGKVMNGML 360
           QGLGSVSADCANILNKVLDVY+KQLIRSCVDLVG WP  E EKPLAH QQIQGKV+NGML
Sbjct: 301 QGLGSVSADCANILNKVLDVYLKQLIRSCVDLVGPWPVFEPEKPLAHNQQIQGKVINGML 360

Query: 361 PNNQLHMRHGNGNGEIMHEHRLRCSVSLLDFKVAMELNPKQLGEDWPLLLEKIRMRAFEE 420
           PNNQLH  H NGN E++HE RL CS+SLLDFKVAMELNPKQLGEDWPLLLEKI MRAF E
Sbjct: 361 PNNQLHRLHSNGNREVVHERRLHCSISLLDFKVAMELNPKQLGEDWPLLLEKISMRAFTE 417

BLAST of Sgr022910 vs. NCBI nr
Match: KAG7029751.1 (hypothetical protein SDJN02_08093, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 712.6 bits (1838), Expect = 2.0e-201
Identity = 362/420 (86.19%), Postives = 378/420 (90.00%), Query Frame = 0

Query: 1   MQPQQSLRIDLGELKSQIVKKLGTDRSKWYFFYLNRFLCQKLSKNEFDKICCRVLGRENL 60
           MQPQQSLRIDL ELKSQIVKKLGTDRSK YFFYLNRFL QKLSKNEFDK+CCRVLGRENL
Sbjct: 1   MQPQQSLRIDLCELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENL 60

Query: 61  RLHNQLIQSILKNACQAKAAPPMPVAGYPKTSTQSAKISPVIEDGNEDGGAVFPTSTQGI 120
            LHNQLIQSILKNACQAKAAPP+P AGYPKTSTQ+AKISPVIEDGNEDGGAVFPTSTQGI
Sbjct: 61  WLHNQLIQSILKNACQAKAAPPIPAAGYPKTSTQAAKISPVIEDGNEDGGAVFPTSTQGI 120

Query: 121 PIWSNGGFPVSPRKSRSGIHDRKLKDRPSPLGPNGKVECISHQSAGKEDGSCKIMMDNGD 180
           PIWSN GFPVSPRK RSGI DRKLKDRPS L PN KVECIS QSA KEDGSC+IMMDNG+
Sbjct: 121 PIWSNEGFPVSPRKCRSGIRDRKLKDRPSLLAPNLKVECISPQSACKEDGSCRIMMDNGN 180

Query: 181 ATLCDYQRPVQHLQGVAELPQNNIESRVQQPAGKQVLHSKIQVEGTKVEDREEAGQSNHS 240
           AT CDYQRPVQHLQGV ELP+NNIE+RVQ+PAGKQVL         +VEDREEA QSN S
Sbjct: 181 ATSCDYQRPVQHLQGVFELPENNIEARVQRPAGKQVLQ-------MQVEDREEARQSNRS 240

Query: 241 SLLRSLLLAPLGIPFCSASIGGARKARPVDCGGDFVSFSDIGHLSDTESLRRRMEQIAAV 300
           SLLRS LLAPLGIPFCSASIGGA K RPVDCGG+F SFSD+GHL DTESLRRRMEQIAAV
Sbjct: 241 SLLRSRLLAPLGIPFCSASIGGAHKTRPVDCGGNF-SFSDMGHLLDTESLRRRMEQIAAV 300

Query: 301 QGLGSVSADCANILNKVLDVYVKQLIRSCVDLVGAWPTCELEKPLAHKQQIQGKVMNGML 360
           QGLGSVSADCANILNKVLDVY+KQLIRSCVDLVGAWP  E EKPLAH QQIQGKV+NGML
Sbjct: 301 QGLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAFEPEKPLAHNQQIQGKVINGML 360

Query: 361 PNNQLHMRHGNGNGEIMHEHRLRCSVSLLDFKVAMELNPKQLGEDWPLLLEKIRMRAFEE 420
           PNNQLH  H NGNGE++HE RL CS+SLLDFKVAMELNPKQLGEDWPLLLEKI MRAF E
Sbjct: 361 PNNQLHRLHSNGNGEVVHERRLHCSISLLDFKVAMELNPKQLGEDWPLLLEKISMRAFTE 412

BLAST of Sgr022910 vs. NCBI nr
Match: XP_022962598.1 (uncharacterized protein LOC111463000 [Cucurbita moschata] >KAG6598811.1 hypothetical protein SDJN03_08589, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 711.4 bits (1835), Expect = 4.4e-201
Identity = 361/420 (85.95%), Postives = 378/420 (90.00%), Query Frame = 0

Query: 1   MQPQQSLRIDLGELKSQIVKKLGTDRSKWYFFYLNRFLCQKLSKNEFDKICCRVLGRENL 60
           MQPQQSLRIDL ELKSQIVKKLGTDRSK YFFYLNRFL QKLSKNEFDK+CCRVLGRENL
Sbjct: 1   MQPQQSLRIDLCELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENL 60

Query: 61  RLHNQLIQSILKNACQAKAAPPMPVAGYPKTSTQSAKISPVIEDGNEDGGAVFPTSTQGI 120
            LHNQLIQSILKNACQAKAAPP+P AGYPKTSTQ+AKISPVIEDGNEDGGAVFPTSTQGI
Sbjct: 61  WLHNQLIQSILKNACQAKAAPPIPAAGYPKTSTQAAKISPVIEDGNEDGGAVFPTSTQGI 120

Query: 121 PIWSNGGFPVSPRKSRSGIHDRKLKDRPSPLGPNGKVECISHQSAGKEDGSCKIMMDNGD 180
           PIWSN GFPVSPRK RSGI DRKLKDRPS L PN KVECIS QSA KEDGSC+IMMDNG+
Sbjct: 121 PIWSNEGFPVSPRKCRSGIRDRKLKDRPSLLAPNLKVECISPQSACKEDGSCRIMMDNGN 180

Query: 181 ATLCDYQRPVQHLQGVAELPQNNIESRVQQPAGKQVLHSKIQVEGTKVEDREEAGQSNHS 240
           AT CDYQRPVQHLQGV ELP+NNIE+RVQ+P+GKQVL         +VEDREEA QSN S
Sbjct: 181 ATSCDYQRPVQHLQGVFELPENNIEARVQRPSGKQVLQ-------MQVEDREEARQSNRS 240

Query: 241 SLLRSLLLAPLGIPFCSASIGGARKARPVDCGGDFVSFSDIGHLSDTESLRRRMEQIAAV 300
           SLLRS LLAPLGIPFCSASIGGA K RPVDCGG+F SFSD+GHL DTESLRRRMEQIAAV
Sbjct: 241 SLLRSRLLAPLGIPFCSASIGGAHKTRPVDCGGNF-SFSDMGHLLDTESLRRRMEQIAAV 300

Query: 301 QGLGSVSADCANILNKVLDVYVKQLIRSCVDLVGAWPTCELEKPLAHKQQIQGKVMNGML 360
           QGLGSVSADCANILNKVLDVY+KQLIRSCVDLVGAWP  E EKPLAH QQIQGKV+NGML
Sbjct: 301 QGLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAFEPEKPLAHNQQIQGKVINGML 360

Query: 361 PNNQLHMRHGNGNGEIMHEHRLRCSVSLLDFKVAMELNPKQLGEDWPLLLEKIRMRAFEE 420
           PNNQLH  H NGNGE++HE RL CS+SLLDFKVAMELNPKQLGEDWPLLLEKI MRAF E
Sbjct: 361 PNNQLHRLHSNGNGEVVHERRLHCSISLLDFKVAMELNPKQLGEDWPLLLEKISMRAFTE 412

BLAST of Sgr022910 vs. ExPASy TrEMBL
Match: A0A6J1BTJ5 (uncharacterized protein LOC111005206 OS=Momordica charantia OX=3673 GN=LOC111005206 PE=4 SV=1)

HSP 1 Score: 730.7 bits (1885), Expect = 3.4e-207
Identity = 369/421 (87.65%), Postives = 388/421 (92.16%), Query Frame = 0

Query: 1   MQPQQSLRIDLGELKSQIVKKLGTDRSKWYFFYLNRFLCQKLSKNEFDKICCRVLGRENL 60
           MQPQQSLRIDLGELKSQIVKKLGTDRSK YFFYLNRFL QKLSKNEFDK+C RVLGR+NL
Sbjct: 1   MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCRRVLGRDNL 60

Query: 61  RLHNQLIQSILKNACQAKAAPPMPVAGYPKTSTQSAKISPVIEDGNEDGGAVFPTSTQGI 120
            LHNQLIQSILKNACQAKAAPP+PVAGYPKTSTQSAK+SPVIEDGNED GAV+PTSTQ I
Sbjct: 61  WLHNQLIQSILKNACQAKAAPPLPVAGYPKTSTQSAKVSPVIEDGNEDTGAVYPTSTQSI 120

Query: 121 PIWSNGGFPVSPRKSRSGIHDRKLKDRPSPLGPNGKVECISHQSAGKEDGSCKIMMDNGD 180
           PIWSNGGFP SPRKSRSGI DRKLKDRPSPLGPNGKVECISHQSAGK+DGSCK+MM NGD
Sbjct: 121 PIWSNGGFPASPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKKDGSCKMMMVNGD 180

Query: 181 ATLCDYQRPVQHLQGVAELPQNNIESRVQQPAGKQVLHSKIQVEGTKVEDREEAGQSNHS 240
           ATLCDYQRPVQHLQGVAELP+NNIE+R+ +PAGKQVL++KI  EGTKV DREEAG S HS
Sbjct: 181 ATLCDYQRPVQHLQGVAELPENNIEARI-RPAGKQVLNNKIHDEGTKVGDREEAGHSIHS 240

Query: 241 SLLRSLLLAPLGIPFCSASIGGARKARPVDCGGDFVSFSDIGHLSDTESLRRRMEQIAAV 300
            LL+S LLAPLGIPFCSASIGGARKARP D GGDFVSFSDIGHLSDTESLRRRMEQIAAV
Sbjct: 241 GLLQSRLLAPLGIPFCSASIGGARKARPADFGGDFVSFSDIGHLSDTESLRRRMEQIAAV 300

Query: 301 QGLGSVSADCANILNKVLDVYVKQLIRSCVDLVGAWP-TCELEKPLAHKQQIQGKVMNGM 360
            GLGSVSAD ANILNKVLDVY+KQLIRSCV LVG  P  CE EKPL  K Q+QGKV+NGM
Sbjct: 301 HGLGSVSADSANILNKVLDVYLKQLIRSCVGLVGTCPMPCEPEKPLTDKLQVQGKVINGM 360

Query: 361 LPNNQLHMRHGNGNGEIMHEHRLRCSVSLLDFKVAMELNPKQLGEDWPLLLEKIRMRAFE 420
           LPNNQLH RH NG+ E+MHEHRLRCSVSLLDFKVAMELNPKQLGEDWPLLLEKIRMRAFE
Sbjct: 361 LPNNQLHGRHSNGSREVMHEHRLRCSVSLLDFKVAMELNPKQLGEDWPLLLEKIRMRAFE 420

BLAST of Sgr022910 vs. ExPASy TrEMBL
Match: A0A6J1K7Q1 (uncharacterized protein LOC111492414 OS=Cucurbita maxima OX=3661 GN=LOC111492414 PE=4 SV=1)

HSP 1 Score: 713.0 bits (1839), Expect = 7.4e-202
Identity = 362/420 (86.19%), Postives = 381/420 (90.71%), Query Frame = 0

Query: 1   MQPQQSLRIDLGELKSQIVKKLGTDRSKWYFFYLNRFLCQKLSKNEFDKICCRVLGRENL 60
           MQPQQSLRIDL ELKSQIVKKLGTDRSK YFFYLNRFL QKLSKNEFDK+CCRVLGRENL
Sbjct: 1   MQPQQSLRIDLCELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENL 60

Query: 61  RLHNQLIQSILKNACQAKAAPPMPVAGYPKTSTQSAKISPVIEDGNEDGGAVFPTSTQGI 120
            LHNQLIQSILKNACQAKAAPP+P AGYPKTSTQ+AKISPVIEDGNEDGGAVF TSTQGI
Sbjct: 61  WLHNQLIQSILKNACQAKAAPPIPAAGYPKTSTQAAKISPVIEDGNEDGGAVFATSTQGI 120

Query: 121 PIWSNGGFPVSPRKSRSGIHDRKLKDRPSPLGPNGKVECISHQSAGKEDGSCKIMMDNGD 180
           PIWSN GF +SPRK RSGI DRKLKDRPS L PN KVECIS QSA KEDGSC+IMMDNG+
Sbjct: 121 PIWSNEGFSMSPRKCRSGIRDRKLKDRPSLLAPNLKVECISAQSACKEDGSCRIMMDNGN 180

Query: 181 ATLCDYQRPVQHLQGVAELPQNNIESRVQQPAGKQVLHSKIQVEGTKVEDREEAGQSNHS 240
           AT CDYQRPVQHLQGV ELP+NNIE+RVQ+P+GKQVL  ++QVEGTKVEDREEA QSN S
Sbjct: 181 ATSCDYQRPVQHLQGVFELPENNIEARVQRPSGKQVL--QMQVEGTKVEDREEARQSNRS 240

Query: 241 SLLRSLLLAPLGIPFCSASIGGARKARPVDCGGDFVSFSDIGHLSDTESLRRRMEQIAAV 300
           SLLRS LLAPLGIPFCSASIGGA K RPVDCGG+F SFSD+GHL DTESLRRRMEQIAAV
Sbjct: 241 SLLRSRLLAPLGIPFCSASIGGAHKTRPVDCGGNF-SFSDMGHLLDTESLRRRMEQIAAV 300

Query: 301 QGLGSVSADCANILNKVLDVYVKQLIRSCVDLVGAWPTCELEKPLAHKQQIQGKVMNGML 360
           QGLGSVSADCANILNKVLDVY+KQLIRSCVDLVG WP  E EKPLAH QQIQGKV+NGML
Sbjct: 301 QGLGSVSADCANILNKVLDVYLKQLIRSCVDLVGPWPVFEPEKPLAHNQQIQGKVINGML 360

Query: 361 PNNQLHMRHGNGNGEIMHEHRLRCSVSLLDFKVAMELNPKQLGEDWPLLLEKIRMRAFEE 420
           PNNQLH  H NGN E++HE RL CS+SLLDFKVAMELNPKQLGEDWPLLLEKI MRAF E
Sbjct: 361 PNNQLHRLHSNGNREVVHERRLHCSISLLDFKVAMELNPKQLGEDWPLLLEKISMRAFTE 417

BLAST of Sgr022910 vs. ExPASy TrEMBL
Match: A0A6J1HF85 (uncharacterized protein LOC111463000 OS=Cucurbita moschata OX=3662 GN=LOC111463000 PE=4 SV=1)

HSP 1 Score: 711.4 bits (1835), Expect = 2.1e-201
Identity = 361/420 (85.95%), Postives = 378/420 (90.00%), Query Frame = 0

Query: 1   MQPQQSLRIDLGELKSQIVKKLGTDRSKWYFFYLNRFLCQKLSKNEFDKICCRVLGRENL 60
           MQPQQSLRIDL ELKSQIVKKLGTDRSK YFFYLNRFL QKLSKNEFDK+CCRVLGRENL
Sbjct: 1   MQPQQSLRIDLCELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENL 60

Query: 61  RLHNQLIQSILKNACQAKAAPPMPVAGYPKTSTQSAKISPVIEDGNEDGGAVFPTSTQGI 120
            LHNQLIQSILKNACQAKAAPP+P AGYPKTSTQ+AKISPVIEDGNEDGGAVFPTSTQGI
Sbjct: 61  WLHNQLIQSILKNACQAKAAPPIPAAGYPKTSTQAAKISPVIEDGNEDGGAVFPTSTQGI 120

Query: 121 PIWSNGGFPVSPRKSRSGIHDRKLKDRPSPLGPNGKVECISHQSAGKEDGSCKIMMDNGD 180
           PIWSN GFPVSPRK RSGI DRKLKDRPS L PN KVECIS QSA KEDGSC+IMMDNG+
Sbjct: 121 PIWSNEGFPVSPRKCRSGIRDRKLKDRPSLLAPNLKVECISPQSACKEDGSCRIMMDNGN 180

Query: 181 ATLCDYQRPVQHLQGVAELPQNNIESRVQQPAGKQVLHSKIQVEGTKVEDREEAGQSNHS 240
           AT CDYQRPVQHLQGV ELP+NNIE+RVQ+P+GKQVL         +VEDREEA QSN S
Sbjct: 181 ATSCDYQRPVQHLQGVFELPENNIEARVQRPSGKQVLQ-------MQVEDREEARQSNRS 240

Query: 241 SLLRSLLLAPLGIPFCSASIGGARKARPVDCGGDFVSFSDIGHLSDTESLRRRMEQIAAV 300
           SLLRS LLAPLGIPFCSASIGGA K RPVDCGG+F SFSD+GHL DTESLRRRMEQIAAV
Sbjct: 241 SLLRSRLLAPLGIPFCSASIGGAHKTRPVDCGGNF-SFSDMGHLLDTESLRRRMEQIAAV 300

Query: 301 QGLGSVSADCANILNKVLDVYVKQLIRSCVDLVGAWPTCELEKPLAHKQQIQGKVMNGML 360
           QGLGSVSADCANILNKVLDVY+KQLIRSCVDLVGAWP  E EKPLAH QQIQGKV+NGML
Sbjct: 301 QGLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAFEPEKPLAHNQQIQGKVINGML 360

Query: 361 PNNQLHMRHGNGNGEIMHEHRLRCSVSLLDFKVAMELNPKQLGEDWPLLLEKIRMRAFEE 420
           PNNQLH  H NGNGE++HE RL CS+SLLDFKVAMELNPKQLGEDWPLLLEKI MRAF E
Sbjct: 361 PNNQLHRLHSNGNGEVVHERRLHCSISLLDFKVAMELNPKQLGEDWPLLLEKISMRAFTE 412

BLAST of Sgr022910 vs. ExPASy TrEMBL
Match: A0A5A7VF96 (SAGA-Tad1 domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold82G003200 PE=4 SV=1)

HSP 1 Score: 711.1 bits (1834), Expect = 2.8e-201
Identity = 362/420 (86.19%), Postives = 376/420 (89.52%), Query Frame = 0

Query: 1   MQPQQSLRIDLGELKSQIVKKLGTDRSKWYFFYLNRFLCQKLSKNEFDKICCRVLGRENL 60
           MQPQQSLRIDLGELKSQIVKKLG DRSK YFFYLNRFL QKLSKNEFDK CCRVLGRENL
Sbjct: 1   MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENL 60

Query: 61  RLHNQLIQSILKNACQAKAAPPMPVAGYPKTSTQSAKISPVIEDGNEDGGAVFPTSTQGI 120
            LHNQLIQSILKNACQAKAAPP+PVAGYPKTSTQSAKISP++EDGNEDGGAVFPTSTQ I
Sbjct: 61  WLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISPLVEDGNEDGGAVFPTSTQNI 120

Query: 121 PIWSNGGFPVSPRKSRSGIHDRKLKDRPSPLGPNGKVECISHQSAGKEDGSCKIMMDNGD 180
           P WSNG   VSPRK RSGI DRKLKDRPS LGPNGKVECISH SA          MDNGD
Sbjct: 121 PGWSNG---VSPRKCRSGIRDRKLKDRPSVLGPNGKVECISHLSAN---------MDNGD 180

Query: 181 ATLCDYQRPVQHLQGVAELPQNNIESRVQQPAGKQVLHSKIQVEGTKVEDREEAGQSNHS 240
           ATLCDY+RPVQHLQGVAELP+NNIE RV QP+GKQVLH+KIQVE TKVEDREEAGQSNHS
Sbjct: 181 ATLCDYKRPVQHLQGVAELPENNIEVRVPQPSGKQVLHNKIQVEATKVEDREEAGQSNHS 240

Query: 241 SLLRSLLLAPLGIPFCSASIGGARKARPVDCGGDFVSFSDIGHLSDTESLRRRMEQIAAV 300
           SLLRS LLAPLGIPFCSAS GG  K RPVDCGGDF SF D+GHL DTESLRRRMEQIAAV
Sbjct: 241 SLLRSRLLAPLGIPFCSASTGGPHKTRPVDCGGDF-SFGDVGHLLDTESLRRRMEQIAAV 300

Query: 301 QGLGSVSADCANILNKVLDVYVKQLIRSCVDLVGAWPTCELEKPLAHKQQIQGKVMNGML 360
           QGLGSVSADCANILNKVLDVY+KQLIRSCVDLVGAWP  E EKPLAHKQQIQGKV+NGML
Sbjct: 301 QGLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGML 360

Query: 361 PNNQLHMRHGNGNGEIMHEHRLRCSVSLLDFKVAMELNPKQLGEDWPLLLEKIRMRAFEE 420
           PNNQLH RH NGN E++HEHRL+CS+SLLDFKVAMELNP QLGEDWPLLLEKI MRAF E
Sbjct: 361 PNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLLEKICMRAFGE 407

BLAST of Sgr022910 vs. ExPASy TrEMBL
Match: A0A1S3BCQ5 (uncharacterized protein LOC103488231 OS=Cucumis melo OX=3656 GN=LOC103488231 PE=4 SV=1)

HSP 1 Score: 711.1 bits (1834), Expect = 2.8e-201
Identity = 362/420 (86.19%), Postives = 376/420 (89.52%), Query Frame = 0

Query: 1   MQPQQSLRIDLGELKSQIVKKLGTDRSKWYFFYLNRFLCQKLSKNEFDKICCRVLGRENL 60
           MQPQQSLRIDLGELKSQIVKKLG DRSK YFFYLNRFL QKLSKNEFDK CCRVLGRENL
Sbjct: 1   MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENL 60

Query: 61  RLHNQLIQSILKNACQAKAAPPMPVAGYPKTSTQSAKISPVIEDGNEDGGAVFPTSTQGI 120
            LHNQLIQSILKNACQAKAAPP+PVAGYPKTSTQSAKISP++EDGNEDGGAVFPTSTQ I
Sbjct: 61  WLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISPLVEDGNEDGGAVFPTSTQNI 120

Query: 121 PIWSNGGFPVSPRKSRSGIHDRKLKDRPSPLGPNGKVECISHQSAGKEDGSCKIMMDNGD 180
           P WSNG   VSPRK RSGI DRKLKDRPS LGPNGKVECISH SA          MDNGD
Sbjct: 121 PGWSNG---VSPRKCRSGIRDRKLKDRPSVLGPNGKVECISHLSAN---------MDNGD 180

Query: 181 ATLCDYQRPVQHLQGVAELPQNNIESRVQQPAGKQVLHSKIQVEGTKVEDREEAGQSNHS 240
           ATLCDY+RPVQHLQGVAELP+NNIE RV QP+GKQVLH+KIQVE TKVEDREEAGQSNHS
Sbjct: 181 ATLCDYKRPVQHLQGVAELPENNIEVRVPQPSGKQVLHNKIQVEATKVEDREEAGQSNHS 240

Query: 241 SLLRSLLLAPLGIPFCSASIGGARKARPVDCGGDFVSFSDIGHLSDTESLRRRMEQIAAV 300
           SLLRS LLAPLGIPFCSAS GG  K RPVDCGGDF SF D+GHL DTESLRRRMEQIAAV
Sbjct: 241 SLLRSRLLAPLGIPFCSASTGGPHKTRPVDCGGDF-SFGDVGHLLDTESLRRRMEQIAAV 300

Query: 301 QGLGSVSADCANILNKVLDVYVKQLIRSCVDLVGAWPTCELEKPLAHKQQIQGKVMNGML 360
           QGLGSVSADCANILNKVLDVY+KQLIRSCVDLVGAWP  E EKPLAHKQQIQGKV+NGML
Sbjct: 301 QGLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGML 360

Query: 361 PNNQLHMRHGNGNGEIMHEHRLRCSVSLLDFKVAMELNPKQLGEDWPLLLEKIRMRAFEE 420
           PNNQLH RH NGN E++HEHRL+CS+SLLDFKVAMELNP QLGEDWPLLLEKI MRAF E
Sbjct: 361 PNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLLEKICMRAFGE 407

BLAST of Sgr022910 vs. TAIR 10
Match: AT2G24530.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G31440.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 359.4 bits (921), Expect = 4.0e-99
Identity = 201/421 (47.74%), Postives = 263/421 (62.47%), Query Frame = 0

Query: 1   MQPQQSLRIDLGELKSQIVKKLGTDRSKWYFFYLNRFLCQKLSKNEFDKICCRVLGRENL 60
           MQ  Q  RI L ELK  IVKK G +RS+ YF+YL RFL QKL+K+EFDK C R+LGRENL
Sbjct: 1   MQRSQDQRISLCELKEHIVKKTGVERSRRYFYYLGRFLSQKLTKSEFDKTCLRLLGRENL 60

Query: 61  RLHNQLIQSILKNACQAKAAPPMPVAGYPKTSTQSAKISPVIEDGNEDGGAVFPTSTQGI 120
            LHNQLI+SIL+NA  AK+ PP   AG+    +  A       DG E  G + P  +Q  
Sbjct: 61  SLHNQLIRSILRNATVAKSPPPDHEAGH----STKANAFQSRGDGLEQSGTLIPNHSQHE 120

Query: 121 PIWSNGGFPVSPRKSRSGIHDRKLKDRPSPLGPNGKVECISHQSAGKEDGSCKIMMDNGD 180
           P+WSNG  P+SPRK RSG+ +RK +DRPSPLG NGKVE + HQ   +ED    + M+NG 
Sbjct: 121 PVWSNGVLPISPRKVRSGMQNRKSRDRPSPLGSNGKVEHMLHQPVCREDNRGSVGMENG- 180

Query: 181 ATLCDYQRPVQHLQGVAELPQNNIESRVQQPAGKQVLHSKIQVEGTKVEDREEAGQSNHS 240
               DYQR  +++        +  +    +P  K  + +K ++    + D +   +    
Sbjct: 181 ----DYQRSGRYV-------ADEKDGEFLRPVEKPRIPNKEKIAAVSMRDDQNQEEQARV 240

Query: 241 SLLRSLLLAPLGIPFCSASIGGARKARPVDCGGDFVSFSDIGHLSDTESLRRRMEQIAAV 300
           +L  S L+APLGIPFCSAS+GG+ +  PV    + +S  D G L D E LR+RME IA  
Sbjct: 241 NLSMSPLIAPLGIPFCSASVGGSPRTIPVSTNAELISCYDSGGLPDIEMLRKRMENIAVA 300

Query: 301 QGLGSVSADCANILNKVLDVYVKQLIRSCVDLVGAWPT-CELEKPLAHKQQIQGKVMNGM 360
           QGL  VS +CA  LN +LDVY+K+LI SC DLVGA  T  +  K    KQQ Q K++NG+
Sbjct: 301 QGLEGVSMECAKTLNNMLDVYLKKLINSCFDLVGARSTNGDPGKQRIGKQQSQNKIVNGV 360

Query: 361 LPNNQLHMRHGNGNGEIMHEHRLRCSVSLLDFKVAMELNPKQLGEDWPLLLEKIRMRAFE 420
            P N L ++  NG+ +I  +H    SVS+LDF+ AMELNP+QLGEDWP L E+I +R+FE
Sbjct: 361 WPTNSLKIQTPNGSSDIRQDHH---SVSMLDFRTAMELNPRQLGEDWPTLRERISLRSFE 402

BLAST of Sgr022910 vs. TAIR 10
Match: AT4G31440.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G24530.1); Has 210 Blast hits to 209 proteins in 55 species: Archae - 0; Bacteria - 72; Metazoa - 2; Fungi - 6; Plants - 128; Viruses - 0; Other Eukaryotes - 2 (source: NCBI BLink). )

HSP 1 Score: 282.0 bits (720), Expect = 8.1e-76
Identity = 182/421 (43.23%), Postives = 237/421 (56.29%), Query Frame = 0

Query: 1   MQPQQSLRIDLGELKSQIVKKLGTDRSKWYFFYLNRFLCQKLSKNEFDKICCRVLGRENL 60
           MQ  Q  RIDL ELK  IVKK+G +RS  YF+YL RFL QKL+K+EFDK C R+LGRENL
Sbjct: 1   MQRLQDPRIDLAELKVHIVKKVGVERSTRYFYYLGRFLSQKLTKSEFDKSCFRLLGRENL 60

Query: 61  RLHNQLIQSILKNACQAKAAPPMPVAGYPKTSTQSAKISPVIEDGNEDGGAVFPTSTQGI 120
            LHN+LI+SIL+NA  AK+ P +  +G+P  S    K     EDG E+  ++ P   +  
Sbjct: 61  SLHNKLIRSILRNASLAKSPPSVHQSGHPGKSLVLGK-----EDGPEESRSLNPDHIRND 120

Query: 121 PIWSNGGFPVSPRKSRSGIHDRKLKDRPSPLGPNGKVECISHQSAGKEDGSCKIMMDNGD 180
              SNG   V  +       DR ++D+P PLG NGKV                       
Sbjct: 121 LALSNG---VLAKVRPGTCDDRTIRDKPCPLGSNGKV----------------------- 180

Query: 181 ATLCDYQRPVQHLQGVAELPQNNIESRVQQPAGKQVLHSKIQVEGTKVEDREEAGQSNHS 240
                Y RP ++         +  +S    PA ++ +  K QV      D E    +   
Sbjct: 181 LGPFAYSRPGRY--------PDERDSAFLCPAEQKAVSGKDQVAAPISRDDE----AQVR 240

Query: 241 SLLRSLLLAPLGIPFCSASIGGARKARPVDCGGDFVSFSDIGHLSDTESLRRRMEQIAAV 300
            L    ++APLGIPFCSAS+GG R+  PV      +S  D G LSDTE LR+RME IA  
Sbjct: 241 ILSTPPVMAPLGIPFCSASVGGDRRTVPVSTSAAAISCYDSGGLSDTEMLRKRMENIAVT 300

Query: 301 QGLGSVSADCANILNKVLDVYVKQLIRSCVDLVGAWPTCELE-KPLAHKQQIQGKVMNGM 360
           QGLG VSA+C+ +LN +LD+Y+K+L++SCVDL GA        K    KQQ + +++NG+
Sbjct: 301 QGLGGVSAECSIVLNNMLDLYLKKLMKSCVDLAGARSMNGTPGKHSLEKQQSRDELVNGV 360

Query: 361 LPNNQLHMRHGNGNGEIMHEHRLRCSVSLLDFKVAMELNPKQLGEDWPLLLEKIRMRAFE 420
             NN  H++  N   +I  E     SVSLLDF+VAMELNP QLGEDWPLL E+I +  FE
Sbjct: 361 RTNNSFHIQTSNQPSDITREQH---SVSLLDFRVAMELNPHQLGEDWPLLRERISISLFE 375

BLAST of Sgr022910 vs. TAIR 10
Match: AT2G14850.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G33890.2); Has 140 Blast hits to 132 proteins in 17 species: Archae - 0; Bacteria - 0; Metazoa - 1; Fungi - 2; Plants - 133; Viruses - 0; Other Eukaryotes - 4 (source: NCBI BLink). )

HSP 1 Score: 156.4 bits (394), Expect = 5.1e-38
Identity = 129/416 (31.01%), Postives = 184/416 (44.23%), Query Frame = 0

Query: 8   RIDLGELKSQIVKKLGTDRSKWYFFYLNRFLCQKLSKNEFDKICCRVLGRENLRLHNQLI 67
           R++  E+K+ I +K+G  R+  YF  L +FL  ++SK+EFDK+C + +GREN+ LHN+L+
Sbjct: 9   RLNSLEIKALIYQKIGHQRADTYFDQLGKFLTSRISKSEFDKLCSKTVGRENISLHNRLV 68

Query: 68  QSILKNACQAKAAPPMPVAGYPKTSTQSAKISPVIEDGNEDGGAVFPTSTQGIPIWSNGG 127
           +SILKNA  AK+ PP     YPK S                             ++ +  
Sbjct: 69  RSILKNASVAKSPPPR----YPKKS-----------------------------LYGDPV 128

Query: 128 FPVSPRKSRSGIHDRKLKDRPSPLGPNGKVECISHQSAGKEDGSCKIMMDNGDATLCDYQ 187
           FP SPRK RS    RK +DRPSPLGP GK + ++                  D ++   Q
Sbjct: 129 FPPSPRKCRS----RKFRDRPSPLGPLGKPQSLT---------------TTNDESMSKAQ 188

Query: 188 RPVQHLQGVAELPQNNIESRVQQPAGKQVLHSKIQVEGTKVEDREEAGQSNHSSLL--RS 247
           R                                + +E   VED EE  Q   S  +  RS
Sbjct: 189 R--------------------------------LPMEVVSVEDGEEVEQMTGSPSVQSRS 248

Query: 248 LLLAPLGIPFCSASIGGARKARPVDCGG-DFVSFSDIGHLSDTESLRRRMEQIAAVQGLG 307
            L APLG+ F   S     KAR     G +  +    G L D  +LR R+E+   ++G+ 
Sbjct: 249 PLTAPLGVSFHLKS-----KARFSTYNGINRETCQSSGELPDMITLRARLEKKLEMEGI- 291

Query: 308 SVSADCANILNKVLDVYVKQLIRSCVDLVGAWPTCELEKPLAHKQQIQGKVMNGMLPNNQ 367
            +S D AN+LN+ L+ Y+++LI  C+ L                                
Sbjct: 309 KLSMDSANLLNRGLNAYMRRLIEPCLSLAS------------------------------ 291

Query: 368 LHMRHGNGNGEIMHEHRLRCSVSLLDFKVAMELNPKQLGEDWPLLLEKIRMRAFEE 421
                         + R   +VS+LDF  AME+NP+ LGE+WP+ LEKI  RA EE
Sbjct: 369 -------------QQKRAVSNVSMLDFHAAMEVNPRVLGEEWPIQLEKICCRASEE 291

BLAST of Sgr022910 vs. TAIR 10
Match: AT4G33890.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G14850.1); Has 133 Blast hits to 131 proteins in 15 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 2; Plants - 129; Viruses - 0; Other Eukaryotes - 2 (source: NCBI BLink). )

HSP 1 Score: 156.0 bits (393), Expect = 6.7e-38
Identity = 131/424 (30.90%), Postives = 194/424 (45.75%), Query Frame = 0

Query: 4   QQSLRIDLGELKSQIVKKLGTDRSKWYFFYLNRFLCQKLSKNEFDKICCRVLGRENLRLH 63
           Q S R+D  E+K+ I +++G  R++ YF  L RF   K++K+EFDK+C + +GR+N+ LH
Sbjct: 5   QGSSRLDTLEIKALIYREIGNQRAESYFNQLGRFFALKITKSEFDKLCIKTIGRQNIHLH 64

Query: 64  NQLIQSILKNACQAKAAPPMPVAGYPKTSTQSAKISPVIEDGNEDGGAVFPTSTQGIPIW 123
           N+LI+SI+KNAC AK+ P +   G              +  GN D       ++Q  P+ 
Sbjct: 65  NRLIRSIIKNACIAKSPPFIKKGG------------SFVRFGNGDS----KKNSQIQPLH 124

Query: 124 SNGGFPVSPRKSRSGIHDRKLKDRPSPLGPNGKVECISHQSAGKEDGSCKIMMDNGDATL 183
            +  F  S RK RS    RKL+DRPSPLGP GK   ++  +                   
Sbjct: 125 GDSAFSPSTRKCRS----RKLRDRPSPLGPLGKPHSLTTTN------------------- 184

Query: 184 CDYQRPVQHLQGVAELPQNNIESRVQQPAGKQVLHSKIQVEGTKVEDREEAGQ---SNHS 243
              +  +   Q   EL                 L S+  VE   VE+ EE  Q    + S
Sbjct: 185 ---EESMSKAQSATELLS---------------LGSRPPVEVVSVEEGEEVEQIAGGSPS 244

Query: 244 SLLRSLLLAPLGIPFCSASIGGARK--ARPVDCGGDF--VSFSDIGHLSDTESLRRRMEQ 303
              R  L APLG+   S   G  RK  +    C   F   +  + G L DT +LR R+E+
Sbjct: 245 VQSRCPLTAPLGVSM-SLRNGATRKSVSNVSMCSRSFNRETCQNNGELPDTRTLRSRLER 304

Query: 304 IAAVQGLGSVSADCANILNKVLDVYVKQLIRSCVDLVGAWPTCELEKPLAHKQQIQGKVM 363
              ++GL  ++ D  ++LN  LDV++++LI  C+ L      C  +              
Sbjct: 305 RLEMEGL-KITMDSVSLLNSGLDVFMRRLIEPCLSLANT--RCGTD-------------- 342

Query: 364 NGMLPNNQLHMRHGNGNGEIMHEHRLRCSVSLLDFKVAMELNPKQLGEDWPLLLEKIRMR 421
                      R    N +   + R    VS+ DF+  MELN + LGEDWP+ +EKI  R
Sbjct: 365 -----------RVREMNYQYTQQSRRLSYVSMSDFRAGMELNTEILGEDWPMHMEKICSR 342

BLAST of Sgr022910 vs. TAIR 10
Match: AT4G33890.2 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G14850.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 156.0 bits (393), Expect = 6.7e-38
Identity = 131/424 (30.90%), Postives = 194/424 (45.75%), Query Frame = 0

Query: 4   QQSLRIDLGELKSQIVKKLGTDRSKWYFFYLNRFLCQKLSKNEFDKICCRVLGRENLRLH 63
           Q S R+D  E+K+ I +++G  R++ YF  L RF   K++K+EFDK+C + +GR+N+ LH
Sbjct: 5   QGSSRLDTLEIKALIYREIGNQRAESYFNQLGRFFALKITKSEFDKLCIKTIGRQNIHLH 64

Query: 64  NQLIQSILKNACQAKAAPPMPVAGYPKTSTQSAKISPVIEDGNEDGGAVFPTSTQGIPIW 123
           N+LI+SI+KNAC AK+ P +   G              +  GN D       ++Q  P+ 
Sbjct: 65  NRLIRSIIKNACIAKSPPFIKKGG------------SFVRFGNGDS----KKNSQIQPLH 124

Query: 124 SNGGFPVSPRKSRSGIHDRKLKDRPSPLGPNGKVECISHQSAGKEDGSCKIMMDNGDATL 183
            +  F  S RK RS    RKL+DRPSPLGP GK   ++  +                   
Sbjct: 125 GDSAFSPSTRKCRS----RKLRDRPSPLGPLGKPHSLTTTN------------------- 184

Query: 184 CDYQRPVQHLQGVAELPQNNIESRVQQPAGKQVLHSKIQVEGTKVEDREEAGQ---SNHS 243
              +  +   Q   EL                 L S+  VE   VE+ EE  Q    + S
Sbjct: 185 ---EESMSKAQSATELLS---------------LGSRPPVEVVSVEEGEEVEQIAGGSPS 244

Query: 244 SLLRSLLLAPLGIPFCSASIGGARK--ARPVDCGGDF--VSFSDIGHLSDTESLRRRMEQ 303
              R  L APLG+   S   G  RK  +    C   F   +  + G L DT +LR R+E+
Sbjct: 245 VQSRCPLTAPLGVSM-SLRNGATRKSVSNVSMCSRSFNRETCQNNGELPDTRTLRSRLER 304

Query: 304 IAAVQGLGSVSADCANILNKVLDVYVKQLIRSCVDLVGAWPTCELEKPLAHKQQIQGKVM 363
              ++GL  ++ D  ++LN  LDV++++LI  C+ L      C  +              
Sbjct: 305 RLEMEGL-KITMDSVSLLNSGLDVFMRRLIEPCLSLANT--RCGTD-------------- 342

Query: 364 NGMLPNNQLHMRHGNGNGEIMHEHRLRCSVSLLDFKVAMELNPKQLGEDWPLLLEKIRMR 421
                      R    N +   + R    VS+ DF+  MELN + LGEDWP+ +EKI  R
Sbjct: 365 -----------RVREMNYQYTQQSRRLSYVSMSDFRAGMELNTEILGEDWPMHMEKICSR 342

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022132327.17.1e-20787.65uncharacterized protein LOC111005206 [Momordica charantia][more]
XP_023546134.16.6e-20587.14uncharacterized protein LOC111805335 [Cucurbita pepo subsp. pepo][more]
XP_022997521.11.5e-20186.19uncharacterized protein LOC111492414 [Cucurbita maxima][more]
KAG7029751.12.0e-20186.19hypothetical protein SDJN02_08093, partial [Cucurbita argyrosperma subsp. argyro... [more]
XP_022962598.14.4e-20185.95uncharacterized protein LOC111463000 [Cucurbita moschata] >KAG6598811.1 hypothet... [more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1BTJ53.4e-20787.65uncharacterized protein LOC111005206 OS=Momordica charantia OX=3673 GN=LOC111005... [more]
A0A6J1K7Q17.4e-20286.19uncharacterized protein LOC111492414 OS=Cucurbita maxima OX=3661 GN=LOC111492414... [more]
A0A6J1HF852.1e-20185.95uncharacterized protein LOC111463000 OS=Cucurbita moschata OX=3662 GN=LOC1114630... [more]
A0A5A7VF962.8e-20186.19SAGA-Tad1 domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6... [more]
A0A1S3BCQ52.8e-20186.19uncharacterized protein LOC103488231 OS=Cucumis melo OX=3656 GN=LOC103488231 PE=... [more]
Match NameE-valueIdentityDescription
AT2G24530.14.0e-9947.74unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT4G31440.18.1e-7643.23unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT2G14850.15.1e-3831.01unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT4G33890.16.7e-3830.90unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT4G33890.26.7e-3830.90unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 415..420
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 124..157
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 135..149
NoneNo IPR availablePANTHERPTHR21277:SF38TRANSCRIPTIONAL REGULATOR OF RNA POLII, SAGA, SUBUNITcoord: 1..420
IPR024738Transcriptional coactivator Hfi1/Transcriptional adapter 1PFAMPF12767SAGA-Tad1coord: 5..333
e-value: 9.1E-63
score: 212.1
IPR024738Transcriptional coactivator Hfi1/Transcriptional adapter 1PANTHERPTHR21277TRANSCRIPTIONAL ADAPTER 1coord: 1..420

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr022910.1Sgr022910.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0045893 positive regulation of transcription, DNA-templated
biological_process GO:0006357 regulation of transcription by RNA polymerase II
cellular_component GO:0000124 SAGA complex
cellular_component GO:0070461 SAGA-type complex
molecular_function GO:0003713 transcription coactivator activity