Sgr020877 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr020877
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
DescriptionTrihelix transcription factor GT-2
Locationtig00153574: 823149 .. 830240 (-)
RNA-Seq ExpressionSgr020877
SyntenySgr020877
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGACCTCTTCACCGCCGATCATCGGATAACCAGCTCCGACGACTTCCCGCAGCACGTAGCGCCATTTCCCGATCCCACGGACCTGCTTTACGCCGCTCCATCGGCCGTATTTCCCCCGGCCGATATTATCGACCACTGTCCGAACCCTCCACCGCCGCCGCAGAAGCTTCGTCCCATCCGGTGCAACGGCAGGTCCCCGGCGGGTTCTCAGGCCGAGAACATCTTCGACGGAGCGCTAAGAAATTTCCAGGGCATACCGTCGTCGCCAGAGGGTGGATTTACTGGTGATCAGCTCTGTGTGGCTAATATTGACCCGAGTGAGTACTTCAATTCCTCGAAGAAGGAAAAGCCTGTTGAGGTCGCGATGGATAATGGCGGCTTCGGGGATATCATCGGAAACAATTATTTCTCGGAGGAAGAGACGAAGGACGGCGGTTCGGGTGCAGTTATCGCTGTGGAGAATTTAAGCCGGAGAGGGGAAGGACCTCAATTAGATGACGATTCTTGGTGAGTTTGAACTCTGAACATCACCTCATAAATTGGGTTTTTTTTTTTTGGTGAACTGTTAAACTTTATTTACTTGGATTATTTTTTGAGTTGCTGGGTGCTTGGAACTTTTTAGGGAATTGCAATTAATTTATGATGGTAAATTCTGAATTCTTATGGAAAAGTGAAGCATTAAAGAAAAAGCTTCCCCAGAAAGTGATCCTTATAAGATATGAAAAAATTTTGTCCTTCAGCTTGGTGGTTTCCTTGGGCTCTCTTCATTCAAATTATGTTTTCTTTCTTTTAATTCTCAAAAGAAAGAAAAGAATAATTTATTTCACCAGATTTTTCGACTAAATATTACTTTCATCCTTGAGGACCCTCTCTCTCTCTTCCCCCCCCCCCCCCCCAAAAAAAAAAAAAAAAAAAAAAAAAAATATATATATATATATATAATAGAAAAGTAAAAGATACAATGAGGAGATTCCCAAAAAACCCAATTTGCATTAATAAGAAACACAGAGTAATTACAAAACCCTTTTAAGAGAGTGCACCAATTTGAAGTAACAAAATGACAAGGTCAACAAAATCATAAAAGATTCTTCTTTTTCTCTCCTTTTTAAAAGTTCCCATCACTAGCAATCATGATCTTGTCCCAGGAAATACTAGTTTTCCTTTTGAAACCGTGGCCAAGGATAGAGGTTGTTCCAAGAAATGGATATGTACAGAAGAGTTCACAACTACCCAAAGCTTGTGAGAGAGTTATGGTAATTAGATGTTTCTCTAAATACTTAATAGGCTAGAAAAAGACACAAAACTAAAACAACAATCGGAATGAAGAGATGGGGGAAAAAAAACAAACATTATATGTTTACCTCCTTTCTATGGGATTTAAAATGTAGAAGTGCAAGTATGTGTTTCTGATTACCGAACTGGTAGGTTTTTATTCTTGAGAAATGAAATCTTGAAAAATCTTGTGAATTCTTTTGGTCCTGGTGAAGCATTTTCTGATACAAGTAGTATGGTTACTCATGGTTTTTCTTTTTGTGGTGGTCACCTGCCATTACTGTAGCAATGGGTTGGGGATAGAGTGCATAGAACAGGAAGTTCTTATACTGTTATGGTGGTTTCTTTTCATTGCAGTAATTTGAGACTTGTGTAAGGCATGGGAAGTCATTCCATTTTTAAAAAATTTTGCTTTGTATTGCCTTTTTCTGAAAAGGGAACAGAACTTTTAATTGCTAATTGGAAAGTTACAAATTCATAACAATATATTCTCCATTTTAGAAGAATACAACCCCAACCCCCCCCCAAACCACACACGCACACACACAAAAAAGAAGCAAATAAACAAGATCAAACCGCTTTCTATATCGACAAAATACTATCAATCTTTCCCATTTTACAGTATAACATCACATTCTTTATATAGTTAAGCTCAAAATTAGATTAACTAACTCCTTGCATCCTTTAGAAATTTTTTAATTTTTTTCCTCCTGCCACTAGAAATTGGTGTTAGATACACCCAGTCCTCTCCAAATTGAATTTTTTAACTGATCATTGAGATCCACCCCATTCAAACTTGGAAGCTTGACCAAACAATCTAGGTCTTCTTCCACCATTTCACTTTGACCAAGGTCTTTTCCATAGCTGATTAACTCCCCCTTTTCATTTTCCCCAAAATTAGGGCCAGACATATTATTAACTCCTTGTCAAGGCTGAAACTATCCTCTTCAACACAAGTTGTTCTATGGAATTTCTTTCCTTATATTTTTTAATATAAACAAAGCAGCCCATGTAAAATTATGATTGAAGTTTTATCGAGCAAGCAGGATACATATGTAGTTCAGCTCCAGGAAATTTTGTGGATTGTGGTTGCTTATGTAGTGAGTTCAACTCTTTTGAAATGAAATTATTTGATTGCTTGTTTCCAGTTCGACTTCAGATGTTGGTGATGACATTGTAAGCACAAAAAAACCTTTAAATCATAAGAGAAAGAGAACAAGATCGCTCGAGCTTTTTGTGGAAAATTTGGTAATGAAGGTATTGGATAAACAGGAGCAGATGCATCAGCAGTTGATCGATATGATAGAAAAGAAGGAGAAGGAAAGAATAGTCAGAGAAGAAGCTTGGAAGCAGAGGAGATTGAAAGAATCAGAAGGGATGAGGAGTTAAGAGCCCAAGAAACAACTCGCAGTTTAGCAATTATTTCCTGATCCAAAATTTGCTGGGCCATGAAATTCAAATCTCCCAACCAGTTGAAAACCAATGTACAGAAGATGATGGAGGTGAAAGCAGCATTCAGAAGGAATTAAAAAGTGATCTTAGTAGGAGATGGCCTCAGGCTGAAGTACAGGCCTTGATATCACTACGAACGTCTCTGGAACATAAATTTCGTGCTACAGGCTCAAAAGGATCAATATGGGAGGAGATATCAGTTGAGATGCATAAGATGGGTCACAACCGCTCGGCGAAGAAGTGCAAAGAAAAATGGGAAAATATGAACAAATATTTCAAAAGGACAATTGGAACCGGGAAGACTAGTATTACAAATGGTAAGACATGTCCTTATTTTCAAGAATTAGATATTCTTTATAGAAATGGAGTGGTAAATACCGGAGCTGTCTTTGATAATACAAATACTGAAAATAAGTCCAAGTCTGAAAAACGTATAGACCTTTTTCATGAAGAGACCTTCATACAAGAAGAATGAGAATGAGAGCCTCAAGGTGAAAGAGAGCATATATATAACAGTAGGCCTTGGACATTATACAATTTTAAAGAGATGTATGTACACTACAGATTCCTTCGGTCGGTTTTCACAGCCCAGATTTCTGATAACGCAGTTTCTTCATGTTTACCCTTGGTCCATTCATGGCTGCTTCCCCAATCCGCTTGCATTTCGGAATTTAAACTCATGGATGTTTCAGACAATTGTGCTCCTATGGTAACACAGAGAAATGCTATCTTTTCAGTTCACATTAGCAGAAAATAATAGCAGTTGGTTGATGATACTCTTGAATGAAGTTTCTTACCTGATAATGCTCGAAATTCTTGTAAGCAAATGGATAGTAGCTGAAGACATAACATCTTTATTCTCATCGATTTGTTTGGATTTGCTGCTTAAATTCACGGGTTATGCGATCAGTTAGTTTGTAACCAGAAATAGGTTGACATTGATCCTCTATTTTCTTTAATTGAGGTCATTTCAGGACATCTAACGTTATTTAGCTTTGATTTTTTTTTTTTTTTAATATCATATCCCATAGATGGAGATTTGAACTTATGATCTCTTAAAGAAGTAATAATACCTTAATTATGAACTATGTTTATGTTAATTGGTAGCTTTGATTTGAAAAGTCACTATAGGATTTGCTTATTATTACTTGGGAGAAAATTATTACTTCCTGAATTGTGATGTTAATTTCTACGGTTGAAATAGCTAATTTTTCCATTAATATATTTAAAAACATTGCAAAATATTCCCTTAAAACTTAAAAAGTATCGAGAAAGGTAACAATTTTATCAAAATTCTACCTTTTTTGGGTTATAAAACTGTAGATATAATGATTTAACTATGATCAATTTGGTAAGTTCAATTCATTAAAACTAATTGATGAAAGAAATATTGATTACATGAGTTAATATCAGAAATAATTACATGAAAGCTTATTAATATATTAGACAAAATGTTAACAAGCTGTTTATGAATTATGAGGTGACATGTCAATGTCGGTATTCCAATTAATATTTCTATAAATTGGACAGTTCAATAATTCTACTGCATTCAATACCCAATAATAGAGACATTAAATTACACGTATAGAAAATAAAGTTTAATATATGCCCCTTTTTTGGCAAAAAAACAATTATACCCCTACAAAAAATCCAGCTGTAAAATGGACTTTCTTATTAACTTTTTGAATCACTCTGAAACGACACCGTTTCATCCACCACTTTCAGTTTCCCGCCACAGAAGAGTATGCGATTATTTGAATTCAGACAGCGCCGAACACAGAGCTGCAGAAGCCGCCACGTGTCTACTCCGGCGCAAACCTTTTAAAATCCAATCTCTTCGCTCCATCTCCTCCGATTCAAAACCCTAAAAATGCGCTGAATCAAACACTGCCAATCTTAGTTGAAGCTCTGGTAGTTGATACCTCCCTCTTAAAGATCTCAAGTTTTCCATCCATGGCCTCCGTCGTCAAGCCCTCTTCGCGCTACAGTTCCTACGATGTGCGCTCTTCAACTTCCTCCCACTTCTCCGACCCTTCTTCTTCCTCCGAGTTCAAGCTCAAGTCCCCCATGGCAGCCAACTCGTCGTCTTCCCGCGCTCTTGTTAAGTGCAAGGCGTCTGATCTGGCTAGAGGCAAATCGAAGCCGTCTGATCAGAACTTGACGGCGATGGTGAAGAAGTTCATGGAGAAACGCTCTGGTTTGAAGCCGAAGACGGCGAAGCAGGCGACGGGGTTGGTGATCCCGTCGGATTTGATTGCGGAGGATTGAAAAGACGGCGAGGAAGGGGACGAACTTCGGTGGGCTGCACAAGAAGCTGTTTGGGAAGGGAACTGCGGCGGTGGAGAAGAAGGAGAAGGAGACGGAGGCGAAGGCGTTGACGGAAGTGAAAGGGAATACGAGGACATTGGCGATGGTGCTGAGAAGTGAAAGAGAGCTTTTGAGTTTGAATAAGGAGCAGGAGTTGGAAATCACTGAGCTCAAATTAGTTCTGGAAGAGAAGTACAGAGAAGTGAGGATACACTTCTTAATTTTACTCTATTTTTGGAAAATTTTCATTTTTTACTGCTGTATTGGTTGTGTTTAGTCCATTTTTAATCTTAAAACCTTACAATTTTATTAAATGAAATCTTAAATTTGAATCAGTAGTAAAATTAGTACACCCAATTAGTTCAGTTTTCACTTTTTAAGATTTATAAGCTAGGAATTATTGTAGTTAGACAAAACTTCTTTTTAACTAGTTTTTCTAGTCATGTTAGATATCCTTTAACTGGATTTTCACCACTCTTTAACTGGAGTAGGTTTGGATATTTTTTATGGTTGAGTTTAGTCATTTCTAAAGTTTGGTTTTTTTTTTTTTTTTTTTTTTTTGAGTATTCTAAAGTTTGGTTTTGATTTTGAGTGTTTTATTTGGATTTAAAAAAAAAAGAGATAAAAAAGAAAATTTGTGAGCAGTGAATAATGTGCCAGCATTTTTTCACATATTGTTTGTTAACACAATCCTGTTGCTTATTGTTGATGTAATCAGATTGAGAAGTTGAAGGATTTATGTTTGAAGCAAAGAGAAGAAATAAAGTCATTGAAAAATGCAATATTGTTCCCAGATGTTATGAATTCTCAGCTTCAAGAGCTGCTTGAAAAGCAGGATTCAGAGTTGAAGCAAGCCAAACAAGTCATCCCTACTCTGCAAAAGCAGGTCACTACTCTCACTGGCCAGCTTCATTCCCTCGCCGAGGACCTTGCCGAGGTACGAAAGAATTTAGTATTAACTTTCTATTTAACAAAAATCCTTGAAAAGCATTAAAAGTAAGACCTGGGAAGCGGGAAGACTCTATTTGGTCGATCACCAAGGGGCTTTGTTGCTAATGCTTTTAACAATAGGATGATCCAGCTCTGCTTTAGTCCTCTGAGAGCTTCCATTTCTTTCATATGAACTATGAAATTACTGCTCAAAAGTCAAAACACAGCCTCTAATGAAATTTGGACTGTGCATGATTTATTTCTGGTACCTAGTAACTGCTATTGAATGGTATTGGTTGCTTTCTGTGTTGATTAATTTGAAAAACTGAAGTGTTAGCTGATAGCACTACTGGAATTTTGTTTTGAGCAGGTGAAGGCAGATAAATATTCAGGAAAGGCCTGGTTGCAAAATAATAGCAGTTCTCCTCACACACCAACATATGATCACGAGGATGCTTCTAACTCTTTGGTAAGGACAAAAGATATCCCTCAATCCTCATGTCTCATCTTTTAGTGTTGAGCATATATATCCTGTTACGGCTGCTCAAATGGGAAGCTTGAGAGTATGGCTAAATTATTCCTCAAATTAAGAGTTTAGGAAGGTAAGAAGGCGCATCCTAATAATTAAAATCATCATAGGACCTAGCTTTACCAAAGAAAGTCTATTTATCTACTAGATGCAAAATCACTTATCAAGCATTTATGTAATTCAGGAGTTCAGTGCCTGCGATCCAGCATCCCCTGGCAGTCCAGATGACTTTTTGCTGAAGGATGTGAATCCCTGTCTAACACCCTATTATGCAACTAAATCCAAGGTATGACTGCCCATTTCTGCTTTAATTTCTCAAGTGTTCATGCAGAGTGTATTCCCAGCCAGTCTAGAGTTATTCATCAAACTAATTAACTGAACTCATTTCTACAGGAGTTTGAGGCAATGGGATATGATTCTCCGCGAGATGAAATCTTATCCCACAACAGAATGGAATCTGGTTTTAAATCTTGTTCCAGAAAATTGTCCAAAAGTTCTGATTGCAGGCAGAATTCCAACAAACCAAACACTACAAAAACAGCCCGAAGATCTGATGAAGCCAAATACACATATGGAAAGCCAATGCGTAAATTTTACTGA

mRNA sequence

ATGGACCTCTTCACCGCCGATCATCGGATAACCAGCTCCGACGACTTCCCGCAGCACGTAGCGCCATTTCCCGATCCCACGGACCTGCTTTACGCCGCTCCATCGGCCGTATTTCCCCCGGCCGATATTATCGACCACTGTCCGAACCCTCCACCGCCGCCGCAGAAGCTTCGTCCCATCCGGTGCAACGGCAGGTCCCCGGCGGGTTCTCAGGCCGAGAACATCTTCGACGGAGCGCTAAGAAATTTCCAGGGCATACCGTCGTCGCCAGAGGGTGGATTTACTGGTGATCAGCTCTGTGTGGCTAATATTGACCCGAGTGAGTACTTCAATTCCTCGAAGAAGGAAAAGCCTGTTGAGGTCGCGATGGATAATGGCGGCTTCGGGGATATCATCGGAAACAATTATTTCTCGGAGGAAGAGACGAAGGACGGCGGTTCGGGTGCAGTTATCGCTGTGGAGAATTTAAGCCGGAGAGGGGAAGGACCTCAATTAGATGACGATTCTTGTTCGACTTCAGATGTTGGTGATGACATTGTAAGCACAAAAAAACCTTTAAATCATAAGAGAAAGAGAACAAGATCGCTCGAGCTTTTTGTGGAAAATTTGGTAATGAAGGTATTGGATAAACAGGAGCAGATGCATCAGCAGTTGATCGATATGATAGAAAAGAAGGAGAAGGAAAGAATAGTCAGAGAAGAAGCTTGGAAGCAGAGGAGATTGAAAGAATCAGAAGGGATGAGGATTGAAAACCAATGTACAGAAGATGATGGAGGTGAAAGCAGCATTCAGAAGGAATTAAAAAGTGATCTTAGTAGGAGATGGCCTCAGGCTGAAGTACAGGCCTTGATATCACTACGAACGTCTCTGGAACATAAATTTCGTGCTACAGGCTCAAAAGGATCAATATGGGAGGAGATATCAGTTGAGATGCATAAGATGGGTCACAACCGCTCGGCGAAGAAGTGCAAAGAAAAATGGGAAAATATGAACAAATATTTCAAAAGGACAATTGGAACCGGGAAGACTAGTATTACAAATGCCCAGATTTCTGATAACGCAGTTTCTTCATGTTTACCCTTGGTCCATTCATGGCTGCTTCCCCAATCCGCTTGCATTTCGGAATTTAAACTCATGGATGTTTCAGACAATTGTGCTCCTATGACAGCGCCGAACACAGAGCTGCAGAAGCCGCCACGTGTCTACTCCGGCGCAAACCTTTTAAAATCCAATCTCTTCGCTCCATCTCCTCCGATTCAAAACCCTAAAAATGCGCTGAATCAAACACTGCCAATCTTAGTTGAAGCTCTGGTAGTTGATACCTCCCTCTTAAAGATCTCAAGTTTTCCATCCATGGCCTCCGTCGTCAAGCCCTCTTCGCGCTACAGTTCCTACGATGTGCGCTCTTCAACTTCCTCCCACTTCTCCGACCCTTCTTCTTCCTCCGAGTTCAAGCTCAAGTCCCCCATGGCAGCCAACTCGTCGTCTTCCCGCGCTCTTGTTAAGTGCAAGGCGTCTGATCTGGCTAGAGGCAAATCGAAGCCGTCTGATCAGAACTTGACGGCGATGGTGAAGAAGTTCATGGAGAAACGCTCTGGTTTGAAGCCGAAGACGGCGAAGCAGGCGACGGGGTTGAAGCTGTTTGGGAAGGGAACTGCGGCGGTGGAGAAGAAGGAGAAGGAGACGGAGGCGAAGGCGTTGACGGAAGTGAAAGGGAATACGAGGACATTGGCGATGGTGCTGAGAAGTGAAAGAGAGCTTTTGAGTTTGAATAAGGAGCAGGAGTTGGAAATCACTGAGCTCAAATTAGTTCTGGAAGAGAAGTACAGAGAAATTGAGAAGTTGAAGGATTTATGTTTGAAGCAAAGAGAAGAAATAAAGTCATTGAAAAATGCAATATTGTTCCCAGATGTTATGAATTCTCAGCTTCAAGAGCTGCTTGAAAAGCAGGATTCAGAGTTGAAGCAAGCCAAACAAGTCATCCCTACTCTGCAAAAGCAGGTCACTACTCTCACTGGCCAGCTTCATTCCCTCGCCGAGGACCTTGCCGAGGTGAAGGCAGATAAATATTCAGGAAAGGCCTGGTTGCAAAATAATAGCAGTTCTCCTCACACACCAACATATGATCACGAGGATGCTTCTAACTCTTTGGAGTTCAGTGCCTGCGATCCAGCATCCCCTGGCAGTCCAGATGACTTTTTGCTGAAGGATGTGAATCCCTGTCTAACACCCTATTATGCAACTAAATCCAAGGAGTTTGAGGCAATGGGATATGATTCTCCGCGAGATGAAATCTTATCCCACAACAGAATGGAATCTGGTTTTAAATCTTGTTCCAGAAAATTGTCCAAAAGTTCTGATTGCAGGCAGAATTCCAACAAACCAAACACTACAAAAACAGCCCGAAGATCTGATGAAGCCAAATACACATATGGAAAGCCAATGCGTAAATTTTACTGA

Coding sequence (CDS)

ATGGACCTCTTCACCGCCGATCATCGGATAACCAGCTCCGACGACTTCCCGCAGCACGTAGCGCCATTTCCCGATCCCACGGACCTGCTTTACGCCGCTCCATCGGCCGTATTTCCCCCGGCCGATATTATCGACCACTGTCCGAACCCTCCACCGCCGCCGCAGAAGCTTCGTCCCATCCGGTGCAACGGCAGGTCCCCGGCGGGTTCTCAGGCCGAGAACATCTTCGACGGAGCGCTAAGAAATTTCCAGGGCATACCGTCGTCGCCAGAGGGTGGATTTACTGGTGATCAGCTCTGTGTGGCTAATATTGACCCGAGTGAGTACTTCAATTCCTCGAAGAAGGAAAAGCCTGTTGAGGTCGCGATGGATAATGGCGGCTTCGGGGATATCATCGGAAACAATTATTTCTCGGAGGAAGAGACGAAGGACGGCGGTTCGGGTGCAGTTATCGCTGTGGAGAATTTAAGCCGGAGAGGGGAAGGACCTCAATTAGATGACGATTCTTGTTCGACTTCAGATGTTGGTGATGACATTGTAAGCACAAAAAAACCTTTAAATCATAAGAGAAAGAGAACAAGATCGCTCGAGCTTTTTGTGGAAAATTTGGTAATGAAGGTATTGGATAAACAGGAGCAGATGCATCAGCAGTTGATCGATATGATAGAAAAGAAGGAGAAGGAAAGAATAGTCAGAGAAGAAGCTTGGAAGCAGAGGAGATTGAAAGAATCAGAAGGGATGAGGATTGAAAACCAATGTACAGAAGATGATGGAGGTGAAAGCAGCATTCAGAAGGAATTAAAAAGTGATCTTAGTAGGAGATGGCCTCAGGCTGAAGTACAGGCCTTGATATCACTACGAACGTCTCTGGAACATAAATTTCGTGCTACAGGCTCAAAAGGATCAATATGGGAGGAGATATCAGTTGAGATGCATAAGATGGGTCACAACCGCTCGGCGAAGAAGTGCAAAGAAAAATGGGAAAATATGAACAAATATTTCAAAAGGACAATTGGAACCGGGAAGACTAGTATTACAAATGCCCAGATTTCTGATAACGCAGTTTCTTCATGTTTACCCTTGGTCCATTCATGGCTGCTTCCCCAATCCGCTTGCATTTCGGAATTTAAACTCATGGATGTTTCAGACAATTGTGCTCCTATGACAGCGCCGAACACAGAGCTGCAGAAGCCGCCACGTGTCTACTCCGGCGCAAACCTTTTAAAATCCAATCTCTTCGCTCCATCTCCTCCGATTCAAAACCCTAAAAATGCGCTGAATCAAACACTGCCAATCTTAGTTGAAGCTCTGGTAGTTGATACCTCCCTCTTAAAGATCTCAAGTTTTCCATCCATGGCCTCCGTCGTCAAGCCCTCTTCGCGCTACAGTTCCTACGATGTGCGCTCTTCAACTTCCTCCCACTTCTCCGACCCTTCTTCTTCCTCCGAGTTCAAGCTCAAGTCCCCCATGGCAGCCAACTCGTCGTCTTCCCGCGCTCTTGTTAAGTGCAAGGCGTCTGATCTGGCTAGAGGCAAATCGAAGCCGTCTGATCAGAACTTGACGGCGATGGTGAAGAAGTTCATGGAGAAACGCTCTGGTTTGAAGCCGAAGACGGCGAAGCAGGCGACGGGGTTGAAGCTGTTTGGGAAGGGAACTGCGGCGGTGGAGAAGAAGGAGAAGGAGACGGAGGCGAAGGCGTTGACGGAAGTGAAAGGGAATACGAGGACATTGGCGATGGTGCTGAGAAGTGAAAGAGAGCTTTTGAGTTTGAATAAGGAGCAGGAGTTGGAAATCACTGAGCTCAAATTAGTTCTGGAAGAGAAGTACAGAGAAATTGAGAAGTTGAAGGATTTATGTTTGAAGCAAAGAGAAGAAATAAAGTCATTGAAAAATGCAATATTGTTCCCAGATGTTATGAATTCTCAGCTTCAAGAGCTGCTTGAAAAGCAGGATTCAGAGTTGAAGCAAGCCAAACAAGTCATCCCTACTCTGCAAAAGCAGGTCACTACTCTCACTGGCCAGCTTCATTCCCTCGCCGAGGACCTTGCCGAGGTGAAGGCAGATAAATATTCAGGAAAGGCCTGGTTGCAAAATAATAGCAGTTCTCCTCACACACCAACATATGATCACGAGGATGCTTCTAACTCTTTGGAGTTCAGTGCCTGCGATCCAGCATCCCCTGGCAGTCCAGATGACTTTTTGCTGAAGGATGTGAATCCCTGTCTAACACCCTATTATGCAACTAAATCCAAGGAGTTTGAGGCAATGGGATATGATTCTCCGCGAGATGAAATCTTATCCCACAACAGAATGGAATCTGGTTTTAAATCTTGTTCCAGAAAATTGTCCAAAAGTTCTGATTGCAGGCAGAATTCCAACAAACCAAACACTACAAAAACAGCCCGAAGATCTGATGAAGCCAAATACACATATGGAAAGCCAATGCGTAAATTTTACTGA

Protein sequence

MDLFTADHRITSSDDFPQHVAPFPDPTDLLYAAPSAVFPPADIIDHCPNPPPPPQKLRPIRCNGRSPAGSQAENIFDGALRNFQGIPSSPEGGFTGDQLCVANIDPSEYFNSSKKEKPVEVAMDNGGFGDIIGNNYFSEEETKDGGSGAVIAVENLSRRGEGPQLDDDSCSTSDVGDDIVSTKKPLNHKRKRTRSLELFVENLVMKVLDKQEQMHQQLIDMIEKKEKERIVREEAWKQRRLKESEGMRIENQCTEDDGGESSIQKELKSDLSRRWPQAEVQALISLRTSLEHKFRATGSKGSIWEEISVEMHKMGHNRSAKKCKEKWENMNKYFKRTIGTGKTSITNAQISDNAVSSCLPLVHSWLLPQSACISEFKLMDVSDNCAPMTAPNTELQKPPRVYSGANLLKSNLFAPSPPIQNPKNALNQTLPILVEALVVDTSLLKISSFPSMASVVKPSSRYSSYDVRSSTSSHFSDPSSSSEFKLKSPMAANSSSSRALVKCKASDLARGKSKPSDQNLTAMVKKFMEKRSGLKPKTAKQATGLKLFGKGTAAVEKKEKETEAKALTEVKGNTRTLAMVLRSERELLSLNKEQELEITELKLVLEEKYREIEKLKDLCLKQREEIKSLKNAILFPDVMNSQLQELLEKQDSELKQAKQVIPTLQKQVTTLTGQLHSLAEDLAEVKADKYSGKAWLQNNSSSPHTPTYDHEDASNSLEFSACDPASPGSPDDFLLKDVNPCLTPYYATKSKEFEAMGYDSPRDEILSHNRMESGFKSCSRKLSKSSDCRQNSNKPNTTKTARRSDEAKYTYGKPMRKFY
Homology
BLAST of Sgr020877 vs. NCBI nr
Match: XP_022142583.1 (inner centromere protein A [Momordica charantia])

HSP 1 Score: 616.3 bits (1588), Expect = 3.8e-172
Identity = 343/396 (86.62%), Postives = 353/396 (89.14%), Query Frame = 0

Query: 452 MASVVKPSSRYSSYDVRSSTSSHFSDPSSSSEFKLKSPMAAN--SSSSRALVKCKASDLA 511
           MASV+KPSSRYSSYDVRSSTSSHFSDPS+SSEFKLKSPMAAN  SSSSRALVK KASDLA
Sbjct: 1   MASVIKPSSRYSSYDVRSSTSSHFSDPSTSSEFKLKSPMAANSSSSSSRALVKSKASDLA 60

Query: 512 RGKSKPSDQNLTAMVKKFMEKRSGLKPKTAKQATGL------------------------ 571
           R KSKPSDQNLTAMVKKFMEKRS  KPKTAK ATGL                        
Sbjct: 61  RAKSKPSDQNLTAMVKKFMEKRSASKPKTAKHATGLVIPSDLIAEDLKKTARKGTNFGGL 120

Query: 572 --KLFGKGTAAVEKKEKETEAKALTEVKGNTRTLAMVLRSERELLSLNKEQELEITELKL 631
             KLFGKG+AAVEKKEK+ E KALTEVKGNTRTLAMVLRSERELLSLNKEQELEITELKL
Sbjct: 121 HKKLFGKGSAAVEKKEKKEEVKALTEVKGNTRTLAMVLRSERELLSLNKEQELEITELKL 180

Query: 632 VLEEKYREIEKLKDLCLKQREEIKSLKNAILFPDVMNSQLQELLEKQDSELKQAKQVIPT 691
           VLEEKYREIEKLKDLCLKQREEIKSLKNAILFPDVMNSQLQE+LEKQDSELKQAKQ+IPT
Sbjct: 181 VLEEKYREIEKLKDLCLKQREEIKSLKNAILFPDVMNSQLQEMLEKQDSELKQAKQIIPT 240

Query: 692 LQKQVTTLTGQLHSLAEDLAEVKADKYSGKAWLQNNSSSPHTPTYDHEDASNSLEFSACD 751
           LQKQVT LTGQLHSLAEDLAEVKADKYSGKAWLQNNSSSPHTPTYD EDASNSLEFSACD
Sbjct: 241 LQKQVTXLTGQLHSLAEDLAEVKADKYSGKAWLQNNSSSPHTPTYDDEDASNSLEFSACD 300

Query: 752 PASPGSPDDFLLKDVNPCLTPYYATKSKEFEAMGYDSPRDEILSHNRMESGFKSCSRKLS 811
           P SPGSPDDFLLKDVNPCLTPYYATKSKEFEAMGYDSPRDEILSHNR ESGF+SCSRKLS
Sbjct: 301 PTSPGSPDDFLLKDVNPCLTPYYATKSKEFEAMGYDSPRDEILSHNRKESGFESCSRKLS 360

Query: 812 KSSDCRQNSNKPNTTKTARRSDEAKYTYGKPMRKFY 820
           +SSDCRQ SN+ NTT+TARRSDEAKY YGKPM KFY
Sbjct: 361 RSSDCRQKSNETNTTRTARRSDEAKYMYGKPMHKFY 396

BLAST of Sgr020877 vs. NCBI nr
Match: XP_023542139.1 (uncharacterized protein LOC111802113 isoform X1 [Cucurbita pepo subsp. pepo] >XP_023542140.1 uncharacterized protein LOC111802113 isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 592.8 bits (1527), Expect = 4.5e-165
Identity = 332/394 (84.26%), Postives = 342/394 (86.80%), Query Frame = 0

Query: 452 MASVVKPSSRYSSYDVRSSTSSHFSDPSSSSEFKLKSPMAANSSSSRALVKCKASDLARG 511
           MA V+ PSSRYSSYDVRSS SSHFSDPSSSSEFKLKSPM A+SSSSRA+VK KA+DLAR 
Sbjct: 1   MAKVINPSSRYSSYDVRSSNSSHFSDPSSSSEFKLKSPMKADSSSSRAIVKSKAADLARA 60

Query: 512 KSKPSDQNLTAMVKKFMEKRSGLKPKTAKQATGL-------------------------- 571
           K+KPSDQNLTAMVKKFMEKRSGLKPKT K ATGL                          
Sbjct: 61  KTKPSDQNLTAMVKKFMEKRSGLKPKTVKHATGLVIPSDLIAEDLKKTARKGTNFGGLHK 120

Query: 572 KLFGKGTAAVEKKEKETEAKALTEVKGNTRTLAMVLRSERELLSLNKEQELEITELKLVL 631
           KLFGKG   VEKKEKE E KALTEVKGNTRTLAMVLRSERELLSLNKEQELEITELKLVL
Sbjct: 121 KLFGKG--MVEKKEKEKEVKALTEVKGNTRTLAMVLRSERELLSLNKEQELEITELKLVL 180

Query: 632 EEKYREIEKLKDLCLKQREEIKSLKNAILFPDVMNSQLQELLEKQDSELKQAKQVIPTLQ 691
           EEKY EIEKLKDLCLKQREEIKSLKNAILFPDVMNSQLQ +LEKQDSELKQAKQ+IPTLQ
Sbjct: 181 EEKYGEIEKLKDLCLKQREEIKSLKNAILFPDVMNSQLQGILEKQDSELKQAKQIIPTLQ 240

Query: 692 KQVTTLTGQLHSLAEDLAEVKADKYSGKAWLQNNSSSPHTPTYDHEDASNSLEFSACDPA 751
           KQVTTLTGQL+SLAEDLAEVKADKYSGK WLQ  SSSPHTPTYDHEDASN LEFSACDP 
Sbjct: 241 KQVTTLTGQLYSLAEDLAEVKADKYSGKGWLQ-GSSSPHTPTYDHEDASNPLEFSACDPT 300

Query: 752 SPGSPDDFLLKDVNPCLTPYYATKSKEFEAMGYDSPRDEILSHNRMESGFKSCSRKLSKS 811
           SP  PDD+LLKDVNPCLTPYYATKSK+FEAMGYDSPRDEILSHNRMESGF SCSRKLSKS
Sbjct: 301 SPSRPDDYLLKDVNPCLTPYYATKSKDFEAMGYDSPRDEILSHNRMESGFTSCSRKLSKS 360

Query: 812 SDCRQNSNKPNTTKTARRSDEAKYTYGKPMRKFY 820
           SDCRQNSNK  TTKTARRSDEAKYTYGKPM KFY
Sbjct: 361 SDCRQNSNKAKTTKTARRSDEAKYTYGKPMHKFY 391

BLAST of Sgr020877 vs. NCBI nr
Match: XP_038895034.1 (uncharacterized protein LOC120083373 [Benincasa hispida])

HSP 1 Score: 589.0 bits (1517), Expect = 6.5e-164
Identity = 333/393 (84.73%), Postives = 342/393 (87.02%), Query Frame = 0

Query: 452 MASVVKPSSRYSSYDVRSSTSSHFSDPSSSSEFKLKSPMAANSSSSRALVKCKASDLARG 511
           MA V+KPSSRYSSYDVRSSTSSHFSDPSSS EF LKSP+ ANSSSSRALVK K SDLAR 
Sbjct: 1   MAKVMKPSSRYSSYDVRSSTSSHFSDPSSSCEFNLKSPLPANSSSSRALVKTKPSDLARA 60

Query: 512 KSKPSDQNLTAMVKKFMEKRSGLKPKTAKQATGL-------------------------- 571
           K+KPSDQNLTAMVKKFMEKRSG KPKT KQA GL                          
Sbjct: 61  KAKPSDQNLTAMVKKFMEKRSGSKPKTVKQAAGLVIPSDLIAEDLKKTARKGTNFGGLHK 120

Query: 572 KLFGKGTAAVEKKEKETEAKALTEVKGNTRTLAMVLRSERELLSLNKEQELEITELKLVL 631
           KLFGKGT  VEKKE + E KALTEVKGNTRTLAMVLRSERELLSLNKEQELEITELKL+L
Sbjct: 121 KLFGKGT--VEKKEVK-EVKALTEVKGNTRTLAMVLRSERELLSLNKEQELEITELKLIL 180

Query: 632 EEKYREIEKLKDLCLKQREEIKSLKNAILFPDVMNSQLQELLEKQDSELKQAKQVIPTLQ 691
           EEKYREIEKLKDLCLKQREEIKSLKNAILFPDVMNSQLQ +LEKQDSELKQAKQ+IPTLQ
Sbjct: 181 EEKYREIEKLKDLCLKQREEIKSLKNAILFPDVMNSQLQNMLEKQDSELKQAKQIIPTLQ 240

Query: 692 KQVTTLTGQLHSLAEDLAEVKADKYSGKAWLQNNSSSPHTPTYDHEDASNSLEFSACDPA 751
           KQVTTLTGQLHSLAEDLAEVKADKYSGK+WLQ  S SPHTPTYD EDASNSLEFSACDP 
Sbjct: 241 KQVTTLTGQLHSLAEDLAEVKADKYSGKSWLQ-GSISPHTPTYDQEDASNSLEFSACDPT 300

Query: 752 SPGSPDDFLLKDVNPCLTPYYATKSKEFEAMGYDSPRDEILSHNRMESGFKSCSRKLSKS 811
           SPGSPDDFLLKDVNPCLTPYYATKSKEFEAMGYDSPRDEILSHNRME GFKSCSRKLSKS
Sbjct: 301 SPGSPDDFLLKDVNPCLTPYYATKSKEFEAMGYDSPRDEILSHNRMEFGFKSCSRKLSKS 360

Query: 812 SDCRQNSNKPNTTKTARRSDEAKYTYGKPMRKF 819
           SDCRQNS+K NTTKTARRSDEAKY YGKPM KF
Sbjct: 361 SDCRQNSDKANTTKTARRSDEAKYMYGKPMHKF 389

BLAST of Sgr020877 vs. NCBI nr
Match: KAG6573527.1 (hypothetical protein SDJN03_27414, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 586.6 bits (1511), Expect = 3.2e-163
Identity = 331/394 (84.01%), Postives = 341/394 (86.55%), Query Frame = 0

Query: 452 MASVVKPSSRYSSYDVRSSTSSHFSDPSSSSEFKLKSPMAANSSSSRALVKCKASDLARG 511
           MA V+ PSSRYSSYDVRSS SSHFSDPSSSSEFKLKSPM A+SSSSRA+VK KA+DLAR 
Sbjct: 1   MAKVINPSSRYSSYDVRSSNSSHFSDPSSSSEFKLKSPMKADSSSSRAIVKSKAADLARA 60

Query: 512 KSKPSDQNLTAMVKKFMEKRSGLKPKTAKQATGL-------------------------- 571
           K+KPSDQNLTAMVKKFMEKRSGLKPKT K ATGL                          
Sbjct: 61  KTKPSDQNLTAMVKKFMEKRSGLKPKTVKHATGLVIPSDLIAEDLKKTARKGTNFGGLHK 120

Query: 572 KLFGKGTAAVEKKEKETEAKALTEVKGNTRTLAMVLRSERELLSLNKEQELEITELKLVL 631
           KLFGKG   VEKKEK  E KALTEVKGNTRTLAMVLRSERELLSLNKEQELEITELKLVL
Sbjct: 121 KLFGKG--MVEKKEK--EVKALTEVKGNTRTLAMVLRSERELLSLNKEQELEITELKLVL 180

Query: 632 EEKYREIEKLKDLCLKQREEIKSLKNAILFPDVMNSQLQELLEKQDSELKQAKQVIPTLQ 691
           EEKY EIEKLKDLCLKQREEIKSLKNAILFPDVMNSQLQ +LEKQDSELKQAKQ+IPTLQ
Sbjct: 181 EEKYGEIEKLKDLCLKQREEIKSLKNAILFPDVMNSQLQGILEKQDSELKQAKQIIPTLQ 240

Query: 692 KQVTTLTGQLHSLAEDLAEVKADKYSGKAWLQNNSSSPHTPTYDHEDASNSLEFSACDPA 751
           KQVTTLTGQL+SLAEDLAEVKADKYSGK WLQ  SSSPHTPTYDHEDASN LEFSACDP 
Sbjct: 241 KQVTTLTGQLYSLAEDLAEVKADKYSGKGWLQ-GSSSPHTPTYDHEDASNPLEFSACDPT 300

Query: 752 SPGSPDDFLLKDVNPCLTPYYATKSKEFEAMGYDSPRDEILSHNRMESGFKSCSRKLSKS 811
           SP  PDD+LLKDVNPCLTPYYATKSK+FEAMGYDSPRDEILSHNRMESGF SCSRKLSKS
Sbjct: 301 SPSRPDDYLLKDVNPCLTPYYATKSKDFEAMGYDSPRDEILSHNRMESGFTSCSRKLSKS 360

Query: 812 SDCRQNSNKPNTTKTARRSDEAKYTYGKPMRKFY 820
           SDCRQNSNK  TTKTARRSDEAKYTYGKPM KFY
Sbjct: 361 SDCRQNSNKAKTTKTARRSDEAKYTYGKPMHKFY 389

BLAST of Sgr020877 vs. NCBI nr
Match: XP_022925334.1 (uncharacterized protein LOC111432624 isoform X1 [Cucurbita moschata] >XP_022925335.1 uncharacterized protein LOC111432624 isoform X1 [Cucurbita moschata])

HSP 1 Score: 583.9 bits (1504), Expect = 2.1e-162
Identity = 330/394 (83.76%), Postives = 340/394 (86.29%), Query Frame = 0

Query: 452 MASVVKPSSRYSSYDVRSSTSSHFSDPSSSSEFKLKSPMAANSSSSRALVKCKASDLARG 511
           MA V+ PSSRYSSYDVRSS SSHFSDPSSSSEFKLKSPM A+SSSSRA+VK KA+DL R 
Sbjct: 1   MAKVINPSSRYSSYDVRSSGSSHFSDPSSSSEFKLKSPMKADSSSSRAIVKSKAADLPRA 60

Query: 512 KSKPSDQNLTAMVKKFMEKRSGLKPKTAKQATGL-------------------------- 571
           K+KPSDQNLTAMVKKFMEKRSGLKPKT K ATGL                          
Sbjct: 61  KTKPSDQNLTAMVKKFMEKRSGLKPKTVKHATGLVIPSDLIAEDLKKTARKGTNFGGLHK 120

Query: 572 KLFGKGTAAVEKKEKETEAKALTEVKGNTRTLAMVLRSERELLSLNKEQELEITELKLVL 631
           KLFGKG   VEKKEK  E KALTEVKGNTRTLAMVLRSERELLSLNKEQELEITELKLVL
Sbjct: 121 KLFGKG--MVEKKEK--EVKALTEVKGNTRTLAMVLRSERELLSLNKEQELEITELKLVL 180

Query: 632 EEKYREIEKLKDLCLKQREEIKSLKNAILFPDVMNSQLQELLEKQDSELKQAKQVIPTLQ 691
           EEKY EIEKLKDLCLKQREEIKSLKNAILFPDVMNSQLQ +LEKQDSELKQAKQ+IPTLQ
Sbjct: 181 EEKYGEIEKLKDLCLKQREEIKSLKNAILFPDVMNSQLQGILEKQDSELKQAKQIIPTLQ 240

Query: 692 KQVTTLTGQLHSLAEDLAEVKADKYSGKAWLQNNSSSPHTPTYDHEDASNSLEFSACDPA 751
           KQVTTLTGQL+SLAEDLAEVKADKYSGK WLQ  SSSPHTPTYDHEDASN LEFSACDP 
Sbjct: 241 KQVTTLTGQLYSLAEDLAEVKADKYSGKGWLQ-GSSSPHTPTYDHEDASNPLEFSACDPT 300

Query: 752 SPGSPDDFLLKDVNPCLTPYYATKSKEFEAMGYDSPRDEILSHNRMESGFKSCSRKLSKS 811
           SP  PDD+LLKDVNPCLTPYYATKSK+FEAMGYDSPRDEILSHNRMESGF SCSRKLSKS
Sbjct: 301 SPSRPDDYLLKDVNPCLTPYYATKSKDFEAMGYDSPRDEILSHNRMESGFTSCSRKLSKS 360

Query: 812 SDCRQNSNKPNTTKTARRSDEAKYTYGKPMRKFY 820
           SDCRQNSNK  TTKTARRSDEAKYTYGKPM KFY
Sbjct: 361 SDCRQNSNKAKTTKTARRSDEAKYTYGKPMHKFY 389

BLAST of Sgr020877 vs. ExPASy Swiss-Prot
Match: Q9C6K3 (Trihelix transcription factor DF1 OS=Arabidopsis thaliana OX=3702 GN=DF1 PE=4 SV=1)

HSP 1 Score: 103.2 bits (256), Expect = 1.4e-20
Identity = 75/260 (28.85%), Postives = 118/260 (45.38%), Query Frame = 0

Query: 162 GPQLDDDSCSTS---DVGDDIVSTKKPLNHKRKRTRSLELFVENLVMKVLDKQEQMHQQL 221
           G  L D+S S+S       D+         ++KR R  ++F E L+ +V+DKQE++ ++ 
Sbjct: 217 GDFLSDNSTSSSSSYSTSSDMEMGGGTATTRKKRKRKWKVFFERLMKQVVDKQEELQRKF 276

Query: 222 IDMIEKKEKERIVREEAWK-------------------------------QRRLKESE-- 281
           ++ +EK+E ER+VREE+W+                                ++L E +  
Sbjct: 277 LEAVEKREHERLVREESWRVQEIARINREHEILAQERSMSAAKDAAVMAFLQKLSEKQPN 336

Query: 282 -------------GMRIEN------------------------------QCTEDDGGESS 341
                         M++ N                                T+ D G   
Sbjct: 337 QPQPQPQPQQVRPSMQLNNNNQQQPPQRSPPPQPPAPLPQPIQAVVSTLDTTKTDNGGDQ 396

Query: 342 IQKELKSDLSRRWPQAEVQALISLRTSLEHKFRATGSKGSIWEEISVEMHKMGHNRSAKK 343
                 S  S RWP+ E++ALI LRT+L+ K++  G KG +WEEIS  M ++G NR++K+
Sbjct: 397 NMTPAASASSSRWPKVEIEALIKLRTNLDSKYQENGPKGPLWEEISAGMRRLGFNRNSKR 456

BLAST of Sgr020877 vs. ExPASy Swiss-Prot
Match: Q39117 (Trihelix transcription factor GT-2 OS=Arabidopsis thaliana OX=3702 GN=GT-2 PE=1 SV=1)

HSP 1 Score: 98.6 bits (244), Expect = 3.5e-19
Identity = 76/236 (32.20%), Postives = 119/236 (50.42%), Query Frame = 0

Query: 165 LDDDSCSTSDVGDDIVSTKKPLNHKRKRTRSLELFVENLVMKVLDKQEQMHQQLIDMIEK 224
           L   S S+S   D+     +  + ++KR     LF + L  ++++KQE+M ++ ++ +E 
Sbjct: 231 LFSSSTSSSTASDEEEDHHQVKSSRKKRKYWKGLFTK-LTKELMEKQEKMQKRFLETLEY 290

Query: 225 KEKERIVREEAWKQRRL----KESEGMRIE--NQCTED----------DGGE-------- 284
           +EKERI REEAW+ + +    +E E +  E  N   +D           GG+        
Sbjct: 291 REKERISREEAWRVQEIGRINREHETLIHERSNAAAKDAAIISFLHKISGGQPQQPQQHN 350

Query: 285 --SSIQKELKSD--------------------------------LSRRWPQAEVQALISL 343
              S +K+ +SD                                 S RWP+ EV+ALI +
Sbjct: 351 HKPSQRKQYQSDHSITFESKEPRAVLLDTTIKMGNYDNNHSVSPSSSRWPKTEVEALIRI 410

BLAST of Sgr020877 vs. ExPASy Swiss-Prot
Match: Q9C882 (Trihelix transcription factor GTL1 OS=Arabidopsis thaliana OX=3702 GN=GTL1 PE=1 SV=2)

HSP 1 Score: 89.0 bits (219), Expect = 2.8e-16
Identity = 43/82 (52.44%), Postives = 60/82 (73.17%), Query Frame = 0

Query: 261 SSIQKELKSDLSRRWPQAEVQALISLRTSLEHKFRATGSKGSIWEEISVEMHKMGHNRSA 320
           SS Q  L S  S RWP+AE+ ALI+LR+ +E +++    KG +WEEIS  M +MG+NR+A
Sbjct: 424 SSEQSSLPS--SSRWPKAEILALINLRSGMEPRYQDNVPKGLLWEEISTSMKRMGYNRNA 483

Query: 321 KKCKEKWENMNKYFKRTIGTGK 343
           K+CKEKWEN+NKY+K+   + K
Sbjct: 484 KRCKEKWENINKYYKKVKESNK 503


HSP 2 Score: 41.2 bits (95), Expect = 6.6e-02
Identity = 32/88 (36.36%), Postives = 49/88 (55.68%), Query Frame = 0

Query: 160 GEGPQLDDDSCSTSDVGDDIVSTKKPLNHKRKR------TRSLELFVENLVMKVLDKQEQ 219
           G G   DDD     D+  D  +     + KRKR       + +ELF E LV +V+ KQ  
Sbjct: 230 GMGSDDDDD-----DMDVDQANIAGSSSRKRKRGNRGGGGKMMELF-EGLVRQVMQKQAA 289

Query: 220 MHQQLIDMIEKKEKERIVREEAWKQRRL 242
           M +  ++ +EK+E+ER+ REEAWK++ +
Sbjct: 290 MQRSFLEALEKREQERLDREEAWKRQEM 311

BLAST of Sgr020877 vs. ExPASy Swiss-Prot
Match: Q8H181 (Trihelix transcription factor GTL2 OS=Arabidopsis thaliana OX=3702 GN=At5g28300 PE=2 SV=1)

HSP 1 Score: 84.7 bits (208), Expect = 5.2e-15
Identity = 89/332 (26.81%), Postives = 146/332 (43.98%), Query Frame = 0

Query: 111 NSSKKEKPVEVAMDNGGFGDIIGNNYFSEEETKDGGSGAVIAVENLSRR-----GEGPQL 170
           N +K+   VE     G  G+ +  +  +E++ +D   G V      ++R     G+   +
Sbjct: 210 NQNKRTNLVE---GKGNVGETV-QDLMAEDKLRDQDQGQVEEASMENQRNSIEVGKVGNV 269

Query: 171 DDDSCSTSDVGDDIVSTKKPLNHKRK---RTRSLELFVENLVMKVLDKQEQMHQQLIDMI 230
           +DD+ S+S     ++  +K    ++K   R   L+ F E LV  ++ +QE+MH++L++ +
Sbjct: 270 EDDAKSSSSSSLMMIMKEKKRKKRKKEKERFGVLKGFCEGLVRNMIAQQEEMHKKLLEDM 329

Query: 231 EKKEKERIVREEAWKQR---RLKESEGMRIENQCTEDDGGES------------------ 290
            KKE+E+I REEAWK++   R+ +   +R + Q    D   +                  
Sbjct: 330 VKKEEEKIAREEAWKKQEIERVNKEVEIRAQEQAMASDRNTNIIKFISKFTDHDLDVVQN 389

Query: 291 ---------------------------------------SIQKEL--------------- 343
                                                  +I K L               
Sbjct: 390 PTSPSQDSSSLALRKTQGRRKFQTSSSLLPQTLTPHNLLTIDKSLEPFSTKTLKPKNQNP 449

BLAST of Sgr020877 vs. ExPASy Swiss-Prot
Match: Q9LZS0 (Trihelix transcription factor PTL OS=Arabidopsis thaliana OX=3702 GN=PTL PE=1 SV=1)

HSP 1 Score: 68.6 bits (166), Expect = 3.9e-10
Identity = 28/65 (43.08%), Postives = 46/65 (70.77%), Query Frame = 0

Query: 274 RWPQAEVQALISLRTSLEHKFRATGSKGSIWEEIS-VEMHKMGHNRSAKKCKEKWENMNK 333
           RWP+ E   L+ +R+ L+HKF+    KG +W+E+S +   + G+ RS KKC+EK+EN+ K
Sbjct: 119 RWPRQETLTLLEIRSRLDHKFKEANQKGPLWDEVSRIMSEEHGYQRSGKKCREKFENLYK 178

Query: 334 YFKRT 338
           Y+++T
Sbjct: 179 YYRKT 183

BLAST of Sgr020877 vs. ExPASy TrEMBL
Match: A0A6J1CNL5 (inner centromere protein A OS=Momordica charantia OX=3673 GN=LOC111012662 PE=4 SV=1)

HSP 1 Score: 616.3 bits (1588), Expect = 1.8e-172
Identity = 343/396 (86.62%), Postives = 353/396 (89.14%), Query Frame = 0

Query: 452 MASVVKPSSRYSSYDVRSSTSSHFSDPSSSSEFKLKSPMAAN--SSSSRALVKCKASDLA 511
           MASV+KPSSRYSSYDVRSSTSSHFSDPS+SSEFKLKSPMAAN  SSSSRALVK KASDLA
Sbjct: 1   MASVIKPSSRYSSYDVRSSTSSHFSDPSTSSEFKLKSPMAANSSSSSSRALVKSKASDLA 60

Query: 512 RGKSKPSDQNLTAMVKKFMEKRSGLKPKTAKQATGL------------------------ 571
           R KSKPSDQNLTAMVKKFMEKRS  KPKTAK ATGL                        
Sbjct: 61  RAKSKPSDQNLTAMVKKFMEKRSASKPKTAKHATGLVIPSDLIAEDLKKTARKGTNFGGL 120

Query: 572 --KLFGKGTAAVEKKEKETEAKALTEVKGNTRTLAMVLRSERELLSLNKEQELEITELKL 631
             KLFGKG+AAVEKKEK+ E KALTEVKGNTRTLAMVLRSERELLSLNKEQELEITELKL
Sbjct: 121 HKKLFGKGSAAVEKKEKKEEVKALTEVKGNTRTLAMVLRSERELLSLNKEQELEITELKL 180

Query: 632 VLEEKYREIEKLKDLCLKQREEIKSLKNAILFPDVMNSQLQELLEKQDSELKQAKQVIPT 691
           VLEEKYREIEKLKDLCLKQREEIKSLKNAILFPDVMNSQLQE+LEKQDSELKQAKQ+IPT
Sbjct: 181 VLEEKYREIEKLKDLCLKQREEIKSLKNAILFPDVMNSQLQEMLEKQDSELKQAKQIIPT 240

Query: 692 LQKQVTTLTGQLHSLAEDLAEVKADKYSGKAWLQNNSSSPHTPTYDHEDASNSLEFSACD 751
           LQKQVT LTGQLHSLAEDLAEVKADKYSGKAWLQNNSSSPHTPTYD EDASNSLEFSACD
Sbjct: 241 LQKQVTXLTGQLHSLAEDLAEVKADKYSGKAWLQNNSSSPHTPTYDDEDASNSLEFSACD 300

Query: 752 PASPGSPDDFLLKDVNPCLTPYYATKSKEFEAMGYDSPRDEILSHNRMESGFKSCSRKLS 811
           P SPGSPDDFLLKDVNPCLTPYYATKSKEFEAMGYDSPRDEILSHNR ESGF+SCSRKLS
Sbjct: 301 PTSPGSPDDFLLKDVNPCLTPYYATKSKEFEAMGYDSPRDEILSHNRKESGFESCSRKLS 360

Query: 812 KSSDCRQNSNKPNTTKTARRSDEAKYTYGKPMRKFY 820
           +SSDCRQ SN+ NTT+TARRSDEAKY YGKPM KFY
Sbjct: 361 RSSDCRQKSNETNTTRTARRSDEAKYMYGKPMHKFY 396

BLAST of Sgr020877 vs. ExPASy TrEMBL
Match: A0A6J1EHN1 (uncharacterized protein LOC111432624 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111432624 PE=4 SV=1)

HSP 1 Score: 583.9 bits (1504), Expect = 1.0e-162
Identity = 330/394 (83.76%), Postives = 340/394 (86.29%), Query Frame = 0

Query: 452 MASVVKPSSRYSSYDVRSSTSSHFSDPSSSSEFKLKSPMAANSSSSRALVKCKASDLARG 511
           MA V+ PSSRYSSYDVRSS SSHFSDPSSSSEFKLKSPM A+SSSSRA+VK KA+DL R 
Sbjct: 1   MAKVINPSSRYSSYDVRSSGSSHFSDPSSSSEFKLKSPMKADSSSSRAIVKSKAADLPRA 60

Query: 512 KSKPSDQNLTAMVKKFMEKRSGLKPKTAKQATGL-------------------------- 571
           K+KPSDQNLTAMVKKFMEKRSGLKPKT K ATGL                          
Sbjct: 61  KTKPSDQNLTAMVKKFMEKRSGLKPKTVKHATGLVIPSDLIAEDLKKTARKGTNFGGLHK 120

Query: 572 KLFGKGTAAVEKKEKETEAKALTEVKGNTRTLAMVLRSERELLSLNKEQELEITELKLVL 631
           KLFGKG   VEKKEK  E KALTEVKGNTRTLAMVLRSERELLSLNKEQELEITELKLVL
Sbjct: 121 KLFGKG--MVEKKEK--EVKALTEVKGNTRTLAMVLRSERELLSLNKEQELEITELKLVL 180

Query: 632 EEKYREIEKLKDLCLKQREEIKSLKNAILFPDVMNSQLQELLEKQDSELKQAKQVIPTLQ 691
           EEKY EIEKLKDLCLKQREEIKSLKNAILFPDVMNSQLQ +LEKQDSELKQAKQ+IPTLQ
Sbjct: 181 EEKYGEIEKLKDLCLKQREEIKSLKNAILFPDVMNSQLQGILEKQDSELKQAKQIIPTLQ 240

Query: 692 KQVTTLTGQLHSLAEDLAEVKADKYSGKAWLQNNSSSPHTPTYDHEDASNSLEFSACDPA 751
           KQVTTLTGQL+SLAEDLAEVKADKYSGK WLQ  SSSPHTPTYDHEDASN LEFSACDP 
Sbjct: 241 KQVTTLTGQLYSLAEDLAEVKADKYSGKGWLQ-GSSSPHTPTYDHEDASNPLEFSACDPT 300

Query: 752 SPGSPDDFLLKDVNPCLTPYYATKSKEFEAMGYDSPRDEILSHNRMESGFKSCSRKLSKS 811
           SP  PDD+LLKDVNPCLTPYYATKSK+FEAMGYDSPRDEILSHNRMESGF SCSRKLSKS
Sbjct: 301 SPSRPDDYLLKDVNPCLTPYYATKSKDFEAMGYDSPRDEILSHNRMESGFTSCSRKLSKS 360

Query: 812 SDCRQNSNKPNTTKTARRSDEAKYTYGKPMRKFY 820
           SDCRQNSNK  TTKTARRSDEAKYTYGKPM KFY
Sbjct: 361 SDCRQNSNKAKTTKTARRSDEAKYTYGKPMHKFY 389

BLAST of Sgr020877 vs. ExPASy TrEMBL
Match: A0A1S3CL74 (uncharacterized protein LOC103501712 OS=Cucumis melo OX=3656 GN=LOC103501712 PE=4 SV=1)

HSP 1 Score: 579.7 bits (1493), Expect = 1.9e-161
Identity = 326/393 (82.95%), Postives = 339/393 (86.26%), Query Frame = 0

Query: 452 MASVVKPSSRYSSYDVRSSTSSHFSDPSSSSEFKLKSPMAANSSSSRALVKCKASDLARG 511
           MA V+KPSSRYSSYDVRSSTSSHFSDPSSSS+FK+KSP+ ANSSSSRALVK K +DLAR 
Sbjct: 1   MAKVMKPSSRYSSYDVRSSTSSHFSDPSSSSDFKIKSPLPANSSSSRALVKTKPTDLARA 60

Query: 512 KSKPSDQNLTAMVKKFMEKRSGLKPKTAKQATGL-------------------------- 571
           K KPSDQNLTAMVKKFMEKRSG KPK  K A GL                          
Sbjct: 61  KMKPSDQNLTAMVKKFMEKRSGSKPKAVKHAAGLVIPSDLIAEDLKKTARKGTSFGGLHK 120

Query: 572 KLFGKGTAAVEKKEKETEAKALTEVKGNTRTLAMVLRSERELLSLNKEQELEITELKLVL 631
           KLFGKGT  +EKK+ + E KALTEVKGNTRTLAMVLRSERELLSLNKEQELEITELKLVL
Sbjct: 121 KLFGKGT--MEKKDAK-EVKALTEVKGNTRTLAMVLRSERELLSLNKEQELEITELKLVL 180

Query: 632 EEKYREIEKLKDLCLKQREEIKSLKNAILFPDVMNSQLQELLEKQDSELKQAKQVIPTLQ 691
           EEKYREIEKLKDLCLKQREEIKSLKNAILFPDVMNSQLQ +LEKQDSELKQAKQ+IPTLQ
Sbjct: 181 EEKYREIEKLKDLCLKQREEIKSLKNAILFPDVMNSQLQNMLEKQDSELKQAKQIIPTLQ 240

Query: 692 KQVTTLTGQLHSLAEDLAEVKADKYSGKAWLQNNSSSPHTPTYDHEDASNSLEFSACDPA 751
           KQVTTLTGQLHSLAEDLAEVKADKYSGK+WLQ  S SPHTPTYDHEDASNSLEFS CDP 
Sbjct: 241 KQVTTLTGQLHSLAEDLAEVKADKYSGKSWLQ-GSISPHTPTYDHEDASNSLEFSVCDPT 300

Query: 752 SPGSPDDFLLKDVNPCLTPYYATKSKEFEAMGYDSPRDEILSHNRMESGFKSCSRKLSKS 811
           SPGSPDDFLLKDVNPCLTPYYATKSKEFEAMGYDSPR E +S NRMESGFKSCSRKLSKS
Sbjct: 301 SPGSPDDFLLKDVNPCLTPYYATKSKEFEAMGYDSPRGETVSQNRMESGFKSCSRKLSKS 360

Query: 812 SDCRQNSNKPNTTKTARRSDEAKYTYGKPMRKF 819
           SDCRQNSNK NTTKT R+SDEAKYTYGKPM KF
Sbjct: 361 SDCRQNSNKANTTKTGRQSDEAKYTYGKPMHKF 389

BLAST of Sgr020877 vs. ExPASy TrEMBL
Match: A0A6J1HTF1 (uncharacterized protein LOC111466593 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111466593 PE=4 SV=1)

HSP 1 Score: 578.9 bits (1491), Expect = 3.2e-161
Identity = 328/394 (83.25%), Postives = 337/394 (85.53%), Query Frame = 0

Query: 452 MASVVKPSSRYSSYDVRSSTSSHFSDPSSSSEFKLKSPMAANSSSSRALVKCKASDLARG 511
           MA V+ PSSRYSSYDVRSS SSHFSDPSSSSEFKLKSPM A+SSSSR +VK KA DLAR 
Sbjct: 1   MAKVINPSSRYSSYDVRSSNSSHFSDPSSSSEFKLKSPMKADSSSSRTIVKSKAVDLARA 60

Query: 512 KSKPSDQNLTAMVKKFMEKRSGLKPKTAKQATGL-------------------------- 571
           K+KP DQNLTAMVKKFMEKRSGLKPKT K ATGL                          
Sbjct: 61  KTKPLDQNLTAMVKKFMEKRSGLKPKTVKHATGLVIPSDLIAEDLKKTARKGTNFGGLHK 120

Query: 572 KLFGKGTAAVEKKEKETEAKALTEVKGNTRTLAMVLRSERELLSLNKEQELEITELKLVL 631
           KLFGKG   VEKKEK  E KALTEVKGNTRTLAMVLRSERELLSLNKEQELEITELKLVL
Sbjct: 121 KLFGKG--MVEKKEK--EVKALTEVKGNTRTLAMVLRSERELLSLNKEQELEITELKLVL 180

Query: 632 EEKYREIEKLKDLCLKQREEIKSLKNAILFPDVMNSQLQELLEKQDSELKQAKQVIPTLQ 691
           EEKY EIEKLKDLCLKQREEIKSLKNAILFPDVMNSQLQ +LEKQDSELKQAKQ+IPTLQ
Sbjct: 181 EEKYGEIEKLKDLCLKQREEIKSLKNAILFPDVMNSQLQGILEKQDSELKQAKQIIPTLQ 240

Query: 692 KQVTTLTGQLHSLAEDLAEVKADKYSGKAWLQNNSSSPHTPTYDHEDASNSLEFSACDPA 751
           KQVTTLTGQL+SLAEDLAEVKADKYSGK WLQ  SSSPHTPTYDHEDASN LEFSACDP 
Sbjct: 241 KQVTTLTGQLYSLAEDLAEVKADKYSGKGWLQ-GSSSPHTPTYDHEDASNPLEFSACDPT 300

Query: 752 SPGSPDDFLLKDVNPCLTPYYATKSKEFEAMGYDSPRDEILSHNRMESGFKSCSRKLSKS 811
           SP  PDD+LLKDVNPCLTPYYATKSK+FEAMGYDSPRDEILSHNRMES F SCSRKLSKS
Sbjct: 301 SPSRPDDYLLKDVNPCLTPYYATKSKDFEAMGYDSPRDEILSHNRMESDFTSCSRKLSKS 360

Query: 812 SDCRQNSNKPNTTKTARRSDEAKYTYGKPMRKFY 820
           SDCRQNSNK  TTKTARRSDEAKYTYGKPM KFY
Sbjct: 361 SDCRQNSNKAKTTKTARRSDEAKYTYGKPMHKFY 389

BLAST of Sgr020877 vs. ExPASy TrEMBL
Match: A0A0A0LTE0 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G181410 PE=4 SV=1)

HSP 1 Score: 578.6 bits (1490), Expect = 4.2e-161
Identity = 325/394 (82.49%), Postives = 340/394 (86.29%), Query Frame = 0

Query: 452 MASVVKPSSRYSSYDVRSSTSSHFSDPSSSSEFKLKSPMAANSSSSRALVKCKASDLARG 511
           MA V+KPSSRY+SYD+RSSTSSHFSDPSSSS+F +KSP+  NSSSSRALVK K SDLAR 
Sbjct: 1   MAKVMKPSSRYTSYDIRSSTSSHFSDPSSSSDFNIKSPLPPNSSSSRALVKTKPSDLARA 60

Query: 512 KSKPSDQNLTAMVKKFMEKRSGLKPKTAKQATGL-------------------------- 571
           K KPSDQNLTAMVKKFMEKRSG KPKT K A GL                          
Sbjct: 61  KVKPSDQNLTAMVKKFMEKRSGSKPKTLKHAAGLVISSDLIAEDLKKTARKGTNFGGLHK 120

Query: 572 KLFGKGTAAVEKKEKETEAKALTEVKGNTRTLAMVLRSERELLSLNKEQELEITELKLVL 631
           KLFGKGT  VEKKE + E KALTEVKGNTRTLAMVLRSERELLSLNK+QELEITELKLVL
Sbjct: 121 KLFGKGT--VEKKEVK-EVKALTEVKGNTRTLAMVLRSERELLSLNKDQELEITELKLVL 180

Query: 632 EEKYREIEKLKDLCLKQREEIKSLKNAILFPDVMNSQLQELLEKQDSELKQAKQVIPTLQ 691
           EEKYREIEKLKDLCLKQREEIKSLKNA+LFPDVMNSQLQ +LEKQDSELKQAKQ+IPTLQ
Sbjct: 181 EEKYREIEKLKDLCLKQREEIKSLKNAVLFPDVMNSQLQNMLEKQDSELKQAKQIIPTLQ 240

Query: 692 KQVTTLTGQLHSLAEDLAEVKADKYSGKAWLQNNSSSPHTPTYDHEDASNSLEFSACDPA 751
           KQVTTLTGQL+SLAEDLAEVKADKYSGK+WLQ  S SPHTPTYDHEDASNSLEFS CDP 
Sbjct: 241 KQVTTLTGQLYSLAEDLAEVKADKYSGKSWLQ-GSISPHTPTYDHEDASNSLEFSVCDPT 300

Query: 752 SPGSPDDFLLKDVNPCLTPYYATKSKEFEAMGYDSPRDEILSHNRMESGFKSCSRKLSKS 811
           SPGSPDDFLLKDVNPCLTPYYATKSKEFEAMGYDSPRDEIL  NRMESGFKSCSRKLSKS
Sbjct: 301 SPGSPDDFLLKDVNPCLTPYYATKSKEFEAMGYDSPRDEILPQNRMESGFKSCSRKLSKS 360

Query: 812 SDCRQNSNKPNTTKTARRSDEAKYTYGKPMRKFY 820
           SDC+Q SNK NTTKT R+SDEAKYTYGKPMRKFY
Sbjct: 361 SDCKQISNKANTTKTGRQSDEAKYTYGKPMRKFY 390

BLAST of Sgr020877 vs. TAIR 10
Match: AT4G17240.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 16 plant structures; EXPRESSED DURING: 8 growth stages; Has 1142 Blast hits to 1055 proteins in 252 species: Archae - 22; Bacteria - 318; Metazoa - 248; Fungi - 96; Plants - 59; Viruses - 3; Other Eukaryotes - 396 (source: NCBI BLink). )

HSP 1 Score: 223.8 bits (569), Expect = 5.1e-58
Identity = 179/375 (47.73%), Postives = 233/375 (62.13%), Query Frame = 0

Query: 459 SSRYSSYDVRSS-TSSHFSDPSSSSEFKLKSPMAANSSSSRALVKCKASDLARG----KS 518
           +SRY+SYD RSS TSS  SD SSS+EFK   P+     SS+A+V+ K+S L +     K 
Sbjct: 2   ASRYNSYDSRSSVTSSIHSDLSSSAEFKSNKPI-----SSKAIVRSKSSYLTKTTKPIKP 61

Query: 519 KPSDQNLTAMVKKFME-KRSGLK--------PKTAKQATGLKLFGKGTAAVEKKE--KET 578
             +  NLT M+KK ME K+S  K        P+  K+    K  GK T    +++   + 
Sbjct: 62  DSNPGNLTNMMKKLMEMKKSNSKSKRVELVIPEELKKIDTGKGGGKSTLGTLQRKLFGKE 121

Query: 579 EAKALTEVKGNTRTLAMVLRSERELLSLNKEQELEITELKLVLEEKYREIEKLKDLCLKQ 638
           + KALTEVK NTRTL+MVLRSERELL +NK+QE+EI ELK  LEEK RE+EKLKDLCLKQ
Sbjct: 122 KVKALTEVKSNTRTLSMVLRSERELLGMNKDQEVEIAELKFQLEEKNREVEKLKDLCLKQ 181

Query: 639 REEIKSLKNAILFPDVMNSQLQELLEKQDSELKQAKQVIPTLQKQVTTLTGQLHSLAEDL 698
           REEIKSLK+A+LFPD MNSQ+ ++      EL QA+++IP LQKQV +L GQL  +A+DL
Sbjct: 182 REEIKSLKSAVLFPDSMNSQINQM-----QELNQAREIIPNLQKQVISLNGQLQCIAQDL 241

Query: 699 AEVKADKY-SGKAWLQNNSSSPHTPTYDHEDASNSLEFSACDPASPGSPDDFLLKDVNPC 758
           AEVKA+KY S   + Q  +SS     YD      SLEFS+      GSPD   L+D+NPC
Sbjct: 242 AEVKANKYLSESCYWQAQTSS-----YD------SLEFSS------GSPDGLALEDLNPC 301

Query: 759 LTPYYATKSKEFEAMGYDSPRDEILSHNRMES---GFKSCSR--KLSKSSDCRQNSNKPN 812
           LTPY   K KE+E +  DS  + +   + + +     KS SR  K+S+SS+         
Sbjct: 302 LTPYTKKKPKEYERV--DSAEESLSGRSTITTTGGKVKSSSRSVKMSRSSE--------- 337

BLAST of Sgr020877 vs. TAIR 10
Match: AT4G17240.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 16 plant structures; EXPRESSED DURING: 8 growth stages. )

HSP 1 Score: 173.7 bits (439), Expect = 6.1e-43
Identity = 158/375 (42.13%), Postives = 215/375 (57.33%), Query Frame = 0

Query: 459 SSRYSSYDVRSS-TSSHFSDPSSSSEFKLKSPMAANSSSSRALVKCKASDLARG----KS 518
           +SRY+SYD RSS TSS  SD SSS+EFK   P+     SS+A+V+ K+S L +     K 
Sbjct: 2   ASRYNSYDSRSSVTSSIHSDLSSSAEFKSNKPI-----SSKAIVRSKSSYLTKTTKPIKP 61

Query: 519 KPSDQNLTAMVKKFME-KRSGLK--------PKTAKQATGLKLFGKGTAAVEKKE--KET 578
             +  NLT M+KK ME K+S  K        P+  K+    K  GK T    +++   + 
Sbjct: 62  DSNPGNLTNMMKKLMEMKKSNSKSKRVELVIPEELKKIDTGKGGGKSTLGTLQRKLFGKE 121

Query: 579 EAKALTEVKGNTRTLAMVLRSERELLSLNKEQELEITELKLVLEEKYREIEKLKDLCLKQ 638
           + KALTEVK NTRTL+M+            E+     ++K+ L     ++EKLKDLCLKQ
Sbjct: 122 KVKALTEVKSNTRTLSMI-----------HERLAVCNQIKVFL-----QVEKLKDLCLKQ 181

Query: 639 REEIKSLKNAILFPDVMNSQLQELLEKQDSELKQAKQVIPTLQKQVTTLTGQLHSLAEDL 698
           REEIKSLK+A+LFPD MNSQ+ ++      EL QA+++IP LQKQV +L GQL  +A+DL
Sbjct: 182 REEIKSLKSAVLFPDSMNSQINQM-----QELNQAREIIPNLQKQVISLNGQLQCIAQDL 241

Query: 699 AEVKADKY-SGKAWLQNNSSSPHTPTYDHEDASNSLEFSACDPASPGSPDDFLLKDVNPC 758
           AEVKA+KY S   + Q  +SS     YD      SLEFS+      GSPD   L+D+NPC
Sbjct: 242 AEVKANKYLSESCYWQAQTSS-----YD------SLEFSS------GSPDGLALEDLNPC 301

Query: 759 LTPYYATKSKEFEAMGYDSPRDEILSHNRMES---GFKSCSR--KLSKSSDCRQNSNKPN 812
           LTPY   K KE+E +  DS  + +   + + +     KS SR  K+S+SS+         
Sbjct: 302 LTPYTKKKPKEYERV--DSAEESLSGRSTITTTGGKVKSSSRSVKMSRSSE--------- 321

BLAST of Sgr020877 vs. TAIR 10
Match: AT5G47660.1 (Homeodomain-like superfamily protein )

HSP 1 Score: 119.8 bits (299), Expect = 1.0e-26
Identity = 127/397 (31.99%), Postives = 177/397 (44.58%), Query Frame = 0

Query: 1   MDLFTADHRITSSDDFPQHVAPFPDPTD-----LLYAAPSAVFPPADIIDHCPNPPPPPQ 60
           M+L   D R    DDF + + PF D +D     +            D +    +   PPQ
Sbjct: 1   MELLAGDCRKRVGDDFEEDINPF-DGSDGGCGWMYGTRQMGSNGNDDALATLADLASPPQ 60

Query: 61  KLRPIRCNGRSPAGSQAENIFDGALRNFQGIPSSPEGGFTGDQLCVANIDPSEYFNSSKK 120
           KL+PIRC  + P+ S+  +  D        +P    G F             E    SK 
Sbjct: 61  KLKPIRCGVKLPSSSEDRHPLDILAGTLDRLPEMGFGCF-------------EAPLGSKI 120

Query: 121 EKPVEVAMDNGGFGDIIGNNYFSEEETKDGGSGAVIAVENLSRRGEGPQLDDDSCSTSDV 180
               E      GF          E+++         A   +S  G       DS S SD 
Sbjct: 121 ADVEESGQLTRGFSK-------EEDDSLPPLQMEFQARNRISWDGLSLSSSVDS-SDSDS 180

Query: 181 GDDIVSTKKPLNHKRKR-TR-SLELFVENLVMKVLDKQEQMHQQLIDMIEKKEKERIVRE 240
             D+   +K +  KRKR TR  LE F+E LV  ++ +QE+MH QLI+++EK E ERI RE
Sbjct: 181 SPDV---RKTVTGKRKRETRVKLEHFLEKLVGSMMKRQEKMHNQLINVMEKMEVERIRRE 240

Query: 241 EAWKQR---RLKESEGMR-------------------------------------IENQC 300
           EAW+Q+   R+ ++E  R                                     +  QC
Sbjct: 241 EAWRQQETERMTQNEEARKQEMARNLSLISFIRSVTGDEIEIPKQCEFPQPLQQILPEQC 300

Query: 301 TEDDGGESSIQKELKSDLS-------RRWPQAEVQALISLRTSLEHKFRATG-SKGSIWE 343
            ++    +  ++E+K   S       RRWPQ EVQALIS R+ +E K   TG +KG+IW+
Sbjct: 301 KDEKCESAQREREIKFRYSSGSGSSGRRWPQEEVQALISSRSDVEEK---TGINKGAIWD 360

BLAST of Sgr020877 vs. TAIR 10
Match: AT1G76880.1 (Duplicated homeodomain-like superfamily protein )

HSP 1 Score: 103.2 bits (256), Expect = 1.0e-21
Identity = 75/260 (28.85%), Postives = 118/260 (45.38%), Query Frame = 0

Query: 162 GPQLDDDSCSTS---DVGDDIVSTKKPLNHKRKRTRSLELFVENLVMKVLDKQEQMHQQL 221
           G  L D+S S+S       D+         ++KR R  ++F E L+ +V+DKQE++ ++ 
Sbjct: 217 GDFLSDNSTSSSSSYSTSSDMEMGGGTATTRKKRKRKWKVFFERLMKQVVDKQEELQRKF 276

Query: 222 IDMIEKKEKERIVREEAWK-------------------------------QRRLKESE-- 281
           ++ +EK+E ER+VREE+W+                                ++L E +  
Sbjct: 277 LEAVEKREHERLVREESWRVQEIARINREHEILAQERSMSAAKDAAVMAFLQKLSEKQPN 336

Query: 282 -------------GMRIEN------------------------------QCTEDDGGESS 341
                         M++ N                                T+ D G   
Sbjct: 337 QPQPQPQPQQVRPSMQLNNNNQQQPPQRSPPPQPPAPLPQPIQAVVSTLDTTKTDNGGDQ 396

Query: 342 IQKELKSDLSRRWPQAEVQALISLRTSLEHKFRATGSKGSIWEEISVEMHKMGHNRSAKK 343
                 S  S RWP+ E++ALI LRT+L+ K++  G KG +WEEIS  M ++G NR++K+
Sbjct: 397 NMTPAASASSSRWPKVEIEALIKLRTNLDSKYQENGPKGPLWEEISAGMRRLGFNRNSKR 456

BLAST of Sgr020877 vs. TAIR 10
Match: AT1G76890.2 (Duplicated homeodomain-like superfamily protein )

HSP 1 Score: 98.6 bits (244), Expect = 2.5e-20
Identity = 76/236 (32.20%), Postives = 119/236 (50.42%), Query Frame = 0

Query: 165 LDDDSCSTSDVGDDIVSTKKPLNHKRKRTRSLELFVENLVMKVLDKQEQMHQQLIDMIEK 224
           L   S S+S   D+     +  + ++KR     LF + L  ++++KQE+M ++ ++ +E 
Sbjct: 231 LFSSSTSSSTASDEEEDHHQVKSSRKKRKYWKGLFTK-LTKELMEKQEKMQKRFLETLEY 290

Query: 225 KEKERIVREEAWKQRRL----KESEGMRIE--NQCTED----------DGGE-------- 284
           +EKERI REEAW+ + +    +E E +  E  N   +D           GG+        
Sbjct: 291 REKERISREEAWRVQEIGRINREHETLIHERSNAAAKDAAIISFLHKISGGQPQQPQQHN 350

Query: 285 --SSIQKELKSD--------------------------------LSRRWPQAEVQALISL 343
              S +K+ +SD                                 S RWP+ EV+ALI +
Sbjct: 351 HKPSQRKQYQSDHSITFESKEPRAVLLDTTIKMGNYDNNHSVSPSSSRWPKTEVEALIRI 410

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022142583.13.8e-17286.62inner centromere protein A [Momordica charantia][more]
XP_023542139.14.5e-16584.26uncharacterized protein LOC111802113 isoform X1 [Cucurbita pepo subsp. pepo] >XP... [more]
XP_038895034.16.5e-16484.73uncharacterized protein LOC120083373 [Benincasa hispida][more]
KAG6573527.13.2e-16384.01hypothetical protein SDJN03_27414, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_022925334.12.1e-16283.76uncharacterized protein LOC111432624 isoform X1 [Cucurbita moschata] >XP_0229253... [more]
Match NameE-valueIdentityDescription
Q9C6K31.4e-2028.85Trihelix transcription factor DF1 OS=Arabidopsis thaliana OX=3702 GN=DF1 PE=4 SV... [more]
Q391173.5e-1932.20Trihelix transcription factor GT-2 OS=Arabidopsis thaliana OX=3702 GN=GT-2 PE=1 ... [more]
Q9C8822.8e-1652.44Trihelix transcription factor GTL1 OS=Arabidopsis thaliana OX=3702 GN=GTL1 PE=1 ... [more]
Q8H1815.2e-1526.81Trihelix transcription factor GTL2 OS=Arabidopsis thaliana OX=3702 GN=At5g28300 ... [more]
Q9LZS03.9e-1043.08Trihelix transcription factor PTL OS=Arabidopsis thaliana OX=3702 GN=PTL PE=1 SV... [more]
Match NameE-valueIdentityDescription
A0A6J1CNL51.8e-17286.62inner centromere protein A OS=Momordica charantia OX=3673 GN=LOC111012662 PE=4 S... [more]
A0A6J1EHN11.0e-16283.76uncharacterized protein LOC111432624 isoform X1 OS=Cucurbita moschata OX=3662 GN... [more]
A0A1S3CL741.9e-16182.95uncharacterized protein LOC103501712 OS=Cucumis melo OX=3656 GN=LOC103501712 PE=... [more]
A0A6J1HTF13.2e-16183.25uncharacterized protein LOC111466593 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A0A0LTE04.2e-16182.49Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G181410 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G17240.15.1e-5847.73unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT4G17240.26.1e-4342.13unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT5G47660.11.0e-2631.99Homeodomain-like superfamily protein [more]
AT1G76880.11.0e-2128.85Duplicated homeodomain-like superfamily protein [more]
AT1G76890.22.5e-2032.20Duplicated homeodomain-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 591..632
NoneNo IPR availableCOILSCoilCoilcoord: 208..228
NoneNo IPR availableCOILSCoilCoilcoord: 640..667
NoneNo IPR availableGENE3D1.10.10.60coord: 275..336
e-value: 4.1E-22
score: 80.0
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 782..798
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 782..819
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 466..489
NoneNo IPR availablePANTHERPTHR35493:SF1STRUCTURAL MAINTENANCE OF CHROMOSOMES PROTEINcoord: 452..540
NoneNo IPR availablePANTHERPTHR35493STRUCTURAL MAINTENANCE OF CHROMOSOMES PROTEINcoord: 452..540
NoneNo IPR availablePANTHERPTHR35493STRUCTURAL MAINTENANCE OF CHROMOSOMES PROTEINcoord: 546..816
NoneNo IPR availablePANTHERPTHR35493:SF1STRUCTURAL MAINTENANCE OF CHROMOSOMES PROTEINcoord: 546..816
NoneNo IPR availableCDDcd12203GT1coord: 273..337
e-value: 2.06707E-27
score: 103.514
IPR001005SANT/Myb domainSMARTSM00717santcoord: 271..333
e-value: 0.0048
score: 26.1
IPR001005SANT/Myb domainPROSITEPS50090MYB_LIKEcoord: 274..331
score: 7.259158
IPR044822Myb/SANT-like DNA-binding domain 4PFAMPF13837Myb_DNA-bind_4coord: 273..342
e-value: 1.7E-16
score: 60.3

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr020877.1Sgr020877.1mRNA