MC06g1006 (gene) Bitter gourd (Dali-11) v1

Overview
NameMC06g1006
Typegene
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptionpseudouridine-5'-phosphate glycosidase
LocationMC06: 9181571 .. 9187815 (-)
RNA-Seq ExpressionMC06g1006
SyntenyMC06g1006
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: utr5polypeptideCDSutr3
Hold the cursor over a type above to highlight its positions in the sequence below.
CGATCGAATTATTTATTTATTTATTTTTCAACCTCCTTTTAAATGGTAATGTTGCGGGGAATTCTGTAAACTCTATCATTCTGCCGCAACTAATTTTGTGATTTAGCCCAATTATCCACCACCCCCCAGTCTGATCAAAAGAACAATGGCGTCTTATCCTTCTTCTTCTTCTTCTTCAGCTCTCTCAAGAATATCTAATCTCAGTAGACACTTCCACCCTTCAGACTCGAGAAACACAGATAACCCACAGGTCAGAATCATCAAAATTATCTCTTTGAACTACTTTTCCATCTTTTGCTTACTGATGTGTCTTCTTCTGTAATGTTGAAAGGGTGCCATTGACGCTTCAAATGGAGGAGGCCTAATTAAGATGTCTGCCGAAGTTTCTGCTGCCTTGTCTCGCGGCCATCCCGTTGTTGCCCTTGAATCAACCATCATCTCACATGGTGACTCTGTTTTTCTTTGCCATTTCAATAATTTCATGCCATGAGCTTTCGGGTTTTGTAGAATTATTTTAAACTTTTGATCCCCTTTCGTCAAAACTAGTTTTCTTCATCCTTATGATGGATGATGATATGATTGTACCTTGGACAAACTTGGACAAAAAGAAACAAAGTGACTCGGACAAATCTTGTAATGGATCAAACACTAATATTGTGTATGATTCTCAGAGTAGAAGAACCATTGCATTTGAAATTAATTGGTTTTACAGGAGCTTGAAACACAGTAGGGGGCATTACAAAAATGAAGATTCATTTCTGATGACTTAGGCTGTAGGATCCAATAATGTTAAGTTTGAAAACTAGGCAAATTAGTGAAGGTCGACTATTTTTGGATATTCAAAAACTAATGAACACAATAAAATTGTTATTCTTTTCATTCATTTCGTAATAGTTGGAAGCTGTTTTCGCTCTTTGTATCTAGGGATGCCATATCCACAAAATCTGGAAACTGCAAAGGAAGTAGAGGCGATAGTGAGGAAGAATGGAGCAGTTCCTGCAACTGTAGCGATTTTAGATGGCACACCATGCGTAGGTGTTTATATTAAACTACTCTATCTCTATATACTTGTTGACACTGCTTGTGCTTGCTAATGATAGTCTTACACAATTTATTAACCATCTAGATATATTTGAAGGCCTACAACTTCAATGTAATGAGTGAACTAAATTTCGTGTCATGGTTTGAACTGATTTCAATTTTTGATTGTTTAACGCATTGACTTGTGCTTGAAAATGGTAAAACTTTGTGCTGATGCTTCTTGGGGCTTCTTTGGAGATCATGAAGTCACCAATGAATTGTCGAGGTGATAGATCAGATCCGGACACGGTTGCTGCTTTACGGCGATTCGGACAATTATGATCTTCAATATATCTTTGAGACATAATTTTGCATAATGTATGTCTCTGTAAGTGGTAGTTGTAGTGTCTCAACCTCATCCTTGAAGATGAAATTGTGTGTTGCTCGACATATCAAATAATATATCTATGTCCTGCCCATTGATTTGGCCTTCTATGCACGTGGTCTTTGGCTCTTTGAGACCTGCCTACTACTTAGTCTCCTCGTCTTCTATTAGATTAGTAACTGCAACAGTTAGTTCTTGCATCCTTTCCATGGAGCAAACTTTTATCACAATTAACAAAAGCTCGTTCTCTGGGAAACTTTCACTCCAATTCAGATAACTTCTTAAATTCCATCCAACCCTTCCTGTTGATCGAGTTTTGTTAATGTGTCCAACCAATCTAGCCCTTGAAGTCAACTAATTCAGAAGAATCAACCCATCCCATGTATGATGTATCCAACCCTCCGTATCCATACTGGAAAACACTACCCTTTCTAGCTTTCTCACTCTCAACTTGCTCATGGAGAAGTATCCCCGCCAATTTTCCCAATGTAAGCATGTGCACTCACAAGGGCTCCACCTTCCCCCATTGGTTTCTTAAGCTTTTGGAGTTTCCACTTCCACTCAAGTTTTGGGAATGGTTATACATCGTTGCCCCCAATTTCTCATGCAGAGCTTCCATGATTCTCTCTAGGGTTGCTGGCTTGATACATGGCCCAAGGTTGTAGTGCCCTGATTCCAAACAATATGTTCTCTACTCATAACTTGAAGTAGTTTCAAGTGTTACTTTGAGTTAAGAAGAGTTACAGCTAGAAAAGATTTCAAGAGTCCAAAGTCTTTCAACAAACTCACTTTCTAATAATTGTATATTTTGCCTTTCTCCTCTTCTCCCCTTTTAGCCCAGGTTCCTCTTTATTTCCTTTGTTCTCGTGCATGGTTTCCTACCGAATATAAATATAACCAACTATTAAAGGACACCTCTAATAACTCCCTACTATTTATTGCCCTGCATTTCTGGCTAGATATTCTCTTCTTCCTCAGTGATTTTCTCATTGTCAGCAAGCAGAACAGTTTAGTTTTTCTTCTCACGTCGTTTATACATGAAGAACATGGGGATTCTATAACTTGCATTTTGAGGACAGCCCCCTTTTCATTTCCAGGTATAATGGTGTGTGCTTTACTAAGAAGTGGTCTAGTTGGGGGTCTAGACCATGTTCTAGTGTATTGTCCTTTTACCTTCTGCGTGTGGAATTAGTTTTTCCCGTGCTTTGGTATCTTATGGGTTCACTACTGCAAGAGTGACTCTGTGTTGAAAGGAGCTCTAAAGCAGTTATCATTTGACAATATCTTTTTGGAGTGTGGTATTGTAACATATTTGTTGAGGAATTTGCAGAGGTATGAAATTGTTTTGGAGGAAAAAGGCTTTGGATGCATTGAATTGGAATAGAGTCTATTTACCACCAACCCTTCTGCTAATTTCACTTCATTGTTGAAAGTATTCATTCAGAATTAAAACTTCAAAACTTTCCTTATGTTGGTTCTGTTTTTCCCACAACTACTTATCTCATTGATTGCTGCATTTTCACTTTGAGATTTCTAACTCTGTGGAGCTCTCTGTATGCAGGCCTAAATGAGGAAGAATTGGAGAGGTTGAGTATTCTGGGACACCGAGTCCAAAAGACTGCTAGGAGGGATATAGCACAAGTTGTGAGTATTAGTTTACCTTTTGTTTTGTCATTGACCAATTTGGTAATGTAATGAAATTTACTCAGAAATTATCAAGGTTTGAACTTAAATTGAATATTTGAATAATAGTCATAAAAATATCGATGTTGGTAGTTGTATGATATAATAATTATTTCCTTTCCTTTTCTCCATTAGTCTTAATTATAGGACATGGACTTATTATAAACTACGTGGTAGTTTATAATTAATGTGTTTTTTTGGAAATATGCTTTTTACATGGATAATTACATGTTATCTCGTCATGTCTTATAAGGTGGCTAGCAGAGGAAATGGTGCTACTACTGTTTCTGCAACAATGTTCTTTGCTTCTATGGTAAGCACCTCATGGTGGTTCCGTCGTATTCTGCATTTGATTCAATAAACATACTTCATCATTTTGCAGTCTTTTAACCTTCTCACCAGTTATTTACGTGGAATTATTGTAGGTTGGTATCTCTGTGTTTGTGACTGGGGGCATTGGAGGAGTTCATAGACATGGTGAACAAAGTAAGATACGTTTGTTTACAGTTTGGAACAATTATTTTGGGGAAGGGTGGGCGAGGGAAGATACTTTCATTTACAAGCTAGAAAGAGAGAAAATGATTGAGACAAAAACTCATGTGATGAACAAAATTAGAAGCGAGATATGGAGGAGAACAAAAGAATGATTGATGATATGCATGGTCAATCAAGATACATTCATAAGCTATAGAGAGAGAGGGAGGGAGGGAGGGATTGGAGAAATGGGTGCACAACCTCAAAGCAAGGGCATTTGTGGAAGGCTTGGAGAGCATGCTTTCGTGCACGATTTTGAGTGATGTATAGATTAGATTGCACCTTTCTCTTATCAATCATGGGCCTTACAGCTTTTGTAGAATCGATAGCTTTATCTTAAATCATATTCGTGTATTGCTCCCACTTCGGTCCAATGGTGTCTAATGGAATATGGAGATGTTAGTTGGATTCTTTGCGAGCTTTTTGACAAATTTTTCTCCTAAGGTTCTCTGGTAAGTCAAATATTAATTTAGTTTTTTACTGGATGCTCACGCTGAGATTATAGATCATCACATTAATTTTTTTCTGGAGTGTTCAACATAAAGTTCTGACATGTTAATGTTTCAACATGGAGCATTGAATAACTTTATCGATCTCCCGTACTTATTTTCAACTGCTTTTTGCAGCACTGGACATATCTTCTGATCTCACTGAGCTAGGAAGAACTCCTGTAGCAGTCATCTCGGCTGGTGTAAAATCAATTTTAGACATTCCTAGGACACTTGAATATTTGGTACAGTTGTCTATTTCTCTCCCTTCAATCTTTCTACACATCATGCACTTTTCTTTTTAATCTATTCACTGTCCCAAAAATATCATTAACAGGAAACCCAAGGAGTGTGTGTTGCAGCTTACGGGACGAACGAGTTTCCTGCATTTTTCACTGAAACCAGTGGCTGCAAGGTATTTACCTAAATATCTGCTTTTTGCCTTCTAGATCTCTATATATTTCTAATGCTGCTGGTAATTTACCAAAGCCAAAAAGTTTCTTCACTTCCTCTATTTAGTTTAACGGGTATTAAATTCCAGTTATATTGTCCCTAATATTTGGTGTTATATCAAGCCCCTAAGTGACAGCTTAGTTGGCAAGAGTTTGGGGTCTCTTGTTCATATTGGCTCAGAAGTCTCAAGTTTCAGAACTTCTGGTGAGTTTAATATCAATAACCTTTGAAGTCTCTCCAGTTTAAGCCTTGGGCTGGGCACGGGTGCCCCTGGATATAGGAGAGCAAAGCTCTGACTCAAGTTCTAAAATATATATATATATATGGTGTTATACCTAAAGATAATCTTTAAGTTTGTGGTTGGTTATCTTCACTCATTCTATAGTTAGGGTGTAAATTTTCACTGGTTGATGTGCTAGAATGTTCTTTATTTGAAACTAAGATTGCTTTTTTGTTCATGACATACAAAATGCAGGCGCCTTGTCGTGTCGATACCCCAGAAGATGCTGCAAAGCTTATTGGCAAGTCACTACAGAGCTTTCAACCTTTGTTGATCTAATAATAGACTTCTTGAATATTTCTTCGCATAGTTGGCAATCTTGTGATTTATTCCATTGTTTTCTTTATTTATTTATTTTTTATGGTCTTCCTCTGCAGATGCCAACTTTAAGCTTGGGCTTGGAAGTGGAATTCTGATTGCTGTTCCAATTCCAAAAGAGCATTCTGCTTCCGGAAGCTTAACTGAGAATGCAATACAAAGCGCACTTCAAGAAGCTCGGTAAATAATAAAGAGTTGAAGCTATTATTCAGAAGAACTACTAATAAAGTCAGAATATTACTTATATTTCTCTTTTAATTATTTTGAAATATAACATTGCAAATATGAAATGAAGATTTGAACCTTTTGACTTGTAGGAGAGGATGCCTTTACCAGTAAACTCAAAAACTCTTTTTATTTGTATGTATCGATTACAGGGAGAAGAATATACTTGGAAATGCTGAAACTCCGTTCTTACTTAAAAGAGTGAATGAGCTTACTGGAGGAGCCTCGCTTGCTTCAAGTATCCTTCAATTGAGACCATTTTTTTTTCCAAATATTTTTAATCATAACGAACATGCATTGTCAGTTAGGCTAACAAGGAAGTTTTCGCATATTTCTTACATGATCGAGACGAAGCGAAAAAAGTTATTTATTTATTTTTTTACTTTTCCAAGCTATCATATTCAGACCTGGATTTTTGTTTTAAGCTTATGTTTCTTGGTTTCCTTAACTTCCCCAAGATATTGCACTTGTTAAGAATAATGCTATTGTTGGGGCTGGAATTGCTGTAGCCCTTGCCAAGCTTAGAGTCTAATAGCAGGAAGTATTTTCAAGTACTTTTTTTTTTCGTTTTTTTTTATTAATAGAATAACAATGGCCAAATAATTGTATTGTATGGTCATTGGAAAAACAATCTGTTAAGGCAGCCTAACTCTTTGTTTTTAATTTCATTTATTGGTTGGGATAGTTCAGATATAGCAATGTGATGAACATCACATCAGAGAATTTCCTCTCTTAAGTTAGTATGTTAAATTGTGTGTGCTCGTGCGTGTGTGATAAGAACCTGAATAAACGCTCGATATAAAAGAGGAAGACGAGTAATCTTCCAACAATCTTACGACAAAAGTGTAACAATTTCGTG

mRNA sequence

CGATCGAATTATTTATTTATTTATTTTTCAACCTCCTTTTAAATGGTAATGTTGCGGGGAATTCTGTAAACTCTATCATTCTGCCGCAACTAATTTTGTGATTTAGCCCAATTATCCACCACCCCCCAGTCTGATCAAAAGAACAATGGCGTCTTATCCTTCTTCTTCTTCTTCTTCAGCTCTCTCAAGAATATCTAATCTCAGTAGACACTTCCACCCTTCAGACTCGAGAAACACAGATAACCCACAGGGTGCCATTGACGCTTCAAATGGAGGAGGCCTAATTAAGATGTCTGCCGAAGTTTCTGCTGCCTTGTCTCGCGGCCATCCCGTTGTTGCCCTTGAATCAACCATCATCTCACATGGGATGCCATATCCACAAAATCTGGAAACTGCAAAGGAAGTAGAGGCGATAGTGAGGAAGAATGGAGCAGTTCCTGCAACTGTAGCGATTTTAGATGGCACACCATGCGTAGGCCTAAATGAGGAAGAATTGGAGAGGTTGAGTATTCTGGGACACCGAGTCCAAAAGACTGCTAGGAGGGATATAGCACAAGTTGTGGCTAGCAGAGGAAATGGTGCTACTACTGTTTCTGCAACAATGTTCTTTGCTTCTATGGTTGGTATCTCTGTGTTTGTGACTGGGGGCATTGGAGGAGTTCATAGACATGGTGAACAAACACTGGACATATCTTCTGATCTCACTGAGCTAGGAAGAACTCCTGTAGCAGTCATCTCGGCTGGTGTAAAATCAATTTTAGACATTCCTAGGACACTTGAATATTTGGAAACCCAAGGAGTGTGTGTTGCAGCTTACGGGACGAACGAGTTTCCTGCATTTTTCACTGAAACCAGTGGCTGCAAGGCGCCTTGTCGTGTCGATACCCCAGAAGATGCTGCAAAGCTTATTGGCAAGTCACTACAGAGCTTTCAACCTGATCTAATAATAGACTTCTTGAATATTTCTTCGCATAATGCCAACTTTAAGCTTGGGCTTGGAAGTGGAATTCTGATTGCTGTTCCAATTCCAAAAGAGCATTCTGCTTCCGGAAGCTTAACTGAGAATGCAATACAAAGCGCACTTCAAGAAGCTCGGGAGAAGAATATACTTGGAAATGCTGAAACTCCGTTCTTACTTAAAAGAGTGAATGAGCTTACTGGAGGAGCCTCGCTTGCTTCAAATATTGCACTTGTTAAGAATAATGCTATTGTTGGGGCTGGAATTGCTGTAGCCCTTGCCAAGCTTAGAGTCTAATAGCAGGAAGTATTTTCAAGTACTTTTTTTTTTCGTTTTTTTTTATTAATAGAATAACAATGGCCAAATAATTGTATTGTATGGTCATTGGAAAAACAATCTGTTAAGGCAGCCTAACTCTTTGTTTTTAATTTCATTTATTGGTTGGGATAGTTCAGATATAGCAATGTGATGAACATCACATCAGAGAATTTCCTCTCTTAAGTTAGTATGTTAAATTGTGTGTGCTCGTGCGTGTGTGATAAGAACCTGAATAAACGCTCGATATAAAAGAGGAAGACGAGTAATCTTCCAACAATCTTACGACAAAAGTGTAACAATTTCGTG

Coding sequence (CDS)

ATGGCGTCTTATCCTTCTTCTTCTTCTTCTTCAGCTCTCTCAAGAATATCTAATCTCAGTAGACACTTCCACCCTTCAGACTCGAGAAACACAGATAACCCACAGGGTGCCATTGACGCTTCAAATGGAGGAGGCCTAATTAAGATGTCTGCCGAAGTTTCTGCTGCCTTGTCTCGCGGCCATCCCGTTGTTGCCCTTGAATCAACCATCATCTCACATGGGATGCCATATCCACAAAATCTGGAAACTGCAAAGGAAGTAGAGGCGATAGTGAGGAAGAATGGAGCAGTTCCTGCAACTGTAGCGATTTTAGATGGCACACCATGCGTAGGCCTAAATGAGGAAGAATTGGAGAGGTTGAGTATTCTGGGACACCGAGTCCAAAAGACTGCTAGGAGGGATATAGCACAAGTTGTGGCTAGCAGAGGAAATGGTGCTACTACTGTTTCTGCAACAATGTTCTTTGCTTCTATGGTTGGTATCTCTGTGTTTGTGACTGGGGGCATTGGAGGAGTTCATAGACATGGTGAACAAACACTGGACATATCTTCTGATCTCACTGAGCTAGGAAGAACTCCTGTAGCAGTCATCTCGGCTGGTGTAAAATCAATTTTAGACATTCCTAGGACACTTGAATATTTGGAAACCCAAGGAGTGTGTGTTGCAGCTTACGGGACGAACGAGTTTCCTGCATTTTTCACTGAAACCAGTGGCTGCAAGGCGCCTTGTCGTGTCGATACCCCAGAAGATGCTGCAAAGCTTATTGGCAAGTCACTACAGAGCTTTCAACCTGATCTAATAATAGACTTCTTGAATATTTCTTCGCATAATGCCAACTTTAAGCTTGGGCTTGGAAGTGGAATTCTGATTGCTGTTCCAATTCCAAAAGAGCATTCTGCTTCCGGAAGCTTAACTGAGAATGCAATACAAAGCGCACTTCAAGAAGCTCGGGAGAAGAATATACTTGGAAATGCTGAAACTCCGTTCTTACTTAAAAGAGTGAATGAGCTTACTGGAGGAGCCTCGCTTGCTTCAAATATTGCACTTGTTAAGAATAATGCTATTGTTGGGGCTGGAATTGCTGTAGCCCTTGCCAAGCTTAGAGTCTAA

Protein sequence

MASYPSSSSSSALSRISNLSRHFHPSDSRNTDNPQGAIDASNGGGLIKMSAEVSAALSRGHPVVALESTIISHGMPYPQNLETAKEVEAIVRKNGAVPATVAILDGTPCVGLNEEELERLSILGHRVQKTARRDIAQVVASRGNGATTVSATMFFASMVGISVFVTGGIGGVHRHGEQTLDISSDLTELGRTPVAVISAGVKSILDIPRTLEYLETQGVCVAAYGTNEFPAFFTETSGCKAPCRVDTPEDAAKLIGKSLQSFQPDLIIDFLNISSHNANFKLGLGSGILIAVPIPKEHSASGSLTENAIQSALQEAREKNILGNAETPFLLKRVNELTGGASLASNIALVKNNAIVGAGIAVALAKLRV
Homology
BLAST of MC06g1006 vs. ExPASy Swiss-Prot
Match: B6IRJ4 (Pseudouridine-5'-phosphate glycosidase OS=Rhodospirillum centenum (strain ATCC 51521 / SW) OX=414684 GN=psuG PE=3 SV=1)

HSP 1 Score: 301.6 bits (771), Expect = 1.2e-80
Identity = 171/322 (53.11%), Postives = 217/322 (67.39%), Query Frame = 0

Query: 47  IKMSAEVSAALSRGHPVVALESTIISHGMPYPQNLETAKEVEAIVRKNGAVPATVAILDG 106
           + +  EV+AAL  G PVVALEST+ISHG+P P NLETA+ +EA VR NGAVPAT+A+LDG
Sbjct: 5   LSIHPEVAAALKAGRPVVALESTLISHGLPAPANLETAQAIEAAVRANGAVPATIAVLDG 64

Query: 107 TPCVGLNEEELERLSILGHRVQKTARRDIAQVVASRGNGATTVSATMFFASMVGISVFVT 166
              VGL+ E+++RL+  G    K +RRD+  V+A   +GATTV+ATM  A + GI+VF T
Sbjct: 65  RIRVGLDAEDMQRLAAPG--TAKVSRRDLPLVLAKGADGATTVAATMIAADLAGIAVFAT 124

Query: 167 GGIGGVHRHGEQTLDISSDLTELGRTPVAVISAGVKSILDIPRTLEYLETQGVCVAAYGT 226
           GGIGGVHR  E T DIS+DL EL  T VAV+ AG K+ILD+PRTLEYLET+GV V  +GT
Sbjct: 125 GGIGGVHRGVETTGDISADLEELATTSVAVVCAGAKAILDLPRTLEYLETRGVPVVGFGT 184

Query: 227 NEFPAFFTETSGCKAPCRVDTPEDAAKLIGKSLQSFQPDLIIDFLNISSHNANFKLGLGS 286
           + FPAF+   SG     R DTPEDAA+++                     NA ++LGL  
Sbjct: 185 DAFPAFYHRDSGLPVDGRCDTPEDAARVL---------------------NAKWRLGLAG 244

Query: 287 GILIAVPIPKEHSASGSLTENAIQSALQEAREKNILGNAETPFLLKRVNELTGGASLASN 346
           GI++AVPIP E +   +  E A+Q A+ EA    + G A TPFLL R+  LTGGASL +N
Sbjct: 245 GIVVAVPIPDEAALDAAQAEAAVQQAVAEAATGGVRGKALTPFLLHRLETLTGGASLTAN 303

Query: 347 IALVKNNAIVGAGIAVALAKLR 369
            AL+ NNA VGA IAVA A+L+
Sbjct: 305 RALLLNNAAVGARIAVAYARLK 303

BLAST of MC06g1006 vs. ExPASy Swiss-Prot
Match: Q8RCT3 (Pseudouridine-5'-phosphate glycosidase OS=Caldanaerobacter subterraneus subsp. tengcongensis (strain DSM 15242 / JCM 11007 / NBRC 100824 / MB4) OX=273068 GN=psuG PE=3 SV=1)

HSP 1 Score: 298.9 bits (764), Expect = 7.9e-80
Identity = 174/321 (54.21%), Postives = 214/321 (66.67%), Query Frame = 0

Query: 47  IKMSAEVSAALSRGHPVVALESTIISHGMPYPQNLETAKEVEAIVRKNGAVPATVAILDG 106
           I +S EV +AL    PVVALESTIISHGMPYPQN+ETA+ +E IVR+NGAVPAT+AI+ G
Sbjct: 5   IDLSEEVKSALEERRPVVALESTIISHGMPYPQNIETARALEEIVRENGAVPATIAIIGG 64

Query: 107 TPCVGLNEEELERLSILGHRVQKTARRDIAQVVASRGNGATTVSATMFFASMVGISVFVT 166
              +GLNEEELE +      + K ++RD+  V+A   N ATTVSATM  A++ GI VFVT
Sbjct: 65  KIKIGLNEEELEFMG-TSKEILKASKRDLPVVLAKGLNAATTVSATMICANLAGIKVFVT 124

Query: 167 GGIGGVHRHGEQTLDISSDLTELGRTPVAVISAGVKSILDIPRTLEYLETQGVCVAAYGT 226
           GGIGGVHR  E+T DIS+DL EL  T VAV+ AG K+ILD+PRTLEYLET GV V  + T
Sbjct: 125 GGIGGVHRGAEETFDISADLQELANTNVAVVCAGAKAILDLPRTLEYLETFGVPVIGFRT 184

Query: 227 NEFPAFFTETSGCKAPCRVDTPEDAAKLIGKSLQSFQPDLIIDFLNISSHNANFKLGLGS 286
            EFPAF+T  SG K   RV+   +AAK+I                        + LGL  
Sbjct: 185 EEFPAFYTRESGLKVDYRVEDEVEAAKVI---------------------KTKWDLGLKG 244

Query: 287 GILIAVPIPKEHSASGSLTENAIQSALQEAREKNILGNAETPFLLKRVNELTGGASLASN 346
           GILIA PIP+E++   +  E AI+ A+ EA  + I G A TPFLL+++ +LT G SL +N
Sbjct: 245 GILIANPIPEEYALDRAYIEKAIEEAIFEADRRGIRGKALTPFLLEKIKDLTEGKSLKAN 303

Query: 347 IALVKNNAIVGAGIAVALAKL 368
           I LVKNNA VGA IAV L KL
Sbjct: 305 IELVKNNARVGAKIAVQLNKL 303

BLAST of MC06g1006 vs. ExPASy Swiss-Prot
Match: B1HV79 (Pseudouridine-5'-phosphate glycosidase OS=Lysinibacillus sphaericus (strain C3-41) OX=444177 GN=psuG PE=3 SV=1)

HSP 1 Score: 297.4 bits (760), Expect = 2.3e-79
Identity = 169/321 (52.65%), Postives = 216/321 (67.29%), Query Frame = 0

Query: 47  IKMSAEVSAALSRGHPVVALESTIISHGMPYPQNLETAKEVEAIVRKNGAVPATVAILDG 106
           I +S EV A  ++G P+VALESTIISHGMPYPQN++TA+EVE I+R NGAVPAT+A++DG
Sbjct: 5   IVLSEEVKAGQAKGLPIVALESTIISHGMPYPQNVQTAREVEQIIRDNGAVPATIALIDG 64

Query: 107 TPCVGLNEEELERLSILGHRVQKTARRDIAQVVASRGNGATTVSATMFFASMVGISVFVT 166
              +GL++EELE        V K +RRD+  ++A++  GATTV+ATM  A + GI +FVT
Sbjct: 65  KIKIGLSDEELEMFG-NAQGVAKASRRDLGYLLATKKLGATTVAATMICAELAGIEIFVT 124

Query: 167 GGIGGVHRHGEQTLDISSDLTELGRTPVAVISAGVKSILDIPRTLEYLETQGVCVAAYGT 226
           GGIGGVHR  E T+D+S+DL EL +T VAVI AG KSILDI  TLEYLET+GV V  YGT
Sbjct: 125 GGIGGVHRGAETTMDVSADLEELAQTNVAVICAGAKSILDIGLTLEYLETKGVPVVGYGT 184

Query: 227 NEFPAFFTETSGCKAPCRVDTPEDAAKLIGKSLQSFQPDLIIDFLNISSHNANFKLGLGS 286
           +E PAF+T  SG     ++DTPE+ A+++                     +A ++LGL  
Sbjct: 185 DELPAFYTRQSGFDVNFQLDTPEEIAEML---------------------SAKWQLGLKG 244

Query: 287 GILIAVPIPKEHSASGSLTENAIQSALQEAREKNILGNAETPFLLKRVNELTGGASLASN 346
           G +IA PIP+  +       N I+ AL EA E  I G   TPFLL +V ELT G SL +N
Sbjct: 245 GAVIANPIPEAEALEHGFITNIIEKALVEAEENGIQGKNVTPFLLGKVKELTEGKSLDAN 303

Query: 347 IALVKNNAIVGAGIAVALAKL 368
           IALVKNNA+VGA IAVA  +L
Sbjct: 305 IALVKNNAVVGAKIAVAFNQL 303

BLAST of MC06g1006 vs. ExPASy Swiss-Prot
Match: C0ZIY1 (Pseudouridine-5'-phosphate glycosidase OS=Brevibacillus brevis (strain 47 / JCM 6285 / NBRC 100599) OX=358681 GN=psuG PE=3 SV=1)

HSP 1 Score: 297.0 bits (759), Expect = 3.0e-79
Identity = 170/320 (53.12%), Postives = 208/320 (65.00%), Query Frame = 0

Query: 47  IKMSAEVSAALSRGHPVVALESTIISHGMPYPQNLETAKEVEAIVRKNGAVPATVAILDG 106
           +  + EV  AL    PVVALE+TIISHGMPYPQN+E AKEVE I+R NGAVPAT+ I+DG
Sbjct: 5   LTFTEEVRHALENNLPVVALETTIISHGMPYPQNIEMAKEVEQIIRDNGAVPATIGIMDG 64

Query: 107 TPCVGLNEEELERLSILGHRVQKTARRDIAQVVASRGNGATTVSATMFFASMVGISVFVT 166
              +GL + ELE  +     V K +RRD A ++AS   GATTV+ATM  A M GI +F T
Sbjct: 65  KIKIGLTDSELEEFA-TNKNVAKVSRRDFAYILASGKIGATTVAATMIAAEMAGIHMFAT 124

Query: 167 GGIGGVHRHGEQTLDISSDLTELGRTPVAVISAGVKSILDIPRTLEYLETQGVCVAAYGT 226
           GGIGGVHR GE T D+S+DLTEL +T VAV+ AG KSILDI RTLEYLETQGV V  Y T
Sbjct: 125 GGIGGVHREGEITWDVSADLTELAQTDVAVVCAGAKSILDIGRTLEYLETQGVPVVGYRT 184

Query: 227 NEFPAFFTETSGCKAPCRVDTPEDAAKLIGKSLQSFQPDLIIDFLNISSHNANFKLGLGS 286
           +EFP+FF   SG     R+DTPE+  K++                     N  + LGL  
Sbjct: 185 DEFPSFFARKSGFGVDMRIDTPEEVGKMM---------------------NTKWDLGLKG 244

Query: 287 GILIAVPIPKEHSASGSLTENAIQSALQEAREKNILGNAETPFLLKRVNELTGGASLASN 346
           G++IA P+P+  + +    E  IQ AL EA+E NI G   TPF+L +V +LT G SLA+N
Sbjct: 245 GMIIANPVPESDALNHEEIEAVIQKALAEAKENNIAGKQVTPFMLDKVKKLTEGKSLATN 302

Query: 347 IALVKNNAIVGAGIAVALAK 367
           IALVK+NA V A IAVA  K
Sbjct: 305 IALVKHNAEVAAKIAVAYQK 302

BLAST of MC06g1006 vs. ExPASy Swiss-Prot
Match: Q1M4T3 (Pseudouridine-5'-phosphate glycosidase 2 OS=Rhizobium leguminosarum bv. viciae (strain 3841) OX=216596 GN=psuG2 PE=3 SV=1)

HSP 1 Score: 290.4 bits (742), Expect = 2.8e-77
Identity = 166/321 (51.71%), Postives = 212/321 (66.04%), Query Frame = 0

Query: 48  KMSAEVSAALSRGHPVVALESTIISHGMPYPQNLETAKEVEAIVRKNGAVPATVAILDGT 107
           ++S E++ A++ G PVVALESTII+HGMPYP NLETA  VE ++R+NGA+PAT+A++ G 
Sbjct: 7   RLSREMAEAIAAGSPVVALESTIITHGMPYPANLETALGVETVIRENGAIPATIAVVKGE 66

Query: 108 PCVGLNEEELERLSILGHRVQKTARRDIAQVVASRGNGATTVSATMFFASMVGISVFVTG 167
             VGL  +ELE L+     + K + RD+A  +    +  TTVSATM  A + GI VF TG
Sbjct: 67  LRVGLEHDELEELA-QSKGIVKASGRDLAVAMIRGQSAGTTVSATMLMADLAGIDVFATG 126

Query: 168 GIGGVHRHGEQTLDISSDLTELGRTPVAVISAGVKSILDIPRTLEYLETQGVCVAAYGTN 227
           G+GGVHR  EQT DIS+DLTELGRT  AV+ AGVKSILDI +TLEYLETQ V V AYGT 
Sbjct: 127 GVGGVHRGAEQTFDISADLTELGRTKTAVVCAGVKSILDIAKTLEYLETQRVPVIAYGTE 186

Query: 228 EFPAFFTETSGCKAPCRVDTPEDAAKLIGKSLQSFQPDLIIDFLNISSHNANFKLGLGSG 287
           +FPAFFT  SG KA  R+DTPE+ AK +                       + +LG G+G
Sbjct: 187 DFPAFFTRRSGFKADHRLDTPEEIAKAMW---------------------LHHQLGTGTG 246

Query: 288 ILIAVPIPKEHSASGSLTENAIQSALQEAREKNILGNAETPFLLKRVNELTGGASLASNI 347
           +LIA PIP+  + +    +  I  A+++A E+ I     TPFLL R+NELT G SL +NI
Sbjct: 247 LLIANPIPEASALAPDFIDGTIADAVRDADERGIDRKELTPFLLARINELTKGESLKANI 305

Query: 348 ALVKNNAIVGAGIAVALAKLR 369
            LVKNNA + A IAVA A L+
Sbjct: 307 ELVKNNARLAARIAVAYAPLK 305

BLAST of MC06g1006 vs. NCBI nr
Match: XP_022154373.1 (uncharacterized protein LOC111021655 [Momordica charantia])

HSP 1 Score: 642 bits (1656), Expect = 5.06e-231
Identity = 347/369 (94.04%), Postives = 348/369 (94.31%), Query Frame = 0

Query: 1   MASYPSSSSSSALSRISNLSRHFHPSDSRNTDNPQGAIDASNGGGLIKMSAEVSAALSRG 60
           MASYPSSSSSSALSRISNLSRHFHPSDSRNTDNPQGAIDASNGGGLIKMSAEVSAALSRG
Sbjct: 1   MASYPSSSSSSALSRISNLSRHFHPSDSRNTDNPQGAIDASNGGGLIKMSAEVSAALSRG 60

Query: 61  HPVVALESTIISHGMPYPQNLETAKEVEAIVRKNGAVPATVAILDGTPCVGLNEEELERL 120
           HPVVALESTIISHGMPYPQNLETAKEVEAIVRKNGAVPATVAILDGTPCVGLNEEELERL
Sbjct: 61  HPVVALESTIISHGMPYPQNLETAKEVEAIVRKNGAVPATVAILDGTPCVGLNEEELERL 120

Query: 121 SILGHRVQKTARRDIAQVVASRGNGATTVSATMFFASMVGISVFVTGGIGGVHRHGEQTL 180
           SILGHRVQKTARRDIAQVVASRGNGATTVSATMFFASMVGISVFVTGGIGGVHRHGEQTL
Sbjct: 121 SILGHRVQKTARRDIAQVVASRGNGATTVSATMFFASMVGISVFVTGGIGGVHRHGEQTL 180

Query: 181 DISSDLTELGRTPVAVISAGVKSILDIPRTLEYLETQGVCVAAYGTNEFPAFFTETSGCK 240
           DISSDLTELGRTPVAVISAGVKSILDIPRTLEYLETQGVCVAAYGTNEFPAFFTETSGCK
Sbjct: 181 DISSDLTELGRTPVAVISAGVKSILDIPRTLEYLETQGVCVAAYGTNEFPAFFTETSGCK 240

Query: 241 APCRVDTPEDAAKLIGKSLQSFQPDLIIDFLNISSHNANFKLGLGSGILIAVPIPKEHSA 300
           APCRVDTPEDAAKLI                     +ANFKLGLGSGILIAVPIPKEHSA
Sbjct: 241 APCRVDTPEDAAKLI---------------------DANFKLGLGSGILIAVPIPKEHSA 300

Query: 301 SGSLTENAIQSALQEAREKNILGNAETPFLLKRVNELTGGASLASNIALVKNNAIVGAGI 360
           SGSLTENAIQSALQEAREKNILGNAETPFLLKRVNELTGGASLASNIALVKNNAIVGAGI
Sbjct: 301 SGSLTENAIQSALQEAREKNILGNAETPFLLKRVNELTGGASLASNIALVKNNAIVGAGI 348

Query: 361 AVALAKLRV 369
           AVALAKLRV
Sbjct: 361 AVALAKLRV 348

BLAST of MC06g1006 vs. NCBI nr
Match: XP_038878447.1 (pseudouridine-5'-phosphate glycosidase [Benincasa hispida] >XP_038878448.1 pseudouridine-5'-phosphate glycosidase [Benincasa hispida] >XP_038878450.1 pseudouridine-5'-phosphate glycosidase [Benincasa hispida])

HSP 1 Score: 582 bits (1501), Expect = 4.30e-207
Identity = 319/369 (86.45%), Postives = 332/369 (89.97%), Query Frame = 0

Query: 1   MASYPSSSSSSALSRISNLSRHFHPSDSRNTDNPQGAIDASNGGGLIKMSAEVSAALSRG 60
           MAS  SSSSSSALSRISNLSRHFH  DS+ +DNPQGAIDA  G GLIK+S+EVSAA+SRG
Sbjct: 21  MASSSSSSSSSALSRISNLSRHFHSPDSKTSDNPQGAIDAL-GRGLIKISSEVSAAISRG 80

Query: 61  HPVVALESTIISHGMPYPQNLETAKEVEAIVRKNGAVPATVAILDGTPCVGLNEEELERL 120
           HPVVALESTIISHGMPYPQNLETAKEVEA+VRKNGAVPATVAI+DGTPCVGLNEEELERL
Sbjct: 81  HPVVALESTIISHGMPYPQNLETAKEVEAMVRKNGAVPATVAIIDGTPCVGLNEEELERL 140

Query: 121 SILGHRVQKTARRDIAQVVASRGNGATTVSATMFFASMVGISVFVTGGIGGVHRHGEQTL 180
           SILG++VQKTARRDIAQVVASRGNGATTVSATMFFAS VGI VFVTGGIGGVHRHGEQT+
Sbjct: 141 SILGNQVQKTARRDIAQVVASRGNGATTVSATMFFASKVGIPVFVTGGIGGVHRHGEQTM 200

Query: 181 DISSDLTELGRTPVAVISAGVKSILDIPRTLEYLETQGVCVAAYGTNEFPAFFTETSGCK 240
           DISSDLTELGRTPVAVISAGVKSILDIPRTLEYLETQGVCVAAY TNEFPAFFTETSGCK
Sbjct: 201 DISSDLTELGRTPVAVISAGVKSILDIPRTLEYLETQGVCVAAYRTNEFPAFFTETSGCK 260

Query: 241 APCRVDTPEDAAKLIGKSLQSFQPDLIIDFLNISSHNANFKLGLGSGILIAVPIPKEHSA 300
           APCRVDTPEDAAKLI                     +AN  LGLGSGILIAVPIP EHSA
Sbjct: 261 APCRVDTPEDAAKLI---------------------DANMNLGLGSGILIAVPIPNEHSA 320

Query: 301 SGSLTENAIQSALQEAREKNILGNAETPFLLKRVNELTGGASLASNIALVKNNAIVGAGI 360
           SGSL ENAIQSALQEAREKNI+GNAETPFLLKRVNELTGGASLASNIALVKNNA+VGA I
Sbjct: 321 SGSLIENAIQSALQEAREKNIVGNAETPFLLKRVNELTGGASLASNIALVKNNALVGARI 367

Query: 361 AVALAKLRV 369
           AVALAKLRV
Sbjct: 381 AVALAKLRV 367

BLAST of MC06g1006 vs. NCBI nr
Match: XP_023515450.1 (uncharacterized protein LOC111779605 [Cucurbita pepo subsp. pepo] >XP_023515451.1 uncharacterized protein LOC111779605 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 573 bits (1478), Expect = 6.10e-204
Identity = 314/364 (86.26%), Postives = 329/364 (90.38%), Query Frame = 0

Query: 6   SSSSSSALSRISNLSRHFHPSDSRNTDNPQGAIDASNGGGLIKMSAEVSAALSRGHPVVA 65
           SSSSSSALSRISNLSRHFHPSDS+++DNPQ AIDA   G LIK+S EVSAA+SRGHPVVA
Sbjct: 4   SSSSSSALSRISNLSRHFHPSDSKSSDNPQAAIDAPFRG-LIKISTEVSAAMSRGHPVVA 63

Query: 66  LESTIISHGMPYPQNLETAKEVEAIVRKNGAVPATVAILDGTPCVGLNEEELERLSILGH 125
           LESTIISHGMPYPQNLETAKEVEAIVRKNGAVPATVAILDGTPCVGLNEEELERLSILG+
Sbjct: 64  LESTIISHGMPYPQNLETAKEVEAIVRKNGAVPATVAILDGTPCVGLNEEELERLSILGN 123

Query: 126 RVQKTARRDIAQVVASRGNGATTVSATMFFASMVGISVFVTGGIGGVHRHGEQTLDISSD 185
           RV+KTARRDIAQVVAS+GNGATTVSATMFFAS VGI VFVTGGIGGVHRHGE+T+DISSD
Sbjct: 124 RVRKTARRDIAQVVASKGNGATTVSATMFFASRVGIPVFVTGGIGGVHRHGEKTMDISSD 183

Query: 186 LTELGRTPVAVISAGVKSILDIPRTLEYLETQGVCVAAYGTNEFPAFFTETSGCKAPCRV 245
           LTELGRTPVAVISAGVKSILDIPRTLEYLETQGVCVAAY T+EFPAFFTETSGCKAPCRV
Sbjct: 184 LTELGRTPVAVISAGVKSILDIPRTLEYLETQGVCVAAYRTDEFPAFFTETSGCKAPCRV 243

Query: 246 DTPEDAAKLIGKSLQSFQPDLIIDFLNISSHNANFKLGLGSGILIAVPIPKEHSASGSLT 305
           DTPEDAAKLI                     +AN  LGLGSGILI+VPIPKEHSASGSL 
Sbjct: 244 DTPEDAAKLI---------------------DANLNLGLGSGILISVPIPKEHSASGSLI 303

Query: 306 ENAIQSALQEAREKNILGNAETPFLLKRVNELTGGASLASNIALVKNNAIVGAGIAVALA 365
           ENAIQSALQEAREKNI+GNAETPFLLKRVNELT GASLASNIALVKNNA+VGA IAVALA
Sbjct: 304 ENAIQSALQEAREKNIVGNAETPFLLKRVNELTQGASLASNIALVKNNALVGARIAVALA 345

Query: 366 KLRV 369
           KLRV
Sbjct: 364 KLRV 345

BLAST of MC06g1006 vs. NCBI nr
Match: XP_022988167.1 (uncharacterized protein LOC111485484 [Cucurbita maxima])

HSP 1 Score: 572 bits (1473), Expect = 3.39e-203
Identity = 314/364 (86.26%), Postives = 328/364 (90.11%), Query Frame = 0

Query: 6   SSSSSSALSRISNLSRHFHPSDSRNTDNPQGAIDASNGGGLIKMSAEVSAALSRGHPVVA 65
           SSSSSSALSRISNLSRHFHPSDS+++DNPQ AIDA   G LIK+S EVSAA+SRGHPVVA
Sbjct: 3   SSSSSSALSRISNLSRHFHPSDSKSSDNPQAAIDAPFRG-LIKISTEVSAAMSRGHPVVA 62

Query: 66  LESTIISHGMPYPQNLETAKEVEAIVRKNGAVPATVAILDGTPCVGLNEEELERLSILGH 125
           LESTIISHGMPYPQNLETAKEVEAIVRKNGAVPATVAILDGTPCVGLNEEELERLSILG 
Sbjct: 63  LESTIISHGMPYPQNLETAKEVEAIVRKNGAVPATVAILDGTPCVGLNEEELERLSILGT 122

Query: 126 RVQKTARRDIAQVVASRGNGATTVSATMFFASMVGISVFVTGGIGGVHRHGEQTLDISSD 185
           RVQKTARRDIAQVVAS+GNGATTVSATMFFAS VGI VFVTGGIGGVHRHGE+T+DISSD
Sbjct: 123 RVQKTARRDIAQVVASKGNGATTVSATMFFASRVGIPVFVTGGIGGVHRHGEKTMDISSD 182

Query: 186 LTELGRTPVAVISAGVKSILDIPRTLEYLETQGVCVAAYGTNEFPAFFTETSGCKAPCRV 245
           LTELGRTPVAVISAGVKSILDIPRTLEYLETQGVCVAAY T+EFPAFFTETSGCKAPCRV
Sbjct: 183 LTELGRTPVAVISAGVKSILDIPRTLEYLETQGVCVAAYRTDEFPAFFTETSGCKAPCRV 242

Query: 246 DTPEDAAKLIGKSLQSFQPDLIIDFLNISSHNANFKLGLGSGILIAVPIPKEHSASGSLT 305
           DTPE+AAKLI                     +A+  LGLGSGILIAVPIPKEHSASGSL 
Sbjct: 243 DTPEEAAKLI---------------------DADMNLGLGSGILIAVPIPKEHSASGSLI 302

Query: 306 ENAIQSALQEAREKNILGNAETPFLLKRVNELTGGASLASNIALVKNNAIVGAGIAVALA 365
           ENAIQSALQEAREKNI+GNAETPFLLKRVNELT GASLASNIALVKNNA+VGA IAVALA
Sbjct: 303 ENAIQSALQEAREKNIVGNAETPFLLKRVNELTQGASLASNIALVKNNALVGARIAVALA 344

Query: 366 KLRV 369
           KLRV
Sbjct: 363 KLRV 344

BLAST of MC06g1006 vs. NCBI nr
Match: XP_022960638.1 (uncharacterized protein LOC111461367 [Cucurbita moschata])

HSP 1 Score: 572 bits (1473), Expect = 3.65e-203
Identity = 313/364 (85.99%), Postives = 327/364 (89.84%), Query Frame = 0

Query: 6   SSSSSSALSRISNLSRHFHPSDSRNTDNPQGAIDASNGGGLIKMSAEVSAALSRGHPVVA 65
           SSSSSSALSRISNLSRHFHPSDS+++DNPQGAIDA   G LIK+S EVSAA+SRGHPVVA
Sbjct: 5   SSSSSSALSRISNLSRHFHPSDSKSSDNPQGAIDAPFRG-LIKISTEVSAAMSRGHPVVA 64

Query: 66  LESTIISHGMPYPQNLETAKEVEAIVRKNGAVPATVAILDGTPCVGLNEEELERLSILGH 125
           LESTIISHGMPYPQNLETAKEVEAIVRKNGA PATVAILDGTPCVGLNEEELERLSILG 
Sbjct: 65  LESTIISHGMPYPQNLETAKEVEAIVRKNGAFPATVAILDGTPCVGLNEEELERLSILGT 124

Query: 126 RVQKTARRDIAQVVASRGNGATTVSATMFFASMVGISVFVTGGIGGVHRHGEQTLDISSD 185
           RV+KTARRDIAQVVA +GNGATTVSATMFFAS VGI VFVTGGIGGVHRHGE+T+DISSD
Sbjct: 125 RVRKTARRDIAQVVAGKGNGATTVSATMFFASRVGIPVFVTGGIGGVHRHGEKTMDISSD 184

Query: 186 LTELGRTPVAVISAGVKSILDIPRTLEYLETQGVCVAAYGTNEFPAFFTETSGCKAPCRV 245
           LTELGRTPVAVISAGVKSILDIPRTLEYLETQGVCVAAY TNEFPAFFTETSGCKAPCRV
Sbjct: 185 LTELGRTPVAVISAGVKSILDIPRTLEYLETQGVCVAAYRTNEFPAFFTETSGCKAPCRV 244

Query: 246 DTPEDAAKLIGKSLQSFQPDLIIDFLNISSHNANFKLGLGSGILIAVPIPKEHSASGSLT 305
           DTPE+AAKLI                     +AN  LGLGSGILI+VPIPKEHSASGSL 
Sbjct: 245 DTPEEAAKLI---------------------DANMNLGLGSGILISVPIPKEHSASGSLI 304

Query: 306 ENAIQSALQEAREKNILGNAETPFLLKRVNELTGGASLASNIALVKNNAIVGAGIAVALA 365
           ENAIQSALQEAREKNI+GNAETPFLLKRVNELT GASLASNIALVKNNA+VGA IAVALA
Sbjct: 305 ENAIQSALQEAREKNIVGNAETPFLLKRVNELTQGASLASNIALVKNNALVGARIAVALA 346

Query: 366 KLRV 369
           KLRV
Sbjct: 365 KLRV 346

BLAST of MC06g1006 vs. ExPASy TrEMBL
Match: A0A6J1DK54 (uncharacterized protein LOC111021655 OS=Momordica charantia OX=3673 GN=LOC111021655 PE=3 SV=1)

HSP 1 Score: 642 bits (1656), Expect = 2.45e-231
Identity = 347/369 (94.04%), Postives = 348/369 (94.31%), Query Frame = 0

Query: 1   MASYPSSSSSSALSRISNLSRHFHPSDSRNTDNPQGAIDASNGGGLIKMSAEVSAALSRG 60
           MASYPSSSSSSALSRISNLSRHFHPSDSRNTDNPQGAIDASNGGGLIKMSAEVSAALSRG
Sbjct: 1   MASYPSSSSSSALSRISNLSRHFHPSDSRNTDNPQGAIDASNGGGLIKMSAEVSAALSRG 60

Query: 61  HPVVALESTIISHGMPYPQNLETAKEVEAIVRKNGAVPATVAILDGTPCVGLNEEELERL 120
           HPVVALESTIISHGMPYPQNLETAKEVEAIVRKNGAVPATVAILDGTPCVGLNEEELERL
Sbjct: 61  HPVVALESTIISHGMPYPQNLETAKEVEAIVRKNGAVPATVAILDGTPCVGLNEEELERL 120

Query: 121 SILGHRVQKTARRDIAQVVASRGNGATTVSATMFFASMVGISVFVTGGIGGVHRHGEQTL 180
           SILGHRVQKTARRDIAQVVASRGNGATTVSATMFFASMVGISVFVTGGIGGVHRHGEQTL
Sbjct: 121 SILGHRVQKTARRDIAQVVASRGNGATTVSATMFFASMVGISVFVTGGIGGVHRHGEQTL 180

Query: 181 DISSDLTELGRTPVAVISAGVKSILDIPRTLEYLETQGVCVAAYGTNEFPAFFTETSGCK 240
           DISSDLTELGRTPVAVISAGVKSILDIPRTLEYLETQGVCVAAYGTNEFPAFFTETSGCK
Sbjct: 181 DISSDLTELGRTPVAVISAGVKSILDIPRTLEYLETQGVCVAAYGTNEFPAFFTETSGCK 240

Query: 241 APCRVDTPEDAAKLIGKSLQSFQPDLIIDFLNISSHNANFKLGLGSGILIAVPIPKEHSA 300
           APCRVDTPEDAAKLI                     +ANFKLGLGSGILIAVPIPKEHSA
Sbjct: 241 APCRVDTPEDAAKLI---------------------DANFKLGLGSGILIAVPIPKEHSA 300

Query: 301 SGSLTENAIQSALQEAREKNILGNAETPFLLKRVNELTGGASLASNIALVKNNAIVGAGI 360
           SGSLTENAIQSALQEAREKNILGNAETPFLLKRVNELTGGASLASNIALVKNNAIVGAGI
Sbjct: 301 SGSLTENAIQSALQEAREKNILGNAETPFLLKRVNELTGGASLASNIALVKNNAIVGAGI 348

Query: 361 AVALAKLRV 369
           AVALAKLRV
Sbjct: 361 AVALAKLRV 348

BLAST of MC06g1006 vs. ExPASy TrEMBL
Match: A0A6J1JCB2 (uncharacterized protein LOC111485484 OS=Cucurbita maxima OX=3661 GN=LOC111485484 PE=3 SV=1)

HSP 1 Score: 572 bits (1473), Expect = 1.64e-203
Identity = 314/364 (86.26%), Postives = 328/364 (90.11%), Query Frame = 0

Query: 6   SSSSSSALSRISNLSRHFHPSDSRNTDNPQGAIDASNGGGLIKMSAEVSAALSRGHPVVA 65
           SSSSSSALSRISNLSRHFHPSDS+++DNPQ AIDA   G LIK+S EVSAA+SRGHPVVA
Sbjct: 3   SSSSSSALSRISNLSRHFHPSDSKSSDNPQAAIDAPFRG-LIKISTEVSAAMSRGHPVVA 62

Query: 66  LESTIISHGMPYPQNLETAKEVEAIVRKNGAVPATVAILDGTPCVGLNEEELERLSILGH 125
           LESTIISHGMPYPQNLETAKEVEAIVRKNGAVPATVAILDGTPCVGLNEEELERLSILG 
Sbjct: 63  LESTIISHGMPYPQNLETAKEVEAIVRKNGAVPATVAILDGTPCVGLNEEELERLSILGT 122

Query: 126 RVQKTARRDIAQVVASRGNGATTVSATMFFASMVGISVFVTGGIGGVHRHGEQTLDISSD 185
           RVQKTARRDIAQVVAS+GNGATTVSATMFFAS VGI VFVTGGIGGVHRHGE+T+DISSD
Sbjct: 123 RVQKTARRDIAQVVASKGNGATTVSATMFFASRVGIPVFVTGGIGGVHRHGEKTMDISSD 182

Query: 186 LTELGRTPVAVISAGVKSILDIPRTLEYLETQGVCVAAYGTNEFPAFFTETSGCKAPCRV 245
           LTELGRTPVAVISAGVKSILDIPRTLEYLETQGVCVAAY T+EFPAFFTETSGCKAPCRV
Sbjct: 183 LTELGRTPVAVISAGVKSILDIPRTLEYLETQGVCVAAYRTDEFPAFFTETSGCKAPCRV 242

Query: 246 DTPEDAAKLIGKSLQSFQPDLIIDFLNISSHNANFKLGLGSGILIAVPIPKEHSASGSLT 305
           DTPE+AAKLI                     +A+  LGLGSGILIAVPIPKEHSASGSL 
Sbjct: 243 DTPEEAAKLI---------------------DADMNLGLGSGILIAVPIPKEHSASGSLI 302

Query: 306 ENAIQSALQEAREKNILGNAETPFLLKRVNELTGGASLASNIALVKNNAIVGAGIAVALA 365
           ENAIQSALQEAREKNI+GNAETPFLLKRVNELT GASLASNIALVKNNA+VGA IAVALA
Sbjct: 303 ENAIQSALQEAREKNIVGNAETPFLLKRVNELTQGASLASNIALVKNNALVGARIAVALA 344

Query: 366 KLRV 369
           KLRV
Sbjct: 363 KLRV 344

BLAST of MC06g1006 vs. ExPASy TrEMBL
Match: A0A6J1H9J4 (uncharacterized protein LOC111461367 OS=Cucurbita moschata OX=3662 GN=LOC111461367 PE=3 SV=1)

HSP 1 Score: 572 bits (1473), Expect = 1.77e-203
Identity = 313/364 (85.99%), Postives = 327/364 (89.84%), Query Frame = 0

Query: 6   SSSSSSALSRISNLSRHFHPSDSRNTDNPQGAIDASNGGGLIKMSAEVSAALSRGHPVVA 65
           SSSSSSALSRISNLSRHFHPSDS+++DNPQGAIDA   G LIK+S EVSAA+SRGHPVVA
Sbjct: 5   SSSSSSALSRISNLSRHFHPSDSKSSDNPQGAIDAPFRG-LIKISTEVSAAMSRGHPVVA 64

Query: 66  LESTIISHGMPYPQNLETAKEVEAIVRKNGAVPATVAILDGTPCVGLNEEELERLSILGH 125
           LESTIISHGMPYPQNLETAKEVEAIVRKNGA PATVAILDGTPCVGLNEEELERLSILG 
Sbjct: 65  LESTIISHGMPYPQNLETAKEVEAIVRKNGAFPATVAILDGTPCVGLNEEELERLSILGT 124

Query: 126 RVQKTARRDIAQVVASRGNGATTVSATMFFASMVGISVFVTGGIGGVHRHGEQTLDISSD 185
           RV+KTARRDIAQVVA +GNGATTVSATMFFAS VGI VFVTGGIGGVHRHGE+T+DISSD
Sbjct: 125 RVRKTARRDIAQVVAGKGNGATTVSATMFFASRVGIPVFVTGGIGGVHRHGEKTMDISSD 184

Query: 186 LTELGRTPVAVISAGVKSILDIPRTLEYLETQGVCVAAYGTNEFPAFFTETSGCKAPCRV 245
           LTELGRTPVAVISAGVKSILDIPRTLEYLETQGVCVAAY TNEFPAFFTETSGCKAPCRV
Sbjct: 185 LTELGRTPVAVISAGVKSILDIPRTLEYLETQGVCVAAYRTNEFPAFFTETSGCKAPCRV 244

Query: 246 DTPEDAAKLIGKSLQSFQPDLIIDFLNISSHNANFKLGLGSGILIAVPIPKEHSASGSLT 305
           DTPE+AAKLI                     +AN  LGLGSGILI+VPIPKEHSASGSL 
Sbjct: 245 DTPEEAAKLI---------------------DANMNLGLGSGILISVPIPKEHSASGSLI 304

Query: 306 ENAIQSALQEAREKNILGNAETPFLLKRVNELTGGASLASNIALVKNNAIVGAGIAVALA 365
           ENAIQSALQEAREKNI+GNAETPFLLKRVNELT GASLASNIALVKNNA+VGA IAVALA
Sbjct: 305 ENAIQSALQEAREKNIVGNAETPFLLKRVNELTQGASLASNIALVKNNALVGARIAVALA 346

Query: 366 KLRV 369
           KLRV
Sbjct: 365 KLRV 346

BLAST of MC06g1006 vs. ExPASy TrEMBL
Match: A0A1S3BNH4 (pseudouridine-5'-phosphate glycosidase isoform X2 OS=Cucumis melo OX=3656 GN=LOC103492003 PE=3 SV=1)

HSP 1 Score: 568 bits (1463), Expect = 5.08e-202
Identity = 310/362 (85.64%), Postives = 324/362 (89.50%), Query Frame = 0

Query: 8   SSSSALSRISNLSRHFHPSDSRNTDNPQGAIDASNGGGLIKMSAEVSAALSRGHPVVALE 67
           SSSSALSRISNLSRHFH  +S+ +D+PQGAIDA  GGGLIK+S+EVSAA+SRGHPVVALE
Sbjct: 4   SSSSALSRISNLSRHFHSPNSKTSDDPQGAIDAP-GGGLIKISSEVSAAISRGHPVVALE 63

Query: 68  STIISHGMPYPQNLETAKEVEAIVRKNGAVPATVAILDGTPCVGLNEEELERLSILGHRV 127
           STIISHGMPYPQNLETAKEVEAIVRKNGAVPATVAILDGTPCVGLNEEELERLSILG+RV
Sbjct: 64  STIISHGMPYPQNLETAKEVEAIVRKNGAVPATVAILDGTPCVGLNEEELERLSILGNRV 123

Query: 128 QKTARRDIAQVVASRGNGATTVSATMFFASMVGISVFVTGGIGGVHRHGEQTLDISSDLT 187
           QKTARRDIAQVVAS GNGATTVSATMFFAS VGI VFVTGGIGGVHRHGEQT+DISSDLT
Sbjct: 124 QKTARRDIAQVVASGGNGATTVSATMFFASKVGIPVFVTGGIGGVHRHGEQTMDISSDLT 183

Query: 188 ELGRTPVAVISAGVKSILDIPRTLEYLETQGVCVAAYGTNEFPAFFTETSGCKAPCRVDT 247
           ELGRTPVAVISAG+KSILDIPRTLEYLETQGVCVAAY TNEFPAFFTETSGCKAPCRVDT
Sbjct: 184 ELGRTPVAVISAGIKSILDIPRTLEYLETQGVCVAAYRTNEFPAFFTETSGCKAPCRVDT 243

Query: 248 PEDAAKLIGKSLQSFQPDLIIDFLNISSHNANFKLGLGSGILIAVPIPKEHSASGSLTEN 307
           PE+AAKLI                      AN  L LGSGILIAVPIP EHSASGSL E 
Sbjct: 244 PEEAAKLI----------------------ANMNLELGSGILIAVPIPNEHSASGSLIEK 303

Query: 308 AIQSALQEAREKNILGNAETPFLLKRVNELTGGASLASNIALVKNNAIVGAGIAVALAKL 367
           AIQ+ALQEAREKNI+GNAETPFLLKRVNELTGGASLASNIALVKNNA+VGA IAVALAKL
Sbjct: 304 AIQTALQEAREKNIVGNAETPFLLKRVNELTGGASLASNIALVKNNALVGAKIAVALAKL 342

Query: 368 RV 369
           RV
Sbjct: 364 RV 342

BLAST of MC06g1006 vs. ExPASy TrEMBL
Match: A0A1S3BQ44 (pseudouridine-5'-phosphate glycosidase isoform X1 OS=Cucumis melo OX=3656 GN=LOC103492003 PE=3 SV=1)

HSP 1 Score: 567 bits (1461), Expect = 1.06e-201
Identity = 310/362 (85.64%), Postives = 324/362 (89.50%), Query Frame = 0

Query: 8   SSSSALSRISNLSRHFHPSDSRNTDNPQGAIDASNGGGLIKMSAEVSAALSRGHPVVALE 67
           SSSSALSRISNLSRHFH  +S+ +D+PQGAIDA  GGGLIK+S+EVSAA+SRGHPVVALE
Sbjct: 4   SSSSALSRISNLSRHFHSPNSKTSDDPQGAIDAP-GGGLIKISSEVSAAISRGHPVVALE 63

Query: 68  STIISHGMPYPQNLETAKEVEAIVRKNGAVPATVAILDGTPCVGLNEEELERLSILGHRV 127
           STIISHGMPYPQNLETAKEVEAIVRKNGAVPATVAILDGTPCVGLNEEELERLSILG+RV
Sbjct: 64  STIISHGMPYPQNLETAKEVEAIVRKNGAVPATVAILDGTPCVGLNEEELERLSILGNRV 123

Query: 128 QKTARRDIAQVVASRGNGATTVSATMFFASMVGISVFVTGGIGGVHRHGEQTLDISSDLT 187
           QKTARRDIAQVVAS GNGATTVSATMFFAS VGI VFVTGGIGGVHRHGEQT+DISSDLT
Sbjct: 124 QKTARRDIAQVVASGGNGATTVSATMFFASKVGIPVFVTGGIGGVHRHGEQTMDISSDLT 183

Query: 188 ELGRTPVAVISAGVKSILDIPRTLEYLETQGVCVAAYGTNEFPAFFTETSGCKAPCRVDT 247
           ELGRTPVAVISAG+KSILDIPRTLEYLETQGVCVAAY TNEFPAFFTETSGCKAPCRVDT
Sbjct: 184 ELGRTPVAVISAGIKSILDIPRTLEYLETQGVCVAAYRTNEFPAFFTETSGCKAPCRVDT 243

Query: 248 PEDAAKLIGKSLQSFQPDLIIDFLNISSHNANFKLGLGSGILIAVPIPKEHSASGSLTEN 307
           PE+AAKLI                      AN  L LGSGILIAVPIP EHSASGSL E 
Sbjct: 244 PEEAAKLIV---------------------ANMNLELGSGILIAVPIPNEHSASGSLIEK 303

Query: 308 AIQSALQEAREKNILGNAETPFLLKRVNELTGGASLASNIALVKNNAIVGAGIAVALAKL 367
           AIQ+ALQEAREKNI+GNAETPFLLKRVNELTGGASLASNIALVKNNA+VGA IAVALAKL
Sbjct: 304 AIQTALQEAREKNIVGNAETPFLLKRVNELTGGASLASNIALVKNNALVGAKIAVALAKL 343

Query: 368 RV 369
           RV
Sbjct: 364 RV 343

BLAST of MC06g1006 vs. TAIR 10
Match: AT1G50510.1 (indigoidine synthase A family protein )

HSP 1 Score: 435.6 bits (1119), Expect = 3.8e-122
Identity = 239/360 (66.39%), Postives = 280/360 (77.78%), Query Frame = 0

Query: 8   SSSSALSRISNLSRHFHPSDSRNTDNPQGAIDASNGGGLIKMSAEVSAALSRGHPVVALE 67
           +SS A SRISNL  H  P ++ N               L+K+S +VS ALS G  VVALE
Sbjct: 2   ASSLAQSRISNLQNHLSPLEANNKLR-----------SLVKISPQVSEALSNGRAVVALE 61

Query: 68  STIISHGMPYPQNLETAKEVEAIVRKNGAVPATVAILDGTPCVGLNEEELERLSILGHRV 127
           STIISHGMPYPQNL+TAKEVE+IVR+NGA+PAT+AIL+G PC+GL+EEELERL+ LG  V
Sbjct: 62  STIISHGMPYPQNLQTAKEVESIVRENGAIPATIAILNGVPCIGLSEEELERLASLGKSV 121

Query: 128 QKTARRDIAQVVASRGNGATTVSATMFFASMVGISVFVTGGIGGVHRHGEQTLDISSDLT 187
           QKTA RDIA VVA+RGNGATTVSAT+FFASMVGI VFVTGGIGGVHRH   ++DISSDLT
Sbjct: 122 QKTAGRDIANVVATRGNGATTVSATLFFASMVGIQVFVTGGIGGVHRHANHSMDISSDLT 181

Query: 188 ELGRTPVAVISAGVKSILDIPRTLEYLETQGVCVAAYGTNEFPAFFTETSGCKAPCRVDT 247
            LGRTP+AVISAGVKSILDIP+TLEYLETQ V VAAY ++EFPAFFTE SGCKAP RV++
Sbjct: 182 ALGRTPIAVISAGVKSILDIPKTLEYLETQEVYVAAYKSDEFPAFFTEKSGCKAPSRVNS 241

Query: 248 PEDAAKLIGKSLQSFQPDLIIDFLNISSHNANFKLGLGSGILIAVPIPKEHSASGSLTEN 307
           PED A++I                     +AN KL   +GIL A+PIPK HSA+G+L E+
Sbjct: 242 PEDCARVI---------------------DANMKLNRQAGILFAIPIPKHHSAAGNLIES 301

Query: 308 AIQSALQEAREKNILGNAETPFLLKRVNELTGGASLASNIALVKNNAIVGAGIAVALAKL 367
           A Q AL EARE+N+ GNAETPFLL RVNELTGG SLA+NIALVKNNA++G+ IAVAL++L
Sbjct: 302 ATQRALTEAREQNVTGNAETPFLLARVNELTGGTSLAANIALVKNNALIGSQIAVALSQL 329

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
B6IRJ41.2e-8053.11Pseudouridine-5'-phosphate glycosidase OS=Rhodospirillum centenum (strain ATCC 5... [more]
Q8RCT37.9e-8054.21Pseudouridine-5'-phosphate glycosidase OS=Caldanaerobacter subterraneus subsp. t... [more]
B1HV792.3e-7952.65Pseudouridine-5'-phosphate glycosidase OS=Lysinibacillus sphaericus (strain C3-4... [more]
C0ZIY13.0e-7953.13Pseudouridine-5'-phosphate glycosidase OS=Brevibacillus brevis (strain 47 / JCM ... [more]
Q1M4T32.8e-7751.71Pseudouridine-5'-phosphate glycosidase 2 OS=Rhizobium leguminosarum bv. viciae (... [more]
Match NameE-valueIdentityDescription
XP_022154373.15.06e-23194.04uncharacterized protein LOC111021655 [Momordica charantia][more]
XP_038878447.14.30e-20786.45pseudouridine-5'-phosphate glycosidase [Benincasa hispida] >XP_038878448.1 pseud... [more]
XP_023515450.16.10e-20486.26uncharacterized protein LOC111779605 [Cucurbita pepo subsp. pepo] >XP_023515451.... [more]
XP_022988167.13.39e-20386.26uncharacterized protein LOC111485484 [Cucurbita maxima][more]
XP_022960638.13.65e-20385.99uncharacterized protein LOC111461367 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
A0A6J1DK542.45e-23194.04uncharacterized protein LOC111021655 OS=Momordica charantia OX=3673 GN=LOC111021... [more]
A0A6J1JCB21.64e-20386.26uncharacterized protein LOC111485484 OS=Cucurbita maxima OX=3661 GN=LOC111485484... [more]
A0A6J1H9J41.77e-20385.99uncharacterized protein LOC111461367 OS=Cucurbita moschata OX=3662 GN=LOC1114613... [more]
A0A1S3BNH45.08e-20285.64pseudouridine-5'-phosphate glycosidase isoform X2 OS=Cucumis melo OX=3656 GN=LOC... [more]
A0A1S3BQ441.06e-20185.64pseudouridine-5'-phosphate glycosidase isoform X1 OS=Cucumis melo OX=3656 GN=LOC... [more]
Match NameE-valueIdentityDescription
AT1G50510.13.8e-12266.39indigoidine synthase A family protein [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (Dali-11) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR007342Pseudouridine-5'-phosphate glycosidasePFAMPF04227Indigoidine_Acoord: 277..363
e-value: 2.0E-25
score: 89.7
coord: 52..258
e-value: 5.8E-100
score: 334.4
IPR007342Pseudouridine-5'-phosphate glycosidasePANTHERPTHR42909ZGC:136858coord: 276..367
IPR007342Pseudouridine-5'-phosphate glycosidasePANTHERPTHR42909ZGC:136858coord: 7..254
IPR007342Pseudouridine-5'-phosphate glycosidaseHAMAPMF_01876PsiMP_glycosidasecoord: 51..366
score: 40.233646
IPR022830Indigoidine synthase A-likeGENE3D3.40.1790.10Indigoidine synthase domaincoord: 44..369
e-value: 3.2E-146
score: 488.2
IPR022830Indigoidine synthase A-likeSUPERFAMILY110581Indigoidine synthase A-likecoord: 46..364
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 21..43
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 21..38
NoneNo IPR availablePANTHERPTHR42909:SF3BNAA06G02750D PROTEINcoord: 276..367
NoneNo IPR availablePANTHERPTHR42909:SF3BNAA06G02750D PROTEINcoord: 7..254

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
MC06g1006.1MC06g1006.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0001522 pseudouridine synthesis
molecular_function GO:0016798 hydrolase activity, acting on glycosyl bonds
molecular_function GO:0046872 metal ion binding
molecular_function GO:0004730 pseudouridylate synthase activity