CSPI02G23720 (gene) Cucumber (PI 183967) v1

Overview
NameCSPI02G23720
Typegene
OrganismCucumis sativus var. hardwickii cv. PI 183967 (Cucumber (PI 183967) v1)
Descriptionhydroxyproline O-arabinosyltransferase 1-like
LocationChr2: 20377276 .. 20380199 (-)
RNA-Seq ExpressionCSPI02G23720
SyntenyCSPI02G23720
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CCCAGCCGCCGCCCGCTTAAAGATCCAAACTTTCATCTTATGAGTTATTATTATATAATTCCACCAGAAAAAACTAATCCAACTCTCCATTTTGGTCAGCAAATTTCTTCAATAAACTCTCTTCTAGGCCTAGGCGCGACGACTTTTCTCAAACTTGAAGTTCCTACCTTTATTCTGTCTTCGCTTCCAAGGGGAGTTTGATGGGCAAATTCATTTGAGCTGAATTCCAGCAAGTTGGAGACGAATCCTAAAATGGGTTGTGGGAATTTGTTCTTCTTGGTTCTGGTAACCTTCTCGGTAGCTCTAATTACTTACAACATCATTCTCTCTGCTAATGCCCCTCTCAAGCAAGAACTTCCTGGTCCATCAAGATCTTCATCTTCCATTACTGTGGACCCTGTGATTAAGATGCCCCTGGATAGATCAGAAACTTCTTCCTCCAAACGACTCTTCCATACGGCAGTTACTGCGTCGGATTCGGTGTACAATACTTGGCAGTGTCGGATCATGTACTACTGGTTCAAGAAGTTCAAGGACGGACCTAATTCTGAAATGGGTGGCTTCACCAGGATCTTGCATTCAGGGAAGCCCGACAAGTATATGGATGAGATCCCCACCTTCGTTGCTCAGCCTCTGCCAGCGGGAATGGATCGGGTACTAATTTATTTGCTTGATTTCTATTATTTACGGGGTTGACTTTTGAGCGTGGCCTTGTTTTTGCTTCTGGTGTTCTATTATAAGCTTTGAGAATCTCAGGGGAAGTGTTCATCTGTGACTTTGTTATGTGCTATCTATGGTTTTATTTATGCTCTACACCTGTTTGAAGGAATTCGCCTTAGGAAAGCAAGGGGTCGCAGTGAATTTCTATTATGGCTAACTCCTTGGCTAATGAAATGACGCCCAATGGGCGACGTCTCTAATTGAATTAAGAATTTTCTATTTTGCTTTTATTACCCTCTCTGGGTTTCATTTTCATGGAGCTGAAATTGCCTTCTATTAGCTGCTCATCTGTGGTTTGAAACTGTTACAGGGCTACATTGTCCTCAACAGACCGTGGGCATTTGTGCAATGGCTTCAACAAGCGGACATCAAGGAAGAGTAGGGGGCGCTTTCCTCTGTTCTAATTAACTGATAGATCAATAATGTTTTTACTTTGAGTTTACCCTGAATGTTTTTTTATCTAAAATTATGTTTTACAGTTATATACTCATGTCAGAGCCGGATCATATTATTGTCAAGCCTATACCAAATTTGTCTAAGGATGGGCTTGGGGCAGCGTTTCCATTCTTTTACATCGAACCCAAGAAGTATGAGAGCCAACTACGGAAGTTCTTTCCTGAGGACAAGGGTCCTATTACCAACATTGATCCAATAGGGAATTCACCTGTTATTGTTGGAAAGGTATGCAATATCATGATTCTAAGTTATTACATTTAGCATTCTTCTCTCCTTTCTCCATTCTGTCTTACTTTTCTCTTCTCTTTTGAATAAATTTTGCTAGGAGTCTCTCAAGAAAATTGCTCCCACTTGGATGAATGTTTCTTTGGCAATGAAAAAGGATCCTGAGACAGATAAGGCTTTTGGTTGGGTTCTTGAAATGTAAGTTTTTCTTCTTTGAGGTTTCCTTTTCATCTTTCTCACTTCGTATGGTCTTTTTACTTCTTTATTTTCTTTGCTTCCCATGTTTCTTCTCCTTGTATTTTAACTGTAGGTATGCTTATGCTGTCGCTTCTGCTCTACATGATGTTGGTAACATCTTGTATAAGGACTTCATGATTCAGGTTCTTAGAGAAATATTTCTGTTTCTTTATCTTTTTGGTTGGATTATCATTCGCTATGCAGCATAGCATTTTTACCGATTCTCATTGCTGTTTGATATACTTCCTCCACGAAACATAATCAGCTTGAATTGGCTTCACAATTTGAAATCCTTCTGAAAACTTTTATGTTTACTTTCTACCAAACCACATGAAATTTTATCCAATTCCGTTGGCTGATAGGATGTGATGTAGGAAACACGTGATCTTAGGAAATGAGAGGATTTGACCCTTCATTCTCAAACTTGATCAATATTCTGATTTTATATATTAAGTAACTTATTGTACGTTGACACTTTACAGCCCCCATGGGACACAGAAGTGGGCAAGAAGTTCATAATCCATTACACTTATGGTTGTGACTATGATATGAAGGTATCAAACAAAGGTTGATTATGTGTATACCGATCTCTCTCCTCTATTATACATCATAGCTTATCTTTGTCTATGGCTACTTGTTATACCTTGGTATACAGGGAAAACTGACTTACGGTAAGATTGGAGAATGGAGGTTTGACAAGAGATCATATGATAATGTTGTTCCTCCCAGGAACCTTCCCCTACCACCACCCGGCGTCCCTGAAAGCGTGGTATATGTCCTATAACCCTTGCATGTGGATGATGTGGAATGCTTTTTATCCTTTGCTTGTCGATATGTTTCTACTATTATTAACTGATCAAGTAGTTTGTAATGAATTTGATAAATTTTAATATTCTTGTGATGCCTGAGACAACATATTCCATATATGCATTTTCAGGTGACACTAGTGAAAATGGTTAATGAAGCCACAGCAAACATTCCTAATTGGGGATCTTAAGAAAGTCTGTTACGTGTACTTTTGAAAGGCGTGGAGACCGGTACTTAATTTAATTCTTACACAATATAATTACCTTTTTGAATATGTATTGATTAGCTTGTATCTGATTGGCAAAGGCATGGGCAAATGTATTAAATGTGTAAAACGTTGTTTCGTTTTAAGATAAAAATAAATAGCTAGCGTTCTCTTCAATCATCTCATGTATATTAGAAAGCTAACACCTATGTCAATACATCACGAACGTTGAATATTTGTTCCAAATGCAATGGCAAAACCCATTAATCGAGCA

mRNA sequence

CCCAGCCGCCGCCCGCTTAAAGATCCAAACTTTCATCTTATGAGTTATTATTATATAATTCCACCAGAAAAAACTAATCCAACTCTCCATTTTGGTCAGCAAATTTCTTCAATAAACTCTCTTCTAGGCCTAGGCGCGACGACTTTTCTCAAACTTGAAGTTCCTACCTTTATTCTGTCTTCGCTTCCAAGGGGAGTTTGATGGGCAAATTCATTTGAGCTGAATTCCAGCAAGTTGGAGACGAATCCTAAAATGGGTTGTGGGAATTTGTTCTTCTTGGTTCTGGTAACCTTCTCGGTAGCTCTAATTACTTACAACATCATTCTCTCTGCTAATGCCCCTCTCAAGCAAGAACTTCCTGGTCCATCAAGATCTTCATCTTCCATTACTGTGGACCCTGTGATTAAGATGCCCCTGGATAGATCAGAAACTTCTTCCTCCAAACGACTCTTCCATACGGCAGTTACTGCGTCGGATTCGGTGTACAATACTTGGCAGTGTCGGATCATGTACTACTGGTTCAAGAAGTTCAAGGACGGACCTAATTCTGAAATGGGTGGCTTCACCAGGATCTTGCATTCAGGGAAGCCCGACAAGTATATGGATGAGATCCCCACCTTCGTTGCTCAGCCTCTGCCAGCGGGAATGGATCGGGGCTACATTGTCCTCAACAGACCGTGGGCATTTGTGCAATGGCTTCAACAAGCGGACATCAAGGAAGATTATATACTCATGTCAGAGCCGGATCATATTATTGTCAAGCCTATACCAAATTTGTCTAAGGATGGGCTTGGGGCAGCGTTTCCATTCTTTTACATCGAACCCAAGAAGTATGAGAGCCAACTACGGAAGTTCTTTCCTGAGGACAAGGGTCCTATTACCAACATTGATCCAATAGGGAATTCACCTGTTATTGTTGGAAAGGAGTCTCTCAAGAAAATTGCTCCCACTTGGATGAATGTTTCTTTGGCAATGAAAAAGGATCCTGAGACAGATAAGGCTTTTGGTTGGGTTCTTGAAATGTATGCTTATGCTGTCGCTTCTGCTCTACATGATGTTGGTAACATCTTGTATAAGGACTTCATGATTCAGCCCCCATGGGACACAGAAGTGGGCAAGAAGTTCATAATCCATTACACTTATGGTTGTGACTATGATATGAAGGGAAAACTGACTTACGGTAAGATTGGAGAATGGAGGTTTGACAAGAGATCATATGATAATGTTGTTCCTCCCAGGAACCTTCCCCTACCACCACCCGGCGTCCCTGAAAGCGTGGTGACACTAGTGAAAATGGTTAATGAAGCCACAGCAAACATTCCTAATTGGGGATCTTAAGAAAGTCTGTTACGTGTACTTTTGAAAGGCGTGGAGACCGGTACTTAATTTAATTCTTACACAATATAATTACCTTTTTGAATATGTATTGATTAGCTTGTATCTGATTGGCAAAGGCATGGGCAAATGTATTAAATGTGTAAAACGTTGTTTCGTTTTAAGATAAAAATAAATAGCTAGCGTTCTCTTCAATCATCTCATGTATATTAGAAAGCTAACACCTATGTCAATACATCACGAACGTTGAATATTTGTTCCAAATGCAATGGCAAAACCCATTAATCGAGCA

Coding sequence (CDS)

ATGGGTTGTGGGAATTTGTTCTTCTTGGTTCTGGTAACCTTCTCGGTAGCTCTAATTACTTACAACATCATTCTCTCTGCTAATGCCCCTCTCAAGCAAGAACTTCCTGGTCCATCAAGATCTTCATCTTCCATTACTGTGGACCCTGTGATTAAGATGCCCCTGGATAGATCAGAAACTTCTTCCTCCAAACGACTCTTCCATACGGCAGTTACTGCGTCGGATTCGGTGTACAATACTTGGCAGTGTCGGATCATGTACTACTGGTTCAAGAAGTTCAAGGACGGACCTAATTCTGAAATGGGTGGCTTCACCAGGATCTTGCATTCAGGGAAGCCCGACAAGTATATGGATGAGATCCCCACCTTCGTTGCTCAGCCTCTGCCAGCGGGAATGGATCGGGGCTACATTGTCCTCAACAGACCGTGGGCATTTGTGCAATGGCTTCAACAAGCGGACATCAAGGAAGATTATATACTCATGTCAGAGCCGGATCATATTATTGTCAAGCCTATACCAAATTTGTCTAAGGATGGGCTTGGGGCAGCGTTTCCATTCTTTTACATCGAACCCAAGAAGTATGAGAGCCAACTACGGAAGTTCTTTCCTGAGGACAAGGGTCCTATTACCAACATTGATCCAATAGGGAATTCACCTGTTATTGTTGGAAAGGAGTCTCTCAAGAAAATTGCTCCCACTTGGATGAATGTTTCTTTGGCAATGAAAAAGGATCCTGAGACAGATAAGGCTTTTGGTTGGGTTCTTGAAATGTATGCTTATGCTGTCGCTTCTGCTCTACATGATGTTGGTAACATCTTGTATAAGGACTTCATGATTCAGCCCCCATGGGACACAGAAGTGGGCAAGAAGTTCATAATCCATTACACTTATGGTTGTGACTATGATATGAAGGGAAAACTGACTTACGGTAAGATTGGAGAATGGAGGTTTGACAAGAGATCATATGATAATGTTGTTCCTCCCAGGAACCTTCCCCTACCACCACCCGGCGTCCCTGAAAGCGTGGTGACACTAGTGAAAATGGTTAATGAAGCCACAGCAAACATTCCTAATTGGGGATCTTAA

Protein sequence

MGCGNLFFLVLVTFSVALITYNIILSANAPLKQELPGPSRSSSSITVDPVIKMPLDRSETSSSKRLFHTAVTASDSVYNTWQCRIMYYWFKKFKDGPNSEMGGFTRILHSGKPDKYMDEIPTFVAQPLPAGMDRGYIVLNRPWAFVQWLQQADIKEDYILMSEPDHIIVKPIPNLSKDGLGAAFPFFYIEPKKYESQLRKFFPEDKGPITNIDPIGNSPVIVGKESLKKIAPTWMNVSLAMKKDPETDKAFGWVLEMYAYAVASALHDVGNILYKDFMIQPPWDTEVGKKFIIHYTYGCDYDMKGKLTYGKIGEWRFDKRSYDNVVPPRNLPLPPPGVPESVVTLVKMVNEATANIPNWGS*
Homology
BLAST of CSPI02G23720 vs. ExPASy Swiss-Prot
Match: Q8W4E6 (Hydroxyproline O-arabinosyltransferase 1 OS=Arabidopsis thaliana OX=3702 GN=HPAT1 PE=1 SV=1)

HSP 1 Score: 630.2 bits (1624), Expect = 1.5e-179
Identity = 300/367 (81.74%), Postives = 329/367 (89.65%), Query Frame = 0

Query: 1   MGC-GNLFFLVLVTFSVALITYNIILSANAPLKQELPGPSRSSSSITVDPVIKMPL---D 60
           MGC G LF+ +L+T SVALITYNII+SANAPLKQ  PG S SSS I++DPVI++P     
Sbjct: 1   MGCGGTLFYPLLITLSVALITYNIIISANAPLKQGFPGRS-SSSDISIDPVIELPRGGGS 60

Query: 61  RSETSSSKRLFHTAVTASDSVYNTWQCRIMYYWFKKFK--DGPNSEMGGFTRILHSGKPD 120
           R+      RLFHTAVTASDSVYNTWQCR+MYYWFKK +   GP SEMGGFTRILHSGKPD
Sbjct: 61  RNNDGKRIRLFHTAVTASDSVYNTWQCRVMYYWFKKIQASAGPGSEMGGFTRILHSGKPD 120

Query: 121 KYMDEIPTFVAQPLPAGMDRGYIVLNRPWAFVQWLQQADIKEDYILMSEPDHIIVKPIPN 180
           +YMDEIPTFVAQPLP+GMD+GY+VLNRPWAFVQWLQQ DIKEDYILMSEPDHIIVKPIPN
Sbjct: 121 QYMDEIPTFVAQPLPSGMDQGYVVLNRPWAFVQWLQQTDIKEDYILMSEPDHIIVKPIPN 180

Query: 181 LSKDGLGAAFPFFYIEPKKYESQLRKFFPEDKGPITNIDPIGNSPVIVGKESLKKIAPTW 240
           L+KDGLGAAFPFFYIEPKKYE  LRK++PE +GP+TNIDPIGNSPVIVGK++LKKIAPTW
Sbjct: 181 LAKDGLGAAFPFFYIEPKKYEKVLRKYYPEVRGPVTNIDPIGNSPVIVGKDALKKIAPTW 240

Query: 241 MNVSLAMKKDPETDKAFGWVLEMYAYAVASALHDVGNILYKDFMIQPPWDTEVGKKFIIH 300
           MNVSLAMKKDPE DKAFGWVLEMYAYAV+SALH V NIL+KDFMIQPPWD EVG K+IIH
Sbjct: 241 MNVSLAMKKDPEADKAFGWVLEMYAYAVSSALHGVSNILHKDFMIQPPWDIEVGDKYIIH 300

Query: 301 YTYGCDYDMKGKLTYGKIGEWRFDKRSYDNVVPPRNLPLPPPGVPESVVTLVKMVNEATA 360
           YTYGCDYDMKGKLTYGKIGEWRFDKRSYD+  PPRNL +PPPGV +SVVTLVKM+NEATA
Sbjct: 301 YTYGCDYDMKGKLTYGKIGEWRFDKRSYDSKPPPRNLTMPPPGVSQSVVTLVKMINEATA 360

Query: 361 NIPNWGS 362
           NIPNWGS
Sbjct: 361 NIPNWGS 366

BLAST of CSPI02G23720 vs. ExPASy Swiss-Prot
Match: Q494Q2 (Hydroxyproline O-arabinosyltransferase 2 OS=Arabidopsis thaliana OX=3702 GN=HPAT2 PE=1 SV=1)

HSP 1 Score: 557.4 bits (1435), Expect = 1.2e-157
Identity = 262/359 (72.98%), Postives = 301/359 (83.84%), Query Frame = 0

Query: 4   GNLFFLVLVTFSVAL-ITYNIILSANAPLKQELPGPSRSSSSITVDPVIKMPLDRSETSS 63
           G  FF +L+T S+ L I YN I+S + PL+QELPG   +SS   +   +K P     +  
Sbjct: 5   GKYFFPILMTLSLFLIIRYNYIVSDDPPLRQELPGRRSASSGDDITYTVKTP-----SKK 64

Query: 64  SKRLFHTAVTASDSVYNTWQCRIMYYWFKKFKDGPNSEMGGFTRILHSGKPDKYMDEIPT 123
           +KRLFHTAVTA+DSVY+TWQCR+MYYW+ +F+D P S+MGG+TRILHSG+PD  MDEIPT
Sbjct: 65  TKRLFHTAVTATDSVYSTWQCRVMYYWYNRFRDEPGSDMGGYTRILHSGRPDGLMDEIPT 124

Query: 124 FVAQPLPAGMDRGYIVLNRPWAFVQWLQQADIKEDYILMSEPDHIIVKPIPNLSKDGLGA 183
           FVA PLP+G+D+GY+VLNRPWAFVQWLQQA I+EDYILM+EPDHIIVKPIPNL++  L A
Sbjct: 125 FVADPLPSGVDKGYVVLNRPWAFVQWLQQAHIEEDYILMAEPDHIIVKPIPNLARGNLAA 184

Query: 184 AFPFFYIEPKKYESQLRKFFPEDKGPITNIDPIGNSPVIVGKESLKKIAPTWMNVSLAMK 243
           AFPFFYIEPKKYES LRKFFP++ GPI+ IDPIGNSPVIV K +L KIAPTWMNVSLAMK
Sbjct: 185 AFPFFYIEPKKYESVLRKFFPKENGPISRIDPIGNSPVIVTKNALMKIAPTWMNVSLAMK 244

Query: 244 KDPETDKAFGWVLEMYAYAVASALHDVGNILYKDFMIQPPWDTEVGKKFIIHYTYGCDYD 303
            DP+TDKAFGWVLEMYAYAV+SALH V NIL+KDFMIQPPWDTE  K FIIHYTYGCD+D
Sbjct: 245 NDPQTDKAFGWVLEMYAYAVSSALHGVSNILHKDFMIQPPWDTETKKTFIIHYTYGCDFD 304

Query: 304 MKGKLTYGKIGEWRFDKRSYDNVVPPRNLPLPPPGVPESVVTLVKMVNEATANIPNWGS 362
           MKGK+  GKIGEWRFDKRSY +  PPRNL LPP GVPESVVTLV M+NEATANIPNW S
Sbjct: 305 MKGKMMVGKIGEWRFDKRSYGDKPPPRNLTLPPRGVPESVVTLVTMINEATANIPNWES 358

BLAST of CSPI02G23720 vs. ExPASy Swiss-Prot
Match: Q9FY51 (Hydroxyproline O-arabinosyltransferase 3 OS=Arabidopsis thaliana OX=3702 GN=HPAT3 PE=1 SV=1)

HSP 1 Score: 513.5 bits (1321), Expect = 2.0e-144
Identity = 238/359 (66.30%), Postives = 282/359 (78.55%), Query Frame = 0

Query: 1   MGCGNLFFLVLVTFSVALITYNIILSANAPLKQELPGPSRSSSSITVDPVIKMPLDRSET 60
           MG  +   L L+ F   ++TYN++      +     G S S  S  +DPV++MPL+  + 
Sbjct: 1   MGKASGLLLFLLGFGFFVVTYNLL----TLIVHNRSGVSNSDGSPLLDPVVQMPLNIRKA 60

Query: 61  SSSKRLFHTAVTASDSVYNTWQCRIMYYWFKKFKDGPNSEMGGFTRILHSGKPDKYMDEI 120
            SS   FH A+TA+D+ YN WQCRIMYYW+K+ K  P S+MGGFTRILHSG  D  MDEI
Sbjct: 61  KSSPAPFHVALTATDAPYNKWQCRIMYYWYKQKKALPGSDMGGFTRILHSGNSDNLMDEI 120

Query: 121 PTFVAQPLPAGMDRGYIVLNRPWAFVQWLQQADIKEDYILMSEPDHIIVKPIPNLSKDGL 180
           PTFV  PLP G+DRGY+VLNRPWAFVQWL++A IKEDY+LM+EPDH+ V P+PNL+  G 
Sbjct: 121 PTFVVDPLPPGLDRGYVVLNRPWAFVQWLERATIKEDYVLMAEPDHVFVNPLPNLAVGGF 180

Query: 181 GAAFPFFYIEPKKYESQLRKFFPEDKGPITNIDPIGNSPVIVGKESLKKIAPTWMNVSLA 240
            AAFPFFYI P+KYE+ +RK++P + GP+TNIDPIGNSPVI+ KESL+KIAPTWMNVSL 
Sbjct: 181 PAAFPFFYITPEKYENIVRKYYPAEMGPVTNIDPIGNSPVIISKESLEKIAPTWMNVSLT 240

Query: 241 MKKDPETDKAFGWVLEMYAYAVASALHDVGNILYKDFMIQPPWDTEVGKKFIIHYTYGCD 300
           MK DPETDKAFGWVLEMY YA+ASA+H V +IL KDFM+QPPWD     KFIIHYTYGCD
Sbjct: 241 MKNDPETDKAFGWVLEMYGYAIASAIHGVRHILRKDFMLQPPWDLSTKGKFIIHYTYGCD 300

Query: 301 YDMKGKLTYGKIGEWRFDKRSYDNVVPPRNLPLPPPGVPESVVTLVKMVNEATANIPNW 360
           Y+MKG+LTYGKIGEWRFDKRS+    PPRN+ LPPPGVPESVVTLVKMVNEATA IPNW
Sbjct: 301 YNMKGELTYGKIGEWRFDKRSHLRGPPPRNMSLPPPGVPESVVTLVKMVNEATATIPNW 355

BLAST of CSPI02G23720 vs. ExPASy Swiss-Prot
Match: G7LG31 (Hydroxyproline O-arabinosyltransferase RDN2 OS=Medicago truncatula OX=3880 GN=RDN2 PE=3 SV=1)

HSP 1 Score: 500.4 bits (1287), Expect = 1.8e-140
Identity = 221/312 (70.83%), Postives = 265/312 (84.94%), Query Frame = 0

Query: 48  DPVIKMPLDRSETSSSKRLFHTAVTASDSVYNTWQCRIMYYWFKKFKDGPNSEMGGFTRI 107
           DP+++MP     T +SK  FH A+TA+D++YN WQCRIMYYW+KK +  P SEMGGFTRI
Sbjct: 46  DPIVEMPEHVKNTKTSKAPFHIALTATDAIYNKWQCRIMYYWYKKQRSLPGSEMGGFTRI 105

Query: 108 LHSGKPDKYMDEIPTFVAQPLPAGMDRGYIVLNRPWAFVQWLQQADIKEDYILMSEPDHI 167
           LHSGK D  MDEIPT V  PLP G+DRGY+VLNRPWAFVQWL++A+I+E+YILM+EPDH+
Sbjct: 106 LHSGKADNLMDEIPTVVVDPLPEGLDRGYVVLNRPWAFVQWLEKANIEEEYILMAEPDHV 165

Query: 168 IVKPIPNLSKDGLGAAFPFFYIEPKKYESQLRKFFPEDKGPITNIDPIGNSPVIVGKESL 227
            V+P+PNL+     AAFPFFYI+PK+ E  +RK++PE+ GP+TN+DPIGNSPVI+ K+ +
Sbjct: 166 FVRPLPNLAFGENPAAFPFFYIKPKENEKIVRKYYPEENGPVTNVDPIGNSPVIIRKDLI 225

Query: 228 KKIAPTWMNVSLAMKKDPETDKAFGWVLEMYAYAVASALHDVGNILYKDFMIQPPWDTEV 287
            KIAPTWMN+S+ MK+DPETDKAFGWVLEMY YAVASALH V +IL KDFM+QPPWDTE 
Sbjct: 226 AKIAPTWMNISMKMKEDPETDKAFGWVLEMYGYAVASALHGVRHILRKDFMLQPPWDTET 285

Query: 288 GKKFIIHYTYGCDYDMKGKLTYGKIGEWRFDKRSYDNVVPPRNLPLPPPGVPESVVTLVK 347
             K+IIHYTYGCDY++KG+LTYGKIGEWRFDKRS+    PPRNLPLPPPGVPESV TLVK
Sbjct: 286 FNKYIIHYTYGCDYNLKGELTYGKIGEWRFDKRSHLRGPPPRNLPLPPPGVPESVATLVK 345

Query: 348 MVNEATANIPNW 360
           MVNEA+ANIPNW
Sbjct: 346 MVNEASANIPNW 357

BLAST of CSPI02G23720 vs. ExPASy Swiss-Prot
Match: A0A0A1H7M6 (Hydroxyproline O-arabinosyltransferase PLENTY OS=Lotus japonicus OX=34305 GN=PLENTY PE=1 SV=1)

HSP 1 Score: 483.0 bits (1242), Expect = 2.9e-135
Identity = 216/302 (71.52%), Postives = 255/302 (84.44%), Query Frame = 0

Query: 60  TSSSKRLFHTAVTASDSVYNTWQCRIMYYWFKKFKDGPNSEMGGFTRILHSGKPDKYMDE 119
           ++S+   +H A+TA+D+ Y+ WQCRIMYYW+KK KD P S MG FTRILHSG+ D+ MDE
Sbjct: 56  SASTNAKYHVALTATDAAYSQWQCRIMYYWYKKVKDMPGSNMGKFTRILHSGRTDQLMDE 115

Query: 120 IPTFVAQPLPAGMDRGYIVLNRPWAFVQWLQQADIKEDYILMSEPDHIIVKPIPNLSKDG 179
           IPTFV  PLP G+DRGYIVLNRPWAFVQWL++ADI+E+YILM+EPDHI V P+PNL+   
Sbjct: 116 IPTFVVDPLPEGLDRGYIVLNRPWAFVQWLEKADIEEEYILMAEPDHIFVNPLPNLASRT 175

Query: 180 LGAAFPFFYIEPKKYESQLRKFFPEDKGPITNIDPIGNSPVIVGKESLKKIAPTWMNVSL 239
             A +PFFYI+P + E  +RKF+P+DKGP+T++DPIGNSPVI+ K  +++IAPTW+NVSL
Sbjct: 176 QPAGYPFFYIKPAENEKIIRKFYPKDKGPVTDVDPIGNSPVIIQKSLIEEIAPTWVNVSL 235

Query: 240 AMKKDPETDKAFGWVLEMYAYAVASALHDVGNILYKDFMIQPPWDTEVGKKFIIHYTYGC 299
            MK DPETDKAFGWVLEMYAYAVASALH V +IL KDFM+QPPWD  VGK FIIHYTYGC
Sbjct: 236 RMKDDPETDKAFGWVLEMYAYAVASALHGVKHILRKDFMLQPPWDRHVGKTFIIHYTYGC 295

Query: 300 DYDMKGKLTYGKIGEWRFDKRSYDNVVPPRNLPLPPPGVPESVVTLVKMVNEATANIPNW 359
           DY++KG+LTYGKIGEWRFDKRSY    PP+NL LPPPGVPESVV LVKMVNEATANIP W
Sbjct: 296 DYNLKGELTYGKIGEWRFDKRSYLMGPPPKNLSLPPPGVPESVVRLVKMVNEATANIPEW 355

Query: 360 GS 362
            S
Sbjct: 356 DS 357

BLAST of CSPI02G23720 vs. ExPASy TrEMBL
Match: A0A0A0LQB9 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G382460 PE=4 SV=1)

HSP 1 Score: 756.1 bits (1951), Expect = 6.6e-215
Identity = 361/361 (100.00%), Postives = 361/361 (100.00%), Query Frame = 0

Query: 1   MGCGNLFFLVLVTFSVALITYNIILSANAPLKQELPGPSRSSSSITVDPVIKMPLDRSET 60
           MGCGNLFFLVLVTFSVALITYNIILSANAPLKQELPGPSRSSSSITVDPVIKMPLDRSET
Sbjct: 1   MGCGNLFFLVLVTFSVALITYNIILSANAPLKQELPGPSRSSSSITVDPVIKMPLDRSET 60

Query: 61  SSSKRLFHTAVTASDSVYNTWQCRIMYYWFKKFKDGPNSEMGGFTRILHSGKPDKYMDEI 120
           SSSKRLFHTAVTASDSVYNTWQCRIMYYWFKKFKDGPNSEMGGFTRILHSGKPDKYMDEI
Sbjct: 61  SSSKRLFHTAVTASDSVYNTWQCRIMYYWFKKFKDGPNSEMGGFTRILHSGKPDKYMDEI 120

Query: 121 PTFVAQPLPAGMDRGYIVLNRPWAFVQWLQQADIKEDYILMSEPDHIIVKPIPNLSKDGL 180
           PTFVAQPLPAGMDRGYIVLNRPWAFVQWLQQADIKEDYILMSEPDHIIVKPIPNLSKDGL
Sbjct: 121 PTFVAQPLPAGMDRGYIVLNRPWAFVQWLQQADIKEDYILMSEPDHIIVKPIPNLSKDGL 180

Query: 181 GAAFPFFYIEPKKYESQLRKFFPEDKGPITNIDPIGNSPVIVGKESLKKIAPTWMNVSLA 240
           GAAFPFFYIEPKKYESQLRKFFPEDKGPITNIDPIGNSPVIVGKESLKKIAPTWMNVSLA
Sbjct: 181 GAAFPFFYIEPKKYESQLRKFFPEDKGPITNIDPIGNSPVIVGKESLKKIAPTWMNVSLA 240

Query: 241 MKKDPETDKAFGWVLEMYAYAVASALHDVGNILYKDFMIQPPWDTEVGKKFIIHYTYGCD 300
           MKKDPETDKAFGWVLEMYAYAVASALHDVGNILYKDFMIQPPWDTEVGKKFIIHYTYGCD
Sbjct: 241 MKKDPETDKAFGWVLEMYAYAVASALHDVGNILYKDFMIQPPWDTEVGKKFIIHYTYGCD 300

Query: 301 YDMKGKLTYGKIGEWRFDKRSYDNVVPPRNLPLPPPGVPESVVTLVKMVNEATANIPNWG 360
           YDMKGKLTYGKIGEWRFDKRSYDNVVPPRNLPLPPPGVPESVVTLVKMVNEATANIPNWG
Sbjct: 301 YDMKGKLTYGKIGEWRFDKRSYDNVVPPRNLPLPPPGVPESVVTLVKMVNEATANIPNWG 360

Query: 361 S 362
           S
Sbjct: 361 S 361

BLAST of CSPI02G23720 vs. ExPASy TrEMBL
Match: A0A5A7VF01 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold82G002240 PE=4 SV=1)

HSP 1 Score: 752.3 bits (1941), Expect = 9.5e-214
Identity = 359/361 (99.45%), Postives = 360/361 (99.72%), Query Frame = 0

Query: 1   MGCGNLFFLVLVTFSVALITYNIILSANAPLKQELPGPSRSSSSITVDPVIKMPLDRSET 60
           MGCGNLFFLVLVTFSVALITYNIILSANAPLKQELPGPSRSSSSITVDPVIKMPLDRSET
Sbjct: 1   MGCGNLFFLVLVTFSVALITYNIILSANAPLKQELPGPSRSSSSITVDPVIKMPLDRSET 60

Query: 61  SSSKRLFHTAVTASDSVYNTWQCRIMYYWFKKFKDGPNSEMGGFTRILHSGKPDKYMDEI 120
           SSSKRLFHTAVTASDSVYNTWQCRIMYYWFKKFKDGPNSEMGGFTRILHSGKPDKYMDEI
Sbjct: 61  SSSKRLFHTAVTASDSVYNTWQCRIMYYWFKKFKDGPNSEMGGFTRILHSGKPDKYMDEI 120

Query: 121 PTFVAQPLPAGMDRGYIVLNRPWAFVQWLQQADIKEDYILMSEPDHIIVKPIPNLSKDGL 180
           PTFVAQPLPAGMDRGYIVLNRPWAFVQWLQQADIKEDYILMSEPDHIIVKPIPNLSKDGL
Sbjct: 121 PTFVAQPLPAGMDRGYIVLNRPWAFVQWLQQADIKEDYILMSEPDHIIVKPIPNLSKDGL 180

Query: 181 GAAFPFFYIEPKKYESQLRKFFPEDKGPITNIDPIGNSPVIVGKESLKKIAPTWMNVSLA 240
           GAAFPFFYIEPKKYESQLRKFFPEDKGPITNIDPIGNSPVIVGKESLKKIAPTWMNVSLA
Sbjct: 181 GAAFPFFYIEPKKYESQLRKFFPEDKGPITNIDPIGNSPVIVGKESLKKIAPTWMNVSLA 240

Query: 241 MKKDPETDKAFGWVLEMYAYAVASALHDVGNILYKDFMIQPPWDTEVGKKFIIHYTYGCD 300
           MKKDP+TDKAFGWVLEMYAYAVASALH VGNILYKDFMIQPPWDTEVGKKFIIHYTYGCD
Sbjct: 241 MKKDPDTDKAFGWVLEMYAYAVASALHGVGNILYKDFMIQPPWDTEVGKKFIIHYTYGCD 300

Query: 301 YDMKGKLTYGKIGEWRFDKRSYDNVVPPRNLPLPPPGVPESVVTLVKMVNEATANIPNWG 360
           YDMKGKLTYGKIGEWRFDKRSYDNVVPPRNLPLPPPGVPESVVTLVKMVNEATANIPNWG
Sbjct: 301 YDMKGKLTYGKIGEWRFDKRSYDNVVPPRNLPLPPPGVPESVVTLVKMVNEATANIPNWG 360

Query: 361 S 362
           S
Sbjct: 361 S 361

BLAST of CSPI02G23720 vs. ExPASy TrEMBL
Match: A0A1S3BBR3 (uncharacterized protein LOC103488330 OS=Cucumis melo OX=3656 GN=LOC103488330 PE=4 SV=1)

HSP 1 Score: 752.3 bits (1941), Expect = 9.5e-214
Identity = 359/361 (99.45%), Postives = 360/361 (99.72%), Query Frame = 0

Query: 1   MGCGNLFFLVLVTFSVALITYNIILSANAPLKQELPGPSRSSSSITVDPVIKMPLDRSET 60
           MGCGNLFFLVLVTFSVALITYNIILSANAPLKQELPGPSRSSSSITVDPVIKMPLDRSET
Sbjct: 1   MGCGNLFFLVLVTFSVALITYNIILSANAPLKQELPGPSRSSSSITVDPVIKMPLDRSET 60

Query: 61  SSSKRLFHTAVTASDSVYNTWQCRIMYYWFKKFKDGPNSEMGGFTRILHSGKPDKYMDEI 120
           SSSKRLFHTAVTASDSVYNTWQCRIMYYWFKKFKDGPNSEMGGFTRILHSGKPDKYMDEI
Sbjct: 61  SSSKRLFHTAVTASDSVYNTWQCRIMYYWFKKFKDGPNSEMGGFTRILHSGKPDKYMDEI 120

Query: 121 PTFVAQPLPAGMDRGYIVLNRPWAFVQWLQQADIKEDYILMSEPDHIIVKPIPNLSKDGL 180
           PTFVAQPLPAGMDRGYIVLNRPWAFVQWLQQADIKEDYILMSEPDHIIVKPIPNLSKDGL
Sbjct: 121 PTFVAQPLPAGMDRGYIVLNRPWAFVQWLQQADIKEDYILMSEPDHIIVKPIPNLSKDGL 180

Query: 181 GAAFPFFYIEPKKYESQLRKFFPEDKGPITNIDPIGNSPVIVGKESLKKIAPTWMNVSLA 240
           GAAFPFFYIEPKKYESQLRKFFPEDKGPITNIDPIGNSPVIVGKESLKKIAPTWMNVSLA
Sbjct: 181 GAAFPFFYIEPKKYESQLRKFFPEDKGPITNIDPIGNSPVIVGKESLKKIAPTWMNVSLA 240

Query: 241 MKKDPETDKAFGWVLEMYAYAVASALHDVGNILYKDFMIQPPWDTEVGKKFIIHYTYGCD 300
           MKKDP+TDKAFGWVLEMYAYAVASALH VGNILYKDFMIQPPWDTEVGKKFIIHYTYGCD
Sbjct: 241 MKKDPDTDKAFGWVLEMYAYAVASALHGVGNILYKDFMIQPPWDTEVGKKFIIHYTYGCD 300

Query: 301 YDMKGKLTYGKIGEWRFDKRSYDNVVPPRNLPLPPPGVPESVVTLVKMVNEATANIPNWG 360
           YDMKGKLTYGKIGEWRFDKRSYDNVVPPRNLPLPPPGVPESVVTLVKMVNEATANIPNWG
Sbjct: 301 YDMKGKLTYGKIGEWRFDKRSYDNVVPPRNLPLPPPGVPESVVTLVKMVNEATANIPNWG 360

Query: 361 S 362
           S
Sbjct: 361 S 361

BLAST of CSPI02G23720 vs. ExPASy TrEMBL
Match: A0A6J1BRV6 (hydroxyproline O-arabinosyltransferase 1-like OS=Momordica charantia OX=3673 GN=LOC111005182 PE=4 SV=1)

HSP 1 Score: 728.8 bits (1880), Expect = 1.1e-206
Identity = 344/361 (95.29%), Postives = 355/361 (98.34%), Query Frame = 0

Query: 1   MGCGNLFFLVLVTFSVALITYNIILSANAPLKQELPGPSRSSSSITVDPVIKMPLDRSET 60
           MGCGNLFFLVLVTFSVALITYNII+SANAPLKQELPGPSRSS SITVDPVIKMPLDRS+T
Sbjct: 1   MGCGNLFFLVLVTFSVALITYNIIISANAPLKQELPGPSRSSPSITVDPVIKMPLDRSKT 60

Query: 61  SSSKRLFHTAVTASDSVYNTWQCRIMYYWFKKFKDGPNSEMGGFTRILHSGKPDKYMDEI 120
           SSSKRLFHTAVTASD VYNTWQCRIMYYW+KKFKDGPNS+MGGFTRILHSGKPDKYMDEI
Sbjct: 61  SSSKRLFHTAVTASDGVYNTWQCRIMYYWYKKFKDGPNSQMGGFTRILHSGKPDKYMDEI 120

Query: 121 PTFVAQPLPAGMDRGYIVLNRPWAFVQWLQQADIKEDYILMSEPDHIIVKPIPNLSKDGL 180
           PTFVA+PLPAGMDRGYIVLNRPWAFVQWLQQADIKEDYILMSEPDHIIVKPIPNL+KDGL
Sbjct: 121 PTFVAKPLPAGMDRGYIVLNRPWAFVQWLQQADIKEDYILMSEPDHIIVKPIPNLAKDGL 180

Query: 181 GAAFPFFYIEPKKYESQLRKFFPEDKGPITNIDPIGNSPVIVGKESLKKIAPTWMNVSLA 240
           GAAFPFFYIEPKKYESQLRKFFPEDKGPITNIDPIGNSPVIVG+ESLKKIAPTWMN+SLA
Sbjct: 181 GAAFPFFYIEPKKYESQLRKFFPEDKGPITNIDPIGNSPVIVGRESLKKIAPTWMNISLA 240

Query: 241 MKKDPETDKAFGWVLEMYAYAVASALHDVGNILYKDFMIQPPWDTEVGKKFIIHYTYGCD 300
           MKKDPE DKAFGWVLEMYAYAVASALH VGNILYKDFMIQPPWD EVG+KFIIHYTYGCD
Sbjct: 241 MKKDPEADKAFGWVLEMYAYAVASALHGVGNILYKDFMIQPPWDKEVGEKFIIHYTYGCD 300

Query: 301 YDMKGKLTYGKIGEWRFDKRSYDNVVPPRNLPLPPPGVPESVVTLVKMVNEATANIPNWG 360
           Y+MKG+LTYGKIGEWRFDKRSYD VVPPRNLPLPPPGVPESVVTLVKMVNEATANIPNWG
Sbjct: 301 YNMKGELTYGKIGEWRFDKRSYDAVVPPRNLPLPPPGVPESVVTLVKMVNEATANIPNWG 360

Query: 361 S 362
           S
Sbjct: 361 S 361

BLAST of CSPI02G23720 vs. ExPASy TrEMBL
Match: A0A6J1HC52 (hydroxyproline O-arabinosyltransferase 1-like OS=Cucurbita moschata OX=3662 GN=LOC111462740 PE=4 SV=1)

HSP 1 Score: 728.0 bits (1878), Expect = 1.9e-206
Identity = 344/361 (95.29%), Postives = 355/361 (98.34%), Query Frame = 0

Query: 1   MGCGNLFFLVLVTFSVALITYNIILSANAPLKQELPGPSRSSSSITVDPVIKMPLDRSET 60
           MGCGNLFFLVLVTFSVALITYNII+SAN PLKQELPGPSRSSSSITVDPVIKMP+ +S+T
Sbjct: 1   MGCGNLFFLVLVTFSVALITYNIIISANVPLKQELPGPSRSSSSITVDPVIKMPMAKSKT 60

Query: 61  SSSKRLFHTAVTASDSVYNTWQCRIMYYWFKKFKDGPNSEMGGFTRILHSGKPDKYMDEI 120
            SSKRLFHTAVTASDSVYNTWQCRIMYYWFKKFKDGPNSEMGGFTRILHSGKPDKYMDEI
Sbjct: 61  PSSKRLFHTAVTASDSVYNTWQCRIMYYWFKKFKDGPNSEMGGFTRILHSGKPDKYMDEI 120

Query: 121 PTFVAQPLPAGMDRGYIVLNRPWAFVQWLQQADIKEDYILMSEPDHIIVKPIPNLSKDGL 180
           PTFVAQPLPAGMDRGYIVLNRPWAFVQWLQQADIKEDYILMSEPDHIIVKPIPNLSKDGL
Sbjct: 121 PTFVAQPLPAGMDRGYIVLNRPWAFVQWLQQADIKEDYILMSEPDHIIVKPIPNLSKDGL 180

Query: 181 GAAFPFFYIEPKKYESQLRKFFPEDKGPITNIDPIGNSPVIVGKESLKKIAPTWMNVSLA 240
           GAAFPFFYIEPKKYESQLRKFFPEDKGPITNIDPIGNSPVIV +ESLKKIAPTWMN+SLA
Sbjct: 181 GAAFPFFYIEPKKYESQLRKFFPEDKGPITNIDPIGNSPVIVQRESLKKIAPTWMNISLA 240

Query: 241 MKKDPETDKAFGWVLEMYAYAVASALHDVGNILYKDFMIQPPWDTEVGKKFIIHYTYGCD 300
           MKKDPETDKAFGWVLEMYAYAVASALHDVGNILYKDFM+QPPWDTEVGKKFIIHYTYGCD
Sbjct: 241 MKKDPETDKAFGWVLEMYAYAVASALHDVGNILYKDFMLQPPWDTEVGKKFIIHYTYGCD 300

Query: 301 YDMKGKLTYGKIGEWRFDKRSYDNVVPPRNLPLPPPGVPESVVTLVKMVNEATANIPNWG 360
           ++MKG+ T GKIGEWRFDKRSYDNVVPPRNLPLPPPGVPESVVTLVKM+NEATANIPNWG
Sbjct: 301 FNMKGQPTPGKIGEWRFDKRSYDNVVPPRNLPLPPPGVPESVVTLVKMINEATANIPNWG 360

Query: 361 S 362
           S
Sbjct: 361 S 361

BLAST of CSPI02G23720 vs. NCBI nr
Match: XP_004138714.1 (hydroxyproline O-arabinosyltransferase 1 [Cucumis sativus] >KGN62972.1 hypothetical protein Csa_022148 [Cucumis sativus])

HSP 1 Score: 756.1 bits (1951), Expect = 1.4e-214
Identity = 361/361 (100.00%), Postives = 361/361 (100.00%), Query Frame = 0

Query: 1   MGCGNLFFLVLVTFSVALITYNIILSANAPLKQELPGPSRSSSSITVDPVIKMPLDRSET 60
           MGCGNLFFLVLVTFSVALITYNIILSANAPLKQELPGPSRSSSSITVDPVIKMPLDRSET
Sbjct: 1   MGCGNLFFLVLVTFSVALITYNIILSANAPLKQELPGPSRSSSSITVDPVIKMPLDRSET 60

Query: 61  SSSKRLFHTAVTASDSVYNTWQCRIMYYWFKKFKDGPNSEMGGFTRILHSGKPDKYMDEI 120
           SSSKRLFHTAVTASDSVYNTWQCRIMYYWFKKFKDGPNSEMGGFTRILHSGKPDKYMDEI
Sbjct: 61  SSSKRLFHTAVTASDSVYNTWQCRIMYYWFKKFKDGPNSEMGGFTRILHSGKPDKYMDEI 120

Query: 121 PTFVAQPLPAGMDRGYIVLNRPWAFVQWLQQADIKEDYILMSEPDHIIVKPIPNLSKDGL 180
           PTFVAQPLPAGMDRGYIVLNRPWAFVQWLQQADIKEDYILMSEPDHIIVKPIPNLSKDGL
Sbjct: 121 PTFVAQPLPAGMDRGYIVLNRPWAFVQWLQQADIKEDYILMSEPDHIIVKPIPNLSKDGL 180

Query: 181 GAAFPFFYIEPKKYESQLRKFFPEDKGPITNIDPIGNSPVIVGKESLKKIAPTWMNVSLA 240
           GAAFPFFYIEPKKYESQLRKFFPEDKGPITNIDPIGNSPVIVGKESLKKIAPTWMNVSLA
Sbjct: 181 GAAFPFFYIEPKKYESQLRKFFPEDKGPITNIDPIGNSPVIVGKESLKKIAPTWMNVSLA 240

Query: 241 MKKDPETDKAFGWVLEMYAYAVASALHDVGNILYKDFMIQPPWDTEVGKKFIIHYTYGCD 300
           MKKDPETDKAFGWVLEMYAYAVASALHDVGNILYKDFMIQPPWDTEVGKKFIIHYTYGCD
Sbjct: 241 MKKDPETDKAFGWVLEMYAYAVASALHDVGNILYKDFMIQPPWDTEVGKKFIIHYTYGCD 300

Query: 301 YDMKGKLTYGKIGEWRFDKRSYDNVVPPRNLPLPPPGVPESVVTLVKMVNEATANIPNWG 360
           YDMKGKLTYGKIGEWRFDKRSYDNVVPPRNLPLPPPGVPESVVTLVKMVNEATANIPNWG
Sbjct: 301 YDMKGKLTYGKIGEWRFDKRSYDNVVPPRNLPLPPPGVPESVVTLVKMVNEATANIPNWG 360

Query: 361 S 362
           S
Sbjct: 361 S 361

BLAST of CSPI02G23720 vs. NCBI nr
Match: XP_008445244.1 (PREDICTED: uncharacterized protein LOC103488330 [Cucumis melo] >KAA0064906.1 uncharacterized protein E6C27_scaffold82G002240 [Cucumis melo var. makuwa])

HSP 1 Score: 752.3 bits (1941), Expect = 2.0e-213
Identity = 359/361 (99.45%), Postives = 360/361 (99.72%), Query Frame = 0

Query: 1   MGCGNLFFLVLVTFSVALITYNIILSANAPLKQELPGPSRSSSSITVDPVIKMPLDRSET 60
           MGCGNLFFLVLVTFSVALITYNIILSANAPLKQELPGPSRSSSSITVDPVIKMPLDRSET
Sbjct: 1   MGCGNLFFLVLVTFSVALITYNIILSANAPLKQELPGPSRSSSSITVDPVIKMPLDRSET 60

Query: 61  SSSKRLFHTAVTASDSVYNTWQCRIMYYWFKKFKDGPNSEMGGFTRILHSGKPDKYMDEI 120
           SSSKRLFHTAVTASDSVYNTWQCRIMYYWFKKFKDGPNSEMGGFTRILHSGKPDKYMDEI
Sbjct: 61  SSSKRLFHTAVTASDSVYNTWQCRIMYYWFKKFKDGPNSEMGGFTRILHSGKPDKYMDEI 120

Query: 121 PTFVAQPLPAGMDRGYIVLNRPWAFVQWLQQADIKEDYILMSEPDHIIVKPIPNLSKDGL 180
           PTFVAQPLPAGMDRGYIVLNRPWAFVQWLQQADIKEDYILMSEPDHIIVKPIPNLSKDGL
Sbjct: 121 PTFVAQPLPAGMDRGYIVLNRPWAFVQWLQQADIKEDYILMSEPDHIIVKPIPNLSKDGL 180

Query: 181 GAAFPFFYIEPKKYESQLRKFFPEDKGPITNIDPIGNSPVIVGKESLKKIAPTWMNVSLA 240
           GAAFPFFYIEPKKYESQLRKFFPEDKGPITNIDPIGNSPVIVGKESLKKIAPTWMNVSLA
Sbjct: 181 GAAFPFFYIEPKKYESQLRKFFPEDKGPITNIDPIGNSPVIVGKESLKKIAPTWMNVSLA 240

Query: 241 MKKDPETDKAFGWVLEMYAYAVASALHDVGNILYKDFMIQPPWDTEVGKKFIIHYTYGCD 300
           MKKDP+TDKAFGWVLEMYAYAVASALH VGNILYKDFMIQPPWDTEVGKKFIIHYTYGCD
Sbjct: 241 MKKDPDTDKAFGWVLEMYAYAVASALHGVGNILYKDFMIQPPWDTEVGKKFIIHYTYGCD 300

Query: 301 YDMKGKLTYGKIGEWRFDKRSYDNVVPPRNLPLPPPGVPESVVTLVKMVNEATANIPNWG 360
           YDMKGKLTYGKIGEWRFDKRSYDNVVPPRNLPLPPPGVPESVVTLVKMVNEATANIPNWG
Sbjct: 301 YDMKGKLTYGKIGEWRFDKRSYDNVVPPRNLPLPPPGVPESVVTLVKMVNEATANIPNWG 360

Query: 361 S 362
           S
Sbjct: 361 S 361

BLAST of CSPI02G23720 vs. NCBI nr
Match: XP_038885217.1 (hydroxyproline O-arabinosyltransferase 1-like [Benincasa hispida])

HSP 1 Score: 734.9 bits (1896), Expect = 3.2e-208
Identity = 348/361 (96.40%), Postives = 358/361 (99.17%), Query Frame = 0

Query: 1   MGCGNLFFLVLVTFSVALITYNIILSANAPLKQELPGPSRSSSSITVDPVIKMPLDRSET 60
           MGCGNLFFLVLV+FSVALITYNII+SANAPLKQELPGPSRSSSSITVDPVIKMP+DRS+T
Sbjct: 1   MGCGNLFFLVLVSFSVALITYNIIISANAPLKQELPGPSRSSSSITVDPVIKMPIDRSKT 60

Query: 61  SSSKRLFHTAVTASDSVYNTWQCRIMYYWFKKFKDGPNSEMGGFTRILHSGKPDKYMDEI 120
           SSSKRLFHTAVTASDSVYNTWQCRIMYYWFKKFKDGPNSEMGGFTRILHSGKPDKYMDEI
Sbjct: 61  SSSKRLFHTAVTASDSVYNTWQCRIMYYWFKKFKDGPNSEMGGFTRILHSGKPDKYMDEI 120

Query: 121 PTFVAQPLPAGMDRGYIVLNRPWAFVQWLQQADIKEDYILMSEPDHIIVKPIPNLSKDGL 180
           PTFVAQPLPAGMD+GYIVLNRPWAFVQWLQQADIKEDYILMSEPDHIIVKPIPNLSKDGL
Sbjct: 121 PTFVAQPLPAGMDQGYIVLNRPWAFVQWLQQADIKEDYILMSEPDHIIVKPIPNLSKDGL 180

Query: 181 GAAFPFFYIEPKKYESQLRKFFPEDKGPITNIDPIGNSPVIVGKESLKKIAPTWMNVSLA 240
           GAAFPFFYIEPK+YESQLRKFFPEDKGPITNIDPIGNSPVIVG+ESLKKIAPTWMNVSLA
Sbjct: 181 GAAFPFFYIEPKRYESQLRKFFPEDKGPITNIDPIGNSPVIVGRESLKKIAPTWMNVSLA 240

Query: 241 MKKDPETDKAFGWVLEMYAYAVASALHDVGNILYKDFMIQPPWDTEVGKKFIIHYTYGCD 300
           MKKDPETDKAFGWVLEMYAYAVASALH V NILYKDFMIQPPWDTEVGKKFIIHYTYGCD
Sbjct: 241 MKKDPETDKAFGWVLEMYAYAVASALHGVSNILYKDFMIQPPWDTEVGKKFIIHYTYGCD 300

Query: 301 YDMKGKLTYGKIGEWRFDKRSYDNVVPPRNLPLPPPGVPESVVTLVKMVNEATANIPNWG 360
           Y+MKG+LTYGK+GEWRFDKRSYDNVVPPRNL LPPPGVPESVVTLVKMVNEATANIPNWG
Sbjct: 301 YNMKGELTYGKMGEWRFDKRSYDNVVPPRNLSLPPPGVPESVVTLVKMVNEATANIPNWG 360

Query: 361 S 362
           S
Sbjct: 361 S 361

BLAST of CSPI02G23720 vs. NCBI nr
Match: XP_022132296.1 (hydroxyproline O-arabinosyltransferase 1-like [Momordica charantia])

HSP 1 Score: 728.8 bits (1880), Expect = 2.3e-206
Identity = 344/361 (95.29%), Postives = 355/361 (98.34%), Query Frame = 0

Query: 1   MGCGNLFFLVLVTFSVALITYNIILSANAPLKQELPGPSRSSSSITVDPVIKMPLDRSET 60
           MGCGNLFFLVLVTFSVALITYNII+SANAPLKQELPGPSRSS SITVDPVIKMPLDRS+T
Sbjct: 1   MGCGNLFFLVLVTFSVALITYNIIISANAPLKQELPGPSRSSPSITVDPVIKMPLDRSKT 60

Query: 61  SSSKRLFHTAVTASDSVYNTWQCRIMYYWFKKFKDGPNSEMGGFTRILHSGKPDKYMDEI 120
           SSSKRLFHTAVTASD VYNTWQCRIMYYW+KKFKDGPNS+MGGFTRILHSGKPDKYMDEI
Sbjct: 61  SSSKRLFHTAVTASDGVYNTWQCRIMYYWYKKFKDGPNSQMGGFTRILHSGKPDKYMDEI 120

Query: 121 PTFVAQPLPAGMDRGYIVLNRPWAFVQWLQQADIKEDYILMSEPDHIIVKPIPNLSKDGL 180
           PTFVA+PLPAGMDRGYIVLNRPWAFVQWLQQADIKEDYILMSEPDHIIVKPIPNL+KDGL
Sbjct: 121 PTFVAKPLPAGMDRGYIVLNRPWAFVQWLQQADIKEDYILMSEPDHIIVKPIPNLAKDGL 180

Query: 181 GAAFPFFYIEPKKYESQLRKFFPEDKGPITNIDPIGNSPVIVGKESLKKIAPTWMNVSLA 240
           GAAFPFFYIEPKKYESQLRKFFPEDKGPITNIDPIGNSPVIVG+ESLKKIAPTWMN+SLA
Sbjct: 181 GAAFPFFYIEPKKYESQLRKFFPEDKGPITNIDPIGNSPVIVGRESLKKIAPTWMNISLA 240

Query: 241 MKKDPETDKAFGWVLEMYAYAVASALHDVGNILYKDFMIQPPWDTEVGKKFIIHYTYGCD 300
           MKKDPE DKAFGWVLEMYAYAVASALH VGNILYKDFMIQPPWD EVG+KFIIHYTYGCD
Sbjct: 241 MKKDPEADKAFGWVLEMYAYAVASALHGVGNILYKDFMIQPPWDKEVGEKFIIHYTYGCD 300

Query: 301 YDMKGKLTYGKIGEWRFDKRSYDNVVPPRNLPLPPPGVPESVVTLVKMVNEATANIPNWG 360
           Y+MKG+LTYGKIGEWRFDKRSYD VVPPRNLPLPPPGVPESVVTLVKMVNEATANIPNWG
Sbjct: 301 YNMKGELTYGKIGEWRFDKRSYDAVVPPRNLPLPPPGVPESVVTLVKMVNEATANIPNWG 360

Query: 361 S 362
           S
Sbjct: 361 S 361

BLAST of CSPI02G23720 vs. NCBI nr
Match: KAG6598558.1 (Hydroxyproline O-arabinosyltransferase 1, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 728.8 bits (1880), Expect = 2.3e-206
Identity = 344/361 (95.29%), Postives = 355/361 (98.34%), Query Frame = 0

Query: 1   MGCGNLFFLVLVTFSVALITYNIILSANAPLKQELPGPSRSSSSITVDPVIKMPLDRSET 60
           MGCGNLFFLVLVTFSVALITYNII+SAN PLKQELPGPSRSSSSITVDPVIKMP+ +S+T
Sbjct: 1   MGCGNLFFLVLVTFSVALITYNIIISANVPLKQELPGPSRSSSSITVDPVIKMPMAKSKT 60

Query: 61  SSSKRLFHTAVTASDSVYNTWQCRIMYYWFKKFKDGPNSEMGGFTRILHSGKPDKYMDEI 120
            SSKRLFHTAVTASDSVYNTWQCRIMYYWFKKFKDGPNSEMGGFTRILHSGKPDKYMDEI
Sbjct: 61  PSSKRLFHTAVTASDSVYNTWQCRIMYYWFKKFKDGPNSEMGGFTRILHSGKPDKYMDEI 120

Query: 121 PTFVAQPLPAGMDRGYIVLNRPWAFVQWLQQADIKEDYILMSEPDHIIVKPIPNLSKDGL 180
           PTFVAQPLPAGMDRGYIVLNRPWAFVQWLQQADIKEDYILMSEPDHIIVKPIPNLSKDGL
Sbjct: 121 PTFVAQPLPAGMDRGYIVLNRPWAFVQWLQQADIKEDYILMSEPDHIIVKPIPNLSKDGL 180

Query: 181 GAAFPFFYIEPKKYESQLRKFFPEDKGPITNIDPIGNSPVIVGKESLKKIAPTWMNVSLA 240
           GAAFPFFYIEPKKYESQLRKFFPEDKGPITNIDPIGNSPVIV +ESLKKIAPTWMN+SLA
Sbjct: 181 GAAFPFFYIEPKKYESQLRKFFPEDKGPITNIDPIGNSPVIVSRESLKKIAPTWMNISLA 240

Query: 241 MKKDPETDKAFGWVLEMYAYAVASALHDVGNILYKDFMIQPPWDTEVGKKFIIHYTYGCD 300
           MKKDPETDKAFGWVLEMYAYAVASALHDVGNILYKDFM+QPPWDTEVGKKFIIHYTYGCD
Sbjct: 241 MKKDPETDKAFGWVLEMYAYAVASALHDVGNILYKDFMLQPPWDTEVGKKFIIHYTYGCD 300

Query: 301 YDMKGKLTYGKIGEWRFDKRSYDNVVPPRNLPLPPPGVPESVVTLVKMVNEATANIPNWG 360
           ++MKG+ T GKIGEWRFDKRSYDNVVPPRNLPLPPPGVPESVVTLVKM+NEATANIPNWG
Sbjct: 301 FNMKGQPTPGKIGEWRFDKRSYDNVVPPRNLPLPPPGVPESVVTLVKMINEATANIPNWG 360

Query: 361 S 362
           S
Sbjct: 361 S 361

BLAST of CSPI02G23720 vs. TAIR 10
Match: AT5G25265.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: plasma membrane, membrane; EXPRESSED IN: cultured cell, leaf; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G25260.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 630.2 bits (1624), Expect = 1.0e-180
Identity = 300/367 (81.74%), Postives = 329/367 (89.65%), Query Frame = 0

Query: 1   MGC-GNLFFLVLVTFSVALITYNIILSANAPLKQELPGPSRSSSSITVDPVIKMPL---D 60
           MGC G LF+ +L+T SVALITYNII+SANAPLKQ  PG S SSS I++DPVI++P     
Sbjct: 1   MGCGGTLFYPLLITLSVALITYNIIISANAPLKQGFPGRS-SSSDISIDPVIELPRGGGS 60

Query: 61  RSETSSSKRLFHTAVTASDSVYNTWQCRIMYYWFKKFK--DGPNSEMGGFTRILHSGKPD 120
           R+      RLFHTAVTASDSVYNTWQCR+MYYWFKK +   GP SEMGGFTRILHSGKPD
Sbjct: 61  RNNDGKRIRLFHTAVTASDSVYNTWQCRVMYYWFKKIQASAGPGSEMGGFTRILHSGKPD 120

Query: 121 KYMDEIPTFVAQPLPAGMDRGYIVLNRPWAFVQWLQQADIKEDYILMSEPDHIIVKPIPN 180
           +YMDEIPTFVAQPLP+GMD+GY+VLNRPWAFVQWLQQ DIKEDYILMSEPDHIIVKPIPN
Sbjct: 121 QYMDEIPTFVAQPLPSGMDQGYVVLNRPWAFVQWLQQTDIKEDYILMSEPDHIIVKPIPN 180

Query: 181 LSKDGLGAAFPFFYIEPKKYESQLRKFFPEDKGPITNIDPIGNSPVIVGKESLKKIAPTW 240
           L+KDGLGAAFPFFYIEPKKYE  LRK++PE +GP+TNIDPIGNSPVIVGK++LKKIAPTW
Sbjct: 181 LAKDGLGAAFPFFYIEPKKYEKVLRKYYPEVRGPVTNIDPIGNSPVIVGKDALKKIAPTW 240

Query: 241 MNVSLAMKKDPETDKAFGWVLEMYAYAVASALHDVGNILYKDFMIQPPWDTEVGKKFIIH 300
           MNVSLAMKKDPE DKAFGWVLEMYAYAV+SALH V NIL+KDFMIQPPWD EVG K+IIH
Sbjct: 241 MNVSLAMKKDPEADKAFGWVLEMYAYAVSSALHGVSNILHKDFMIQPPWDIEVGDKYIIH 300

Query: 301 YTYGCDYDMKGKLTYGKIGEWRFDKRSYDNVVPPRNLPLPPPGVPESVVTLVKMVNEATA 360
           YTYGCDYDMKGKLTYGKIGEWRFDKRSYD+  PPRNL +PPPGV +SVVTLVKM+NEATA
Sbjct: 301 YTYGCDYDMKGKLTYGKIGEWRFDKRSYDSKPPPRNLTMPPPGVSQSVVTLVKMINEATA 360

Query: 361 NIPNWGS 362
           NIPNWGS
Sbjct: 361 NIPNWGS 366

BLAST of CSPI02G23720 vs. TAIR 10
Match: AT2G25260.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G25265.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 557.4 bits (1435), Expect = 8.6e-159
Identity = 262/359 (72.98%), Postives = 301/359 (83.84%), Query Frame = 0

Query: 4   GNLFFLVLVTFSVAL-ITYNIILSANAPLKQELPGPSRSSSSITVDPVIKMPLDRSETSS 63
           G  FF +L+T S+ L I YN I+S + PL+QELPG   +SS   +   +K P     +  
Sbjct: 5   GKYFFPILMTLSLFLIIRYNYIVSDDPPLRQELPGRRSASSGDDITYTVKTP-----SKK 64

Query: 64  SKRLFHTAVTASDSVYNTWQCRIMYYWFKKFKDGPNSEMGGFTRILHSGKPDKYMDEIPT 123
           +KRLFHTAVTA+DSVY+TWQCR+MYYW+ +F+D P S+MGG+TRILHSG+PD  MDEIPT
Sbjct: 65  TKRLFHTAVTATDSVYSTWQCRVMYYWYNRFRDEPGSDMGGYTRILHSGRPDGLMDEIPT 124

Query: 124 FVAQPLPAGMDRGYIVLNRPWAFVQWLQQADIKEDYILMSEPDHIIVKPIPNLSKDGLGA 183
           FVA PLP+G+D+GY+VLNRPWAFVQWLQQA I+EDYILM+EPDHIIVKPIPNL++  L A
Sbjct: 125 FVADPLPSGVDKGYVVLNRPWAFVQWLQQAHIEEDYILMAEPDHIIVKPIPNLARGNLAA 184

Query: 184 AFPFFYIEPKKYESQLRKFFPEDKGPITNIDPIGNSPVIVGKESLKKIAPTWMNVSLAMK 243
           AFPFFYIEPKKYES LRKFFP++ GPI+ IDPIGNSPVIV K +L KIAPTWMNVSLAMK
Sbjct: 185 AFPFFYIEPKKYESVLRKFFPKENGPISRIDPIGNSPVIVTKNALMKIAPTWMNVSLAMK 244

Query: 244 KDPETDKAFGWVLEMYAYAVASALHDVGNILYKDFMIQPPWDTEVGKKFIIHYTYGCDYD 303
            DP+TDKAFGWVLEMYAYAV+SALH V NIL+KDFMIQPPWDTE  K FIIHYTYGCD+D
Sbjct: 245 NDPQTDKAFGWVLEMYAYAVSSALHGVSNILHKDFMIQPPWDTETKKTFIIHYTYGCDFD 304

Query: 304 MKGKLTYGKIGEWRFDKRSYDNVVPPRNLPLPPPGVPESVVTLVKMVNEATANIPNWGS 362
           MKGK+  GKIGEWRFDKRSY +  PPRNL LPP GVPESVVTLV M+NEATANIPNW S
Sbjct: 305 MKGKMMVGKIGEWRFDKRSYGDKPPPRNLTLPPRGVPESVVTLVTMINEATANIPNWES 358

BLAST of CSPI02G23720 vs. TAIR 10
Match: AT5G13500.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G25265.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )

HSP 1 Score: 513.5 bits (1321), Expect = 1.4e-145
Identity = 238/359 (66.30%), Postives = 282/359 (78.55%), Query Frame = 0

Query: 1   MGCGNLFFLVLVTFSVALITYNIILSANAPLKQELPGPSRSSSSITVDPVIKMPLDRSET 60
           MG  +   L L+ F   ++TYN++      +     G S S  S  +DPV++MPL+  + 
Sbjct: 1   MGKASGLLLFLLGFGFFVVTYNLL----TLIVHNRSGVSNSDGSPLLDPVVQMPLNIRKA 60

Query: 61  SSSKRLFHTAVTASDSVYNTWQCRIMYYWFKKFKDGPNSEMGGFTRILHSGKPDKYMDEI 120
            SS   FH A+TA+D+ YN WQCRIMYYW+K+ K  P S+MGGFTRILHSG  D  MDEI
Sbjct: 61  KSSPAPFHVALTATDAPYNKWQCRIMYYWYKQKKALPGSDMGGFTRILHSGNSDNLMDEI 120

Query: 121 PTFVAQPLPAGMDRGYIVLNRPWAFVQWLQQADIKEDYILMSEPDHIIVKPIPNLSKDGL 180
           PTFV  PLP G+DRGY+VLNRPWAFVQWL++A IKEDY+LM+EPDH+ V P+PNL+  G 
Sbjct: 121 PTFVVDPLPPGLDRGYVVLNRPWAFVQWLERATIKEDYVLMAEPDHVFVNPLPNLAVGGF 180

Query: 181 GAAFPFFYIEPKKYESQLRKFFPEDKGPITNIDPIGNSPVIVGKESLKKIAPTWMNVSLA 240
            AAFPFFYI P+KYE+ +RK++P + GP+TNIDPIGNSPVI+ KESL+KIAPTWMNVSL 
Sbjct: 181 PAAFPFFYITPEKYENIVRKYYPAEMGPVTNIDPIGNSPVIISKESLEKIAPTWMNVSLT 240

Query: 241 MKKDPETDKAFGWVLEMYAYAVASALHDVGNILYKDFMIQPPWDTEVGKKFIIHYTYGCD 300
           MK DPETDKAFGWVLEMY YA+ASA+H V +IL KDFM+QPPWD     KFIIHYTYGCD
Sbjct: 241 MKNDPETDKAFGWVLEMYGYAIASAIHGVRHILRKDFMLQPPWDLSTKGKFIIHYTYGCD 300

Query: 301 YDMKGKLTYGKIGEWRFDKRSYDNVVPPRNLPLPPPGVPESVVTLVKMVNEATANIPNW 360
           Y+MKG+LTYGKIGEWRFDKRS+    PPRN+ LPPPGVPESVVTLVKMVNEATA IPNW
Sbjct: 301 YNMKGELTYGKIGEWRFDKRSHLRGPPPRNMSLPPPGVPESVVTLVKMVNEATATIPNW 355

BLAST of CSPI02G23720 vs. TAIR 10
Match: AT5G13500.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G25265.1); Has 228 Blast hits to 200 proteins in 21 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 213; Viruses - 0; Other Eukaryotes - 15 (source: NCBI BLink). )

HSP 1 Score: 513.5 bits (1321), Expect = 1.4e-145
Identity = 238/359 (66.30%), Postives = 282/359 (78.55%), Query Frame = 0

Query: 1   MGCGNLFFLVLVTFSVALITYNIILSANAPLKQELPGPSRSSSSITVDPVIKMPLDRSET 60
           MG  +   L L+ F   ++TYN++      +     G S S  S  +DPV++MPL+  + 
Sbjct: 1   MGKASGLLLFLLGFGFFVVTYNLL----TLIVHNRSGVSNSDGSPLLDPVVQMPLNIRKA 60

Query: 61  SSSKRLFHTAVTASDSVYNTWQCRIMYYWFKKFKDGPNSEMGGFTRILHSGKPDKYMDEI 120
            SS   FH A+TA+D+ YN WQCRIMYYW+K+ K  P S+MGGFTRILHSG  D  MDEI
Sbjct: 61  KSSPAPFHVALTATDAPYNKWQCRIMYYWYKQKKALPGSDMGGFTRILHSGNSDNLMDEI 120

Query: 121 PTFVAQPLPAGMDRGYIVLNRPWAFVQWLQQADIKEDYILMSEPDHIIVKPIPNLSKDGL 180
           PTFV  PLP G+DRGY+VLNRPWAFVQWL++A IKEDY+LM+EPDH+ V P+PNL+  G 
Sbjct: 121 PTFVVDPLPPGLDRGYVVLNRPWAFVQWLERATIKEDYVLMAEPDHVFVNPLPNLAVGGF 180

Query: 181 GAAFPFFYIEPKKYESQLRKFFPEDKGPITNIDPIGNSPVIVGKESLKKIAPTWMNVSLA 240
            AAFPFFYI P+KYE+ +RK++P + GP+TNIDPIGNSPVI+ KESL+KIAPTWMNVSL 
Sbjct: 181 PAAFPFFYITPEKYENIVRKYYPAEMGPVTNIDPIGNSPVIISKESLEKIAPTWMNVSLT 240

Query: 241 MKKDPETDKAFGWVLEMYAYAVASALHDVGNILYKDFMIQPPWDTEVGKKFIIHYTYGCD 300
           MK DPETDKAFGWVLEMY YA+ASA+H V +IL KDFM+QPPWD     KFIIHYTYGCD
Sbjct: 241 MKNDPETDKAFGWVLEMYGYAIASAIHGVRHILRKDFMLQPPWDLSTKGKFIIHYTYGCD 300

Query: 301 YDMKGKLTYGKIGEWRFDKRSYDNVVPPRNLPLPPPGVPESVVTLVKMVNEATANIPNW 360
           Y+MKG+LTYGKIGEWRFDKRS+    PPRN+ LPPPGVPESVVTLVKMVNEATA IPNW
Sbjct: 301 YNMKGELTYGKIGEWRFDKRSHLRGPPPRNMSLPPPGVPESVVTLVKMVNEATATIPNW 355

BLAST of CSPI02G23720 vs. TAIR 10
Match: AT5G13500.3 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G25265.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 513.5 bits (1321), Expect = 1.4e-145
Identity = 238/359 (66.30%), Postives = 282/359 (78.55%), Query Frame = 0

Query: 1   MGCGNLFFLVLVTFSVALITYNIILSANAPLKQELPGPSRSSSSITVDPVIKMPLDRSET 60
           MG  +   L L+ F   ++TYN++      +     G S S  S  +DPV++MPL+  + 
Sbjct: 1   MGKASGLLLFLLGFGFFVVTYNLL----TLIVHNRSGVSNSDGSPLLDPVVQMPLNIRKA 60

Query: 61  SSSKRLFHTAVTASDSVYNTWQCRIMYYWFKKFKDGPNSEMGGFTRILHSGKPDKYMDEI 120
            SS   FH A+TA+D+ YN WQCRIMYYW+K+ K  P S+MGGFTRILHSG  D  MDEI
Sbjct: 61  KSSPAPFHVALTATDAPYNKWQCRIMYYWYKQKKALPGSDMGGFTRILHSGNSDNLMDEI 120

Query: 121 PTFVAQPLPAGMDRGYIVLNRPWAFVQWLQQADIKEDYILMSEPDHIIVKPIPNLSKDGL 180
           PTFV  PLP G+DRGY+VLNRPWAFVQWL++A IKEDY+LM+EPDH+ V P+PNL+  G 
Sbjct: 121 PTFVVDPLPPGLDRGYVVLNRPWAFVQWLERATIKEDYVLMAEPDHVFVNPLPNLAVGGF 180

Query: 181 GAAFPFFYIEPKKYESQLRKFFPEDKGPITNIDPIGNSPVIVGKESLKKIAPTWMNVSLA 240
            AAFPFFYI P+KYE+ +RK++P + GP+TNIDPIGNSPVI+ KESL+KIAPTWMNVSL 
Sbjct: 181 PAAFPFFYITPEKYENIVRKYYPAEMGPVTNIDPIGNSPVIISKESLEKIAPTWMNVSLT 240

Query: 241 MKKDPETDKAFGWVLEMYAYAVASALHDVGNILYKDFMIQPPWDTEVGKKFIIHYTYGCD 300
           MK DPETDKAFGWVLEMY YA+ASA+H V +IL KDFM+QPPWD     KFIIHYTYGCD
Sbjct: 241 MKNDPETDKAFGWVLEMYGYAIASAIHGVRHILRKDFMLQPPWDLSTKGKFIIHYTYGCD 300

Query: 301 YDMKGKLTYGKIGEWRFDKRSYDNVVPPRNLPLPPPGVPESVVTLVKMVNEATANIPNW 360
           Y+MKG+LTYGKIGEWRFDKRS+    PPRN+ LPPPGVPESVVTLVKMVNEATA IPNW
Sbjct: 301 YNMKGELTYGKIGEWRFDKRSHLRGPPPRNMSLPPPGVPESVVTLVKMVNEATATIPNW 355

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q8W4E61.5e-17981.74Hydroxyproline O-arabinosyltransferase 1 OS=Arabidopsis thaliana OX=3702 GN=HPAT... [more]
Q494Q21.2e-15772.98Hydroxyproline O-arabinosyltransferase 2 OS=Arabidopsis thaliana OX=3702 GN=HPAT... [more]
Q9FY512.0e-14466.30Hydroxyproline O-arabinosyltransferase 3 OS=Arabidopsis thaliana OX=3702 GN=HPAT... [more]
G7LG311.8e-14070.83Hydroxyproline O-arabinosyltransferase RDN2 OS=Medicago truncatula OX=3880 GN=RD... [more]
A0A0A1H7M62.9e-13571.52Hydroxyproline O-arabinosyltransferase PLENTY OS=Lotus japonicus OX=34305 GN=PLE... [more]
Match NameE-valueIdentityDescription
A0A0A0LQB96.6e-215100.00Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G382460 PE=4 SV=1[more]
A0A5A7VF019.5e-21499.45Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold... [more]
A0A1S3BBR39.5e-21499.45uncharacterized protein LOC103488330 OS=Cucumis melo OX=3656 GN=LOC103488330 PE=... [more]
A0A6J1BRV61.1e-20695.29hydroxyproline O-arabinosyltransferase 1-like OS=Momordica charantia OX=3673 GN=... [more]
A0A6J1HC521.9e-20695.29hydroxyproline O-arabinosyltransferase 1-like OS=Cucurbita moschata OX=3662 GN=L... [more]
Match NameE-valueIdentityDescription
XP_004138714.11.4e-214100.00hydroxyproline O-arabinosyltransferase 1 [Cucumis sativus] >KGN62972.1 hypotheti... [more]
XP_008445244.12.0e-21399.45PREDICTED: uncharacterized protein LOC103488330 [Cucumis melo] >KAA0064906.1 unc... [more]
XP_038885217.13.2e-20896.40hydroxyproline O-arabinosyltransferase 1-like [Benincasa hispida][more]
XP_022132296.12.3e-20695.29hydroxyproline O-arabinosyltransferase 1-like [Momordica charantia][more]
KAG6598558.12.3e-20695.29Hydroxyproline O-arabinosyltransferase 1, partial [Cucurbita argyrosperma subsp.... [more]
Match NameE-valueIdentityDescription
AT5G25265.11.0e-18081.74unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT2G25260.18.6e-15972.98unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT5G13500.11.4e-14566.30unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT5G13500.21.4e-14566.30unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT5G13500.31.4e-14566.30unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (PI 183967) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR044845Glycosyltransferase HPAT/SRGT1-likePANTHERPTHR31485PEPTIDYL SERINE ALPHA-GALACTOSYLTRANSFERASEcoord: 1..360
NoneNo IPR availablePANTHERPTHR31485:SF3HYDROXYPROLINE O-ARABINOSYLTRANSFERASE 1coord: 1..360

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI02G23720.1CSPI02G23720.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0005801 cis-Golgi network
cellular_component GO:0016021 integral component of membrane
molecular_function GO:1990585 hydroxyproline O-arabinosyltransferase activity
molecular_function GO:0016757 glycosyltransferase activity