Cp4.1LG20g00040 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG20g00040
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionUV-B-induced protein At3g17800, chloroplastic-like
LocationCp4.1LG20: 7734 .. 10045 (+)
RNA-Seq ExpressionCp4.1LG20g00040
SyntenyCp4.1LG20g00040
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATTATGGTATTGTATGTTTTGTTTATTTTATTTTTTAAATTCAAAATTAGGAATGATCCAAACCTCTCAAAGATTGTGGTGGAGAATATGTGTTAATAGATTCTATCTAGTTTTCTTGATGTAATGCACAGAGAAGAAAGCCAAGAGTGTCCACAGAAATTAGGAGATGCCTACAAAGATCTTTTTTTTTTTTTTGCATGTCGTTGCCGTCTCTCCGTCCATTCATACACAAACATACGCGTGCCCAAACGTGTTTCACAATTTCATTCTATAAACATCTCCTCTCCACAAACTGCACCACACTCTCCAGTTTCTAGTTTCTTCTTCCCAGGTTTGTATCTTTCATACTCCCTCGCTCGATGAAACTATTCATTCAGTTTCGCTCCCTTCATCAATTTTCATATGCCTTTTGCAGATCCATTCGACACAACACTACACTACTCAGAGACACCGCCATGGAAGCAGCCACTGGTTCAACTTCAACCCTTGCCATTGGAATTGGATCGCCATTTCGGGACACCGACCCCAGGCCCCCTGCCTCCCGTTCCCTTTATTTTCCTTCCGAATCCCTCATCTCTGTTCCTGTAAGCTGGAATCATTTTTCTTGTATTTCTCACTCCATATTTATCAATCATTTCATAGTCAATTTCTGTGTATCTCATTTTGATATCTTCTGTTTTTTTCCCAACTTTCCGTTGTCGCATCGCAGCATTATCGGTCCTTCGTTTCTCCATCAAAACTTGGAAAGAAGTCGATTACTCTCCCCTGTAGTGGCCGGGGTCGGGGATTGGGATTCCCAATGGTTAAAGCGTCTCTGTCTCCGGATCCGGATGGTTCTGCTGCCCAAATTGCTCCACTTCGGCTCCAGTCTCCAATTGGCCAGTTTCTGTCTCAAATCCTGACTACCCATCCCCACCTTCTTCCTGCAGCCGTCGACCAGCAGCTTCAACAGCTGCAGACCCAACGTCATGCTGAAGAACTAACTCAAGAGCCCTCCGCTTCAGCTACTCATGACATTGTCTTGTACAGGTTAGTTCCGCATCCCTTAGAGGCCGAGTGTCAACAGGTAAGGATGGATCTAGGAACGAAAATGTTTTAACAGTCGATTGAAGCATATTGTTCGATTAGAAGAGCACCGAAGCTGATGAAGCTTGTTTTTTTTTTTTTTTGGACAACCCTGCTCACGTCGCTTAGCGTTTGAATTTGTTAGGAGGATTGCTGAGGTTAAGGCAATTGAAAGGAAGAGGGCCTTAGAAGAGATATTATATGCAATGGTGGTGCAACGATTCATGGACGCCGATGTTCCTCTAATACCAGCTGTTGCCCCGTCGTCTACGGATCCATATGGCCGAGTTGACACATGGGCACGAGATGATGAAAAGCTGGAGCGGCTTCACTCGTCGGAAGCAAGCGAAATGATTCAGAACCACCTAGCGCTGGTTTTGGGGAATCGGATTGGTGACTTTGCGTCAGTAGCGCAGATAAGCAAACTAAGAGTGGGGCAGGTGTATGCTGCGTCTGTGATGTATGGGTACTTCCTCAAGCGAGTGGATGAGAGATTTCAGCTTGAGAAGACTGTGAAAGTGCTACCAGCCAGTGCAACTGTTGAGGGCTCCTTCTCCAATGCACCAGTGCATCCTGAAATCTCTTCCATGGCAGCTGAACAGGGAGATGTTAGTCCTGGGGAGTCGGGTATGGGGATCAAGCCCTCCCGACTGCGAACATACGTAATGTCATTTGATGGGGAGACACTGCAGAGATTTGCCACAATAAGGTCAAAAGAGGCCGTTAGCATCATTGAGAGACACACGGAGGCCTTGTTTGGAAGACCCCAGATTGCAATCACCCCGCAAGGAACAGTAGATACCTCCAAAGACGAGCTTATCAAAATCAGCTTTGGTGGGTTGAAGAGACTAGTTTTGGAAGCCGTGACTTTCGGTTCTTTCCTGTGGGATGTGGAGACGTATGTGGACTCCAGGTATCATTTTGTCATGAATTGAGACATGAACTTACTCGTGCATTCTGGACAATGGATGCCTCCAGCGAATGTAATGTTAACTGTATATGTTTTGTTTAACTCAACCATTTACATTTTCACATTTTAAAATTATCTTATTTTTTTAAAGACAGAATGTCAAAATTGATGGACAGGAACTTTAAACGATACTTAACAAGTTCATATAATTTTAGAGGAGGAACTAAGTTACGTGAGAGATACAATTTAACAGAGTTTTGATATTTACATCTCAGAAGAAGAACGAAGTAACATGAATATGATACAAAGTTCAATATTTATTCAAATTTTTTTAGCA

mRNA sequence

ATTATGGTATTGTATGTTTTGTTTATTTTATTTTTTAAATTCAAAATTAGGAATGATCCAAACCTCTCAAAGATTGTGGTGGAGAATATGTGTTAATAGATTCTATCTAGTTTTCTTGATGTAATGCACAGAGAAGAAAGCCAAGAGTGTCCACAGAAATTAGGAGATGCCTACAAAGATCTTTTTTTTTTTTTTGCATGTCGTTGCCGTCTCTCCGTCCATTCATACACAAACATACGCGTGCCCAAACGTGTTTCACAATTTCATTCTATAAACATCTCCTCTCCACAAACTGCACCACACTCTCCAGTTTCTAGTTTCTTCTTCCCAGATCCATTCGACACAACACTACACTACTCAGAGACACCGCCATGGAAGCAGCCACTGGTTCAACTTCAACCCTTGCCATTGGAATTGGATCGCCATTTCGGGACACCGACCCCAGGCCCCCTGCCTCCCGTTCCCTTTATTTTCCTTCCGAATCCCTCATCTCTGTTCCTCATTATCGGTCCTTCGTTTCTCCATCAAAACTTGGAAAGAAGTCGATTACTCTCCCCTGTAGTGGCCGGGGTCGGGGATTGGGATTCCCAATGGTTAAAGCGTCTCTGTCTCCGGATCCGGATGGTTCTGCTGCCCAAATTGCTCCACTTCGGCTCCAGTCTCCAATTGGCCAGTTTCTGTCTCAAATCCTGACTACCCATCCCCACCTTCTTCCTGCAGCCGTCGACCAGCAGCTTCAACAGCTGCAGACCCAACGTCATGCTGAAGAACTAACTCAAGAGCCCTCCGCTTCAGCTACTCATGACATTGTCTTGTACAGGAGGATTGCTGAGGTTAAGGCAATTGAAAGGAAGAGGGCCTTAGAAGAGATATTATATGCAATGGTGGTGCAACGATTCATGGACGCCGATGTTCCTCTAATACCAGCTGTTGCCCCGTCGTCTACGGATCCATATGGCCGAGTTGACACATGGGCACGAGATGATGAAAAGCTGGAGCGGCTTCACTCGTCGGAAGCAAGCGAAATGATTCAGAACCACCTAGCGCTGGTTTTGGGGAATCGGATTGGTGACTTTGCGTCAGTAGCGCAGATAAGCAAACTAAGAGTGGGGCAGGTGTATGCTGCGTCTGTGATGTATGGGTACTTCCTCAAGCGAGTGGATGAGAGATTTCAGCTTGAGAAGACTGTGAAAGTGCTACCAGCCAGTGCAACTGTTGAGGGCTCCTTCTCCAATGCACCAGTGCATCCTGAAATCTCTTCCATGGCAGCTGAACAGGGAGATGTTAGTCCTGGGGAGTCGGGTATGGGGATCAAGCCCTCCCGACTGCGAACATACGTAATGTCATTTGATGGGGAGACACTGCAGAGATTTGCCACAATAAGGTCAAAAGAGGCCGTTAGCATCATTGAGAGACACACGGAGGCCTTGTTTGGAAGACCCCAGATTGCAATCACCCCGCAAGGAACAGTAGATACCTCCAAAGACGAGCTTATCAAAATCAGCTTTGGTGGGTTGAAGAGACTAGTTTTGGAAGCCGTGACTTTCGGTTCTTTCCTGTGGGATGTGGAGACGTATGTGGACTCCAGGTATCATTTTGTCATGAATTGAGACATGAACTTACTCGTGCATTCTGGACAATGGATGCCTCCAGCGAATGTAATGTTAACTGTATATGTTTTGTTTAACTCAACCATTTACATTTTCACATTTTAAAATTATCTTATTTTTTTAAAGACAGAATGTCAAAATTGATGGACAGGAACTTTAAACGATACTTAACAAGTTCATATAATTTTAGAGGAGGAACTAAGTTACGTGAGAGATACAATTTAACAGAGTTTTGATATTTACATCTCAGAAGAAGAACGAAGTAACATGAATATGATACAAAGTTCAATATTTATTCAAATTTTTTTAGCA

Coding sequence (CDS)

ATGGAAGCAGCCACTGGTTCAACTTCAACCCTTGCCATTGGAATTGGATCGCCATTTCGGGACACCGACCCCAGGCCCCCTGCCTCCCGTTCCCTTTATTTTCCTTCCGAATCCCTCATCTCTGTTCCTCATTATCGGTCCTTCGTTTCTCCATCAAAACTTGGAAAGAAGTCGATTACTCTCCCCTGTAGTGGCCGGGGTCGGGGATTGGGATTCCCAATGGTTAAAGCGTCTCTGTCTCCGGATCCGGATGGTTCTGCTGCCCAAATTGCTCCACTTCGGCTCCAGTCTCCAATTGGCCAGTTTCTGTCTCAAATCCTGACTACCCATCCCCACCTTCTTCCTGCAGCCGTCGACCAGCAGCTTCAACAGCTGCAGACCCAACGTCATGCTGAAGAACTAACTCAAGAGCCCTCCGCTTCAGCTACTCATGACATTGTCTTGTACAGGAGGATTGCTGAGGTTAAGGCAATTGAAAGGAAGAGGGCCTTAGAAGAGATATTATATGCAATGGTGGTGCAACGATTCATGGACGCCGATGTTCCTCTAATACCAGCTGTTGCCCCGTCGTCTACGGATCCATATGGCCGAGTTGACACATGGGCACGAGATGATGAAAAGCTGGAGCGGCTTCACTCGTCGGAAGCAAGCGAAATGATTCAGAACCACCTAGCGCTGGTTTTGGGGAATCGGATTGGTGACTTTGCGTCAGTAGCGCAGATAAGCAAACTAAGAGTGGGGCAGGTGTATGCTGCGTCTGTGATGTATGGGTACTTCCTCAAGCGAGTGGATGAGAGATTTCAGCTTGAGAAGACTGTGAAAGTGCTACCAGCCAGTGCAACTGTTGAGGGCTCCTTCTCCAATGCACCAGTGCATCCTGAAATCTCTTCCATGGCAGCTGAACAGGGAGATGTTAGTCCTGGGGAGTCGGGTATGGGGATCAAGCCCTCCCGACTGCGAACATACGTAATGTCATTTGATGGGGAGACACTGCAGAGATTTGCCACAATAAGGTCAAAAGAGGCCGTTAGCATCATTGAGAGACACACGGAGGCCTTGTTTGGAAGACCCCAGATTGCAATCACCCCGCAAGGAACAGTAGATACCTCCAAAGACGAGCTTATCAAAATCAGCTTTGGTGGGTTGAAGAGACTAGTTTTGGAAGCCGTGACTTTCGGTTCTTTCCTGTGGGATGTGGAGACGTATGTGGACTCCAGGTATCATTTTGTCATGAATTGA

Protein sequence

MEAATGSTSTLAIGIGSPFRDTDPRPPASRSLYFPSESLISVPHYRSFVSPSKLGKKSITLPCSGRGRGLGFPMVKASLSPDPDGSAAQIAPLRLQSPIGQFLSQILTTHPHLLPAAVDQQLQQLQTQRHAEELTQEPSASATHDIVLYRRIAEVKAIERKRALEEILYAMVVQRFMDADVPLIPAVAPSSTDPYGRVDTWARDDEKLERLHSSEASEMIQNHLALVLGNRIGDFASVAQISKLRVGQVYAASVMYGYFLKRVDERFQLEKTVKVLPASATVEGSFSNAPVHPEISSMAAEQGDVSPGESGMGIKPSRLRTYVMSFDGETLQRFATIRSKEAVSIIERHTEALFGRPQIAITPQGTVDTSKDELIKISFGGLKRLVLEAVTFGSFLWDVETYVDSRYHFVMN
Homology
BLAST of Cp4.1LG20g00040 vs. ExPASy Swiss-Prot
Match: Q9LVJ0 (UV-B-induced protein At3g17800, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=At3g17800 PE=2 SV=1)

HSP 1 Score: 426.0 bits (1094), Expect = 4.8e-118
Identity = 230/350 (65.71%), Postives = 275/350 (78.57%), Query Frame = 0

Query: 77  ASLSPDPDGSAAQIAPLRLQSPIGQFLSQILTTHPHLLPAAVDQQLQQLQTQRHAEELTQ 136
           AS       S   IAPL+LQSP GQFLSQIL +HPHL+PAAV+QQL+QLQT R ++   +
Sbjct: 79  ASNDASSGSSPKPIAPLQLQSPAGQFLSQILVSHPHLVPAAVEQQLEQLQTDRDSQGQNK 138

Query: 137 EPSASATHDIVLYRRIAEVKAIERKRALEEILYAMVVQRFMDADVPLIPAVAPSSTDPYG 196
           + ++    DIVLYRRIAE+K  ER+R LEEILYA+VVQ+FM+A+V L+P+V+PSS DP G
Sbjct: 139 DSASVPGTDIVLYRRIAELKENERRRTLEEILYALVVQKFMEANVSLVPSVSPSS-DPSG 198

Query: 197 RVDTWARDDEKLERLHSSEASEMIQNHLALVLGNRIGDFASVAQISKLRVGQVYAASVMY 256
           RVDTW    EKLERLHS E  EMI NHLAL+LG+R+GD  SVAQISKLRVGQVYAASVMY
Sbjct: 199 RVDTWPTKVEKLERLHSPEMYEMIHNHLALILGSRMGDLNSVAQISKLRVGQVYAASVMY 258

Query: 257 GYFLKRVDERFQLEKTVKVLP--------------ASATVEGSFSNAPVHPEISSMAAEQ 316
           GYFLKRVD+RFQLEKT+K+LP               +AT + + S+   HPE+ + A   
Sbjct: 259 GYFLKRVDQRFQLEKTMKILPGGSDESKTSVEQAEGTATYQAAVSS---HPEVGAFA--- 318

Query: 317 GDVSPGESGMGIKPSRLRTYVMSFDGETLQRFATIRSKEAVSIIERHTEALFGRPQIAIT 376
           G VS    G  IKPSRLR+YVMSFD ETLQR+ATIRS+EAV IIE+HTEALFG+P+I IT
Sbjct: 319 GGVSAKGFGSEIKPSRLRSYVMSFDAETLQRYATIRSREAVGIIEKHTEALFGKPEIVIT 378

Query: 377 PQGTVDTSKDELIKISFGGLKRLVLEAVTFGSFLWDVETYVDSRYHFVMN 413
           P+GTVD+SKDE IKISFGG+KRLVLEAVTFGSFLWDVE++VD+RYHFV+N
Sbjct: 379 PEGTVDSSKDEQIKISFGGMKRLVLEAVTFGSFLWDVESHVDARYHFVLN 421

BLAST of Cp4.1LG20g00040 vs. NCBI nr
Match: XP_023519193.1 (UV-B-induced protein At3g17800, chloroplastic-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 795 bits (2053), Expect = 9.96e-290
Identity = 412/412 (100.00%), Postives = 412/412 (100.00%), Query Frame = 0

Query: 1   MEAATGSTSTLAIGIGSPFRDTDPRPPASRSLYFPSESLISVPHYRSFVSPSKLGKKSIT 60
           MEAATGSTSTLAIGIGSPFRDTDPRPPASRSLYFPSESLISVPHYRSFVSPSKLGKKSIT
Sbjct: 1   MEAATGSTSTLAIGIGSPFRDTDPRPPASRSLYFPSESLISVPHYRSFVSPSKLGKKSIT 60

Query: 61  LPCSGRGRGLGFPMVKASLSPDPDGSAAQIAPLRLQSPIGQFLSQILTTHPHLLPAAVDQ 120
           LPCSGRGRGLGFPMVKASLSPDPDGSAAQIAPLRLQSPIGQFLSQILTTHPHLLPAAVDQ
Sbjct: 61  LPCSGRGRGLGFPMVKASLSPDPDGSAAQIAPLRLQSPIGQFLSQILTTHPHLLPAAVDQ 120

Query: 121 QLQQLQTQRHAEELTQEPSASATHDIVLYRRIAEVKAIERKRALEEILYAMVVQRFMDAD 180
           QLQQLQTQRHAEELTQEPSASATHDIVLYRRIAEVKAIERKRALEEILYAMVVQRFMDAD
Sbjct: 121 QLQQLQTQRHAEELTQEPSASATHDIVLYRRIAEVKAIERKRALEEILYAMVVQRFMDAD 180

Query: 181 VPLIPAVAPSSTDPYGRVDTWARDDEKLERLHSSEASEMIQNHLALVLGNRIGDFASVAQ 240
           VPLIPAVAPSSTDPYGRVDTWARDDEKLERLHSSEASEMIQNHLALVLGNRIGDFASVAQ
Sbjct: 181 VPLIPAVAPSSTDPYGRVDTWARDDEKLERLHSSEASEMIQNHLALVLGNRIGDFASVAQ 240

Query: 241 ISKLRVGQVYAASVMYGYFLKRVDERFQLEKTVKVLPASATVEGSFSNAPVHPEISSMAA 300
           ISKLRVGQVYAASVMYGYFLKRVDERFQLEKTVKVLPASATVEGSFSNAPVHPEISSMAA
Sbjct: 241 ISKLRVGQVYAASVMYGYFLKRVDERFQLEKTVKVLPASATVEGSFSNAPVHPEISSMAA 300

Query: 301 EQGDVSPGESGMGIKPSRLRTYVMSFDGETLQRFATIRSKEAVSIIERHTEALFGRPQIA 360
           EQGDVSPGESGMGIKPSRLRTYVMSFDGETLQRFATIRSKEAVSIIERHTEALFGRPQIA
Sbjct: 301 EQGDVSPGESGMGIKPSRLRTYVMSFDGETLQRFATIRSKEAVSIIERHTEALFGRPQIA 360

Query: 361 ITPQGTVDTSKDELIKISFGGLKRLVLEAVTFGSFLWDVETYVDSRYHFVMN 412
           ITPQGTVDTSKDELIKISFGGLKRLVLEAVTFGSFLWDVETYVDSRYHFVMN
Sbjct: 361 ITPQGTVDTSKDELIKISFGGLKRLVLEAVTFGSFLWDVETYVDSRYHFVMN 412

BLAST of Cp4.1LG20g00040 vs. NCBI nr
Match: XP_022923896.1 (UV-B-induced protein At3g17800, chloroplastic-like [Cucurbita moschata])

HSP 1 Score: 790 bits (2041), Expect = 6.72e-288
Identity = 410/412 (99.51%), Postives = 411/412 (99.76%), Query Frame = 0

Query: 1   MEAATGSTSTLAIGIGSPFRDTDPRPPASRSLYFPSESLISVPHYRSFVSPSKLGKKSIT 60
           MEAATGSTSTLAIGIGSPFRDT PRPPASRSLYFPSESLISVPHYRSFVSPSKLGKKSIT
Sbjct: 1   MEAATGSTSTLAIGIGSPFRDTAPRPPASRSLYFPSESLISVPHYRSFVSPSKLGKKSIT 60

Query: 61  LPCSGRGRGLGFPMVKASLSPDPDGSAAQIAPLRLQSPIGQFLSQILTTHPHLLPAAVDQ 120
           LPCSGRGRGLGFPMVKASLSPDPDGSAAQIAPLRLQSPIGQFLSQILTTHPHLLPAAVDQ
Sbjct: 61  LPCSGRGRGLGFPMVKASLSPDPDGSAAQIAPLRLQSPIGQFLSQILTTHPHLLPAAVDQ 120

Query: 121 QLQQLQTQRHAEELTQEPSASATHDIVLYRRIAEVKAIERKRALEEILYAMVVQRFMDAD 180
           QLQQLQTQRHAEELTQEPSASATHDIVLYRRIAEVKAIERKRALEEILYAMVVQRFMDAD
Sbjct: 121 QLQQLQTQRHAEELTQEPSASATHDIVLYRRIAEVKAIERKRALEEILYAMVVQRFMDAD 180

Query: 181 VPLIPAVAPSSTDPYGRVDTWARDDEKLERLHSSEASEMIQNHLALVLGNRIGDFASVAQ 240
           VPLIPAVAPSSTDPYGRVDTWARDDEKLERLHSSEASEMIQNHLALVLGNRIGDFASVAQ
Sbjct: 181 VPLIPAVAPSSTDPYGRVDTWARDDEKLERLHSSEASEMIQNHLALVLGNRIGDFASVAQ 240

Query: 241 ISKLRVGQVYAASVMYGYFLKRVDERFQLEKTVKVLPASATVEGSFSNAPVHPEISSMAA 300
           ISKLRVGQVYAASVMYGYFLKRVDERFQLEKTVKVLPASATVEGSFSNAPVHPEISSMAA
Sbjct: 241 ISKLRVGQVYAASVMYGYFLKRVDERFQLEKTVKVLPASATVEGSFSNAPVHPEISSMAA 300

Query: 301 EQGDVSPGESGMGIKPSRLRTYVMSFDGETLQRFATIRSKEAVSIIERHTEALFGRPQIA 360
           EQGDVSPGESGMGIKPSRLRTYVMSFDGETLQRFATIRSKEAVSIIERHTEALFGRPQIA
Sbjct: 301 EQGDVSPGESGMGIKPSRLRTYVMSFDGETLQRFATIRSKEAVSIIERHTEALFGRPQIA 360

Query: 361 ITPQGTVDTSKDELIKISFGGLKRLVLEAVTFGSFLWDVETYVDSRYHFVMN 412
           ITPQGTVDTSKDELIKISFGGLKRLVLEAVTFGSFLWDVETYVDSR+HFVMN
Sbjct: 361 ITPQGTVDTSKDELIKISFGGLKRLVLEAVTFGSFLWDVETYVDSRFHFVMN 412

BLAST of Cp4.1LG20g00040 vs. NCBI nr
Match: KAG7020067.1 (UV-B-induced protein, chloroplastic [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 790 bits (2040), Expect = 9.54e-288
Identity = 410/412 (99.51%), Postives = 410/412 (99.51%), Query Frame = 0

Query: 1   MEAATGSTSTLAIGIGSPFRDTDPRPPASRSLYFPSESLISVPHYRSFVSPSKLGKKSIT 60
           MEAATGSTSTLAIGIGSPFRDT PRPPASRSLYFPSESLISVPHYRSFVSPSKLGKKSIT
Sbjct: 1   MEAATGSTSTLAIGIGSPFRDTAPRPPASRSLYFPSESLISVPHYRSFVSPSKLGKKSIT 60

Query: 61  LPCSGRGRGLGFPMVKASLSPDPDGSAAQIAPLRLQSPIGQFLSQILTTHPHLLPAAVDQ 120
           LPCSGRGRGLGFPMVKASLSPDPDGSAAQIAPLRLQSPIGQFLSQILTTHPHLLPAAVDQ
Sbjct: 61  LPCSGRGRGLGFPMVKASLSPDPDGSAAQIAPLRLQSPIGQFLSQILTTHPHLLPAAVDQ 120

Query: 121 QLQQLQTQRHAEELTQEPSASATHDIVLYRRIAEVKAIERKRALEEILYAMVVQRFMDAD 180
           QLQQLQTQRHAEELTQEPSASATHDIVLYRRIAEVKAIERKRALEEILYAMVVQRFMDAD
Sbjct: 121 QLQQLQTQRHAEELTQEPSASATHDIVLYRRIAEVKAIERKRALEEILYAMVVQRFMDAD 180

Query: 181 VPLIPAVAPSSTDPYGRVDTWARDDEKLERLHSSEASEMIQNHLALVLGNRIGDFASVAQ 240
           VPLIPAVAPSSTDPYGRVDTWARDDEKLERLHSSEASEMIQNHLALVLGNRIGDFASVAQ
Sbjct: 181 VPLIPAVAPSSTDPYGRVDTWARDDEKLERLHSSEASEMIQNHLALVLGNRIGDFASVAQ 240

Query: 241 ISKLRVGQVYAASVMYGYFLKRVDERFQLEKTVKVLPASATVEGSFSNAPVHPEISSMAA 300
           ISKLRVGQVYAASVMYGYFLKRVDERFQLEKTVKVLPASATVEGSFSNAPVHPEISSMAA
Sbjct: 241 ISKLRVGQVYAASVMYGYFLKRVDERFQLEKTVKVLPASATVEGSFSNAPVHPEISSMAA 300

Query: 301 EQGDVSPGESGMGIKPSRLRTYVMSFDGETLQRFATIRSKEAVSIIERHTEALFGRPQIA 360
           EQGDVSPGESGMGIKPSRLRTYVMSFDGETLQRFATIRSKEAVSIIERH EALFGRPQIA
Sbjct: 301 EQGDVSPGESGMGIKPSRLRTYVMSFDGETLQRFATIRSKEAVSIIERHAEALFGRPQIA 360

Query: 361 ITPQGTVDTSKDELIKISFGGLKRLVLEAVTFGSFLWDVETYVDSRYHFVMN 412
           ITPQGTVDTSKDELIKISFGGLKRLVLEAVTFGSFLWDVETYVDSRYHFVMN
Sbjct: 361 ITPQGTVDTSKDELIKISFGGLKRLVLEAVTFGSFLWDVETYVDSRYHFVMN 412

BLAST of Cp4.1LG20g00040 vs. NCBI nr
Match: KAG6584480.1 (UV-B-induced protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 792 bits (2045), Expect = 1.49e-285
Identity = 411/412 (99.76%), Postives = 411/412 (99.76%), Query Frame = 0

Query: 1   MEAATGSTSTLAIGIGSPFRDTDPRPPASRSLYFPSESLISVPHYRSFVSPSKLGKKSIT 60
           MEAATGSTSTLAIGIGSPFRDT PRPPASRSLYFPSESLISVPHYRSFVSPSKLGKKSIT
Sbjct: 184 MEAATGSTSTLAIGIGSPFRDTAPRPPASRSLYFPSESLISVPHYRSFVSPSKLGKKSIT 243

Query: 61  LPCSGRGRGLGFPMVKASLSPDPDGSAAQIAPLRLQSPIGQFLSQILTTHPHLLPAAVDQ 120
           LPCSGRGRGLGFPMVKASLSPDPDGSAAQIAPLRLQSPIGQFLSQILTTHPHLLPAAVDQ
Sbjct: 244 LPCSGRGRGLGFPMVKASLSPDPDGSAAQIAPLRLQSPIGQFLSQILTTHPHLLPAAVDQ 303

Query: 121 QLQQLQTQRHAEELTQEPSASATHDIVLYRRIAEVKAIERKRALEEILYAMVVQRFMDAD 180
           QLQQLQTQRHAEELTQEPSASATHDIVLYRRIAEVKAIERKRALEEILYAMVVQRFMDAD
Sbjct: 304 QLQQLQTQRHAEELTQEPSASATHDIVLYRRIAEVKAIERKRALEEILYAMVVQRFMDAD 363

Query: 181 VPLIPAVAPSSTDPYGRVDTWARDDEKLERLHSSEASEMIQNHLALVLGNRIGDFASVAQ 240
           VPLIPAVAPSSTDPYGRVDTWARDDEKLERLHSSEASEMIQNHLALVLGNRIGDFASVAQ
Sbjct: 364 VPLIPAVAPSSTDPYGRVDTWARDDEKLERLHSSEASEMIQNHLALVLGNRIGDFASVAQ 423

Query: 241 ISKLRVGQVYAASVMYGYFLKRVDERFQLEKTVKVLPASATVEGSFSNAPVHPEISSMAA 300
           ISKLRVGQVYAASVMYGYFLKRVDERFQLEKTVKVLPASATVEGSFSNAPVHPEISSMAA
Sbjct: 424 ISKLRVGQVYAASVMYGYFLKRVDERFQLEKTVKVLPASATVEGSFSNAPVHPEISSMAA 483

Query: 301 EQGDVSPGESGMGIKPSRLRTYVMSFDGETLQRFATIRSKEAVSIIERHTEALFGRPQIA 360
           EQGDVSPGESGMGIKPSRLRTYVMSFDGETLQRFATIRSKEAVSIIERHTEALFGRPQIA
Sbjct: 484 EQGDVSPGESGMGIKPSRLRTYVMSFDGETLQRFATIRSKEAVSIIERHTEALFGRPQIA 543

Query: 361 ITPQGTVDTSKDELIKISFGGLKRLVLEAVTFGSFLWDVETYVDSRYHFVMN 412
           ITPQGTVDTSKDELIKISFGGLKRLVLEAVTFGSFLWDVETYVDSRYHFVMN
Sbjct: 544 ITPQGTVDTSKDELIKISFGGLKRLVLEAVTFGSFLWDVETYVDSRYHFVMN 595

BLAST of Cp4.1LG20g00040 vs. NCBI nr
Match: XP_023001153.1 (UV-B-induced protein At3g17800, chloroplastic-like [Cucurbita maxima])

HSP 1 Score: 776 bits (2005), Expect = 2.06e-282
Identity = 404/412 (98.06%), Postives = 407/412 (98.79%), Query Frame = 0

Query: 1   MEAATGSTSTLAIGIGSPFRDTDPRPPASRSLYFPSESLISVPHYRSFVSPSKLGKKSIT 60
           MEAATGSTSTLAIGIGSPFRD  PRPPASRSLYFPS+SLISVPHYRSFVSPSKLGKK+IT
Sbjct: 1   MEAATGSTSTLAIGIGSPFRDIAPRPPASRSLYFPSKSLISVPHYRSFVSPSKLGKKTIT 60

Query: 61  LPCSGRGRGLGFPMVKASLSPDPDGSAAQIAPLRLQSPIGQFLSQILTTHPHLLPAAVDQ 120
           LPCSGRGRGLGFP VKASLSPDPDGSAAQIAPLRLQSPIGQFLSQILTTHPHLLPAAVDQ
Sbjct: 61  LPCSGRGRGLGFPTVKASLSPDPDGSAAQIAPLRLQSPIGQFLSQILTTHPHLLPAAVDQ 120

Query: 121 QLQQLQTQRHAEELTQEPSASATHDIVLYRRIAEVKAIERKRALEEILYAMVVQRFMDAD 180
           QLQQLQTQRHAEELTQEPSASATHDIVLYRRIAEVKAIERKRALEEILYAMVVQRFMDAD
Sbjct: 121 QLQQLQTQRHAEELTQEPSASATHDIVLYRRIAEVKAIERKRALEEILYAMVVQRFMDAD 180

Query: 181 VPLIPAVAPSSTDPYGRVDTWARDDEKLERLHSSEASEMIQNHLALVLGNRIGDFASVAQ 240
           VPLIPAVAPSSTDPYGRVDTWARDDEKLERLHSSEASEMIQNHLALVLGNRIGDFASVAQ
Sbjct: 181 VPLIPAVAPSSTDPYGRVDTWARDDEKLERLHSSEASEMIQNHLALVLGNRIGDFASVAQ 240

Query: 241 ISKLRVGQVYAASVMYGYFLKRVDERFQLEKTVKVLPASATVEGSFSNAPVHPEISSMAA 300
           ISKLRVGQVYAASVMYGYFLKRVDERFQLEKTVKVLPASATVEGSFSNAPVHPEISSMAA
Sbjct: 241 ISKLRVGQVYAASVMYGYFLKRVDERFQLEKTVKVLPASATVEGSFSNAPVHPEISSMAA 300

Query: 301 EQGDVSPGESGMGIKPSRLRTYVMSFDGETLQRFATIRSKEAVSIIERHTEALFGRPQIA 360
           EQGDVS GESGMGIKPSRLRTYVMSFDGETLQRFATIRSKEAVSIIERHTEALFGRPQIA
Sbjct: 301 EQGDVSHGESGMGIKPSRLRTYVMSFDGETLQRFATIRSKEAVSIIERHTEALFGRPQIA 360

Query: 361 ITPQGTVDTSKDELIKISFGGLKRLVLEAVTFGSFLWDVETYVDSRYHFVMN 412
           ITP GTVDTSKDELIKISFGGLKRLVLEAVTFGSFLWDVETYVDSRY+FVMN
Sbjct: 361 ITPHGTVDTSKDELIKISFGGLKRLVLEAVTFGSFLWDVETYVDSRYNFVMN 412

BLAST of Cp4.1LG20g00040 vs. ExPASy TrEMBL
Match: A0A6J1E7Z1 (UV-B-induced protein At3g17800, chloroplastic-like OS=Cucurbita moschata OX=3662 GN=LOC111431480 PE=4 SV=1)

HSP 1 Score: 790 bits (2041), Expect = 3.25e-288
Identity = 410/412 (99.51%), Postives = 411/412 (99.76%), Query Frame = 0

Query: 1   MEAATGSTSTLAIGIGSPFRDTDPRPPASRSLYFPSESLISVPHYRSFVSPSKLGKKSIT 60
           MEAATGSTSTLAIGIGSPFRDT PRPPASRSLYFPSESLISVPHYRSFVSPSKLGKKSIT
Sbjct: 1   MEAATGSTSTLAIGIGSPFRDTAPRPPASRSLYFPSESLISVPHYRSFVSPSKLGKKSIT 60

Query: 61  LPCSGRGRGLGFPMVKASLSPDPDGSAAQIAPLRLQSPIGQFLSQILTTHPHLLPAAVDQ 120
           LPCSGRGRGLGFPMVKASLSPDPDGSAAQIAPLRLQSPIGQFLSQILTTHPHLLPAAVDQ
Sbjct: 61  LPCSGRGRGLGFPMVKASLSPDPDGSAAQIAPLRLQSPIGQFLSQILTTHPHLLPAAVDQ 120

Query: 121 QLQQLQTQRHAEELTQEPSASATHDIVLYRRIAEVKAIERKRALEEILYAMVVQRFMDAD 180
           QLQQLQTQRHAEELTQEPSASATHDIVLYRRIAEVKAIERKRALEEILYAMVVQRFMDAD
Sbjct: 121 QLQQLQTQRHAEELTQEPSASATHDIVLYRRIAEVKAIERKRALEEILYAMVVQRFMDAD 180

Query: 181 VPLIPAVAPSSTDPYGRVDTWARDDEKLERLHSSEASEMIQNHLALVLGNRIGDFASVAQ 240
           VPLIPAVAPSSTDPYGRVDTWARDDEKLERLHSSEASEMIQNHLALVLGNRIGDFASVAQ
Sbjct: 181 VPLIPAVAPSSTDPYGRVDTWARDDEKLERLHSSEASEMIQNHLALVLGNRIGDFASVAQ 240

Query: 241 ISKLRVGQVYAASVMYGYFLKRVDERFQLEKTVKVLPASATVEGSFSNAPVHPEISSMAA 300
           ISKLRVGQVYAASVMYGYFLKRVDERFQLEKTVKVLPASATVEGSFSNAPVHPEISSMAA
Sbjct: 241 ISKLRVGQVYAASVMYGYFLKRVDERFQLEKTVKVLPASATVEGSFSNAPVHPEISSMAA 300

Query: 301 EQGDVSPGESGMGIKPSRLRTYVMSFDGETLQRFATIRSKEAVSIIERHTEALFGRPQIA 360
           EQGDVSPGESGMGIKPSRLRTYVMSFDGETLQRFATIRSKEAVSIIERHTEALFGRPQIA
Sbjct: 301 EQGDVSPGESGMGIKPSRLRTYVMSFDGETLQRFATIRSKEAVSIIERHTEALFGRPQIA 360

Query: 361 ITPQGTVDTSKDELIKISFGGLKRLVLEAVTFGSFLWDVETYVDSRYHFVMN 412
           ITPQGTVDTSKDELIKISFGGLKRLVLEAVTFGSFLWDVETYVDSR+HFVMN
Sbjct: 361 ITPQGTVDTSKDELIKISFGGLKRLVLEAVTFGSFLWDVETYVDSRFHFVMN 412

BLAST of Cp4.1LG20g00040 vs. ExPASy TrEMBL
Match: A0A6J1KPP2 (UV-B-induced protein At3g17800, chloroplastic-like OS=Cucurbita maxima OX=3661 GN=LOC111495375 PE=4 SV=1)

HSP 1 Score: 776 bits (2005), Expect = 9.97e-283
Identity = 404/412 (98.06%), Postives = 407/412 (98.79%), Query Frame = 0

Query: 1   MEAATGSTSTLAIGIGSPFRDTDPRPPASRSLYFPSESLISVPHYRSFVSPSKLGKKSIT 60
           MEAATGSTSTLAIGIGSPFRD  PRPPASRSLYFPS+SLISVPHYRSFVSPSKLGKK+IT
Sbjct: 1   MEAATGSTSTLAIGIGSPFRDIAPRPPASRSLYFPSKSLISVPHYRSFVSPSKLGKKTIT 60

Query: 61  LPCSGRGRGLGFPMVKASLSPDPDGSAAQIAPLRLQSPIGQFLSQILTTHPHLLPAAVDQ 120
           LPCSGRGRGLGFP VKASLSPDPDGSAAQIAPLRLQSPIGQFLSQILTTHPHLLPAAVDQ
Sbjct: 61  LPCSGRGRGLGFPTVKASLSPDPDGSAAQIAPLRLQSPIGQFLSQILTTHPHLLPAAVDQ 120

Query: 121 QLQQLQTQRHAEELTQEPSASATHDIVLYRRIAEVKAIERKRALEEILYAMVVQRFMDAD 180
           QLQQLQTQRHAEELTQEPSASATHDIVLYRRIAEVKAIERKRALEEILYAMVVQRFMDAD
Sbjct: 121 QLQQLQTQRHAEELTQEPSASATHDIVLYRRIAEVKAIERKRALEEILYAMVVQRFMDAD 180

Query: 181 VPLIPAVAPSSTDPYGRVDTWARDDEKLERLHSSEASEMIQNHLALVLGNRIGDFASVAQ 240
           VPLIPAVAPSSTDPYGRVDTWARDDEKLERLHSSEASEMIQNHLALVLGNRIGDFASVAQ
Sbjct: 181 VPLIPAVAPSSTDPYGRVDTWARDDEKLERLHSSEASEMIQNHLALVLGNRIGDFASVAQ 240

Query: 241 ISKLRVGQVYAASVMYGYFLKRVDERFQLEKTVKVLPASATVEGSFSNAPVHPEISSMAA 300
           ISKLRVGQVYAASVMYGYFLKRVDERFQLEKTVKVLPASATVEGSFSNAPVHPEISSMAA
Sbjct: 241 ISKLRVGQVYAASVMYGYFLKRVDERFQLEKTVKVLPASATVEGSFSNAPVHPEISSMAA 300

Query: 301 EQGDVSPGESGMGIKPSRLRTYVMSFDGETLQRFATIRSKEAVSIIERHTEALFGRPQIA 360
           EQGDVS GESGMGIKPSRLRTYVMSFDGETLQRFATIRSKEAVSIIERHTEALFGRPQIA
Sbjct: 301 EQGDVSHGESGMGIKPSRLRTYVMSFDGETLQRFATIRSKEAVSIIERHTEALFGRPQIA 360

Query: 361 ITPQGTVDTSKDELIKISFGGLKRLVLEAVTFGSFLWDVETYVDSRYHFVMN 412
           ITP GTVDTSKDELIKISFGGLKRLVLEAVTFGSFLWDVETYVDSRY+FVMN
Sbjct: 361 ITPHGTVDTSKDELIKISFGGLKRLVLEAVTFGSFLWDVETYVDSRYNFVMN 412

BLAST of Cp4.1LG20g00040 vs. ExPASy TrEMBL
Match: A0A6J1C7Y6 (UV-B-induced protein At3g17800, chloroplastic-like isoform X1 OS=Momordica charantia OX=3673 GN=LOC111008818 PE=4 SV=1)

HSP 1 Score: 564 bits (1453), Expect = 8.61e-199
Identity = 326/426 (76.53%), Postives = 351/426 (82.39%), Query Frame = 0

Query: 1   MEAATGSTSTLAIGIGSPFRDTDPRPPASRSLYFPSESLISVP--HYRSFVSPSKLGKKS 60
           MEAAT STST AIG  S FRD   RP   R L  PS+SL+SVP  H R      ++GKK+
Sbjct: 1   MEAAT-STSTFAIG--SLFRD---RP---RLLSLPSKSLLSVPVAHDRP-----RVGKKA 60

Query: 61  ITLPCSGRGRGLGFPMVKASLSPDPD--GSAAQIAPLRLQSPIGQFLSQILTTHPHLLPA 120
           I    S RGR +GFP+VKA +SPDP   GSAAQIAPL+LQSPIG FLSQILT HPHLLPA
Sbjct: 61  I----SHRGRRVGFPVVKACVSPDPPQAGSAAQIAPLQLQSPIGLFLSQILTIHPHLLPA 120

Query: 121 AVDQQLQQLQTQRHAEELTQEPSASATHDIVLYRRIAEVKAIERKRALEEILYAMVVQRF 180
           A+DQQL+QLQTQRHAEE    P+++ATHDIVLYRRIAEVK  ER+RALEEILYAMVVQRF
Sbjct: 121 AIDQQLEQLQTQRHAEE---PPASAATHDIVLYRRIAEVKENERRRALEEILYAMVVQRF 180

Query: 181 MDADVPLIPAVAPSSTDPYGRVDTWARDDEKLERLHSSEASEMIQNHLALVLGNRIGDFA 240
           MDA+V LIPAV P S DPYGRVDTW +DDEKLERLHSSEA EMIQNHLAL+LGNRIG+ A
Sbjct: 181 MDANVGLIPAVGPPSGDPYGRVDTWPQDDEKLERLHSSEAGEMIQNHLALILGNRIGESA 240

Query: 241 SVAQISKLRVGQVYAASVMYGYFLKRVDERFQLEKTVKVLPASA------TVEG----SF 300
           SVAQISK+RVGQVYAASVMYGYFLKRVDERFQLEKTVKVLPA A      T EG    S 
Sbjct: 241 SVAQISKVRVGQVYAASVMYGYFLKRVDERFQLEKTVKVLPAPADGDAMATHEGEEWDSP 300

Query: 301 SNAPVHPEISSMAAEQGDVSPGESGMGIKPSRLRTYVMSFDGETLQRFATIRSKEAVSII 360
           SNA VHPEISSM A    V+P ES +GIKPSRLRTYVMSFDGETLQR ATIRSKEAV +I
Sbjct: 301 SNAAVHPEISSMPAGPAGVTPWESRLGIKPSRLRTYVMSFDGETLQRLATIRSKEAVGLI 360

Query: 361 ERHTEALFGRPQIAITPQGTVDTSKDELIKISFGGLKRLVLEAVTFGSFLWDVETYVDSR 412
           ERHT+ALFGRPQ  ITPQGTVDT+KDELI ISFGGLKRLVLEAVTFGSFLWDVE YVDSR
Sbjct: 361 ERHTDALFGRPQTVITPQGTVDTTKDELITISFGGLKRLVLEAVTFGSFLWDVEAYVDSR 405

BLAST of Cp4.1LG20g00040 vs. ExPASy TrEMBL
Match: A0A6J1CA19 (UV-B-induced protein At3g17800, chloroplastic-like isoform X2 OS=Momordica charantia OX=3673 GN=LOC111008818 PE=4 SV=1)

HSP 1 Score: 551 bits (1419), Expect = 1.14e-193
Identity = 320/420 (76.19%), Postives = 345/420 (82.14%), Query Frame = 0

Query: 1   MEAATGSTSTLAIGIGSPFRDTDPRPPASRSLYFPSESLISVP--HYRSFVSPSKLGKKS 60
           MEAAT STST AIG  S FRD   RP   R L  PS+SL+SVP  H R      ++GKK+
Sbjct: 1   MEAAT-STSTFAIG--SLFRD---RP---RLLSLPSKSLLSVPVAHDRP-----RVGKKA 60

Query: 61  ITLPCSGRGRGLGFPMVKASLSPDPD--GSAAQIAPLRLQSPIGQFLSQILTTHPHLLPA 120
           I    S RGR +GFP+VKA +SPDP   GSAAQIAPL+LQSPIG FLSQILT HPHLLPA
Sbjct: 61  I----SHRGRRVGFPVVKACVSPDPPQAGSAAQIAPLQLQSPIGLFLSQILTIHPHLLPA 120

Query: 121 AVDQQLQQLQTQRHAEELTQEPSASATHDIVLYRRIAEVKAIERKRALEEILYAMVVQRF 180
           A+DQQL+QLQTQRHAEE    P+++ATHDIVLYRRIAEVK  ER+RALEEILYAMVVQRF
Sbjct: 121 AIDQQLEQLQTQRHAEE---PPASAATHDIVLYRRIAEVKENERRRALEEILYAMVVQRF 180

Query: 181 MDADVPLIPAVAPSSTDPYGRVDTWARDDEKLERLHSSEASEMIQNHLALVLGNRIGDFA 240
           MDA+V LIPAV P S DPYGRVDTW +DDEKLERLHSSEA EMIQNHLAL+LGNRIG+ A
Sbjct: 181 MDANVGLIPAVGPPSGDPYGRVDTWPQDDEKLERLHSSEAGEMIQNHLALILGNRIGESA 240

Query: 241 SVAQISKLRVGQVYAASVMYGYFLKRVDERFQLEKTVKVLPASA------TVEG----SF 300
           SVAQISK+RVGQVYAASVMYGYFLKRVDERFQLEKTVKVLPA A      T EG    S 
Sbjct: 241 SVAQISKVRVGQVYAASVMYGYFLKRVDERFQLEKTVKVLPAPADGDAMATHEGEEWDSP 300

Query: 301 SNAPVHPEISSMAAEQGDVSPGESGMGIKPSRLRTYVMSFDGETLQRFATIRSKEAVSII 360
           SNA VHPEISSM A    V+P ES +GIKPSRLRTYVMSFDGETLQR ATIRSKEAV +I
Sbjct: 301 SNAAVHPEISSMPAGPAGVTPWESRLGIKPSRLRTYVMSFDGETLQRLATIRSKEAVGLI 360

Query: 361 ERHTEALFGRPQIAITPQGTVDTSKDELIKISFGGLKRLVLEAVTFGSFLWDVETYVDSR 406
           ERHT+ALFGRPQ  ITPQGTVDT+KDELI ISFGGLKRLVLEAVTFGSFLWDVE YVDSR
Sbjct: 361 ERHTDALFGRPQTVITPQGTVDTTKDELITISFGGLKRLVLEAVTFGSFLWDVEAYVDSR 399

BLAST of Cp4.1LG20g00040 vs. ExPASy TrEMBL
Match: A0A061FWL1 (Uncharacterized protein isoform 2 OS=Theobroma cacao OX=3641 GN=TCM_013026 PE=4 SV=1)

HSP 1 Score: 490 bits (1262), Expect = 2.06e-169
Identity = 279/444 (62.84%), Postives = 322/444 (72.52%), Query Frame = 0

Query: 1   MEAATGSTSTLAIGIGSPFRDTDPRPPAS--RSLYFPSESLISVPHYRSFV--------- 60
           M+AAT S S +   +      T  RPP+S  RS    +      PH+  F          
Sbjct: 1   MDAATASASVVGSSM------TTRRPPSSVTRSAILTANE----PHFLRFAAKPRLPFSI 60

Query: 61  ---SPSKLGKKSITLPCSGRGRGLGFPMVKASLSPDPDGSAAQIAPLRLQSPIGQFLSQI 120
              SP    K        G  RG+   +V+AS SPD  G  A IAPL+++SPIGQFLSQI
Sbjct: 61  KHYSPLSYSKPQNRRMALGSRRGM---VVRASSSPDSAGPTAPIAPLQMESPIGQFLSQI 120

Query: 121 LTTHPHLLPAAVDQQLQQLQTQRHAEELTQEPSASATHDIVLYRRIAEVKAIERKRALEE 180
           L +HPHL+PAAV+QQL+QLQT R AEE  +EPSASA  D+VLYRRIAEVKA ERK+ALEE
Sbjct: 121 LISHPHLVPAAVEQQLEQLQTDRDAEEKKEEPSASAGTDLVLYRRIAEVKANERKKALEE 180

Query: 181 ILYAMVVQRFMDADVPLIPAVAPSSTDPYGRVDTWARDDEKLERLHSSEASEMIQNHLAL 240
           ILYA+VVQ+FMDA+V L+PA+ PSSTDP GRVD W  +++KLE LHS EA EMIQNHLAL
Sbjct: 181 ILYALVVQKFMDANVSLVPAMTPSSTDPSGRVDMWPSEEDKLELLHSPEAYEMIQNHLAL 240

Query: 241 VLGNRIGDFASVAQISKLRVGQVYAASVMYGYFLKRVDERFQLEKTVKVLPASATVE--- 300
           +LGNR+GD  SVAQISKLRVGQVYAASVMYGYFLKRVD+RFQLEKT+K+LP ++  E   
Sbjct: 241 ILGNRLGDSTSVAQISKLRVGQVYAASVMYGYFLKRVDQRFQLEKTMKILPNASNGEESG 300

Query: 301 ---------------GSFSNAPVHPEISSMAAEQGDVSPGESGMGIKPSRLRTYVMSFDG 360
                           S+     HPE+SS +   G +SPG  G GIKP RLRTYVMSFDG
Sbjct: 301 VEQSVGEDMGTAGLGDSYKAVSSHPEVSSWS---GGISPGGFGHGIKPCRLRTYVMSFDG 360

Query: 361 ETLQRFATIRSKEAVSIIERHTEALFGRPQIAITPQGTVDTSKDELIKISFGGLKRLVLE 412
           ETLQ+FA IRSKEAVSIIE+HTEALFGRP+I ITPQGTVD+SKDELIKISF GLKRLVLE
Sbjct: 361 ETLQKFAAIRSKEAVSIIEKHTEALFGRPEIVITPQGTVDSSKDELIKISFNGLKRLVLE 420

BLAST of Cp4.1LG20g00040 vs. TAIR 10
Match: AT3G17800.1 (Protein of unknown function (DUF760) )

HSP 1 Score: 426.0 bits (1094), Expect = 3.4e-119
Identity = 230/350 (65.71%), Postives = 275/350 (78.57%), Query Frame = 0

Query: 77  ASLSPDPDGSAAQIAPLRLQSPIGQFLSQILTTHPHLLPAAVDQQLQQLQTQRHAEELTQ 136
           AS       S   IAPL+LQSP GQFLSQIL +HPHL+PAAV+QQL+QLQT R ++   +
Sbjct: 79  ASNDASSGSSPKPIAPLQLQSPAGQFLSQILVSHPHLVPAAVEQQLEQLQTDRDSQGQNK 138

Query: 137 EPSASATHDIVLYRRIAEVKAIERKRALEEILYAMVVQRFMDADVPLIPAVAPSSTDPYG 196
           + ++    DIVLYRRIAE+K  ER+R LEEILYA+VVQ+FM+A+V L+P+V+PSS DP G
Sbjct: 139 DSASVPGTDIVLYRRIAELKENERRRTLEEILYALVVQKFMEANVSLVPSVSPSS-DPSG 198

Query: 197 RVDTWARDDEKLERLHSSEASEMIQNHLALVLGNRIGDFASVAQISKLRVGQVYAASVMY 256
           RVDTW    EKLERLHS E  EMI NHLAL+LG+R+GD  SVAQISKLRVGQVYAASVMY
Sbjct: 199 RVDTWPTKVEKLERLHSPEMYEMIHNHLALILGSRMGDLNSVAQISKLRVGQVYAASVMY 258

Query: 257 GYFLKRVDERFQLEKTVKVLP--------------ASATVEGSFSNAPVHPEISSMAAEQ 316
           GYFLKRVD+RFQLEKT+K+LP               +AT + + S+   HPE+ + A   
Sbjct: 259 GYFLKRVDQRFQLEKTMKILPGGSDESKTSVEQAEGTATYQAAVSS---HPEVGAFA--- 318

Query: 317 GDVSPGESGMGIKPSRLRTYVMSFDGETLQRFATIRSKEAVSIIERHTEALFGRPQIAIT 376
           G VS    G  IKPSRLR+YVMSFD ETLQR+ATIRS+EAV IIE+HTEALFG+P+I IT
Sbjct: 319 GGVSAKGFGSEIKPSRLRSYVMSFDAETLQRYATIRSREAVGIIEKHTEALFGKPEIVIT 378

Query: 377 PQGTVDTSKDELIKISFGGLKRLVLEAVTFGSFLWDVETYVDSRYHFVMN 413
           P+GTVD+SKDE IKISFGG+KRLVLEAVTFGSFLWDVE++VD+RYHFV+N
Sbjct: 379 PEGTVDSSKDEQIKISFGGMKRLVLEAVTFGSFLWDVESHVDARYHFVLN 421

BLAST of Cp4.1LG20g00040 vs. TAIR 10
Match: AT3G17800.2 (Protein of unknown function (DUF760) )

HSP 1 Score: 426.0 bits (1094), Expect = 3.4e-119
Identity = 230/350 (65.71%), Postives = 275/350 (78.57%), Query Frame = 0

Query: 77  ASLSPDPDGSAAQIAPLRLQSPIGQFLSQILTTHPHLLPAAVDQQLQQLQTQRHAEELTQ 136
           AS       S   IAPL+LQSP GQFLSQIL +HPHL+PAAV+QQL+QLQT R ++   +
Sbjct: 85  ASNDASSGSSPKPIAPLQLQSPAGQFLSQILVSHPHLVPAAVEQQLEQLQTDRDSQGQNK 144

Query: 137 EPSASATHDIVLYRRIAEVKAIERKRALEEILYAMVVQRFMDADVPLIPAVAPSSTDPYG 196
           + ++    DIVLYRRIAE+K  ER+R LEEILYA+VVQ+FM+A+V L+P+V+PSS DP G
Sbjct: 145 DSASVPGTDIVLYRRIAELKENERRRTLEEILYALVVQKFMEANVSLVPSVSPSS-DPSG 204

Query: 197 RVDTWARDDEKLERLHSSEASEMIQNHLALVLGNRIGDFASVAQISKLRVGQVYAASVMY 256
           RVDTW    EKLERLHS E  EMI NHLAL+LG+R+GD  SVAQISKLRVGQVYAASVMY
Sbjct: 205 RVDTWPTKVEKLERLHSPEMYEMIHNHLALILGSRMGDLNSVAQISKLRVGQVYAASVMY 264

Query: 257 GYFLKRVDERFQLEKTVKVLP--------------ASATVEGSFSNAPVHPEISSMAAEQ 316
           GYFLKRVD+RFQLEKT+K+LP               +AT + + S+   HPE+ + A   
Sbjct: 265 GYFLKRVDQRFQLEKTMKILPGGSDESKTSVEQAEGTATYQAAVSS---HPEVGAFA--- 324

Query: 317 GDVSPGESGMGIKPSRLRTYVMSFDGETLQRFATIRSKEAVSIIERHTEALFGRPQIAIT 376
           G VS    G  IKPSRLR+YVMSFD ETLQR+ATIRS+EAV IIE+HTEALFG+P+I IT
Sbjct: 325 GGVSAKGFGSEIKPSRLRSYVMSFDAETLQRYATIRSREAVGIIEKHTEALFGKPEIVIT 384

Query: 377 PQGTVDTSKDELIKISFGGLKRLVLEAVTFGSFLWDVETYVDSRYHFVMN 413
           P+GTVD+SKDE IKISFGG+KRLVLEAVTFGSFLWDVE++VD+RYHFV+N
Sbjct: 385 PEGTVDSSKDEQIKISFGGMKRLVLEAVTFGSFLWDVESHVDARYHFVLN 427

BLAST of Cp4.1LG20g00040 vs. TAIR 10
Match: AT1G48450.1 (Protein of unknown function (DUF760) )

HSP 1 Score: 415.2 bits (1066), Expect = 6.0e-116
Identity = 228/357 (63.87%), Postives = 273/357 (76.47%), Query Frame = 0

Query: 74  MVKASLSPDPDGSAAQIAPLRLQSPIGQFLSQILTTHPHLLPAAVDQQLQQLQTQRHAEE 133
           +VKAS S   D S   IAPL+L+SP+GQFLSQIL +HPHL+PAAV+QQL+QLQ  R AEE
Sbjct: 69  VVKASAS--GDASTESIAPLQLKSPVGQFLSQILVSHPHLVPAAVEQQLEQLQIDRDAEE 128

Query: 134 LTQEPSASATHDIVLYRRIAEVKAIERKRALEEILYAMVVQRFMDADVPLIPAVAPSSTD 193
            +++ S+    DIVLYRRIAEVK  ER+RALEEILYA+VVQ+FMDA+V L+P++  SS D
Sbjct: 129 QSKDASSVLGTDIVLYRRIAEVKEKERRRALEEILYALVVQKFMDANVTLVPSITSSSAD 188

Query: 194 PYGRVDTWARDDEKLERLHSSEASEMIQNHLALVLGNRIGDFASVAQISKLRVGQVYAAS 253
           P GRVDTW   D +LERLHS E  EMIQNHL+++L NR  D  +VAQISKL VGQVYAAS
Sbjct: 189 PSGRVDTWPTLDGELERLHSPEVYEMIQNHLSIILKNRTDDLTAVAQISKLGVGQVYAAS 248

Query: 254 VMYGYFLKRVDERFQLEKTVKVLP------------ASATVEGSF-SNAPVHPEISSMAA 313
           VMYGYFLKR+D+RFQLEKT+++LP            A   VE +F   A    +  S   
Sbjct: 249 VMYGYFLKRIDQRFQLEKTMRILPGGSDEGETSIEQAGRDVERNFYEEAEETYQAVSSNQ 308

Query: 314 EQGDVSPGESGMG-----IKPSRLRTYVMSFDGETLQRFATIRSKEAVSIIERHTEALFG 373
           + G    G +  G     +K SRL+TYVMSFDGETLQR+ATIRS+E+V IIE+HTEALFG
Sbjct: 309 DVGSFVGGINASGGFSSDMKQSRLKTYVMSFDGETLQRYATIRSRESVGIIEKHTEALFG 368

Query: 374 RPQIAITPQGTVDTSKDELIKISFGGLKRLVLEAVTFGSFLWDVETYVDSRYHFVMN 413
           RP+I ITPQGT+D+SKDE IKISF GLKRLVLEAVTFGSFLWDVE++VDSRYHFV+N
Sbjct: 369 RPEIVITPQGTIDSSKDEHIKISFKGLKRLVLEAVTFGSFLWDVESHVDSRYHFVLN 423

BLAST of Cp4.1LG20g00040 vs. TAIR 10
Match: AT1G32160.1 (Protein of unknown function (DUF760) )

HSP 1 Score: 303.1 bits (775), Expect = 3.3e-82
Identity = 179/389 (46.02%), Postives = 253/389 (65.04%), Query Frame = 0

Query: 35  PSESLISVP-HYRSFVSPSKLGKKSITLPCSGRGRGLGFPMVKASLSPDPDGSAAQIAPL 94
           PS S   +P    SF  P KLG  S     +GRGR +    V+AS   D + + A +AP+
Sbjct: 28  PSSSPSLLPQRCHSFCIP-KLGSSSTNE--NGRGRSV---TVRASGDEDSNENFAPLAPV 87

Query: 95  RLQSPIGQFLSQILTTHPHLLPAAVDQQLQQLQTQRHAEELTQEPSASATHDIVLYRRIA 154
            L+SP+GQ L QIL THPHLLP  VD+QL++      AE  +++  +S+T DI L +RI+
Sbjct: 88  ELESPVGQLLEQILRTHPHLLPVTVDEQLEKFA----AESESRKADSSSTQDI-LQKRIS 147

Query: 155 EVKAIERKRALEEILYAMVVQRFMDADVPLIPAVAPSSTDPYGRVDTWARDDEKLERLHS 214
           EV+  ER++ L EI+Y +VV RF++  + +IP + P+S DP GR+D W   +EKLE +HS
Sbjct: 148 EVRDKERRKTLAEIIYCLVVHRFVEKGISMIPRIKPTS-DPAGRIDLWPNQEEKLEVIHS 207

Query: 215 SEASEMIQNHLALVLGN--RIGDFASVAQISKLRVGQVYAASVMYGYFLKRVDERFQLEK 274
           ++A EMIQ+HL+ VLG+   +G  +S+ QI K+++G++YAAS MYGYFL+RVD+R+QLE+
Sbjct: 208 ADAFEMIQSHLSSVLGDGPAVGPLSSIVQIGKIKLGKLYAASAMYGYFLRRVDQRYQLER 267

Query: 275 TVKVLP--ASATVEGSFSNAPVHP---EISSMAAEQGDVSPGESGMGIKPSR-----LRT 334
           T+  LP     T E     +P +P     S +  +  +  P E  +           LR+
Sbjct: 268 TMNTLPKRPEKTRERFEEPSPPYPLWDPDSLIRIQPEEYDPDEYAIQRNEDESSSYGLRS 327

Query: 335 YVMSFDGETLQRFATIRSKEAVSIIERHTEALFGRPQIAITPQGTVDTSKDELIKISFGG 394
           YV   D +TLQR+ATIRSKEA+++IE+ T+ALFGRP I I   G +DTS DE++ +S  G
Sbjct: 328 YVTYLDSDTLQRYATIRSKEAMTLIEKQTQALFGRPDIRILEDGKLDTSNDEVLSLSVSG 387

Query: 395 LKRLVLEAVTFGSFLWDVETYVDSRYHFV 411
           L  LVLEAV FGSFLWD E+YV+S+YHF+
Sbjct: 388 LAMLVLEAVAFGSFLWDSESYVESKYHFL 404

BLAST of Cp4.1LG20g00040 vs. TAIR 10
Match: AT1G48450.2 (Protein of unknown function (DUF760) )

HSP 1 Score: 260.4 bits (664), Expect = 2.5e-69
Identity = 137/207 (66.18%), Postives = 167/207 (80.68%), Query Frame = 0

Query: 74  MVKASLSPDPDGSAAQIAPLRLQSPIGQFLSQILTTHPHLLPAAVDQQLQQLQTQRHAEE 133
           +VKAS S   D S   IAPL+L+SP+GQFLSQIL +HPHL+PAAV+QQL+QLQ  R AEE
Sbjct: 69  VVKASAS--GDASTESIAPLQLKSPVGQFLSQILVSHPHLVPAAVEQQLEQLQIDRDAEE 128

Query: 134 LTQEPSASATHDIVLYRRIAEVKAIERKRALEEILYAMVVQRFMDADVPLIPAVAPSSTD 193
            +++ S+    DIVLYRRIAEVK  ER+RALEEILYA+VVQ+FMDA+V L+P++  SS D
Sbjct: 129 QSKDASSVLGTDIVLYRRIAEVKEKERRRALEEILYALVVQKFMDANVTLVPSITSSSAD 188

Query: 194 PYGRVDTWARDDEKLERLHSSEASEMIQNHLALVLGNRIGDFASVAQISKLRVGQVYAAS 253
           P GRVDTW   D +LERLHS E  EMIQNHL+++L NR  D  +VAQISKL VGQVYAAS
Sbjct: 189 PSGRVDTWPTLDGELERLHSPEVYEMIQNHLSIILKNRTDDLTAVAQISKLGVGQVYAAS 248

Query: 254 VMYGYFLKRVDERFQLEKTVKVLPASA 281
           VMYGYFLKR+D+RFQLEKT+++LP  +
Sbjct: 249 VMYGYFLKRIDQRFQLEKTMRILPGGS 273

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9LVJ04.8e-11865.71UV-B-induced protein At3g17800, chloroplastic OS=Arabidopsis thaliana OX=3702 GN... [more]
Match NameE-valueIdentityDescription
XP_023519193.19.96e-290100.00UV-B-induced protein At3g17800, chloroplastic-like [Cucurbita pepo subsp. pepo][more]
XP_022923896.16.72e-28899.51UV-B-induced protein At3g17800, chloroplastic-like [Cucurbita moschata][more]
KAG7020067.19.54e-28899.51UV-B-induced protein, chloroplastic [Cucurbita argyrosperma subsp. argyrosperma][more]
KAG6584480.11.49e-28599.76UV-B-induced protein, chloroplastic, partial [Cucurbita argyrosperma subsp. soro... [more]
XP_023001153.12.06e-28298.06UV-B-induced protein At3g17800, chloroplastic-like [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
A0A6J1E7Z13.25e-28899.51UV-B-induced protein At3g17800, chloroplastic-like OS=Cucurbita moschata OX=3662... [more]
A0A6J1KPP29.97e-28398.06UV-B-induced protein At3g17800, chloroplastic-like OS=Cucurbita maxima OX=3661 G... [more]
A0A6J1C7Y68.61e-19976.53UV-B-induced protein At3g17800, chloroplastic-like isoform X1 OS=Momordica chara... [more]
A0A6J1CA191.14e-19376.19UV-B-induced protein At3g17800, chloroplastic-like isoform X2 OS=Momordica chara... [more]
A0A061FWL12.06e-16962.84Uncharacterized protein isoform 2 OS=Theobroma cacao OX=3641 GN=TCM_013026 PE=4 ... [more]
Match NameE-valueIdentityDescription
AT3G17800.13.4e-11965.71Protein of unknown function (DUF760) [more]
AT3G17800.23.4e-11965.71Protein of unknown function (DUF760) [more]
AT1G48450.16.0e-11663.87Protein of unknown function (DUF760) [more]
AT1G32160.13.3e-8246.02Protein of unknown function (DUF760) [more]
AT1G48450.22.5e-6966.18Protein of unknown function (DUF760) [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR008479Protein of unknown function DUF760PFAMPF05542DUF760coord: 147..272
e-value: 1.4E-21
score: 76.7
coord: 319..400
e-value: 4.4E-4
score: 20.6
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..28
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..15
NoneNo IPR availablePANTHERPTHR31808:SF4EXPRESSED PROTEINcoord: 73..412
IPR038925UV-B-induced protein At3g17800-likePANTHERPTHR31808EXPRESSED PROTEINcoord: 73..412

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG20g00040.1Cp4.1LG20g00040.1mRNA