CmaCh02G005210 (gene) Cucurbita maxima (Rimu) v1.1

Overview
NameCmaCh02G005210
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu) v1.1)
Descriptionprotein IN CHLOROPLAST ATPASE BIOGENESIS, chloroplastic-like isoform X2
LocationCma_Chr02: 2861210 .. 2865160 (+)
RNA-Seq ExpressionCmaCh02G005210
SyntenyCmaCh02G005210
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTGTTCACAGAAAAGATGAGATAAAAGCCGTTATCAATGAGGCCCATGTATTGCTGCGTTTAGCGATTGTAAATCGAAAAGGTCGAAAAAGCAGGATGAAAATTGGTGGTGGAGTGGTCTGTGGAAGTCCACGCGCCGCCGTTCTACCCTCACTGCTCCTCGGCCGCCGTGGAGTCACAATCCGTTGCGCTTCATCTTCTTCGACTTCCAGTAAGTTCACCGCGAGCTCATTTTTCGCACATTTCCCTAATAGATTTTCAGTATCTTAATGCCTTTTTGAACCATCGCCGTGTTGCAGACCATGTATCATTCATCAAGGATGTTGCGGCAACTGAGCCTCCTCAGCATTTGTCTAATTTGTTGAAAATGCTGAAGACTAGAGGTATGTCGTTGGTCAAGCGCTTCAAATCATTCCTATCATGAATTTTATACCAATTAATCGCTAACCTGCCCGATTGAATCCATCAATTTTGTCCAGGCACTGAATTCTTCGTCCTTTGGAGTTGTTTACAAATTTTATTAGGAGTGTGAAGCTTGACTCTCTAATTAGAATTGAAACAATAATGTCCTCGTGAATAGATTTCGATTCTATTGGAAATCCGTAGACCTAATTGGTTATATCTCAACATTTTTATAAAACGATTAATAACCTGAGTTTTTGCTTCAATTTTCCTGTGGCACAAGTTTTTATTATTGTACGCTTAATGTATCGTTATGTATGCGTTTCTGCAATATATTGGGCTTGGTTGTTCCTTTATTTTTTTTTATGGCTACTTCTCTCATAAATTAAGGCCTCTAATAGCATTTGAATCCCTGGAGGTCTAGTGGTCGAGGCCCTTCTTGAACTTCATTCCATGTTCTATTTTCTGAAAATTATATGCTGAGTCATGTCATGCCTATCTGCATCCCCAATGGTTCTATCAGCTTCTTTGCATTTTCCTCTGCCCTATAAACTGTTGCATTGGCTTTTCTTATACTAGTAAATACTAAAAATAACTATAGAATTATACAACCAATTATAGGCCTGAGTTGCCCCATTTTTTGAACCTGCAGACATATTCTTATCTACATGGGAATCTCCTGTAAATTGTTGGTTTTAAGCAGGTGAATCCATAATTTCTCCGGGAGCCAAGCAAGGAATTATTCCTCTTGCCATTCCACTGGCTAAAAACAACTCAGGTATTATGTTAGAATTTTTCATTCAAATTGGTTCGCTTTGTAGGTGTATGATATCTCAATCTATTCTTTCTTCTTGTTTCAAGGTACTATAACTGCATTGCTGCGCTGGCCAACAGCACCCGCTGGGTAATAAGAATAAACTTCTTACATTTTCTTATGGGTGGTTCATCCTCCATCTTAATAAACTTCTATCCTCAGGATGGAAATGCCAGTAGTGGACGTCAATAGGAATGGAGTGTGGCTACTAGCCAAGAACGTAAGTGAAGCATCTATTGTGCAATTGTTTCCCATTTTTCAATGTTCTAAAACCATTTCCTTAGACACTCTGATATAGAATTCAGTTCTCTGTTGTGCAGGTGGATCAATTTATTCACAGACTTCTAGTGGAAGAAGATGCCAAAGGAAGTGGAGAGCAAAATGATGAGCTATTTCTTGCAGCAGCTGATGCTGGGAAGAAACTTTATGTAAGGGGTGATTTTGCTGAATCTCAGATCAAAAACATTGATGGGTATTTGCTGAAAAAGGTATTCTACAATTTCTCATGGTACTTCTCTTGATTGTCCCCATCAACGGATAAATTTGTATACAAATGTTTTCTTAAATGGGGCAAGGCGTTTCCATGTGGTGACAGACAAGTTGGTTGTGCTCCTGACAGAGCTGCTTGCCTTGATCATCACCTTTTTGGGGTCTCTCTAAAGAAACAGAATTTTCAGTTTCTGATGTTTTTAGACTTCAGAATCTTCTTTTTCATGCTTCTTGAAAATAAAAGATAATTTTTTATGCAAAAAATATTTTTATGCATAAACAGGTATCTTTAAAGCATATTTAAGTAATATATTTTATCTACTCTACTTCTGTCTTGTTGCTGAAGAAGTAACATTTTTGCATAATGCATTTTGTGCTGGCATTATAGCCTTCAATTAGAAAATTTCCTCTTTTCTTTTCTTTTTCCTGTTTTTAAGTCTCACTTCACACAAGTTATACTTCCTAAGCACATTTTGGGTGAACATAATAATAAAAAAAGGTTTATACCATTTGCAATGTATCTTGCAGTGCCAGGTCTGACAATGTTTCAATCTCAGGTTGGGATATTTCCAGATATCATAGAACGTAAAATATTGCGCCATTTTGAAGAAGGCGATCTTGTAAGGCTGTTGTTTATGAAATACTGAAATGCAGTGAGTGAAAGAAGCTTTGACTCCAATTTACGCACTCGAATATCTTTACCTGCTTGTGTAGGTTTCAGCTTTGGTGACGGGAGAATTCTATACTAAAAAGGAGCACTTCCCAGGATTTGCACGGCCATATGTATTTAATGCAGAGGTTTTGCTGAAGTATGTGGGGATTTTTTTCATCACTCCAAAAATGCTTTGGCTGAGAATGTACCTAAAGAATCTGTGCTTTCTTCATGCTTCAGGGTGGGACGTAAAACAGAAGCAAAGGATGCTGCGAGGGGAGCCTTAAAATCACCATGGTGGACCCTAGGCTGTAAGTATGAGGTATTGACTTTCTGGTTGATCTCTTTTGCCTTTTTCTTAATAGCTCTCTGTTTTAAGCCGCTTGCCTTGGGAGAGATGGGGAAAGATTAAGCAAAGAATATTTGCAAAAAGTTGATCCTTGAGTTGGTTTTGGCTCATATTCAGGAAGTTGCTAATATCGCTCAATGGGAAGATGAACAAATTGAGTATTTGAAAGAGAAGGTCACAGAAGAAGGAAAGCTAGAAGACCTCAAAAAGGGAAAGGCTCCTGCCCAGGTTTTTCACAGTTCTTTATGGATTGCTTTAAAATTATCAAGCAGACAAACATGAATTAGAACAGGTTATTTTGTCAAAACTACCCTTTGCATATGAACATTTTCATGTGTGTTGAGTGTCCATTTGATAACCATTTGATTTTTTTTTGCTAATTAAGCTATAAATATTACATCCACCTGTGGATTTCTTTGTTTTGTTATCTACTTCTCACCTGTGTTTTAAAAAACTAACCCAAGTTTTGAAAACTAAAAATAGTTGTGTTCAAAAAATAGTTTTTGTTTTTGAAATTTGGATTAAGAATTCAAAAGTTTTCTTAGGAAAGATGAAACCTATCATAGAGAAATGATGAGAAAACAAGTATAATTTTCAGAAATCAAATGGCTATCTGACCGGGTCTAAACTAAACAAGTGGGTAACTTGTGTCTATGTCTTTCATATTAATAATCTATTATTATTTTTAAGTTTAAACTCCTTTCAATATGTTGACAATACTACTGTTGAAACATTAAAACTATACGAAAAATCTAGATTGAAGATCAATTTTAAACCAAGAAATATCATTTAGTAGACTAAAAAATTACTTCTATAAATTTTGTTACTGGCTCGTGCACTGCAGGTTGCCTTGGATCAAGCTGCCTTTTTGTTGGATTTAGCTTCGGTTGATGGGACTTGGGACATCTCTGTGGAGCGTATTGCTCAATGTTATGAGGAGGCTGGCCTTCAGGAGATTGCGAGATTCGTACTTTACAGAGGCTGAATACAAACAGAGGCATTTTTCTTCTCTAAACTCTTCATTTTCATTGTCTTCTTCCTTCTGGCTATATAATTATATCTGTATTTCTTCATTGTGTAAACATCATTCTTTCTCTCCCCTATGGATGAGACGATCCTTTCCCTACAAGAATCGGGTAGACTGAGGGACAAGGAATACTCAAAGTTTCTCTTGATAAATTATAATGTTTTTTTCTAAAGCATATAAGAAATGTCAGAGAAATGTGTTTGAACATTTTTGTTTCCG

mRNA sequence

TTGTTCACAGAAAAGATGAGATAAAAGCCGTTATCAATGAGGCCCATGTATTGCTGCGTTTAGCGATTGTAAATCGAAAAGGTCGAAAAAGCAGGATGAAAATTGGTGGTGGAGTGGTCTGTGGAAGTCCACGCGCCGCCGTTCTACCCTCACTGCTCCTCGGCCGCCGTGGAGTCACAATCCGTTGCGCTTCATCTTCTTCGACTTCCAACCATGTATCATTCATCAAGGATGTTGCGGCAACTGAGCCTCCTCAGCATTTGTCTAATTTGTTGAAAATGCTGAAGACTAGAGGTGAATCCATAATTTCTCCGGGAGCCAAGCAAGGAATTATTCCTCTTGCCATTCCACTGGCTAAAAACAACTCAGGTACTATAACTGCATTGCTGCGCTGGCCAACAGCACCCGCTGGGATGGAAATGCCAGTAGTGGACGTCAATAGGAATGGAGTGTGGCTACTAGCCAAGAACGTGGATCAATTTATTCACAGACTTCTAGTGGAAGAAGATGCCAAAGGAAGTGGAGAGCAAAATGATGAGCTATTTCTTGCAGCAGCTGATGCTGGGAAGAAACTTTATGTAAGGGGTGATTTTGCTGAATCTCAGATCAAAAACATTGATGGGTATTTGCTGAAAAAGGTTGGGATATTTCCAGATATCATAGAACGTAAAATATTGCGCCATTTTGAAGAAGGCGATCTTGTTTCAGCTTTGGTGACGGGAGAATTCTATACTAAAAAGGAGCACTTCCCAGGATTTGCACGGCCATATGTATTTAATGCAGAGGTTTTGCTGAAGGTGGGACGTAAAACAGAAGCAAAGGATGCTGCGAGGGGAGCCTTAAAATCACCATGGTGGACCCTAGGCTGTAAGTATGAGGAAGTTGCTAATATCGCTCAATGGGAAGATGAACAAATTGAGTATTTGAAAGAGAAGGTCACAGAAGAAGGAAAGCTAGAAGACCTCAAAAAGGGAAAGGCTCCTGCCCAGGTTGCCTTGGATCAAGCTGCCTTTTTGTTGGATTTAGCTTCGGTTGATGGGACTTGGGACATCTCTGTGGAGCGTATTGCTCAATGTTATGAGGAGGCTGGCCTTCAGGAGATTGCGAGATTCGTACTTTACAGAGGCTGAATACAAACAGAGGCATTTTTCTTCTCTAAACTCTTCATTTTCATTGTCTTCTTCCTTCTGGCTATATAATTATATCTGTATTTCTTCATTGTGTAAACATCATTCTTTCTCTCCCCTATGGATGAGACGATCCTTTCCCTACAAGAATCGGGTAGACTGAGGGACAAGGAATACTCAAAGTTTCTCTTGATAAATTATAATGTTTTTTTCTAAAGCATATAAGAAATGTCAGAGAAATGTGTTTGAACATTTTTGTTTCCG

Coding sequence (CDS)

ATGAAAATTGGTGGTGGAGTGGTCTGTGGAAGTCCACGCGCCGCCGTTCTACCCTCACTGCTCCTCGGCCGCCGTGGAGTCACAATCCGTTGCGCTTCATCTTCTTCGACTTCCAACCATGTATCATTCATCAAGGATGTTGCGGCAACTGAGCCTCCTCAGCATTTGTCTAATTTGTTGAAAATGCTGAAGACTAGAGGTGAATCCATAATTTCTCCGGGAGCCAAGCAAGGAATTATTCCTCTTGCCATTCCACTGGCTAAAAACAACTCAGGTACTATAACTGCATTGCTGCGCTGGCCAACAGCACCCGCTGGGATGGAAATGCCAGTAGTGGACGTCAATAGGAATGGAGTGTGGCTACTAGCCAAGAACGTGGATCAATTTATTCACAGACTTCTAGTGGAAGAAGATGCCAAAGGAAGTGGAGAGCAAAATGATGAGCTATTTCTTGCAGCAGCTGATGCTGGGAAGAAACTTTATGTAAGGGGTGATTTTGCTGAATCTCAGATCAAAAACATTGATGGGTATTTGCTGAAAAAGGTTGGGATATTTCCAGATATCATAGAACGTAAAATATTGCGCCATTTTGAAGAAGGCGATCTTGTTTCAGCTTTGGTGACGGGAGAATTCTATACTAAAAAGGAGCACTTCCCAGGATTTGCACGGCCATATGTATTTAATGCAGAGGTTTTGCTGAAGGTGGGACGTAAAACAGAAGCAAAGGATGCTGCGAGGGGAGCCTTAAAATCACCATGGTGGACCCTAGGCTGTAAGTATGAGGAAGTTGCTAATATCGCTCAATGGGAAGATGAACAAATTGAGTATTTGAAAGAGAAGGTCACAGAAGAAGGAAAGCTAGAAGACCTCAAAAAGGGAAAGGCTCCTGCCCAGGTTGCCTTGGATCAAGCTGCCTTTTTGTTGGATTTAGCTTCGGTTGATGGGACTTGGGACATCTCTGTGGAGCGTATTGCTCAATGTTATGAGGAGGCTGGCCTTCAGGAGATTGCGAGATTCGTACTTTACAGAGGCTGA

Protein sequence

MKIGGGVVCGSPRAAVLPSLLLGRRGVTIRCASSSSTSNHVSFIKDVAATEPPQHLSNLLKMLKTRGESIISPGAKQGIIPLAIPLAKNNSGTITALLRWPTAPAGMEMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDAKGSGEQNDELFLAAADAGKKLYVRGDFAESQIKNIDGYLLKKVGIFPDIIERKILRHFEEGDLVSALVTGEFYTKKEHFPGFARPYVFNAEVLLKVGRKTEAKDAARGALKSPWWTLGCKYEEVANIAQWEDEQIEYLKEKVTEEGKLEDLKKGKAPAQVALDQAAFLLDLASVDGTWDISVERIAQCYEEAGLQEIARFVLYRG
Homology
BLAST of CmaCh02G005210 vs. ExPASy Swiss-Prot
Match: Q94JY0 (Protein IN CHLOROPLAST ATPASE BIOGENESIS, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PAB PE=1 SV=1)

HSP 1 Score: 453.8 bits (1166), Expect = 1.8e-126
Identity = 224/333 (67.27%), Postives = 276/333 (82.88%), Query Frame = 0

Query: 10  GSPRAAVLPSLLLGRRGVTIRCASSSSTSNHVSFIKDVAATEPPQHLSNLLKMLKTRGES 69
           GS    + PS  L  R    R   S  +S HVSFIKDVAATEPP HL +LLK+L+TRGE+
Sbjct: 2   GSISMHITPSTALPIR--HFRARVSCCSSGHVSFIKDVAATEPPMHLHHLLKVLQTRGET 61

Query: 70  IISPGAKQGIIPLAIPLAKNNSGTITALLRWPTAPAGMEMPVVDVNRNGVWLLAKNVDQF 129
           IISPGAKQG+IPLAIPL+KN+SG++TALLRWPTAP GM+MPVV+V R+GV L+A+NVD++
Sbjct: 62  IISPGAKQGLIPLAIPLSKNSSGSVTALLRWPTAPPGMDMPVVEVWRSGVRLIARNVDEY 121

Query: 130 IHRLLVEEDAKGSGEQNDELFLAAADAGKKLYVRGDFAESQIKNIDGYLLKKVGIFPDII 189
           IHR+LVEEDA    ++  EL+ A+ +AG+KLY +G FAES+I N+D Y+LKKVG+FPD++
Sbjct: 122 IHRILVEEDA----QELTELYRASGEAGEKLYEKGAFAESEIDNLDVYVLKKVGLFPDLL 181

Query: 190 ERKILRHFEEGDLVSALVTGEFYTKKEHFPGFARPYVFNAEVLLKVGRKTEAKDAARGAL 249
           ERK+LRHF+EGD VSA+VTGEFYTKK+ FPGF RP+V+ A +L KVGR  EAKDAAR AL
Sbjct: 182 ERKVLRHFDEGDHVSAMVTGEFYTKKDLFPGFGRPFVYYANILQKVGRNVEAKDAARVAL 241

Query: 250 KSPWWTLGCKYEEVANIAQWEDEQIEYLKEKVTEEGKLEDLKKGKAPAQVALDQAAFLLD 309
           +SPWWTLGC YEEVA+IAQWEDEQIE+++EKV++EG+ EDL KGKAP QVALD AAFLLD
Sbjct: 242 RSPWWTLGCPYEEVASIAQWEDEQIEFIREKVSDEGRFEDLHKGKAPIQVALDVAAFLLD 301

Query: 310 LASVDGTWDISVERIAQCYEEAGLQEIARFVLY 343
           LAS++GTW  S+  IA+CYEEAGL  I+ FVLY
Sbjct: 302 LASIEGTWSESLNHIAKCYEEAGLHHISNFVLY 328

BLAST of CmaCh02G005210 vs. TAIR 10
Match: AT4G34090.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast, chloroplast stroma; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G23370.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 453.8 bits (1166), Expect = 1.3e-127
Identity = 224/333 (67.27%), Postives = 276/333 (82.88%), Query Frame = 0

Query: 10  GSPRAAVLPSLLLGRRGVTIRCASSSSTSNHVSFIKDVAATEPPQHLSNLLKMLKTRGES 69
           GS    + PS  L  R    R   S  +S HVSFIKDVAATEPP HL +LLK+L+TRGE+
Sbjct: 2   GSISMHITPSTALPIR--HFRARVSCCSSGHVSFIKDVAATEPPMHLHHLLKVLQTRGET 61

Query: 70  IISPGAKQGIIPLAIPLAKNNSGTITALLRWPTAPAGMEMPVVDVNRNGVWLLAKNVDQF 129
           IISPGAKQG+IPLAIPL+KN+SG++TALLRWPTAP GM+MPVV+V R+GV L+A+NVD++
Sbjct: 62  IISPGAKQGLIPLAIPLSKNSSGSVTALLRWPTAPPGMDMPVVEVWRSGVRLIARNVDEY 121

Query: 130 IHRLLVEEDAKGSGEQNDELFLAAADAGKKLYVRGDFAESQIKNIDGYLLKKVGIFPDII 189
           IHR+LVEEDA    ++  EL+ A+ +AG+KLY +G FAES+I N+D Y+LKKVG+FPD++
Sbjct: 122 IHRILVEEDA----QELTELYRASGEAGEKLYEKGAFAESEIDNLDVYVLKKVGLFPDLL 181

Query: 190 ERKILRHFEEGDLVSALVTGEFYTKKEHFPGFARPYVFNAEVLLKVGRKTEAKDAARGAL 249
           ERK+LRHF+EGD VSA+VTGEFYTKK+ FPGF RP+V+ A +L KVGR  EAKDAAR AL
Sbjct: 182 ERKVLRHFDEGDHVSAMVTGEFYTKKDLFPGFGRPFVYYANILQKVGRNVEAKDAARVAL 241

Query: 250 KSPWWTLGCKYEEVANIAQWEDEQIEYLKEKVTEEGKLEDLKKGKAPAQVALDQAAFLLD 309
           +SPWWTLGC YEEVA+IAQWEDEQIE+++EKV++EG+ EDL KGKAP QVALD AAFLLD
Sbjct: 242 RSPWWTLGCPYEEVASIAQWEDEQIEFIREKVSDEGRFEDLHKGKAPIQVALDVAAFLLD 301

Query: 310 LASVDGTWDISVERIAQCYEEAGLQEIARFVLY 343
           LAS++GTW  S+  IA+CYEEAGL  I+ FVLY
Sbjct: 302 LASIEGTWSESLNHIAKCYEEAGLHHISNFVLY 328

BLAST of CmaCh02G005210 vs. TAIR 10
Match: AT4G34090.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G23370.1); Has 75 Blast hits to 73 proteins in 21 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 67; Viruses - 0; Other Eukaryotes - 8 (source: NCBI BLink). )

HSP 1 Score: 449.1 bits (1154), Expect = 3.1e-126
Identity = 224/334 (67.07%), Postives = 276/334 (82.63%), Query Frame = 0

Query: 10  GSPRAAVLPSLLLGRRGVTIRCASSSSTSNHVSFIKDVAATEPPQHLSNLLKMLKTRGES 69
           GS    + PS  L  R    R   S  +S HVSFIKDVAATEPP HL +LLK+L+TRGE+
Sbjct: 2   GSISMHITPSTALPIR--HFRARVSCCSSGHVSFIKDVAATEPPMHLHHLLKVLQTRGET 61

Query: 70  IISPGAKQGIIPLAIPLAKNNS-GTITALLRWPTAPAGMEMPVVDVNRNGVWLLAKNVDQ 129
           IISPGAKQG+IPLAIPL+KN+S G++TALLRWPTAP GM+MPVV+V R+GV L+A+NVD+
Sbjct: 62  IISPGAKQGLIPLAIPLSKNSSVGSVTALLRWPTAPPGMDMPVVEVWRSGVRLIARNVDE 121

Query: 130 FIHRLLVEEDAKGSGEQNDELFLAAADAGKKLYVRGDFAESQIKNIDGYLLKKVGIFPDI 189
           +IHR+LVEEDA    ++  EL+ A+ +AG+KLY +G FAES+I N+D Y+LKKVG+FPD+
Sbjct: 122 YIHRILVEEDA----QELTELYRASGEAGEKLYEKGAFAESEIDNLDVYVLKKVGLFPDL 181

Query: 190 IERKILRHFEEGDLVSALVTGEFYTKKEHFPGFARPYVFNAEVLLKVGRKTEAKDAARGA 249
           +ERK+LRHF+EGD VSA+VTGEFYTKK+ FPGF RP+V+ A +L KVGR  EAKDAAR A
Sbjct: 182 LERKVLRHFDEGDHVSAMVTGEFYTKKDLFPGFGRPFVYYANILQKVGRNVEAKDAARVA 241

Query: 250 LKSPWWTLGCKYEEVANIAQWEDEQIEYLKEKVTEEGKLEDLKKGKAPAQVALDQAAFLL 309
           L+SPWWTLGC YEEVA+IAQWEDEQIE+++EKV++EG+ EDL KGKAP QVALD AAFLL
Sbjct: 242 LRSPWWTLGCPYEEVASIAQWEDEQIEFIREKVSDEGRFEDLHKGKAPIQVALDVAAFLL 301

Query: 310 DLASVDGTWDISVERIAQCYEEAGLQEIARFVLY 343
           DLAS++GTW  S+  IA+CYEEAGL  I+ FVLY
Sbjct: 302 DLASIEGTWSESLNHIAKCYEEAGLHHISNFVLY 329

BLAST of CmaCh02G005210 vs. TAIR 10
Match: AT4G34090.3 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G23370.1). )

HSP 1 Score: 443.7 bits (1140), Expect = 1.3e-124
Identity = 215/309 (69.58%), Postives = 265/309 (85.76%), Query Frame = 0

Query: 40  HVSFIKDVAATEPPQHLSNLLKMLKTRGESIISPGAKQGIIPLAIPLAKNNSGTITALLR 99
           HVSFIKDVAATEPP HL +LLK+L+TRGE+IISPGAKQG+IPLAIPL+KN+SG++TALLR
Sbjct: 84  HVSFIKDVAATEPPMHLHHLLKVLQTRGETIISPGAKQGLIPLAIPLSKNSSGSVTALLR 143

Query: 100 WPTAPAGMEMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDAKGSGEQNDELFLAAADAGKK 159
           WPTAP GM+MPVV+V R+GV L+A+NVD++IHR+LVEEDA    ++  EL+ A+ +AG+K
Sbjct: 144 WPTAPPGMDMPVVEVWRSGVRLIARNVDEYIHRILVEEDA----QELTELYRASGEAGEK 203

Query: 160 LYVRGDFAESQIKNIDGYLLKKVGIFPDIIERKILRHFEEGDLVSALVTGEFYTKKEHFP 219
           LY +G FAES+I N+D Y+LKKVG+FPD++ERK+LRHF+EGD VSA+VTGEFYTKK+ FP
Sbjct: 204 LYEKGAFAESEIDNLDVYVLKKVGLFPDLLERKVLRHFDEGDHVSAMVTGEFYTKKDLFP 263

Query: 220 GFARPYVFNAEVLLK------VGRKTEAKDAARGALKSPWWTLGCKYEEVANIAQWEDEQ 279
           GF RP+V+ A +L K      VGR  EAKDAAR AL+SPWWTLGC YEEVA+IAQWEDEQ
Sbjct: 264 GFGRPFVYYANILQKFILIRRVGRNVEAKDAARVALRSPWWTLGCPYEEVASIAQWEDEQ 323

Query: 280 IEYLKEKVTEEGKLEDLKKGKAPAQVALDQAAFLLDLASVDGTWDISVERIAQCYEEAGL 339
           IE+++EKV++EG+ EDL KGKAP QVALD AAFLLDLAS++GTW  S+  IA+CYEEAGL
Sbjct: 324 IEFIREKVSDEGRFEDLHKGKAPIQVALDVAAFLLDLASIEGTWSESLNHIAKCYEEAGL 383

Query: 340 QEIARFVLY 343
             I+ FVLY
Sbjct: 384 HHISNFVLY 388

BLAST of CmaCh02G005210 vs. TAIR 10
Match: AT2G23370.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G34090.1); Has 73 Blast hits to 73 proteins in 21 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 65; Viruses - 0; Other Eukaryotes - 8 (source: NCBI BLink). )

HSP 1 Score: 429.5 bits (1103), Expect = 2.6e-120
Identity = 209/339 (61.65%), Postives = 260/339 (76.70%), Query Frame = 0

Query: 5   GGVVCGSPRAAVLPSLLLGRRGVTIRCASSSSTSNHVSFIKDVAATEPPQHLSNLLKMLK 64
           G  V G  R  +   LL G R       SSSS S H  FIKD+A  +PP+HL  LL +  
Sbjct: 4   GAAVFGRKRRLI---LLHGSRNFARSFCSSSSLSEHECFIKDIAKAQPPKHLMQLLNIFT 63

Query: 65  TRGESIISPGAKQGIIPLAIPLAKNNSGTITALLRWPTAPAGMEMPVVDVNRNGVWLLAK 124
            RG+SI+SPGAKQG++PL IPL K + G+  ALLRWPTAP+ MEMPVV+V ++GVW LA 
Sbjct: 64  ARGKSIVSPGAKQGLLPLTIPLVKMSPGSSIALLRWPTAPSSMEMPVVEVQKHGVWFLAN 123

Query: 125 NVDQFIHRLLVEEDAKGSGEQNDELFLAAADAGKKLYVRGDFAESQIKNIDGYLLKKVGI 184
           NVDQFIHR+LVEED     E + E+F AA +AGKKLY +GDFA S++ ++D YLL+KVG+
Sbjct: 124 NVDQFIHRILVEEDVSKPEECSQEIFNAAGEAGKKLYSKGDFASSRLMDLDAYLLRKVGL 183

Query: 185 FPDIIERKILRHFEEGDLVSALVTGEFYTKKEHFPGFARPYVFNAEVLLKVGRKTEAKDA 244
           FPD +ERK++RH E GD VSALV  EFYTK+ +FPGFARP+ FNA+VLLK+GR  EAKDA
Sbjct: 184 FPDSLERKVIRHIENGDHVSALVATEFYTKRGNFPGFARPFAFNAKVLLKLGRNLEAKDA 243

Query: 245 ARGALKSPWWTLGCKYEEVANIAQWEDEQIEYLKEKVTEEGKLEDLKKGKAPAQVALDQA 304
           ARGALKS WWTLGC+YEE+A IA+W +EQI   KE+VT EGK  D+ +GK  AQ +LD+A
Sbjct: 244 ARGALKSSWWTLGCRYEEIAQIAEWGEEQIAQYKERVTGEGKQRDIDRGKPMAQASLDEA 303

Query: 305 AFLLDLASVDGTWDISVERIAQCYEEAGLQEIARFVLYR 344
           AFLL+LAS++GTWD S+ER+AQCY+EAGL +IA+FVLYR
Sbjct: 304 AFLLNLASLEGTWDESLERVAQCYKEAGLNDIAKFVLYR 339

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q94JY01.8e-12667.27Protein IN CHLOROPLAST ATPASE BIOGENESIS, chloroplastic OS=Arabidopsis thaliana ... [more]
Match NameE-valueIdentityDescription
AT4G34090.11.3e-12767.27unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT4G34090.23.1e-12667.07unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT4G34090.31.3e-12469.58unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT2G23370.12.6e-12061.65unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita maxima (Rimu) v1.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR35115CYCLIN DELTA-3coord: 8..343
NoneNo IPR availablePANTHERPTHR35115:SF4PROTEIN IN CHLOROPLAST ATPASE BIOGENESIS, CHLOROPLASTICcoord: 8..343

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh02G005210.1CmaCh02G005210.1mRNA