Cp4.1LG10g01040 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG10g01040
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionPentatricopeptide repeat-containing protein
LocationCp4.1LG10: 3406419 .. 3408269 (-)
RNA-Seq ExpressionCp4.1LG10g01040
SyntenyCp4.1LG10g01040
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCCCCTTTCTTTTTCTCTGCAATTTTCTCCCTCATTTACTCCATTTTCTTCTTCTTCCCGAAACTTCCAACTCTCTCAGACATGGCTTCCACTCCCATTGTCTCTGAGGAGGGCCGTTTCGACCCGAGTGGTCTGTATTTCAACTCGACCCAATAGGAAATCCGGGGCTAAGACCGACAGGTCGGAGGCTGAAGAACTGGTTCGCGGTGTAATCAGAAAGTTCAGCGATAAAGAACCATTGCTCAAGACACTGGACAAGTACGTCCGTGTTATGAGAACTGAGCATTGTTTTCTTCTATTTGAGGAATTAGGGAAAAAGGATAAATGGCTTCAGTGCCTCGAGGTTCGCTCTTCTTCTTTTGAACTCTGAATTAGGGCCTCTGGGTGTTGATTTTTCTAACTTCTTTAGCTGCCTGTTAAGTGTTGATAAATGGGTAATGATGAAAATGCTCACAGACAGAGTTGATTGAATTTGAAAGCGAGGATATGAATTTCAATTTGTGAACTATACACTTCCTTGTAAAATATAGGAAATGAATGGAATTTGTAATGTAGTTCAAGCTCTATTTGGCATGGATGAACAATGGAGAGTCTAAATTTTTCTGGGTTAAGCTAAAAAGGCAAAGAAATTCTTATTGATGTATTTCATTTTCAGGTTTTCAGATGGATGCAGAAACAGCGCTGGTATATTGCTGACAATGGGGTTTATTCTAAACTGATATCTATTATGGGAAAGAAGGGCCAAATTCGGATGGCTATATGGCTCTTTTCTGAGATGCGAAATAGTGGGTGTCGACCTGATACTTCTGTCTACAATGCACTTATCACAGCGCACCTTCATTCTAAGGACAAAGCCAAGGCTTTGGTTAAAGTCCTTTGGTACTTTGAGAAGATGAAGGGAATGGAACGGTGTAAGCCAAATGTTGTTACTTACAACATTCTTACCAGAGCTTTTGCTCAAGCTGGAAAAGTGGATCAAGTTAATGCCTTGTTTAAAGATCTTGATGAGAGCATTGTTTCAGCTGATATATACACATATAATGGAGTGATGGATGCATATGGGAAAAATGGGATGATCAAAGAGATGGAAACGATGCTTGCTCGCATGAAAAGTAATGAAATCAAACCAGATATCATATCCTTTAACTTGTTGATTGATTCATATGGGAAGAAACAGCTATTTGACAAGATGGAACAGGTGTTCAAGAGTTTATTACGATCGAAGGAGAGGCCGACGCTACCTACATTCAATTCGATGATCACAAACTACGGGAAGGCACGACTTAGAGAGAGAGCTGAAGAAGTTTTTAGGAAGATGAAGGATATGGGATATGATCCAAGCTATGTCACTTGTGAAAGTCTAATCATGATGTATGGTCACTGTGACTGTGTTTCTAAGGCTAGAGAAATCTTTGATGGAATGGTTAATTCTGGGAAGGCAGTGAGAGTCTCGACCCTCAACGCCATGCTTGATGTTTACTGCATGAATGGTTTGCCCATGGAAGCCGATTTGCTTTTCGAGAGTGCGAGTAGTATGAAGGTGTTTCCAGATTCAACAACATACAAGCTTTTGTATAAAGCTTACACCAAAGCTGATGCGAAGGAACTTGTGGAGAAGCTCTTGAAGAACATGGACAAAGCTGGCATCATTCCAAATAAGAGATTCTTCCTCGATGCTTTGGGAACTGTTGGATCTTCACCAGAAAAACCAGAACCTGCAAGAACTGGAACAGGTTCAAGACACTCAGAAAGTAGTGTGGAAAAACCAATATCTTCAAGAAATAGAAGAGGTTCAAGAAAATTGGAAAGTAGTGTGAAAAACCATGCTCCAGAGTTGGGCTTAGCTTAA

mRNA sequence

ATGCCCCTTTCTTTTTCTCTGCAATTTTCTCCCTCATTTACTCCATTTTCTTCTTCTTCCCGAAACTTCCAACTCTCTCAGACATGGCTTCCACTCCCATTGTCTCTGAGGAGGGCCGTTTCGACCCGAGTGGTCTGTATTTCAACTCGACCCAATAGGAAATCCGGGGCTAAGACCGACAGGTCGGAGGCTGAAGAACTGGTTCGCGGTGTAATCAGAAAGTTCAGCGATAAAGAACCATTGCTCAAGACACTGGACAAGTACGTCCGTGTTATGAGAACTGAGCATTGTTTTCTTCTATTTGAGGAATTAGGGAAAAAGGATAAATGGCTTCAGTGCCTCGAGGTTTTCAGATGGATGCAGAAACAGCGCTGGTATATTGCTGACAATGGGGTTTATTCTAAACTGATATCTATTATGGGAAAGAAGGGCCAAATTCGGATGGCTATATGGCTCTTTTCTGAGATGCGAAATAGTGGGTGTCGACCTGATACTTCTGTCTACAATGCACTTATCACAGCGCACCTTCATTCTAAGGACAAAGCCAAGGCTTTGGTTAAAGTCCTTTGGTACTTTGAGAAGATGAAGGGAATGGAACGGTGTAAGCCAAATGTTGTTACTTACAACATTCTTACCAGAGCTTTTGCTCAAGCTGGAAAAGTGGATCAAGTTAATGCCTTGTTTAAAGATCTTGATGAGAGCATTGTTTCAGCTGATATATACACATATAATGGAGTGATGGATGCATATGGGAAAAATGGGATGATCAAAGAGATGGAAACGATGCTTGCTCGCATGAAAAGTAATGAAATCAAACCAGATATCATATCCTTTAACTTGTTGATTGATTCATATGGGAAGAAACAGCTATTTGACAAGATGGAACAGGTGTTCAAGAGTTTATTACGATCGAAGGAGAGGCCGACGCTACCTACATTCAATTCGATGATCACAAACTACGGGAAGGCACGACTTAGAGAGAGAGCTGAAGAAGTTTTTAGGAAGATGAAGGATATGGGATATGATCCAAGCTATGTCACTTGTGAAAGTCTAATCATGATGTATGGTCACTGTGACTGTGTTTCTAAGGCTAGAGAAATCTTTGATGGAATGGTTAATTCTGGGAAGGCAGTGAGAGTCTCGACCCTCAACGCCATGCTTGATGTTTACTGCATGAATGGTTTGCCCATGGAAGCCGATTTGCTTTTCGAGAGTGCGAGTAGTATGAAGGTGTTTCCAGATTCAACAACATACAAGCTTTTGTATAAAGCTTACACCAAAGCTGATGCGAAGGAACTTGTGGAGAAGCTCTTGAAGAACATGGACAAAGCTGGCATCATTCCAAATAAGAGATTCTTCCTCGATGCTTTGGGAACTGTTGGATCTTCACCAGAAAAACCAGAACCTGCAAGAACTGGAACAGGTTCAAGACACTCAGAAAGTAGTGTGGAAAAACCAATATCTTCAAGAAATAGAAGAGGTTCAAGAAAATTGGAAAGTAGTGTGAAAAACCATGCTCCAGAGTTGGGCTTAGCTTAA

Coding sequence (CDS)

ATGCCCCTTTCTTTTTCTCTGCAATTTTCTCCCTCATTTACTCCATTTTCTTCTTCTTCCCGAAACTTCCAACTCTCTCAGACATGGCTTCCACTCCCATTGTCTCTGAGGAGGGCCGTTTCGACCCGAGTGGTCTGTATTTCAACTCGACCCAATAGGAAATCCGGGGCTAAGACCGACAGGTCGGAGGCTGAAGAACTGGTTCGCGGTGTAATCAGAAAGTTCAGCGATAAAGAACCATTGCTCAAGACACTGGACAAGTACGTCCGTGTTATGAGAACTGAGCATTGTTTTCTTCTATTTGAGGAATTAGGGAAAAAGGATAAATGGCTTCAGTGCCTCGAGGTTTTCAGATGGATGCAGAAACAGCGCTGGTATATTGCTGACAATGGGGTTTATTCTAAACTGATATCTATTATGGGAAAGAAGGGCCAAATTCGGATGGCTATATGGCTCTTTTCTGAGATGCGAAATAGTGGGTGTCGACCTGATACTTCTGTCTACAATGCACTTATCACAGCGCACCTTCATTCTAAGGACAAAGCCAAGGCTTTGGTTAAAGTCCTTTGGTACTTTGAGAAGATGAAGGGAATGGAACGGTGTAAGCCAAATGTTGTTACTTACAACATTCTTACCAGAGCTTTTGCTCAAGCTGGAAAAGTGGATCAAGTTAATGCCTTGTTTAAAGATCTTGATGAGAGCATTGTTTCAGCTGATATATACACATATAATGGAGTGATGGATGCATATGGGAAAAATGGGATGATCAAAGAGATGGAAACGATGCTTGCTCGCATGAAAAGTAATGAAATCAAACCAGATATCATATCCTTTAACTTGTTGATTGATTCATATGGGAAGAAACAGCTATTTGACAAGATGGAACAGGTGTTCAAGAGTTTATTACGATCGAAGGAGAGGCCGACGCTACCTACATTCAATTCGATGATCACAAACTACGGGAAGGCACGACTTAGAGAGAGAGCTGAAGAAGTTTTTAGGAAGATGAAGGATATGGGATATGATCCAAGCTATGTCACTTGTGAAAGTCTAATCATGATGTATGGTCACTGTGACTGTGTTTCTAAGGCTAGAGAAATCTTTGATGGAATGGTTAATTCTGGGAAGGCAGTGAGAGTCTCGACCCTCAACGCCATGCTTGATGTTTACTGCATGAATGGTTTGCCCATGGAAGCCGATTTGCTTTTCGAGAGTGCGAGTAGTATGAAGGTGTTTCCAGATTCAACAACATACAAGCTTTTGTATAAAGCTTACACCAAAGCTGATGCGAAGGAACTTGTGGAGAAGCTCTTGAAGAACATGGACAAAGCTGGCATCATTCCAAATAAGAGATTCTTCCTCGATGCTTTGGGAACTGTTGGATCTTCACCAGAAAAACCAGAACCTGCAAGAACTGGAACAGGTTCAAGACACTCAGAAAGTAGTGTGGAAAAACCAATATCTTCAAGAAATAGAAGAGGTTCAAGAAAATTGGAAAGTAGTGTGAAAAACCATGCTCCAGAGTTGGGCTTAGCTTAA

Protein sequence

MPLSFSLQFSPSFTPFSSSSRNFQLSQTWLPLPLSLRRAVSTRVVCISTRPNRKSGAKTDRSEAEELVRGVIRKFSDKEPLLKTLDKYVRVMRTEHCFLLFEELGKKDKWLQCLEVFRWMQKQRWYIADNGVYSKLISIMGKKGQIRMAIWLFSEMRNSGCRPDTSVYNALITAHLHSKDKAKALVKVLWYFEKMKGMERCKPNVVTYNILTRAFAQAGKVDQVNALFKDLDESIVSADIYTYNGVMDAYGKNGMIKEMETMLARMKSNEIKPDIISFNLLIDSYGKKQLFDKMEQVFKSLLRSKERPTLPTFNSMITNYGKARLRERAEEVFRKMKDMGYDPSYVTCESLIMMYGHCDCVSKAREIFDGMVNSGKAVRVSTLNAMLDVYCMNGLPMEADLLFESASSMKVFPDSTTYKLLYKAYTKADAKELVEKLLKNMDKAGIIPNKRFFLDALGTVGSSPEKPEPARTGTGSRHSESSVEKPISSRNRRGSRKLESSVKNHAPELGLA
Homology
BLAST of Cp4.1LG10g01040 vs. ExPASy Swiss-Prot
Match: Q9SV96 (Pentatricopeptide repeat-containing protein At4g39620, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=EMB2453 PE=2 SV=1)

HSP 1 Score: 614.4 bits (1583), Expect = 1.2e-174
Identity = 321/503 (63.82%), Postives = 390/503 (77.53%), Query Frame = 0

Query: 10  SPSFTPFSS--SSRNFQLSQTWLPLPLSL---RRAVSTRVVCISTRPNRKSGAKTDRSEA 69
           SPS   FS   SS   +    WL   ++L   RR+  TR+ C +    RK  A+ + +E 
Sbjct: 7   SPSSLRFSDFISSIPKETDHKWLRFSVNLGDARRSTRTRITCGAISSRRKL-AERESAER 66

Query: 70  EE--LVRGVIRKFSDKEPLLKTLDKYVRVMRTEHCFLLFEELGKKDKWLQCLEVFRWMQK 129
           E   LVR ++ + SD+EPL+KTLDKYV+V+R +HCFLLFEELGK DKWLQCLEVFRWMQK
Sbjct: 67  ENRVLVRSLMSRISDREPLVKTLDKYVKVVRCDHCFLLFEELGKSDKWLQCLEVFRWMQK 126

Query: 130 QRWYIADNGVYSKLISIMGKKGQIRMAIWLFSEMRNSGCRPDTSVYNALITAHLHSKDKA 189
           QRWYI DNGVYSKLIS+MGKKGQ RMA+WLFSEM+NSGCRPD SVYNALITAHLH++DKA
Sbjct: 127 QRWYIPDNGVYSKLISVMGKKGQTRMAMWLFSEMKNSGCRPDASVYNALITAHLHTRDKA 186

Query: 190 KALVKVLWYFEKMKGMERCKPNVVTYNILTRAFAQAGKVDQVNALFKDLDESIVSADIYT 249
           KAL KV  Y +KMKG+ERC+PNVVTYNIL RAFAQ+GKVDQVNALFKDLD S VS D+YT
Sbjct: 187 KALEKVRGYLDKMKGIERCQPNVVTYNILLRAFAQSGKVDQVNALFKDLDMSPVSPDVYT 246

Query: 250 YNGVMDAYGKNGMIKEMETMLARMKSNEIKPDIISFNLLIDSYGKKQLFDKMEQVFKSLL 309
           +NGVMDAYGKNGMIKEME +L RM+SNE KPDII+FN+LIDSYGKKQ F+KMEQ FKSL+
Sbjct: 247 FNGVMDAYGKNGMIKEMEAVLTRMRSNECKPDIITFNVLIDSYGKKQEFEKMEQTFKSLM 306

Query: 310 RSKERPTLPTFNSMITNYGKARLRERAEEVFRKMKDMGYDPSYVTCESLIMMYGHCDCVS 369
           RSKE+PTLPTFNSMI NYGKAR+ ++AE VF+KM DM Y PS++T E +IMMYG+C  VS
Sbjct: 307 RSKEKPTLPTFNSMIINYGKARMIDKAEWVFKKMNDMNYIPSFITYECMIMMYGYCGSVS 366

Query: 370 KAREIFDGMVNSGKAVRVSTLNAMLDVYCMNGLPMEADLLFESASSMKVFPDSTTYKLLY 429
           +AREIF+ +  S + ++ STLNAML+VYC NGL +EAD LF +AS+ +V PD++TYK LY
Sbjct: 367 RAREIFEEVGESDRVLKASTLNAMLEVYCRNGLYIEADKLFHNASAFRVHPDASTYKFLY 426

Query: 430 KAYTKADAKELVEKLLKNMDKAGIIPNKRFFLDALGTVGSSPEKPEPARTGTGSRHSESS 489
           KAYTKAD KE V+ L+K M+K GI+PNKRFFL+AL   GS         +G+ +R S  S
Sbjct: 427 KAYTKADMKEQVQILMKKMEKDGIVPNKRFFLEALEVFGS-----RLPGSGSENRKSTRS 486

Query: 490 VEKPISSRNRRGSRKLESSVKNH 506
                S + R G++  E   K++
Sbjct: 487 SRSRDSPKGRGGNQLTEFQDKDN 503

BLAST of Cp4.1LG10g01040 vs. ExPASy Swiss-Prot
Match: A7LN87 (Pentatricopeptide repeat-containing protein PPR5, chloroplastic OS=Zea mays OX=4577 GN=PPR5 PE=2 SV=1)

HSP 1 Score: 596.3 bits (1536), Expect = 3.3e-169
Identity = 300/470 (63.83%), Postives = 366/470 (77.87%), Query Frame = 0

Query: 41  STRVVCISTRPNRK-------SGAKTDRSEAEELVRGVIRKFS-DKEPLLKTLDKYVRVM 100
           +TR V ++ R  R+        G     +EA ELVR ++R+ +  KE L+  LD++VRV+
Sbjct: 27  ATRHVALAARSKRRGAGPAAAEGVDEAAAEAAELVRSLLRRTAGGKERLVPVLDRHVRVV 86

Query: 101 RTEHCFLLFEELGKKDKWLQCLEVFRWMQKQRWYIADNGVYSKLISIMGKKGQIRMAIWL 160
           RTEHCFLLFEELG++D WLQCL+VFRWMQKQRWY+ADNG+YSKLIS+MG+KGQIRMA+WL
Sbjct: 87  RTEHCFLLFEELGRRDAWLQCLDVFRWMQKQRWYVADNGIYSKLISVMGRKGQIRMAMWL 146

Query: 161 FSEMRNSGCRPDTSVYNALITAHLHSKDKAKALVKVLWYFEKMKGMERCKPNVVTYNILT 220
           FS+MRNSGC+PDTSVYN+LI AHLHS+DK KAL K L YFEKMK +ERC+P +VTYNIL 
Sbjct: 147 FSQMRNSGCKPDTSVYNSLIGAHLHSRDKTKALAKALGYFEKMKCIERCQPTIVTYNILL 206

Query: 221 RAFAQAGKVDQVNALFKDLDESIVSADIYTYNGVMDAYGKNGMIKEMETMLARMKSNEIK 280
           RAFAQAG   QV+ LFKDLDES+VS D+YTYNGV+DAYGKNGMIKEME++L RMKS + +
Sbjct: 207 RAFAQAGDTKQVDMLFKDLDESVVSPDVYTYNGVLDAYGKNGMIKEMESVLVRMKSTQCR 266

Query: 281 PDIISFNLLIDSYGKKQLFDKMEQVFKSLLRSKERPTLPTFNSMITNYGKARLRERAEEV 340
           PD+I+FN+LIDSYG+KQ FDKMEQVFKSLLRSKERPT PTFNSMITNYG+ARLRE+AE V
Sbjct: 267 PDVITFNILIDSYGRKQTFDKMEQVFKSLLRSKERPTHPTFNSMITNYGRARLREKAESV 326

Query: 341 FRKMKDMGYDPSYVTCESLIMMYGHCDCVSKAREIFDGMVNSGKAVRVSTLNAMLDVYCM 400
             KM+++G+ P+YVT E LI+MY HCDCVSKAR++FD +V S   V +S+LN+ML+ YCM
Sbjct: 327 VEKMEELGFKPNYVTQECLIIMYAHCDCVSKARQVFDELVTSQTKVHLSSLNSMLEAYCM 386

Query: 401 NGLPMEADLLFESASSMKVFPDSTTYKLLYKAYTKADAKELVEKLLKNMDKAGIIPNKRF 460
           NGL  EAD L ++A    V P+ +TYKLLYKAYTKA+ K LV+KLLK M+K GI+PNK+F
Sbjct: 387 NGLHTEADRLLDTALQQCVVPNGSTYKLLYKAYTKANDKLLVQKLLKRMNKQGIVPNKKF 446

Query: 461 FLDALGTVGSSPEKPEPARTGTGSRHSESSVEKPISSRNRRGSRKLESSV 503
           FLDAL   G+S  KP   RT  G   +               S K E SV
Sbjct: 447 FLDALEAFGTSDRKP---RTSPGINSASKPSTDSAGDSETATSDKPEVSV 493

BLAST of Cp4.1LG10g01040 vs. ExPASy Swiss-Prot
Match: A3ABE1 (Pentatricopeptide repeat-containing protein PPR5 homolog, chloroplastic OS=Oryza sativa subsp. japonica OX=39947 GN=Os02g0750400 PE=3 SV=2)

HSP 1 Score: 595.9 bits (1535), Expect = 4.3e-169
Identity = 294/438 (67.12%), Postives = 359/438 (81.96%), Query Frame = 0

Query: 60  DRSEAEELVRGVIRKFS-DKEPLLKTLDKYVRVMRTEHCFLLFEELGKKDKWLQCLEVFR 119
           + +EA +LVR  +R+ S  KE L+  LD++V+V+RTEHCFLLFEELG++D WLQCLEVFR
Sbjct: 50  EAAEAADLVRFFLRRTSGGKERLVAVLDRHVKVVRTEHCFLLFEELGRRDGWLQCLEVFR 109

Query: 120 WMQKQRWYIADNGVYSKLISIMGKKGQIRMAIWLFSEMRNSGCRPDTSVYNALITAHLHS 179
           WMQKQRWY+ADNG+YSKLIS+MG+KGQIRMA+WLFS+MRNSGCRPDTSVYN+LI  HLHS
Sbjct: 110 WMQKQRWYVADNGIYSKLISVMGRKGQIRMAMWLFSQMRNSGCRPDTSVYNSLIGTHLHS 169

Query: 180 KDKAKALVKVLWYFEKMKGMERCKPNVVTYNILTRAFAQAGKVDQVNALFKDLDESIVSA 239
           +DK+KAL K L YFEKMK ++RC+PN+VTYNIL RAFAQAG   Q++ LFKDLDES VS 
Sbjct: 170 RDKSKALAKALGYFEKMKTIDRCQPNIVTYNILLRAFAQAGDTKQLDILFKDLDESPVSP 229

Query: 240 DIYTYNGVMDAYGKNGMIKEMETMLARMKSNEIKPDIISFNLLIDSYGKKQLFDKMEQVF 299
           DIYTYNGVMDAYGKNGMI EME++L RMKSN+ +PD+I+FN+LIDSYG+KQ FDKMEQVF
Sbjct: 230 DIYTYNGVMDAYGKNGMITEMESVLVRMKSNQCRPDVITFNILIDSYGRKQAFDKMEQVF 289

Query: 300 KSLLRSKERPTLPTFNSMITNYGKARLRERAEEVFRKMKDMGYDPSYVTCESLIMMYGHC 359
           KSLLRSKE+PT PTFNSMITNYGKARLRE+AE V  KM +MG+ P+YVT E LIMMY +C
Sbjct: 290 KSLLRSKEKPTHPTFNSMITNYGKARLREKAECVLDKMTEMGFKPNYVTQECLIMMYAYC 349

Query: 360 DCVSKAREIFDGMVNSGKAVRVSTLNAMLDVYCMNGLPMEADLLFESASSMKVFPDSTTY 419
           DCVS+AR+IFD +V+S   V +S++NAMLD YCMNGLPMEAD L +S       P ++TY
Sbjct: 350 DCVSRARQIFDELVSSQNNVHLSSVNAMLDAYCMNGLPMEADQLLDSVIKKGAVPSASTY 409

Query: 420 KLLYKAYTKADAKELVEKLLKNMDKAGIIPNKRFFLDALGTVGSSPEKPEPARTGTGSRH 479
           KLLYKAYTKA+ K+L++KLLK M+  GI+PNK+FFLDAL   G++ +KP           
Sbjct: 410 KLLYKAYTKANDKKLIQKLLKRMNSQGIVPNKKFFLDALEAFGNTDKKPRTV-------P 469

Query: 480 SESSVEKP-ISSRNRRGS 496
           S++S  KP + S N  G+
Sbjct: 470 SKNSASKPDVESANNSGT 480

BLAST of Cp4.1LG10g01040 vs. ExPASy Swiss-Prot
Match: Q9SCP4 (Pentatricopeptide repeat-containing protein At3g53170 OS=Arabidopsis thaliana OX=3702 GN=At3g53170 PE=3 SV=1)

HSP 1 Score: 166.4 bits (420), Expect = 8.5e-40
Identity = 109/380 (28.68%), Postives = 174/380 (45.79%), Query Frame = 0

Query: 102 EELGKKDKWLQCLEVFRWMQKQRWYIADNGVYSKLISIMGKKGQIRMAIWLFSEMRNSGC 161
           +E  K+++W   L++F  ++KQ WY      Y+KL  ++G   Q   A  LF  M + G 
Sbjct: 66  DEAIKENRWQSALKIFNLLRKQHWYEPRCKTYTKLFKVLGNCKQPDQASLLFEVMLSEGL 125

Query: 162 RPDTSVYNALITAHLHSKDKAKALVKVLWYFEKMKGMERCKPNVVTYNILTRAFAQAGKV 221
           +P   VY +LI+ +     K++ L K     E MK +  CKP+V T+ +L     + G+ 
Sbjct: 126 KPTIDVYTSLISVY----GKSELLDKAFSTLEYMKSVSDCKPDVFTFTVLISCCCKLGRF 185

Query: 222 DQVNALFKDLDESIVSADIYTYNGVMDAYGKNGMIKEMETMLA----------------- 281
           D V ++  ++    V     TYN ++D YGK GM +EME++LA                 
Sbjct: 186 DLVKSIVLEMSYLGVGCSTVTYNTIIDGYGKAGMFEEMESVLADMIEDGDSLPDVCTLNS 245

Query: 282 -------------------RMKSNEIKPDIISFNLLIDSYGKKQLFDKMEQVFKSLLRSK 341
                              R +   ++PDI +FN+LI S+GK  ++ KM  V   + +  
Sbjct: 246 IIGSYGNGRNMRKMESWYSRFQLMGVQPDITTFNILILSFGKAGMYKKMCSVMDFMEKRF 305

Query: 342 ERPTLPTFNSMITNYGKARLRERAEEVFRKMKDMGYDPSYVTCESLIMMYGHCDCVSKAR 401
              T  T+N +I  +GKA   E+ ++VFRKMK  G  P+ +T  SL+  Y     V K  
Sbjct: 306 FSLTTVTYNIVIETFGKAGRIEKMDDVFRKMKYQGVKPNSITYCSLVNAYSKAGLVVKID 365

Query: 402 EIFDGMVNSGKAVRVSTLNAMLDVYCMNGLPMEADLLFESASSMKVFPDSTTYKLLYKAY 446
            +   +VNS   +     N +++ Y   G       L+      K  PD  T+  + K Y
Sbjct: 366 SVLRQIVNSDVVLDTPFFNCIINAYGQAGDLATMKELYIQMEERKCKPDKITFATMIKTY 425

BLAST of Cp4.1LG10g01040 vs. ExPASy Swiss-Prot
Match: Q9SQU6 (Pentatricopeptide repeat-containing protein At3g06430, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=EMB2750 PE=1 SV=1)

HSP 1 Score: 162.5 bits (410), Expect = 1.2e-38
Identity = 100/354 (28.25%), Postives = 174/354 (49.15%), Query Frame = 0

Query: 103 ELGKKDKWLQCLEVFRWMQKQRWYIADNGVYSKLISIMGKKGQIRMAIWLFSEMRNSGCR 162
           +L  K +WLQ LEVF  +++Q +Y    G Y KL+ ++GK GQ   A  LF EM   G  
Sbjct: 97  DLIAKKQWLQALEVFDMLREQTFYQPKEGTYMKLLVLLGKSGQPNRAQKLFDEMLEEGLE 156

Query: 163 PDTSVYNALITAHLHSK--DKAKALVKVLWYFEKMKGMERCKPNVVTYNILTRAFAQAGK 222
           P   +Y AL+ A+  S   D A +++      +KMK   +C+P+V TY+ L +A   A +
Sbjct: 157 PTVELYTALLAAYTRSNLIDDAFSIL------DKMKSFPQCQPDVFTYSTLLKACVDASQ 216

Query: 223 VDQVNALFKDLDESIVSADIYTYNGVMDAYGKNGMIKEMETMLARM-KSNEIKPDIISFN 282
            D V++L+K++DE +++ +  T N V+  YG+ G   +ME +L+ M  S   KPD+ + N
Sbjct: 217 FDLVDSLYKEMDERLITPNTVTQNIVLSGYGRVGRFDQMEKVLSDMLVSTACKPDVWTMN 276

Query: 283 LLIDSYGKKQLFDKMEQVFKSLLRSKERPTLPTFNSMITNYGKARLRERAEEVFRKMKDM 342
           +++  +G     D ME  ++        P   TFN +I +YGK R+ ++   V   M+ +
Sbjct: 277 IILSVFGNMGKIDMMESWYEKFRNFGIEPETRTFNILIGSYGKKRMYDKMSSVMEYMRKL 336

Query: 343 GYDPSYVTCESLIMMYGHCDCVSKAREIFDGMVNSGKAVRVSTLNAMLDVYCMNGLPMEA 402
            +  +  T  ++I  +            FD M + G      T   +++ Y   GL  + 
Sbjct: 337 EFPWTTSTYNNIIEAFADVGDAKNMELTFDQMRSEGMKADTKTFCCLINGYANAGLFHKV 396

Query: 403 DLLFESASSMKVFPDSTTYKLLYKAYTKADAKELVEKLLKNMDKAGIIPNKRFF 454
               + A+  ++  ++  Y  +  A  KAD    +E++   M +   + + R F
Sbjct: 397 ISSVQLAAKFEIPENTAFYNAVISACAKADDLIEMERVYIRMKERQCVCDSRTF 444

BLAST of Cp4.1LG10g01040 vs. NCBI nr
Match: XP_023543919.1 (pentatricopeptide repeat-containing protein At4g39620, chloroplastic [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1009 bits (2609), Expect = 0.0
Identity = 512/512 (100.00%), Postives = 512/512 (100.00%), Query Frame = 0

Query: 1   MPLSFSLQFSPSFTPFSSSSRNFQLSQTWLPLPLSLRRAVSTRVVCISTRPNRKSGAKTD 60
           MPLSFSLQFSPSFTPFSSSSRNFQLSQTWLPLPLSLRRAVSTRVVCISTRPNRKSGAKTD
Sbjct: 1   MPLSFSLQFSPSFTPFSSSSRNFQLSQTWLPLPLSLRRAVSTRVVCISTRPNRKSGAKTD 60

Query: 61  RSEAEELVRGVIRKFSDKEPLLKTLDKYVRVMRTEHCFLLFEELGKKDKWLQCLEVFRWM 120
           RSEAEELVRGVIRKFSDKEPLLKTLDKYVRVMRTEHCFLLFEELGKKDKWLQCLEVFRWM
Sbjct: 61  RSEAEELVRGVIRKFSDKEPLLKTLDKYVRVMRTEHCFLLFEELGKKDKWLQCLEVFRWM 120

Query: 121 QKQRWYIADNGVYSKLISIMGKKGQIRMAIWLFSEMRNSGCRPDTSVYNALITAHLHSKD 180
           QKQRWYIADNGVYSKLISIMGKKGQIRMAIWLFSEMRNSGCRPDTSVYNALITAHLHSKD
Sbjct: 121 QKQRWYIADNGVYSKLISIMGKKGQIRMAIWLFSEMRNSGCRPDTSVYNALITAHLHSKD 180

Query: 181 KAKALVKVLWYFEKMKGMERCKPNVVTYNILTRAFAQAGKVDQVNALFKDLDESIVSADI 240
           KAKALVKVLWYFEKMKGMERCKPNVVTYNILTRAFAQAGKVDQVNALFKDLDESIVSADI
Sbjct: 181 KAKALVKVLWYFEKMKGMERCKPNVVTYNILTRAFAQAGKVDQVNALFKDLDESIVSADI 240

Query: 241 YTYNGVMDAYGKNGMIKEMETMLARMKSNEIKPDIISFNLLIDSYGKKQLFDKMEQVFKS 300
           YTYNGVMDAYGKNGMIKEMETMLARMKSNEIKPDIISFNLLIDSYGKKQLFDKMEQVFKS
Sbjct: 241 YTYNGVMDAYGKNGMIKEMETMLARMKSNEIKPDIISFNLLIDSYGKKQLFDKMEQVFKS 300

Query: 301 LLRSKERPTLPTFNSMITNYGKARLRERAEEVFRKMKDMGYDPSYVTCESLIMMYGHCDC 360
           LLRSKERPTLPTFNSMITNYGKARLRERAEEVFRKMKDMGYDPSYVTCESLIMMYGHCDC
Sbjct: 301 LLRSKERPTLPTFNSMITNYGKARLRERAEEVFRKMKDMGYDPSYVTCESLIMMYGHCDC 360

Query: 361 VSKAREIFDGMVNSGKAVRVSTLNAMLDVYCMNGLPMEADLLFESASSMKVFPDSTTYKL 420
           VSKAREIFDGMVNSGKAVRVSTLNAMLDVYCMNGLPMEADLLFESASSMKVFPDSTTYKL
Sbjct: 361 VSKAREIFDGMVNSGKAVRVSTLNAMLDVYCMNGLPMEADLLFESASSMKVFPDSTTYKL 420

Query: 421 LYKAYTKADAKELVEKLLKNMDKAGIIPNKRFFLDALGTVGSSPEKPEPARTGTGSRHSE 480
           LYKAYTKADAKELVEKLLKNMDKAGIIPNKRFFLDALGTVGSSPEKPEPARTGTGSRHSE
Sbjct: 421 LYKAYTKADAKELVEKLLKNMDKAGIIPNKRFFLDALGTVGSSPEKPEPARTGTGSRHSE 480

Query: 481 SSVEKPISSRNRRGSRKLESSVKNHAPELGLA 512
           SSVEKPISSRNRRGSRKLESSVKNHAPELGLA
Sbjct: 481 SSVEKPISSRNRRGSRKLESSVKNHAPELGLA 512

BLAST of Cp4.1LG10g01040 vs. NCBI nr
Match: KAG7034169.1 (Pentatricopeptide repeat-containing protein, chloroplastic [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1006 bits (2600), Expect = 0.0
Identity = 510/512 (99.61%), Postives = 511/512 (99.80%), Query Frame = 0

Query: 1   MPLSFSLQFSPSFTPFSSSSRNFQLSQTWLPLPLSLRRAVSTRVVCISTRPNRKSGAKTD 60
           MPLSFSLQFSPSFTPFSSSSRNFQLSQTWLPLPLSLRRAVSTRVVCISTRPNRKS AKTD
Sbjct: 1   MPLSFSLQFSPSFTPFSSSSRNFQLSQTWLPLPLSLRRAVSTRVVCISTRPNRKSAAKTD 60

Query: 61  RSEAEELVRGVIRKFSDKEPLLKTLDKYVRVMRTEHCFLLFEELGKKDKWLQCLEVFRWM 120
           RSEAEELVRGVIRKFSDKEPLLKTLDKYVRVMRTEHCFLLFEELGKKDKWLQCLEVFRWM
Sbjct: 61  RSEAEELVRGVIRKFSDKEPLLKTLDKYVRVMRTEHCFLLFEELGKKDKWLQCLEVFRWM 120

Query: 121 QKQRWYIADNGVYSKLISIMGKKGQIRMAIWLFSEMRNSGCRPDTSVYNALITAHLHSKD 180
           QKQRWYIADNGVYSKLISIMGKKGQIRMAIWLFSEMRNSGCRPDTSVYNALITAHLHSKD
Sbjct: 121 QKQRWYIADNGVYSKLISIMGKKGQIRMAIWLFSEMRNSGCRPDTSVYNALITAHLHSKD 180

Query: 181 KAKALVKVLWYFEKMKGMERCKPNVVTYNILTRAFAQAGKVDQVNALFKDLDESIVSADI 240
           KAKALVKVLWYFEKMKGMERCKPNVVTYNILTRAFAQAGKVDQVNALFKDLDESIVSADI
Sbjct: 181 KAKALVKVLWYFEKMKGMERCKPNVVTYNILTRAFAQAGKVDQVNALFKDLDESIVSADI 240

Query: 241 YTYNGVMDAYGKNGMIKEMETMLARMKSNEIKPDIISFNLLIDSYGKKQLFDKMEQVFKS 300
           YTYNGVMDAYGKNGMIKEMETMLARMKSNEIKPDIISFNLLIDSYGKKQLFDKMEQVFKS
Sbjct: 241 YTYNGVMDAYGKNGMIKEMETMLARMKSNEIKPDIISFNLLIDSYGKKQLFDKMEQVFKS 300

Query: 301 LLRSKERPTLPTFNSMITNYGKARLRERAEEVFRKMKDMGYDPSYVTCESLIMMYGHCDC 360
           LLRSKERPTLPTFNSMITNYGKARLRE+AEEVFRKMKDMGYDPSYVTCESLIMMYGHCDC
Sbjct: 301 LLRSKERPTLPTFNSMITNYGKARLREKAEEVFRKMKDMGYDPSYVTCESLIMMYGHCDC 360

Query: 361 VSKAREIFDGMVNSGKAVRVSTLNAMLDVYCMNGLPMEADLLFESASSMKVFPDSTTYKL 420
           VSKAREIFDGMVNSGKAVRVSTLNAMLDVYCMNGLPMEADLLFESASSMKVFPDSTTYKL
Sbjct: 361 VSKAREIFDGMVNSGKAVRVSTLNAMLDVYCMNGLPMEADLLFESASSMKVFPDSTTYKL 420

Query: 421 LYKAYTKADAKELVEKLLKNMDKAGIIPNKRFFLDALGTVGSSPEKPEPARTGTGSRHSE 480
           LYKAYTKADAKELVEKLLKNMDKAGIIPNKRFFLDALGTVGSSPEKPEPARTGTGSRHSE
Sbjct: 421 LYKAYTKADAKELVEKLLKNMDKAGIIPNKRFFLDALGTVGSSPEKPEPARTGTGSRHSE 480

Query: 481 SSVEKPISSRNRRGSRKLESSVKNHAPELGLA 512
           SSVEKPISSRNRRGSRKLESSVKNHAPELGLA
Sbjct: 481 SSVEKPISSRNRRGSRKLESSVKNHAPELGLA 512

BLAST of Cp4.1LG10g01040 vs. NCBI nr
Match: XP_022950765.1 (pentatricopeptide repeat-containing protein At4g39620, chloroplastic [Cucurbita moschata])

HSP 1 Score: 1005 bits (2599), Expect = 0.0
Identity = 510/512 (99.61%), Postives = 510/512 (99.61%), Query Frame = 0

Query: 1   MPLSFSLQFSPSFTPFSSSSRNFQLSQTWLPLPLSLRRAVSTRVVCISTRPNRKSGAKTD 60
           MPLSFSLQFSPSFTPFSSSSRNFQLSQTWLPLPLSLRRAVSTRVVCISTRPNRKS AKTD
Sbjct: 1   MPLSFSLQFSPSFTPFSSSSRNFQLSQTWLPLPLSLRRAVSTRVVCISTRPNRKSAAKTD 60

Query: 61  RSEAEELVRGVIRKFSDKEPLLKTLDKYVRVMRTEHCFLLFEELGKKDKWLQCLEVFRWM 120
           RSE EELVRGVIRKFSDKEPLLKTLDKYVRVMRTEHCFLLFEELGKKDKWLQCLEVFRWM
Sbjct: 61  RSETEELVRGVIRKFSDKEPLLKTLDKYVRVMRTEHCFLLFEELGKKDKWLQCLEVFRWM 120

Query: 121 QKQRWYIADNGVYSKLISIMGKKGQIRMAIWLFSEMRNSGCRPDTSVYNALITAHLHSKD 180
           QKQRWYIADNGVYSKLISIMGKKGQIRMAIWLFSEMRNSGCRPDTSVYNALITAHLHSKD
Sbjct: 121 QKQRWYIADNGVYSKLISIMGKKGQIRMAIWLFSEMRNSGCRPDTSVYNALITAHLHSKD 180

Query: 181 KAKALVKVLWYFEKMKGMERCKPNVVTYNILTRAFAQAGKVDQVNALFKDLDESIVSADI 240
           KAKALVKVLWYFEKMKGMERCKPNVVTYNILTRAFAQAGKVDQVNALFKDLDESIVSADI
Sbjct: 181 KAKALVKVLWYFEKMKGMERCKPNVVTYNILTRAFAQAGKVDQVNALFKDLDESIVSADI 240

Query: 241 YTYNGVMDAYGKNGMIKEMETMLARMKSNEIKPDIISFNLLIDSYGKKQLFDKMEQVFKS 300
           YTYNGVMDAYGKNGMIKEMETMLARMKSNEIKPDIISFNLLIDSYGKKQLFDKMEQVFKS
Sbjct: 241 YTYNGVMDAYGKNGMIKEMETMLARMKSNEIKPDIISFNLLIDSYGKKQLFDKMEQVFKS 300

Query: 301 LLRSKERPTLPTFNSMITNYGKARLRERAEEVFRKMKDMGYDPSYVTCESLIMMYGHCDC 360
           LLRSKERPTLPTFNSMITNYGKARLRERAEEVFRKMKDMGYDPSYVTCESLIMMYGHCDC
Sbjct: 301 LLRSKERPTLPTFNSMITNYGKARLRERAEEVFRKMKDMGYDPSYVTCESLIMMYGHCDC 360

Query: 361 VSKAREIFDGMVNSGKAVRVSTLNAMLDVYCMNGLPMEADLLFESASSMKVFPDSTTYKL 420
           VSKAREIFDGMVNSGKAVRVSTLNAMLDVYCMNGLPMEADLLFESASSMKVFPDSTTYKL
Sbjct: 361 VSKAREIFDGMVNSGKAVRVSTLNAMLDVYCMNGLPMEADLLFESASSMKVFPDSTTYKL 420

Query: 421 LYKAYTKADAKELVEKLLKNMDKAGIIPNKRFFLDALGTVGSSPEKPEPARTGTGSRHSE 480
           LYKAYTKADAKELVEKLLKNMDKAGIIPNKRFFLDALGTVGSSPEKPEPARTGTGSRHSE
Sbjct: 421 LYKAYTKADAKELVEKLLKNMDKAGIIPNKRFFLDALGTVGSSPEKPEPARTGTGSRHSE 480

Query: 481 SSVEKPISSRNRRGSRKLESSVKNHAPELGLA 512
           SSVEKPISSRNRRGSRKLESSVKNHAPELGLA
Sbjct: 481 SSVEKPISSRNRRGSRKLESSVKNHAPELGLA 512

BLAST of Cp4.1LG10g01040 vs. NCBI nr
Match: KAG6604006.1 (Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1004 bits (2595), Expect = 0.0
Identity = 509/512 (99.41%), Postives = 511/512 (99.80%), Query Frame = 0

Query: 1   MPLSFSLQFSPSFTPFSSSSRNFQLSQTWLPLPLSLRRAVSTRVVCISTRPNRKSGAKTD 60
           MPLSFSLQFSPSFTPFSSSSRNFQLSQTWLPLPLSLRRAVSTRVVCISTRPNRKS AKTD
Sbjct: 1   MPLSFSLQFSPSFTPFSSSSRNFQLSQTWLPLPLSLRRAVSTRVVCISTRPNRKSAAKTD 60

Query: 61  RSEAEELVRGVIRKFSDKEPLLKTLDKYVRVMRTEHCFLLFEELGKKDKWLQCLEVFRWM 120
           RSEAEELVRGVIRKFSDKEPLLKTLDKYVRVMRTEHCFLLFEELGKKDKWLQCLEVFRWM
Sbjct: 61  RSEAEELVRGVIRKFSDKEPLLKTLDKYVRVMRTEHCFLLFEELGKKDKWLQCLEVFRWM 120

Query: 121 QKQRWYIADNGVYSKLISIMGKKGQIRMAIWLFSEMRNSGCRPDTSVYNALITAHLHSKD 180
           QKQRWYIADNGVYSKLISIMGKKGQIRMAIWLFSEMRNSGCRPDTSVYNALITAHLHSKD
Sbjct: 121 QKQRWYIADNGVYSKLISIMGKKGQIRMAIWLFSEMRNSGCRPDTSVYNALITAHLHSKD 180

Query: 181 KAKALVKVLWYFEKMKGMERCKPNVVTYNILTRAFAQAGKVDQVNALFKDLDESIVSADI 240
           KAKALVKVLWYFEKMKGMERCKPNVVTYNILTRAFAQAGKVDQVNALFKDLDESIVSADI
Sbjct: 181 KAKALVKVLWYFEKMKGMERCKPNVVTYNILTRAFAQAGKVDQVNALFKDLDESIVSADI 240

Query: 241 YTYNGVMDAYGKNGMIKEMETMLARMKSNEIKPDIISFNLLIDSYGKKQLFDKMEQVFKS 300
           YTYNGVMDAYGKNGMIKEMETMLARMKSNEIKPDIISFNLLIDSYGKKQLFDKMEQVFKS
Sbjct: 241 YTYNGVMDAYGKNGMIKEMETMLARMKSNEIKPDIISFNLLIDSYGKKQLFDKMEQVFKS 300

Query: 301 LLRSKERPTLPTFNSMITNYGKARLRERAEEVFRKMKDMGYDPSYVTCESLIMMYGHCDC 360
           LLRSKERPTLPTFNSMITNYGKARLRE+AEEVFRKMKDMGYDPSYVTCESLIMMYGHCDC
Sbjct: 301 LLRSKERPTLPTFNSMITNYGKARLREKAEEVFRKMKDMGYDPSYVTCESLIMMYGHCDC 360

Query: 361 VSKAREIFDGMVNSGKAVRVSTLNAMLDVYCMNGLPMEADLLFESASSMKVFPDSTTYKL 420
           VSKAREIF+GMVNSGKAVRVSTLNAMLDVYCMNGLPMEADLLFESASSMKVFPDSTTYKL
Sbjct: 361 VSKAREIFNGMVNSGKAVRVSTLNAMLDVYCMNGLPMEADLLFESASSMKVFPDSTTYKL 420

Query: 421 LYKAYTKADAKELVEKLLKNMDKAGIIPNKRFFLDALGTVGSSPEKPEPARTGTGSRHSE 480
           LYKAYTKADAKELVEKLLKNMDKAGIIPNKRFFLDALGTVGSSPEKPEPARTGTGSRHSE
Sbjct: 421 LYKAYTKADAKELVEKLLKNMDKAGIIPNKRFFLDALGTVGSSPEKPEPARTGTGSRHSE 480

Query: 481 SSVEKPISSRNRRGSRKLESSVKNHAPELGLA 512
           SSVEKPISSRNRRGSRKLESSVKNHAPELGLA
Sbjct: 481 SSVEKPISSRNRRGSRKLESSVKNHAPELGLA 512

BLAST of Cp4.1LG10g01040 vs. NCBI nr
Match: XP_022978471.1 (pentatricopeptide repeat-containing protein At4g39620, chloroplastic [Cucurbita maxima])

HSP 1 Score: 994 bits (2569), Expect = 0.0
Identity = 504/512 (98.44%), Postives = 506/512 (98.83%), Query Frame = 0

Query: 1   MPLSFSLQFSPSFTPFSSSSRNFQLSQTWLPLPLSLRRAVSTRVVCISTRPNRKSGAKTD 60
           MPLSFSLQFSPSFTPFSSSSRNFQ SQTWLPLPLSLRRAVSTRVVCISTRP RKSGAKTD
Sbjct: 1   MPLSFSLQFSPSFTPFSSSSRNFQRSQTWLPLPLSLRRAVSTRVVCISTRPKRKSGAKTD 60

Query: 61  RSEAEELVRGVIRKFSDKEPLLKTLDKYVRVMRTEHCFLLFEELGKKDKWLQCLEVFRWM 120
           RSEAEELVRGVIRKFSDKEPLLKTLDKYVRVMRTEHCFLLFEELGKKDKWLQCLEVFRWM
Sbjct: 61  RSEAEELVRGVIRKFSDKEPLLKTLDKYVRVMRTEHCFLLFEELGKKDKWLQCLEVFRWM 120

Query: 121 QKQRWYIADNGVYSKLISIMGKKGQIRMAIWLFSEMRNSGCRPDTSVYNALITAHLHSKD 180
           QKQRWYIADNGVYSKLISIMGKKGQIRMA+WLFSEMRNSGCRPDTSVYNALITAHLHSKD
Sbjct: 121 QKQRWYIADNGVYSKLISIMGKKGQIRMAMWLFSEMRNSGCRPDTSVYNALITAHLHSKD 180

Query: 181 KAKALVKVLWYFEKMKGMERCKPNVVTYNILTRAFAQAGKVDQVNALFKDLDESIVSADI 240
           KAKALVKVLWYFEKMKGMERCKPNVVTYNILTRAFAQAGKVDQVNALFKDLDESIVSADI
Sbjct: 181 KAKALVKVLWYFEKMKGMERCKPNVVTYNILTRAFAQAGKVDQVNALFKDLDESIVSADI 240

Query: 241 YTYNGVMDAYGKNGMIKEMETMLARMKSNEIKPDIISFNLLIDSYGKKQLFDKMEQVFKS 300
           YTYNGVMDAYGKNGMIKEMETMLARMKSNEIKPDIISFNLLIDSYGKKQLFDKMEQVFKS
Sbjct: 241 YTYNGVMDAYGKNGMIKEMETMLARMKSNEIKPDIISFNLLIDSYGKKQLFDKMEQVFKS 300

Query: 301 LLRSKERPTLPTFNSMITNYGKARLRERAEEVFRKMKDMGYDPSYVTCESLIMMYGHCDC 360
           LLRSKERPTLPTFNSMITNYGKARLRERAEEVFRKMK MGYDPSYVTCESLIMMYGHCDC
Sbjct: 301 LLRSKERPTLPTFNSMITNYGKARLRERAEEVFRKMKGMGYDPSYVTCESLIMMYGHCDC 360

Query: 361 VSKAREIFDGMVNSGKAVRVSTLNAMLDVYCMNGLPMEADLLFESASSMKVFPDSTTYKL 420
           VSKAREIFDGMVNSGKAVRVSTLNAMLDVYCMNGLPMEADLLFESASSM+VFPDSTTYKL
Sbjct: 361 VSKAREIFDGMVNSGKAVRVSTLNAMLDVYCMNGLPMEADLLFESASSMRVFPDSTTYKL 420

Query: 421 LYKAYTKADAKELVEKLLKNMDKAGIIPNKRFFLDALGTVGSSPEKPEPARTGTGSRHSE 480
           LYKAYTKADAKELVEKLLKNMDKAGIIPNKRFFLDALGTVGSSPEKPEPA TGTGSRHSE
Sbjct: 421 LYKAYTKADAKELVEKLLKNMDKAGIIPNKRFFLDALGTVGSSPEKPEPASTGTGSRHSE 480

Query: 481 SSVEKPISSRNRRGSRKLESSVKNHAPELGLA 512
           SSVE PISSRNRRGSRKLESSVKNH PELGLA
Sbjct: 481 SSVENPISSRNRRGSRKLESSVKNHTPELGLA 512

BLAST of Cp4.1LG10g01040 vs. ExPASy TrEMBL
Match: A0A6J1GGN3 (pentatricopeptide repeat-containing protein At4g39620, chloroplastic OS=Cucurbita moschata OX=3662 GN=LOC111453772 PE=3 SV=1)

HSP 1 Score: 1005 bits (2599), Expect = 0.0
Identity = 510/512 (99.61%), Postives = 510/512 (99.61%), Query Frame = 0

Query: 1   MPLSFSLQFSPSFTPFSSSSRNFQLSQTWLPLPLSLRRAVSTRVVCISTRPNRKSGAKTD 60
           MPLSFSLQFSPSFTPFSSSSRNFQLSQTWLPLPLSLRRAVSTRVVCISTRPNRKS AKTD
Sbjct: 1   MPLSFSLQFSPSFTPFSSSSRNFQLSQTWLPLPLSLRRAVSTRVVCISTRPNRKSAAKTD 60

Query: 61  RSEAEELVRGVIRKFSDKEPLLKTLDKYVRVMRTEHCFLLFEELGKKDKWLQCLEVFRWM 120
           RSE EELVRGVIRKFSDKEPLLKTLDKYVRVMRTEHCFLLFEELGKKDKWLQCLEVFRWM
Sbjct: 61  RSETEELVRGVIRKFSDKEPLLKTLDKYVRVMRTEHCFLLFEELGKKDKWLQCLEVFRWM 120

Query: 121 QKQRWYIADNGVYSKLISIMGKKGQIRMAIWLFSEMRNSGCRPDTSVYNALITAHLHSKD 180
           QKQRWYIADNGVYSKLISIMGKKGQIRMAIWLFSEMRNSGCRPDTSVYNALITAHLHSKD
Sbjct: 121 QKQRWYIADNGVYSKLISIMGKKGQIRMAIWLFSEMRNSGCRPDTSVYNALITAHLHSKD 180

Query: 181 KAKALVKVLWYFEKMKGMERCKPNVVTYNILTRAFAQAGKVDQVNALFKDLDESIVSADI 240
           KAKALVKVLWYFEKMKGMERCKPNVVTYNILTRAFAQAGKVDQVNALFKDLDESIVSADI
Sbjct: 181 KAKALVKVLWYFEKMKGMERCKPNVVTYNILTRAFAQAGKVDQVNALFKDLDESIVSADI 240

Query: 241 YTYNGVMDAYGKNGMIKEMETMLARMKSNEIKPDIISFNLLIDSYGKKQLFDKMEQVFKS 300
           YTYNGVMDAYGKNGMIKEMETMLARMKSNEIKPDIISFNLLIDSYGKKQLFDKMEQVFKS
Sbjct: 241 YTYNGVMDAYGKNGMIKEMETMLARMKSNEIKPDIISFNLLIDSYGKKQLFDKMEQVFKS 300

Query: 301 LLRSKERPTLPTFNSMITNYGKARLRERAEEVFRKMKDMGYDPSYVTCESLIMMYGHCDC 360
           LLRSKERPTLPTFNSMITNYGKARLRERAEEVFRKMKDMGYDPSYVTCESLIMMYGHCDC
Sbjct: 301 LLRSKERPTLPTFNSMITNYGKARLRERAEEVFRKMKDMGYDPSYVTCESLIMMYGHCDC 360

Query: 361 VSKAREIFDGMVNSGKAVRVSTLNAMLDVYCMNGLPMEADLLFESASSMKVFPDSTTYKL 420
           VSKAREIFDGMVNSGKAVRVSTLNAMLDVYCMNGLPMEADLLFESASSMKVFPDSTTYKL
Sbjct: 361 VSKAREIFDGMVNSGKAVRVSTLNAMLDVYCMNGLPMEADLLFESASSMKVFPDSTTYKL 420

Query: 421 LYKAYTKADAKELVEKLLKNMDKAGIIPNKRFFLDALGTVGSSPEKPEPARTGTGSRHSE 480
           LYKAYTKADAKELVEKLLKNMDKAGIIPNKRFFLDALGTVGSSPEKPEPARTGTGSRHSE
Sbjct: 421 LYKAYTKADAKELVEKLLKNMDKAGIIPNKRFFLDALGTVGSSPEKPEPARTGTGSRHSE 480

Query: 481 SSVEKPISSRNRRGSRKLESSVKNHAPELGLA 512
           SSVEKPISSRNRRGSRKLESSVKNHAPELGLA
Sbjct: 481 SSVEKPISSRNRRGSRKLESSVKNHAPELGLA 512

BLAST of Cp4.1LG10g01040 vs. ExPASy TrEMBL
Match: A0A6J1IMR8 (pentatricopeptide repeat-containing protein At4g39620, chloroplastic OS=Cucurbita maxima OX=3661 GN=LOC111478441 PE=3 SV=1)

HSP 1 Score: 994 bits (2569), Expect = 0.0
Identity = 504/512 (98.44%), Postives = 506/512 (98.83%), Query Frame = 0

Query: 1   MPLSFSLQFSPSFTPFSSSSRNFQLSQTWLPLPLSLRRAVSTRVVCISTRPNRKSGAKTD 60
           MPLSFSLQFSPSFTPFSSSSRNFQ SQTWLPLPLSLRRAVSTRVVCISTRP RKSGAKTD
Sbjct: 1   MPLSFSLQFSPSFTPFSSSSRNFQRSQTWLPLPLSLRRAVSTRVVCISTRPKRKSGAKTD 60

Query: 61  RSEAEELVRGVIRKFSDKEPLLKTLDKYVRVMRTEHCFLLFEELGKKDKWLQCLEVFRWM 120
           RSEAEELVRGVIRKFSDKEPLLKTLDKYVRVMRTEHCFLLFEELGKKDKWLQCLEVFRWM
Sbjct: 61  RSEAEELVRGVIRKFSDKEPLLKTLDKYVRVMRTEHCFLLFEELGKKDKWLQCLEVFRWM 120

Query: 121 QKQRWYIADNGVYSKLISIMGKKGQIRMAIWLFSEMRNSGCRPDTSVYNALITAHLHSKD 180
           QKQRWYIADNGVYSKLISIMGKKGQIRMA+WLFSEMRNSGCRPDTSVYNALITAHLHSKD
Sbjct: 121 QKQRWYIADNGVYSKLISIMGKKGQIRMAMWLFSEMRNSGCRPDTSVYNALITAHLHSKD 180

Query: 181 KAKALVKVLWYFEKMKGMERCKPNVVTYNILTRAFAQAGKVDQVNALFKDLDESIVSADI 240
           KAKALVKVLWYFEKMKGMERCKPNVVTYNILTRAFAQAGKVDQVNALFKDLDESIVSADI
Sbjct: 181 KAKALVKVLWYFEKMKGMERCKPNVVTYNILTRAFAQAGKVDQVNALFKDLDESIVSADI 240

Query: 241 YTYNGVMDAYGKNGMIKEMETMLARMKSNEIKPDIISFNLLIDSYGKKQLFDKMEQVFKS 300
           YTYNGVMDAYGKNGMIKEMETMLARMKSNEIKPDIISFNLLIDSYGKKQLFDKMEQVFKS
Sbjct: 241 YTYNGVMDAYGKNGMIKEMETMLARMKSNEIKPDIISFNLLIDSYGKKQLFDKMEQVFKS 300

Query: 301 LLRSKERPTLPTFNSMITNYGKARLRERAEEVFRKMKDMGYDPSYVTCESLIMMYGHCDC 360
           LLRSKERPTLPTFNSMITNYGKARLRERAEEVFRKMK MGYDPSYVTCESLIMMYGHCDC
Sbjct: 301 LLRSKERPTLPTFNSMITNYGKARLRERAEEVFRKMKGMGYDPSYVTCESLIMMYGHCDC 360

Query: 361 VSKAREIFDGMVNSGKAVRVSTLNAMLDVYCMNGLPMEADLLFESASSMKVFPDSTTYKL 420
           VSKAREIFDGMVNSGKAVRVSTLNAMLDVYCMNGLPMEADLLFESASSM+VFPDSTTYKL
Sbjct: 361 VSKAREIFDGMVNSGKAVRVSTLNAMLDVYCMNGLPMEADLLFESASSMRVFPDSTTYKL 420

Query: 421 LYKAYTKADAKELVEKLLKNMDKAGIIPNKRFFLDALGTVGSSPEKPEPARTGTGSRHSE 480
           LYKAYTKADAKELVEKLLKNMDKAGIIPNKRFFLDALGTVGSSPEKPEPA TGTGSRHSE
Sbjct: 421 LYKAYTKADAKELVEKLLKNMDKAGIIPNKRFFLDALGTVGSSPEKPEPASTGTGSRHSE 480

Query: 481 SSVEKPISSRNRRGSRKLESSVKNHAPELGLA 512
           SSVE PISSRNRRGSRKLESSVKNH PELGLA
Sbjct: 481 SSVENPISSRNRRGSRKLESSVKNHTPELGLA 512

BLAST of Cp4.1LG10g01040 vs. ExPASy TrEMBL
Match: A0A5A7T266 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold18G00650 PE=3 SV=1)

HSP 1 Score: 910 bits (2351), Expect = 0.0
Identity = 462/531 (87.01%), Postives = 484/531 (91.15%), Query Frame = 0

Query: 1   MPLSFSLQFSPSFTPFSSSSRNFQLSQTWLPLPLSLRRAVSTRVVCISTRPNRKSGAKTD 60
           MPLS S QFSP FT   SSSR  QLS+ W P+P SL++  STRVVC+STRP+RKSG KTD
Sbjct: 1   MPLSLSPQFSPIFTTSYSSSRTSQLSRLWFPVPSSLKKTFSTRVVCVSTRPSRKSGVKTD 60

Query: 61  RSEAEELVRGVIRKFSDKEPLLKTLDKYVRVMRTEHCFLLFEELGKKDKWLQCLEVFRWM 120
           RSEAEELVRG+I+KFSDKEPLLKTLDKYVRVMRTEHCFLLFEELGK+DKWLQCLEVFRWM
Sbjct: 61  RSEAEELVRGIIKKFSDKEPLLKTLDKYVRVMRTEHCFLLFEELGKRDKWLQCLEVFRWM 120

Query: 121 QKQRWYIADNGVYSKLISIMGKKGQIRMAIWLFSEMRNSGCRPDTSVYNALITAHLHSKD 180
           QKQRWYIADNGVYSKLISIMGKKGQIRMA+WLFSEMRNSGCRPDTSVYNALITAHLHSKD
Sbjct: 121 QKQRWYIADNGVYSKLISIMGKKGQIRMAMWLFSEMRNSGCRPDTSVYNALITAHLHSKD 180

Query: 181 KAKALVKVLWYFEKMKGMERCKPNVVTYNILTRAFAQAGKVDQVNALFKDLDESIVSADI 240
           KAKALVKVLWYFEKMKGMERCKPN+VTYNILTRAFAQA KVDQVN LFKDLDES+VSADI
Sbjct: 181 KAKALVKVLWYFEKMKGMERCKPNIVTYNILTRAFAQAAKVDQVNTLFKDLDESVVSADI 240

Query: 241 YTYNGVMDAYGKNGMIKEMETMLARMKSNEIKPDIISFNLLIDSYGKKQLFDKMEQVFKS 300
           YTYNGVMDAYGKNG IKEME MLARMKSN+IKPDIISFNLLIDSYGKKQLFDKMEQVFKS
Sbjct: 241 YTYNGVMDAYGKNGNIKEMELMLARMKSNQIKPDIISFNLLIDSYGKKQLFDKMEQVFKS 300

Query: 301 LLRSKERPTLPTFNSMITNYGKARLRERAEEVFRKMKDMGYDPSYVTCESLIMMYGHCDC 360
           LLRSKERPTLPTFNSMITNYGKARLR++AEEVFRKMKDMGYDPSYVTCESLIMMYGHCDC
Sbjct: 301 LLRSKERPTLPTFNSMITNYGKARLRDKAEEVFRKMKDMGYDPSYVTCESLIMMYGHCDC 360

Query: 361 VSKAREIFDGMVNSGKAVRVSTLNAMLDVYCMNGLPMEADLLFESASSMKVFPDSTTYKL 420
           VSKAREIFDGMVNSGK VRVSTLNAMLDVYC+NGLP EAD+LFESA +MKVFPDSTTYKL
Sbjct: 361 VSKAREIFDGMVNSGKEVRVSTLNAMLDVYCINGLPFEADMLFESAGNMKVFPDSTTYKL 420

Query: 421 LYKAYTKADAKELVEKLLKNMDKAGIIPNKRFFLDALGTVGSSPEKPEPARTGTGSRHSE 480
           LYKAYTKAD KEL+EKLLKNMDKAGIIPNKRFFLDALGT+GSS EKPEPART TGSR+SE
Sbjct: 421 LYKAYTKADMKELLEKLLKNMDKAGIIPNKRFFLDALGTIGSSREKPEPARTKTGSRNSE 480

Query: 481 SSVEK-------------------PISSRNRRGSRKLESSVKNHAPELGLA 512
           SSVEK                   PISSR R GSR LES V+NHAPELGLA
Sbjct: 481 SSVEKQSSSRTRSGPRNLESNVQKPISSRTRTGSRNLESGVENHAPELGLA 531

BLAST of Cp4.1LG10g01040 vs. ExPASy TrEMBL
Match: A0A5D3CMK0 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold255G002480 PE=3 SV=1)

HSP 1 Score: 907 bits (2345), Expect = 0.0
Identity = 462/531 (87.01%), Postives = 484/531 (91.15%), Query Frame = 0

Query: 1   MPLSFSLQFSPSFTPFSSSSRNFQLSQTWLPLPLSLRRAVSTRVVCISTRPNRKSGAKTD 60
           MPLS S QFSP FT   SSSR  QLS+ WLP+P SL++  STRVVC+STR +RKSG KTD
Sbjct: 1   MPLSLSPQFSPIFTTSYSSSRTSQLSRLWLPVPSSLKKTFSTRVVCVSTRLSRKSGVKTD 60

Query: 61  RSEAEELVRGVIRKFSDKEPLLKTLDKYVRVMRTEHCFLLFEELGKKDKWLQCLEVFRWM 120
           RSEAEELVRG+I+KFSDKEPLLKTLDKYVRVMRTEHCFLLFEELGK+DKWLQCLEVFRWM
Sbjct: 61  RSEAEELVRGIIKKFSDKEPLLKTLDKYVRVMRTEHCFLLFEELGKRDKWLQCLEVFRWM 120

Query: 121 QKQRWYIADNGVYSKLISIMGKKGQIRMAIWLFSEMRNSGCRPDTSVYNALITAHLHSKD 180
           QKQRWYIADNGVYSKLISIMGKKGQIRMA+WLFSEMRNSGCRPDTSVYNALITAHLHSKD
Sbjct: 121 QKQRWYIADNGVYSKLISIMGKKGQIRMAMWLFSEMRNSGCRPDTSVYNALITAHLHSKD 180

Query: 181 KAKALVKVLWYFEKMKGMERCKPNVVTYNILTRAFAQAGKVDQVNALFKDLDESIVSADI 240
           KAKALVKVLWYFEKMKGMERCKPN+VTYNILTRAFAQA KVDQVN LFKDLDES+VSADI
Sbjct: 181 KAKALVKVLWYFEKMKGMERCKPNIVTYNILTRAFAQAAKVDQVNTLFKDLDESVVSADI 240

Query: 241 YTYNGVMDAYGKNGMIKEMETMLARMKSNEIKPDIISFNLLIDSYGKKQLFDKMEQVFKS 300
           YTYNGVMDAYGKNG IKEME MLARMKSN+IKPDIISFNLLIDSYGKKQLFDKMEQVFKS
Sbjct: 241 YTYNGVMDAYGKNGNIKEMELMLARMKSNQIKPDIISFNLLIDSYGKKQLFDKMEQVFKS 300

Query: 301 LLRSKERPTLPTFNSMITNYGKARLRERAEEVFRKMKDMGYDPSYVTCESLIMMYGHCDC 360
           LLRSKERPTLPTFNSMITNYGKARLR++AEEVFRKMKDMGYDPSYVTCESLIMMYGHCDC
Sbjct: 301 LLRSKERPTLPTFNSMITNYGKARLRDKAEEVFRKMKDMGYDPSYVTCESLIMMYGHCDC 360

Query: 361 VSKAREIFDGMVNSGKAVRVSTLNAMLDVYCMNGLPMEADLLFESASSMKVFPDSTTYKL 420
           VSKAREIFDGMVNSGK VRVSTLNAMLDVYC+NGLP EAD+LFESA +MKVFPDSTTYKL
Sbjct: 361 VSKAREIFDGMVNSGKEVRVSTLNAMLDVYCINGLPFEADMLFESAGNMKVFPDSTTYKL 420

Query: 421 LYKAYTKADAKELVEKLLKNMDKAGIIPNKRFFLDALGTVGSSPEKPEPARTGTGSRHSE 480
           LYKAYTKAD KEL+EKLLKNMDKAGIIPNKRFFLDALGT+GSS EKPEPART TGSR+SE
Sbjct: 421 LYKAYTKADMKELLEKLLKNMDKAGIIPNKRFFLDALGTIGSSREKPEPARTKTGSRNSE 480

Query: 481 SSVEK-------------------PISSRNRRGSRKLESSVKNHAPELGLA 512
           SSVEK                   PISSR R GSR LES V+NHAPELGLA
Sbjct: 481 SSVEKQSSSRTRSGPRNLESNVQKPISSRTRTGSRNLESGVENHAPELGLA 531

BLAST of Cp4.1LG10g01040 vs. ExPASy TrEMBL
Match: A0A1S3B2D9 (pentatricopeptide repeat-containing protein At4g39620, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103485009 PE=3 SV=1)

HSP 1 Score: 907 bits (2345), Expect = 0.0
Identity = 462/531 (87.01%), Postives = 484/531 (91.15%), Query Frame = 0

Query: 1   MPLSFSLQFSPSFTPFSSSSRNFQLSQTWLPLPLSLRRAVSTRVVCISTRPNRKSGAKTD 60
           MPLS S QFSP FT   SSSR  QLS+ WLP+P SL++  STRVVC+STR +RKSG KTD
Sbjct: 1   MPLSLSPQFSPIFTTSYSSSRTSQLSRLWLPVPSSLKKTFSTRVVCVSTRLSRKSGVKTD 60

Query: 61  RSEAEELVRGVIRKFSDKEPLLKTLDKYVRVMRTEHCFLLFEELGKKDKWLQCLEVFRWM 120
           RSEAEELVRG+I+KFSDKEPLLKTLDKYVRVMRTEHCFLLFEELGK+DKWLQCLEVFRWM
Sbjct: 61  RSEAEELVRGIIKKFSDKEPLLKTLDKYVRVMRTEHCFLLFEELGKRDKWLQCLEVFRWM 120

Query: 121 QKQRWYIADNGVYSKLISIMGKKGQIRMAIWLFSEMRNSGCRPDTSVYNALITAHLHSKD 180
           QKQRWYIADNGVYSKLISIMGKKGQIRMA+WLFSEMRNSGCRPDTSVYNALITAHLHSKD
Sbjct: 121 QKQRWYIADNGVYSKLISIMGKKGQIRMAMWLFSEMRNSGCRPDTSVYNALITAHLHSKD 180

Query: 181 KAKALVKVLWYFEKMKGMERCKPNVVTYNILTRAFAQAGKVDQVNALFKDLDESIVSADI 240
           KAKALVKVLWYFEKMKGMERCKPN+VTYNILTRAFAQA KVDQVN LFKDLDES+VSADI
Sbjct: 181 KAKALVKVLWYFEKMKGMERCKPNIVTYNILTRAFAQAAKVDQVNTLFKDLDESVVSADI 240

Query: 241 YTYNGVMDAYGKNGMIKEMETMLARMKSNEIKPDIISFNLLIDSYGKKQLFDKMEQVFKS 300
           YTYNGVMDAYGKNG IKEME MLARMKSN+IKPDIISFNLLIDSYGKKQLFDKMEQVFKS
Sbjct: 241 YTYNGVMDAYGKNGNIKEMELMLARMKSNQIKPDIISFNLLIDSYGKKQLFDKMEQVFKS 300

Query: 301 LLRSKERPTLPTFNSMITNYGKARLRERAEEVFRKMKDMGYDPSYVTCESLIMMYGHCDC 360
           LLRSKERPTLPTFNSMITNYGKARLR++AEEVFRKMKDMGYDPSYVTCESLIMMYGHCDC
Sbjct: 301 LLRSKERPTLPTFNSMITNYGKARLRDKAEEVFRKMKDMGYDPSYVTCESLIMMYGHCDC 360

Query: 361 VSKAREIFDGMVNSGKAVRVSTLNAMLDVYCMNGLPMEADLLFESASSMKVFPDSTTYKL 420
           VSKAREIFDGMVNSGK VRVSTLNAMLDVYC+NGLP EAD+LFESA +MKVFPDSTTYKL
Sbjct: 361 VSKAREIFDGMVNSGKEVRVSTLNAMLDVYCINGLPFEADMLFESAGNMKVFPDSTTYKL 420

Query: 421 LYKAYTKADAKELVEKLLKNMDKAGIIPNKRFFLDALGTVGSSPEKPEPARTGTGSRHSE 480
           LYKAYTKAD KEL+EKLLKNMDKAGIIPNKRFFLDALGT+GSS EKPEPART TGSR+SE
Sbjct: 421 LYKAYTKADMKELLEKLLKNMDKAGIIPNKRFFLDALGTIGSSREKPEPARTKTGSRNSE 480

Query: 481 SSVEK-------------------PISSRNRRGSRKLESSVKNHAPELGLA 512
           SSVEK                   PISSR R GSR LES V+NHAPELGLA
Sbjct: 481 SSVEKQSSSRTRSGPRNLESNVQKPISSRTRTGSRNLESGVENHAPELGLA 531

BLAST of Cp4.1LG10g01040 vs. TAIR 10
Match: AT4G39620.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 614.4 bits (1583), Expect = 8.4e-176
Identity = 321/503 (63.82%), Postives = 390/503 (77.53%), Query Frame = 0

Query: 10  SPSFTPFSS--SSRNFQLSQTWLPLPLSL---RRAVSTRVVCISTRPNRKSGAKTDRSEA 69
           SPS   FS   SS   +    WL   ++L   RR+  TR+ C +    RK  A+ + +E 
Sbjct: 7   SPSSLRFSDFISSIPKETDHKWLRFSVNLGDARRSTRTRITCGAISSRRKL-AERESAER 66

Query: 70  EE--LVRGVIRKFSDKEPLLKTLDKYVRVMRTEHCFLLFEELGKKDKWLQCLEVFRWMQK 129
           E   LVR ++ + SD+EPL+KTLDKYV+V+R +HCFLLFEELGK DKWLQCLEVFRWMQK
Sbjct: 67  ENRVLVRSLMSRISDREPLVKTLDKYVKVVRCDHCFLLFEELGKSDKWLQCLEVFRWMQK 126

Query: 130 QRWYIADNGVYSKLISIMGKKGQIRMAIWLFSEMRNSGCRPDTSVYNALITAHLHSKDKA 189
           QRWYI DNGVYSKLIS+MGKKGQ RMA+WLFSEM+NSGCRPD SVYNALITAHLH++DKA
Sbjct: 127 QRWYIPDNGVYSKLISVMGKKGQTRMAMWLFSEMKNSGCRPDASVYNALITAHLHTRDKA 186

Query: 190 KALVKVLWYFEKMKGMERCKPNVVTYNILTRAFAQAGKVDQVNALFKDLDESIVSADIYT 249
           KAL KV  Y +KMKG+ERC+PNVVTYNIL RAFAQ+GKVDQVNALFKDLD S VS D+YT
Sbjct: 187 KALEKVRGYLDKMKGIERCQPNVVTYNILLRAFAQSGKVDQVNALFKDLDMSPVSPDVYT 246

Query: 250 YNGVMDAYGKNGMIKEMETMLARMKSNEIKPDIISFNLLIDSYGKKQLFDKMEQVFKSLL 309
           +NGVMDAYGKNGMIKEME +L RM+SNE KPDII+FN+LIDSYGKKQ F+KMEQ FKSL+
Sbjct: 247 FNGVMDAYGKNGMIKEMEAVLTRMRSNECKPDIITFNVLIDSYGKKQEFEKMEQTFKSLM 306

Query: 310 RSKERPTLPTFNSMITNYGKARLRERAEEVFRKMKDMGYDPSYVTCESLIMMYGHCDCVS 369
           RSKE+PTLPTFNSMI NYGKAR+ ++AE VF+KM DM Y PS++T E +IMMYG+C  VS
Sbjct: 307 RSKEKPTLPTFNSMIINYGKARMIDKAEWVFKKMNDMNYIPSFITYECMIMMYGYCGSVS 366

Query: 370 KAREIFDGMVNSGKAVRVSTLNAMLDVYCMNGLPMEADLLFESASSMKVFPDSTTYKLLY 429
           +AREIF+ +  S + ++ STLNAML+VYC NGL +EAD LF +AS+ +V PD++TYK LY
Sbjct: 367 RAREIFEEVGESDRVLKASTLNAMLEVYCRNGLYIEADKLFHNASAFRVHPDASTYKFLY 426

Query: 430 KAYTKADAKELVEKLLKNMDKAGIIPNKRFFLDALGTVGSSPEKPEPARTGTGSRHSESS 489
           KAYTKAD KE V+ L+K M+K GI+PNKRFFL+AL   GS         +G+ +R S  S
Sbjct: 427 KAYTKADMKEQVQILMKKMEKDGIVPNKRFFLEALEVFGS-----RLPGSGSENRKSTRS 486

Query: 490 VEKPISSRNRRGSRKLESSVKNH 506
                S + R G++  E   K++
Sbjct: 487 SRSRDSPKGRGGNQLTEFQDKDN 503

BLAST of Cp4.1LG10g01040 vs. TAIR 10
Match: AT4G39620.2 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 614.0 bits (1582), Expect = 1.1e-175
Identity = 321/502 (63.94%), Postives = 389/502 (77.49%), Query Frame = 0

Query: 10  SPSFTPFSS--SSRNFQLSQTWLPLPLSL---RRAVSTRVVCISTRPNRKSGAKTDRSEA 69
           SPS   FS   SS   +    WL   ++L   RR+  TR+ C +    RK  A+ + +E 
Sbjct: 7   SPSSLRFSDFISSIPKETDHKWLRFSVNLGDARRSTRTRITCGAISSRRKL-AERESAER 66

Query: 70  EE--LVRGVIRKFSDKEPLLKTLDKYVRVMRTEHCFLLFEELGKKDKWLQCLEVFRWMQK 129
           E   LVR ++ + SD+EPL+KTLDKYV+V+R +HCFLLFEELGK DKWLQCLEVFRWMQK
Sbjct: 67  ENRVLVRSLMSRISDREPLVKTLDKYVKVVRCDHCFLLFEELGKSDKWLQCLEVFRWMQK 126

Query: 130 QRWYIADNGVYSKLISIMGKKGQIRMAIWLFSEMRNSGCRPDTSVYNALITAHLHSKDKA 189
           QRWYI DNGVYSKLIS+MGKKGQ RMA+WLFSEM+NSGCRPD SVYNALITAHLH++DKA
Sbjct: 127 QRWYIPDNGVYSKLISVMGKKGQTRMAMWLFSEMKNSGCRPDASVYNALITAHLHTRDKA 186

Query: 190 KALVKVLWYFEKMKGMERCKPNVVTYNILTRAFAQAGKVDQVNALFKDLDESIVSADIYT 249
           KAL KV  Y +KMKG+ERC+PNVVTYNIL RAFAQ+GKVDQVNALFKDLD S VS D+YT
Sbjct: 187 KALEKVRGYLDKMKGIERCQPNVVTYNILLRAFAQSGKVDQVNALFKDLDMSPVSPDVYT 246

Query: 250 YNGVMDAYGKNGMIKEMETMLARMKSNEIKPDIISFNLLIDSYGKKQLFDKMEQVFKSLL 309
           +NGVMDAYGKNGMIKEME +L RM+SNE KPDII+FN+LIDSYGKKQ F+KMEQ FKSL+
Sbjct: 247 FNGVMDAYGKNGMIKEMEAVLTRMRSNECKPDIITFNVLIDSYGKKQEFEKMEQTFKSLM 306

Query: 310 RSKERPTLPTFNSMITNYGKARLRERAEEVFRKMKDMGYDPSYVTCESLIMMYGHCDCVS 369
           RSKE+PTLPTFNSMI NYGKAR+ ++AE VF+KM DM Y PS++T E +IMMYG+C  VS
Sbjct: 307 RSKEKPTLPTFNSMIINYGKARMIDKAEWVFKKMNDMNYIPSFITYECMIMMYGYCGSVS 366

Query: 370 KAREIFDGMVNSGKAVRVSTLNAMLDVYCMNGLPMEADLLFESASSMKVFPDSTTYKLLY 429
           +AREIF+ +  S + ++ STLNAML+VYC NGL +EAD LF +AS+ +V PD++TYK LY
Sbjct: 367 RAREIFEEVGESDRVLKASTLNAMLEVYCRNGLYIEADKLFHNASAFRVHPDASTYKFLY 426

Query: 430 KAYTKADAKELVEKLLKNMDKAGIIPNKRFFLDALGTVGSSPEKPEPARTGTGSRHSESS 489
           KAYTKAD KE V+ L+K M+K GI+PNKRFFL+AL   GS         +G+ +R S  S
Sbjct: 427 KAYTKADMKEQVQILMKKMEKDGIVPNKRFFLEALEVFGS-----RLPGSGSENRKSTRS 486

Query: 490 VEKPISSRNRRGSRKLESSVKN 505
                S + R G++  E   K+
Sbjct: 487 SRSRDSPKGRGGNQLTEFQDKD 502

BLAST of Cp4.1LG10g01040 vs. TAIR 10
Match: AT3G53170.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 165.2 bits (417), Expect = 1.3e-40
Identity = 109/381 (28.61%), Postives = 174/381 (45.67%), Query Frame = 0

Query: 102 EELGKKDKWLQCLEVFRWMQKQRWYIADNGVYSKLISIMGKKGQIRMAIWLFSEMRNSGC 161
           +E  K+++W   L++F  ++KQ WY      Y+KL  ++G   Q   A  LF  M + G 
Sbjct: 116 DEAIKENRWQSALKIFNLLRKQHWYEPRCKTYTKLFKVLGNCKQPDQASLLFEVMLSEGL 175

Query: 162 RPDTSVYNALITAHLHSKDKAKALVKVLWYFEKMKGMERCKPNVVTYNILTRAFAQAGKV 221
           +P   VY +LI+ +     K++ L K     E MK +  CKP+V T+ +L     + G+ 
Sbjct: 176 KPTIDVYTSLISVY----GKSELLDKAFSTLEYMKSVSDCKPDVFTFTVLISCCCKLGRF 235

Query: 222 DQVNALFKDLDESIVSADIYTYNGVMDAYGKNGMIKEMETMLA----------------- 281
           D V ++  ++    V     TYN ++D YGK GM +EME++LA                 
Sbjct: 236 DLVKSIVLEMSYLGVGCSTVTYNTIIDGYGKAGMFEEMESVLADMIEDGDSLPDVCTLNS 295

Query: 282 -------------------RMKSNEIKPDIISFNLLIDSYGKKQLFDKMEQVFKSLLRSK 341
                              R +   ++PDI +FN+LI S+GK  ++ KM  V   + +  
Sbjct: 296 IIGSYGNGRNMRKMESWYSRFQLMGVQPDITTFNILILSFGKAGMYKKMCSVMDFMEKRF 355

Query: 342 ERPTLPTFNSMITNYGKARLRERAEEVFRKMKDMGYDPSYVTCESLIMMYGHCDCVSKAR 401
              T  T+N +I  +GKA   E+ ++VFRKMK  G  P+ +T  SL+  Y     V K  
Sbjct: 356 FSLTTVTYNIVIETFGKAGRIEKMDDVFRKMKYQGVKPNSITYCSLVNAYSKAGLVVKID 415

Query: 402 EIFDGMVNSGKAVRVSTLNAMLDVYCMNGLPMEADLLFESASSMKVFPDSTTYKLLYKAY 447
            +   +VNS   +     N +++ Y   G       L+      K  PD  T+  + K Y
Sbjct: 416 SVLRQIVNSDVVLDTPFFNCIINAYGQAGDLATMKELYIQMEERKCKPDKITFATMIKTY 475

BLAST of Cp4.1LG10g01040 vs. TAIR 10
Match: AT3G06430.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 162.5 bits (410), Expect = 8.7e-40
Identity = 100/354 (28.25%), Postives = 174/354 (49.15%), Query Frame = 0

Query: 103 ELGKKDKWLQCLEVFRWMQKQRWYIADNGVYSKLISIMGKKGQIRMAIWLFSEMRNSGCR 162
           +L  K +WLQ LEVF  +++Q +Y    G Y KL+ ++GK GQ   A  LF EM   G  
Sbjct: 97  DLIAKKQWLQALEVFDMLREQTFYQPKEGTYMKLLVLLGKSGQPNRAQKLFDEMLEEGLE 156

Query: 163 PDTSVYNALITAHLHSK--DKAKALVKVLWYFEKMKGMERCKPNVVTYNILTRAFAQAGK 222
           P   +Y AL+ A+  S   D A +++      +KMK   +C+P+V TY+ L +A   A +
Sbjct: 157 PTVELYTALLAAYTRSNLIDDAFSIL------DKMKSFPQCQPDVFTYSTLLKACVDASQ 216

Query: 223 VDQVNALFKDLDESIVSADIYTYNGVMDAYGKNGMIKEMETMLARM-KSNEIKPDIISFN 282
            D V++L+K++DE +++ +  T N V+  YG+ G   +ME +L+ M  S   KPD+ + N
Sbjct: 217 FDLVDSLYKEMDERLITPNTVTQNIVLSGYGRVGRFDQMEKVLSDMLVSTACKPDVWTMN 276

Query: 283 LLIDSYGKKQLFDKMEQVFKSLLRSKERPTLPTFNSMITNYGKARLRERAEEVFRKMKDM 342
           +++  +G     D ME  ++        P   TFN +I +YGK R+ ++   V   M+ +
Sbjct: 277 IILSVFGNMGKIDMMESWYEKFRNFGIEPETRTFNILIGSYGKKRMYDKMSSVMEYMRKL 336

Query: 343 GYDPSYVTCESLIMMYGHCDCVSKAREIFDGMVNSGKAVRVSTLNAMLDVYCMNGLPMEA 402
            +  +  T  ++I  +            FD M + G      T   +++ Y   GL  + 
Sbjct: 337 EFPWTTSTYNNIIEAFADVGDAKNMELTFDQMRSEGMKADTKTFCCLINGYANAGLFHKV 396

Query: 403 DLLFESASSMKVFPDSTTYKLLYKAYTKADAKELVEKLLKNMDKAGIIPNKRFF 454
               + A+  ++  ++  Y  +  A  KAD    +E++   M +   + + R F
Sbjct: 397 ISSVQLAAKFEIPENTAFYNAVISACAKADDLIEMERVYIRMKERQCVCDSRTF 444

BLAST of Cp4.1LG10g01040 vs. TAIR 10
Match: AT5G48730.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 158.3 bits (399), Expect = 1.6e-38
Identity = 113/386 (29.27%), Postives = 182/386 (47.15%), Query Frame = 0

Query: 109 KWLQCLEVFRWMQKQRWYIADNGVYSKLISIMGKKGQIRMAIWLFSEMRNSGCRPDTSVY 168
           +W   ++VF  +++Q WY  + G+Y KLI ++GK  Q   A  LF EM N GC  +  VY
Sbjct: 129 RWESAIQVFELLREQLWYKPNVGIYVKLIVMLGKCKQPEKAHELFQEMINEGCVVNHEVY 188

Query: 169 NALITAHLHSK--DKAKALVKVLWYFEKMKGMERCKPNVVTYNILTRAFAQAGKVDQVNA 228
            AL++A+  S   D A  L+      E+MK    C+P+V TY+IL ++F Q    D+V  
Sbjct: 189 TALVSAYSRSGRFDAAFTLL------ERMKSSHNCQPDVHTYSILIKSFLQVFAFDKVQD 248

Query: 229 LFKDLDESIVSADIYTYNGVMDAYGK---------------------------------- 288
           L  D+    +  +  TYN ++DAYGK                                  
Sbjct: 249 LLSDMRRQGIRPNTITYNTLIDAYGKAKMFVEMESTLIQMLGEDDCKPDSWTMNSTLRAF 308

Query: 289 --NGMIKEMETMLARMKSNEIKPDIISFNLLIDSYGKKQLFDKMEQVFKSLLRSKERPTL 348
             NG I+ ME    + +S+ I+P+I +FN+L+DSYGK   + KM  V + + +     T+
Sbjct: 309 GGNGQIEMMENCYEKFQSSGIEPNIRTFNILLDSYGKSGNYKKMSAVMEYMQKYHYSWTI 368

Query: 349 PTFNSMITNYGKARLRERAEEVFRKMKDMGYDPSYVTCESLIMMYGHCDCVSKAREIFDG 408
            T+N +I  +G+A   ++ E +FR M+     PS VT  SL+  YG      K   +   
Sbjct: 369 VTYNVVIDAFGRAGDLKQMEYLFRLMQSERIFPSCVTLCSLVRAYGRASKADKIGGVLRF 428

Query: 409 MVNSGKAVRVSTLNAMLDVYCMNGLPMEADLLFESASSMKVFPDSTTYKLLYKAYTKADA 454
           + NS   + +   N ++D Y       E   + E        PD  TY+ + KAY  +  
Sbjct: 429 IENSDIRLDLVFFNCLVDAYGRMEKFAEMKGVLELMEKKGFKPDKITYRTMVKAYRISGM 488

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9SV961.2e-17463.82Pentatricopeptide repeat-containing protein At4g39620, chloroplastic OS=Arabidop... [more]
A7LN873.3e-16963.83Pentatricopeptide repeat-containing protein PPR5, chloroplastic OS=Zea mays OX=4... [more]
A3ABE14.3e-16967.12Pentatricopeptide repeat-containing protein PPR5 homolog, chloroplastic OS=Oryza... [more]
Q9SCP48.5e-4028.68Pentatricopeptide repeat-containing protein At3g53170 OS=Arabidopsis thaliana OX... [more]
Q9SQU61.2e-3828.25Pentatricopeptide repeat-containing protein At3g06430, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
XP_023543919.10.0100.00pentatricopeptide repeat-containing protein At4g39620, chloroplastic [Cucurbita ... [more]
KAG7034169.10.099.61Pentatricopeptide repeat-containing protein, chloroplastic [Cucurbita argyrosper... [more]
XP_022950765.10.099.61pentatricopeptide repeat-containing protein At4g39620, chloroplastic [Cucurbita ... [more]
KAG6604006.10.099.41Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita a... [more]
XP_022978471.10.098.44pentatricopeptide repeat-containing protein At4g39620, chloroplastic [Cucurbita ... [more]
Match NameE-valueIdentityDescription
A0A6J1GGN30.099.61pentatricopeptide repeat-containing protein At4g39620, chloroplastic OS=Cucurbit... [more]
A0A6J1IMR80.098.44pentatricopeptide repeat-containing protein At4g39620, chloroplastic OS=Cucurbit... [more]
A0A5A7T2660.087.01Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A5D3CMK00.087.01Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A1S3B2D90.087.01pentatricopeptide repeat-containing protein At4g39620, chloroplastic OS=Cucumis ... [more]
Match NameE-valueIdentityDescription
AT4G39620.18.4e-17663.82Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT4G39620.21.1e-17563.94Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT3G53170.11.3e-4028.61Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT3G06430.18.7e-4028.25Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT5G48730.11.6e-3829.27Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 304..502
e-value: 1.6E-30
score: 108.5
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 62..233
e-value: 1.0E-37
score: 131.5
coord: 234..303
e-value: 3.3E-16
score: 61.3
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 346..375
e-value: 0.0098
score: 16.1
coord: 312..340
e-value: 8.6E-6
score: 25.7
coord: 382..405
e-value: 0.0025
score: 17.9
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 152..215
e-value: 3.2E-9
score: 36.7
coord: 227..287
e-value: 1.2E-9
score: 38.1
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 241..275
e-value: 2.0E-9
score: 35.1
coord: 382..414
e-value: 4.3E-4
score: 18.3
coord: 312..344
e-value: 1.3E-6
score: 26.2
coord: 206..237
e-value: 0.0021
score: 16.1
coord: 132..165
e-value: 3.0E-5
score: 21.9
coord: 276..309
e-value: 1.6E-5
score: 22.8
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 309..343
score: 11.41077
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 379..413
score: 8.53891
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 129..163
score: 11.32308
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 274..308
score: 10.128299
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 239..273
score: 11.893068
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 204..238
score: 9.382931
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 414..448
score: 9.174665
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 483..503
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 458..512
NoneNo IPR availablePANTHERPTHR47874:SF3BNAA01G05620D PROTEINcoord: 37..476
IPR044179Pentatricopeptide repeat-containing protein PPR5-likePANTHERPTHR47874EXPRESSED PROTEINcoord: 37..476

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG10g01040.1Cp4.1LG10g01040.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0003729 mRNA binding
molecular_function GO:0005515 protein binding