Cp4.1LG16g01900 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG16g01900
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionPentatricopeptide repeat-containing protein
LocationCp4.1LG16: 4096698 .. 4101366 (-)
RNA-Seq ExpressionCp4.1LG16g01900
SyntenyCp4.1LG16g01900
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
AATTTCGCAAACGCTTTGCCTTCTCTTCCACATTTAGCTCACTTCCAGATCCCCACGACCTTAGTAGTTGTCTTCACTTTTTTTTTTTTCAAAACAAAATTTAACATTCCAATCTCTTCTTCAATTTCATTCCCTCATTATCACCACTGGCAACTCGAACAATGCCTTCTTTGCCACTAAGCTCATGGCCTTTTATGCCTGTCATGGGCAACCTGCGTTCTCCACGCAATTGTTTCGATTTGTTCATCCTAAGGACAAATTTCTTTGGAATTCCATTATCCAATCCCATTTCTCCAATGGTGATTACCTACAGGCATTTGATTTCTACCTTGAGATGCGAGCATCGAGTAGCCTGCCAAACCAATTTACAATTCCCATGGTGGTTTCCACTTGTGCGGAACTAATGATGCTCAACCATGGCATGAACATTCATGGGTTGGCTTTGAAACTTGGGCTCTTTGTTGGTAATTCTGCTGTTGGTTCTTCTTTGATATACATGTATTCCAAATGTGGTAACGCAGAAAGTGCATCTCTCATGTTCAATGAAATTACTGTTAAGGATGTAGTTGCTTGGACTGCCCTTATAATTGGTTATGTCCAGAATAACGAGAGTGAGAAAGGTTTGAAATGTTTGTTTGAGATGCATAGGAATGGATGTACCCCAAATTATAGAACAATAGGAGGTGGGTTTCAAGCTTGTGTTGATTTGGAGGCTTTAGTAGAGGGTAGATGCTTACATGGTTTGGCTTTAAAAAGTGGATTTCTCTGTTTTGAAGTCGTTAAATCTTCTATTCTCTCGATGTACTCGAGGTGTGGGTCACCTGAAGAAGCTTATCGTTGTTTTTTTAAATTGGAGCAAAAAGATCTCATCTCTTGGACATCAATTATTGCAGTTCACTCTAAACTCGGGTTGATGAGTGAATGTCTACATTTATTTTGGGAGATGCAGGCCAGTGGAATAATTCCAGATGACATCGTGATCAGTTGCATGCTTCTGGGTTTTGGTAATTTTGATAGAATCTCTGAAGGAAAAGCCTTACATGCTTGGATTCTGAAACAATGTTGTGCAATGAGTGGAATAACTCACAATGCATTACTCTCCATGTATTGTAAGTTCGGACTCTTACGTATGGCAGATAAGATCTTCCATAGTTTCCATAAAAGCAGTGAAGATTGGAACACAATGATATTAGGATACAGCAATATGGGGGAGAAAGAAAAGTGTATAGACTTTTTCAGGGAGATGCACCTCTTAGGCATAGAACCTGATTTGAATAGTTTAGTTTCGGTCATTTCTTCATGTTTACCAGTTGGAGCTGTGAATATTGGTCGGTCTGTGCACTGCTATGCGATTAAAAACTCGATCATTGACAATGTATCAATAGCTAACTCACTCTTGGACATGTACGGAAAAAGTGGTAATTTAACCGCCGCATGGAGGATATTTCATAGGACACAACAAAAGGATATTGTCTCATGGAATACACTGATTTCGTCCTACAAGCAAAGTGGGCACCCTTCTGAAGCAATTGATTTATTCGATAAAATGATTAAAGAAAAGTTCAACCCCAACGGAGTTACCTGCGTAATAGTTCTTTCGGCATGTTCTCATCTTGCATCCTTAGATAAAGGTGAAAAAATTCACCAGTACATAAAGGAAAATGGATTTGAGACTGATATCACTGTTAGAACTGCATTGATTGATATGTATGCAAAATGTGGGGAGCTCGAGACATCAAGAACATTGTTCAACTCAATGGAAGAGAGGGATGTTATTTTGTGTAATGTCATGATATCAAATTATGGGATGCATGGACATGTGGAATCTGCTATTGAGATCTTCCAACTAATGGAAGACTCAAACATTAAACCAAATGCACTTACCTTTCTTTCTCTTCTCTCAGCTTGTAATCATGCAGGCCATATGGTAGAAGGAAGGCGTCTCTTTGATGTAATGCATAAATATGGTATCAAACCTAGTCTTAAGCACTATGCTTCTATGGTAGATCTTCTTGGCAGGTCAGGTAGCCTTGAAGAAGCGGAGGCTCTTGTTTTATCAATGCCCATCACGCCTGATGGCACTGTGTGGGGCTCCTTGTTAAGTGCTTGTAAACTTCATAATGAATTTGAAATGGGTATAAGGATTGCCAGACATGCAATTGAGTCTGATCCAAAAAATGATGGGTATTATATAGTATTGTCTGATCTGTATGGTTGCTTGGGAAGGTGGGAGGAAGTGGAAAAAGTGCGAGGCTTGATGAAGCAAAGAGGGGTGGAGAAGAGAGCTGGCTGGAGTGCCTTATGAACGAAGTAATTGCTTGAAACATTTGATACCTTTGACTATACAACTTCGTCGAGTCTTGAGAATTATTTCGAAGGACGAATTTGACAATCAATATGAAAAAAAGTTCGATCCTCACTCGTTCGGAACTTTTTCGACTTTGGTAATGCTTCCTCATACATTGAAAATAGTTTGCTTTCTGGTACATTTTCTTTTGGATCAAATTACGTTACTTTATATAATTCTTTCCAAATGGTTTATGCAGGTTCTGATATCAACCTTCGATTCGTTACCTTACATGGAAGAACGTATCGAGGAACTCACTGAAAAGTAGGCTTCCTTTTCGTGTAGCTGGTAATGTCGTTTGGAACTCGGTTCGAGCATTCACAAGAATCCATATGGGTATGTGGGCATTATTCAGGAAACAACTAAGAAATGGCAATAGATAAGTTATATATTAGTTATTGAATTAAATTCATAAACATTATATATTAAGTTATATTGTATAAAGTTATTTCTATTATGAAACCAAGAATAAACAAAATATTTATGTATACTTCCATTAGACATATTTGATTGTTCTTCAAAATATTACTAATGTATAAGATTTCTTTTTTGCAAATGTATTTGATATGAAATTAATTATAAAGAAAATGTTATATATTGTTTATTTAATTATTCATTATTTGCTCGGTTAATTGAATTGATATAAGGTCTATTTGGTTTAACTTTTCGAGTACTTAAAAGGTAAAACATGTTTTTACCCTGTGTTGGAATCCCTTTTAATTTGGAGGAATTTTGAATTATAATGATTTCGGATGAACATCATTGTTTGTTATCATAAGAAAGAGAAAGTCGAAGGTTTAGATGAATGATCTTTGTGAGATCACCCATGAATCTCGAGAACTAAGTGTGGGGCTTGAAGGTCCTACTGTATTTCAGCCCTGGAGCCTGAAATTGTTACCACAGTTGCCACACGTGAGATCCTGATCGTCTTCCTTCTAGTGTTGTTGTCCGTTGGGAGAAGGAAAGTTAGTGGGTTTTCCTTCCAGGATCGAAGAAGGTCAAACTCTGGGTGGGTGGGCTGCTCAAGTTTTCCCTACGCTGTGGTTTTACGAGAATAAACTTGTTTTCTTTAATCGGTGTAAAGAGTTCTAGAGCACGAGGGAGTGAGCTTTCATTCCAACGACCAAAATGGTTTGAAAATTTTAAGAAAAAATTTCTTTGTTATTAACATGTTAATTAAAATTTTAAAAATATAATAATAATTTTAGTAAACTAAGGGTGCTTGCCATTACGTGCAATAGAAGTACATGTACACACAAAATACATGAGATTCCTATGGTTTTCATCTTTAAATCATCTTCGTGACACAACCCTATACACCATTCATATATTACTCTAAGTAACATGTAGGTGTTGGTTTTCATACACTCATCCTCATTTTCTTCAATTAGCGTCATGTCTTTAGAAAAACAAATCATTGTCAGCCATCATGACATTTAATACATGCACGTCATCATAACATGGAATAACATCATGTTAAACTATCTCATCATCATACAATCTTAGGTCATCATATTATCTTACTTGGTCATCATTATGTCATCATCTTATCTTACATCGCCATTAATATAGTCTAATATTGAACAATAATAGAATATTATTTTTATTATTTTCTCATAATCTAATATTATATGAAAAAAGTCAAGTTGGTAATAAAATATAAATGTTTAAGGGTGGAGCGTGGAAGGTATGTTCGTGAATCACGTGATGGAAGAGAACGTTAAAATGACTAAAATGCCCCTATTATTATTATTATTATGAGTACAAATAGGCATTCATAAACTAATACTAATAATAAGAATCACTTTATTCCCCAAAAGTGGATTTTCCATTGGCATCATAACCAACCATGTGGTTTCTGATGCCAAAACATCCACCATGTTCTTCAAATCATGGGCTTCCATTTGTAGTACACTCTATAATACTAATAATAAGAATCTCCCCACGTTGTCATCTGAGTTGACACCATGTTTTGATAGAACGTTCGCCACGGATCCAAATGGATTGCATACACTTTACGTTAAGTCTTTTGAAATCTTTGTCACTAACCCGAAAGAGGTGATCTCGGATGATGTGGTGTATGCCACGTTTGAGCTTACACGCATTGACATAGAGAAGGTGAGGAGAAGAGTGGTAGCAACTTCTTCATCCACTCCTCGTCGTTTAACCACTTTTATGTTGGCGTTTTCTCTTGTATCGACTTGTATTCTCTTGTATGGGGTTTTTTCAATGGCGGAGAGTAGAAATGGGGATGGAGGAGTTGAGCTTGGAATTGCTCTTCCACCTCAAGCTATGGACAAGTTTTGCTCTATATTTTCAGAG

mRNA sequence

AATTTCGCAAACGCTTTGCCTTCTCTTCCACATTTAGCTCACTTCCAGATCCCCACGACCTTATTGTCTTCACTTTTTTTTTTTTCAAAACAAAATTTAACATTCCAATCTCTTCTTCAATTTCATTCCCTCATTATCACCACTGGCAACTCGAACAATGCCTTCTTTGCCACTAAGCTCATGGCCTTTTATGCCTGTCATGGGCAACCTGCGTTCTCCACGCAATTGTTTCGATTTGTTCATCCTAAGGACAAATTTCTTTGGAATTCCATTATCCAATCCCATTTCTCCAATGGTGATTACCTACAGGCATTTGATTTCTACCTTGAGATGCGAGCATCGAGTAGCCTGCCAAACCAATTTACAATTCCCATGGTGGTTTCCACTTGTGCGGAACTAATGATGCTCAACCATGGCATGAACATTCATGGGTTGGCTTTGAAACTTGGGCTCTTTGTTGGTAATTCTGCTGTTGGTTCTTCTTTGATATACATGTATTCCAAATGTGGTAACGCAGAAAGTGCATCTCTCATGTTCAATGAAATTACTGTTAAGGATGTAGTTGCTTGGACTGCCCTTATAATTGGTTATGTCCAGAATAACGAGAGTGAGAAAGGTTTGAAATGTTTGTTTGAGATGCATAGGAATGGATGTACCCCAAATTATAGAACAATAGGAGGTGGGTTTCAAGCTTGTGTTGATTTGGAGGCTTTAGTAGAGGGTAGATGCTTACATGGTTTGGCTTTAAAAAGTGGATTTCTCTGTTTTGAAGTCGTTAAATCTTCTATTCTCTCGATGTACTCGAGGTGTGGGTCACCTGAAGAAGCTTATCGTTGTTTTTTTAAATTGGAGCAAAAAGATCTCATCTCTTGGACATCAATTATTGCAGTTCACTCTAAACTCGGGTTGATGAGTGAATGTCTACATTTATTTTGGGAGATGCAGGCCAGTGGAATAATTCCAGATGACATCGTGATCAGTTGCATGCTTCTGGGTTTTGGTAATTTTGATAGAATCTCTGAAGGAAAAGCCTTACATGCTTGGATTCTGAAACAATGTTGTGCAATGAGTGGAATAACTCACAATGCATTACTCTCCATGTATTGTAAGTTCGGACTCTTACGTATGGCAGATAAGATCTTCCATAGTTTCCATAAAAGCAGTGAAGATTGGAACACAATGATATTAGGATACAGCAATATGGGGGAGAAAGAAAAGTGTATAGACTTTTTCAGGGAGATGCACCTCTTAGGCATAGAACCTGATTTGAATAGTTTAGTTTCGGTCATTTCTTCATGTTTACCAGTTGGAGCTGTGAATATTGGTCGGTCTGTGCACTGCTATGCGATTAAAAACTCGATCATTGACAATGTATCAATAGCTAACTCACTCTTGGACATGTACGGAAAAAGTGGTAATTTAACCGCCGCATGGAGGATATTTCATAGGACACAACAAAAGGATATTGTCTCATGGAATACACTGATTTCGTCCTACAAGCAAAGTGGGCACCCTTCTGAAGCAATTGATTTATTCGATAAAATGATTAAAGAAAAGTTCAACCCCAACGGAGTTACCTGCGTAATAGTTCTGATATCAACCTTCGATTCGTTACCTTACATGGAAGAACGTATCGAGGAACTCACTGAAAAAACGTTCGCCACGGATCCAAATGGATTGCATACACTTTACGTTAAGTCTTTTGAAATCTTTGTCACTAACCCGAAAGAGGTGATCTCGGATGATGTGGTGTATGCCACGTTTGAGCTTACACGCATTGACATAGAGAAGGTGAGGAGAAGAGTGGTAGCAACTTCTTCATCCACTCCTCGTCGTTTAACCACTTTTATGTTGGCGTTTTCTCTTGTATCGACTTGTATTCTCTTGTATGGGGTTTTTTCAATGGCGGAGAGTAGAAATGGGGATGGAGGAGTTGAGCTTGGAATTGCTCTTCCACCTCAAGCTATGGACAAGTTTTGCTCTATATTTTCAGAG

Coding sequence (CDS)

AATTTCGCAAACGCTTTGCCTTCTCTTCCACATTTAGCTCACTTCCAGATCCCCACGACCTTATTGTCTTCACTTTTTTTTTTTTCAAAACAAAATTTAACATTCCAATCTCTTCTTCAATTTCATTCCCTCATTATCACCACTGGCAACTCGAACAATGCCTTCTTTGCCACTAAGCTCATGGCCTTTTATGCCTGTCATGGGCAACCTGCGTTCTCCACGCAATTGTTTCGATTTGTTCATCCTAAGGACAAATTTCTTTGGAATTCCATTATCCAATCCCATTTCTCCAATGGTGATTACCTACAGGCATTTGATTTCTACCTTGAGATGCGAGCATCGAGTAGCCTGCCAAACCAATTTACAATTCCCATGGTGGTTTCCACTTGTGCGGAACTAATGATGCTCAACCATGGCATGAACATTCATGGGTTGGCTTTGAAACTTGGGCTCTTTGTTGGTAATTCTGCTGTTGGTTCTTCTTTGATATACATGTATTCCAAATGTGGTAACGCAGAAAGTGCATCTCTCATGTTCAATGAAATTACTGTTAAGGATGTAGTTGCTTGGACTGCCCTTATAATTGGTTATGTCCAGAATAACGAGAGTGAGAAAGGTTTGAAATGTTTGTTTGAGATGCATAGGAATGGATGTACCCCAAATTATAGAACAATAGGAGGTGGGTTTCAAGCTTGTGTTGATTTGGAGGCTTTAGTAGAGGGTAGATGCTTACATGGTTTGGCTTTAAAAAGTGGATTTCTCTGTTTTGAAGTCGTTAAATCTTCTATTCTCTCGATGTACTCGAGGTGTGGGTCACCTGAAGAAGCTTATCGTTGTTTTTTTAAATTGGAGCAAAAAGATCTCATCTCTTGGACATCAATTATTGCAGTTCACTCTAAACTCGGGTTGATGAGTGAATGTCTACATTTATTTTGGGAGATGCAGGCCAGTGGAATAATTCCAGATGACATCGTGATCAGTTGCATGCTTCTGGGTTTTGGTAATTTTGATAGAATCTCTGAAGGAAAAGCCTTACATGCTTGGATTCTGAAACAATGTTGTGCAATGAGTGGAATAACTCACAATGCATTACTCTCCATGTATTGTAAGTTCGGACTCTTACGTATGGCAGATAAGATCTTCCATAGTTTCCATAAAAGCAGTGAAGATTGGAACACAATGATATTAGGATACAGCAATATGGGGGAGAAAGAAAAGTGTATAGACTTTTTCAGGGAGATGCACCTCTTAGGCATAGAACCTGATTTGAATAGTTTAGTTTCGGTCATTTCTTCATGTTTACCAGTTGGAGCTGTGAATATTGGTCGGTCTGTGCACTGCTATGCGATTAAAAACTCGATCATTGACAATGTATCAATAGCTAACTCACTCTTGGACATGTACGGAAAAAGTGGTAATTTAACCGCCGCATGGAGGATATTTCATAGGACACAACAAAAGGATATTGTCTCATGGAATACACTGATTTCGTCCTACAAGCAAAGTGGGCACCCTTCTGAAGCAATTGATTTATTCGATAAAATGATTAAAGAAAAGTTCAACCCCAACGGAGTTACCTGCGTAATAGTTCTGATATCAACCTTCGATTCGTTACCTTACATGGAAGAACGTATCGAGGAACTCACTGAAAAAACGTTCGCCACGGATCCAAATGGATTGCATACACTTTACGTTAAGTCTTTTGAAATCTTTGTCACTAACCCGAAAGAGGTGATCTCGGATGATGTGGTGTATGCCACGTTTGAGCTTACACGCATTGACATAGAGAAGGTGAGGAGAAGAGTGGTAGCAACTTCTTCATCCACTCCTCGTCGTTTAACCACTTTTATGTTGGCGTTTTCTCTTGTATCGACTTGTATTCTCTTGTATGGGGTTTTTTCAATGGCGGAGAGTAGAAATGGGGATGGAGGAGTTGAGCTTGGAATTGCTCTTCCACCTCAAGCTATGGACAAGTTTTGCTCTATATTTTCAGAG

Protein sequence

NFANALPSLPHLAHFQIPTTLLSSLFFFSKQNLTFQSLLQFHSLIITTGNSNNAFFATKLMAFYACHGQPAFSTQLFRFVHPKDKFLWNSIIQSHFSNGDYLQAFDFYLEMRASSSLPNQFTIPMVVSTCAELMMLNHGMNIHGLALKLGLFVGNSAVGSSLIYMYSKCGNAESASLMFNEITVKDVVAWTALIIGYVQNNESEKGLKCLFEMHRNGCTPNYRTIGGGFQACVDLEALVEGRCLHGLALKSGFLCFEVVKSSILSMYSRCGSPEEAYRCFFKLEQKDLISWTSIIAVHSKLGLMSECLHLFWEMQASGIIPDDIVISCMLLGFGNFDRISEGKALHAWILKQCCAMSGITHNALLSMYCKFGLLRMADKIFHSFHKSSEDWNTMILGYSNMGEKEKCIDFFREMHLLGIEPDLNSLVSVISSCLPVGAVNIGRSVHCYAIKNSIIDNVSIANSLLDMYGKSGNLTAAWRIFHRTQQKDIVSWNTLISSYKQSGHPSEAIDLFDKMIKEKFNPNGVTCVIVLISTFDSLPYMEERIEELTEKTFATDPNGLHTLYVKSFEIFVTNPKEVISDDVVYATFELTRIDIEKVRRRVVATSSSTPRRLTTFMLAFSLVSTCILLYGVFSMAESRNGDGGVELGIALPPQAMDKFCSIFSE
Homology
BLAST of Cp4.1LG16g01900 vs. ExPASy Swiss-Prot
Match: Q3E9N1 (Pentatricopeptide repeat-containing protein At4g39952, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-E98 PE=2 SV=2)

HSP 1 Score: 478.0 bits (1229), Expect = 1.7e-133
Identity = 242/508 (47.64%), Postives = 339/508 (66.73%), Query Frame = 0

Query: 31  QNLTFQSLLQFHSLIITTGNSNNAFFATKLMAFYACHGQPAFSTQLFRFVHPKDKFLWNS 90
           Q+L+ +SL + ++LIIT G S N F A+KL++ YA +G+P  S+++F  V  +D FLWNS
Sbjct: 36  QSLSLESLRKHNALIITGGLSENIFVASKLISSYASYGKPNLSSRVFHLVTRRDIFLWNS 95

Query: 91  IIQSHFSNGDYLQAFDFYLEMRASSSLPNQFTIPMVVSTCAELMMLNHGMNIHGLALKLG 150
           II++HFSNGDY ++  F+  M  S   P+ FT PMVVS CAEL+  + G  +HGL LK G
Sbjct: 96  IIKAHFSNGDYARSLCFFFSMLLSGQSPDHFTAPMVVSACAELLWFHVGTFVHGLVLKHG 155

Query: 151 LFVGNSAVGSSLIYMYSKCGNAESASLMFNEITVKDVVAWTALIIGYVQNNESEKGLKCL 210
            F  N+AVG+S +Y YSKCG  + A L+F+E+  +DVVAWTA+I G+VQN ESE GL  L
Sbjct: 156 GFDRNTAVGASFVYFYSKCGFLQDACLVFDEMPDRDVVAWTAIISGHVQNGESEGGLGYL 215

Query: 211 FEMHRNGC---TPNYRTIGGGFQACVDLEALVEGRCLHGLALKSGFLCFEVVKSSILSMY 270
            +MH  G     PN RT+  GFQAC +L AL EGRCLHG A+K+G    + V+SS+ S Y
Sbjct: 216 CKMHSAGSDVDKPNPRTLECGFQACSNLGALKEGRCLHGFAVKNGLASSKFVQSSMFSFY 275

Query: 271 SRCGSPEEAYRCFFKLEQKDLISWTSIIAVHSKLGLMSECLHLFWEMQASGIIPDDIVIS 330
           S+ G+P EAY  F +L  +D+ SWTSIIA  ++ G M E   +FWEMQ  G+ PD +VIS
Sbjct: 276 SKSGNPSEAYLSFRELGDEDMFSWTSIIASLARSGDMEESFDMFWEMQNKGMHPDGVVIS 335

Query: 331 CMLLGFGNFDRISEGKALHAWILKQCCAMSGITHNALLSMYCKFGLLRMADKIF--HSFH 390
           C++   G    + +GKA H ++++ C ++     N+LLSMYCKF LL +A+K+F   S  
Sbjct: 336 CLINELGKMMLVPQGKAFHGFVIRHCFSLDSTVCNSLLSMYCKFELLSVAEKLFCRISEE 395

Query: 391 KSSEDWNTMILGYSNMGEKEKCIDFFREMHLLGIEPDLNSLVSVISSCLPVGAVNIGRSV 450
            + E WNTM+ GY  M    KCI+ FR++  LGIE D  S  SVISSC  +GAV +G+S+
Sbjct: 396 GNKEAWNTMLKGYGKMKCHVKCIELFRKIQNLGIEIDSASATSVISSCSHIGAVLLGKSL 455

Query: 451 HCYAIKNSIIDNVSIANSLLDMYGKSGNLTAAWRIFHRTQQKDIVSWNTLISSYKQSGHP 510
           HCY +K S+   +S+ NSL+D+YGK G+LT AWR+F      ++++WN +I+SY      
Sbjct: 456 HCYVVKTSLDLTISVVNSLIDLYGKMGDLTVAWRMFCEA-DTNVITWNAMIASYVHCEQS 515

Query: 511 SEAIDLFDKMIKEKFNPNGVTCVIVLIS 534
            +AI LFD+M+ E F P+ +T V +L++
Sbjct: 516 EKAIALFDRMVSENFKPSSITLVTLLMA 542

BLAST of Cp4.1LG16g01900 vs. ExPASy Swiss-Prot
Match: Q9SS60 (Pentatricopeptide repeat-containing protein At3g03580 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H23 PE=2 SV=1)

HSP 1 Score: 253.1 bits (645), Expect = 9.0e-66
Identity = 152/505 (30.10%), Postives = 275/505 (54.46%), Query Frame = 0

Query: 29  SKQNLTFQSLLQFHSLIITTGNSNNAFFATKLMAFYACHGQPAFSTQLFRFVHP-KDKFL 88
           S  NL    L + H+L+I+ G  ++ FF+ KL+  Y+   +PA S  +FR V P K+ +L
Sbjct: 16  SSSNL--NELRRIHALVISLGLDSSDFFSGKLIDKYSHFREPASSLSVFRRVSPAKNVYL 75

Query: 89  WNSIIQSHFSNGDYLQAFDFYLEMRASSSLPNQFTIPMVVSTCAELMMLNHGMNIHGLAL 148
           WNSII++   NG + +A +FY ++R S   P+++T P V+  CA L     G  ++   L
Sbjct: 76  WNSIIRAFSKNGLFPEALEFYGKLRESKVSPDKYTFPSVIKACAGLFDAEMGDLVYEQIL 135

Query: 149 KLGLFVGNSAVGSSLIYMYSKCGNAESASLMFNEITVKDVVAWTALIIGYVQNNESEKGL 208
            +G F  +  VG++L+ MYS+ G    A  +F+E+ V+D+V+W +LI GY  +   E+ L
Sbjct: 136 DMG-FESDLFVGNALVDMYSRMGLLTRARQVFDEMPVRDLVSWNSLISGYSSHGYYEEAL 195

Query: 209 KCLFEMHRNGCTPNYRTIGGGFQACVDLEALVEGRCLHGLALKSGFLCFEVVKSSILSMY 268
           +   E+  +   P+  T+     A  +L  + +G+ LHG ALKSG     VV + +++MY
Sbjct: 196 EIYHELKNSWIVPDSFTVSSVLPAFGNLLVVKQGQGLHGFALKSGVNSVVVVNNGLVAMY 255

Query: 269 SRCGSPEEAYRCFFKLEQKDLISWTSIIAVHSKLGLMSECLHLFWEMQASGIIPDDIVIS 328
            +   P +A R F +++ +D +S+ ++I  + KL ++ E + +F E       PD + +S
Sbjct: 256 LKFRRPTDARRVFDEMDVRDSVSYNTMICGYLKLEMVEESVRMFLE-NLDQFKPDLLTVS 315

Query: 329 CMLLGFGNFDRISEGKALHAWILKQCCAMSGITHNALLSMYCKFGLLRMADKIFHSFH-K 388
            +L   G+   +S  K ++ ++LK    +     N L+ +Y K G +  A  +F+S   K
Sbjct: 316 SVLRACGHLRDLSLAKYIYNYMLKAGFVLESTVRNILIDVYAKCGDMITARDVFNSMECK 375

Query: 389 SSEDWNTMILGYSNMGEKEKCIDFFREMHLLGIEPDLNSLVSVISSCLPVGAVNIGRSVH 448
            +  WN++I GY   G+  + +  F+ M ++  + D  + + +IS    +  +  G+ +H
Sbjct: 376 DTVSWNSIISGYIQSGDLMEAMKLFKMMMIMEEQADHITYLMLISVSTRLADLKFGKGLH 435

Query: 449 CYAIKNSIIDNVSIANSLLDMYGKSGNLTAAWRIFHRTQQKDIVSWNTLISSYKQSGHPS 508
              IK+ I  ++S++N+L+DMY K G +  + +IF      D V+WNT+IS+  + G  +
Sbjct: 436 SNGIKSGICIDLSVSNALIDMYAKCGEVGDSLKIFSSMGTGDTVTWNTVISACVRFGDFA 495

Query: 509 EAIDLFDKMIKEKFNPNGVTCVIVL 532
             + +  +M K +  P+  T ++ L
Sbjct: 496 TGLQVTTQMRKSEVVPDMATFLVTL 516

BLAST of Cp4.1LG16g01900 vs. ExPASy Swiss-Prot
Match: Q9ZQ74 (Pentatricopeptide repeat-containing protein At2g03380, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-E47 PE=3 SV=1)

HSP 1 Score: 251.9 bits (642), Expect = 2.0e-65
Identity = 149/512 (29.10%), Postives = 262/512 (51.17%), Query Frame = 0

Query: 23  SSLFFFSKQNLTFQSLLQFHSLIITTGNSNNAFFATKLMAFYACHGQPAFSTQLFRFVHP 82
           S  F    +     SL Q H ++   G   +   ATKL++ Y   G    +  +F  +  
Sbjct: 45  SPCFLLLSKCTNIDSLRQSHGVLTGNGLMGDISIATKLVSLYGFFGYTKDARLVFDQIPE 104

Query: 83  KDKFLWNSIIQSHFSNGDYLQAFDFYLEMRASSSLPNQFTIPMVVSTCAELMMLNHGMNI 142
            D +LW  +++ +  N + ++    Y  +       +       +  C EL  L++G  I
Sbjct: 105 PDFYLWKVMLRCYCLNKESVEVVKLYDLLMKHGFRYDDIVFSKALKACTELQDLDNGKKI 164

Query: 143 HGLALKLGLFVGNSAVGSSLIYMYSKCGNAESASLMFNEITVKDVVAWTALIIGYVQNNE 202
           H   +K+  F  ++ V + L+ MY+KCG  +SA  +FN+IT+++VV WT++I GYV+N+ 
Sbjct: 165 HCQLVKVPSF--DNVVLTGLLDMYAKCGEIKSAHKVFNDITLRNVVCWTSMIAGYVKNDL 224

Query: 203 SEKGLKCLFEMHRNGCTPNYRTIGGGFQACVDLEALVEGRCLHGLALKSGFLCFEVVKSS 262
            E+GL     M  N    N  T G    AC  L AL +G+  HG  +KSG      + +S
Sbjct: 225 CEEGLVLFNRMRENNVLGNEYTYGTLIMACTKLSALHQGKWFHGCLVKSGIELSSCLVTS 284

Query: 263 ILSMYSRCGSPEEAYRCFFKLEQKDLISWTSIIAVHSKLGLMSECLHLFWEMQASGIIPD 322
           +L MY +CG    A R F +    DL+ WT++I  ++  G ++E L LF +M+   I P+
Sbjct: 285 LLDMYVKCGDISNARRVFNEHSHVDLVMWTAMIVGYTHNGSVNEALSLFQKMKGVEIKPN 344

Query: 323 DIVISCMLLGFGNFDRISEGKALHAWILKQCCAMSGITHNALLSMYCKFGLLRMADKIFH 382
            + I+ +L G G  + +  G+++H   +K     + +  NAL+ MY K    R A  +F 
Sbjct: 345 CVTIASVLSGCGLIENLELGRSVHGLSIKVGIWDTNVA-NALVHMYAKCYQNRDAKYVFE 404

Query: 383 -SFHKSSEDWNTMILGYSNMGEKEKCIDFFREMHLLGIEPDLNSLVSVISSCLPVGAVNI 442
               K    WN++I G+S  G   + +  F  M+   + P+  ++ S+ S+C  +G++ +
Sbjct: 405 MESEKDIVAWNSIISGFSQNGSIHEALFLFHRMNSESVTPNGVTVASLFSACASLGSLAV 464

Query: 443 GRSVHCYAIKNSII--DNVSIANSLLDMYGKSGNLTAAWRIFHRTQQKDIVSWNTLISSY 502
           G S+H Y++K   +   +V +  +LLD Y K G+  +A  IF   ++K+ ++W+ +I  Y
Sbjct: 465 GSSLHAYSVKLGFLASSSVHVGTALLDFYAKCGDPQSARLIFDTIEEKNTITWSAMIGGY 524

Query: 503 KQSGHPSEAIDLFDKMIKEKFNPNGVTCVIVL 532
            + G    +++LF++M+K++  PN  T   +L
Sbjct: 525 GKQGDTIGSLELFEEMLKKQQKPNESTFTSIL 553

BLAST of Cp4.1LG16g01900 vs. ExPASy Swiss-Prot
Match: Q9SN39 (Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=DOT4 PE=2 SV=1)

HSP 1 Score: 248.4 bits (633), Expect = 2.2e-64
Identity = 153/533 (28.71%), Postives = 263/533 (49.34%), Query Frame = 0

Query: 1   NFANALPSLPHLAHFQIPTTLLSSLFFFSKQNLTFQSLLQFHSLIITTGNSNNAFFATKL 60
           N  NA+  L     + I    L S+      + + +   +  + I   G   ++   +KL
Sbjct: 76  NLENAVKLLCVSGKWDIDPRTLCSVLQLCADSKSLKDGKEVDNFIRGNGFVIDSNLGSKL 135

Query: 61  MAFYACHGQPAFSTQLFRFVHPKDKFLWNSIIQSHFSNGDYLQAFDFYLEMRASSSLPNQ 120
              Y   G    ++++F  V  +    WN ++     +GD+  +   + +M +S    + 
Sbjct: 136 SLMYTNCGDLKEASRVFDEVKIEKALFWNILMNELAKSGDFSGSIGLFKKMMSSGVEMDS 195

Query: 121 FTIPMVVSTCAELMMLNHGMNIHGLALKLGLFVGNSAVGSSLIYMYSKCGNAESASLMFN 180
           +T   V  + + L  ++ G  +HG  LK G F   ++VG+SL+  Y K    +SA  +F+
Sbjct: 196 YTFSCVSKSFSSLRSVHGGEQLHGFILKSG-FGERNSVGNSLVAFYLKNQRVDSARKVFD 255

Query: 181 EITVKDVVAWTALIIGYVQNNESEKGLKCLFEMHRNGCTPNYRTIGGGFQACVDLEALVE 240
           E+T +DV++W ++I GYV N  +EKGL    +M  +G   +  TI   F  C D   +  
Sbjct: 256 EMTERDVISWNSIINGYVSNGLAEKGLSVFVQMLVSGIEIDLATIVSVFAGCADSRLISL 315

Query: 241 GRCLHGLALKSGFLCFEVVKSSILSMYSRCGSPEEAYRCFFKLEQKDLISWTSIIAVHSK 300
           GR +H + +K+ F   +   +++L MYS+CG  + A   F ++  + ++S+TS+IA +++
Sbjct: 316 GRAVHSIGVKACFSREDRFCNTLLDMYSKCGDLDSAKAVFREMSDRSVVSYTSMIAGYAR 375

Query: 301 LGLMSECLHLFWEMQASGIIPDDIVISCMLLGFGNFDRISEGKALHAWILKQCCAMSGIT 360
            GL  E + LF EM+  GI PD   ++ +L     +  + EGK +H WI +         
Sbjct: 376 EGLAGEAVKLFEEMEEEGISPDVYTVTAVLNCCARYRLLDEGKRVHEWIKENDLGFDIFV 435

Query: 361 HNALLSMYCKFGLLRMADKIFHSFH-KSSEDWNTMILGYSNMGEKEKCIDFFR-EMHLLG 420
            NAL+ MY K G ++ A+ +F     K    WNT+I GYS      + +  F   +    
Sbjct: 436 SNALMDMYAKCGSMQEAELVFSEMRVKDIISWNTIIGGYSKNCYANEALSLFNLLLEEKR 495

Query: 421 IEPDLNSLVSVISSCLPVGAVNIGRSVHCYAIKNSIIDNVSIANSLLDMYGKSGNLTAAW 480
             PD  ++  V+ +C  + A + GR +H Y ++N    +  +ANSL+DMY K G L  A 
Sbjct: 496 FSPDERTVACVLPACASLSAFDKGREIHGYIMRNGYFSDRHVANSLVDMYAKCGALLLAH 555

Query: 481 RIFHRTQQKDIVSWNTLISSYKQSGHPSEAIDLFDKMIKEKFNPNGVTCVIVL 532
            +F     KD+VSW  +I+ Y   G   EAI LF++M +     + ++ V +L
Sbjct: 556 MLFDDIASKDLVSWTVMIAGYGMHGFGKEAIALFNQMRQAGIEADEISFVSLL 607

BLAST of Cp4.1LG16g01900 vs. ExPASy Swiss-Prot
Match: O04659 (Pentatricopeptide repeat-containing protein At5g27110 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E14 PE=2 SV=2)

HSP 1 Score: 243.4 bits (620), Expect = 7.1e-63
Identity = 175/670 (26.12%), Postives = 313/670 (46.72%), Query Frame = 0

Query: 21  LLSSLFFFSKQNLTFQSLLQFHSLIITTGNSNNAFFATKLM-AFYACHGQPAFSTQLFRF 80
           LLS L   +    + + +   H  I+T G   +      L+  ++ C    +       F
Sbjct: 6   LLSLLRECTNSTKSLRRIKLVHQRILTLGLRRDVVLCKSLINVYFTCKDHCSARHVFENF 65

Query: 81  VHPKDKFLWNSIIQSHFSNGDYLQAFDFYLE-MRASSSLPNQFTIPMVVSTCAELMMLNH 140
               D ++WNS++  +  N  +    + +   +  S  +P+ FT P V+     L     
Sbjct: 66  DIRSDVYIWNSLMSGYSKNSMFHDTLEVFKRLLNCSICVPDSFTFPNVIKAYGALGREFL 125

Query: 141 GMNIHGLALKLGLFVGNSAVGSSLIYMYSKCGNAESASLMFNEITVKDVVAWTALIIGYV 200
           G  IH L +K G +V +  V SSL+ MY+K    E++  +F+E+  +DV +W  +I  + 
Sbjct: 126 GRMIHTLVVKSG-YVCDVVVASSLVGMYAKFNLFENSLQVFDEMPERDVASWNTVISCFY 185

Query: 201 QNNESEKGLKCLFEMHRNGCTPNYRTIGGGFQACVDLEALVEGRCLHGLALKSGFLCFEV 260
           Q+ E+EK L+    M  +G  PN  ++     AC  L  L  G+ +H   +K GF   E 
Sbjct: 186 QSGEAEKALELFGRMESSGFEPNSVSLTVAISACSRLLWLERGKEIHRKCVKKGFELDEY 245

Query: 261 VKSSILSMYSRCGSPEEAYRCFFKLEQKDLISWTSIIAVHSKLGLMSECLHLFWEMQASG 320
           V S+++ MY +C   E A   F K+ +K L++W S+I  +   G    C+ +   M   G
Sbjct: 246 VNSALVDMYGKCDCLEVAREVFQKMPRKSLVAWNSMIKGYVAKGDSKSCVEILNRMIIEG 305

Query: 321 IIPDDIVISCMLLGFGNFDRISEGKALHAWILKQCCAMSGITHNALLSMYCKFGLLRMAD 380
             P    ++ +L+       +  GK +H ++++         + +L+ +Y K G   +A+
Sbjct: 306 TRPSQTTLTSILMACSRSRNLLHGKFIHGYVIRSVVNADIYVNCSLIDLYFKCGEANLAE 365

Query: 381 KIFHSFHKS-SEDWNTMILGYSNMGEKEKCIDFFREMHLLGIEPDLNSLVSVISSCLPVG 440
            +F    K  +E WN MI  Y ++G   K ++ + +M  +G++PD+ +  SV+ +C  + 
Sbjct: 366 TVFSKTQKDVAESWNVMISSYISVGNWFKAVEVYDQMVSVGVKPDVVTFTSVLPACSQLA 425

Query: 441 AVNIGRSVHCYAIKNSIIDNVSIANSLLDMYGKSGNLTAAWRIFHRTQQKDIVSWNTLIS 500
           A+  G+ +H    ++ +  +  + ++LLDMY K GN   A+RIF+   +KD+VSW  +IS
Sbjct: 426 ALEKGKQIHLSISESRLETDELLLSALLDMYSKCGNEKEAFRIFNSIPKKDVVSWTVMIS 485

Query: 501 SYKQSGHPSEAIDLFDKMIKEKFNPNGVTCVIVLIS-----------TFDSLPYMEERIE 560
           +Y   G P EA+  FD+M K    P+GVT + VL +            F S    +  IE
Sbjct: 486 AYGSHGQPREALYQFDEMQKFGLKPDGVTLLAVLSACGHAGLIDEGLKFFSQMRSKYGIE 545

Query: 561 ELTEK-TFATDPNGLHTLYVKSFEIFVTNPKEVISDDVVYATFELTRIDIE-----KVRR 620
            + E  +   D  G     ++++EI    P+   + +++   F    + +E     ++ R
Sbjct: 546 PIIEHYSCMIDILGRAGRLLEAYEIIQQTPETSDNAELLSTLFSACCLHLEHSLGDRIAR 605

Query: 621 RVVATSSSTPRRLTTFMLAFSLVSTCILLYGVFSMAESRNGDGGVELGIALPP-----QA 666
            +V    + P   +T+M+ F+L ++     G    A  R      E+G+   P     + 
Sbjct: 606 LLV---ENYPDDASTYMVLFNLYAS-----GESWDAARRVRLKMKEMGLRKKPGCSWIEM 665

BLAST of Cp4.1LG16g01900 vs. NCBI nr
Match: XP_023513090.1 (LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At4g39952, mitochondrial-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1032 bits (2668), Expect = 0.0
Identity = 518/550 (94.18%), Postives = 523/550 (95.09%), Query Frame = 0

Query: 26  FFFSKQNLTFQSLLQFHSLIITTGNSNNAFFATKLMAFYACHGQPAFSTQLFRFVHPKDK 85
           FFFSKQNLTFQSLLQFHSLIITTGNSNNAFFATKLMAFYACHGQPAFSTQLFRFVHPKDK
Sbjct: 37  FFFSKQNLTFQSLLQFHSLIITTGNSNNAFFATKLMAFYACHGQPAFSTQLFRFVHPKDK 96

Query: 86  FLWNSIIQSHFSNGDYLQAFDFYLEMRASSSLPNQFTIPMVVSTCAELMMLNHGMNIHGL 145
           FLWNSIIQSHFSNGDYLQAFDFYLEMRASSSLPNQFTIPMVVSTCAELMMLNHGMNIHGL
Sbjct: 97  FLWNSIIQSHFSNGDYLQAFDFYLEMRASSSLPNQFTIPMVVSTCAELMMLNHGMNIHGL 156

Query: 146 ALKLGLFVGNSAVGSSLIYMYSKCGNAESASLMFNEITVKDVVAWTALIIGYVQNNESEK 205
           ALKLGLFVGNSAVGSSLIYMYSKCGNAESASLMFNEITVKDVVAWTALIIGYVQNNESEK
Sbjct: 157 ALKLGLFVGNSAVGSSLIYMYSKCGNAESASLMFNEITVKDVVAWTALIIGYVQNNESEK 216

Query: 206 GLKCLFEMHRNGCTPNYRTIGGGFQACVDLEALVEGRCLHGLALKSGFLCFEVVKSSILS 265
           GLKCLFEMHRNGCTPNYRTIGGGFQACVDLEALVEGRCLHGLALKSGFLCFEVVKSSILS
Sbjct: 217 GLKCLFEMHRNGCTPNYRTIGGGFQACVDLEALVEGRCLHGLALKSGFLCFEVVKSSILS 276

Query: 266 MYSRCGSPEEAYRCFFKLEQKDLISWTSIIAVHSKLGLMSECLHLFWEMQASGIIPDDIV 325
           MYSRCGSPEEAYRCFFKLEQKDLISWTSIIAVHSKLGLMSECLHLFWEMQASGIIPDDIV
Sbjct: 277 MYSRCGSPEEAYRCFFKLEQKDLISWTSIIAVHSKLGLMSECLHLFWEMQASGIIPDDIV 336

Query: 326 ISCMLLGFGNFDRISEGKALHAWILKQCCAMSGITHNALLSMYCKFGLLRMADKIFHSFH 385
           ISCMLLGFGNFDRISEGKALHAWILKQCCAMSGITHNALLSMYCKFGLLRMADKIFHSFH
Sbjct: 337 ISCMLLGFGNFDRISEGKALHAWILKQCCAMSGITHNALLSMYCKFGLLRMADKIFHSFH 396

Query: 386 KSSEDWNTMILGYSNMGEKEKCIDFFREMHLLGIEPDLNSLVSVISSCLPVGAVNIGRSV 445
           KSSEDWNTMILGYSNMGEKEKCIDFFREMHLLGIEPDLNSLVSVISSCLPVGAVNIGRSV
Sbjct: 397 KSSEDWNTMILGYSNMGEKEKCIDFFREMHLLGIEPDLNSLVSVISSCLPVGAVNIGRSV 456

Query: 446 HCYAIKNSIIDNVSIANSLLDMYGKSGNLTAAWRIFHRTQQKDIVSWNTLISSYKQSGHP 505
           HCYAIKNSIIDNVSIANSLLDMYGKSGNLTAAWRIFHRTQQKDIVSWNTLISSYKQSGHP
Sbjct: 457 HCYAIKNSIIDNVSIANSLLDMYGKSGNLTAAWRIFHRTQQKDIVSWNTLISSYKQSGHP 516

Query: 506 SEAIDLFDKMIKEKFNPNGVTCVIVL--ISTFDSLPYMEERIEELTEKTFATD---PNGL 565
           SEAIDLFDKMIKEKFNPNGVTCVIVL   S   SL   E+  + + E  F TD      L
Sbjct: 517 SEAIDLFDKMIKEKFNPNGVTCVIVLSACSHLASLDKGEKIHQYIKENGFETDITVRTAL 576

Query: 566 HTLYVKSFEI 570
             +Y K  E+
Sbjct: 577 IDMYAKCGEL 586

BLAST of Cp4.1LG16g01900 vs. NCBI nr
Match: XP_022986468.1 (LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At4g39952, mitochondrial-like [Cucurbita maxima])

HSP 1 Score: 1019 bits (2636), Expect = 0.0
Identity = 505/550 (91.82%), Postives = 522/550 (94.91%), Query Frame = 0

Query: 26  FFFSKQNLTFQSLLQFHSLIITTGNSNNAFFATKLMAFYACHGQPAFSTQLFRFVHPKDK 85
           FFFSKQNLTFQSLLQFHSL+ITTGNSNNAFFATKLMAFYACHGQPAFSTQLFRFVHPKDK
Sbjct: 37  FFFSKQNLTFQSLLQFHSLVITTGNSNNAFFATKLMAFYACHGQPAFSTQLFRFVHPKDK 96

Query: 86  FLWNSIIQSHFSNGDYLQAFDFYLEMRASSSLPNQFTIPMVVSTCAELMMLNHGMNIHGL 145
           FLWNSIIQSHFSNGDYLQAFDFYLEMRASSSLPNQFTIPMVVSTCAELMMLNHGMNIHGL
Sbjct: 97  FLWNSIIQSHFSNGDYLQAFDFYLEMRASSSLPNQFTIPMVVSTCAELMMLNHGMNIHGL 156

Query: 146 ALKLGLFVGNSAVGSSLIYMYSKCGNAESASLMFNEITVKDVVAWTALIIGYVQNNESEK 205
           ALKLGLFVGNSAVGSSLIYMYSKCGNAESASLMFN+ITVKDVVAWTALIIGYVQNNESEK
Sbjct: 157 ALKLGLFVGNSAVGSSLIYMYSKCGNAESASLMFNDITVKDVVAWTALIIGYVQNNESEK 216

Query: 206 GLKCLFEMHRNGCTPNYRTIGGGFQACVDLEALVEGRCLHGLALKSGFLCFEVVKSSILS 265
           GLKCLFEMHRNGCTPNYRTIGGGFQACVDLEALVEGRCLHGLALKSGFLCFEVVKSSILS
Sbjct: 217 GLKCLFEMHRNGCTPNYRTIGGGFQACVDLEALVEGRCLHGLALKSGFLCFEVVKSSILS 276

Query: 266 MYSRCGSPEEAYRCFFKLEQKDLISWTSIIAVHSKLGLMSECLHLFWEMQASGIIPDDIV 325
           MYSRCGSPEEAYRCFFKLEQKDLISWTSI+AVHSKLGLMSECLHLFWEMQASGIIPDDIV
Sbjct: 277 MYSRCGSPEEAYRCFFKLEQKDLISWTSIMAVHSKLGLMSECLHLFWEMQASGIIPDDIV 336

Query: 326 ISCMLLGFGNFDRISEGKALHAWILKQCCAMSGITHNALLSMYCKFGLLRMADKIFHSFH 385
           ISCMLLGFGNFDRISEGKA HAWILKQCCA+SGITHNALLSMYCKFGLLR ADKIFHSFH
Sbjct: 337 ISCMLLGFGNFDRISEGKAFHAWILKQCCAVSGITHNALLSMYCKFGLLRTADKIFHSFH 396

Query: 386 KSSEDWNTMILGYSNMGEKEKCIDFFREMHLLGIEPDLNSLVSVISSCLPVGAVNIGRSV 445
           KSSEDWNTMILGYSNMGEKEKCIDFFREM+LLGIEPDLNSLVSVISSCLPVGAVNIGRSV
Sbjct: 397 KSSEDWNTMILGYSNMGEKEKCIDFFREMYLLGIEPDLNSLVSVISSCLPVGAVNIGRSV 456

Query: 446 HCYAIKNSIIDNVSIANSLLDMYGKSGNLTAAWRIFHRTQQKDIVSWNTLISSYKQSGHP 505
           HCYAIKNSIIDNVSIANSLLDMYGKSGNLTAAWRIFHRTQQKDI+SWNTLIS+YKQSGHP
Sbjct: 457 HCYAIKNSIIDNVSIANSLLDMYGKSGNLTAAWRIFHRTQQKDIISWNTLISTYKQSGHP 516

Query: 506 SEAIDLFDKMIKEKFNPNGVTCVIVL-----ISTFDSLPYMEERIEELTEKTFATDPNGL 565
           SEAIDLFDKMIKEKFNPNGVTC+IVL     +++ D    + + I+E   +T  T    L
Sbjct: 517 SEAIDLFDKMIKEKFNPNGVTCIIVLSACAHLASLDKGERIHQYIKENGFETDITVRTAL 576

Query: 566 HTLYVKSFEI 570
             +Y K  E+
Sbjct: 577 IDMYAKCGEL 586

BLAST of Cp4.1LG16g01900 vs. NCBI nr
Match: XP_022944467.1 (LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At4g39952, mitochondrial-like [Cucurbita moschata])

HSP 1 Score: 1008 bits (2607), Expect = 0.0
Identity = 505/550 (91.82%), Postives = 517/550 (94.00%), Query Frame = 0

Query: 26  FFFSKQNLTFQSLLQFHSLIITTGNSNNAFFATKLMAFYACHGQPAFSTQLFRFVHPKDK 85
           FFFSKQNL+FQSLLQFHSLIITTGNSNNAFFATKLMAFYACHGQPAFSTQLFRFVHPKDK
Sbjct: 37  FFFSKQNLSFQSLLQFHSLIITTGNSNNAFFATKLMAFYACHGQPAFSTQLFRFVHPKDK 96

Query: 86  FLWNSIIQSHFSNGDYLQAFDFYLEMRASSSLPNQFTIPMVVSTCAELMMLNHGMNIHGL 145
           FLWNSIIQSHFSNGDYLQAFDFYLEMRASSSLPNQFTIPMVVSTCAELMMLNHGMNIHGL
Sbjct: 97  FLWNSIIQSHFSNGDYLQAFDFYLEMRASSSLPNQFTIPMVVSTCAELMMLNHGMNIHGL 156

Query: 146 ALKLGLFVGNSAVGSSLIYMYSKCGNAESASLMFNEITVKDVVAWTALIIGYVQNNESEK 205
           ALKLGLFVGNSAVGSSLIYMYSKCGNAESASLMFNEITVKDVVAWTALIIGYVQNNESEK
Sbjct: 157 ALKLGLFVGNSAVGSSLIYMYSKCGNAESASLMFNEITVKDVVAWTALIIGYVQNNESEK 216

Query: 206 GLKCLFEMHRNGCTPNYRTIGGGFQACVDLEALVEGRCLHGLALKSGFLCFEVVKSSILS 265
           GLKCLFEMHRNGCTPNYRTIGGGFQACVDL+ALVEGRCLHGLALKSGFLCFEVVKSSILS
Sbjct: 217 GLKCLFEMHRNGCTPNYRTIGGGFQACVDLDALVEGRCLHGLALKSGFLCFEVVKSSILS 276

Query: 266 MYSRCGSPEEAYRCFFKLEQKDLISWTSIIAVHSKLGLMSECLHLFWEMQASGIIPDDIV 325
           MYSRCGSPEEAYRCFFKLEQKDLISWTSIIAVHSKLGLMSECLHLFWEMQASGIIPDDIV
Sbjct: 277 MYSRCGSPEEAYRCFFKLEQKDLISWTSIIAVHSKLGLMSECLHLFWEMQASGIIPDDIV 336

Query: 326 ISCMLLGFGNFDRISEGKALHAWILKQCCAMSGITHNALLSMYCKFGLLRMADKIFHSFH 385
           ISCMLLGFGNFDRISEG A HAWILKQC A+SGITHNALLSMYCKFGLLR ADKIFHSFH
Sbjct: 337 ISCMLLGFGNFDRISEGNAFHAWILKQCYAVSGITHNALLSMYCKFGLLRTADKIFHSFH 396

Query: 386 KSSEDWNTMILGYSNMGEKEKCIDFFREMHLLGIEPDLNSLVSVISSCLPVGAVNIGRSV 445
           KSSEDWNTMILGYSNMGEKEKCIDFFREMHLLGIEPDLNSLVSVISSCLPV AVNIGRSV
Sbjct: 397 KSSEDWNTMILGYSNMGEKEKCIDFFREMHLLGIEPDLNSLVSVISSCLPVRAVNIGRSV 456

Query: 446 HCYAIKNSIIDNVSIANSLLDMYGKSGNLTAAWRIFHRTQQKDIVSWNTLISSYKQSGHP 505
           HCYAIKNSIIDNVSIANSLLDMYGKSGNLTAAWRIFHRTQQKDIVSWNTLISSYKQSGHP
Sbjct: 457 HCYAIKNSIIDNVSIANSLLDMYGKSGNLTAAWRIFHRTQQKDIVSWNTLISSYKQSGHP 516

Query: 506 SEAIDLFDKMIKEKFNPNGVTCVIVL-----ISTFDSLPYMEERIEELTEKTFATDPNGL 565
           SEAIDLFDKMIKEKFNPN VTCVI L     +++ D    + + I+E   +T  T    L
Sbjct: 517 SEAIDLFDKMIKEKFNPNVVTCVIALSACAHLASLDKGLKIHQYIKENGCETDITVRTAL 576

Query: 566 HTLYVKSFEI 570
             +Y K  E+
Sbjct: 577 IDMYAKCGEL 586

BLAST of Cp4.1LG16g01900 vs. NCBI nr
Match: KAG6571177.1 (Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 954 bits (2465), Expect = 0.0
Identity = 474/515 (92.04%), Postives = 485/515 (94.17%), Query Frame = 0

Query: 61  MAFYACHGQPAFSTQLFRFVHPKDKFLWNSIIQSHFSNGDYLQAFDFYLEMRASSSLPNQ 120
           MAFYACHGQPAFSTQLFRFVHPKDKFLWNSIIQSHFSNGDYLQAFDFYLEMRASSSLPNQ
Sbjct: 1   MAFYACHGQPAFSTQLFRFVHPKDKFLWNSIIQSHFSNGDYLQAFDFYLEMRASSSLPNQ 60

Query: 121 FTIPMVVSTCAELMMLNHGMNIHGLALKLGLFVGNSAVGSSLIYMYSKCGNAESASLMFN 180
           FTIPMVVSTCAELMMLNHGMNIHGLALKLGLFVGNSAVGSSLIYMYSKCGNAESASLMFN
Sbjct: 61  FTIPMVVSTCAELMMLNHGMNIHGLALKLGLFVGNSAVGSSLIYMYSKCGNAESASLMFN 120

Query: 181 EITVKDVVAWTALIIGYVQNNESEKGLKCLFEMHRNGCTPNYRTIGGGFQACVDLEALVE 240
           EITVKDVVAWTALIIGYVQNNESEKGLKCLFEMHRNGCTPNYRTIGGGFQACVDL+ALVE
Sbjct: 121 EITVKDVVAWTALIIGYVQNNESEKGLKCLFEMHRNGCTPNYRTIGGGFQACVDLDALVE 180

Query: 241 GRCLHGLALKSGFLCFEVVKSSILSMYSRCGSPEEAYRCFFKLEQKDLISWTSIIAVHSK 300
           GRCLHGLALKSGFLCFEVVKSSILSMYSRCGSPEEAYRCFFKLEQKDLISWTSIIAVHSK
Sbjct: 181 GRCLHGLALKSGFLCFEVVKSSILSMYSRCGSPEEAYRCFFKLEQKDLISWTSIIAVHSK 240

Query: 301 LGLMSECLHLFWEMQASGIIPDDIVISCMLLGFGNFDRISEGKALHAWILKQCCAMSGIT 360
           LGLMSECLHLFWEMQASGIIPDDIVISCMLLGFGNFDRISEGKA HAWILKQCCAMSGIT
Sbjct: 241 LGLMSECLHLFWEMQASGIIPDDIVISCMLLGFGNFDRISEGKAFHAWILKQCCAMSGIT 300

Query: 361 HNALLSMYCKFGLLRMADKIFHSFHKSSEDWNTMILGYSNMGEKEKCIDFFREMHLLGIE 420
           HNALLSMYCKFGLLR ADKIFHSFH+SSEDWNTMILGYSNMGEKEKCIDFFREMHLLGIE
Sbjct: 301 HNALLSMYCKFGLLRTADKIFHSFHRSSEDWNTMILGYSNMGEKEKCIDFFREMHLLGIE 360

Query: 421 PDLNSLVSVISSCLPVGAVNIGRSVHCYAIKNSIIDNVSIANSLLDMYGKSGNLTAAWRI 480
           PDLNSLVSVISSCLPVGAVNIGRSVHCYAIKNSIIDNVSIANSLLDMYGK GNLTAA RI
Sbjct: 361 PDLNSLVSVISSCLPVGAVNIGRSVHCYAIKNSIIDNVSIANSLLDMYGKRGNLTAASRI 420

Query: 481 FHRTQQKDIVSWNTLISSYKQSGHPSEAIDLFDKMIKEKFNPNGVTCVIVL-----ISTF 540
           FHRTQQKDIVSWNTLISSYKQSGHPSEAIDLFDKMIKEKFNPNGVTCVIVL     +++ 
Sbjct: 421 FHRTQQKDIVSWNTLISSYKQSGHPSEAIDLFDKMIKEKFNPNGVTCVIVLSACAHLASL 480

Query: 541 DSLPYMEERIEELTEKTFATDPNGLHTLYVKSFEI 570
           D    + + I+E   +T  T    L  +Y K  E+
Sbjct: 481 DKGEKIHQYIKENGFETDITVRTALIDMYAKCGEL 515

BLAST of Cp4.1LG16g01900 vs. NCBI nr
Match: KAG7010985.1 (Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 952 bits (2462), Expect = 0.0
Identity = 473/515 (91.84%), Postives = 485/515 (94.17%), Query Frame = 0

Query: 61  MAFYACHGQPAFSTQLFRFVHPKDKFLWNSIIQSHFSNGDYLQAFDFYLEMRASSSLPNQ 120
           MAFYACHGQPAFSTQLFRFVHPKDKFLWNSIIQSHFSNGDYLQAFDFYLEMRASSSLPNQ
Sbjct: 1   MAFYACHGQPAFSTQLFRFVHPKDKFLWNSIIQSHFSNGDYLQAFDFYLEMRASSSLPNQ 60

Query: 121 FTIPMVVSTCAELMMLNHGMNIHGLALKLGLFVGNSAVGSSLIYMYSKCGNAESASLMFN 180
           FTIPMVVSTCAELMMLNHGMNIHGLALKLGLFVGNSAVGSSLIYMYSKCGNAESASLMFN
Sbjct: 61  FTIPMVVSTCAELMMLNHGMNIHGLALKLGLFVGNSAVGSSLIYMYSKCGNAESASLMFN 120

Query: 181 EITVKDVVAWTALIIGYVQNNESEKGLKCLFEMHRNGCTPNYRTIGGGFQACVDLEALVE 240
           EITVKDVVAWTALIIGYVQNNESEKGLKCLFEMHRNGCTPNYRTIGGGFQACVDL+ALVE
Sbjct: 121 EITVKDVVAWTALIIGYVQNNESEKGLKCLFEMHRNGCTPNYRTIGGGFQACVDLDALVE 180

Query: 241 GRCLHGLALKSGFLCFEVVKSSILSMYSRCGSPEEAYRCFFKLEQKDLISWTSIIAVHSK 300
           GRCLHGLALKSGFLCFEVVKSSILSMYSRCGSPEEAYRCFFKLEQKDLISWTSIIAVHSK
Sbjct: 181 GRCLHGLALKSGFLCFEVVKSSILSMYSRCGSPEEAYRCFFKLEQKDLISWTSIIAVHSK 240

Query: 301 LGLMSECLHLFWEMQASGIIPDDIVISCMLLGFGNFDRISEGKALHAWILKQCCAMSGIT 360
           LGLMSECLHLFWEMQASGIIPDDIVISCMLLGFGNFDRISEGKA HAWILKQCCAMSGIT
Sbjct: 241 LGLMSECLHLFWEMQASGIIPDDIVISCMLLGFGNFDRISEGKAFHAWILKQCCAMSGIT 300

Query: 361 HNALLSMYCKFGLLRMADKIFHSFHKSSEDWNTMILGYSNMGEKEKCIDFFREMHLLGIE 420
           HNALLSMYCKFGLLR ADKIFHSFH+SSEDWNTMILGYSNMGEKEKCIDFFREMHLLGIE
Sbjct: 301 HNALLSMYCKFGLLRTADKIFHSFHRSSEDWNTMILGYSNMGEKEKCIDFFREMHLLGIE 360

Query: 421 PDLNSLVSVISSCLPVGAVNIGRSVHCYAIKNSIIDNVSIANSLLDMYGKSGNLTAAWRI 480
           PDLNSLVSVISSCLPVGAVNIGRSVHCYAIKNSIIDNVSIANSLLDMYGK GNLTAA RI
Sbjct: 361 PDLNSLVSVISSCLPVGAVNIGRSVHCYAIKNSIIDNVSIANSLLDMYGKRGNLTAASRI 420

Query: 481 FHRTQQKDIVSWNTLISSYKQSGHPSEAIDLFDKMIKEKFNPNGVTCVIVL-----ISTF 540
           FHRTQQKDIVSWNTLISSYKQSGHPSEAIDLFDKMIKEK+NPNGVTCVIVL     +++ 
Sbjct: 421 FHRTQQKDIVSWNTLISSYKQSGHPSEAIDLFDKMIKEKYNPNGVTCVIVLSACAHLASL 480

Query: 541 DSLPYMEERIEELTEKTFATDPNGLHTLYVKSFEI 570
           D    + + I+E   +T  T    L  +Y K  E+
Sbjct: 481 DKGEKIHQYIKENGFETDITVRTALIDMYAKCGEL 515

BLAST of Cp4.1LG16g01900 vs. ExPASy TrEMBL
Match: A0A6J1JGK6 (LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At4g39952, mitochondrial-like OS=Cucurbita maxima OX=3661 GN=LOC111484200 PE=4 SV=1)

HSP 1 Score: 1019 bits (2636), Expect = 0.0
Identity = 505/550 (91.82%), Postives = 522/550 (94.91%), Query Frame = 0

Query: 26  FFFSKQNLTFQSLLQFHSLIITTGNSNNAFFATKLMAFYACHGQPAFSTQLFRFVHPKDK 85
           FFFSKQNLTFQSLLQFHSL+ITTGNSNNAFFATKLMAFYACHGQPAFSTQLFRFVHPKDK
Sbjct: 37  FFFSKQNLTFQSLLQFHSLVITTGNSNNAFFATKLMAFYACHGQPAFSTQLFRFVHPKDK 96

Query: 86  FLWNSIIQSHFSNGDYLQAFDFYLEMRASSSLPNQFTIPMVVSTCAELMMLNHGMNIHGL 145
           FLWNSIIQSHFSNGDYLQAFDFYLEMRASSSLPNQFTIPMVVSTCAELMMLNHGMNIHGL
Sbjct: 97  FLWNSIIQSHFSNGDYLQAFDFYLEMRASSSLPNQFTIPMVVSTCAELMMLNHGMNIHGL 156

Query: 146 ALKLGLFVGNSAVGSSLIYMYSKCGNAESASLMFNEITVKDVVAWTALIIGYVQNNESEK 205
           ALKLGLFVGNSAVGSSLIYMYSKCGNAESASLMFN+ITVKDVVAWTALIIGYVQNNESEK
Sbjct: 157 ALKLGLFVGNSAVGSSLIYMYSKCGNAESASLMFNDITVKDVVAWTALIIGYVQNNESEK 216

Query: 206 GLKCLFEMHRNGCTPNYRTIGGGFQACVDLEALVEGRCLHGLALKSGFLCFEVVKSSILS 265
           GLKCLFEMHRNGCTPNYRTIGGGFQACVDLEALVEGRCLHGLALKSGFLCFEVVKSSILS
Sbjct: 217 GLKCLFEMHRNGCTPNYRTIGGGFQACVDLEALVEGRCLHGLALKSGFLCFEVVKSSILS 276

Query: 266 MYSRCGSPEEAYRCFFKLEQKDLISWTSIIAVHSKLGLMSECLHLFWEMQASGIIPDDIV 325
           MYSRCGSPEEAYRCFFKLEQKDLISWTSI+AVHSKLGLMSECLHLFWEMQASGIIPDDIV
Sbjct: 277 MYSRCGSPEEAYRCFFKLEQKDLISWTSIMAVHSKLGLMSECLHLFWEMQASGIIPDDIV 336

Query: 326 ISCMLLGFGNFDRISEGKALHAWILKQCCAMSGITHNALLSMYCKFGLLRMADKIFHSFH 385
           ISCMLLGFGNFDRISEGKA HAWILKQCCA+SGITHNALLSMYCKFGLLR ADKIFHSFH
Sbjct: 337 ISCMLLGFGNFDRISEGKAFHAWILKQCCAVSGITHNALLSMYCKFGLLRTADKIFHSFH 396

Query: 386 KSSEDWNTMILGYSNMGEKEKCIDFFREMHLLGIEPDLNSLVSVISSCLPVGAVNIGRSV 445
           KSSEDWNTMILGYSNMGEKEKCIDFFREM+LLGIEPDLNSLVSVISSCLPVGAVNIGRSV
Sbjct: 397 KSSEDWNTMILGYSNMGEKEKCIDFFREMYLLGIEPDLNSLVSVISSCLPVGAVNIGRSV 456

Query: 446 HCYAIKNSIIDNVSIANSLLDMYGKSGNLTAAWRIFHRTQQKDIVSWNTLISSYKQSGHP 505
           HCYAIKNSIIDNVSIANSLLDMYGKSGNLTAAWRIFHRTQQKDI+SWNTLIS+YKQSGHP
Sbjct: 457 HCYAIKNSIIDNVSIANSLLDMYGKSGNLTAAWRIFHRTQQKDIISWNTLISTYKQSGHP 516

Query: 506 SEAIDLFDKMIKEKFNPNGVTCVIVL-----ISTFDSLPYMEERIEELTEKTFATDPNGL 565
           SEAIDLFDKMIKEKFNPNGVTC+IVL     +++ D    + + I+E   +T  T    L
Sbjct: 517 SEAIDLFDKMIKEKFNPNGVTCIIVLSACAHLASLDKGERIHQYIKENGFETDITVRTAL 576

Query: 566 HTLYVKSFEI 570
             +Y K  E+
Sbjct: 577 IDMYAKCGEL 586

BLAST of Cp4.1LG16g01900 vs. ExPASy TrEMBL
Match: A0A6J1FVR1 (LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At4g39952, mitochondrial-like OS=Cucurbita moschata OX=3662 GN=LOC111448916 PE=4 SV=1)

HSP 1 Score: 1008 bits (2607), Expect = 0.0
Identity = 505/550 (91.82%), Postives = 517/550 (94.00%), Query Frame = 0

Query: 26  FFFSKQNLTFQSLLQFHSLIITTGNSNNAFFATKLMAFYACHGQPAFSTQLFRFVHPKDK 85
           FFFSKQNL+FQSLLQFHSLIITTGNSNNAFFATKLMAFYACHGQPAFSTQLFRFVHPKDK
Sbjct: 37  FFFSKQNLSFQSLLQFHSLIITTGNSNNAFFATKLMAFYACHGQPAFSTQLFRFVHPKDK 96

Query: 86  FLWNSIIQSHFSNGDYLQAFDFYLEMRASSSLPNQFTIPMVVSTCAELMMLNHGMNIHGL 145
           FLWNSIIQSHFSNGDYLQAFDFYLEMRASSSLPNQFTIPMVVSTCAELMMLNHGMNIHGL
Sbjct: 97  FLWNSIIQSHFSNGDYLQAFDFYLEMRASSSLPNQFTIPMVVSTCAELMMLNHGMNIHGL 156

Query: 146 ALKLGLFVGNSAVGSSLIYMYSKCGNAESASLMFNEITVKDVVAWTALIIGYVQNNESEK 205
           ALKLGLFVGNSAVGSSLIYMYSKCGNAESASLMFNEITVKDVVAWTALIIGYVQNNESEK
Sbjct: 157 ALKLGLFVGNSAVGSSLIYMYSKCGNAESASLMFNEITVKDVVAWTALIIGYVQNNESEK 216

Query: 206 GLKCLFEMHRNGCTPNYRTIGGGFQACVDLEALVEGRCLHGLALKSGFLCFEVVKSSILS 265
           GLKCLFEMHRNGCTPNYRTIGGGFQACVDL+ALVEGRCLHGLALKSGFLCFEVVKSSILS
Sbjct: 217 GLKCLFEMHRNGCTPNYRTIGGGFQACVDLDALVEGRCLHGLALKSGFLCFEVVKSSILS 276

Query: 266 MYSRCGSPEEAYRCFFKLEQKDLISWTSIIAVHSKLGLMSECLHLFWEMQASGIIPDDIV 325
           MYSRCGSPEEAYRCFFKLEQKDLISWTSIIAVHSKLGLMSECLHLFWEMQASGIIPDDIV
Sbjct: 277 MYSRCGSPEEAYRCFFKLEQKDLISWTSIIAVHSKLGLMSECLHLFWEMQASGIIPDDIV 336

Query: 326 ISCMLLGFGNFDRISEGKALHAWILKQCCAMSGITHNALLSMYCKFGLLRMADKIFHSFH 385
           ISCMLLGFGNFDRISEG A HAWILKQC A+SGITHNALLSMYCKFGLLR ADKIFHSFH
Sbjct: 337 ISCMLLGFGNFDRISEGNAFHAWILKQCYAVSGITHNALLSMYCKFGLLRTADKIFHSFH 396

Query: 386 KSSEDWNTMILGYSNMGEKEKCIDFFREMHLLGIEPDLNSLVSVISSCLPVGAVNIGRSV 445
           KSSEDWNTMILGYSNMGEKEKCIDFFREMHLLGIEPDLNSLVSVISSCLPV AVNIGRSV
Sbjct: 397 KSSEDWNTMILGYSNMGEKEKCIDFFREMHLLGIEPDLNSLVSVISSCLPVRAVNIGRSV 456

Query: 446 HCYAIKNSIIDNVSIANSLLDMYGKSGNLTAAWRIFHRTQQKDIVSWNTLISSYKQSGHP 505
           HCYAIKNSIIDNVSIANSLLDMYGKSGNLTAAWRIFHRTQQKDIVSWNTLISSYKQSGHP
Sbjct: 457 HCYAIKNSIIDNVSIANSLLDMYGKSGNLTAAWRIFHRTQQKDIVSWNTLISSYKQSGHP 516

Query: 506 SEAIDLFDKMIKEKFNPNGVTCVIVL-----ISTFDSLPYMEERIEELTEKTFATDPNGL 565
           SEAIDLFDKMIKEKFNPN VTCVI L     +++ D    + + I+E   +T  T    L
Sbjct: 517 SEAIDLFDKMIKEKFNPNVVTCVIALSACAHLASLDKGLKIHQYIKENGCETDITVRTAL 576

Query: 566 HTLYVKSFEI 570
             +Y K  E+
Sbjct: 577 IDMYAKCGEL 586

BLAST of Cp4.1LG16g01900 vs. ExPASy TrEMBL
Match: A0A1S4E2D5 (pentatricopeptide repeat-containing protein At4g39952, mitochondrial isoform X2 OS=Cucumis melo OX=3656 GN=LOC103498381 PE=4 SV=1)

HSP 1 Score: 875 bits (2261), Expect = 1.91e-311
Identity = 442/574 (77.00%), Postives = 491/574 (85.54%), Query Frame = 0

Query: 2   FANALPSLPHLAHFQIPTTLLSSLFFFSKQNLTFQSLLQFHSLIITTGNSNNAFFATKLM 61
           F++   SLP   H+  P   L S  FFSK +LTFQSLLQFHSLIITTGNS+N FFATKLM
Sbjct: 25  FSSTFTSLPD-PHY--PNNCLHS--FFSKPSLTFQSLLQFHSLIITTGNSDNVFFATKLM 84

Query: 62  AFYACHGQPAFSTQLFRFVHPKDKFLWNSIIQSHFSNGDYLQAFDFYLEMRASSSLPNQF 121
           AFYA H QPAFST LFR +H KD FLWNSIIQSHFSNGDY +AFDFYL+MRASSSLPNQF
Sbjct: 85  AFYASHRQPAFSTHLFRLIHSKDIFLWNSIIQSHFSNGDYQRAFDFYLQMRASSSLPNQF 144

Query: 122 TIPMVVSTCAELMMLNHGMNIHGLALKLGLFVGNSAVGSSLIYMYSKCGNAESASLMFNE 181
           T+PMVVSTCAELMM NHGMNIHGL  KLGLFV NSA+GSS IYMYSKCG+ ESASLMF+E
Sbjct: 145 TVPMVVSTCAELMMFNHGMNIHGLTSKLGLFVSNSAIGSSFIYMYSKCGHVESASLMFSE 204

Query: 182 ITVKDVVAWTALIIGYVQNNESEKGLKCLFEMHRNGCTPNYRTIGGGFQACVDLEALVEG 241
           ITVKDVVAWTALI+GYVQNNES +GLKCLFEMHR G TPNY+TIG GFQACVDL+ALVEG
Sbjct: 205 ITVKDVVAWTALIVGYVQNNESGRGLKCLFEMHRIGGTPNYKTIGSGFQACVDLDALVEG 264

Query: 242 RCLHGLALKSGFLCFEVVKSSILSMYSRCGSPEEAYRCFFKLEQKDLISWTSIIAVHSKL 301
           +CLHGLALK+GFLCF+VVKS+ILSMYSRCGSPEEAYRCF KL+QKDLISWTSIIAVHSK 
Sbjct: 265 KCLHGLALKNGFLCFKVVKSTILSMYSRCGSPEEAYRCFCKLDQKDLISWTSIIAVHSKF 324

Query: 302 GLMSECLHLFWEMQASGIIPDDIVISCMLLGFGNFDRISEGKALHAWILKQCCAMSGITH 361
           GLMSECLHLFWEMQ S IIPD+IVISCML+GFGN  RI EGKA HAWILKQCCAM+GITH
Sbjct: 325 GLMSECLHLFWEMQDSEIIPDEIVISCMLMGFGNSGRIFEGKAFHAWILKQCCAMNGITH 384

Query: 362 NALLSMYCKFGLLRMADKIFHSFHKSSEDWNTMILGYSNMGEKEKCIDFFREMHLLGIEP 421
           NALLSMYCKFG L  A+KIFHSFHKSSEDW+TMILGYSNMG+KE CI F REM LLG EP
Sbjct: 385 NALLSMYCKFGHLGTANKIFHSFHKSSEDWSTMILGYSNMGQKENCISFLREMLLLGREP 444

Query: 422 DLNSLVSVISSCLPVGAVNIGRSVHCYAIKNSIIDNVSIANSLLDMYGKSGNLTAAWRIF 481
           DLNSLVSVISSC  VGA+NIGRS+HCYAIKNSII+NVSIANSL+DMYGKSG++TA WRIF
Sbjct: 445 DLNSLVSVISSCSQVGAINIGRSIHCYAIKNSIIENVSIANSLMDMYGKSGHVTATWRIF 504

Query: 482 HRTQQKDIVSWNTLISSYKQSGHPSEAIDLFDKMIKEKFNPNGVTCVIVL-----ISTFD 541
           HRTQQ+D++SWNTLISSYKQSG+ +EAI LFDKM+KEK  PN VTCVIVL     +++ D
Sbjct: 505 HRTQQRDVISWNTLISSYKQSGNLAEAIILFDKMVKEKVYPNKVTCVIVLSVCAHLASLD 564

Query: 542 SLPYMEERIEELTEKTFATDPNGLHTLYVKSFEI 570
               + + I+E   ++  T    L  +Y K  E+
Sbjct: 565 KGEKIHQYIKENGFESNITIRTALIDMYAKCGEL 593

BLAST of Cp4.1LG16g01900 vs. ExPASy TrEMBL
Match: A0A0A0LRH3 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G439160 PE=4 SV=1)

HSP 1 Score: 875 bits (2260), Expect = 2.70e-311
Identity = 434/549 (79.05%), Postives = 480/549 (87.43%), Query Frame = 0

Query: 27  FFSKQNLTFQSLLQFHSLIITTGNSNNAFFATKLMAFYACHGQPAFSTQLFRFVHPKDKF 86
           FFSK NLTFQSLLQFHSLIITTGNSNN FFATKLMAFYA H +PAFST LFR +H KD F
Sbjct: 45  FFSKPNLTFQSLLQFHSLIITTGNSNNVFFATKLMAFYAYHRKPAFSTHLFRLIHSKDIF 104

Query: 87  LWNSIIQSHFSNGDYLQAFDFYLEMRASSSLPNQFTIPMVVSTCAELMMLNHGMNIHGLA 146
           LWNSIIQSHFSNGDY +AFDFYL+MRASSSLPNQFT+PMVVSTCAELMM NHGMNIHGL 
Sbjct: 105 LWNSIIQSHFSNGDYQRAFDFYLQMRASSSLPNQFTVPMVVSTCAELMMFNHGMNIHGLT 164

Query: 147 LKLGLFVGNSAVGSSLIYMYSKCGNAESASLMFNEITVKDVVAWTALIIGYVQNNESEKG 206
            KLGLFVGNSA+GSS IYMYSKCG+ ESAS+MF+EITVKDVV WTALI+GYVQNNES +G
Sbjct: 165 SKLGLFVGNSAIGSSFIYMYSKCGHVESASIMFSEITVKDVVTWTALIVGYVQNNESGRG 224

Query: 207 LKCLFEMHRNGCTPNYRTIGGGFQACVDLEALVEGRCLHGLALKSGFLCFEVVKSSILSM 266
           LKCLFEMHR G TPNY+TIG GFQACVDL+ALVEG+CLHGLALK+GFLCFEVVKS+ILSM
Sbjct: 225 LKCLFEMHRIGGTPNYKTIGSGFQACVDLDALVEGKCLHGLALKNGFLCFEVVKSTILSM 284

Query: 267 YSRCGSPEEAYRCFFKLEQKDLISWTSIIAVHSKLGLMSECLHLFWEMQASGIIPDDIVI 326
           YSRCGSPEEAYRCF KL+QKDLISWTSIIAVHSK GLMSECLHLFWEMQAS IIPD+IVI
Sbjct: 285 YSRCGSPEEAYRCFCKLDQKDLISWTSIIAVHSKFGLMSECLHLFWEMQASEIIPDEIVI 344

Query: 327 SCMLLGFGNFDRISEGKALHAWILKQCCAMSGITHNALLSMYCKFGLLRMADKIFHSFHK 386
           SCML+GFGN DRI EGKA HA ILKQCCA+SGITHNALLSMYCKFG L  A+KIFHSFHK
Sbjct: 345 SCMLMGFGNSDRIFEGKAFHARILKQCCALSGITHNALLSMYCKFGHLGTANKIFHSFHK 404

Query: 387 SSEDWNTMILGYSNMGEKEKCIDFFREMHLLGIEPDLNSLVSVISSCLPVGAVNIGRSVH 446
           SSEDW+TMILGYSNMG+KEKCI F REM LLG EPDLNSLVSVISSC  VGA+NIGRS+H
Sbjct: 405 SSEDWSTMILGYSNMGQKEKCISFLREMLLLGREPDLNSLVSVISSCSQVGAINIGRSIH 464

Query: 447 CYAIKNSIIDNVSIANSLLDMYGKSGNLTAAWRIFHRTQQKDIVSWNTLISSYKQSGHPS 506
           CYAIKNSII+NVS+ANSL+DMYGKSG++TA WRIFHRT Q+D++SWNTLISSYKQSG  +
Sbjct: 465 CYAIKNSIIENVSVANSLMDMYGKSGHVTATWRIFHRTLQRDVISWNTLISSYKQSGILA 524

Query: 507 EAIDLFDKMIKEKFNPNGVTCVIVL-----ISTFDSLPYMEERIEELTEKTFATDPNGLH 566
           EAI LFDKM+KEK  PN VTC+IVL     +++ D    + + I+E   ++  T    L 
Sbjct: 525 EAIILFDKMVKEKVYPNKVTCIIVLSACAHLASLDEGEKIHQYIKENGFESNITIRTALI 584

Query: 567 TLYVKSFEI 570
            +Y K  E+
Sbjct: 585 DMYAKCGEL 593

BLAST of Cp4.1LG16g01900 vs. ExPASy TrEMBL
Match: A0A1S3C9K9 (pentatricopeptide repeat-containing protein At4g39952, mitochondrial isoform X1 OS=Cucumis melo OX=3656 GN=LOC103498381 PE=4 SV=1)

HSP 1 Score: 875 bits (2261), Expect = 9.35e-311
Identity = 442/574 (77.00%), Postives = 491/574 (85.54%), Query Frame = 0

Query: 2   FANALPSLPHLAHFQIPTTLLSSLFFFSKQNLTFQSLLQFHSLIITTGNSNNAFFATKLM 61
           F++   SLP   H+  P   L S  FFSK +LTFQSLLQFHSLIITTGNS+N FFATKLM
Sbjct: 70  FSSTFTSLPD-PHY--PNNCLHS--FFSKPSLTFQSLLQFHSLIITTGNSDNVFFATKLM 129

Query: 62  AFYACHGQPAFSTQLFRFVHPKDKFLWNSIIQSHFSNGDYLQAFDFYLEMRASSSLPNQF 121
           AFYA H QPAFST LFR +H KD FLWNSIIQSHFSNGDY +AFDFYL+MRASSSLPNQF
Sbjct: 130 AFYASHRQPAFSTHLFRLIHSKDIFLWNSIIQSHFSNGDYQRAFDFYLQMRASSSLPNQF 189

Query: 122 TIPMVVSTCAELMMLNHGMNIHGLALKLGLFVGNSAVGSSLIYMYSKCGNAESASLMFNE 181
           T+PMVVSTCAELMM NHGMNIHGL  KLGLFV NSA+GSS IYMYSKCG+ ESASLMF+E
Sbjct: 190 TVPMVVSTCAELMMFNHGMNIHGLTSKLGLFVSNSAIGSSFIYMYSKCGHVESASLMFSE 249

Query: 182 ITVKDVVAWTALIIGYVQNNESEKGLKCLFEMHRNGCTPNYRTIGGGFQACVDLEALVEG 241
           ITVKDVVAWTALI+GYVQNNES +GLKCLFEMHR G TPNY+TIG GFQACVDL+ALVEG
Sbjct: 250 ITVKDVVAWTALIVGYVQNNESGRGLKCLFEMHRIGGTPNYKTIGSGFQACVDLDALVEG 309

Query: 242 RCLHGLALKSGFLCFEVVKSSILSMYSRCGSPEEAYRCFFKLEQKDLISWTSIIAVHSKL 301
           +CLHGLALK+GFLCF+VVKS+ILSMYSRCGSPEEAYRCF KL+QKDLISWTSIIAVHSK 
Sbjct: 310 KCLHGLALKNGFLCFKVVKSTILSMYSRCGSPEEAYRCFCKLDQKDLISWTSIIAVHSKF 369

Query: 302 GLMSECLHLFWEMQASGIIPDDIVISCMLLGFGNFDRISEGKALHAWILKQCCAMSGITH 361
           GLMSECLHLFWEMQ S IIPD+IVISCML+GFGN  RI EGKA HAWILKQCCAM+GITH
Sbjct: 370 GLMSECLHLFWEMQDSEIIPDEIVISCMLMGFGNSGRIFEGKAFHAWILKQCCAMNGITH 429

Query: 362 NALLSMYCKFGLLRMADKIFHSFHKSSEDWNTMILGYSNMGEKEKCIDFFREMHLLGIEP 421
           NALLSMYCKFG L  A+KIFHSFHKSSEDW+TMILGYSNMG+KE CI F REM LLG EP
Sbjct: 430 NALLSMYCKFGHLGTANKIFHSFHKSSEDWSTMILGYSNMGQKENCISFLREMLLLGREP 489

Query: 422 DLNSLVSVISSCLPVGAVNIGRSVHCYAIKNSIIDNVSIANSLLDMYGKSGNLTAAWRIF 481
           DLNSLVSVISSC  VGA+NIGRS+HCYAIKNSII+NVSIANSL+DMYGKSG++TA WRIF
Sbjct: 490 DLNSLVSVISSCSQVGAINIGRSIHCYAIKNSIIENVSIANSLMDMYGKSGHVTATWRIF 549

Query: 482 HRTQQKDIVSWNTLISSYKQSGHPSEAIDLFDKMIKEKFNPNGVTCVIVL-----ISTFD 541
           HRTQQ+D++SWNTLISSYKQSG+ +EAI LFDKM+KEK  PN VTCVIVL     +++ D
Sbjct: 550 HRTQQRDVISWNTLISSYKQSGNLAEAIILFDKMVKEKVYPNKVTCVIVLSVCAHLASLD 609

Query: 542 SLPYMEERIEELTEKTFATDPNGLHTLYVKSFEI 570
               + + I+E   ++  T    L  +Y K  E+
Sbjct: 610 KGEKIHQYIKENGFESNITIRTALIDMYAKCGEL 638

BLAST of Cp4.1LG16g01900 vs. TAIR 10
Match: AT4G39952.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 478.0 bits (1229), Expect = 1.2e-134
Identity = 242/508 (47.64%), Postives = 339/508 (66.73%), Query Frame = 0

Query: 31  QNLTFQSLLQFHSLIITTGNSNNAFFATKLMAFYACHGQPAFSTQLFRFVHPKDKFLWNS 90
           Q+L+ +SL + ++LIIT G S N F A+KL++ YA +G+P  S+++F  V  +D FLWNS
Sbjct: 36  QSLSLESLRKHNALIITGGLSENIFVASKLISSYASYGKPNLSSRVFHLVTRRDIFLWNS 95

Query: 91  IIQSHFSNGDYLQAFDFYLEMRASSSLPNQFTIPMVVSTCAELMMLNHGMNIHGLALKLG 150
           II++HFSNGDY ++  F+  M  S   P+ FT PMVVS CAEL+  + G  +HGL LK G
Sbjct: 96  IIKAHFSNGDYARSLCFFFSMLLSGQSPDHFTAPMVVSACAELLWFHVGTFVHGLVLKHG 155

Query: 151 LFVGNSAVGSSLIYMYSKCGNAESASLMFNEITVKDVVAWTALIIGYVQNNESEKGLKCL 210
            F  N+AVG+S +Y YSKCG  + A L+F+E+  +DVVAWTA+I G+VQN ESE GL  L
Sbjct: 156 GFDRNTAVGASFVYFYSKCGFLQDACLVFDEMPDRDVVAWTAIISGHVQNGESEGGLGYL 215

Query: 211 FEMHRNGC---TPNYRTIGGGFQACVDLEALVEGRCLHGLALKSGFLCFEVVKSSILSMY 270
            +MH  G     PN RT+  GFQAC +L AL EGRCLHG A+K+G    + V+SS+ S Y
Sbjct: 216 CKMHSAGSDVDKPNPRTLECGFQACSNLGALKEGRCLHGFAVKNGLASSKFVQSSMFSFY 275

Query: 271 SRCGSPEEAYRCFFKLEQKDLISWTSIIAVHSKLGLMSECLHLFWEMQASGIIPDDIVIS 330
           S+ G+P EAY  F +L  +D+ SWTSIIA  ++ G M E   +FWEMQ  G+ PD +VIS
Sbjct: 276 SKSGNPSEAYLSFRELGDEDMFSWTSIIASLARSGDMEESFDMFWEMQNKGMHPDGVVIS 335

Query: 331 CMLLGFGNFDRISEGKALHAWILKQCCAMSGITHNALLSMYCKFGLLRMADKIF--HSFH 390
           C++   G    + +GKA H ++++ C ++     N+LLSMYCKF LL +A+K+F   S  
Sbjct: 336 CLINELGKMMLVPQGKAFHGFVIRHCFSLDSTVCNSLLSMYCKFELLSVAEKLFCRISEE 395

Query: 391 KSSEDWNTMILGYSNMGEKEKCIDFFREMHLLGIEPDLNSLVSVISSCLPVGAVNIGRSV 450
            + E WNTM+ GY  M    KCI+ FR++  LGIE D  S  SVISSC  +GAV +G+S+
Sbjct: 396 GNKEAWNTMLKGYGKMKCHVKCIELFRKIQNLGIEIDSASATSVISSCSHIGAVLLGKSL 455

Query: 451 HCYAIKNSIIDNVSIANSLLDMYGKSGNLTAAWRIFHRTQQKDIVSWNTLISSYKQSGHP 510
           HCY +K S+   +S+ NSL+D+YGK G+LT AWR+F      ++++WN +I+SY      
Sbjct: 456 HCYVVKTSLDLTISVVNSLIDLYGKMGDLTVAWRMFCEA-DTNVITWNAMIASYVHCEQS 515

Query: 511 SEAIDLFDKMIKEKFNPNGVTCVIVLIS 534
            +AI LFD+M+ E F P+ +T V +L++
Sbjct: 516 EKAIALFDRMVSENFKPSSITLVTLLMA 542

BLAST of Cp4.1LG16g01900 vs. TAIR 10
Match: AT3G03580.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 253.1 bits (645), Expect = 6.4e-67
Identity = 152/505 (30.10%), Postives = 275/505 (54.46%), Query Frame = 0

Query: 29  SKQNLTFQSLLQFHSLIITTGNSNNAFFATKLMAFYACHGQPAFSTQLFRFVHP-KDKFL 88
           S  NL    L + H+L+I+ G  ++ FF+ KL+  Y+   +PA S  +FR V P K+ +L
Sbjct: 16  SSSNL--NELRRIHALVISLGLDSSDFFSGKLIDKYSHFREPASSLSVFRRVSPAKNVYL 75

Query: 89  WNSIIQSHFSNGDYLQAFDFYLEMRASSSLPNQFTIPMVVSTCAELMMLNHGMNIHGLAL 148
           WNSII++   NG + +A +FY ++R S   P+++T P V+  CA L     G  ++   L
Sbjct: 76  WNSIIRAFSKNGLFPEALEFYGKLRESKVSPDKYTFPSVIKACAGLFDAEMGDLVYEQIL 135

Query: 149 KLGLFVGNSAVGSSLIYMYSKCGNAESASLMFNEITVKDVVAWTALIIGYVQNNESEKGL 208
            +G F  +  VG++L+ MYS+ G    A  +F+E+ V+D+V+W +LI GY  +   E+ L
Sbjct: 136 DMG-FESDLFVGNALVDMYSRMGLLTRARQVFDEMPVRDLVSWNSLISGYSSHGYYEEAL 195

Query: 209 KCLFEMHRNGCTPNYRTIGGGFQACVDLEALVEGRCLHGLALKSGFLCFEVVKSSILSMY 268
           +   E+  +   P+  T+     A  +L  + +G+ LHG ALKSG     VV + +++MY
Sbjct: 196 EIYHELKNSWIVPDSFTVSSVLPAFGNLLVVKQGQGLHGFALKSGVNSVVVVNNGLVAMY 255

Query: 269 SRCGSPEEAYRCFFKLEQKDLISWTSIIAVHSKLGLMSECLHLFWEMQASGIIPDDIVIS 328
            +   P +A R F +++ +D +S+ ++I  + KL ++ E + +F E       PD + +S
Sbjct: 256 LKFRRPTDARRVFDEMDVRDSVSYNTMICGYLKLEMVEESVRMFLE-NLDQFKPDLLTVS 315

Query: 329 CMLLGFGNFDRISEGKALHAWILKQCCAMSGITHNALLSMYCKFGLLRMADKIFHSFH-K 388
            +L   G+   +S  K ++ ++LK    +     N L+ +Y K G +  A  +F+S   K
Sbjct: 316 SVLRACGHLRDLSLAKYIYNYMLKAGFVLESTVRNILIDVYAKCGDMITARDVFNSMECK 375

Query: 389 SSEDWNTMILGYSNMGEKEKCIDFFREMHLLGIEPDLNSLVSVISSCLPVGAVNIGRSVH 448
            +  WN++I GY   G+  + +  F+ M ++  + D  + + +IS    +  +  G+ +H
Sbjct: 376 DTVSWNSIISGYIQSGDLMEAMKLFKMMMIMEEQADHITYLMLISVSTRLADLKFGKGLH 435

Query: 449 CYAIKNSIIDNVSIANSLLDMYGKSGNLTAAWRIFHRTQQKDIVSWNTLISSYKQSGHPS 508
              IK+ I  ++S++N+L+DMY K G +  + +IF      D V+WNT+IS+  + G  +
Sbjct: 436 SNGIKSGICIDLSVSNALIDMYAKCGEVGDSLKIFSSMGTGDTVTWNTVISACVRFGDFA 495

Query: 509 EAIDLFDKMIKEKFNPNGVTCVIVL 532
             + +  +M K +  P+  T ++ L
Sbjct: 496 TGLQVTTQMRKSEVVPDMATFLVTL 516

BLAST of Cp4.1LG16g01900 vs. TAIR 10
Match: AT2G03380.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 251.9 bits (642), Expect = 1.4e-66
Identity = 149/512 (29.10%), Postives = 262/512 (51.17%), Query Frame = 0

Query: 23  SSLFFFSKQNLTFQSLLQFHSLIITTGNSNNAFFATKLMAFYACHGQPAFSTQLFRFVHP 82
           S  F    +     SL Q H ++   G   +   ATKL++ Y   G    +  +F  +  
Sbjct: 45  SPCFLLLSKCTNIDSLRQSHGVLTGNGLMGDISIATKLVSLYGFFGYTKDARLVFDQIPE 104

Query: 83  KDKFLWNSIIQSHFSNGDYLQAFDFYLEMRASSSLPNQFTIPMVVSTCAELMMLNHGMNI 142
            D +LW  +++ +  N + ++    Y  +       +       +  C EL  L++G  I
Sbjct: 105 PDFYLWKVMLRCYCLNKESVEVVKLYDLLMKHGFRYDDIVFSKALKACTELQDLDNGKKI 164

Query: 143 HGLALKLGLFVGNSAVGSSLIYMYSKCGNAESASLMFNEITVKDVVAWTALIIGYVQNNE 202
           H   +K+  F  ++ V + L+ MY+KCG  +SA  +FN+IT+++VV WT++I GYV+N+ 
Sbjct: 165 HCQLVKVPSF--DNVVLTGLLDMYAKCGEIKSAHKVFNDITLRNVVCWTSMIAGYVKNDL 224

Query: 203 SEKGLKCLFEMHRNGCTPNYRTIGGGFQACVDLEALVEGRCLHGLALKSGFLCFEVVKSS 262
            E+GL     M  N    N  T G    AC  L AL +G+  HG  +KSG      + +S
Sbjct: 225 CEEGLVLFNRMRENNVLGNEYTYGTLIMACTKLSALHQGKWFHGCLVKSGIELSSCLVTS 284

Query: 263 ILSMYSRCGSPEEAYRCFFKLEQKDLISWTSIIAVHSKLGLMSECLHLFWEMQASGIIPD 322
           +L MY +CG    A R F +    DL+ WT++I  ++  G ++E L LF +M+   I P+
Sbjct: 285 LLDMYVKCGDISNARRVFNEHSHVDLVMWTAMIVGYTHNGSVNEALSLFQKMKGVEIKPN 344

Query: 323 DIVISCMLLGFGNFDRISEGKALHAWILKQCCAMSGITHNALLSMYCKFGLLRMADKIFH 382
            + I+ +L G G  + +  G+++H   +K     + +  NAL+ MY K    R A  +F 
Sbjct: 345 CVTIASVLSGCGLIENLELGRSVHGLSIKVGIWDTNVA-NALVHMYAKCYQNRDAKYVFE 404

Query: 383 -SFHKSSEDWNTMILGYSNMGEKEKCIDFFREMHLLGIEPDLNSLVSVISSCLPVGAVNI 442
               K    WN++I G+S  G   + +  F  M+   + P+  ++ S+ S+C  +G++ +
Sbjct: 405 MESEKDIVAWNSIISGFSQNGSIHEALFLFHRMNSESVTPNGVTVASLFSACASLGSLAV 464

Query: 443 GRSVHCYAIKNSII--DNVSIANSLLDMYGKSGNLTAAWRIFHRTQQKDIVSWNTLISSY 502
           G S+H Y++K   +   +V +  +LLD Y K G+  +A  IF   ++K+ ++W+ +I  Y
Sbjct: 465 GSSLHAYSVKLGFLASSSVHVGTALLDFYAKCGDPQSARLIFDTIEEKNTITWSAMIGGY 524

Query: 503 KQSGHPSEAIDLFDKMIKEKFNPNGVTCVIVL 532
            + G    +++LF++M+K++  PN  T   +L
Sbjct: 525 GKQGDTIGSLELFEEMLKKQQKPNESTFTSIL 553

BLAST of Cp4.1LG16g01900 vs. TAIR 10
Match: AT4G18750.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 248.4 bits (633), Expect = 1.6e-65
Identity = 153/533 (28.71%), Postives = 263/533 (49.34%), Query Frame = 0

Query: 1   NFANALPSLPHLAHFQIPTTLLSSLFFFSKQNLTFQSLLQFHSLIITTGNSNNAFFATKL 60
           N  NA+  L     + I    L S+      + + +   +  + I   G   ++   +KL
Sbjct: 76  NLENAVKLLCVSGKWDIDPRTLCSVLQLCADSKSLKDGKEVDNFIRGNGFVIDSNLGSKL 135

Query: 61  MAFYACHGQPAFSTQLFRFVHPKDKFLWNSIIQSHFSNGDYLQAFDFYLEMRASSSLPNQ 120
              Y   G    ++++F  V  +    WN ++     +GD+  +   + +M +S    + 
Sbjct: 136 SLMYTNCGDLKEASRVFDEVKIEKALFWNILMNELAKSGDFSGSIGLFKKMMSSGVEMDS 195

Query: 121 FTIPMVVSTCAELMMLNHGMNIHGLALKLGLFVGNSAVGSSLIYMYSKCGNAESASLMFN 180
           +T   V  + + L  ++ G  +HG  LK G F   ++VG+SL+  Y K    +SA  +F+
Sbjct: 196 YTFSCVSKSFSSLRSVHGGEQLHGFILKSG-FGERNSVGNSLVAFYLKNQRVDSARKVFD 255

Query: 181 EITVKDVVAWTALIIGYVQNNESEKGLKCLFEMHRNGCTPNYRTIGGGFQACVDLEALVE 240
           E+T +DV++W ++I GYV N  +EKGL    +M  +G   +  TI   F  C D   +  
Sbjct: 256 EMTERDVISWNSIINGYVSNGLAEKGLSVFVQMLVSGIEIDLATIVSVFAGCADSRLISL 315

Query: 241 GRCLHGLALKSGFLCFEVVKSSILSMYSRCGSPEEAYRCFFKLEQKDLISWTSIIAVHSK 300
           GR +H + +K+ F   +   +++L MYS+CG  + A   F ++  + ++S+TS+IA +++
Sbjct: 316 GRAVHSIGVKACFSREDRFCNTLLDMYSKCGDLDSAKAVFREMSDRSVVSYTSMIAGYAR 375

Query: 301 LGLMSECLHLFWEMQASGIIPDDIVISCMLLGFGNFDRISEGKALHAWILKQCCAMSGIT 360
            GL  E + LF EM+  GI PD   ++ +L     +  + EGK +H WI +         
Sbjct: 376 EGLAGEAVKLFEEMEEEGISPDVYTVTAVLNCCARYRLLDEGKRVHEWIKENDLGFDIFV 435

Query: 361 HNALLSMYCKFGLLRMADKIFHSFH-KSSEDWNTMILGYSNMGEKEKCIDFFR-EMHLLG 420
            NAL+ MY K G ++ A+ +F     K    WNT+I GYS      + +  F   +    
Sbjct: 436 SNALMDMYAKCGSMQEAELVFSEMRVKDIISWNTIIGGYSKNCYANEALSLFNLLLEEKR 495

Query: 421 IEPDLNSLVSVISSCLPVGAVNIGRSVHCYAIKNSIIDNVSIANSLLDMYGKSGNLTAAW 480
             PD  ++  V+ +C  + A + GR +H Y ++N    +  +ANSL+DMY K G L  A 
Sbjct: 496 FSPDERTVACVLPACASLSAFDKGREIHGYIMRNGYFSDRHVANSLVDMYAKCGALLLAH 555

Query: 481 RIFHRTQQKDIVSWNTLISSYKQSGHPSEAIDLFDKMIKEKFNPNGVTCVIVL 532
            +F     KD+VSW  +I+ Y   G   EAI LF++M +     + ++ V +L
Sbjct: 556 MLFDDIASKDLVSWTVMIAGYGMHGFGKEAIALFNQMRQAGIEADEISFVSLL 607

BLAST of Cp4.1LG16g01900 vs. TAIR 10
Match: AT5G27110.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 243.4 bits (620), Expect = 5.1e-64
Identity = 175/670 (26.12%), Postives = 313/670 (46.72%), Query Frame = 0

Query: 21  LLSSLFFFSKQNLTFQSLLQFHSLIITTGNSNNAFFATKLM-AFYACHGQPAFSTQLFRF 80
           LLS L   +    + + +   H  I+T G   +      L+  ++ C    +       F
Sbjct: 6   LLSLLRECTNSTKSLRRIKLVHQRILTLGLRRDVVLCKSLINVYFTCKDHCSARHVFENF 65

Query: 81  VHPKDKFLWNSIIQSHFSNGDYLQAFDFYLE-MRASSSLPNQFTIPMVVSTCAELMMLNH 140
               D ++WNS++  +  N  +    + +   +  S  +P+ FT P V+     L     
Sbjct: 66  DIRSDVYIWNSLMSGYSKNSMFHDTLEVFKRLLNCSICVPDSFTFPNVIKAYGALGREFL 125

Query: 141 GMNIHGLALKLGLFVGNSAVGSSLIYMYSKCGNAESASLMFNEITVKDVVAWTALIIGYV 200
           G  IH L +K G +V +  V SSL+ MY+K    E++  +F+E+  +DV +W  +I  + 
Sbjct: 126 GRMIHTLVVKSG-YVCDVVVASSLVGMYAKFNLFENSLQVFDEMPERDVASWNTVISCFY 185

Query: 201 QNNESEKGLKCLFEMHRNGCTPNYRTIGGGFQACVDLEALVEGRCLHGLALKSGFLCFEV 260
           Q+ E+EK L+    M  +G  PN  ++     AC  L  L  G+ +H   +K GF   E 
Sbjct: 186 QSGEAEKALELFGRMESSGFEPNSVSLTVAISACSRLLWLERGKEIHRKCVKKGFELDEY 245

Query: 261 VKSSILSMYSRCGSPEEAYRCFFKLEQKDLISWTSIIAVHSKLGLMSECLHLFWEMQASG 320
           V S+++ MY +C   E A   F K+ +K L++W S+I  +   G    C+ +   M   G
Sbjct: 246 VNSALVDMYGKCDCLEVAREVFQKMPRKSLVAWNSMIKGYVAKGDSKSCVEILNRMIIEG 305

Query: 321 IIPDDIVISCMLLGFGNFDRISEGKALHAWILKQCCAMSGITHNALLSMYCKFGLLRMAD 380
             P    ++ +L+       +  GK +H ++++         + +L+ +Y K G   +A+
Sbjct: 306 TRPSQTTLTSILMACSRSRNLLHGKFIHGYVIRSVVNADIYVNCSLIDLYFKCGEANLAE 365

Query: 381 KIFHSFHKS-SEDWNTMILGYSNMGEKEKCIDFFREMHLLGIEPDLNSLVSVISSCLPVG 440
            +F    K  +E WN MI  Y ++G   K ++ + +M  +G++PD+ +  SV+ +C  + 
Sbjct: 366 TVFSKTQKDVAESWNVMISSYISVGNWFKAVEVYDQMVSVGVKPDVVTFTSVLPACSQLA 425

Query: 441 AVNIGRSVHCYAIKNSIIDNVSIANSLLDMYGKSGNLTAAWRIFHRTQQKDIVSWNTLIS 500
           A+  G+ +H    ++ +  +  + ++LLDMY K GN   A+RIF+   +KD+VSW  +IS
Sbjct: 426 ALEKGKQIHLSISESRLETDELLLSALLDMYSKCGNEKEAFRIFNSIPKKDVVSWTVMIS 485

Query: 501 SYKQSGHPSEAIDLFDKMIKEKFNPNGVTCVIVLIS-----------TFDSLPYMEERIE 560
           +Y   G P EA+  FD+M K    P+GVT + VL +            F S    +  IE
Sbjct: 486 AYGSHGQPREALYQFDEMQKFGLKPDGVTLLAVLSACGHAGLIDEGLKFFSQMRSKYGIE 545

Query: 561 ELTEK-TFATDPNGLHTLYVKSFEIFVTNPKEVISDDVVYATFELTRIDIE-----KVRR 620
            + E  +   D  G     ++++EI    P+   + +++   F    + +E     ++ R
Sbjct: 546 PIIEHYSCMIDILGRAGRLLEAYEIIQQTPETSDNAELLSTLFSACCLHLEHSLGDRIAR 605

Query: 621 RVVATSSSTPRRLTTFMLAFSLVSTCILLYGVFSMAESRNGDGGVELGIALPP-----QA 666
            +V    + P   +T+M+ F+L ++     G    A  R      E+G+   P     + 
Sbjct: 606 LLV---ENYPDDASTYMVLFNLYAS-----GESWDAARRVRLKMKEMGLRKKPGCSWIEM 665

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q3E9N11.7e-13347.64Pentatricopeptide repeat-containing protein At4g39952, mitochondrial OS=Arabidop... [more]
Q9SS609.0e-6630.10Pentatricopeptide repeat-containing protein At3g03580 OS=Arabidopsis thaliana OX... [more]
Q9ZQ742.0e-6529.10Pentatricopeptide repeat-containing protein At2g03380, mitochondrial OS=Arabidop... [more]
Q9SN392.2e-6428.71Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis t... [more]
O046597.1e-6326.12Pentatricopeptide repeat-containing protein At5g27110 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
XP_023513090.10.094.18LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At4g39952, mito... [more]
XP_022986468.10.091.82LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At4g39952, mito... [more]
XP_022944467.10.091.82LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At4g39952, mito... [more]
KAG6571177.10.092.04Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita a... [more]
KAG7010985.10.091.84Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita a... [more]
Match NameE-valueIdentityDescription
A0A6J1JGK60.091.82LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At4g39952, mito... [more]
A0A6J1FVR10.091.82LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At4g39952, mito... [more]
A0A1S4E2D51.91e-31177.00pentatricopeptide repeat-containing protein At4g39952, mitochondrial isoform X2 ... [more]
A0A0A0LRH32.70e-31179.05Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G439160 PE=4 SV=1[more]
A0A1S3C9K99.35e-31177.00pentatricopeptide repeat-containing protein At4g39952, mitochondrial isoform X1 ... [more]
Match NameE-valueIdentityDescription
AT4G39952.11.2e-13447.64Pentatricopeptide repeat (PPR) superfamily protein [more]
AT3G03580.16.4e-6730.10Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT2G03380.11.4e-6629.10Pentatricopeptide repeat (PPR) superfamily protein [more]
AT4G18750.11.6e-6528.71Pentatricopeptide repeat (PPR) superfamily protein [more]
AT5G27110.15.1e-6426.12Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 147..242
e-value: 2.1E-14
score: 55.2
coord: 13..143
e-value: 7.0E-8
score: 34.0
coord: 445..558
e-value: 2.2E-20
score: 74.8
coord: 243..337
e-value: 6.2E-15
score: 57.0
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 338..444
e-value: 3.9E-15
score: 58.0
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 487..531
e-value: 8.8E-11
score: 41.8
coord: 185..225
e-value: 3.9E-7
score: 30.2
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 87..119
e-value: 1.3E-4
score: 19.9
coord: 391..422
e-value: 2.9E-6
score: 25.1
coord: 359..387
e-value: 0.0026
score: 15.8
coord: 490..523
e-value: 1.2E-8
score: 32.6
coord: 289..322
e-value: 1.7E-4
score: 19.6
coord: 188..221
e-value: 3.6E-7
score: 28.0
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 359..384
e-value: 0.0018
score: 18.4
coord: 87..113
e-value: 0.0028
score: 17.8
coord: 289..319
e-value: 2.2E-5
score: 24.4
coord: 261..287
e-value: 0.041
score: 14.1
coord: 391..419
e-value: 8.9E-5
score: 22.5
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 488..522
score: 12.813817
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 84..118
score: 9.437737
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 186..220
score: 10.709248
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 287..321
score: 10.599635
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 387..421
score: 10.544828
NoneNo IPR availablePANTHERPTHR47924PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 20..221
coord: 204..531
NoneNo IPR availablePANTHERPTHR47924:SF18BNAC05G27170D PROTEINcoord: 204..531
NoneNo IPR availablePANTHERPTHR47924:SF18BNAC05G27170D PROTEINcoord: 20..221
coord: 155..362
NoneNo IPR availablePANTHERPTHR47924PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 155..362

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG16g01900.1Cp4.1LG16g01900.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding