CmUC01G014220 (gene) Watermelon (USVL531) v1

Overview
NameCmUC01G014220
Typegene
OrganismCitrullus mucosospermus (Watermelon (USVL531) v1)
DescriptionPentatricopeptide repeat-containing protein
LocationCmU531Chr01: 27673790 .. 27675565 (-)
RNA-Seq ExpressionCmUC01G014220
SyntenyCmUC01G014220
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCCTCTTTATTTTCATCTCGTTCGCCCATTCATTTTTAATTCAAAATCCACCAAATTACAACATTCAATAGCTTTGAGAATTTCTCACAAGTCTTTCATTTCGAAATCGGACAACTCCTCGGTGAAGCTAGAAGATTTCTATGCCAGTCTTTTGAATCGGTGTGTTCAGACCTCCGATTCCCGCCATGGATCTGCAATTCATGCGAAGTTCCTCAAAGGGTTTCTTCCATTTTCTCTTTTCTTCCACAATCATGTACTTAATTTGTATGTCAAATGTGGCGGTCTATCATATGGCCTGCAACTGTTCGACGAAATGCCTGAGAGAAATGTTGTGTCCTGGTCTGCAATCATTGCTGGGTTCGTTCAACATGGACGACCCAACGAAGCTCTCTCTCTATTTGGCCGTATGCATTGCGATGGCACGATAATGCCCAACGAATTCACCCTTGTAAGTGCCCTCCATGCTTGTTCTTTAACTCAGAGGCTGATATGTTCAAACCAAATTTATTCATTAATTGTTCGCTTAGGATATGGGTCGAATGTTTTCCTCATGAATGCGTTCTTAACTACTTTAATTAGGCATGATAAATTGCTAGAGGCTTTAGAAGTTTTCGAGAGTTTTTCATCCAAAGACACTGTATCTTGGAATGCCATGATGGCTGGTTATTTGCAATTAGCATATTTTGAACTGCCGAAGTTTTGGCACAGGATGAATCTCGAGGGCGTTAAGCCTGATAATTTTACATTTGCTAGTATCTTAACTGGGTTGGCTGCTCTCTCTGAATTTAGGCTGGGATTGCAAGTTCATGGACAGCTTGTGAAAGCTGGATATGGGAATGATATTTGTGTAGGGAATTCCTTGTGTGATATGTACATTAAGAACCAGAAGTTGTTAGACGGTTTTAAAGCTTTTGATGAAATGCCTTCAAGTGATGTGTGTTCTTGGACCCAGATGGCTGCAGGTTGCCTCCAGTGTGGGGAACCAATGAAAGCACTCGAGGTTGTTTACGAGATGAAAAATGTTGGCGTGAGACTAAATAAGTTCACCCTTGCAACTGCCTTGAATGCTTGTGCCAATTTGGCCTCCATGGAAGAAGGAAAGAAATTCCATGGATTGAGAATTAAACTTGGAACTGATATTGATGTTTGTGTTGATAATGCTCTACTTGATATGTATGCAAAATGTGGATGTATGGCCAGTGCAAATGTCGTCTTTCGTTCAATGGATGAACGATCTGTCGTCTCGTGGACTACGATGATTATGGGATTTGCACATAATGGCCAAGCAAGAGAAGCCCTTCAAATCTTTGATGAAATGAGAAAAGGGGAAGCTGAACCTAACCACATCACTTTTATTTGTGTTCTTTATGCCTGTAGCCAAGGAGGTTTTGTTGATGAAGCATGGAAATACTTCTCTTCCATGAGTGCTGACCATGGGATTTCACCTTCAGAAGATCATTATGTGTGTATGGTGAATCTATTAGGCCGAGCTGGATGTATAAAAGAAGCCGAGGATTTGATCCTACAAATGCCGTTTCGACCTGGTTCGTTGGTCTGGCAAACATTGCTCGGTGCTTGCTTAGTTCATGGCGACTTAGAGACAGGAAAACGAGCAGCCGAGCATGCGTTGAATTTGGATCGAAATGATCCATCGACTTACGTTTTGTTATCAAACATGCTTGCTGGTGGTAATAACTGGGACAGTGTTGGAAGTTTGAGAGAACTAATGGAAACTAGAGATGTAAAGAAAGTACCTGGATCAAGTTGGATGTAA

mRNA sequence

ATGCCTCTTTATTTTCATCTCGTTCGCCCATTCATTTTTAATTCAAAATCCACCAAATTACAACATTCAATAGCTTTGAGAATTTCTCACAAGTCTTTCATTTCGAAATCGGACAACTCCTCGGTGAAGCTAGAAGATTTCTATGCCAGTCTTTTGAATCGGTGTGTTCAGACCTCCGATTCCCGCCATGGATCTGCAATTCATGCGAAGTTCCTCAAAGGGTTTCTTCCATTTTCTCTTTTCTTCCACAATCATGTACTTAATTTGTATGTCAAATGTGGCGGTCTATCATATGGCCTGCAACTGTTCGACGAAATGCCTGAGAGAAATGTTGTGTCCTGGTCTGCAATCATTGCTGGGTTCGTTCAACATGGACGACCCAACGAAGCTCTCTCTCTATTTGGCCGTATGCATTGCGATGGCACGATAATGCCCAACGAATTCACCCTTGTAAGTGCCCTCCATGCTTGTTCTTTAACTCAGAGGCTGATATGTTCAAACCAAATTTATTCATTAATTGTTCGCTTAGGATATGGGTCGAATGTTTTCCTCATGAATGCGTTCTTAACTACTTTAATTAGGCATGATAAATTGCTAGAGGCTTTAGAAGTTTTCGAGAGTTTTTCATCCAAAGACACTGTATCTTGGAATGCCATGATGGCTGGTTATTTGCAATTAGCATATTTTGAACTGCCGAAGTTTTGGCACAGGATGAATCTCGAGGGCGTTAAGCCTGATAATTTTACATTTGCTAGTATCTTAACTGGGTTGGCTGCTCTCTCTGAATTTAGGCTGGGATTGCAAGTTCATGGACAGCTTGTGAAAGCTGGATATGGGAATGATATTTGTGTAGGGAATTCCTTGTGTGATATGTACATTAAGAACCAGAAGTTGTTAGACGGTTTTAAAGCTTTTGATGAAATGCCTTCAAGTGATGTGTGTTCTTGGACCCAGATGGCTGCAGGTTGCCTCCAGTGTGGGGAACCAATGAAAGCACTCGAGGTTGTTTACGAGATGAAAAATGTTGGCGTGAGACTAAATAAGTTCACCCTTGCAACTGCCTTGAATGCTTGTGCCAATTTGGCCTCCATGGAAGAAGGAAAGAAATTCCATGGATTGAGAATTAAACTTGGAACTGATATTGATGTTTGTGTTGATAATGCTCTACTTGATATGTATGCAAAATGTGGATGTATGGCCAGTGCAAATGTCGTCTTTCGTTCAATGGATGAACGATCTGTCGTCTCGTGGACTACGATGATTATGGGATTTGCACATAATGGCCAAGCAAGAGAAGCCCTTCAAATCTTTGATGAAATGAGAAAAGGGGAAGCTGAACCTAACCACATCACTTTTATTTGTGTTCTTTATGCCTGTAGCCAAGGAGGTTTTGTTGATGAAGCATGGAAATACTTCTCTTCCATGAGTGCTGACCATGGGATTTCACCTTCAGAAGATCATTATGTGTGTATGGTGAATCTATTAGGCCGAGCTGGATGTATAAAAGAAGCCGAGGATTTGATCCTACAAATGCCGTTTCGACCTGGTTCGTTGGTCTGGCAAACATTGCTCGGTGCTTGCTTAGTTCATGGCGACTTAGAGACAGGAAAACGAGCAGCCGAGCATGCGTTGAATTTGGATCGAAATGATCCATCGACTTACGTTTTGTTATCAAACATGCTTGCTGGTGGTAATAACTGGGACAGTGTTGGAAGTTTGAGAGAACTAATGGAAACTAGAGATGTAAAGAAAGTACCTGGATCAAGTTGGATGTAA

Coding sequence (CDS)

ATGCCTCTTTATTTTCATCTCGTTCGCCCATTCATTTTTAATTCAAAATCCACCAAATTACAACATTCAATAGCTTTGAGAATTTCTCACAAGTCTTTCATTTCGAAATCGGACAACTCCTCGGTGAAGCTAGAAGATTTCTATGCCAGTCTTTTGAATCGGTGTGTTCAGACCTCCGATTCCCGCCATGGATCTGCAATTCATGCGAAGTTCCTCAAAGGGTTTCTTCCATTTTCTCTTTTCTTCCACAATCATGTACTTAATTTGTATGTCAAATGTGGCGGTCTATCATATGGCCTGCAACTGTTCGACGAAATGCCTGAGAGAAATGTTGTGTCCTGGTCTGCAATCATTGCTGGGTTCGTTCAACATGGACGACCCAACGAAGCTCTCTCTCTATTTGGCCGTATGCATTGCGATGGCACGATAATGCCCAACGAATTCACCCTTGTAAGTGCCCTCCATGCTTGTTCTTTAACTCAGAGGCTGATATGTTCAAACCAAATTTATTCATTAATTGTTCGCTTAGGATATGGGTCGAATGTTTTCCTCATGAATGCGTTCTTAACTACTTTAATTAGGCATGATAAATTGCTAGAGGCTTTAGAAGTTTTCGAGAGTTTTTCATCCAAAGACACTGTATCTTGGAATGCCATGATGGCTGGTTATTTGCAATTAGCATATTTTGAACTGCCGAAGTTTTGGCACAGGATGAATCTCGAGGGCGTTAAGCCTGATAATTTTACATTTGCTAGTATCTTAACTGGGTTGGCTGCTCTCTCTGAATTTAGGCTGGGATTGCAAGTTCATGGACAGCTTGTGAAAGCTGGATATGGGAATGATATTTGTGTAGGGAATTCCTTGTGTGATATGTACATTAAGAACCAGAAGTTGTTAGACGGTTTTAAAGCTTTTGATGAAATGCCTTCAAGTGATGTGTGTTCTTGGACCCAGATGGCTGCAGGTTGCCTCCAGTGTGGGGAACCAATGAAAGCACTCGAGGTTGTTTACGAGATGAAAAATGTTGGCGTGAGACTAAATAAGTTCACCCTTGCAACTGCCTTGAATGCTTGTGCCAATTTGGCCTCCATGGAAGAAGGAAAGAAATTCCATGGATTGAGAATTAAACTTGGAACTGATATTGATGTTTGTGTTGATAATGCTCTACTTGATATGTATGCAAAATGTGGATGTATGGCCAGTGCAAATGTCGTCTTTCGTTCAATGGATGAACGATCTGTCGTCTCGTGGACTACGATGATTATGGGATTTGCACATAATGGCCAAGCAAGAGAAGCCCTTCAAATCTTTGATGAAATGAGAAAAGGGGAAGCTGAACCTAACCACATCACTTTTATTTGTGTTCTTTATGCCTGTAGCCAAGGAGGTTTTGTTGATGAAGCATGGAAATACTTCTCTTCCATGAGTGCTGACCATGGGATTTCACCTTCAGAAGATCATTATGTGTGTATGGTGAATCTATTAGGCCGAGCTGGATGTATAAAAGAAGCCGAGGATTTGATCCTACAAATGCCGTTTCGACCTGGTTCGTTGGTCTGGCAAACATTGCTCGGTGCTTGCTTAGTTCATGGCGACTTAGAGACAGGAAAACGAGCAGCCGAGCATGCGTTGAATTTGGATCGAAATGATCCATCGACTTACGTTTTGTTATCAAACATGCTTGCTGGTGGTAATAACTGGGACAGTGTTGGAAGTTTGAGAGAACTAATGGAAACTAGAGATGTAAAGAAAGTACCTGGATCAAGTTGGATGTAA

Protein sequence

MPLYFHLVRPFIFNSKSTKLQHSIALRISHKSFISKSDNSSVKLEDFYASLLNRCVQTSDSRHGSAIHAKFLKGFLPFSLFFHNHVLNLYVKCGGLSYGLQLFDEMPERNVVSWSAIIAGFVQHGRPNEALSLFGRMHCDGTIMPNEFTLVSALHACSLTQRLICSNQIYSLIVRLGYGSNVFLMNAFLTTLIRHDKLLEALEVFESFSSKDTVSWNAMMAGYLQLAYFELPKFWHRMNLEGVKPDNFTFASILTGLAALSEFRLGLQVHGQLVKAGYGNDICVGNSLCDMYIKNQKLLDGFKAFDEMPSSDVCSWTQMAAGCLQCGEPMKALEVVYEMKNVGVRLNKFTLATALNACANLASMEEGKKFHGLRIKLGTDIDVCVDNALLDMYAKCGCMASANVVFRSMDERSVVSWTTMIMGFAHNGQAREALQIFDEMRKGEAEPNHITFICVLYACSQGGFVDEAWKYFSSMSADHGISPSEDHYVCMVNLLGRAGCIKEAEDLILQMPFRPGSLVWQTLLGACLVHGDLETGKRAAEHALNLDRNDPSTYVLLSNMLAGGNNWDSVGSLRELMETRDVKKVPGSSWM
Homology
BLAST of CmUC01G014220 vs. NCBI nr
Match: XP_038894950.1 (pentatricopeptide repeat-containing protein At4g33170-like isoform X1 [Benincasa hispida])

HSP 1 Score: 1149.4 bits (2972), Expect = 0.0e+00
Identity = 555/591 (93.91%), Postives = 572/591 (96.79%), Query Frame = 0

Query: 1   MPLYFHLVRPFIFNSKSTKLQHSIALRISHKSFISKSDNSSVKLEDFYASLLNRCVQTSD 60
           MPL FHLVRP  FNSKSTK+QHSIALRISHKSFISKS+NSS +LEDFY SLL+RCVQT+D
Sbjct: 1   MPLCFHLVRPLFFNSKSTKIQHSIALRISHKSFISKSENSSGELEDFYVSLLHRCVQTTD 60

Query: 61  SRHGSAIHAKFLKGFLPFSLFFHNHVLNLYVKCGGLSYGLQLFDEMPERNVVSWSAIIAG 120
           SRHGSAIHAKFLKGFLPFSLFFHNHVLNLYVKCG LSYGLQLFDEMPERNVVSWSAII G
Sbjct: 61  SRHGSAIHAKFLKGFLPFSLFFHNHVLNLYVKCGRLSYGLQLFDEMPERNVVSWSAIIVG 120

Query: 121 FVQHGRPNEALSLFGRMHCDGTIMPNEFTLVSALHACSLTQRLICSNQIYSLIVRLGYGS 180
           FVQHGRPNEALSLFGRMHCDGTIMPNEFTLVSALHACSLTQRLICS QIY+LIVRLGYGS
Sbjct: 121 FVQHGRPNEALSLFGRMHCDGTIMPNEFTLVSALHACSLTQRLICSYQIYALIVRLGYGS 180

Query: 181 NVFLMNAFLTTLIRHDKLLEALEVFESFSSKDTVSWNAMMAGYLQLAYFELPKFWHRMNL 240
           NVFLMNAFLT LIRH+KLLEALEVFES SSKDTVSWNAMMAGYLQLAYFELPKFW RMNL
Sbjct: 181 NVFLMNAFLTALIRHEKLLEALEVFESCSSKDTVSWNAMMAGYLQLAYFELPKFWRRMNL 240

Query: 241 EGVKPDNFTFASILTGLAALSEFRLGLQVHGQLVKAGYGNDICVGNSLCDMYIKNQKLLD 300
           EGVKPDNFTFAS+ TGLAALSEFRLGLQVHGQLVK GYG+DICVGNSLCDMYIKNQKLLD
Sbjct: 241 EGVKPDNFTFASVFTGLAALSEFRLGLQVHGQLVKCGYGSDICVGNSLCDMYIKNQKLLD 300

Query: 301 GFKAFDEMPSSDVCSWTQMAAGCLQCGEPMKALEVVYEMKNVGVRLNKFTLATALNACAN 360
           GFKAFDEMPSSDVCSWTQMAAGCLQCGEPMKALE++YEMKNVG+ LNKFTLATALNACAN
Sbjct: 301 GFKAFDEMPSSDVCSWTQMAAGCLQCGEPMKALEIIYEMKNVGLSLNKFTLATALNACAN 360

Query: 361 LASMEEGKKFHGLRIKLGTDIDVCVDNALLDMYAKCGCMASANVVFRSMDERSVVSWTTM 420
           LASMEEGKKFHGLRIKLGTDIDVCVDNALLDMYAKCGCM SANVVFRSMDERSVVSWTTM
Sbjct: 361 LASMEEGKKFHGLRIKLGTDIDVCVDNALLDMYAKCGCMTSANVVFRSMDERSVVSWTTM 420

Query: 421 IMGFAHNGQAREALQIFDEMRKGEAEPNHITFICVLYACSQGGFVDEAWKYFSSMSADHG 480
           IMGFAHN Q +EALQIFDEMRKGEAEPNHITFICVLYACSQGGFVDEAWKYFSSMSAD+G
Sbjct: 421 IMGFAHNSQTKEALQIFDEMRKGEAEPNHITFICVLYACSQGGFVDEAWKYFSSMSADYG 480

Query: 481 ISPSEDHYVCMVNLLGRAGCIKEAEDLILQMPFRPGSLVWQTLLGACLVHGDLETGKRAA 540
           ISPSEDHYVCMVNLLGRAGCIKEAEDLIL+MPFRPGSLVWQTLLGACLVHGDLETGKRAA
Sbjct: 481 ISPSEDHYVCMVNLLGRAGCIKEAEDLILRMPFRPGSLVWQTLLGACLVHGDLETGKRAA 540

Query: 541 EHALNLDRNDPSTYVLLSNMLAGGNNWDSVGSLRELMETRDVKKVPGSSWM 592
           EHALNLDRNDPSTYVLLSNM AGG+NWD+V SLRELMETRDVKKVPGSSWM
Sbjct: 541 EHALNLDRNDPSTYVLLSNMFAGGSNWDNVRSLRELMETRDVKKVPGSSWM 591

BLAST of CmUC01G014220 vs. NCBI nr
Match: XP_008467246.1 (PREDICTED: pentatricopeptide repeat-containing protein At2g13600-like [Cucumis melo] >XP_016903099.1 PREDICTED: pentatricopeptide repeat-containing protein At2g13600-like [Cucumis melo] >XP_016903114.1 PREDICTED: pentatricopeptide repeat-containing protein At2g13600-like [Cucumis melo])

HSP 1 Score: 1138.3 bits (2943), Expect = 0.0e+00
Identity = 550/591 (93.06%), Postives = 567/591 (95.94%), Query Frame = 0

Query: 1   MPLYFHLVRPFIFNSKSTKLQHSIALRISHKSFISKSDNSSVKLEDFYASLLNRCVQTSD 60
           MPL FHL RP I  SKST LQ SIALRISHKSFISKS++SSVKLEDFY S L RCVQTSD
Sbjct: 1   MPLCFHLARPLILISKSTDLQKSIALRISHKSFISKSEDSSVKLEDFYVSFLQRCVQTSD 60

Query: 61  SRHGSAIHAKFLKGFLPFSLFFHNHVLNLYVKCGGLSYGLQLFDEMPERNVVSWSAIIAG 120
           SRHGSAIHAKFLKGFLPFSLFFHNHVLNLY+KCG LSYGLQLFDEMPERNVVSWSAIIAG
Sbjct: 61  SRHGSAIHAKFLKGFLPFSLFFHNHVLNLYLKCGRLSYGLQLFDEMPERNVVSWSAIIAG 120

Query: 121 FVQHGRPNEALSLFGRMHCDGTIMPNEFTLVSALHACSLTQRLICSNQIYSLIVRLGYGS 180
           FVQHGRPNEALSLFGRMHCDGTIMPNEFTLVSALHACSLTQRLICS QIY+ IVRLGYGS
Sbjct: 121 FVQHGRPNEALSLFGRMHCDGTIMPNEFTLVSALHACSLTQRLICSYQIYAFIVRLGYGS 180

Query: 181 NVFLMNAFLTTLIRHDKLLEALEVFESFSSKDTVSWNAMMAGYLQLAYFELPKFWHRMNL 240
           NVFLMNAFLT LIRH+KLLEALEVFES  SKDTVSWNAMMAGYLQLAYFELPKFW RMNL
Sbjct: 181 NVFLMNAFLTALIRHEKLLEALEVFESCLSKDTVSWNAMMAGYLQLAYFELPKFWRRMNL 240

Query: 241 EGVKPDNFTFASILTGLAALSEFRLGLQVHGQLVKAGYGNDICVGNSLCDMYIKNQKLLD 300
           E VKPDNFTFASILTGLAALSEFRLGLQVHGQLVK+GYGNDICVGNSLCDMYIKNQKLLD
Sbjct: 241 ESVKPDNFTFASILTGLAALSEFRLGLQVHGQLVKSGYGNDICVGNSLCDMYIKNQKLLD 300

Query: 301 GFKAFDEMPSSDVCSWTQMAAGCLQCGEPMKALEVVYEMKNVGVRLNKFTLATALNACAN 360
           GFKAFDEM SSDVCSWTQMA+GCLQCGEPMKALEV+YEMKNVGVRLNKFTLATALN+CAN
Sbjct: 301 GFKAFDEMSSSDVCSWTQMASGCLQCGEPMKALEVIYEMKNVGVRLNKFTLATALNSCAN 360

Query: 361 LASMEEGKKFHGLRIKLGTDIDVCVDNALLDMYAKCGCMASANVVFRSMDERSVVSWTTM 420
           LAS+EEGKKFHGLRIKLGTD+DVCVDNALLDMYAKCGCM SANVVFRSMDERSVVSWTTM
Sbjct: 361 LASIEEGKKFHGLRIKLGTDVDVCVDNALLDMYAKCGCMTSANVVFRSMDERSVVSWTTM 420

Query: 421 IMGFAHNGQAREALQIFDEMRKGEAEPNHITFICVLYACSQGGFVDEAWKYFSSMSADHG 480
           IMGFAHNGQ +EALQIFDEMRKGEAEPNHITFICVL ACSQGGF+DEAWKYFSSMSADHG
Sbjct: 421 IMGFAHNGQTKEALQIFDEMRKGEAEPNHITFICVLNACSQGGFIDEAWKYFSSMSADHG 480

Query: 481 ISPSEDHYVCMVNLLGRAGCIKEAEDLILQMPFRPGSLVWQTLLGACLVHGDLETGKRAA 540
           I+PSEDHYVCMVNLLGRAGCIKEAEDLILQMPF+PGSLVWQTLLGACLVHGD+ETGKRAA
Sbjct: 481 IAPSEDHYVCMVNLLGRAGCIKEAEDLILQMPFQPGSLVWQTLLGACLVHGDIETGKRAA 540

Query: 541 EHALNLDRNDPSTYVLLSNMLAGGNNWDSVGSLRELMETRDVKKVPGSSWM 592
           EHALNLDRNDPSTY+LLSNM AGGNNWD VGSLRELMETRDVKKVPGSSWM
Sbjct: 541 EHALNLDRNDPSTYILLSNMFAGGNNWDGVGSLRELMETRDVKKVPGSSWM 591

BLAST of CmUC01G014220 vs. NCBI nr
Match: XP_011650978.1 (putative pentatricopeptide repeat-containing protein At3g15130 [Cucumis sativus] >KGN64206.1 hypothetical protein Csa_013410 [Cucumis sativus])

HSP 1 Score: 1130.2 bits (2922), Expect = 0.0e+00
Identity = 546/591 (92.39%), Postives = 563/591 (95.26%), Query Frame = 0

Query: 1   MPLYFHLVRPFIFNSKSTKLQHSIALRISHKSFISKSDNSSVKLEDFYASLLNRCVQTSD 60
           MPL FHL RP    SKST LQ SIALRIS KSF+SKS+NSSVKLEDFY S L RCV TSD
Sbjct: 1   MPLCFHLARPLFLISKSTDLQKSIALRISRKSFVSKSENSSVKLEDFYVSFLQRCVLTSD 60

Query: 61  SRHGSAIHAKFLKGFLPFSLFFHNHVLNLYVKCGGLSYGLQLFDEMPERNVVSWSAIIAG 120
           SRHGSAIHAKFLKGFLPFSLFFHNHVLN YVKCG LSYGLQLFDEMPERNVVSWSAIIAG
Sbjct: 61  SRHGSAIHAKFLKGFLPFSLFFHNHVLNFYVKCGRLSYGLQLFDEMPERNVVSWSAIIAG 120

Query: 121 FVQHGRPNEALSLFGRMHCDGTIMPNEFTLVSALHACSLTQRLICSNQIYSLIVRLGYGS 180
           FVQHGRPNEALSLFGRMHCDGTIMPNEFTLVSALHACSLTQRLICS QIY+ IVRLGYGS
Sbjct: 121 FVQHGRPNEALSLFGRMHCDGTIMPNEFTLVSALHACSLTQRLICSYQIYAFIVRLGYGS 180

Query: 181 NVFLMNAFLTTLIRHDKLLEALEVFESFSSKDTVSWNAMMAGYLQLAYFELPKFWHRMNL 240
           NVFLMNAFLT LIRH+KLLEALEVFES  SKDTVSWNAMMAGYLQLAYFELPKFW RMNL
Sbjct: 181 NVFLMNAFLTALIRHEKLLEALEVFESCLSKDTVSWNAMMAGYLQLAYFELPKFWRRMNL 240

Query: 241 EGVKPDNFTFASILTGLAALSEFRLGLQVHGQLVKAGYGNDICVGNSLCDMYIKNQKLLD 300
           E VKPDNFTFASILTGLAALSEFRLGLQVHGQLVK+GYGNDICVGNSLCDMY+KNQKLLD
Sbjct: 241 ESVKPDNFTFASILTGLAALSEFRLGLQVHGQLVKSGYGNDICVGNSLCDMYVKNQKLLD 300

Query: 301 GFKAFDEMPSSDVCSWTQMAAGCLQCGEPMKALEVVYEMKNVGVRLNKFTLATALNACAN 360
           GFKAFDEM SSDVCSWTQMAAGCLQCGEPMKALEV+YEMKNVGVRLNKFTLATALN+CAN
Sbjct: 301 GFKAFDEMSSSDVCSWTQMAAGCLQCGEPMKALEVIYEMKNVGVRLNKFTLATALNSCAN 360

Query: 361 LASMEEGKKFHGLRIKLGTDIDVCVDNALLDMYAKCGCMASANVVFRSMDERSVVSWTTM 420
           LAS+EEGKKFHGLRIKLGTD+DVCVDNALLDMYAKCGCM SANVVFRSMDERSVVSWTTM
Sbjct: 361 LASIEEGKKFHGLRIKLGTDVDVCVDNALLDMYAKCGCMTSANVVFRSMDERSVVSWTTM 420

Query: 421 IMGFAHNGQAREALQIFDEMRKGEAEPNHITFICVLYACSQGGFVDEAWKYFSSMSADHG 480
           IMGFAHNGQ +EALQIFDEMRKGEAEPNHITFICVL ACSQGGF+DEAWKYFSSMSADHG
Sbjct: 421 IMGFAHNGQTKEALQIFDEMRKGEAEPNHITFICVLNACSQGGFIDEAWKYFSSMSADHG 480

Query: 481 ISPSEDHYVCMVNLLGRAGCIKEAEDLILQMPFRPGSLVWQTLLGACLVHGDLETGKRAA 540
           I+PSEDHYVCMVNLLGRAGCIKEAEDLILQMPF+PGSLVWQTLLGACLVHGD+ETGKRAA
Sbjct: 481 IAPSEDHYVCMVNLLGRAGCIKEAEDLILQMPFQPGSLVWQTLLGACLVHGDIETGKRAA 540

Query: 541 EHALNLDRNDPSTYVLLSNMLAGGNNWDSVGSLRELMETRDVKKVPGSSWM 592
           EHALNLDRNDPSTY+LLSNM AGG+NWDSVG LRELMETRDVKKVPGSSWM
Sbjct: 541 EHALNLDRNDPSTYILLSNMFAGGDNWDSVGILRELMETRDVKKVPGSSWM 591

BLAST of CmUC01G014220 vs. NCBI nr
Match: TYJ99062.1 (pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 1108.2 bits (2865), Expect = 0.0e+00
Identity = 532/567 (93.83%), Postives = 550/567 (97.00%), Query Frame = 0

Query: 25  ALRISHKSFISKSDNSSVKLEDFYASLLNRCVQTSDSRHGSAIHAKFLKGFLPFSLFFHN 84
           +LRISHKSFISKS++SSVKLEDFY S L RCVQTSDSRHGSAIHAKFLKGFLPFSLFFHN
Sbjct: 152 SLRISHKSFISKSEDSSVKLEDFYVSFLQRCVQTSDSRHGSAIHAKFLKGFLPFSLFFHN 211

Query: 85  HVLNLYVKCGGLSYGLQLFDEMPERNVVSWSAIIAGFVQHGRPNEALSLFGRMHCDGTIM 144
           HVLNLY+KCG LSYGLQLFDEMPERNVVSWSAIIAGFVQHGRPNEALSLFGRMHCDGTIM
Sbjct: 212 HVLNLYLKCGRLSYGLQLFDEMPERNVVSWSAIIAGFVQHGRPNEALSLFGRMHCDGTIM 271

Query: 145 PNEFTLVSALHACSLTQRLICSNQIYSLIVRLGYGSNVFLMNAFLTTLIRHDKLLEALEV 204
           PNEFTLVSALHACSLTQRLICS QIY+ IVRLGYGSNVFLMNAFLT LIRH+KLLEALEV
Sbjct: 272 PNEFTLVSALHACSLTQRLICSYQIYAFIVRLGYGSNVFLMNAFLTALIRHEKLLEALEV 331

Query: 205 FESFSSKDTVSWNAMMAGYLQLAYFELPKFWHRMNLEGVKPDNFTFASILTGLAALSEFR 264
           FES  SKDTVSWNAMMAGYLQLAYFELPKFW RMNLE VKPDNFTFASILTGLAALSEFR
Sbjct: 332 FESCLSKDTVSWNAMMAGYLQLAYFELPKFWRRMNLESVKPDNFTFASILTGLAALSEFR 391

Query: 265 LGLQVHGQLVKAGYGNDICVGNSLCDMYIKNQKLLDGFKAFDEMPSSDVCSWTQMAAGCL 324
           LGLQVHGQLVK+GYGNDICVGNSLCDMYIKNQKLLDGFKAFDEM SSDVCSWTQMA+GCL
Sbjct: 392 LGLQVHGQLVKSGYGNDICVGNSLCDMYIKNQKLLDGFKAFDEMSSSDVCSWTQMASGCL 451

Query: 325 QCGEPMKALEVVYEMKNVGVRLNKFTLATALNACANLASMEEGKKFHGLRIKLGTDIDVC 384
           QCGEPMKALEV+YEMKNVGVRLNKFTLATALN+CANLAS+EEGKKFHGLRIKLGTD+DVC
Sbjct: 452 QCGEPMKALEVIYEMKNVGVRLNKFTLATALNSCANLASIEEGKKFHGLRIKLGTDVDVC 511

Query: 385 VDNALLDMYAKCGCMASANVVFRSMDERSVVSWTTMIMGFAHNGQAREALQIFDEMRKGE 444
           VDNALLDMYAKCGCM SANVVFRSMDERSVVSWTTMIMGFAHNGQ +EALQIFDEMRKGE
Sbjct: 512 VDNALLDMYAKCGCMTSANVVFRSMDERSVVSWTTMIMGFAHNGQTKEALQIFDEMRKGE 571

Query: 445 AEPNHITFICVLYACSQGGFVDEAWKYFSSMSADHGISPSEDHYVCMVNLLGRAGCIKEA 504
           AEPNHITFICVL ACSQGGF+DEAWKYFSSMSADHGI+PSEDHYVCMVNLLGRAGCIKEA
Sbjct: 572 AEPNHITFICVLNACSQGGFIDEAWKYFSSMSADHGIAPSEDHYVCMVNLLGRAGCIKEA 631

Query: 505 EDLILQMPFRPGSLVWQTLLGACLVHGDLETGKRAAEHALNLDRNDPSTYVLLSNMLAGG 564
           EDLILQMPF+PGSLVWQTLLGACLVHGD+ETGKRAAEHALNLDRNDPSTY+LLSNM AGG
Sbjct: 632 EDLILQMPFQPGSLVWQTLLGACLVHGDIETGKRAAEHALNLDRNDPSTYILLSNMFAGG 691

Query: 565 NNWDSVGSLRELMETRDVKKVPGSSWM 592
           NNWD VGSLRELMETRDVKKVPGSSWM
Sbjct: 692 NNWDGVGSLRELMETRDVKKVPGSSWM 718

BLAST of CmUC01G014220 vs. NCBI nr
Match: KAG7012171.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1103.2 bits (2852), Expect = 0.0e+00
Identity = 532/591 (90.02%), Postives = 562/591 (95.09%), Query Frame = 0

Query: 1   MPLYFHLVRPFIFNSKSTKLQHSIALRISHKSFISKSDNSSVKLEDFYASLLNRCVQTSD 60
           MPL  H+VRP IF SK TK +HSIALRISHKSFISKS+ SSVKLEDFY +LL+RCVQTSD
Sbjct: 1   MPL--HIVRPLIFVSKLTKSRHSIALRISHKSFISKSEISSVKLEDFYVNLLHRCVQTSD 60

Query: 61  SRHGSAIHAKFLKGFLPFSLFFHNHVLNLYVKCGGLSYGLQLFDEMPERNVVSWSAIIAG 120
           SRHGSAIHAKFLKGFLP+SLFFHNHVLN YVKCG LS GLQLFDEMPERNVVSWSA+IAG
Sbjct: 61  SRHGSAIHAKFLKGFLPYSLFFHNHVLNFYVKCGSLSCGLQLFDEMPERNVVSWSAVIAG 120

Query: 121 FVQHGRPNEALSLFGRMHCDGTIMPNEFTLVSALHACSLTQRLICSNQIYSLIVRLGYGS 180
           FVQHGRPNEALSLF RMHCDGTI+PNEFTLVSALHACSLTQRLICS QIY+L++RLGYGS
Sbjct: 121 FVQHGRPNEALSLFSRMHCDGTIIPNEFTLVSALHACSLTQRLICSYQIYALVLRLGYGS 180

Query: 181 NVFLMNAFLTTLIRHDKLLEALEVFESFSSKDTVSWNAMMAGYLQLAYFELPKFWHRMNL 240
           N+FLMNAFLT LIRH+KLL+ALEVFES SSKD VSWNAMMAGYLQLAY ELPKFW RMNL
Sbjct: 181 NIFLMNAFLTALIRHEKLLDALEVFESSSSKDIVSWNAMMAGYLQLAYLELPKFWRRMNL 240

Query: 241 EGVKPDNFTFASILTGLAALSEFRLGLQVHGQLVKAGYGNDICVGNSLCDMYIKNQKLLD 300
           E +KPDNFTFASILTGLAALSEF+LGLQVHGQLVK+GYGNDICVGNSLCDMYIKNQKLLD
Sbjct: 241 EDIKPDNFTFASILTGLAALSEFKLGLQVHGQLVKSGYGNDICVGNSLCDMYIKNQKLLD 300

Query: 301 GFKAFDEMPSSDVCSWTQMAAGCLQCGEPMKALEVVYEMKNVGVRLNKFTLATALNACAN 360
           GFKAFDEMPSSDVCSWTQMAAGCL CGEPMKALEV+Y+MKN+GVRLNKFTLATALNA AN
Sbjct: 301 GFKAFDEMPSSDVCSWTQMAAGCLHCGEPMKALEVIYDMKNIGVRLNKFTLATALNASAN 360

Query: 361 LASMEEGKKFHGLRIKLGTDIDVCVDNALLDMYAKCGCMASANVVFRSMDERSVVSWTTM 420
           LAS+EEGKKFHGLRIKLG DIDVCVDNALLDMYAKCGCM+SANVVFRSMDE+SVVSWTTM
Sbjct: 361 LASIEEGKKFHGLRIKLGADIDVCVDNALLDMYAKCGCMSSANVVFRSMDEQSVVSWTTM 420

Query: 421 IMGFAHNGQAREALQIFDEMRKGEAEPNHITFICVLYACSQGGFVDEAWKYFSSMSADHG 480
           IMGFAHNGQA+EALQIFDEMRK  AEPNHITFICVLYACSQGGF+DEAWKYFSSMSADHG
Sbjct: 421 IMGFAHNGQAKEALQIFDEMRKEGAEPNHITFICVLYACSQGGFIDEAWKYFSSMSADHG 480

Query: 481 ISPSEDHYVCMVNLLGRAGCIKEAEDLILQMPFRPGSLVWQTLLGACLVHGDLETGKRAA 540
           ISPSEDHYVCMVNLLGRAGCIKEAEDLI +MPF+PGSLVWQTLLGACLVHGD+ETGKRAA
Sbjct: 481 ISPSEDHYVCMVNLLGRAGCIKEAEDLIRRMPFKPGSLVWQTLLGACLVHGDVETGKRAA 540

Query: 541 EHALNLDRNDPSTYVLLSNMLAGGNNWDSVGSLRELMETRDVKKVPGSSWM 592
           EHALNLD+NDPSTYVLLSNM AG +NWD VGSLRELMETRDVKKVPG SWM
Sbjct: 541 EHALNLDQNDPSTYVLLSNMFAGRSNWDGVGSLRELMETRDVKKVPGFSWM 589

BLAST of CmUC01G014220 vs. ExPASy Swiss-Prot
Match: Q9SIT7 (Pentatricopeptide repeat-containing protein At2g13600 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E76 PE=3 SV=1)

HSP 1 Score: 382.1 bits (980), Expect = 1.1e-104
Identity = 219/633 (34.60%), Postives = 333/633 (52.61%), Query Frame = 0

Query: 31  KSFIS-KSDNSSVKLEDFYASLLNRCVQTSDSR-HGSAIHAKFLKGFLPFSLFFHNHVLN 90
           KSF+   +D SS      +A LL+ C+++  S  +   +HA  +K      +F  N +++
Sbjct: 4   KSFLKLAADLSSFTDSSPFAKLLDSCIKSKLSAIYVRYVHASVIKSGFSNEIFIQNRLID 63

Query: 91  LYVKCGGLSYGLQLFDEMPERNVVSWSAI------------------------------- 150
            Y KCG L  G Q+FD+MP+RN+ +W+++                               
Sbjct: 64  AYSKCGSLEDGRQVFDKMPQRNIYTWNSVVTGLTKLGFLDEADSLFRSMPERDQCTWNSM 123

Query: 151 IAGFVQHGRPNEALSLFGRMHCDGTIMPNEFTLVSALHACSLTQRLICSNQIYSLIVRLG 210
           ++GF QH R  EAL  F  MH +G ++ NE++  S L ACS    +    Q++SLI +  
Sbjct: 124 VSGFAQHDRCEEALCYFAMMHKEGFVL-NEYSFASVLSACSGLNDMNKGVQVHSLIAKSP 183

Query: 211 YGSNVFLMNAFLTTLIRHDKLLEALEVFESFSSKDTVSWNAMMAGYLQLA-YFELPKFWH 270
           + S+V++ +A +    +   + +A  VF+    ++ VSWN+++  + Q     E    + 
Sbjct: 184 FLSDVYIGSALVDMYSKCGNVNDAQRVFDEMGDRNVVSWNSLITCFEQNGPAVEALDVFQ 243

Query: 271 RMNLEGVKPDNFTFASILTGLAALSEFRLGLQVHGQLVKAG-YGNDICVGNSLCDMYIKN 330
            M    V+PD  T AS+++  A+LS  ++G +VHG++VK     NDI + N+  DMY K 
Sbjct: 244 MMLESRVEPDEVTLASVISACASLSAIKVGQEVHGRVVKNDKLRNDIILSNAFVDMYAKC 303

Query: 331 QKLLDGFKAFDEMP-------------------------------SSDVCSWTQMAAGCL 390
            ++ +    FD MP                                 +V SW  + AG  
Sbjct: 304 SRIKEARFIFDSMPIRNVIAETSMISGYAMAASTKAARLMFTKMAERNVVSWNALIAGYT 363

Query: 391 QCGEPMKALEVVYEMKNVGVRLNKFTLATALNACANLASMEEGKKF------HGLRIKLG 450
           Q GE  +AL +   +K   V    ++ A  L ACA+LA +  G +       HG + + G
Sbjct: 364 QNGENEEALSLFCLLKRESVCPTHYSFANILKACADLAELHLGMQAHVHVLKHGFKFQSG 423

Query: 451 TDIDVCVDNALLDMYAKCGCMASANVVFRSMDERSVVSWTTMIMGFAHNGQAREALQIFD 510
            + D+ V N+L+DMY KCGC+    +VFR M ER  VSW  MI+GFA NG   EAL++F 
Sbjct: 424 EEDDIFVGNSLIDMYVKCGCVEEGYLVFRKMMERDCVSWNAMIIGFAQNGYGNEALELFR 483

Query: 511 EMRKGEAEPNHITFICVLYACSQGGFVDEAWKYFSSMSADHGISPSEDHYVCMVNLLGRA 570
           EM +   +P+HIT I VL AC   GFV+E   YFSSM+ D G++P  DHY CMV+LLGRA
Sbjct: 484 EMLESGEKPDHITMIGVLSACGHAGFVEEGRHYFSSMTRDFGVAPLRDHYTCMVDLLGRA 543

Query: 571 GCIKEAEDLILQMPFRPGSLVWQTLLGACLVHGDLETGKRAAEHALNLDRNDPSTYVLLS 592
           G ++EA+ +I +MP +P S++W +LL AC VH ++  GK  AE  L ++ ++   YVLLS
Sbjct: 544 GFLEEAKSMIEEMPMQPDSVIWGSLLAACKVHRNITLGKYVAEKLLEVEPSNSGPYVLLS 603

BLAST of CmUC01G014220 vs. ExPASy Swiss-Prot
Match: Q9SMZ2 (Pentatricopeptide repeat-containing protein At4g33170 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H53 PE=3 SV=1)

HSP 1 Score: 375.2 bits (962), Expect = 1.4e-102
Identity = 200/543 (36.83%), Postives = 311/543 (57.27%), Query Frame = 0

Query: 51  LLNRCVQTSDSRHGSAIHAKFLKGFLPFSLFFHNHVLNLYVKCGGLSYGLQLFDEMPERN 110
           +L   V+      G  +H   LK  L   L   N ++N+Y K     +   +FD M ER+
Sbjct: 321 MLATAVKVDSLALGQQVHCMALKLGLDLMLTVSNSLINMYCKLRKFGFARTVFDNMSERD 380

Query: 111 VVSWSAIIAGFVQHGRPNEALSLFGRMHCDGTIMPNEFTLVSALHAC-SLTQRLICSNQI 170
           ++SW+++IAG  Q+G   EA+ LF ++   G + P+++T+ S L A  SL + L  S Q+
Sbjct: 381 LISWNSVIAGIAQNGLEVEAVCLFMQLLRCG-LKPDQYTMTSVLKAASSLPEGLSLSKQV 440

Query: 171 YSLIVRLGYGSNVFLMNAFLTTLIRHDKLLEALEVFESFSSKDTVSWNAMMAGYLQL-AY 230
           +   +++   S+ F+  A +    R+  + EA  +FE  +  D V+WNAMMAGY Q    
Sbjct: 441 HVHAIKINNVSDSFVSTALIDAYSRNRCMKEAEILFERHNF-DLVAWNAMMAGYTQSHDG 500

Query: 231 FELPKFWHRMNLEGVKPDNFTFASILTGLAALSEFRLGLQVHGQLVKAGYGNDICVGNSL 290
            +  K +  M+ +G + D+FT A++      L     G QVH   +K+GY  D+ V + +
Sbjct: 501 HKTLKLFALMHKQGERSDDFTLATVFKTCGFLFAINQGKQVHAYAIKSGYDLDLWVSSGI 560

Query: 291 CDMYIKNQKLLDGFKAFDEMPSSDVCSWTQMAAGCLQCGEPMKALEVVYEMKNVGVRLNK 350
            DMY+K   +     AFD +P  D  +WT M +GC++ GE  +A  V  +M+ +GV  ++
Sbjct: 561 LDMYVKCGDMSAAQFAFDSIPVPDDVAWTTMISGCIENGEEERAFHVFSQMRLMGVLPDE 620

Query: 351 FTLATALNACANLASMEEGKKFHGLRIKLGTDIDVCVDNALLDMYAKCGCMASANVVFRS 410
           FT+AT   A + L ++E+G++ H   +KL    D  V  +L+DMYAKCG +  A  +F+ 
Sbjct: 621 FTIATLAKASSCLTALEQGRQIHANALKLNCTNDPFVGTSLVDMYAKCGSIDDAYCLFKR 680

Query: 411 MDERSVVSWTTMIMGFAHNGQAREALQIFDEMRKGEAEPNHITFICVLYACSQGGFVDEA 470
           ++  ++ +W  M++G A +G+ +E LQ+F +M+    +P+ +TFI VL ACS  G V EA
Sbjct: 681 IEMMNITAWNAMLVGLAQHGEGKETLQLFKQMKSLGIKPDKVTFIGVLSACSHSGLVSEA 740

Query: 471 WKYFSSMSADHGISPSEDHYVCMVNLLGRAGCIKEAEDLILQMPFRPGSLVWQTLLGACL 530
           +K+  SM  D+GI P  +HY C+ + LGRAG +K+AE+LI  M     + +++TLL AC 
Sbjct: 741 YKHMRSMHGDYGIKPEIEHYSCLADALGRAGLVKQAENLIESMSMEASASMYRTLLAACR 800

Query: 531 VHGDLETGKRAAEHALNLDRNDPSTYVLLSNMLAGGNNWDSVGSLRELMETRDVKKVPGS 590
           V GD ETGKR A   L L+  D S YVLLSNM A  + WD +   R +M+   VKK PG 
Sbjct: 801 VQGDTETGKRVATKLLELEPLDSSAYVLLSNMYAAASKWDEMKLARTMMKGHKVKKDPGF 860

Query: 591 SWM 592
           SW+
Sbjct: 861 SWI 861

BLAST of CmUC01G014220 vs. ExPASy Swiss-Prot
Match: Q9SS83 (Pentatricopeptide repeat-containing protein At3g09040, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-E88 PE=2 SV=1)

HSP 1 Score: 370.5 bits (950), Expect = 3.4e-101
Identity = 208/557 (37.34%), Postives = 307/557 (55.12%), Query Frame = 0

Query: 39  NSSVKLEDF-YASLLNRCVQTSDSRHGSAIHAKFLKGFLPFSLFFHNHVLNLYVKCGGLS 98
           +S   ++DF + SLL+ C  + D   GS  H+  +K  L  +LF  N ++++Y KCG L 
Sbjct: 421 SSGYNIDDFTFTSLLSTCAASHDLEMGSQFHSIIIKKKLAKNLFVGNALVDMYAKCGALE 480

Query: 99  YGLQLFDEMPERNVVSWSAIIAGFVQHGRPNEALSLFGRMHCDGTIMPNEFTLVSALHAC 158
              Q+F+ M +R+ V+W+ II  +VQ    +EA  LF RM+  G I+ +   L S L AC
Sbjct: 481 DARQIFERMCDRDNVTWNTIIGSYVQDENESEAFDLFKRMNLCG-IVSDGACLASTLKAC 540

Query: 159 SLTQRLICSNQIYSLIVRLGYGSNVFLMNAFLTTLIRHDKLLEALEVFESFSSKDTVSWN 218
           +    L    Q++ L V+ G   ++   ++ +    +   + +A +VF S      VS N
Sbjct: 541 THVHGLYQGKQVHCLSVKCGLDRDLHTGSSLIDMYSKCGIIKDARKVFSSLPEWSVVSMN 600

Query: 219 AMMAGYLQLAYFELPKFWHRMNLEGVKPDNFTFASILTGLAALSEFRLGLQVHGQLVKAG 278
           A++AGY Q    E    +  M   GV P   TFA+I+          LG Q HGQ+ K G
Sbjct: 601 ALIAGYSQNNLEEAVVLFQEMLTRGVNPSEITFATIVEACHKPESLTLGTQFHGQITKRG 660

Query: 279 YGND-ICVGNSLCDMYIKNQKLLDGFKAFDEMPS-SDVCSWTQMAAGCLQCGEPMKALEV 338
           + ++   +G SL  MY+ ++ + +    F E+ S   +  WT M +G  Q G   +AL+ 
Sbjct: 661 FSSEGEYLGISLLGMYMNSRGMTEACALFSELSSPKSIVLWTGMMSGHSQNGFYEEALKF 720

Query: 339 VYEMKNVGVRLNKFTLATALNACANLASMEEGKKFHGLRIKLGTDIDVCVDNALLDMYAK 398
             EM++ GV  ++ T  T L  C+ L+S+ EG+  H L   L  D+D    N L+DMYAK
Sbjct: 721 YKEMRHDGVLPDQATFVTVLRVCSVLSSLREGRAIHSLIFHLAHDLDELTSNTLIDMYAK 780

Query: 399 CGCMASANVVFRSMDERS-VVSWTTMIMGFAHNGQAREALQIFDEMRKGEAEPNHITFIC 458
           CG M  ++ VF  M  RS VVSW ++I G+A NG A +AL+IFD MR+    P+ ITF+ 
Sbjct: 781 CGDMKGSSQVFDEMRRRSNVVSWNSLINGYAKNGYAEDALKIFDSMRQSHIMPDEITFLG 840

Query: 459 VLYACSQGGFVDEAWKYFSSMSADHGISPSEDHYVCMVNLLGRAGCIKEAEDLILQMPFR 518
           VL ACS  G V +  K F  M   +GI    DH  CMV+LLGR G ++EA+D I     +
Sbjct: 841 VLTACSHAGKVSDGRKIFEMMIGQYGIEARVDHVACMVDLLGRWGYLQEADDFIEAQNLK 900

Query: 519 PGSLVWQTLLGACLVHGDLETGKRAAEHALNLDRNDPSTYVLLSNMLAGGNNWDSVGSLR 578
           P + +W +LLGAC +HGD   G+ +AE  + L+  + S YVLLSN+ A    W+   +LR
Sbjct: 901 PDARLWSSLLGACRIHGDDIRGEISAEKLIELEPQNSSAYVLLSNIYASQGCWEKANALR 960

Query: 579 ELMETRDVKKVPGSSWM 592
           ++M  R VKKVPG SW+
Sbjct: 961 KVMRDRGVKKVPGYSWI 976

BLAST of CmUC01G014220 vs. ExPASy Swiss-Prot
Match: P0C898 (Putative pentatricopeptide repeat-containing protein At3g15130 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H86 PE=3 SV=1)

HSP 1 Score: 368.2 bits (944), Expect = 1.7e-100
Identity = 189/547 (34.55%), Postives = 303/547 (55.39%), Query Frame = 0

Query: 50  SLLNRCVQTSDSRHGSAIHAKFLKGFLPFSLFFHNHVLNLYVKCGGLSYGLQLFDEMPER 109
           S+L  C +   S  G  +H   LK     +L   N+++++Y KC       ++FD MPER
Sbjct: 11  SILRVCTRKGLSDQGGQVHCYLLKSGSGLNLITSNYLIDMYCKCREPLMAYKVFDSMPER 70

Query: 110 NVVSWSAIIAGFVQHGRPNEALSLFGRMHCDGTIMPNEFTLVSALHACSLTQRLICSNQI 169
           NVVSWSA+++G V +G    +LSLF  M   G I PNEFT  + L AC L   L    QI
Sbjct: 71  NVVSWSALMSGHVLNGDLKGSLSLFSEMGRQG-IYPNEFTFSTNLKACGLLNALEKGLQI 130

Query: 170 YSLIVRLGYGSNVFLMNAFLTTLIRHDKLLEALEVFESFSSKDTVSWNAMMAGYLQLAYF 229
           +   +++G+   V + N+ +    +  ++ EA +VF     +  +SWNAM+AG++   Y 
Sbjct: 131 HGFCLKIGFEMMVEVGNSLVDMYSKCGRINEAEKVFRRIVDRSLISWNAMIAGFVHAGYG 190

Query: 230 ELPKFWHRMNLEG---VKPDNFTFASILTGLAALSEFRLGLQVHGQLVKAGY--GNDICV 289
                   M  E     +PD FT  S+L   ++      G Q+HG LV++G+   +   +
Sbjct: 191 SKALDTFGMMQEANIKERPDEFTLTSLLKACSSTGMIYAGKQIHGFLVRSGFHCPSSATI 250

Query: 290 GNSLCDMYIKNQKLLDGFKAFDEMPSSDVCSWTQMAAGCLQCGEPMKALEVVYEMKNVGV 349
             SL D+Y+K   L    KAFD++    + SW+ +  G  Q GE ++A+ +   ++ +  
Sbjct: 251 TGSLVDLYVKCGYLFSARKAFDQIKEKTMISWSSLILGYAQEGEFVEAMGLFKRLQELNS 310

Query: 350 RLNKFTLATALNACANLASMEEGKKFHGLRIKLGTDIDVCVDNALLDMYAKCGCMASANV 409
           +++ F L++ +   A+ A + +GK+   L +KL + ++  V N+++DMY KCG +  A  
Sbjct: 311 QIDSFALSSIIGVFADFALLRQGKQMQALAVKLPSGLETSVLNSVVDMYLKCGLVDEAEK 370

Query: 410 VFRSMDERSVVSWTTMIMGFAHNGQAREALQIFDEMRKGEAEPNHITFICVLYACSQGGF 469
            F  M  + V+SWT +I G+  +G  +++++IF EM +   EP+ + ++ VL ACS  G 
Sbjct: 371 CFAEMQLKDVISWTVVITGYGKHGLGKKSVRIFYEMLRHNIEPDEVCYLAVLSACSHSGM 430

Query: 470 VDEAWKYFSSMSADHGISPSEDHYVCMVNLLGRAGCIKEAEDLILQMPFRPGSLVWQTLL 529
           + E  + FS +   HGI P  +HY C+V+LLGRAG +KEA+ LI  MP +P   +WQTLL
Sbjct: 431 IKEGEELFSKLLETHGIKPRVEHYACVVDLLGRAGRLKEAKHLIDTMPIKPNVGIWQTLL 490

Query: 530 GACLVHGDLETGKRAAEHALNLDRNDPSTYVLLSNMLAGGNNWDSVGSLRELMETRDVKK 589
             C VHGD+E GK   +  L +D  +P+ YV++SN+      W+  G+ REL   + +KK
Sbjct: 491 SLCRVHGDIELGKEVGKILLRIDAKNPANYVMMSNLYGQAGYWNEQGNARELGNIKGLKK 550

Query: 590 VPGSSWM 592
             G SW+
Sbjct: 551 EAGMSWV 556

BLAST of CmUC01G014220 vs. ExPASy Swiss-Prot
Match: Q9ZUW3 (Pentatricopeptide repeat-containing protein At2g27610 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H60 PE=2 SV=1)

HSP 1 Score: 363.6 bits (932), Expect = 4.2e-99
Identity = 197/547 (36.01%), Postives = 311/547 (56.86%), Query Frame = 0

Query: 48  YASLLNRCVQTSDSRHGSAIHAKFLKGFLPFSLFFHNHVLNLYVKCGGLSYGLQLFDEMP 107
           +A+ L    +      G  +H   +K  L  ++   N ++NLY+KCG +     LFD+  
Sbjct: 197 FAAALGVLAEEGVGGRGLQVHTVVVKNGLDKTIPVSNSLINLYLKCGNVRKARILFDKTE 256

Query: 108 ERNVVSWSAIIAGFVQHGRPNEALSLFGRMHCDGTIMPNEFTLVSALHACSLTQRLICSN 167
            ++VV+W+++I+G+  +G   EAL +F  M  +  +  +E +  S +  C+  + L  + 
Sbjct: 257 VKSVVTWNSMISGYAANGLDLEALGMFYSMRLN-YVRLSESSFASVIKLCANLKELRFTE 316

Query: 168 QIYSLIVRLGYGSNVFLMNAFLTTLIRHDKLLEALEVFESFSS-KDTVSWNAMMAGYLQL 227
           Q++  +V+ G+  +  +  A +    +   +L+AL +F+      + VSW AM++G+LQ 
Sbjct: 317 QLHCSVVKYGFLFDQNIRTALMVAYSKCTAMLDALRLFKEIGCVGNVVSWTAMISGFLQN 376

Query: 228 -AYFELPKFWHRMNLEGVKPDNFTFASILTGLAALSEFRLGLQVHGQLVKAGYGNDICVG 287
               E    +  M  +GV+P+ FT++ ILT L  +S      +VH Q+VK  Y     VG
Sbjct: 377 DGKEEAVDLFSEMKRKGVRPNEFTYSVILTALPVISP----SEVHAQVVKTNYERSSTVG 436

Query: 288 NSLCDMYIKNQKLLDGFKAFDEMPSSDVCSWTQMAAGCLQCGEPMKALEVVYEMKNVGVR 347
            +L D Y+K  K+ +  K F  +   D+ +W+ M AG  Q GE   A+++  E+   G++
Sbjct: 437 TALLDAYVKLGKVEEAAKVFSGIDDKDIVAWSAMLAGYAQTGETEAAIKMFGELTKGGIK 496

Query: 348 LNKFTLATALNAC-ANLASMEEGKKFHGLRIKLGTDIDVCVDNALLDMYAKCGCMASANV 407
            N+FT ++ LN C A  ASM +GK+FHG  IK   D  +CV +ALL MYAK G + SA  
Sbjct: 497 PNEFTFSSILNVCAATNASMGQGKQFHGFAIKSRLDSSLCVSSALLTMYAKKGNIESAEE 556

Query: 408 VFRSMDERSVVSWTTMIMGFAHNGQAREALQIFDEMRKGEAEPNHITFICVLYACSQGGF 467
           VF+   E+ +VSW +MI G+A +GQA +AL +F EM+K + + + +TFI V  AC+  G 
Sbjct: 557 VFKRQREKDLVSWNSMISGYAQHGQAMKALDVFKEMKKRKVKMDGVTFIGVFAACTHAGL 616

Query: 468 VDEAWKYFSSMSADHGISPSEDHYVCMVNLLGRAGCIKEAEDLILQMPFRPGSLVWQTLL 527
           V+E  KYF  M  D  I+P+++H  CMV+L  RAG +++A  +I  MP   GS +W+T+L
Sbjct: 617 VEEGEKYFDIMVRDCKIAPTKEHNSCMVDLYSRAGQLEKAMKVIENMPNPAGSTIWRTIL 676

Query: 528 GACLVHGDLETGKRAAEHALNLDRNDPSTYVLLSNMLAGGNNWDSVGSLRELMETRDVKK 587
            AC VH   E G+ AAE  + +   D + YVLLSNM A   +W     +R+LM  R+VKK
Sbjct: 677 AACRVHKKTELGRLAAEKIIAMKPEDSAAYVLLSNMYAESGDWQERAKVRKLMNERNVKK 736

Query: 588 VPGSSWM 592
            PG SW+
Sbjct: 737 EPGYSWI 738

BLAST of CmUC01G014220 vs. ExPASy TrEMBL
Match: A0A1S4E4G2 (pentatricopeptide repeat-containing protein At2g13600-like OS=Cucumis melo OX=3656 GN=LOC103504641 PE=4 SV=1)

HSP 1 Score: 1138.3 bits (2943), Expect = 0.0e+00
Identity = 550/591 (93.06%), Postives = 567/591 (95.94%), Query Frame = 0

Query: 1   MPLYFHLVRPFIFNSKSTKLQHSIALRISHKSFISKSDNSSVKLEDFYASLLNRCVQTSD 60
           MPL FHL RP I  SKST LQ SIALRISHKSFISKS++SSVKLEDFY S L RCVQTSD
Sbjct: 1   MPLCFHLARPLILISKSTDLQKSIALRISHKSFISKSEDSSVKLEDFYVSFLQRCVQTSD 60

Query: 61  SRHGSAIHAKFLKGFLPFSLFFHNHVLNLYVKCGGLSYGLQLFDEMPERNVVSWSAIIAG 120
           SRHGSAIHAKFLKGFLPFSLFFHNHVLNLY+KCG LSYGLQLFDEMPERNVVSWSAIIAG
Sbjct: 61  SRHGSAIHAKFLKGFLPFSLFFHNHVLNLYLKCGRLSYGLQLFDEMPERNVVSWSAIIAG 120

Query: 121 FVQHGRPNEALSLFGRMHCDGTIMPNEFTLVSALHACSLTQRLICSNQIYSLIVRLGYGS 180
           FVQHGRPNEALSLFGRMHCDGTIMPNEFTLVSALHACSLTQRLICS QIY+ IVRLGYGS
Sbjct: 121 FVQHGRPNEALSLFGRMHCDGTIMPNEFTLVSALHACSLTQRLICSYQIYAFIVRLGYGS 180

Query: 181 NVFLMNAFLTTLIRHDKLLEALEVFESFSSKDTVSWNAMMAGYLQLAYFELPKFWHRMNL 240
           NVFLMNAFLT LIRH+KLLEALEVFES  SKDTVSWNAMMAGYLQLAYFELPKFW RMNL
Sbjct: 181 NVFLMNAFLTALIRHEKLLEALEVFESCLSKDTVSWNAMMAGYLQLAYFELPKFWRRMNL 240

Query: 241 EGVKPDNFTFASILTGLAALSEFRLGLQVHGQLVKAGYGNDICVGNSLCDMYIKNQKLLD 300
           E VKPDNFTFASILTGLAALSEFRLGLQVHGQLVK+GYGNDICVGNSLCDMYIKNQKLLD
Sbjct: 241 ESVKPDNFTFASILTGLAALSEFRLGLQVHGQLVKSGYGNDICVGNSLCDMYIKNQKLLD 300

Query: 301 GFKAFDEMPSSDVCSWTQMAAGCLQCGEPMKALEVVYEMKNVGVRLNKFTLATALNACAN 360
           GFKAFDEM SSDVCSWTQMA+GCLQCGEPMKALEV+YEMKNVGVRLNKFTLATALN+CAN
Sbjct: 301 GFKAFDEMSSSDVCSWTQMASGCLQCGEPMKALEVIYEMKNVGVRLNKFTLATALNSCAN 360

Query: 361 LASMEEGKKFHGLRIKLGTDIDVCVDNALLDMYAKCGCMASANVVFRSMDERSVVSWTTM 420
           LAS+EEGKKFHGLRIKLGTD+DVCVDNALLDMYAKCGCM SANVVFRSMDERSVVSWTTM
Sbjct: 361 LASIEEGKKFHGLRIKLGTDVDVCVDNALLDMYAKCGCMTSANVVFRSMDERSVVSWTTM 420

Query: 421 IMGFAHNGQAREALQIFDEMRKGEAEPNHITFICVLYACSQGGFVDEAWKYFSSMSADHG 480
           IMGFAHNGQ +EALQIFDEMRKGEAEPNHITFICVL ACSQGGF+DEAWKYFSSMSADHG
Sbjct: 421 IMGFAHNGQTKEALQIFDEMRKGEAEPNHITFICVLNACSQGGFIDEAWKYFSSMSADHG 480

Query: 481 ISPSEDHYVCMVNLLGRAGCIKEAEDLILQMPFRPGSLVWQTLLGACLVHGDLETGKRAA 540
           I+PSEDHYVCMVNLLGRAGCIKEAEDLILQMPF+PGSLVWQTLLGACLVHGD+ETGKRAA
Sbjct: 481 IAPSEDHYVCMVNLLGRAGCIKEAEDLILQMPFQPGSLVWQTLLGACLVHGDIETGKRAA 540

Query: 541 EHALNLDRNDPSTYVLLSNMLAGGNNWDSVGSLRELMETRDVKKVPGSSWM 592
           EHALNLDRNDPSTY+LLSNM AGGNNWD VGSLRELMETRDVKKVPGSSWM
Sbjct: 541 EHALNLDRNDPSTYILLSNMFAGGNNWDGVGSLRELMETRDVKKVPGSSWM 591

BLAST of CmUC01G014220 vs. ExPASy TrEMBL
Match: A0A0A0LVZ1 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G043090 PE=4 SV=1)

HSP 1 Score: 1130.2 bits (2922), Expect = 0.0e+00
Identity = 546/591 (92.39%), Postives = 563/591 (95.26%), Query Frame = 0

Query: 1   MPLYFHLVRPFIFNSKSTKLQHSIALRISHKSFISKSDNSSVKLEDFYASLLNRCVQTSD 60
           MPL FHL RP    SKST LQ SIALRIS KSF+SKS+NSSVKLEDFY S L RCV TSD
Sbjct: 1   MPLCFHLARPLFLISKSTDLQKSIALRISRKSFVSKSENSSVKLEDFYVSFLQRCVLTSD 60

Query: 61  SRHGSAIHAKFLKGFLPFSLFFHNHVLNLYVKCGGLSYGLQLFDEMPERNVVSWSAIIAG 120
           SRHGSAIHAKFLKGFLPFSLFFHNHVLN YVKCG LSYGLQLFDEMPERNVVSWSAIIAG
Sbjct: 61  SRHGSAIHAKFLKGFLPFSLFFHNHVLNFYVKCGRLSYGLQLFDEMPERNVVSWSAIIAG 120

Query: 121 FVQHGRPNEALSLFGRMHCDGTIMPNEFTLVSALHACSLTQRLICSNQIYSLIVRLGYGS 180
           FVQHGRPNEALSLFGRMHCDGTIMPNEFTLVSALHACSLTQRLICS QIY+ IVRLGYGS
Sbjct: 121 FVQHGRPNEALSLFGRMHCDGTIMPNEFTLVSALHACSLTQRLICSYQIYAFIVRLGYGS 180

Query: 181 NVFLMNAFLTTLIRHDKLLEALEVFESFSSKDTVSWNAMMAGYLQLAYFELPKFWHRMNL 240
           NVFLMNAFLT LIRH+KLLEALEVFES  SKDTVSWNAMMAGYLQLAYFELPKFW RMNL
Sbjct: 181 NVFLMNAFLTALIRHEKLLEALEVFESCLSKDTVSWNAMMAGYLQLAYFELPKFWRRMNL 240

Query: 241 EGVKPDNFTFASILTGLAALSEFRLGLQVHGQLVKAGYGNDICVGNSLCDMYIKNQKLLD 300
           E VKPDNFTFASILTGLAALSEFRLGLQVHGQLVK+GYGNDICVGNSLCDMY+KNQKLLD
Sbjct: 241 ESVKPDNFTFASILTGLAALSEFRLGLQVHGQLVKSGYGNDICVGNSLCDMYVKNQKLLD 300

Query: 301 GFKAFDEMPSSDVCSWTQMAAGCLQCGEPMKALEVVYEMKNVGVRLNKFTLATALNACAN 360
           GFKAFDEM SSDVCSWTQMAAGCLQCGEPMKALEV+YEMKNVGVRLNKFTLATALN+CAN
Sbjct: 301 GFKAFDEMSSSDVCSWTQMAAGCLQCGEPMKALEVIYEMKNVGVRLNKFTLATALNSCAN 360

Query: 361 LASMEEGKKFHGLRIKLGTDIDVCVDNALLDMYAKCGCMASANVVFRSMDERSVVSWTTM 420
           LAS+EEGKKFHGLRIKLGTD+DVCVDNALLDMYAKCGCM SANVVFRSMDERSVVSWTTM
Sbjct: 361 LASIEEGKKFHGLRIKLGTDVDVCVDNALLDMYAKCGCMTSANVVFRSMDERSVVSWTTM 420

Query: 421 IMGFAHNGQAREALQIFDEMRKGEAEPNHITFICVLYACSQGGFVDEAWKYFSSMSADHG 480
           IMGFAHNGQ +EALQIFDEMRKGEAEPNHITFICVL ACSQGGF+DEAWKYFSSMSADHG
Sbjct: 421 IMGFAHNGQTKEALQIFDEMRKGEAEPNHITFICVLNACSQGGFIDEAWKYFSSMSADHG 480

Query: 481 ISPSEDHYVCMVNLLGRAGCIKEAEDLILQMPFRPGSLVWQTLLGACLVHGDLETGKRAA 540
           I+PSEDHYVCMVNLLGRAGCIKEAEDLILQMPF+PGSLVWQTLLGACLVHGD+ETGKRAA
Sbjct: 481 IAPSEDHYVCMVNLLGRAGCIKEAEDLILQMPFQPGSLVWQTLLGACLVHGDIETGKRAA 540

Query: 541 EHALNLDRNDPSTYVLLSNMLAGGNNWDSVGSLRELMETRDVKKVPGSSWM 592
           EHALNLDRNDPSTY+LLSNM AGG+NWDSVG LRELMETRDVKKVPGSSWM
Sbjct: 541 EHALNLDRNDPSTYILLSNMFAGGDNWDSVGILRELMETRDVKKVPGSSWM 591

BLAST of CmUC01G014220 vs. ExPASy TrEMBL
Match: A0A5D3BH26 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold248G002660 PE=4 SV=1)

HSP 1 Score: 1108.2 bits (2865), Expect = 0.0e+00
Identity = 532/567 (93.83%), Postives = 550/567 (97.00%), Query Frame = 0

Query: 25  ALRISHKSFISKSDNSSVKLEDFYASLLNRCVQTSDSRHGSAIHAKFLKGFLPFSLFFHN 84
           +LRISHKSFISKS++SSVKLEDFY S L RCVQTSDSRHGSAIHAKFLKGFLPFSLFFHN
Sbjct: 152 SLRISHKSFISKSEDSSVKLEDFYVSFLQRCVQTSDSRHGSAIHAKFLKGFLPFSLFFHN 211

Query: 85  HVLNLYVKCGGLSYGLQLFDEMPERNVVSWSAIIAGFVQHGRPNEALSLFGRMHCDGTIM 144
           HVLNLY+KCG LSYGLQLFDEMPERNVVSWSAIIAGFVQHGRPNEALSLFGRMHCDGTIM
Sbjct: 212 HVLNLYLKCGRLSYGLQLFDEMPERNVVSWSAIIAGFVQHGRPNEALSLFGRMHCDGTIM 271

Query: 145 PNEFTLVSALHACSLTQRLICSNQIYSLIVRLGYGSNVFLMNAFLTTLIRHDKLLEALEV 204
           PNEFTLVSALHACSLTQRLICS QIY+ IVRLGYGSNVFLMNAFLT LIRH+KLLEALEV
Sbjct: 272 PNEFTLVSALHACSLTQRLICSYQIYAFIVRLGYGSNVFLMNAFLTALIRHEKLLEALEV 331

Query: 205 FESFSSKDTVSWNAMMAGYLQLAYFELPKFWHRMNLEGVKPDNFTFASILTGLAALSEFR 264
           FES  SKDTVSWNAMMAGYLQLAYFELPKFW RMNLE VKPDNFTFASILTGLAALSEFR
Sbjct: 332 FESCLSKDTVSWNAMMAGYLQLAYFELPKFWRRMNLESVKPDNFTFASILTGLAALSEFR 391

Query: 265 LGLQVHGQLVKAGYGNDICVGNSLCDMYIKNQKLLDGFKAFDEMPSSDVCSWTQMAAGCL 324
           LGLQVHGQLVK+GYGNDICVGNSLCDMYIKNQKLLDGFKAFDEM SSDVCSWTQMA+GCL
Sbjct: 392 LGLQVHGQLVKSGYGNDICVGNSLCDMYIKNQKLLDGFKAFDEMSSSDVCSWTQMASGCL 451

Query: 325 QCGEPMKALEVVYEMKNVGVRLNKFTLATALNACANLASMEEGKKFHGLRIKLGTDIDVC 384
           QCGEPMKALEV+YEMKNVGVRLNKFTLATALN+CANLAS+EEGKKFHGLRIKLGTD+DVC
Sbjct: 452 QCGEPMKALEVIYEMKNVGVRLNKFTLATALNSCANLASIEEGKKFHGLRIKLGTDVDVC 511

Query: 385 VDNALLDMYAKCGCMASANVVFRSMDERSVVSWTTMIMGFAHNGQAREALQIFDEMRKGE 444
           VDNALLDMYAKCGCM SANVVFRSMDERSVVSWTTMIMGFAHNGQ +EALQIFDEMRKGE
Sbjct: 512 VDNALLDMYAKCGCMTSANVVFRSMDERSVVSWTTMIMGFAHNGQTKEALQIFDEMRKGE 571

Query: 445 AEPNHITFICVLYACSQGGFVDEAWKYFSSMSADHGISPSEDHYVCMVNLLGRAGCIKEA 504
           AEPNHITFICVL ACSQGGF+DEAWKYFSSMSADHGI+PSEDHYVCMVNLLGRAGCIKEA
Sbjct: 572 AEPNHITFICVLNACSQGGFIDEAWKYFSSMSADHGIAPSEDHYVCMVNLLGRAGCIKEA 631

Query: 505 EDLILQMPFRPGSLVWQTLLGACLVHGDLETGKRAAEHALNLDRNDPSTYVLLSNMLAGG 564
           EDLILQMPF+PGSLVWQTLLGACLVHGD+ETGKRAAEHALNLDRNDPSTY+LLSNM AGG
Sbjct: 632 EDLILQMPFQPGSLVWQTLLGACLVHGDIETGKRAAEHALNLDRNDPSTYILLSNMFAGG 691

Query: 565 NNWDSVGSLRELMETRDVKKVPGSSWM 592
           NNWD VGSLRELMETRDVKKVPGSSWM
Sbjct: 692 NNWDGVGSLRELMETRDVKKVPGSSWM 718

BLAST of CmUC01G014220 vs. ExPASy TrEMBL
Match: A0A6J1GRX4 (pentatricopeptide repeat-containing protein At2g13600-like OS=Cucurbita moschata OX=3662 GN=LOC111456896 PE=4 SV=1)

HSP 1 Score: 1100.9 bits (2846), Expect = 0.0e+00
Identity = 532/591 (90.02%), Postives = 561/591 (94.92%), Query Frame = 0

Query: 1   MPLYFHLVRPFIFNSKSTKLQHSIALRISHKSFISKSDNSSVKLEDFYASLLNRCVQTSD 60
           MPL  H+VRP IF SKSTK +HSIALRISHKSFISKS+ S VKLEDFY +LL+RCVQTSD
Sbjct: 1   MPL--HIVRPLIFVSKSTKTRHSIALRISHKSFISKSEISYVKLEDFYVNLLHRCVQTSD 60

Query: 61  SRHGSAIHAKFLKGFLPFSLFFHNHVLNLYVKCGGLSYGLQLFDEMPERNVVSWSAIIAG 120
           SRHGSAIHAKFLKGFLP+SLFFHNHVLN YVKCG LS GLQLFDEMPERNVVSWSA+IAG
Sbjct: 61  SRHGSAIHAKFLKGFLPYSLFFHNHVLNFYVKCGSLSCGLQLFDEMPERNVVSWSAVIAG 120

Query: 121 FVQHGRPNEALSLFGRMHCDGTIMPNEFTLVSALHACSLTQRLICSNQIYSLIVRLGYGS 180
           FVQHGRPNEALSLF RMHCDGTI+PNEFTLVSALHACSLTQRLICS QIY+L++RLGYGS
Sbjct: 121 FVQHGRPNEALSLFSRMHCDGTIIPNEFTLVSALHACSLTQRLICSYQIYALVLRLGYGS 180

Query: 181 NVFLMNAFLTTLIRHDKLLEALEVFESFSSKDTVSWNAMMAGYLQLAYFELPKFWHRMNL 240
           N+FLMNAFLT LIRH+KLLEALEVF S SSKD VSWNAMMAGYLQL+Y ELPKFW RMNL
Sbjct: 181 NIFLMNAFLTALIRHEKLLEALEVFGSSSSKDIVSWNAMMAGYLQLSYLELPKFWRRMNL 240

Query: 241 EGVKPDNFTFASILTGLAALSEFRLGLQVHGQLVKAGYGNDICVGNSLCDMYIKNQKLLD 300
           E +KPDNFTFASILTGLAALSEF+LGLQVHGQLVK+GYGNDICVGNSLCDMYIKNQKLLD
Sbjct: 241 ENIKPDNFTFASILTGLAALSEFKLGLQVHGQLVKSGYGNDICVGNSLCDMYIKNQKLLD 300

Query: 301 GFKAFDEMPSSDVCSWTQMAAGCLQCGEPMKALEVVYEMKNVGVRLNKFTLATALNACAN 360
           GFKAFDEMPSSDVCSWTQMAAGCL CGEPMKALEV+Y+MKNVGVRLNKFTLATALNA AN
Sbjct: 301 GFKAFDEMPSSDVCSWTQMAAGCLHCGEPMKALEVIYDMKNVGVRLNKFTLATALNASAN 360

Query: 361 LASMEEGKKFHGLRIKLGTDIDVCVDNALLDMYAKCGCMASANVVFRSMDERSVVSWTTM 420
           LAS+EEGKKFHGLRIKLG DIDVCVDNALLDMYAKCGCM+SANVVFRSMDE+SVVSWTTM
Sbjct: 361 LASIEEGKKFHGLRIKLGADIDVCVDNALLDMYAKCGCMSSANVVFRSMDEQSVVSWTTM 420

Query: 421 IMGFAHNGQAREALQIFDEMRKGEAEPNHITFICVLYACSQGGFVDEAWKYFSSMSADHG 480
           IMGFAHNGQA+EALQIFDEMRK  AEPNHITFICVLYACSQGGF+DEAWKYFSSMSADHG
Sbjct: 421 IMGFAHNGQAKEALQIFDEMRKEGAEPNHITFICVLYACSQGGFIDEAWKYFSSMSADHG 480

Query: 481 ISPSEDHYVCMVNLLGRAGCIKEAEDLILQMPFRPGSLVWQTLLGACLVHGDLETGKRAA 540
           ISPSEDHYVCMVNLLGRAGCIKEAEDLI +MPF+PGSLVWQTLLGACLVHGD+ETGKRAA
Sbjct: 481 ISPSEDHYVCMVNLLGRAGCIKEAEDLIGRMPFKPGSLVWQTLLGACLVHGDVETGKRAA 540

Query: 541 EHALNLDRNDPSTYVLLSNMLAGGNNWDSVGSLRELMETRDVKKVPGSSWM 592
           EHALNLD+NDPSTYVLLSNM AG +NWD VGSLRELMETRDVKKVPG SWM
Sbjct: 541 EHALNLDQNDPSTYVLLSNMFAGRSNWDGVGSLRELMETRDVKKVPGFSWM 589

BLAST of CmUC01G014220 vs. ExPASy TrEMBL
Match: A0A6J1K498 (pentatricopeptide repeat-containing protein At4g33170-like OS=Cucurbita maxima OX=3661 GN=LOC111490529 PE=4 SV=1)

HSP 1 Score: 1095.5 bits (2832), Expect = 0.0e+00
Identity = 530/591 (89.68%), Postives = 560/591 (94.75%), Query Frame = 0

Query: 1   MPLYFHLVRPFIFNSKSTKLQHSIALRISHKSFISKSDNSSVKLEDFYASLLNRCVQTSD 60
           MPL  H+ RP IF SKSTK +HSIALRISHKSFISKS+ SSVKLEDFY +LL+RCVQTSD
Sbjct: 1   MPL--HIARPLIFVSKSTKTRHSIALRISHKSFISKSEISSVKLEDFYVNLLHRCVQTSD 60

Query: 61  SRHGSAIHAKFLKGFLPFSLFFHNHVLNLYVKCGGLSYGLQLFDEMPERNVVSWSAIIAG 120
           SRHGSAIHAKFLKGFLP+SLFFHNHVLN YVKCG LS GLQLFDEMPERNVVSWSA+IAG
Sbjct: 61  SRHGSAIHAKFLKGFLPYSLFFHNHVLNFYVKCGSLSCGLQLFDEMPERNVVSWSAVIAG 120

Query: 121 FVQHGRPNEALSLFGRMHCDGTIMPNEFTLVSALHACSLTQRLICSNQIYSLIVRLGYGS 180
           FVQHGRPNEALSLF RMHCDGTI+PNEFTLVSALHACSLTQRLICS QIY+L++RLGYGS
Sbjct: 121 FVQHGRPNEALSLFSRMHCDGTIIPNEFTLVSALHACSLTQRLICSYQIYALVLRLGYGS 180

Query: 181 NVFLMNAFLTTLIRHDKLLEALEVFESFSSKDTVSWNAMMAGYLQLAYFELPKFWHRMNL 240
           N+FLMNAFLT LIRH+KLLEALEVFE+ SSKD VSWNAMMAGYLQL+YFELPKFW RMNL
Sbjct: 181 NIFLMNAFLTALIRHEKLLEALEVFENSSSKDIVSWNAMMAGYLQLSYFELPKFWRRMNL 240

Query: 241 EGVKPDNFTFASILTGLAALSEFRLGLQVHGQLVKAGYGNDICVGNSLCDMYIKNQKLLD 300
           E +KPDNFTFASILTGLAALSEF+LGLQVHG LVK+GYGNDICVGNSLCDMYIKNQKLLD
Sbjct: 241 EDIKPDNFTFASILTGLAALSEFKLGLQVHGLLVKSGYGNDICVGNSLCDMYIKNQKLLD 300

Query: 301 GFKAFDEMPSSDVCSWTQMAAGCLQCGEPMKALEVVYEMKNVGVRLNKFTLATALNACAN 360
           GFKAFDEMPSSDVCSWTQMAAGCL CGEPMKALEV+Y+MKNVGVRLNKFTLATALNA AN
Sbjct: 301 GFKAFDEMPSSDVCSWTQMAAGCLHCGEPMKALEVIYDMKNVGVRLNKFTLATALNASAN 360

Query: 361 LASMEEGKKFHGLRIKLGTDIDVCVDNALLDMYAKCGCMASANVVFRSMDERSVVSWTTM 420
           LAS+EEGKKFHGLRIKLG DIDVCVDNALLDMYAKCGCM+SANVVFRSMDE+SVVSWTTM
Sbjct: 361 LASIEEGKKFHGLRIKLGADIDVCVDNALLDMYAKCGCMSSANVVFRSMDEQSVVSWTTM 420

Query: 421 IMGFAHNGQAREALQIFDEMRKGEAEPNHITFICVLYACSQGGFVDEAWKYFSSMSADHG 480
           IMGFAHNGQA+EALQIFDEMRK  AEPNHITFICVLYACSQGGF+DEAWKYFSSMSADHG
Sbjct: 421 IMGFAHNGQAKEALQIFDEMRKEGAEPNHITFICVLYACSQGGFIDEAWKYFSSMSADHG 480

Query: 481 ISPSEDHYVCMVNLLGRAGCIKEAEDLILQMPFRPGSLVWQTLLGACLVHGDLETGKRAA 540
           ISPSEDHYVCMVNLLGRAGCIKEAEDLI +M F+PGSLVWQTLLGACLVHGD+ETGKRAA
Sbjct: 481 ISPSEDHYVCMVNLLGRAGCIKEAEDLIGRMLFKPGSLVWQTLLGACLVHGDVETGKRAA 540

Query: 541 EHALNLDRNDPSTYVLLSNMLAGGNNWDSVGSLRELMETRDVKKVPGSSWM 592
           EHALNLD+ND STYVLLSNM AG +NWD VGSLRELMETRDVKKVPG SWM
Sbjct: 541 EHALNLDQNDSSTYVLLSNMFAGRSNWDGVGSLRELMETRDVKKVPGFSWM 589

BLAST of CmUC01G014220 vs. TAIR 10
Match: AT2G13600.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 382.1 bits (980), Expect = 8.1e-106
Identity = 219/633 (34.60%), Postives = 333/633 (52.61%), Query Frame = 0

Query: 31  KSFIS-KSDNSSVKLEDFYASLLNRCVQTSDSR-HGSAIHAKFLKGFLPFSLFFHNHVLN 90
           KSF+   +D SS      +A LL+ C+++  S  +   +HA  +K      +F  N +++
Sbjct: 4   KSFLKLAADLSSFTDSSPFAKLLDSCIKSKLSAIYVRYVHASVIKSGFSNEIFIQNRLID 63

Query: 91  LYVKCGGLSYGLQLFDEMPERNVVSWSAI------------------------------- 150
            Y KCG L  G Q+FD+MP+RN+ +W+++                               
Sbjct: 64  AYSKCGSLEDGRQVFDKMPQRNIYTWNSVVTGLTKLGFLDEADSLFRSMPERDQCTWNSM 123

Query: 151 IAGFVQHGRPNEALSLFGRMHCDGTIMPNEFTLVSALHACSLTQRLICSNQIYSLIVRLG 210
           ++GF QH R  EAL  F  MH +G ++ NE++  S L ACS    +    Q++SLI +  
Sbjct: 124 VSGFAQHDRCEEALCYFAMMHKEGFVL-NEYSFASVLSACSGLNDMNKGVQVHSLIAKSP 183

Query: 211 YGSNVFLMNAFLTTLIRHDKLLEALEVFESFSSKDTVSWNAMMAGYLQLA-YFELPKFWH 270
           + S+V++ +A +    +   + +A  VF+    ++ VSWN+++  + Q     E    + 
Sbjct: 184 FLSDVYIGSALVDMYSKCGNVNDAQRVFDEMGDRNVVSWNSLITCFEQNGPAVEALDVFQ 243

Query: 271 RMNLEGVKPDNFTFASILTGLAALSEFRLGLQVHGQLVKAG-YGNDICVGNSLCDMYIKN 330
            M    V+PD  T AS+++  A+LS  ++G +VHG++VK     NDI + N+  DMY K 
Sbjct: 244 MMLESRVEPDEVTLASVISACASLSAIKVGQEVHGRVVKNDKLRNDIILSNAFVDMYAKC 303

Query: 331 QKLLDGFKAFDEMP-------------------------------SSDVCSWTQMAAGCL 390
            ++ +    FD MP                                 +V SW  + AG  
Sbjct: 304 SRIKEARFIFDSMPIRNVIAETSMISGYAMAASTKAARLMFTKMAERNVVSWNALIAGYT 363

Query: 391 QCGEPMKALEVVYEMKNVGVRLNKFTLATALNACANLASMEEGKKF------HGLRIKLG 450
           Q GE  +AL +   +K   V    ++ A  L ACA+LA +  G +       HG + + G
Sbjct: 364 QNGENEEALSLFCLLKRESVCPTHYSFANILKACADLAELHLGMQAHVHVLKHGFKFQSG 423

Query: 451 TDIDVCVDNALLDMYAKCGCMASANVVFRSMDERSVVSWTTMIMGFAHNGQAREALQIFD 510
            + D+ V N+L+DMY KCGC+    +VFR M ER  VSW  MI+GFA NG   EAL++F 
Sbjct: 424 EEDDIFVGNSLIDMYVKCGCVEEGYLVFRKMMERDCVSWNAMIIGFAQNGYGNEALELFR 483

Query: 511 EMRKGEAEPNHITFICVLYACSQGGFVDEAWKYFSSMSADHGISPSEDHYVCMVNLLGRA 570
           EM +   +P+HIT I VL AC   GFV+E   YFSSM+ D G++P  DHY CMV+LLGRA
Sbjct: 484 EMLESGEKPDHITMIGVLSACGHAGFVEEGRHYFSSMTRDFGVAPLRDHYTCMVDLLGRA 543

Query: 571 GCIKEAEDLILQMPFRPGSLVWQTLLGACLVHGDLETGKRAAEHALNLDRNDPSTYVLLS 592
           G ++EA+ +I +MP +P S++W +LL AC VH ++  GK  AE  L ++ ++   YVLLS
Sbjct: 544 GFLEEAKSMIEEMPMQPDSVIWGSLLAACKVHRNITLGKYVAEKLLEVEPSNSGPYVLLS 603

BLAST of CmUC01G014220 vs. TAIR 10
Match: AT4G33170.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 375.2 bits (962), Expect = 9.9e-104
Identity = 200/543 (36.83%), Postives = 311/543 (57.27%), Query Frame = 0

Query: 51  LLNRCVQTSDSRHGSAIHAKFLKGFLPFSLFFHNHVLNLYVKCGGLSYGLQLFDEMPERN 110
           +L   V+      G  +H   LK  L   L   N ++N+Y K     +   +FD M ER+
Sbjct: 321 MLATAVKVDSLALGQQVHCMALKLGLDLMLTVSNSLINMYCKLRKFGFARTVFDNMSERD 380

Query: 111 VVSWSAIIAGFVQHGRPNEALSLFGRMHCDGTIMPNEFTLVSALHAC-SLTQRLICSNQI 170
           ++SW+++IAG  Q+G   EA+ LF ++   G + P+++T+ S L A  SL + L  S Q+
Sbjct: 381 LISWNSVIAGIAQNGLEVEAVCLFMQLLRCG-LKPDQYTMTSVLKAASSLPEGLSLSKQV 440

Query: 171 YSLIVRLGYGSNVFLMNAFLTTLIRHDKLLEALEVFESFSSKDTVSWNAMMAGYLQL-AY 230
           +   +++   S+ F+  A +    R+  + EA  +FE  +  D V+WNAMMAGY Q    
Sbjct: 441 HVHAIKINNVSDSFVSTALIDAYSRNRCMKEAEILFERHNF-DLVAWNAMMAGYTQSHDG 500

Query: 231 FELPKFWHRMNLEGVKPDNFTFASILTGLAALSEFRLGLQVHGQLVKAGYGNDICVGNSL 290
            +  K +  M+ +G + D+FT A++      L     G QVH   +K+GY  D+ V + +
Sbjct: 501 HKTLKLFALMHKQGERSDDFTLATVFKTCGFLFAINQGKQVHAYAIKSGYDLDLWVSSGI 560

Query: 291 CDMYIKNQKLLDGFKAFDEMPSSDVCSWTQMAAGCLQCGEPMKALEVVYEMKNVGVRLNK 350
            DMY+K   +     AFD +P  D  +WT M +GC++ GE  +A  V  +M+ +GV  ++
Sbjct: 561 LDMYVKCGDMSAAQFAFDSIPVPDDVAWTTMISGCIENGEEERAFHVFSQMRLMGVLPDE 620

Query: 351 FTLATALNACANLASMEEGKKFHGLRIKLGTDIDVCVDNALLDMYAKCGCMASANVVFRS 410
           FT+AT   A + L ++E+G++ H   +KL    D  V  +L+DMYAKCG +  A  +F+ 
Sbjct: 621 FTIATLAKASSCLTALEQGRQIHANALKLNCTNDPFVGTSLVDMYAKCGSIDDAYCLFKR 680

Query: 411 MDERSVVSWTTMIMGFAHNGQAREALQIFDEMRKGEAEPNHITFICVLYACSQGGFVDEA 470
           ++  ++ +W  M++G A +G+ +E LQ+F +M+    +P+ +TFI VL ACS  G V EA
Sbjct: 681 IEMMNITAWNAMLVGLAQHGEGKETLQLFKQMKSLGIKPDKVTFIGVLSACSHSGLVSEA 740

Query: 471 WKYFSSMSADHGISPSEDHYVCMVNLLGRAGCIKEAEDLILQMPFRPGSLVWQTLLGACL 530
           +K+  SM  D+GI P  +HY C+ + LGRAG +K+AE+LI  M     + +++TLL AC 
Sbjct: 741 YKHMRSMHGDYGIKPEIEHYSCLADALGRAGLVKQAENLIESMSMEASASMYRTLLAACR 800

Query: 531 VHGDLETGKRAAEHALNLDRNDPSTYVLLSNMLAGGNNWDSVGSLRELMETRDVKKVPGS 590
           V GD ETGKR A   L L+  D S YVLLSNM A  + WD +   R +M+   VKK PG 
Sbjct: 801 VQGDTETGKRVATKLLELEPLDSSAYVLLSNMYAAASKWDEMKLARTMMKGHKVKKDPGF 860

Query: 591 SWM 592
           SW+
Sbjct: 861 SWI 861

BLAST of CmUC01G014220 vs. TAIR 10
Match: AT3G09040.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 370.5 bits (950), Expect = 2.4e-102
Identity = 208/557 (37.34%), Postives = 307/557 (55.12%), Query Frame = 0

Query: 39  NSSVKLEDF-YASLLNRCVQTSDSRHGSAIHAKFLKGFLPFSLFFHNHVLNLYVKCGGLS 98
           +S   ++DF + SLL+ C  + D   GS  H+  +K  L  +LF  N ++++Y KCG L 
Sbjct: 421 SSGYNIDDFTFTSLLSTCAASHDLEMGSQFHSIIIKKKLAKNLFVGNALVDMYAKCGALE 480

Query: 99  YGLQLFDEMPERNVVSWSAIIAGFVQHGRPNEALSLFGRMHCDGTIMPNEFTLVSALHAC 158
              Q+F+ M +R+ V+W+ II  +VQ    +EA  LF RM+  G I+ +   L S L AC
Sbjct: 481 DARQIFERMCDRDNVTWNTIIGSYVQDENESEAFDLFKRMNLCG-IVSDGACLASTLKAC 540

Query: 159 SLTQRLICSNQIYSLIVRLGYGSNVFLMNAFLTTLIRHDKLLEALEVFESFSSKDTVSWN 218
           +    L    Q++ L V+ G   ++   ++ +    +   + +A +VF S      VS N
Sbjct: 541 THVHGLYQGKQVHCLSVKCGLDRDLHTGSSLIDMYSKCGIIKDARKVFSSLPEWSVVSMN 600

Query: 219 AMMAGYLQLAYFELPKFWHRMNLEGVKPDNFTFASILTGLAALSEFRLGLQVHGQLVKAG 278
           A++AGY Q    E    +  M   GV P   TFA+I+          LG Q HGQ+ K G
Sbjct: 601 ALIAGYSQNNLEEAVVLFQEMLTRGVNPSEITFATIVEACHKPESLTLGTQFHGQITKRG 660

Query: 279 YGND-ICVGNSLCDMYIKNQKLLDGFKAFDEMPS-SDVCSWTQMAAGCLQCGEPMKALEV 338
           + ++   +G SL  MY+ ++ + +    F E+ S   +  WT M +G  Q G   +AL+ 
Sbjct: 661 FSSEGEYLGISLLGMYMNSRGMTEACALFSELSSPKSIVLWTGMMSGHSQNGFYEEALKF 720

Query: 339 VYEMKNVGVRLNKFTLATALNACANLASMEEGKKFHGLRIKLGTDIDVCVDNALLDMYAK 398
             EM++ GV  ++ T  T L  C+ L+S+ EG+  H L   L  D+D    N L+DMYAK
Sbjct: 721 YKEMRHDGVLPDQATFVTVLRVCSVLSSLREGRAIHSLIFHLAHDLDELTSNTLIDMYAK 780

Query: 399 CGCMASANVVFRSMDERS-VVSWTTMIMGFAHNGQAREALQIFDEMRKGEAEPNHITFIC 458
           CG M  ++ VF  M  RS VVSW ++I G+A NG A +AL+IFD MR+    P+ ITF+ 
Sbjct: 781 CGDMKGSSQVFDEMRRRSNVVSWNSLINGYAKNGYAEDALKIFDSMRQSHIMPDEITFLG 840

Query: 459 VLYACSQGGFVDEAWKYFSSMSADHGISPSEDHYVCMVNLLGRAGCIKEAEDLILQMPFR 518
           VL ACS  G V +  K F  M   +GI    DH  CMV+LLGR G ++EA+D I     +
Sbjct: 841 VLTACSHAGKVSDGRKIFEMMIGQYGIEARVDHVACMVDLLGRWGYLQEADDFIEAQNLK 900

Query: 519 PGSLVWQTLLGACLVHGDLETGKRAAEHALNLDRNDPSTYVLLSNMLAGGNNWDSVGSLR 578
           P + +W +LLGAC +HGD   G+ +AE  + L+  + S YVLLSN+ A    W+   +LR
Sbjct: 901 PDARLWSSLLGACRIHGDDIRGEISAEKLIELEPQNSSAYVLLSNIYASQGCWEKANALR 960

Query: 579 ELMETRDVKKVPGSSWM 592
           ++M  R VKKVPG SW+
Sbjct: 961 KVMRDRGVKKVPGYSWI 976

BLAST of CmUC01G014220 vs. TAIR 10
Match: AT3G15130.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 368.2 bits (944), Expect = 1.2e-101
Identity = 189/547 (34.55%), Postives = 303/547 (55.39%), Query Frame = 0

Query: 50  SLLNRCVQTSDSRHGSAIHAKFLKGFLPFSLFFHNHVLNLYVKCGGLSYGLQLFDEMPER 109
           S+L  C +   S  G  +H   LK     +L   N+++++Y KC       ++FD MPER
Sbjct: 11  SILRVCTRKGLSDQGGQVHCYLLKSGSGLNLITSNYLIDMYCKCREPLMAYKVFDSMPER 70

Query: 110 NVVSWSAIIAGFVQHGRPNEALSLFGRMHCDGTIMPNEFTLVSALHACSLTQRLICSNQI 169
           NVVSWSA+++G V +G    +LSLF  M   G I PNEFT  + L AC L   L    QI
Sbjct: 71  NVVSWSALMSGHVLNGDLKGSLSLFSEMGRQG-IYPNEFTFSTNLKACGLLNALEKGLQI 130

Query: 170 YSLIVRLGYGSNVFLMNAFLTTLIRHDKLLEALEVFESFSSKDTVSWNAMMAGYLQLAYF 229
           +   +++G+   V + N+ +    +  ++ EA +VF     +  +SWNAM+AG++   Y 
Sbjct: 131 HGFCLKIGFEMMVEVGNSLVDMYSKCGRINEAEKVFRRIVDRSLISWNAMIAGFVHAGYG 190

Query: 230 ELPKFWHRMNLEG---VKPDNFTFASILTGLAALSEFRLGLQVHGQLVKAGY--GNDICV 289
                   M  E     +PD FT  S+L   ++      G Q+HG LV++G+   +   +
Sbjct: 191 SKALDTFGMMQEANIKERPDEFTLTSLLKACSSTGMIYAGKQIHGFLVRSGFHCPSSATI 250

Query: 290 GNSLCDMYIKNQKLLDGFKAFDEMPSSDVCSWTQMAAGCLQCGEPMKALEVVYEMKNVGV 349
             SL D+Y+K   L    KAFD++    + SW+ +  G  Q GE ++A+ +   ++ +  
Sbjct: 251 TGSLVDLYVKCGYLFSARKAFDQIKEKTMISWSSLILGYAQEGEFVEAMGLFKRLQELNS 310

Query: 350 RLNKFTLATALNACANLASMEEGKKFHGLRIKLGTDIDVCVDNALLDMYAKCGCMASANV 409
           +++ F L++ +   A+ A + +GK+   L +KL + ++  V N+++DMY KCG +  A  
Sbjct: 311 QIDSFALSSIIGVFADFALLRQGKQMQALAVKLPSGLETSVLNSVVDMYLKCGLVDEAEK 370

Query: 410 VFRSMDERSVVSWTTMIMGFAHNGQAREALQIFDEMRKGEAEPNHITFICVLYACSQGGF 469
            F  M  + V+SWT +I G+  +G  +++++IF EM +   EP+ + ++ VL ACS  G 
Sbjct: 371 CFAEMQLKDVISWTVVITGYGKHGLGKKSVRIFYEMLRHNIEPDEVCYLAVLSACSHSGM 430

Query: 470 VDEAWKYFSSMSADHGISPSEDHYVCMVNLLGRAGCIKEAEDLILQMPFRPGSLVWQTLL 529
           + E  + FS +   HGI P  +HY C+V+LLGRAG +KEA+ LI  MP +P   +WQTLL
Sbjct: 431 IKEGEELFSKLLETHGIKPRVEHYACVVDLLGRAGRLKEAKHLIDTMPIKPNVGIWQTLL 490

Query: 530 GACLVHGDLETGKRAAEHALNLDRNDPSTYVLLSNMLAGGNNWDSVGSLRELMETRDVKK 589
             C VHGD+E GK   +  L +D  +P+ YV++SN+      W+  G+ REL   + +KK
Sbjct: 491 SLCRVHGDIELGKEVGKILLRIDAKNPANYVMMSNLYGQAGYWNEQGNARELGNIKGLKK 550

Query: 590 VPGSSWM 592
             G SW+
Sbjct: 551 EAGMSWV 556

BLAST of CmUC01G014220 vs. TAIR 10
Match: AT2G27610.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 363.6 bits (932), Expect = 3.0e-100
Identity = 197/547 (36.01%), Postives = 311/547 (56.86%), Query Frame = 0

Query: 48  YASLLNRCVQTSDSRHGSAIHAKFLKGFLPFSLFFHNHVLNLYVKCGGLSYGLQLFDEMP 107
           +A+ L    +      G  +H   +K  L  ++   N ++NLY+KCG +     LFD+  
Sbjct: 197 FAAALGVLAEEGVGGRGLQVHTVVVKNGLDKTIPVSNSLINLYLKCGNVRKARILFDKTE 256

Query: 108 ERNVVSWSAIIAGFVQHGRPNEALSLFGRMHCDGTIMPNEFTLVSALHACSLTQRLICSN 167
            ++VV+W+++I+G+  +G   EAL +F  M  +  +  +E +  S +  C+  + L  + 
Sbjct: 257 VKSVVTWNSMISGYAANGLDLEALGMFYSMRLN-YVRLSESSFASVIKLCANLKELRFTE 316

Query: 168 QIYSLIVRLGYGSNVFLMNAFLTTLIRHDKLLEALEVFESFSS-KDTVSWNAMMAGYLQL 227
           Q++  +V+ G+  +  +  A +    +   +L+AL +F+      + VSW AM++G+LQ 
Sbjct: 317 QLHCSVVKYGFLFDQNIRTALMVAYSKCTAMLDALRLFKEIGCVGNVVSWTAMISGFLQN 376

Query: 228 -AYFELPKFWHRMNLEGVKPDNFTFASILTGLAALSEFRLGLQVHGQLVKAGYGNDICVG 287
               E    +  M  +GV+P+ FT++ ILT L  +S      +VH Q+VK  Y     VG
Sbjct: 377 DGKEEAVDLFSEMKRKGVRPNEFTYSVILTALPVISP----SEVHAQVVKTNYERSSTVG 436

Query: 288 NSLCDMYIKNQKLLDGFKAFDEMPSSDVCSWTQMAAGCLQCGEPMKALEVVYEMKNVGVR 347
            +L D Y+K  K+ +  K F  +   D+ +W+ M AG  Q GE   A+++  E+   G++
Sbjct: 437 TALLDAYVKLGKVEEAAKVFSGIDDKDIVAWSAMLAGYAQTGETEAAIKMFGELTKGGIK 496

Query: 348 LNKFTLATALNAC-ANLASMEEGKKFHGLRIKLGTDIDVCVDNALLDMYAKCGCMASANV 407
            N+FT ++ LN C A  ASM +GK+FHG  IK   D  +CV +ALL MYAK G + SA  
Sbjct: 497 PNEFTFSSILNVCAATNASMGQGKQFHGFAIKSRLDSSLCVSSALLTMYAKKGNIESAEE 556

Query: 408 VFRSMDERSVVSWTTMIMGFAHNGQAREALQIFDEMRKGEAEPNHITFICVLYACSQGGF 467
           VF+   E+ +VSW +MI G+A +GQA +AL +F EM+K + + + +TFI V  AC+  G 
Sbjct: 557 VFKRQREKDLVSWNSMISGYAQHGQAMKALDVFKEMKKRKVKMDGVTFIGVFAACTHAGL 616

Query: 468 VDEAWKYFSSMSADHGISPSEDHYVCMVNLLGRAGCIKEAEDLILQMPFRPGSLVWQTLL 527
           V+E  KYF  M  D  I+P+++H  CMV+L  RAG +++A  +I  MP   GS +W+T+L
Sbjct: 617 VEEGEKYFDIMVRDCKIAPTKEHNSCMVDLYSRAGQLEKAMKVIENMPNPAGSTIWRTIL 676

Query: 528 GACLVHGDLETGKRAAEHALNLDRNDPSTYVLLSNMLAGGNNWDSVGSLRELMETRDVKK 587
            AC VH   E G+ AAE  + +   D + YVLLSNM A   +W     +R+LM  R+VKK
Sbjct: 677 AACRVHKKTELGRLAAEKIIAMKPEDSAAYVLLSNMYAESGDWQERAKVRKLMNERNVKK 736

Query: 588 VPGSSWM 592
            PG SW+
Sbjct: 737 EPGYSWI 738

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038894950.10.0e+0093.91pentatricopeptide repeat-containing protein At4g33170-like isoform X1 [Benincasa... [more]
XP_008467246.10.0e+0093.06PREDICTED: pentatricopeptide repeat-containing protein At2g13600-like [Cucumis m... [more]
XP_011650978.10.0e+0092.39putative pentatricopeptide repeat-containing protein At3g15130 [Cucumis sativus]... [more]
TYJ99062.10.0e+0093.83pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa][more]
KAG7012171.10.0e+0090.02Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
Match NameE-valueIdentityDescription
Q9SIT71.1e-10434.60Pentatricopeptide repeat-containing protein At2g13600 OS=Arabidopsis thaliana OX... [more]
Q9SMZ21.4e-10236.83Pentatricopeptide repeat-containing protein At4g33170 OS=Arabidopsis thaliana OX... [more]
Q9SS833.4e-10137.34Pentatricopeptide repeat-containing protein At3g09040, mitochondrial OS=Arabidop... [more]
P0C8981.7e-10034.55Putative pentatricopeptide repeat-containing protein At3g15130 OS=Arabidopsis th... [more]
Q9ZUW34.2e-9936.01Pentatricopeptide repeat-containing protein At2g27610 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
A0A1S4E4G20.0e+0093.06pentatricopeptide repeat-containing protein At2g13600-like OS=Cucumis melo OX=36... [more]
A0A0A0LVZ10.0e+0092.39Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G043090 PE=4 SV=1[more]
A0A5D3BH260.0e+0093.83Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A6J1GRX40.0e+0090.02pentatricopeptide repeat-containing protein At2g13600-like OS=Cucurbita moschata... [more]
A0A6J1K4980.0e+0089.68pentatricopeptide repeat-containing protein At4g33170-like OS=Cucurbita maxima O... [more]
Match NameE-valueIdentityDescription
AT2G13600.18.1e-10634.60Pentatricopeptide repeat (PPR) superfamily protein [more]
AT4G33170.19.9e-10436.83Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT3G09040.12.4e-10237.34Pentatricopeptide repeat (PPR) superfamily protein [more]
AT3G15130.11.2e-10134.55Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT2G27610.13.0e-10036.01Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (USVL531) v1
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 461..566
e-value: 1.5E-13
score: 52.5
coord: 371..460
e-value: 7.8E-22
score: 79.5
coord: 262..370
e-value: 5.4E-18
score: 67.0
coord: 166..261
e-value: 3.3E-12
score: 48.1
coord: 18..165
e-value: 1.5E-20
score: 75.3
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 450..484
e-value: 6.5E-5
score: 20.8
coord: 415..448
e-value: 5.2E-7
score: 27.4
coord: 112..142
e-value: 2.5E-6
score: 25.3
coord: 315..347
e-value: 0.0027
score: 15.7
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 315..344
e-value: 0.0021
score: 18.2
coord: 488..511
e-value: 0.044
score: 14.0
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 109..157
e-value: 6.6E-8
score: 32.6
coord: 413..459
e-value: 1.2E-11
score: 44.6
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 448..483
score: 8.527949
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 312..346
score: 9.426776
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 413..447
score: 11.794416
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 110..144
score: 10.500983
NoneNo IPR availablePANTHERPTHR24015OS07G0578800 PROTEIN-RELATEDcoord: 31..591
NoneNo IPR availablePANTHERPTHR24015:SF1725TETRATRICOPEPTIDE REPEAT (TPR)-LIKE SUPERFAMILY PROTEINcoord: 31..591

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmUC01G014220.1CmUC01G014220.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009451 RNA modification
cellular_component GO:0043231 intracellular membrane-bounded organelle
molecular_function GO:0005515 protein binding
molecular_function GO:0003723 RNA binding