HG10004370 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10004370
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptionpentatricopeptide repeat-containing protein At1g26460, mitochondrial
LocationChr08: 16373463 .. 16381929 (+)
RNA-Seq ExpressionHG10004370
SyntenyHG10004370
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGTCCAAAATGGCGATTCTCTCTAGAACTCACACTCTAATCAGAACCACAAACCTCAACAATGTCTGCTTCTTCAAGCCCATTTCCACTTTCACATTTCTCTCTCAACAACCTCAGCTCGCTAATGAACCAGTGGACATTCCACCTTCAACTCCACTTCCTCCGAATCCCGCCTCTGGTAGCCCACTTTACAAGGAGAACTGGCGGAATCCAATCCCCAACTATTCTATGGCTCCGTCTCTGGTCCCCCTTGGTCTCCTTAGCCAATCCCCGAGCTCTCGCATTGAAGCATTGTCTCAAACGCTCGATGTTCAGAGCTTGTTGAACGTTTTTGCTGATTGGATGGCTTCCCAGCGCTGGGAAGATATGAAGCAGTTGTTTGAGTTTTGGATTCGGTCGTTGGATAAAGATGGCAAGCCTAATAAGCCAGATGTTAACTTGTATAACAATTATTTGAGGGCTAACTTGATGGTTAACGCCACGGCTGGGGAGCTTCTGGATATAGTGGCTCAAATGGAGGACTATGCGATCACACCCAACACTGCATCATACAATTTAGTATTAAAGGCGATGTACCAGGCTAGAGAGACCGAGGCTGCTGAAAAATTGATTGAAAGGTTAGATTTTGAATTTTATTCGTCATTCAATCTTCAACAAACATGTTGTTTTCTTTTACCTGTCAACTGCTTTTGTTCCGTGTGCCTCTAGATTAACATCTTTATGGAATTCCTATATTATTGCTTGTTCATTGGGAATTGAACGTTTAACGTTCTTTGGGTCTAGCATCGATCCTTATAGGAAGTTAGAATTCTTACGGATATAATTTAGACTGAATAGTTTCTAGACTAGGTCCTCCCCACAAAACTTGATGATAATAAAATGCTTCTGGGTGACAATAATGTTAGTTGTTTCAAGTGAATATCTGGAAAAACATTTCAATTTTCAGGGATCATTACTACTTTTAGTTATTGGTCTTGATTGTAATAATCTCTCAAGACTCTTGAGCGTTTTCTTACTAATCAATGTGACTGATAAGTCTTATTATTATCATTGTTGTTGTTGTTATTTTGAGTTCAGCATGGGGTGGAACACAACACCACTTTGCTATGCTCACTTTGACGATACAAGTCTTATTTACAAACTGACAAAGAAAACTAAGTGTAGTAAAACTGGAACACAACTAATTACAACAGCTTGTTAACTAATCAAGTAATAAAATAATGATTAGTTATCATCCCCCTCAAATGTATGCTTATAGATGCAAGAGTTTCCATCATTGAAACAGAAAACGAGCACTTGGATGTGGCCTAGTGAATATATCAGCATGCTGTTGAAGTAGTAGAAATGTATTGAAGTTCAATATCTTTACGAATGACATGCTCTCATATAAAGTGAAAATCAATTTCCACAGGCACATGCTTCGTTCAATTATGAAAAATCAGGTTGGGAGCTAGTTGAATTGCTGAAACATTATCACATAACAATGGTGTTTTAAGACTTAAAAATAAAATATGAAGATCTCATAAGATTTGCCTTAGCCATGAAAATTCTACGGCCATGGAAGCAAGAGCTTGATATTCAACCTCAATGGAGCTTGGAGATACTGTGTTTGCTTCTTAGCACTCCAAGATATAGGACTAGTACCTAAGAAATTGACAAATCCTGGAGTAAAGCACTTATCAATCAGGTCCAATAAGAGTTTGAATAGATATGTAAACGCAAAAAAACACCTTGACATCTGCAAAACAATAATCCAAGATCAGTAGTACCATTGAGGTATCTTATCTTAACACTCATTTGGCAGCAACTACGTGATTCTCATGAGTATGCATATGCTAAGATACTTTACATGACATATTAGGCTGGCTGAAGGTAAGATATTGCAAAACACCAATTAAAGCACGATAACTTTGAGCACCAGCAGAAGAGCAAAGAGCTCCACTATCAGAAGTTTGTAGTGGAAGAGGAGTGGAGCAAAACTTAGCACCAATCATGCCAAAATGTTTCACCATGTCCTCGGTAAATTGAGCTTGATGGACAAATATGCTAGCCTTTGTATAAGAGATCTCAAGGCCAAAAAGGTACTTTAAAACACCTAGATCAATCATGTCAAAATGAAGCTTAAAATAAGAGCACTGGCATAAGAAGGATCAATCCCAGTTAGAATGAGATCATCAACATACAACAGATAACAAATATGTAATCGAACCTCCATGACTTCATACAAAAAGAGAGGAATTGGCCAAAGAGGCTTGAAAGCCAAGGGTAAGCAAATAGGAAGTAAGACAATCAAACCAAGTACGAAGGACTTGTCATAAGCCTCTTCTTGAAGCTCTCCATGAAGAATCACATTTATAACTTAAGCTGACGTAGTTGTCAACCATATTAGGCAGCTGGGAAAGAATAACTTGTACTGTAGGCTTCTTAACCTTAGGGCTAAAACTCTCATCATAAACCACACCCTCTTTTTCAATGATACCCTTTATCAATGAGTCGTGCTTTGTATTGAGCAACACTACCATTCGGATTTATCCTAGTAAGATGCACCTATTTACACTCTATTGCAATCTTGCTTGGGGGTAAAGGAAGGAGAGACCGAGTATTCTGTTTCACAGTACCAACTTTGCTCATTGCTTGTTGCCAAACATCCAATTTTGAAGCCTCTGTAAAAGGGGAAGGTTCCTCTTGAATAGAATCAACAGTAACTGGAAAGATCTTCTTTGAAATACCAAGTTTACTGTTGGTTTGCATGATTGCATCAAATGAGCATTATGAGTAACATTGTTTAAAACACCGACAGGCTTAACGCAGACAATGGTACAATCACCATTTACCAAGAACATACTCAGCACTAACAACAAAAATAGATGGGGCTGCACATGGGGAGAAGCATTAGTTGTAGCATCACCAAATGTTTCTACAGATTGTGGTGTTGAATCACAGGATGAAAGAAAAAGTGGTGAGGGAAAAGAGGAAGGTGAATTAGAAACAAAGGAGGAAGTAACTTTAGGGGTAGCAAAAGGGAATACGTTGTCATAAAAAGGAACGTGACGAGATGTGTAAAGATGACCATTTCTGATGTTGTAACAAAGATAGCCTTTACAATCAAATGGATAACCAAGCAAGACAAATTGGATGTATTTAGGCTTAAGTTTGTAGGAGTTGTAAGACTTGAGAAGAGGATAACAAGCACAATCAAAGGTTTTTAGAAAAAGAATATTCAAGGGCCTTATGAAAGTAGCTCAAATGGAGATTTTTCACCAAGATTTGAAGAAGGAAGGGGATTAAGGACAAACATCTAGTAACTCCCTTTTCGCTATTTGTTGGATCCTTCTTCTACAAGCGAGTCAATCTTTTCTACTCTATGGAGGATTAAGATTTCAGAGAAAGTTAGGTTTTTTACTTGGCAAGTTCTACATGGGCGAGTGAACACTTTGGATCGGCTATCGAGGAGGATGCCTTTGTTAGTGGGCCTTTTTTGTTGTATTCTCTGTTGGGAGGTGGAGGAAGACCTAGATCACATCCTTTGGAGATGTGAGTTCGTGCATTCTGTTTGGAATTTGTCCTTCCAGACCTTTGGTTTAGTGCTTGCTTGTTAGACGGATAATAGTGCTATGATCGAGGAGTTCCTCCTTCATCAGCCTTTTTCTGAGAAAGGTTGCTTCTTGTGGTTGGCGGGGGTGTGTGCTATCATGTGGAATCTTTAGGGAGAAAGGAACAACCGAGTGTTTAGAGGGTTGGATAAGGAGCCTAGTGATTTTTGGAACCTTATTAAGTTTCATGTTTCCCTTTGAGCTTTGACTTTGAAGTTTTTTTGTAATTATTCTACATACACTATTTTGCTTGATTGGGCCTGTTTTTGGGGGCTTATTTATTTATTTATTCATTAATCTTTTAACGCCCTTGTATTCTTTCATTTTTTTTCTCAATGAAAGTGGTTGCATTTGCAGAAATAATAATAATAGTGATGATAATAATGAACACAACAGTGGAAAAAGGAAAAGGTAAAAAAATTAACAGGCAAGGAAGTTGTAGACGTAAGAGATAATGCAATTTCAACAACATGACAATGTTTTCTTCCAATGACACCATTTTGTTGGAGGTGTTAGGTAGTAGTACAATTTTTTTTTTTAATTTAAAATTATTATTATTATTATTTTTTAATTTTAATGGACAATACCATTGCTTTGTAGAAAAGATCCAAGAGAATGATTGATAACTACATGCACCCTCCATCTCTTATGAGAAATGAGAACATTGATTCTAGAGTTGAGAAGATTTTCAGTAAGATGTTTAAACAACAGATTTAAGAGTTGTAGGATAAAACCAAGTCTACATTGAATAGTCATCAACAGAAAAAAAACGTAGTATTCGATGCCATTTGTTGAAATTTCAGGAGTAGGGCCCCATACATCAACATGAACCAATGGTTACAAGGAAACAGAGGAAGAAATAGGATAGGGTAATCTATACATTTCGCCTTGTGCAGTGTTCACAAATACAAGAAGAAGAAGTTGTACAAATACAAAAATGTAAATTCAAAAGAACAATGAAGAATGTTGAGGGTGGGGGTACATGGATTGCCTAACCAGTTAGGACACGGTGAGAATGAAGACATCTTATTTCCAACGTGAGCAACAGTGGTAGTAGGGATTGAAAAGGAAACTGAAGTGGGGAAAAATGATATAAGTTGTTAGCATCAGGGCCTTGAAAGAGTATCTTGCTAAAAAATTTGTTCCGGAAAAAGAACTTATCATCATCAAAAACTATATAGCAATTATTATCAATATAGAGTTGATGAACAAAAAGAATATTGGATGAGATGTGGGGAACATGCAAGGGATTTTTCAGGAGGAGAGGATTTAGAAGTATGAAGAACACCAGAACCTGTATGGGCAGTTGACTAGATTTGACCACTATGAACTCCAACTTTCTCTTCACCTCAATTTTCTCTTTGGAAGCAACATTAACTTGATTGTAATTGGCTGTTATGTGGGCATTACAACCAAAATTTGTGAGCTAAGAAGAGTTGTAGTAGTAAGATGAGCTTGGTTGTGATTAGCAACCTTGGTTGCCAATTGGATAGTAGGATGGTGGCCGCAAAAACTATACTTCATGCGATTGTAACAGTCAAGAGCAGAATGCCCAGGCTTCCGACAAATTTGACAAACAATACACGAAGAATTTCCAGCATGGAGACCTTTTCTAAGAGTAGGACCAAGAGGAAGAATACCATGACTTCCTCTTTTAGTAGATTGATTCTTACTAGGAAAACCTCAACCACGACCCAAATTCATATTAAATTTTGGTTAAAAGGTCCATTGTCGCTCCAAATGATTCTCAGAGCAAAAGGCGCCGAGATAGTCAAAGAAGACATGGAGGCACCAAAACAGAATTAGGTTTAGGAAGGAAACCATCAACATAACCAAAAAAATTACGAGCTTTAAGTACGGATATCAAAACTTTTTACAGGACAAAATTTGTTCCGTCAAGATGAATCGAAATCAAGTTGTAGATATTGGAAGGAAGATGCTTCAAGGTAGAAGAGTCCGCCATTAAAGAAAGTAGCAAACGGAAGGAAACATACAAGAATTGCAAACTTATTCATAGACTGAAAGAAAACTAAGTGCAGTAACAGAATGGGAAAAACTGATACTCAACTACTTACCACAGCTTAGTAACTAATAAAATAATGATTAGCTATTATTGACCTTGTCCTTTTATTGGAGATTAACTGTCAGGATTTCAAACTACTGGGCAGGAACATGAACATGAGTTTAATTGTCATTGCAATTTATTGAGTTTTCTTTTTACCTGATTTACTATAAAGTGTACTTTTCTCTACAAGAACAATGGCCTTTAAACATCGTTGTCACAAGTTGTAGCCTTTGAACATGATTTTCTTTCCTGTTATCAGTATGTTGCAGACAGGAGAAGAGTCAATGCCTGATGATGAATCATATGATATAGTAATTGGAATGCTGTTGTCAACAGATCAGATGGATGCTGTCTTGAAATATATTGACTTGACTTTAAAATCAGGTCATATTCTTTCATTGAAAGTGTTCTCTGAATGTGTGCGGAGCTGTGTTGAAAAGGGCAGGCTGGACACGCTGGTCGCGGTAATTGATAGGTGCAAGGTAATAAATCAGTTCAATTGATGGCTAAGAAAAGAGGCTTTTTTTTTGTGGTTTTTAGTGCTTATTGCTTCATTACTTTTCCTGATGGTGTAACAGACAACTGTTCAGAACAAAGCTCTTTGCCCGCCGTGGAACTTGTGTAATTATATTGCTGAAGTAGCAACGCAAGAGGATAATAGCAAACTAGCTTATTATGCTCTAGAATTTATAGCCAAATGGATTGCTCGAGGTGAAAATGCAAGGCCTCCAGTTCATCTTTCAGTAGATGAAGGACTAGTTGTATCAGCTCTTGGAACTGCAGGTAGAACCCATAGCTCTTCTCTTTTAGATGCAGCTTGGGCAATTCTAAAGCGCTCATTGCGACAAAAGAAGGTTCCAAATCCTGAATCTTACCTCGGAAAGATTTATGCTCTTGCATCGTTTGGGGATCTGCAGAGGGCCTTTACTACCCTACATGAGTTTGAAGAGGCTTATAGGACTTTTGATGATGGAGCTGGTGAAGAAATGCTTTCTCCATTCACTTCTTTATATCCATTAGTTGTGGCATGCTCTAAAAAGGGTTTTGAGACCCTGGACACGGTATAACTTTTTACCAGTAAAAATTACCAATATGCAATTTCCATATTTCTATTTATAAGTAATCTAGTATAAAATATAATATGGTACTATGCATCTTTTATTGGTCGAGCAATGTATTTACAGTATTCAAATAACAAGAAATCATTAGGCCTTCTGCTAAGTTTTTATATATAACCACGACATGAAGTGTTCCAATAAAATTTAAACTTGTTAAAAAATTACTGTACAAACAACATATTCTTATTGTATTCTTATTGTTTGAGTTTGACTTGAGCAGTCTAGCTTGAACAATGAACATTCAGTATCAATATGAGTGGTGGGAGACAGGTGGTAAATACACTTTTAAAAATGGAATGTTAAAAACATATGGTATTTTCTATATCATTTCATCTTAAAGAATAATATTGTATTTTTCTCCTTACAAATCTGGGAGATAGCTTTCTTTGTCAGAGCCATATATGGGTGTTCATCAAGTAATAAATTTTGTTGTGGGAGGTCTAGAAGAATATTAACTATTACTATTACTATGATAAAAGGTCCCGTGAGATTAGTCGAGGTGTGATATGATGGCCTAGACATTCACGGATATTAAAAAATAATCAACTGAGGCAAGTAGAGTGCAATTGTCTCTCCCACAGGACATGAACACAAATGGTTATGTGATGTGTTTCATACTAGATATCTGGCATGCTTCCTTTGTTGAAAGATGATATTCAGTGACTAAATTAAGAAGTTCATGGTAGCTGAGTATTTTTGTTCCTTTCCTCTAATTCATCAGAGTTACCGATCCAATTTTCTCTCCTTTTATAGGTTTATTTTCAACTGGAAAACTTGAGCAGTGCAGATCCTCCTTACAAGTCTGTTGCTGCATTGAATTGTGTGATTTTAGGCTGTGCAAATATATGGGACCTTGATCGTGCATACCAAACGTTTGAAGCAATTGGTTCCAGTTTTGGTTTGACCCCTGATATCCATTCGTACAACGCTCTGATGTACGCATTTGGAAGGCTGAAGAAGGTCAGCCTTTAACCTTTTATACCACTCCCTTATTCACTCACAAAAATACACGTGCAATTAGCTGCTGAAATTCCATTGACTGTACTGATGGTTGGTGGCTGAAAAAAGATAATAATTGACAAGTATCATCAATGGGTTAGATTGCTATGCTTTTGTTGCAAGAGGGGACTGAATACGCGTAGTGAAGCCATACTACTCTACATGCACGCACTTCAGGGGTTCAATGTGTCTTAAGTGTTTCTAACGGATATCTCACATATTTCCATATAAACTGAAAATTTTGCTGACTTACTATGGTCACTTGGCAGACATTTGAAGCTGCAAGGGTGTTTGAACACTTGGTAGGTTTGGGCATCAAACCAAATGCAAGATCTTATTCATTGCTTGTTGACGCTCATCTCATTAACCGTGATCCCAAATCTGCTCTCTCTGCGATTGATGATATGGTAAAATTGTTGTCTCAATTGACTTGTATATGTTTGTATATTTGTGTTTCTTCTCATGGTGTTGCTAATTGCGTCACAAACATTTTTTGGAAGGTAACTGCTGGATTTGTACCTTCAAGAGAAATGCTGAAAAAGGTGAGAAGGCGATGCATTCGAGAGCAAGATTACGATAGTAATGATAGGGTGGATTACTTCGCCAAAATTTTCAGAATTCGAATGGGAACAGACAAACGTAGGGATATTCTGTTCATGCTGGACTACGGCACCGATTATGTTGCATAG

mRNA sequence

ATGGCGTCCAAAATGGCGATTCTCTCTAGAACTCACACTCTAATCAGAACCACAAACCTCAACAATGTCTGCTTCTTCAAGCCCATTTCCACTTTCACATTTCTCTCTCAACAACCTCAGCTCGCTAATGAACCAGTGGACATTCCACCTTCAACTCCACTTCCTCCGAATCCCGCCTCTGGTAGCCCACTTTACAAGGAGAACTGGCGGAATCCAATCCCCAACTATTCTATGGCTCCGTCTCTGGTCCCCCTTGGTCTCCTTAGCCAATCCCCGAGCTCTCGCATTGAAGCATTGTCTCAAACGCTCGATGTTCAGAGCTTGTTGAACGTTTTTGCTGATTGGATGGCTTCCCAGCGCTGGGAAGATATGAAGCAGTTGTTTGAGTTTTGGATTCGGTCGTTGGATAAAGATGGCAAGCCTAATAAGCCAGATGTTAACTTGTATAACAATTATTTGAGGGCTAACTTGATGGTTAACGCCACGGCTGGGGAGCTTCTGGATATAGTGGCTCAAATGGAGGACTATGCGATCACACCCAACACTGCATCATACAATTTAGTATTAAAGGCGATGTACCAGGCTAGAGAGACCGAGGCTGCTGAAAAATTGATTGAAAGTATGTTGCAGACAGGAGAAGAGTCAATGCCTGATGATGAATCATATGATATAGTAATTGGAATGCTGTTGTCAACAGATCAGATGGATGCTGTCTTGAAATATATTGACTTGACTTTAAAATCAGGTCATATTCTTTCATTGAAAGTGTTCTCTGAATGTGTGCGGAGCTGTGTTGAAAAGGGCAGGCTGGACACGCTGGTCGCGGTAATTGATAGGTGCAAGACAACTGTTCAGAACAAAGCTCTTTGCCCGCCGTGGAACTTGTGTAATTATATTGCTGAAGTAGCAACGCAAGAGGATAATAGCAAACTAGCTTATTATGCTCTAGAATTTATAGCCAAATGGATTGCTCGAGGTGAAAATGCAAGGCCTCCAGTTCATCTTTCAGTAGATGAAGGACTAGTTGTATCAGCTCTTGGAACTGCAGGTAGAACCCATAGCTCTTCTCTTTTAGATGCAGCTTGGGCAATTCTAAAGCGCTCATTGCGACAAAAGAAGGTTCCAAATCCTGAATCTTACCTCGGAAAGATTTATGCTCTTGCATCGTTTGGGGATCTGCAGAGGGCCTTTACTACCCTACATGAGTTTGAAGAGGCTTATAGGACTTTTGATGATGGAGCTGGTGAAGAAATGCTTTCTCCATTCACTTCTTTATATCCATTAGTTGTGGCATGCTCTAAAAAGGGTTTTGAGACCCTGGACACGGTTTATTTTCAACTGGAAAACTTGAGCAGTGCAGATCCTCCTTACAAGTCTGTTGCTGCATTGAATTGTGTGATTTTAGGCTGTGCAAATATATGGGACCTTGATCGTGCATACCAAACGTTTGAAGCAATTGGTTCCAGTTTTGGTTTGACCCCTGATATCCATTCGTACAACGCTCTGATGTACGCATTTGGAAGGCTGAAGAAGACATTTGAAGCTGCAAGGGTGTTTGAACACTTGGTAGGTTTGGGCATCAAACCAAATGCAAGATCTTATTCATTGCTTGTTGACGCTCATCTCATTAACCGTGATCCCAAATCTGCTCTCTCTGCGATTGATGATATGGTAACTGCTGGATTTGTACCTTCAAGAGAAATGCTGAAAAAGGTGAGAAGGCGATGCATTCGAGAGCAAGATTACGATAGTAATGATAGGGTGGATTACTTCGCCAAAATTTTCAGAATTCGAATGGGAACAGACAAACGTAGGGATATTCTGTTCATGCTGGACTACGGCACCGATTATGTTGCATAG

Coding sequence (CDS)

ATGGCGTCCAAAATGGCGATTCTCTCTAGAACTCACACTCTAATCAGAACCACAAACCTCAACAATGTCTGCTTCTTCAAGCCCATTTCCACTTTCACATTTCTCTCTCAACAACCTCAGCTCGCTAATGAACCAGTGGACATTCCACCTTCAACTCCACTTCCTCCGAATCCCGCCTCTGGTAGCCCACTTTACAAGGAGAACTGGCGGAATCCAATCCCCAACTATTCTATGGCTCCGTCTCTGGTCCCCCTTGGTCTCCTTAGCCAATCCCCGAGCTCTCGCATTGAAGCATTGTCTCAAACGCTCGATGTTCAGAGCTTGTTGAACGTTTTTGCTGATTGGATGGCTTCCCAGCGCTGGGAAGATATGAAGCAGTTGTTTGAGTTTTGGATTCGGTCGTTGGATAAAGATGGCAAGCCTAATAAGCCAGATGTTAACTTGTATAACAATTATTTGAGGGCTAACTTGATGGTTAACGCCACGGCTGGGGAGCTTCTGGATATAGTGGCTCAAATGGAGGACTATGCGATCACACCCAACACTGCATCATACAATTTAGTATTAAAGGCGATGTACCAGGCTAGAGAGACCGAGGCTGCTGAAAAATTGATTGAAAGTATGTTGCAGACAGGAGAAGAGTCAATGCCTGATGATGAATCATATGATATAGTAATTGGAATGCTGTTGTCAACAGATCAGATGGATGCTGTCTTGAAATATATTGACTTGACTTTAAAATCAGGTCATATTCTTTCATTGAAAGTGTTCTCTGAATGTGTGCGGAGCTGTGTTGAAAAGGGCAGGCTGGACACGCTGGTCGCGGTAATTGATAGGTGCAAGACAACTGTTCAGAACAAAGCTCTTTGCCCGCCGTGGAACTTGTGTAATTATATTGCTGAAGTAGCAACGCAAGAGGATAATAGCAAACTAGCTTATTATGCTCTAGAATTTATAGCCAAATGGATTGCTCGAGGTGAAAATGCAAGGCCTCCAGTTCATCTTTCAGTAGATGAAGGACTAGTTGTATCAGCTCTTGGAACTGCAGGTAGAACCCATAGCTCTTCTCTTTTAGATGCAGCTTGGGCAATTCTAAAGCGCTCATTGCGACAAAAGAAGGTTCCAAATCCTGAATCTTACCTCGGAAAGATTTATGCTCTTGCATCGTTTGGGGATCTGCAGAGGGCCTTTACTACCCTACATGAGTTTGAAGAGGCTTATAGGACTTTTGATGATGGAGCTGGTGAAGAAATGCTTTCTCCATTCACTTCTTTATATCCATTAGTTGTGGCATGCTCTAAAAAGGGTTTTGAGACCCTGGACACGGTTTATTTTCAACTGGAAAACTTGAGCAGTGCAGATCCTCCTTACAAGTCTGTTGCTGCATTGAATTGTGTGATTTTAGGCTGTGCAAATATATGGGACCTTGATCGTGCATACCAAACGTTTGAAGCAATTGGTTCCAGTTTTGGTTTGACCCCTGATATCCATTCGTACAACGCTCTGATGTACGCATTTGGAAGGCTGAAGAAGACATTTGAAGCTGCAAGGGTGTTTGAACACTTGGTAGGTTTGGGCATCAAACCAAATGCAAGATCTTATTCATTGCTTGTTGACGCTCATCTCATTAACCGTGATCCCAAATCTGCTCTCTCTGCGATTGATGATATGGTAACTGCTGGATTTGTACCTTCAAGAGAAATGCTGAAAAAGGTGAGAAGGCGATGCATTCGAGAGCAAGATTACGATAGTAATGATAGGGTGGATTACTTCGCCAAAATTTTCAGAATTCGAATGGGAACAGACAAACGTAGGGATATTCTGTTCATGCTGGACTACGGCACCGATTATGTTGCATAG

Protein sequence

MASKMAILSRTHTLIRTTNLNNVCFFKPISTFTFLSQQPQLANEPVDIPPSTPLPPNPASGSPLYKENWRNPIPNYSMAPSLVPLGLLSQSPSSRIEALSQTLDVQSLLNVFADWMASQRWEDMKQLFEFWIRSLDKDGKPNKPDVNLYNNYLRANLMVNATAGELLDIVAQMEDYAITPNTASYNLVLKAMYQARETEAAEKLIESMLQTGEESMPDDESYDIVIGMLLSTDQMDAVLKYIDLTLKSGHILSLKVFSECVRSCVEKGRLDTLVAVIDRCKTTVQNKALCPPWNLCNYIAEVATQEDNSKLAYYALEFIAKWIARGENARPPVHLSVDEGLVVSALGTAGRTHSSSLLDAAWAILKRSLRQKKVPNPESYLGKIYALASFGDLQRAFTTLHEFEEAYRTFDDGAGEEMLSPFTSLYPLVVACSKKGFETLDTVYFQLENLSSADPPYKSVAALNCVILGCANIWDLDRAYQTFEAIGSSFGLTPDIHSYNALMYAFGRLKKTFEAARVFEHLVGLGIKPNARSYSLLVDAHLINRDPKSALSAIDDMVTAGFVPSREMLKKVRRRCIREQDYDSNDRVDYFAKIFRIRMGTDKRRDILFMLDYGTDYVA
Homology
BLAST of HG10004370 vs. NCBI nr
Match: XP_023546997.1 (pentatricopeptide repeat-containing protein At1g26460, mitochondrial-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1122.5 bits (2902), Expect = 0.0e+00
Identity = 556/619 (89.82%), Postives = 592/619 (95.64%), Query Frame = 0

Query: 1   MASKMAILSRTHTLIRTTNLNNVCFFKPISTFTFLSQQPQLANEPVDIPPSTPLPPNPAS 60
           MASKMAILSRT TLIR +NLNNVCFFKPI+TFTFLSQ+PQLANEP DI PSTPLPPNPAS
Sbjct: 1   MASKMAILSRTQTLIRNSNLNNVCFFKPITTFTFLSQEPQLANEPSDI-PSTPLPPNPAS 60

Query: 61  GSPLYKENWRNPIPNYSMAPSLVPLGLLSQSPSSRIEALSQTLDVQSLLNVFADWMASQR 120
           GSPLY ENWRNPIPN S   S++PLG L+QSPSSRIEALSQTLDVQSLLNVFADWMASQR
Sbjct: 61  GSPLYNENWRNPIPNASRTLSMIPLGFLNQSPSSRIEALSQTLDVQSLLNVFADWMASQR 120

Query: 121 WEDMKQLFEFWIRSLDKDGKPNKPDVNLYNNYLRANLMVNATAGELLDIVAQMEDYAITP 180
           WEDMKQLFEFWIRSLDK+GKPNKPDVNLYNNYLRANLMVNA+AGELLD+VAQMEDYAI+P
Sbjct: 121 WEDMKQLFEFWIRSLDKNGKPNKPDVNLYNNYLRANLMVNASAGELLDLVAQMEDYAISP 180

Query: 181 NTASYNLVLKAMYQARETEAAEKLIESMLQTGEESMPDDESYDIVIGMLLSTDQMDAVLK 240
           NTAS+NLVLKAMYQA+ETEAAEKLIE MLQTGEESMPDDESYD+VIGMLLSTDQ+DA LK
Sbjct: 181 NTASFNLVLKAMYQAKETEAAEKLIERMLQTGEESMPDDESYDLVIGMLLSTDQIDAALK 240

Query: 241 YIDLTLKSGHILSLKVFSECVRSCVEKGRLDTLVAVIDRCKTTVQNKALCPPWNLCNYIA 300
           YIDLTLKSGH+LSLKVF+ECVRSCV+KGRLDTLV+VIDRCK TVQNKALCP WNLCNYIA
Sbjct: 241 YIDLTLKSGHMLSLKVFTECVRSCVKKGRLDTLVSVIDRCKATVQNKALCPTWNLCNYIA 300

Query: 301 EVATQEDNSKLAYYALEFIAKWIARGENARPPVHLSVDEGLVVSALGTAGRTHSSSLLDA 360
           EVATQEDNSKLAYYALEF A+WIARGENARPPVHLSVDEG+VVSALGTAGRT+SSSLLDA
Sbjct: 301 EVATQEDNSKLAYYALEFTARWIARGENARPPVHLSVDEGIVVSALGTAGRTYSSSLLDA 360

Query: 361 AWAILKRSLRQKKVPNPESYLGKIYALASFGDLQRAFTTLHEFEEAYRTFDDGAGEEMLS 420
           +WAILKRSL QKKVPNPESYLGKIYALASFG+LQRAFTTL EFEEAYRT DDGAGEEM S
Sbjct: 361 SWAILKRSLGQKKVPNPESYLGKIYALASFGNLQRAFTTLREFEEAYRTSDDGAGEEMFS 420

Query: 421 PFTSLYPLVVACSKKGFETLDTVYFQLENLSSADPPYKSVAALNCVILGCANIWDLDRAY 480
           PFTSLYPLVVACSKKGFETLDTVYFQLENLS ADPPYKSV+ALNCVILGCANIWDLDRAY
Sbjct: 421 PFTSLYPLVVACSKKGFETLDTVYFQLENLSRADPPYKSVSALNCVILGCANIWDLDRAY 480

Query: 481 QTFEAIGSSFGLTPDIHSYNALMYAFGRLKKTFEAARVFEHLVGLGIKPNARSYSLLVDA 540
           QTFEAIGSSFGLTPDIHSYNAL+YAFGRLKKTFEAARVFEHLVGLG+KPNA+SYSLLVDA
Sbjct: 481 QTFEAIGSSFGLTPDIHSYNALIYAFGRLKKTFEAARVFEHLVGLGVKPNAKSYSLLVDA 540

Query: 541 HLINRDPKSALSAIDDMVTAGFVPSREMLKKVRRRCIREQDYDSNDRVDYFAKIFRIRMG 600
           H+INRDPKSALS ID+MVTAGFVPS+EMLKKVRRRC+RE DY SND+VDY AK F+IRMG
Sbjct: 541 HIINRDPKSALSVIDNMVTAGFVPSKEMLKKVRRRCMREMDYPSNDKVDYLAKNFKIRMG 600

Query: 601 TDKRRDILFMLDYGTDYVA 620
           T+ RRD+LF LDYGT+YVA
Sbjct: 601 TENRRDMLFNLDYGTNYVA 618

BLAST of HG10004370 vs. NCBI nr
Match: KAG7029573.1 (Pentatricopeptide repeat-containing protein, mitochondrial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1120.1 bits (2896), Expect = 0.0e+00
Identity = 553/619 (89.34%), Postives = 592/619 (95.64%), Query Frame = 0

Query: 1   MASKMAILSRTHTLIRTTNLNNVCFFKPISTFTFLSQQPQLANEPVDIPPSTPLPPNPAS 60
           MASKMAILSRT T+IR +NLNNVCFFKPI+TFTFLSQ+PQLANEP DI PSTPLPPNPAS
Sbjct: 1   MASKMAILSRTQTIIRNSNLNNVCFFKPITTFTFLSQEPQLANEPADI-PSTPLPPNPAS 60

Query: 61  GSPLYKENWRNPIPNYSMAPSLVPLGLLSQSPSSRIEALSQTLDVQSLLNVFADWMASQR 120
           GSPLY ENWRNPIPN S   S++PLG L+QSPSSRIEALSQTLDVQSLLNVFADWMASQR
Sbjct: 61  GSPLYNENWRNPIPNASRTLSMIPLGFLNQSPSSRIEALSQTLDVQSLLNVFADWMASQR 120

Query: 121 WEDMKQLFEFWIRSLDKDGKPNKPDVNLYNNYLRANLMVNATAGELLDIVAQMEDYAITP 180
           WEDMKQLFEFWIRSLDK+GKPNKPDVNLYNNYLRANLMVNA+AGELLD+VAQMEDYAI+P
Sbjct: 121 WEDMKQLFEFWIRSLDKNGKPNKPDVNLYNNYLRANLMVNASAGELLDLVAQMEDYAISP 180

Query: 181 NTASYNLVLKAMYQARETEAAEKLIESMLQTGEESMPDDESYDIVIGMLLSTDQMDAVLK 240
           NTAS+NLVLKAMYQA+ET+AAEKLIE MLQTGEESMPDDESYD+VIGMLLSTDQ+DA LK
Sbjct: 181 NTASFNLVLKAMYQAKETQAAEKLIERMLQTGEESMPDDESYDLVIGMLLSTDQIDAALK 240

Query: 241 YIDLTLKSGHILSLKVFSECVRSCVEKGRLDTLVAVIDRCKTTVQNKALCPPWNLCNYIA 300
           YIDLTLKSGH+LSLKVF+ECVRSCV+KGRLDTLV+VIDRCK TVQNKALCP WNLCN+IA
Sbjct: 241 YIDLTLKSGHMLSLKVFTECVRSCVKKGRLDTLVSVIDRCKATVQNKALCPTWNLCNFIA 300

Query: 301 EVATQEDNSKLAYYALEFIAKWIARGENARPPVHLSVDEGLVVSALGTAGRTHSSSLLDA 360
           EVATQEDNSKLAYYALEF+A+WIARGENARPPVHLSVDEG+VVSALGTAGRT+SSSLLDA
Sbjct: 301 EVATQEDNSKLAYYALEFMARWIARGENARPPVHLSVDEGIVVSALGTAGRTYSSSLLDA 360

Query: 361 AWAILKRSLRQKKVPNPESYLGKIYALASFGDLQRAFTTLHEFEEAYRTFDDGAGEEMLS 420
           +WAILKRSL QKKVPNPESYLGKIYALASFG+LQRAFTTL EFEEAYRT DDGAGEEM S
Sbjct: 361 SWAILKRSLGQKKVPNPESYLGKIYALASFGNLQRAFTTLREFEEAYRTADDGAGEEMFS 420

Query: 421 PFTSLYPLVVACSKKGFETLDTVYFQLENLSSADPPYKSVAALNCVILGCANIWDLDRAY 480
           PFTSLYPLVVACSKKGFETLDTVYFQLENLS ADPPYKSV+ALNCVILGCANIWDLDRAY
Sbjct: 421 PFTSLYPLVVACSKKGFETLDTVYFQLENLSRADPPYKSVSALNCVILGCANIWDLDRAY 480

Query: 481 QTFEAIGSSFGLTPDIHSYNALMYAFGRLKKTFEAARVFEHLVGLGIKPNARSYSLLVDA 540
           QTFEAIGSSFGLTPDIHSYNAL+YAFGRLKKTFEAARVFEHLVGLG+KPNA+SYSLLVDA
Sbjct: 481 QTFEAIGSSFGLTPDIHSYNALIYAFGRLKKTFEAARVFEHLVGLGVKPNAKSYSLLVDA 540

Query: 541 HLINRDPKSALSAIDDMVTAGFVPSREMLKKVRRRCIREQDYDSNDRVDYFAKIFRIRMG 600
           H+INRDPKSALS ID+MVTAGFVPS+EMLKK RRRC+RE DY SND+VDY AK F+IRMG
Sbjct: 541 HIINRDPKSALSVIDNMVTAGFVPSKEMLKKARRRCMREMDYPSNDKVDYLAKNFKIRMG 600

Query: 601 TDKRRDILFMLDYGTDYVA 620
           T+ RRDILF LDYGT+YVA
Sbjct: 601 TENRRDILFNLDYGTNYVA 618

BLAST of HG10004370 vs. NCBI nr
Match: XP_023002111.1 (pentatricopeptide repeat-containing protein At1g26460, mitochondrial [Cucurbita maxima] >XP_023002112.1 pentatricopeptide repeat-containing protein At1g26460, mitochondrial [Cucurbita maxima])

HSP 1 Score: 1119.8 bits (2895), Expect = 0.0e+00
Identity = 555/619 (89.66%), Postives = 588/619 (94.99%), Query Frame = 0

Query: 1   MASKMAILSRTHTLIRTTNLNNVCFFKPISTFTFLSQQPQLANEPVDIPPSTPLPPNPAS 60
           MASKMAILSRTHTLIR +NLNNVC FKPI+TFTFLSQ+P LANEP D+ PSTPLPPNPAS
Sbjct: 1   MASKMAILSRTHTLIRNSNLNNVCSFKPITTFTFLSQEPHLANEPADV-PSTPLPPNPAS 60

Query: 61  GSPLYKENWRNPIPNYSMAPSLVPLGLLSQSPSSRIEALSQTLDVQSLLNVFADWMASQR 120
           GSPLY ENWRNPIPN SM  SL+PLG L+QSPSSRI+ALSQTLDVQSLLNVFADWMASQR
Sbjct: 61  GSPLYNENWRNPIPNASMTQSLIPLGFLNQSPSSRIQALSQTLDVQSLLNVFADWMASQR 120

Query: 121 WEDMKQLFEFWIRSLDKDGKPNKPDVNLYNNYLRANLMVNATAGELLDIVAQMEDYAITP 180
           W+DMKQLFE WIRSLDK+GKPNKPDVNLYNNYLRANLMVNATAGELLD+VAQMEDYAI+P
Sbjct: 121 WDDMKQLFESWIRSLDKNGKPNKPDVNLYNNYLRANLMVNATAGELLDLVAQMEDYAISP 180

Query: 181 NTASYNLVLKAMYQARETEAAEKLIESMLQTGEESMPDDESYDIVIGMLLSTDQMDAVLK 240
           N+AS+NLVLKAMYQARETEAAEKLIE MLQTG ESMPDDESYD+VIGMLLSTDQ+DA LK
Sbjct: 181 NSASFNLVLKAMYQARETEAAEKLIERMLQTGGESMPDDESYDLVIGMLLSTDQIDAALK 240

Query: 241 YIDLTLKSGHILSLKVFSECVRSCVEKGRLDTLVAVIDRCKTTVQNKALCPPWNLCNYIA 300
           YIDLTLKSGH+LSLKVF+ECVRSCV+KGRLDTLV+VIDRCK TVQNKALCP WNLCNYIA
Sbjct: 241 YIDLTLKSGHMLSLKVFTECVRSCVKKGRLDTLVSVIDRCKATVQNKALCPTWNLCNYIA 300

Query: 301 EVATQEDNSKLAYYALEFIAKWIARGENARPPVHLSVDEGLVVSALGTAGRTHSSSLLDA 360
           EVA QEDNSKLAYYALEF+A+WIARGENARPPVHLSVDEG+VVSALGTAGRT+SSSLLDA
Sbjct: 301 EVAMQEDNSKLAYYALEFMARWIARGENARPPVHLSVDEGIVVSALGTAGRTYSSSLLDA 360

Query: 361 AWAILKRSLRQKKVPNPESYLGKIYALASFGDLQRAFTTLHEFEEAYRTFDDGAGEEMLS 420
           AWAILKRSLRQKKVPNPESYLGKIYALASFG+LQRAFTTLHEFEEAYRT DDGA EEM S
Sbjct: 361 AWAILKRSLRQKKVPNPESYLGKIYALASFGNLQRAFTTLHEFEEAYRTSDDGAAEEMFS 420

Query: 421 PFTSLYPLVVACSKKGFETLDTVYFQLENLSSADPPYKSVAALNCVILGCANIWDLDRAY 480
           PFTSLYPLVVACSKKGFETLDTVYFQLENLS ADPPYKSVAALNCVILGCANIWDLDRAY
Sbjct: 421 PFTSLYPLVVACSKKGFETLDTVYFQLENLSRADPPYKSVAALNCVILGCANIWDLDRAY 480

Query: 481 QTFEAIGSSFGLTPDIHSYNALMYAFGRLKKTFEAARVFEHLVGLGIKPNARSYSLLVDA 540
           QTFEAI SSFGLTPDIHSYNALMYAFG+LKKTFEAA+VFEHLVGLG+KPNA SYSLLVDA
Sbjct: 481 QTFEAISSSFGLTPDIHSYNALMYAFGKLKKTFEAAKVFEHLVGLGVKPNATSYSLLVDA 540

Query: 541 HLINRDPKSALSAIDDMVTAGFVPSREMLKKVRRRCIREQDYDSNDRVDYFAKIFRIRMG 600
           HLINRDPKSALS IDDMVTAGF PS++MLKKVRRRCIRE DYDSNDRVDY AK F+IRMG
Sbjct: 541 HLINRDPKSALSVIDDMVTAGFAPSKQMLKKVRRRCIREMDYDSNDRVDYQAKSFKIRMG 600

Query: 601 TDKRRDILFMLDYGTDYVA 620
           T+ RRD+LF LD+GT YVA
Sbjct: 601 TENRRDMLFNLDFGTHYVA 618

BLAST of HG10004370 vs. NCBI nr
Match: XP_023537323.1 (pentatricopeptide repeat-containing protein At1g26460, mitochondrial [Cucurbita pepo subsp. pepo] >XP_023537324.1 pentatricopeptide repeat-containing protein At1g26460, mitochondrial [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1119.4 bits (2894), Expect = 0.0e+00
Identity = 553/619 (89.34%), Postives = 589/619 (95.15%), Query Frame = 0

Query: 1   MASKMAILSRTHTLIRTTNLNNVCFFKPISTFTFLSQQPQLANEPVDIPPSTPLPPNPAS 60
           MASKMAILSRTHTLIR +NLNNVC FKPI+TFT+LSQ+PQ+ANEP D+ PSTPLPPNPAS
Sbjct: 1   MASKMAILSRTHTLIRNSNLNNVCSFKPITTFTYLSQEPQVANEPADV-PSTPLPPNPAS 60

Query: 61  GSPLYKENWRNPIPNYSMAPSLVPLGLLSQSPSSRIEALSQTLDVQSLLNVFADWMASQR 120
           GSPLY ENWRNPIPN SM  SL+PLG L+QSPSSRI+ALSQTLDVQSLLNVFADWMASQR
Sbjct: 61  GSPLYNENWRNPIPNASMTQSLIPLGFLNQSPSSRIQALSQTLDVQSLLNVFADWMASQR 120

Query: 121 WEDMKQLFEFWIRSLDKDGKPNKPDVNLYNNYLRANLMVNATAGELLDIVAQMEDYAITP 180
           W+DMKQLFE WIRSLDK+GKPNKPDVNLYNNYLRANLMVNATAGELLD+VAQMEDYAI+P
Sbjct: 121 WDDMKQLFESWIRSLDKNGKPNKPDVNLYNNYLRANLMVNATAGELLDLVAQMEDYAISP 180

Query: 181 NTASYNLVLKAMYQARETEAAEKLIESMLQTGEESMPDDESYDIVIGMLLSTDQMDAVLK 240
           N+AS+NLVLKAMYQARETEAAEKLIE MLQTG ESMPDDESYD+VIGMLLSTDQ+DA LK
Sbjct: 181 NSASFNLVLKAMYQARETEAAEKLIERMLQTGGESMPDDESYDLVIGMLLSTDQIDAALK 240

Query: 241 YIDLTLKSGHILSLKVFSECVRSCVEKGRLDTLVAVIDRCKTTVQNKALCPPWNLCNYIA 300
           YIDLTLKSGH+LSLKVF+ECVRSCV+KGRLDTLV+VIDRCK TVQNKALCPPWNLCNYIA
Sbjct: 241 YIDLTLKSGHMLSLKVFTECVRSCVKKGRLDTLVSVIDRCKATVQNKALCPPWNLCNYIA 300

Query: 301 EVATQEDNSKLAYYALEFIAKWIARGENARPPVHLSVDEGLVVSALGTAGRTHSSSLLDA 360
           EVA QEDNSKLAYYALEF+A+WIARGENARPPVHLSVDEG+VVSALGTAGRT+SSSLLDA
Sbjct: 301 EVAMQEDNSKLAYYALEFMARWIARGENARPPVHLSVDEGIVVSALGTAGRTYSSSLLDA 360

Query: 361 AWAILKRSLRQKKVPNPESYLGKIYALASFGDLQRAFTTLHEFEEAYRTFDDGAGEEMLS 420
           AWAILKRSLRQKKVPNPESYLGKIYALASFG+LQRAFTTLHEFEEAYRT DDGA EEM S
Sbjct: 361 AWAILKRSLRQKKVPNPESYLGKIYALASFGNLQRAFTTLHEFEEAYRTSDDGAAEEMFS 420

Query: 421 PFTSLYPLVVACSKKGFETLDTVYFQLENLSSADPPYKSVAALNCVILGCANIWDLDRAY 480
           PFTSLYPLV+ACSKKGFETLDTVYFQLENLS ADPPYKSVAALNCVILGCANIWDLDRAY
Sbjct: 421 PFTSLYPLVMACSKKGFETLDTVYFQLENLSRADPPYKSVAALNCVILGCANIWDLDRAY 480

Query: 481 QTFEAIGSSFGLTPDIHSYNALMYAFGRLKKTFEAARVFEHLVGLGIKPNARSYSLLVDA 540
           QTFEAI SSFG TPDIHSYNALMYAFG+LKKTFEAARVFEHLVGLG+KPNA SYSLLVDA
Sbjct: 481 QTFEAISSSFGFTPDIHSYNALMYAFGKLKKTFEAARVFEHLVGLGVKPNATSYSLLVDA 540

Query: 541 HLINRDPKSALSAIDDMVTAGFVPSREMLKKVRRRCIREQDYDSNDRVDYFAKIFRIRMG 600
           HLINRDPKSALS IDDMV AGF PS++MLKKVRRRCIRE DYDSNDRVDY AK F+IRMG
Sbjct: 541 HLINRDPKSALSVIDDMVIAGFAPSKQMLKKVRRRCIREMDYDSNDRVDYQAKSFKIRMG 600

Query: 601 TDKRRDILFMLDYGTDYVA 620
           T+ RRD+LF LD+GT+YVA
Sbjct: 601 TENRRDMLFNLDFGTNYVA 618

BLAST of HG10004370 vs. NCBI nr
Match: KAG7020167.1 (Pentatricopeptide repeat-containing protein, mitochondrial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1117.8 bits (2890), Expect = 0.0e+00
Identity = 553/619 (89.34%), Postives = 589/619 (95.15%), Query Frame = 0

Query: 1   MASKMAILSRTHTLIRTTNLNNVCFFKPISTFTFLSQQPQLANEPVDIPPSTPLPPNPAS 60
           MASKMAILSRTHTLIR +NLNNVC FKPI+TFT+LSQ+PQ+ANEP D+ PSTPLPPNPAS
Sbjct: 1   MASKMAILSRTHTLIRNSNLNNVCSFKPITTFTYLSQEPQVANEPADV-PSTPLPPNPAS 60

Query: 61  GSPLYKENWRNPIPNYSMAPSLVPLGLLSQSPSSRIEALSQTLDVQSLLNVFADWMASQR 120
           GSPLY ENWRNPIPN SM  SL+PLG L+QSPSSRI+ALSQTLDVQSLLNVFADWMASQR
Sbjct: 61  GSPLYNENWRNPIPNASMTQSLIPLGFLNQSPSSRIQALSQTLDVQSLLNVFADWMASQR 120

Query: 121 WEDMKQLFEFWIRSLDKDGKPNKPDVNLYNNYLRANLMVNATAGELLDIVAQMEDYAITP 180
           W+DMKQLFE WIRSLDK+GKPNKPDVNLYNNYLRANLMVNATAGELLD+VAQMEDYAI+P
Sbjct: 121 WDDMKQLFESWIRSLDKNGKPNKPDVNLYNNYLRANLMVNATAGELLDLVAQMEDYAISP 180

Query: 181 NTASYNLVLKAMYQARETEAAEKLIESMLQTGEESMPDDESYDIVIGMLLSTDQMDAVLK 240
           N+AS+NLVLKAMYQARETEAAEKLIE MLQTG ESMPDDESYD+VIGMLLSTDQ+DA LK
Sbjct: 181 NSASFNLVLKAMYQARETEAAEKLIERMLQTGGESMPDDESYDLVIGMLLSTDQIDAALK 240

Query: 241 YIDLTLKSGHILSLKVFSECVRSCVEKGRLDTLVAVIDRCKTTVQNKALCPPWNLCNYIA 300
           YIDLTLKSGH+LSLKVF++CVRSCV+KGRLDTLV+VIDRCK TVQNKALCP WNLCNYIA
Sbjct: 241 YIDLTLKSGHMLSLKVFTDCVRSCVKKGRLDTLVSVIDRCKATVQNKALCPTWNLCNYIA 300

Query: 301 EVATQEDNSKLAYYALEFIAKWIARGENARPPVHLSVDEGLVVSALGTAGRTHSSSLLDA 360
           EVA QEDNSKLAYYALEF+A+WIARGENARPPVHLSVDEG+VVSALGTAGRT+SSSLLDA
Sbjct: 301 EVAMQEDNSKLAYYALEFMARWIARGENARPPVHLSVDEGIVVSALGTAGRTYSSSLLDA 360

Query: 361 AWAILKRSLRQKKVPNPESYLGKIYALASFGDLQRAFTTLHEFEEAYRTFDDGAGEEMLS 420
           AWAILKRSLRQKKVPNPESYLGKIYALASFG+LQRAFTTLHEFEEAYRT DDGA EEM S
Sbjct: 361 AWAILKRSLRQKKVPNPESYLGKIYALASFGNLQRAFTTLHEFEEAYRTSDDGAAEEMFS 420

Query: 421 PFTSLYPLVVACSKKGFETLDTVYFQLENLSSADPPYKSVAALNCVILGCANIWDLDRAY 480
           PFTSLYPLVVACSKKGFETLDTVYFQLENLS ADPPYKSVAALNCVILGCANIWDLDRAY
Sbjct: 421 PFTSLYPLVVACSKKGFETLDTVYFQLENLSRADPPYKSVAALNCVILGCANIWDLDRAY 480

Query: 481 QTFEAIGSSFGLTPDIHSYNALMYAFGRLKKTFEAARVFEHLVGLGIKPNARSYSLLVDA 540
           QTFEAI SSFGLTPDIHSYNALMYAFG+LKKTFEAARVFEHLVGLG+KPNA SYSLLVDA
Sbjct: 481 QTFEAISSSFGLTPDIHSYNALMYAFGKLKKTFEAARVFEHLVGLGVKPNATSYSLLVDA 540

Query: 541 HLINRDPKSALSAIDDMVTAGFVPSREMLKKVRRRCIREQDYDSNDRVDYFAKIFRIRMG 600
           HLINRDPKSALS IDDMV AGF PS++MLKKVRRRCIRE DYDSNDRVDY AK F+IRMG
Sbjct: 541 HLINRDPKSALSVIDDMVIAGFAPSKQMLKKVRRRCIREMDYDSNDRVDYQAKSFKIRMG 600

Query: 601 TDKRRDILFMLDYGTDYVA 620
           T+ RRD+LF LD+GT+YVA
Sbjct: 601 TENRRDMLFNLDFGTNYVA 618

BLAST of HG10004370 vs. ExPASy Swiss-Prot
Match: Q9FZD1 (Pentatricopeptide repeat-containing protein At1g26460, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At1g26460 PE=1 SV=1)

HSP 1 Score: 823.5 bits (2126), Expect = 1.5e-237
Identity = 414/623 (66.45%), Postives = 498/623 (79.94%), Query Frame = 0

Query: 1   MASKMAILSRTHTLIRTTNLNNVCFFKPISTFT---FLSQQPQLANEPVDIPP------S 60
           MAS +   SR  +L++T   N      PI   +   FLSQ P LA E  D  P      S
Sbjct: 1   MASHLFTRSRI-SLLKTLKPNPFTSASPIRAISGTPFLSQDPLLATESTDHDPSNHQSTS 60

Query: 61  TPLPPNPASGSPLYKENWRNPIPNY-SMAPSLVPLGLLSQSPSSRIEALSQTLDVQSLLN 120
           TPLPPNPA+GSPLY+ENWR+PIPN  S   SLVPLG L+Q+P+ RI ALS+TLD+ SLLN
Sbjct: 61  TPLPPNPATGSPLYQENWRSPIPNTPSFNQSLVPLGFLNQAPAPRIRALSETLDMNSLLN 120

Query: 121 VFADWMASQRWEDMKQLFEFWIRSLDKDGKPNKPDVNLYNNYLRANLMVNATAGELLDIV 180
           +FADW ASQRW DMKQLFE W+RSLDK+GKPNKPDVNLYN+YLRANLM+ A+AG++LD+V
Sbjct: 121 MFADWTASQRWSDMKQLFEVWVRSLDKNGKPNKPDVNLYNHYLRANLMMGASAGDMLDLV 180

Query: 181 AQMEDYAITPNTASYNLVLKAMYQARETEAAEKLIESMLQTGEESMPDDESYDIVIGMLL 240
           A ME++++ PNTASYNLVLKAMYQARETEAA KL+E ML  G++S+PDDESYD+VIGM  
Sbjct: 181 APMEEFSVEPNTASYNLVLKAMYQARETEAAMKLLERMLLLGKDSLPDDESYDLVIGMHF 240

Query: 241 STDQMDAVLKYIDLTLKSGHILSLKVFSECVRSCVEKGRLDTLVAVIDRCKTTVQNKALC 300
              + D  +K +D  LKSG++LS  VF+ECVRSCV KGR DTLV++I+RCK   +NK+LC
Sbjct: 241 GVGKNDEAMKVMDTALKSGYMLSTSVFTECVRSCVAKGRTDTLVSIIERCKAVDRNKSLC 300

Query: 301 PPWNLCNYIAEVATQEDNSKLAYYALEFIAKWIARGENARPPVHLSVDEGLVVSALGTAG 360
           P W LCNYIAEVA QEDNSKLA+YA EF+ KWI RGE ARP V  SVDEGLVV+ L +A 
Sbjct: 301 PSWILCNYIAEVAIQEDNSKLAFYAFEFMFKWITRGEMARPSVIFSVDEGLVVAGLASAA 360

Query: 361 RTHSSSLLDAAWAILKRSLRQKKVPNPESYLGKIYALASFGDLQRAFTTLHEFEEAYRTF 420
           RT SSSL++ +W ILK+SLR +K  NP SY+ KI A AS G+LQ+AFT+LHE E AY   
Sbjct: 361 RTCSSSLVEGSWTILKQSLRGRKAANPASYIAKINAYASLGNLQKAFTSLHELESAYADS 420

Query: 421 DDGAGEEMLSPFTSLYPLVVACSKKGFETLDTVYFQLENLSSADPPYKSVAALNCVILGC 480
           +    EEMLSPFTSLYPLVVACSKKGFETLD VYFQLE+LS  D PYKSVAALNC+ILGC
Sbjct: 421 EKEVVEEMLSPFTSLYPLVVACSKKGFETLDEVYFQLESLSQGDTPYKSVAALNCIILGC 480

Query: 481 ANIWDLDRAYQTFEAIGSSFGLTPDIHSYNALMYAFGRLKKTFEAARVFEHLVGLGIKPN 540
           AN WDLDRAYQTFEAI +SFGLTP+I SYNAL+YAFG++KKTFEA  VFEHLV +G+KP+
Sbjct: 481 ANTWDLDRAYQTFEAISASFGLTPNIDSYNALLYAFGKVKKTFEATNVFEHLVSIGVKPD 540

Query: 541 ARSYSLLVDAHLINRDPKSALSAIDDMVTAGFVPSREMLKKVRRRCIREQDYDSNDRVDY 600
           +R+YSLLVDAHLINRDPKSAL+ +DDM+ AGF PSRE LKK+RRRC+RE D +++D+V+ 
Sbjct: 541 SRTYSLLVDAHLINRDPKSALTVVDDMIKAGFEPSRETLKKLRRRCVREMDDENDDQVEA 600

Query: 601 FAKIFRIRMGTDKRRDILFMLDY 614
            AK F+IRMG++ RR++LF +DY
Sbjct: 601 LAKKFQIRMGSENRRNMLFNIDY 622

BLAST of HG10004370 vs. ExPASy Swiss-Prot
Match: Q9FIX3 (Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana OX=3702 GN=EMB2745 PE=2 SV=1)

HSP 1 Score: 67.8 bits (164), Expect = 5.0e-10
Identity = 85/428 (19.86%), Postives = 167/428 (39.02%), Query Frame = 0

Query: 144 PDVNLYNNYLRANLMVNATAGELLDIVAQMEDYAITPNTASYNLVLKAMYQARETEAAEK 203
           P V  YN  L A +          ++  +M +  ++PN  +YN++++    A   + A  
Sbjct: 167 PGVLSYNAVLDATIRSKRNISFAENVFKEMLESQVSPNVFTYNILIRGFCFAGNIDVALT 226

Query: 204 LIESMLQTGEESMPDDESYDIVIGMLLSTDQMDAVLKYIDLTLKSGHILSLKVFSECVRS 263
           L + M   G   +P+  +Y+ +I       ++D   K +      G   +L  ++  +  
Sbjct: 227 LFDKMETKG--CLPNVVTYNTLIDGYCKLRKIDDGFKLLRSMALKGLEPNLISYNVVING 286

Query: 264 CVEKGRLDTLVAVIDRCKTTVQNKALCPPWNLCNYIAEVATQEDNSKLAYYALE--FIAK 323
              +GR+  +  V+                N   Y  +  T   N+ +  Y  E  F   
Sbjct: 287 LCREGRMKEVSFVLTE-------------MNRRGYSLDEVTY--NTLIKGYCKEGNFHQA 346

Query: 324 WIARGENARPPVHLSVDEGLVVSALGTAGRTHS---SSLLDAAWAILKRSLRQKKVPNPE 383
            +   E  R         GL  S +      HS   +  ++ A   L +   +   PN  
Sbjct: 347 LVMHAEMLR--------HGLTPSVITYTSLIHSMCKAGNMNRAMEFLDQMRVRGLCPNER 406

Query: 384 SYLGKIYALASFGDLQRAFTTLHEFEEAYRTFDDGAGEEMLSPFTSLYPLVV--ACSKKG 443
           +Y   +   +  G +  A+  L E        D+G      SP    Y  ++   C    
Sbjct: 407 TYTTLVDGFSQKGYMNEAYRVLREMN------DNG-----FSPSVVTYNALINGHCVTGK 466

Query: 444 FETLDTVYFQLENLSSADPPYKSVAALNCVILGCANIWDLDRAYQTFEAIGSSFGLTPDI 503
            E    V   ++    +      V + + V+ G    +D+D A +    +    G+ PD 
Sbjct: 467 MEDAIAVLEDMKEKGLS----PDVVSYSTVLSGFCRSYDVDEALRVKREMVEK-GIKPDT 526

Query: 504 HSYNALMYAFGRLKKTFEAARVFEHLVGLGIKPNARSYSLLVDAHLINRDPKSALSAIDD 563
            +Y++L+  F   ++T EA  ++E ++ +G+ P+  +Y+ L++A+ +  D + AL   ++
Sbjct: 527 ITYSSLIQGFCEQRRTKEACDLYEEMLRVGLPPDEFTYTALINAYCMEGDLEKALQLHNE 553

Query: 564 MVTAGFVP 565
           MV  G +P
Sbjct: 587 MVEKGVLP 553

BLAST of HG10004370 vs. ExPASy Swiss-Prot
Match: Q9SZ52 (Pentatricopeptide repeat-containing protein At4g31850, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PGR3 PE=1 SV=1)

HSP 1 Score: 65.5 bits (158), Expect = 2.5e-09
Identity = 35/132 (26.52%), Postives = 66/132 (50.00%), Query Frame = 0

Query: 444  YFQLENLSSADPPYKSVAALNCVILGCANIWDLDRAYQTFEAIGSSFGLTPDIHSYNALM 503
            YF+    S  +P    V   N +I G      L+ A   F  + +S G+TPD+++YN+L+
Sbjct: 983  YFKELKESGLNP---DVVCYNLIINGLGKSHRLEEALVLFNEMKTSRGITPDLYTYNSLI 1042

Query: 504  YAFGRLKKTFEAARVFEHLVGLGIKPNARSYSLLVDAHLINRDPKSALSAIDDMVTAGFV 563
               G      EA +++  +   G++PN  +++ L+  + ++  P+ A +    MVT GF 
Sbjct: 1043 LNLGIAGMVEEAGKIYNEIQRAGLEPNVFTFNALIRGYSLSGKPEHAYAVYQTMVTGGFS 1102

Query: 564  PSREMLKKVRRR 576
            P+    +++  R
Sbjct: 1103 PNTGTYEQLPNR 1111

BLAST of HG10004370 vs. ExPASy Swiss-Prot
Match: Q9M907 (Pentatricopeptide repeat-containing protein At3g06920 OS=Arabidopsis thaliana OX=3702 GN=At3g06920 PE=2 SV=1)

HSP 1 Score: 63.9 bits (154), Expect = 7.2e-09
Identity = 86/434 (19.82%), Postives = 162/434 (37.33%), Query Frame = 0

Query: 180 PNTASYNLVLKAMYQARETEAAEKLIESMLQTGEESMPDDESYDIVIGMLLSTDQMDAVL 239
           P+  +YN +L  + +  + + A K+ E M    +++ P+  +Y+I+I ML    ++D   
Sbjct: 341 PSVIAYNCILTCLRKMGKVDEALKVFEEM---KKDAAPNLSTYNILIDMLCRAGKLDTAF 400

Query: 240 KYIDLTLKSGHILSLKVFSECVRSCVEKGRLDTLVAVIDRCKTTVQNKALCPPWNLCNYI 299
           +  D   K+G   +++  +  V    +  +LD   A+ +     V         +L + +
Sbjct: 401 ELRDSMQKAGLFPNVRTVNIMVDRLCKSQKLDEACAMFEEMDYKVCTPDEITFCSLIDGL 460

Query: 300 AEVATQEDNSKLAYYALE------------FIAKWIARG--------------ENARPPV 359
            +V   +D  K+    L+             I  +   G              +N  P +
Sbjct: 461 GKVGRVDDAYKVYEKMLDSDCRTNSIVYTSLIKNFFNHGRKEDGHKIYKDMINQNCSPDL 520

Query: 360 HLSVDEGLVVSALGTAGRTHSSSLLDAAWAILKRSLRQKKVPNPESYLGKIYALASFGDL 419
            L       +  +  AG            A+ +    ++ VP+  SY   I+ L   G  
Sbjct: 521 QLL---NTYMDCMFKAGEPEKGR------AMFEEIKARRFVPDARSYSILIHGLIKAGFA 580

Query: 420 QRAFTTLHEFEEAYRTFDDGAGEEMLSPF------TSLYPLVVACSKKGFE----TLDTV 479
              +   +  +E     D  A   ++  F         Y L+     KGFE    T  +V
Sbjct: 581 NETYELFYSMKEQGCVLDTRAYNIVIDGFCKCGKVNKAYQLLEEMKTKGFEPTVVTYGSV 640

Query: 480 YFQLENLSSADPPYK------------SVAALNCVILGCANIWDLDRAYQTFEAIGSSFG 539
              L  +   D  Y             +V   + +I G   +  +D AY   E +    G
Sbjct: 641 IDGLAKIDRLDEAYMLFEEAKSKRIELNVVIYSSLIDGFGKVGRIDEAYLILEELMQK-G 700

Query: 540 LTPDIHSYNALMYAFGRLKKTFEAARVFEHLVGLGIKPNARSYSLLVDAHLINRDPKSAL 566
           LTP+++++N+L+ A  + ++  EA   F+ +  L   PN  +Y +L++     R    A 
Sbjct: 701 LTPNLYTWNSLLDALVKAEEINEALVCFQSMKELKCTPNQVTYGILINGLCKVRKFNKAF 760

BLAST of HG10004370 vs. ExPASy Swiss-Prot
Match: Q9FMF6 (Pentatricopeptide repeat-containing protein At5g64320, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At5g64320 PE=2 SV=1)

HSP 1 Score: 59.7 bits (143), Expect = 1.4e-07
Identity = 54/226 (23.89%), Postives = 88/226 (38.94%), Query Frame = 0

Query: 358 LDAAWAILKRSLRQKKVPNPESYLGKIYALASFGDLQRAFTTLHEF-----EEAYRTFDD 417
           +D+A ++L+   +   VPN   Y   I++L+    +  A   L E           TF+D
Sbjct: 233 IDSALSLLRDMTKHGCVPNSVIYQTLIHSLSKCNRVNEALQLLEEMFLMGCVPDAETFND 292

Query: 418 G-AGEEMLSPFTSLYPLVVACSKKGFETLDTVYFQLEN----LSSADP--------PYKS 477
              G            +V     +GF   D  Y  L N    +   D         P   
Sbjct: 293 VILGLCKFDRINEAAKMVNRMLIRGFAPDDITYGYLMNGLCKIGRVDAAKDLFYRIPKPE 352

Query: 478 VAALNCVILGCANIWDLDRAYQTFEAIGSSFGLTPDIHSYNALMYAFGRLKKTFEAARVF 537
           +   N +I G      LD A      + +S+G+ PD+ +YN+L+Y + +      A  V 
Sbjct: 353 IVIFNTLIHGFVTHGRLDDAKAVLSDMVTSYGIVPDVCTYNSLIYGYWKEGLVGLALEVL 412

Query: 538 EHLVGLGIKPNARSYSLLVDAHLINRDPKSALSAIDDMVTAGFVPS 566
             +   G KPN  SY++LVD          A + +++M   G  P+
Sbjct: 413 HDMRNKGCKPNVYSYTILVDGFCKLGKIDEAYNVLNEMSADGLKPN 458

BLAST of HG10004370 vs. ExPASy TrEMBL
Match: A0A6J1KIJ9 (pentatricopeptide repeat-containing protein At1g26460, mitochondrial OS=Cucurbita maxima OX=3661 GN=LOC111496072 PE=4 SV=1)

HSP 1 Score: 1119.8 bits (2895), Expect = 0.0e+00
Identity = 555/619 (89.66%), Postives = 588/619 (94.99%), Query Frame = 0

Query: 1   MASKMAILSRTHTLIRTTNLNNVCFFKPISTFTFLSQQPQLANEPVDIPPSTPLPPNPAS 60
           MASKMAILSRTHTLIR +NLNNVC FKPI+TFTFLSQ+P LANEP D+ PSTPLPPNPAS
Sbjct: 1   MASKMAILSRTHTLIRNSNLNNVCSFKPITTFTFLSQEPHLANEPADV-PSTPLPPNPAS 60

Query: 61  GSPLYKENWRNPIPNYSMAPSLVPLGLLSQSPSSRIEALSQTLDVQSLLNVFADWMASQR 120
           GSPLY ENWRNPIPN SM  SL+PLG L+QSPSSRI+ALSQTLDVQSLLNVFADWMASQR
Sbjct: 61  GSPLYNENWRNPIPNASMTQSLIPLGFLNQSPSSRIQALSQTLDVQSLLNVFADWMASQR 120

Query: 121 WEDMKQLFEFWIRSLDKDGKPNKPDVNLYNNYLRANLMVNATAGELLDIVAQMEDYAITP 180
           W+DMKQLFE WIRSLDK+GKPNKPDVNLYNNYLRANLMVNATAGELLD+VAQMEDYAI+P
Sbjct: 121 WDDMKQLFESWIRSLDKNGKPNKPDVNLYNNYLRANLMVNATAGELLDLVAQMEDYAISP 180

Query: 181 NTASYNLVLKAMYQARETEAAEKLIESMLQTGEESMPDDESYDIVIGMLLSTDQMDAVLK 240
           N+AS+NLVLKAMYQARETEAAEKLIE MLQTG ESMPDDESYD+VIGMLLSTDQ+DA LK
Sbjct: 181 NSASFNLVLKAMYQARETEAAEKLIERMLQTGGESMPDDESYDLVIGMLLSTDQIDAALK 240

Query: 241 YIDLTLKSGHILSLKVFSECVRSCVEKGRLDTLVAVIDRCKTTVQNKALCPPWNLCNYIA 300
           YIDLTLKSGH+LSLKVF+ECVRSCV+KGRLDTLV+VIDRCK TVQNKALCP WNLCNYIA
Sbjct: 241 YIDLTLKSGHMLSLKVFTECVRSCVKKGRLDTLVSVIDRCKATVQNKALCPTWNLCNYIA 300

Query: 301 EVATQEDNSKLAYYALEFIAKWIARGENARPPVHLSVDEGLVVSALGTAGRTHSSSLLDA 360
           EVA QEDNSKLAYYALEF+A+WIARGENARPPVHLSVDEG+VVSALGTAGRT+SSSLLDA
Sbjct: 301 EVAMQEDNSKLAYYALEFMARWIARGENARPPVHLSVDEGIVVSALGTAGRTYSSSLLDA 360

Query: 361 AWAILKRSLRQKKVPNPESYLGKIYALASFGDLQRAFTTLHEFEEAYRTFDDGAGEEMLS 420
           AWAILKRSLRQKKVPNPESYLGKIYALASFG+LQRAFTTLHEFEEAYRT DDGA EEM S
Sbjct: 361 AWAILKRSLRQKKVPNPESYLGKIYALASFGNLQRAFTTLHEFEEAYRTSDDGAAEEMFS 420

Query: 421 PFTSLYPLVVACSKKGFETLDTVYFQLENLSSADPPYKSVAALNCVILGCANIWDLDRAY 480
           PFTSLYPLVVACSKKGFETLDTVYFQLENLS ADPPYKSVAALNCVILGCANIWDLDRAY
Sbjct: 421 PFTSLYPLVVACSKKGFETLDTVYFQLENLSRADPPYKSVAALNCVILGCANIWDLDRAY 480

Query: 481 QTFEAIGSSFGLTPDIHSYNALMYAFGRLKKTFEAARVFEHLVGLGIKPNARSYSLLVDA 540
           QTFEAI SSFGLTPDIHSYNALMYAFG+LKKTFEAA+VFEHLVGLG+KPNA SYSLLVDA
Sbjct: 481 QTFEAISSSFGLTPDIHSYNALMYAFGKLKKTFEAAKVFEHLVGLGVKPNATSYSLLVDA 540

Query: 541 HLINRDPKSALSAIDDMVTAGFVPSREMLKKVRRRCIREQDYDSNDRVDYFAKIFRIRMG 600
           HLINRDPKSALS IDDMVTAGF PS++MLKKVRRRCIRE DYDSNDRVDY AK F+IRMG
Sbjct: 541 HLINRDPKSALSVIDDMVTAGFAPSKQMLKKVRRRCIREMDYDSNDRVDYQAKSFKIRMG 600

Query: 601 TDKRRDILFMLDYGTDYVA 620
           T+ RRD+LF LD+GT YVA
Sbjct: 601 TENRRDMLFNLDFGTHYVA 618

BLAST of HG10004370 vs. ExPASy TrEMBL
Match: A0A6J1GHH4 (pentatricopeptide repeat-containing protein At1g26460, mitochondrial OS=Cucurbita moschata OX=3662 GN=LOC111454227 PE=4 SV=1)

HSP 1 Score: 1117.4 bits (2889), Expect = 0.0e+00
Identity = 555/619 (89.66%), Postives = 588/619 (94.99%), Query Frame = 0

Query: 1   MASKMAILSRTHTLIRTTNLNNVCFFKPISTFTFLSQQPQLANEPVDIPPSTPLPPNPAS 60
           MASKMAILSRTHTLIR +NLNNVC FKPI+TFTFLSQ+ QLANEPVD+ PSTPLPPNPAS
Sbjct: 1   MASKMAILSRTHTLIRNSNLNNVCSFKPITTFTFLSQESQLANEPVDV-PSTPLPPNPAS 60

Query: 61  GSPLYKENWRNPIPNYSMAPSLVPLGLLSQSPSSRIEALSQTLDVQSLLNVFADWMASQR 120
           GSPLY ENWRNPIPN SM  SL+PLG L+QSPSSRI+ALSQTLDVQSLLNVFADWMASQR
Sbjct: 61  GSPLYNENWRNPIPNASMTQSLIPLGFLNQSPSSRIQALSQTLDVQSLLNVFADWMASQR 120

Query: 121 WEDMKQLFEFWIRSLDKDGKPNKPDVNLYNNYLRANLMVNATAGELLDIVAQMEDYAITP 180
           W+DMKQLFE WIRSLDK+GKPNKPDVNLYNNYLRANLMVNATAGELLD+VAQMEDYAI+P
Sbjct: 121 WDDMKQLFESWIRSLDKNGKPNKPDVNLYNNYLRANLMVNATAGELLDLVAQMEDYAISP 180

Query: 181 NTASYNLVLKAMYQARETEAAEKLIESMLQTGEESMPDDESYDIVIGMLLSTDQMDAVLK 240
           N+AS+NLVLKAMYQARETEAAEKLIE MLQTG ESMPDDESYD+VIGMLLSTDQ+DA LK
Sbjct: 181 NSASFNLVLKAMYQARETEAAEKLIERMLQTGGESMPDDESYDLVIGMLLSTDQIDAALK 240

Query: 241 YIDLTLKSGHILSLKVFSECVRSCVEKGRLDTLVAVIDRCKTTVQNKALCPPWNLCNYIA 300
           YIDLTLKSGH+LSLKVF+ECVRSCV+KGRLDTLV+VIDRCK TVQNKALCP WNLCNYIA
Sbjct: 241 YIDLTLKSGHMLSLKVFTECVRSCVKKGRLDTLVSVIDRCKATVQNKALCPTWNLCNYIA 300

Query: 301 EVATQEDNSKLAYYALEFIAKWIARGENARPPVHLSVDEGLVVSALGTAGRTHSSSLLDA 360
           EVA QEDNSKLAYYALEF+A+WIARGENARPPVHLSVDEG+VVSALGTAGRT+SSSLLDA
Sbjct: 301 EVAMQEDNSKLAYYALEFMARWIARGENARPPVHLSVDEGIVVSALGTAGRTYSSSLLDA 360

Query: 361 AWAILKRSLRQKKVPNPESYLGKIYALASFGDLQRAFTTLHEFEEAYRTFDDGAGEEMLS 420
           AWAILKRSLRQKKVPNPESYLGKIYALASFG+LQRAFTTLHEFEEAYRT DDGA EEM S
Sbjct: 361 AWAILKRSLRQKKVPNPESYLGKIYALASFGNLQRAFTTLHEFEEAYRTSDDGAAEEMFS 420

Query: 421 PFTSLYPLVVACSKKGFETLDTVYFQLENLSSADPPYKSVAALNCVILGCANIWDLDRAY 480
           PFTSLYPLVVACSKKGFETLDTVYFQLENLS ADPPYKSVAALNCVILGCANIWDLDRAY
Sbjct: 421 PFTSLYPLVVACSKKGFETLDTVYFQLENLSRADPPYKSVAALNCVILGCANIWDLDRAY 480

Query: 481 QTFEAIGSSFGLTPDIHSYNALMYAFGRLKKTFEAARVFEHLVGLGIKPNARSYSLLVDA 540
           QTFEAI SSFGLTPDIHSYNALMYAFG+LKKTFEAARVFEHLV LG+KPNA SYSLLVDA
Sbjct: 481 QTFEAISSSFGLTPDIHSYNALMYAFGKLKKTFEAARVFEHLVSLGVKPNATSYSLLVDA 540

Query: 541 HLINRDPKSALSAIDDMVTAGFVPSREMLKKVRRRCIREQDYDSNDRVDYFAKIFRIRMG 600
           HLINRDPKSALS IDDMV AGF PS++MLKKVRRRCIRE DYDSNDRVDY AK F+IRMG
Sbjct: 541 HLINRDPKSALSVIDDMVIAGFAPSKQMLKKVRRRCIREMDYDSNDRVDYQAKSFKIRMG 600

Query: 601 TDKRRDILFMLDYGTDYVA 620
           T+ RRD+LF LD+GT+YVA
Sbjct: 601 TENRRDMLFNLDFGTNYVA 618

BLAST of HG10004370 vs. ExPASy TrEMBL
Match: A0A6J1HEP6 (pentatricopeptide repeat-containing protein At1g26460, mitochondrial-like OS=Cucurbita moschata OX=3662 GN=LOC111462853 PE=4 SV=1)

HSP 1 Score: 1117.4 bits (2889), Expect = 0.0e+00
Identity = 553/619 (89.34%), Postives = 591/619 (95.48%), Query Frame = 0

Query: 1   MASKMAILSRTHTLIRTTNLNNVCFFKPISTFTFLSQQPQLANEPVDIPPSTPLPPNPAS 60
           MASKMAILSRT T+IR +NLNNVCFFKPI+TFTFLSQ+PQLANEP DI PSTPLPPNPAS
Sbjct: 1   MASKMAILSRTQTIIRNSNLNNVCFFKPITTFTFLSQEPQLANEPADI-PSTPLPPNPAS 60

Query: 61  GSPLYKENWRNPIPNYSMAPSLVPLGLLSQSPSSRIEALSQTLDVQSLLNVFADWMASQR 120
           GSPLY ENWRNPIPN S   S++PLG L+QSPSSRIEALSQTLDVQSLLNVFADWMASQR
Sbjct: 61  GSPLYNENWRNPIPNASRTLSMIPLGFLNQSPSSRIEALSQTLDVQSLLNVFADWMASQR 120

Query: 121 WEDMKQLFEFWIRSLDKDGKPNKPDVNLYNNYLRANLMVNATAGELLDIVAQMEDYAITP 180
           WEDMKQLFEFWIRSLDK+GKPNKPDVNLYNNYLRANLMVNA+AGELLD+VAQMEDYAI+P
Sbjct: 121 WEDMKQLFEFWIRSLDKNGKPNKPDVNLYNNYLRANLMVNASAGELLDLVAQMEDYAISP 180

Query: 181 NTASYNLVLKAMYQARETEAAEKLIESMLQTGEESMPDDESYDIVIGMLLSTDQMDAVLK 240
           NTAS+NLVLKAMYQA+ETEAAEKLIE MLQTGEESMPDDESYD+VIGMLLSTDQ+DA LK
Sbjct: 181 NTASFNLVLKAMYQAKETEAAEKLIERMLQTGEESMPDDESYDLVIGMLLSTDQIDAALK 240

Query: 241 YIDLTLKSGHILSLKVFSECVRSCVEKGRLDTLVAVIDRCKTTVQNKALCPPWNLCNYIA 300
           YIDLTLKSGH+LSLKVF+ECVRSCV+KGRLDTLV+VIDRCK TVQNKALCP WNLCN+IA
Sbjct: 241 YIDLTLKSGHMLSLKVFTECVRSCVKKGRLDTLVSVIDRCKATVQNKALCPTWNLCNFIA 300

Query: 301 EVATQEDNSKLAYYALEFIAKWIARGENARPPVHLSVDEGLVVSALGTAGRTHSSSLLDA 360
           EVATQEDNSKLAYYALEF+A+WIARGENARPPVHLSVDEG+VVSALGTAGRT+SSSLLDA
Sbjct: 301 EVATQEDNSKLAYYALEFMARWIARGENARPPVHLSVDEGIVVSALGTAGRTYSSSLLDA 360

Query: 361 AWAILKRSLRQKKVPNPESYLGKIYALASFGDLQRAFTTLHEFEEAYRTFDDGAGEEMLS 420
           +WAILKRSL QKKVPN ESYLGKIYALASFG+LQRAFTTL EFEEAYRT DDGAGEEM S
Sbjct: 361 SWAILKRSLGQKKVPNLESYLGKIYALASFGNLQRAFTTLREFEEAYRTADDGAGEEMFS 420

Query: 421 PFTSLYPLVVACSKKGFETLDTVYFQLENLSSADPPYKSVAALNCVILGCANIWDLDRAY 480
           PFTSLYPLVVACSKKGFETLDTVYFQLENLS ADPPYKSV+ALNCVILGCANIWDLDRAY
Sbjct: 421 PFTSLYPLVVACSKKGFETLDTVYFQLENLSRADPPYKSVSALNCVILGCANIWDLDRAY 480

Query: 481 QTFEAIGSSFGLTPDIHSYNALMYAFGRLKKTFEAARVFEHLVGLGIKPNARSYSLLVDA 540
           QTFEAIGSSFGLTPDIHSYNAL+YAFGRLKKTFEAARVFEHLVGLG+KPNA+SYSLLVDA
Sbjct: 481 QTFEAIGSSFGLTPDIHSYNALIYAFGRLKKTFEAARVFEHLVGLGVKPNAKSYSLLVDA 540

Query: 541 HLINRDPKSALSAIDDMVTAGFVPSREMLKKVRRRCIREQDYDSNDRVDYFAKIFRIRMG 600
           H+INRDPKSALS ID+MVTAGFVPS+EMLKK RRRC+RE DY SND+VDY AK F+IRMG
Sbjct: 541 HIINRDPKSALSVIDNMVTAGFVPSKEMLKKARRRCMREMDYPSNDKVDYLAKNFKIRMG 600

Query: 601 TDKRRDILFMLDYGTDYVA 620
           T+ RRDILF LDYGT+YVA
Sbjct: 601 TENRRDILFNLDYGTNYVA 618

BLAST of HG10004370 vs. ExPASy TrEMBL
Match: A0A6J1BQT9 (pentatricopeptide repeat-containing protein At1g26460, mitochondrial OS=Momordica charantia OX=3673 GN=LOC111004896 PE=4 SV=1)

HSP 1 Score: 1115.5 bits (2884), Expect = 0.0e+00
Identity = 551/619 (89.01%), Postives = 590/619 (95.32%), Query Frame = 0

Query: 1   MASKMAILSRTHTLIRTTNLNNVCFFKPISTFTFLSQQPQLANEPVDIPPSTPLPPNPAS 60
           MASKMAILSRT TLIRT+NLNNVCFFKPI+TF FLSQ+PQLANEP D+ PSTPLPPNPAS
Sbjct: 1   MASKMAILSRTGTLIRTSNLNNVCFFKPITTFAFLSQEPQLANEPPDL-PSTPLPPNPAS 60

Query: 61  GSPLYKENWRNPIPNYSMAPSLVPLGLLSQSPSSRIEALSQTLDVQSLLNVFADWMASQR 120
           GSPLY ENWRNPIPN S+ PSL+PLG L+QSPSSRI+ALSQTLDVQSLLNVFADWMASQR
Sbjct: 61  GSPLYNENWRNPIPNSSLTPSLIPLGFLNQSPSSRIQALSQTLDVQSLLNVFADWMASQR 120

Query: 121 WEDMKQLFEFWIRSLDKDGKPNKPDVNLYNNYLRANLMVNATAGELLDIVAQMEDYAITP 180
           WEDMKQLFEFWIRSLDK+GKPNKPDV+LYNNYLRANLMVNATAGELLD+VAQMEDYAI+P
Sbjct: 121 WEDMKQLFEFWIRSLDKNGKPNKPDVSLYNNYLRANLMVNATAGELLDLVAQMEDYAISP 180

Query: 181 NTASYNLVLKAMYQARETEAAEKLIESMLQTGEESMPDDESYDIVIGMLLSTDQMDAVLK 240
           NTAS+NLVLKAMY+ARETEAAEKLIE MLQ+GE+SMPDDESYD+VIGMLL+ DQ+DA LK
Sbjct: 181 NTASFNLVLKAMYRARETEAAEKLIERMLQSGEDSMPDDESYDLVIGMLLAMDQIDAALK 240

Query: 241 YIDLTLKSGHILSLKVFSECVRSCVEKGRLDTLVAVIDRCKTTVQNKALCPPWNLCNYIA 300
           YIDLTLKSG++LSLKVF+ECVRSC+ KGRLDTLV+VIDRCKTT QNKALCP WNLCNYIA
Sbjct: 241 YIDLTLKSGYMLSLKVFTECVRSCINKGRLDTLVSVIDRCKTTTQNKALCPTWNLCNYIA 300

Query: 301 EVATQEDNSKLAYYALEFIAKWIARGENARPPVHLSVDEGLVVSALGTAGRTHSSSLLDA 360
           EVA QEDNSKLAYYALEF+A+WIARGENARPPVHLSVDEG+VVSALGTAGRT+SSSLLDA
Sbjct: 301 EVAMQEDNSKLAYYALEFMARWIARGENARPPVHLSVDEGMVVSALGTAGRTYSSSLLDA 360

Query: 361 AWAILKRSLRQKKVPNPESYLGKIYALASFGDLQRAFTTLHEFEEAYRTFDDGAGEEMLS 420
           AWAILKRSLRQKKVPNPESYLGKIYALASFG+LQRAFTTLHEFEEAYR  DDGAGEEM S
Sbjct: 361 AWAILKRSLRQKKVPNPESYLGKIYALASFGNLQRAFTTLHEFEEAYRNSDDGAGEEMFS 420

Query: 421 PFTSLYPLVVACSKKGFETLDTVYFQLENLSSADPPYKSVAALNCVILGCANIWDLDRAY 480
           PFTSLYPLV+ACSKKGFETLDTVYFQLENLS ADPPYKSVAALNCVILGCANIWDLDRAY
Sbjct: 421 PFTSLYPLVMACSKKGFETLDTVYFQLENLSRADPPYKSVAALNCVILGCANIWDLDRAY 480

Query: 481 QTFEAIGSSFGLTPDIHSYNALMYAFGRLKKTFEAARVFEHLVGLGIKPNARSYSLLVDA 540
           QTFEAI SSFGLTPDIHSYNALMYAFGRLKKTFEA+RVFEHLV LG+KPNA SYSLLVDA
Sbjct: 481 QTFEAISSSFGLTPDIHSYNALMYAFGRLKKTFEASRVFEHLVSLGVKPNATSYSLLVDA 540

Query: 541 HLINRDPKSALSAIDDMVTAGFVPSREMLKKVRRRCIREQDYDSNDRVDYFAKIFRIRMG 600
           HLINRDPKSALSAIDDMV AGF PS+EMLKKVRRRCIRE DYD+NDRVDY AK F+IRMG
Sbjct: 541 HLINRDPKSALSAIDDMVIAGFSPSKEMLKKVRRRCIRELDYDTNDRVDYLAKNFKIRMG 600

Query: 601 TDKRRDILFMLDYGTDYVA 620
           T+ R+D+LF LDYGT+YVA
Sbjct: 601 TENRKDMLFNLDYGTNYVA 618

BLAST of HG10004370 vs. ExPASy TrEMBL
Match: A0A6J1KCV2 (pentatricopeptide repeat-containing protein At1g26460, mitochondrial-like OS=Cucurbita maxima OX=3661 GN=LOC111492112 PE=4 SV=1)

HSP 1 Score: 1112.4 bits (2876), Expect = 0.0e+00
Identity = 551/619 (89.01%), Postives = 589/619 (95.15%), Query Frame = 0

Query: 1   MASKMAILSRTHTLIRTTNLNNVCFFKPISTFTFLSQQPQLANEPVDIPPSTPLPPNPAS 60
           MASKMAILSRT TLIR +NLNNVCFFKPI+TFTFLSQ+PQLANEP DI PSTPLPPNPAS
Sbjct: 1   MASKMAILSRTQTLIRNSNLNNVCFFKPITTFTFLSQEPQLANEPADI-PSTPLPPNPAS 60

Query: 61  GSPLYKENWRNPIPNYSMAPSLVPLGLLSQSPSSRIEALSQTLDVQSLLNVFADWMASQR 120
           GSPLY ENWRNPIPN S   S++PLG L+QSPSSRIEALSQTLDVQSLLNVFADWMASQR
Sbjct: 61  GSPLYNENWRNPIPNASRTLSMIPLGFLNQSPSSRIEALSQTLDVQSLLNVFADWMASQR 120

Query: 121 WEDMKQLFEFWIRSLDKDGKPNKPDVNLYNNYLRANLMVNATAGELLDIVAQMEDYAITP 180
           WEDMKQLFEFWIRSLDK+GKPNKPDVNLYNNYLRANLMVNA+AGELL++VAQMEDYAI+P
Sbjct: 121 WEDMKQLFEFWIRSLDKNGKPNKPDVNLYNNYLRANLMVNASAGELLNLVAQMEDYAISP 180

Query: 181 NTASYNLVLKAMYQARETEAAEKLIESMLQTGEESMPDDESYDIVIGMLLSTDQMDAVLK 240
           NTAS+NLVLKAMYQA+ETEAAEKLIE MLQTGEESMPDDESYD+VIGMLLSTDQ+DA LK
Sbjct: 181 NTASFNLVLKAMYQAKETEAAEKLIERMLQTGEESMPDDESYDLVIGMLLSTDQIDAALK 240

Query: 241 YIDLTLKSGHILSLKVFSECVRSCVEKGRLDTLVAVIDRCKTTVQNKALCPPWNLCNYIA 300
           YIDLTLKSGHILSLKVF+ECVRSCV+KGRLDTLV+VIDRCK TVQNKALCP WNLCNYI 
Sbjct: 241 YIDLTLKSGHILSLKVFTECVRSCVKKGRLDTLVSVIDRCKATVQNKALCPTWNLCNYIV 300

Query: 301 EVATQEDNSKLAYYALEFIAKWIARGENARPPVHLSVDEGLVVSALGTAGRTHSSSLLDA 360
           EVATQEDNSKLAYYALEF+A+WIARGENAR PVHLSVDEG+VVSALGTAGRT+SSSLLDA
Sbjct: 301 EVATQEDNSKLAYYALEFMARWIARGENARSPVHLSVDEGIVVSALGTAGRTYSSSLLDA 360

Query: 361 AWAILKRSLRQKKVPNPESYLGKIYALASFGDLQRAFTTLHEFEEAYRTFDDGAGEEMLS 420
           +WAILKRSL QKKVPNPESYLGKIYALASFG+LQRAFTTL EFEEAYRT DDGAGEEM S
Sbjct: 361 SWAILKRSLGQKKVPNPESYLGKIYALASFGNLQRAFTTLREFEEAYRTSDDGAGEEMFS 420

Query: 421 PFTSLYPLVVACSKKGFETLDTVYFQLENLSSADPPYKSVAALNCVILGCANIWDLDRAY 480
           PFTSLYPLVVACSKKGFETLDTVYFQLENLS ADPPYKSV+ALNCVILGCANIWDLDRAY
Sbjct: 421 PFTSLYPLVVACSKKGFETLDTVYFQLENLSRADPPYKSVSALNCVILGCANIWDLDRAY 480

Query: 481 QTFEAIGSSFGLTPDIHSYNALMYAFGRLKKTFEAARVFEHLVGLGIKPNARSYSLLVDA 540
           QTFE IGSSFGLTPDIHSYNAL+YAFGRLKKTF AARVFEHLVGLG+KPNA+SYSLLVDA
Sbjct: 481 QTFEEIGSSFGLTPDIHSYNALIYAFGRLKKTFAAARVFEHLVGLGVKPNAKSYSLLVDA 540

Query: 541 HLINRDPKSALSAIDDMVTAGFVPSREMLKKVRRRCIREQDYDSNDRVDYFAKIFRIRMG 600
           H+INR+PKS+LS ID+MVTAGFVPS+EMLKKVRRRC+RE DY SND+VDY AK F+IRMG
Sbjct: 541 HIINRNPKSSLSVIDNMVTAGFVPSKEMLKKVRRRCMREMDYPSNDKVDYLAKNFKIRMG 600

Query: 601 TDKRRDILFMLDYGTDYVA 620
           T+ RRDILF LDYGT+YVA
Sbjct: 601 TENRRDILFNLDYGTNYVA 618

BLAST of HG10004370 vs. TAIR 10
Match: AT1G26460.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 823.5 bits (2126), Expect = 1.1e-238
Identity = 414/623 (66.45%), Postives = 498/623 (79.94%), Query Frame = 0

Query: 1   MASKMAILSRTHTLIRTTNLNNVCFFKPISTFT---FLSQQPQLANEPVDIPP------S 60
           MAS +   SR  +L++T   N      PI   +   FLSQ P LA E  D  P      S
Sbjct: 1   MASHLFTRSRI-SLLKTLKPNPFTSASPIRAISGTPFLSQDPLLATESTDHDPSNHQSTS 60

Query: 61  TPLPPNPASGSPLYKENWRNPIPNY-SMAPSLVPLGLLSQSPSSRIEALSQTLDVQSLLN 120
           TPLPPNPA+GSPLY+ENWR+PIPN  S   SLVPLG L+Q+P+ RI ALS+TLD+ SLLN
Sbjct: 61  TPLPPNPATGSPLYQENWRSPIPNTPSFNQSLVPLGFLNQAPAPRIRALSETLDMNSLLN 120

Query: 121 VFADWMASQRWEDMKQLFEFWIRSLDKDGKPNKPDVNLYNNYLRANLMVNATAGELLDIV 180
           +FADW ASQRW DMKQLFE W+RSLDK+GKPNKPDVNLYN+YLRANLM+ A+AG++LD+V
Sbjct: 121 MFADWTASQRWSDMKQLFEVWVRSLDKNGKPNKPDVNLYNHYLRANLMMGASAGDMLDLV 180

Query: 181 AQMEDYAITPNTASYNLVLKAMYQARETEAAEKLIESMLQTGEESMPDDESYDIVIGMLL 240
           A ME++++ PNTASYNLVLKAMYQARETEAA KL+E ML  G++S+PDDESYD+VIGM  
Sbjct: 181 APMEEFSVEPNTASYNLVLKAMYQARETEAAMKLLERMLLLGKDSLPDDESYDLVIGMHF 240

Query: 241 STDQMDAVLKYIDLTLKSGHILSLKVFSECVRSCVEKGRLDTLVAVIDRCKTTVQNKALC 300
              + D  +K +D  LKSG++LS  VF+ECVRSCV KGR DTLV++I+RCK   +NK+LC
Sbjct: 241 GVGKNDEAMKVMDTALKSGYMLSTSVFTECVRSCVAKGRTDTLVSIIERCKAVDRNKSLC 300

Query: 301 PPWNLCNYIAEVATQEDNSKLAYYALEFIAKWIARGENARPPVHLSVDEGLVVSALGTAG 360
           P W LCNYIAEVA QEDNSKLA+YA EF+ KWI RGE ARP V  SVDEGLVV+ L +A 
Sbjct: 301 PSWILCNYIAEVAIQEDNSKLAFYAFEFMFKWITRGEMARPSVIFSVDEGLVVAGLASAA 360

Query: 361 RTHSSSLLDAAWAILKRSLRQKKVPNPESYLGKIYALASFGDLQRAFTTLHEFEEAYRTF 420
           RT SSSL++ +W ILK+SLR +K  NP SY+ KI A AS G+LQ+AFT+LHE E AY   
Sbjct: 361 RTCSSSLVEGSWTILKQSLRGRKAANPASYIAKINAYASLGNLQKAFTSLHELESAYADS 420

Query: 421 DDGAGEEMLSPFTSLYPLVVACSKKGFETLDTVYFQLENLSSADPPYKSVAALNCVILGC 480
           +    EEMLSPFTSLYPLVVACSKKGFETLD VYFQLE+LS  D PYKSVAALNC+ILGC
Sbjct: 421 EKEVVEEMLSPFTSLYPLVVACSKKGFETLDEVYFQLESLSQGDTPYKSVAALNCIILGC 480

Query: 481 ANIWDLDRAYQTFEAIGSSFGLTPDIHSYNALMYAFGRLKKTFEAARVFEHLVGLGIKPN 540
           AN WDLDRAYQTFEAI +SFGLTP+I SYNAL+YAFG++KKTFEA  VFEHLV +G+KP+
Sbjct: 481 ANTWDLDRAYQTFEAISASFGLTPNIDSYNALLYAFGKVKKTFEATNVFEHLVSIGVKPD 540

Query: 541 ARSYSLLVDAHLINRDPKSALSAIDDMVTAGFVPSREMLKKVRRRCIREQDYDSNDRVDY 600
           +R+YSLLVDAHLINRDPKSAL+ +DDM+ AGF PSRE LKK+RRRC+RE D +++D+V+ 
Sbjct: 541 SRTYSLLVDAHLINRDPKSALTVVDDMIKAGFEPSRETLKKLRRRCVREMDDENDDQVEA 600

Query: 601 FAKIFRIRMGTDKRRDILFMLDY 614
            AK F+IRMG++ RR++LF +DY
Sbjct: 601 LAKKFQIRMGSENRRNMLFNIDY 622

BLAST of HG10004370 vs. TAIR 10
Match: AT5G39710.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 67.8 bits (164), Expect = 3.5e-11
Identity = 85/428 (19.86%), Postives = 167/428 (39.02%), Query Frame = 0

Query: 144 PDVNLYNNYLRANLMVNATAGELLDIVAQMEDYAITPNTASYNLVLKAMYQARETEAAEK 203
           P V  YN  L A +          ++  +M +  ++PN  +YN++++    A   + A  
Sbjct: 167 PGVLSYNAVLDATIRSKRNISFAENVFKEMLESQVSPNVFTYNILIRGFCFAGNIDVALT 226

Query: 204 LIESMLQTGEESMPDDESYDIVIGMLLSTDQMDAVLKYIDLTLKSGHILSLKVFSECVRS 263
           L + M   G   +P+  +Y+ +I       ++D   K +      G   +L  ++  +  
Sbjct: 227 LFDKMETKG--CLPNVVTYNTLIDGYCKLRKIDDGFKLLRSMALKGLEPNLISYNVVING 286

Query: 264 CVEKGRLDTLVAVIDRCKTTVQNKALCPPWNLCNYIAEVATQEDNSKLAYYALE--FIAK 323
              +GR+  +  V+                N   Y  +  T   N+ +  Y  E  F   
Sbjct: 287 LCREGRMKEVSFVLTE-------------MNRRGYSLDEVTY--NTLIKGYCKEGNFHQA 346

Query: 324 WIARGENARPPVHLSVDEGLVVSALGTAGRTHS---SSLLDAAWAILKRSLRQKKVPNPE 383
            +   E  R         GL  S +      HS   +  ++ A   L +   +   PN  
Sbjct: 347 LVMHAEMLR--------HGLTPSVITYTSLIHSMCKAGNMNRAMEFLDQMRVRGLCPNER 406

Query: 384 SYLGKIYALASFGDLQRAFTTLHEFEEAYRTFDDGAGEEMLSPFTSLYPLVV--ACSKKG 443
           +Y   +   +  G +  A+  L E        D+G      SP    Y  ++   C    
Sbjct: 407 TYTTLVDGFSQKGYMNEAYRVLREMN------DNG-----FSPSVVTYNALINGHCVTGK 466

Query: 444 FETLDTVYFQLENLSSADPPYKSVAALNCVILGCANIWDLDRAYQTFEAIGSSFGLTPDI 503
            E    V   ++    +      V + + V+ G    +D+D A +    +    G+ PD 
Sbjct: 467 MEDAIAVLEDMKEKGLS----PDVVSYSTVLSGFCRSYDVDEALRVKREMVEK-GIKPDT 526

Query: 504 HSYNALMYAFGRLKKTFEAARVFEHLVGLGIKPNARSYSLLVDAHLINRDPKSALSAIDD 563
            +Y++L+  F   ++T EA  ++E ++ +G+ P+  +Y+ L++A+ +  D + AL   ++
Sbjct: 527 ITYSSLIQGFCEQRRTKEACDLYEEMLRVGLPPDEFTYTALINAYCMEGDLEKALQLHNE 553

Query: 564 MVTAGFVP 565
           MV  G +P
Sbjct: 587 MVEKGVLP 553

BLAST of HG10004370 vs. TAIR 10
Match: AT4G31850.1 (proton gradient regulation 3 )

HSP 1 Score: 65.5 bits (158), Expect = 1.8e-10
Identity = 35/132 (26.52%), Postives = 66/132 (50.00%), Query Frame = 0

Query: 444  YFQLENLSSADPPYKSVAALNCVILGCANIWDLDRAYQTFEAIGSSFGLTPDIHSYNALM 503
            YF+    S  +P    V   N +I G      L+ A   F  + +S G+TPD+++YN+L+
Sbjct: 983  YFKELKESGLNP---DVVCYNLIINGLGKSHRLEEALVLFNEMKTSRGITPDLYTYNSLI 1042

Query: 504  YAFGRLKKTFEAARVFEHLVGLGIKPNARSYSLLVDAHLINRDPKSALSAIDDMVTAGFV 563
               G      EA +++  +   G++PN  +++ L+  + ++  P+ A +    MVT GF 
Sbjct: 1043 LNLGIAGMVEEAGKIYNEIQRAGLEPNVFTFNALIRGYSLSGKPEHAYAVYQTMVTGGFS 1102

Query: 564  PSREMLKKVRRR 576
            P+    +++  R
Sbjct: 1103 PNTGTYEQLPNR 1111

BLAST of HG10004370 vs. TAIR 10
Match: AT3G06920.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 63.9 bits (154), Expect = 5.1e-10
Identity = 86/434 (19.82%), Postives = 162/434 (37.33%), Query Frame = 0

Query: 180 PNTASYNLVLKAMYQARETEAAEKLIESMLQTGEESMPDDESYDIVIGMLLSTDQMDAVL 239
           P+  +YN +L  + +  + + A K+ E M    +++ P+  +Y+I+I ML    ++D   
Sbjct: 341 PSVIAYNCILTCLRKMGKVDEALKVFEEM---KKDAAPNLSTYNILIDMLCRAGKLDTAF 400

Query: 240 KYIDLTLKSGHILSLKVFSECVRSCVEKGRLDTLVAVIDRCKTTVQNKALCPPWNLCNYI 299
           +  D   K+G   +++  +  V    +  +LD   A+ +     V         +L + +
Sbjct: 401 ELRDSMQKAGLFPNVRTVNIMVDRLCKSQKLDEACAMFEEMDYKVCTPDEITFCSLIDGL 460

Query: 300 AEVATQEDNSKLAYYALE------------FIAKWIARG--------------ENARPPV 359
            +V   +D  K+    L+             I  +   G              +N  P +
Sbjct: 461 GKVGRVDDAYKVYEKMLDSDCRTNSIVYTSLIKNFFNHGRKEDGHKIYKDMINQNCSPDL 520

Query: 360 HLSVDEGLVVSALGTAGRTHSSSLLDAAWAILKRSLRQKKVPNPESYLGKIYALASFGDL 419
            L       +  +  AG            A+ +    ++ VP+  SY   I+ L   G  
Sbjct: 521 QLL---NTYMDCMFKAGEPEKGR------AMFEEIKARRFVPDARSYSILIHGLIKAGFA 580

Query: 420 QRAFTTLHEFEEAYRTFDDGAGEEMLSPF------TSLYPLVVACSKKGFE----TLDTV 479
              +   +  +E     D  A   ++  F         Y L+     KGFE    T  +V
Sbjct: 581 NETYELFYSMKEQGCVLDTRAYNIVIDGFCKCGKVNKAYQLLEEMKTKGFEPTVVTYGSV 640

Query: 480 YFQLENLSSADPPYK------------SVAALNCVILGCANIWDLDRAYQTFEAIGSSFG 539
              L  +   D  Y             +V   + +I G   +  +D AY   E +    G
Sbjct: 641 IDGLAKIDRLDEAYMLFEEAKSKRIELNVVIYSSLIDGFGKVGRIDEAYLILEELMQK-G 700

Query: 540 LTPDIHSYNALMYAFGRLKKTFEAARVFEHLVGLGIKPNARSYSLLVDAHLINRDPKSAL 566
           LTP+++++N+L+ A  + ++  EA   F+ +  L   PN  +Y +L++     R    A 
Sbjct: 701 LTPNLYTWNSLLDALVKAEEINEALVCFQSMKELKCTPNQVTYGILINGLCKVRKFNKAF 760

BLAST of HG10004370 vs. TAIR 10
Match: AT5G64320.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 59.7 bits (143), Expect = 9.6e-09
Identity = 54/226 (23.89%), Postives = 88/226 (38.94%), Query Frame = 0

Query: 358 LDAAWAILKRSLRQKKVPNPESYLGKIYALASFGDLQRAFTTLHEF-----EEAYRTFDD 417
           +D+A ++L+   +   VPN   Y   I++L+    +  A   L E           TF+D
Sbjct: 233 IDSALSLLRDMTKHGCVPNSVIYQTLIHSLSKCNRVNEALQLLEEMFLMGCVPDAETFND 292

Query: 418 G-AGEEMLSPFTSLYPLVVACSKKGFETLDTVYFQLEN----LSSADP--------PYKS 477
              G            +V     +GF   D  Y  L N    +   D         P   
Sbjct: 293 VILGLCKFDRINEAAKMVNRMLIRGFAPDDITYGYLMNGLCKIGRVDAAKDLFYRIPKPE 352

Query: 478 VAALNCVILGCANIWDLDRAYQTFEAIGSSFGLTPDIHSYNALMYAFGRLKKTFEAARVF 537
           +   N +I G      LD A      + +S+G+ PD+ +YN+L+Y + +      A  V 
Sbjct: 353 IVIFNTLIHGFVTHGRLDDAKAVLSDMVTSYGIVPDVCTYNSLIYGYWKEGLVGLALEVL 412

Query: 538 EHLVGLGIKPNARSYSLLVDAHLINRDPKSALSAIDDMVTAGFVPS 566
             +   G KPN  SY++LVD          A + +++M   G  P+
Sbjct: 413 HDMRNKGCKPNVYSYTILVDGFCKLGKIDEAYNVLNEMSADGLKPN 458

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_023546997.10.0e+0089.82pentatricopeptide repeat-containing protein At1g26460, mitochondrial-like [Cucur... [more]
KAG7029573.10.0e+0089.34Pentatricopeptide repeat-containing protein, mitochondrial [Cucurbita argyrosper... [more]
XP_023002111.10.0e+0089.66pentatricopeptide repeat-containing protein At1g26460, mitochondrial [Cucurbita ... [more]
XP_023537323.10.0e+0089.34pentatricopeptide repeat-containing protein At1g26460, mitochondrial [Cucurbita ... [more]
KAG7020167.10.0e+0089.34Pentatricopeptide repeat-containing protein, mitochondrial [Cucurbita argyrosper... [more]
Match NameE-valueIdentityDescription
Q9FZD11.5e-23766.45Pentatricopeptide repeat-containing protein At1g26460, mitochondrial OS=Arabidop... [more]
Q9FIX35.0e-1019.86Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana OX... [more]
Q9SZ522.5e-0926.52Pentatricopeptide repeat-containing protein At4g31850, chloroplastic OS=Arabidop... [more]
Q9M9077.2e-0919.82Pentatricopeptide repeat-containing protein At3g06920 OS=Arabidopsis thaliana OX... [more]
Q9FMF61.4e-0723.89Pentatricopeptide repeat-containing protein At5g64320, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A6J1KIJ90.0e+0089.66pentatricopeptide repeat-containing protein At1g26460, mitochondrial OS=Cucurbit... [more]
A0A6J1GHH40.0e+0089.66pentatricopeptide repeat-containing protein At1g26460, mitochondrial OS=Cucurbit... [more]
A0A6J1HEP60.0e+0089.34pentatricopeptide repeat-containing protein At1g26460, mitochondrial-like OS=Cuc... [more]
A0A6J1BQT90.0e+0089.01pentatricopeptide repeat-containing protein At1g26460, mitochondrial OS=Momordic... [more]
A0A6J1KCV20.0e+0089.01pentatricopeptide repeat-containing protein At1g26460, mitochondrial-like OS=Cuc... [more]
Match NameE-valueIdentityDescription
AT1G26460.11.1e-23866.45Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT5G39710.13.5e-1119.86Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT4G31850.11.8e-1026.52proton gradient regulation 3 [more]
AT3G06920.15.1e-1019.82Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT5G64320.19.6e-0923.89Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 491..539
e-value: 3.8E-4
score: 20.5
coord: 143..192
e-value: 8.4E-4
score: 19.4
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 533..565
e-value: 0.0027
score: 15.8
coord: 497..531
e-value: 1.8E-5
score: 22.6
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 530..564
score: 8.977363
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 495..529
score: 10.731171
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 181..215
score: 9.032168
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 115..316
e-value: 4.1E-18
score: 67.8
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 446..609
e-value: 8.7E-21
score: 76.1
IPR044605Pentatricopeptide repeat-containing protein At1g26460-likePANTHERPTHR47205OS07G0599000 PROTEINcoord: 1..617

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10004370.1HG10004370.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding