HG10022172 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10022172
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPentatricopeptide repeat-containing protein
LocationChr05: 21578192 .. 21580084 (+)
RNA-Seq ExpressionHG10022172
SyntenyHG10022172
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCATCGAACTTCAACTTCCAATCTTCCCAAACTCTCACAGTCCTTCTTCTTCCTCACTTCATTCCCCAAATCAACTTTCTCTTCTGCTTCATCCTCAACATTTCTGCAATCAGTTACTCAATCCGAAACTAAATCAATCGTCGTCAACCCTCTTTACCATTTTCTTCCACAAAACCAAAACCCCTTCAATATCGTCGAACTCGTCTCTTCGCGCCTTAAAACGAGCAACCCGCAGTTAGCTCTTCTTCAATCCGACATTAAAGTGCTTCTTCCCCACTTGGGTCATCGTGAAATCTCCAAGATTTTATTGAGGTGCCAATCTAATTTCGTCTCTGCTCTTACTTTTTTCAATTGGGTTAAATATGATTTGGATATTAGACTTAATTCTCACAATTATTGCTTAATTATCCATATATTGGCTTGGTCTCGACAATTTCCTTTAGCGATGAAATTTTTGTCTGAATTGATTGAATTGTCTAAGGATGTCTCAAGTAGTGAGGATGTTTTCCAGAATTTGGTGTTGTGCACTGAGCACTGTAATTGGAACCCAGTTATCTTTGAGATGCTAGTTAAGGCATATGTGAAAGTGGATATGATTCAAGAAAGTTATTGGAGCTTTAAGAAGATGGTGAAACTGGGTTTTGTCCCAAGTGTGATTGCTTGTAATTGTATTCTGAATGGACTGGCGAAGATGAAGTATGATGGCCAATGTTGGGAGCTCTATGAAGAGATGGGGAGGATTGGAGTTCACTCAAATGCATACACTTTTAATATTTTGACTTATGTTCTGTGTAGAGATGGGGATGTGAATAAGATTAATGAATTCTTGGAAAAGATGGAAGAAGAGGGCTTTGATCCCGACGTTGTGACTTACAATACTTTAATTGATAGCTATTGCAGAAGAGGAAGATTAGATGATGCATTTTACTTGTATAGGATAATGTATAGGAGGGGTGTGATGCCTGATCTTGTTTCATATACTTCCTTGATGAAGGGTCTTTGTAGGTTACGAAGGGTAAAAGAGGCCCATCAGCTATTTCATCGAATGATTGACCGAGGAATGGATCCAGATGTTGTGTCGTATAATACGCTAATTGGTGCATATTGTAAGAATGGAATGCTGCAAGAGGCAAGATCATTGCTACACGATATGATTGGACTTGGTATTCACCCAGATAGTTTCACTTGTAGGATTTTGGTGGAAGGACATGGAAGAGAAGGTAGATTGATCTCAGCTTTGAATTTGGTTGTGGAGCTTCAGAAACTTGGAGTCACTGTTGCTTATGACATCTACGAGTATCTTATCATCTCATTATGTCGGGAAGATCGTCCATTTGCAGCTAAGAGTCTTCTCGAAAGAATTATTAAAGACGGTTTTCAACCTGATTTCAATATCTATAATAAGCTGATTGAATCTTTCTGTAGAGGTGATAATGTGTCTGAGGCACTACTTCTGAAATTGGAAATGATAAACAGGAATTTCAAACCTACCATTGATACATATAAGTCTCTTATATGCTGTATGTCTGAAATCAATAGAAGTGTAGATGGTGAAAGTCTAATGGTGGAAATGGTTGAATCTGGAGTGCTTCCAGATCGTGAAATATGCAGGGCATTGATAAATGGATACTGCAAAGAAGGCAATGCTGATAAAGCAGAATCACTATTGGTCTCATTTGCTAAAGACTTTCAGTTCTTTGACTCTGAAAGTTTCAATGCCCTGGTTAAAGTTTACCACGATGTGGGTAACGAAACAAAGTTGATGGAGCTGCAAGATCGAATGATAAAAGCAGGTTTTCTTCCAAATAGCTTAACGTGTCGATACATTATCCATGGACTATGGAAATCTGCGAGACTCAACAAGCAGAGAGTTCACGCAGTAGCAGTATAA

mRNA sequence

ATGCATCGAACTTCAACTTCCAATCTTCCCAAACTCTCACAGTCCTTCTTCTTCCTCACTTCATTCCCCAAATCAACTTTCTCTTCTGCTTCATCCTCAACATTTCTGCAATCAGTTACTCAATCCGAAACTAAATCAATCGTCGTCAACCCTCTTTACCATTTTCTTCCACAAAACCAAAACCCCTTCAATATCGTCGAACTCGTCTCTTCGCGCCTTAAAACGAGCAACCCGCAGTTAGCTCTTCTTCAATCCGACATTAAAGTGCTTCTTCCCCACTTGGGTCATCGTGAAATCTCCAAGATTTTATTGAGGTGCCAATCTAATTTCGTCTCTGCTCTTACTTTTTTCAATTGGGTTAAATATGATTTGGATATTAGACTTAATTCTCACAATTATTGCTTAATTATCCATATATTGGCTTGGTCTCGACAATTTCCTTTAGCGATGAAATTTTTGTCTGAATTGATTGAATTGTCTAAGGATGTCTCAAGTAGTGAGGATGTTTTCCAGAATTTGGTGTTGTGCACTGAGCACTGTAATTGGAACCCAGTTATCTTTGAGATGCTAGTTAAGGCATATGTGAAAGTGGATATGATTCAAGAAAGTTATTGGAGCTTTAAGAAGATGGTGAAACTGGGTTTTGTCCCAAGTGTGATTGCTTGTAATTGTATTCTGAATGGACTGGCGAAGATGAAGTATGATGGCCAATGTTGGGAGCTCTATGAAGAGATGGGGAGGATTGGAGTTCACTCAAATGCATACACTTTTAATATTTTGACTTATGTTCTGTGTAGAGATGGGGATGTGAATAAGATTAATGAATTCTTGGAAAAGATGGAAGAAGAGGGCTTTGATCCCGACGTTGTGACTTACAATACTTTAATTGATAGCTATTGCAGAAGAGGAAGATTAGATGATGCATTTTACTTGTATAGGATAATGTATAGGAGGGGTGTGATGCCTGATCTTGTTTCATATACTTCCTTGATGAAGGGTCTTTGTAGGTTACGAAGGGTAAAAGAGGCCCATCAGCTATTTCATCGAATGATTGACCGAGGAATGGATCCAGATGTTGTGTCGTATAATACGCTAATTGGTGCATATTGTAAGAATGGAATGCTGCAAGAGGCAAGATCATTGCTACACGATATGATTGGACTTGGTATTCACCCAGATAGTTTCACTTGTAGGATTTTGGTGGAAGGACATGGAAGAGAAGGTAGATTGATCTCAGCTTTGAATTTGGTTGTGGAGCTTCAGAAACTTGGAGTCACTGTTGCTTATGACATCTACGAGTATCTTATCATCTCATTATGTCGGGAAGATCGTCCATTTGCAGCTAAGAGTCTTCTCGAAAGAATTATTAAAGACGGTTTTCAACCTGATTTCAATATCTATAATAAGCTGATTGAATCTTTCTGTAGAGGTGATAATGTGTCTGAGGCACTACTTCTGAAATTGGAAATGATAAACAGGAATTTCAAACCTACCATTGATACATATAAGTCTCTTATATGCTGTATGTCTGAAATCAATAGAAGTGTAGATGGTGAAAGTCTAATGGTGGAAATGGTTGAATCTGGAGTGCTTCCAGATCGTGAAATATGCAGGGCATTGATAAATGGATACTGCAAAGAAGGCAATGCTGATAAAGCAGAATCACTATTGGTCTCATTTGCTAAAGACTTTCAGTTCTTTGACTCTGAAAGTTTCAATGCCCTGGTTAAAGTTTACCACGATGTGGGTAACGAAACAAAGTTGATGGAGCTGCAAGATCGAATGATAAAAGCAGGTTTTCTTCCAAATAGCTTAACGTGTCGATACATTATCCATGGACTATGGAAATCTGCGAGACTCAACAAGCAGAGAGTTCACGCAGTAGCAGTATAA

Coding sequence (CDS)

ATGCATCGAACTTCAACTTCCAATCTTCCCAAACTCTCACAGTCCTTCTTCTTCCTCACTTCATTCCCCAAATCAACTTTCTCTTCTGCTTCATCCTCAACATTTCTGCAATCAGTTACTCAATCCGAAACTAAATCAATCGTCGTCAACCCTCTTTACCATTTTCTTCCACAAAACCAAAACCCCTTCAATATCGTCGAACTCGTCTCTTCGCGCCTTAAAACGAGCAACCCGCAGTTAGCTCTTCTTCAATCCGACATTAAAGTGCTTCTTCCCCACTTGGGTCATCGTGAAATCTCCAAGATTTTATTGAGGTGCCAATCTAATTTCGTCTCTGCTCTTACTTTTTTCAATTGGGTTAAATATGATTTGGATATTAGACTTAATTCTCACAATTATTGCTTAATTATCCATATATTGGCTTGGTCTCGACAATTTCCTTTAGCGATGAAATTTTTGTCTGAATTGATTGAATTGTCTAAGGATGTCTCAAGTAGTGAGGATGTTTTCCAGAATTTGGTGTTGTGCACTGAGCACTGTAATTGGAACCCAGTTATCTTTGAGATGCTAGTTAAGGCATATGTGAAAGTGGATATGATTCAAGAAAGTTATTGGAGCTTTAAGAAGATGGTGAAACTGGGTTTTGTCCCAAGTGTGATTGCTTGTAATTGTATTCTGAATGGACTGGCGAAGATGAAGTATGATGGCCAATGTTGGGAGCTCTATGAAGAGATGGGGAGGATTGGAGTTCACTCAAATGCATACACTTTTAATATTTTGACTTATGTTCTGTGTAGAGATGGGGATGTGAATAAGATTAATGAATTCTTGGAAAAGATGGAAGAAGAGGGCTTTGATCCCGACGTTGTGACTTACAATACTTTAATTGATAGCTATTGCAGAAGAGGAAGATTAGATGATGCATTTTACTTGTATAGGATAATGTATAGGAGGGGTGTGATGCCTGATCTTGTTTCATATACTTCCTTGATGAAGGGTCTTTGTAGGTTACGAAGGGTAAAAGAGGCCCATCAGCTATTTCATCGAATGATTGACCGAGGAATGGATCCAGATGTTGTGTCGTATAATACGCTAATTGGTGCATATTGTAAGAATGGAATGCTGCAAGAGGCAAGATCATTGCTACACGATATGATTGGACTTGGTATTCACCCAGATAGTTTCACTTGTAGGATTTTGGTGGAAGGACATGGAAGAGAAGGTAGATTGATCTCAGCTTTGAATTTGGTTGTGGAGCTTCAGAAACTTGGAGTCACTGTTGCTTATGACATCTACGAGTATCTTATCATCTCATTATGTCGGGAAGATCGTCCATTTGCAGCTAAGAGTCTTCTCGAAAGAATTATTAAAGACGGTTTTCAACCTGATTTCAATATCTATAATAAGCTGATTGAATCTTTCTGTAGAGGTGATAATGTGTCTGAGGCACTACTTCTGAAATTGGAAATGATAAACAGGAATTTCAAACCTACCATTGATACATATAAGTCTCTTATATGCTGTATGTCTGAAATCAATAGAAGTGTAGATGGTGAAAGTCTAATGGTGGAAATGGTTGAATCTGGAGTGCTTCCAGATCGTGAAATATGCAGGGCATTGATAAATGGATACTGCAAAGAAGGCAATGCTGATAAAGCAGAATCACTATTGGTCTCATTTGCTAAAGACTTTCAGTTCTTTGACTCTGAAAGTTTCAATGCCCTGGTTAAAGTTTACCACGATGTGGGTAACGAAACAAAGTTGATGGAGCTGCAAGATCGAATGATAAAAGCAGGTTTTCTTCCAAATAGCTTAACGTGTCGATACATTATCCATGGACTATGGAAATCTGCGAGACTCAACAAGCAGAGAGTTCACGCAGTAGCAGTATAA

Protein sequence

MHRTSTSNLPKLSQSFFFLTSFPKSTFSSASSSTFLQSVTQSETKSIVVNPLYHFLPQNQNPFNIVELVSSRLKTSNPQLALLQSDIKVLLPHLGHREISKILLRCQSNFVSALTFFNWVKYDLDIRLNSHNYCLIIHILAWSRQFPLAMKFLSELIELSKDVSSSEDVFQNLVLCTEHCNWNPVIFEMLVKAYVKVDMIQESYWSFKKMVKLGFVPSVIACNCILNGLAKMKYDGQCWELYEEMGRIGVHSNAYTFNILTYVLCRDGDVNKINEFLEKMEEEGFDPDVVTYNTLIDSYCRRGRLDDAFYLYRIMYRRGVMPDLVSYTSLMKGLCRLRRVKEAHQLFHRMIDRGMDPDVVSYNTLIGAYCKNGMLQEARSLLHDMIGLGIHPDSFTCRILVEGHGREGRLISALNLVVELQKLGVTVAYDIYEYLIISLCREDRPFAAKSLLERIIKDGFQPDFNIYNKLIESFCRGDNVSEALLLKLEMINRNFKPTIDTYKSLICCMSEINRSVDGESLMVEMVESGVLPDREICRALINGYCKEGNADKAESLLVSFAKDFQFFDSESFNALVKVYHDVGNETKLMELQDRMIKAGFLPNSLTCRYIIHGLWKSARLNKQRVHAVAV
Homology
BLAST of HG10022172 vs. NCBI nr
Match: XP_038890312.1 (pentatricopeptide repeat-containing protein At5g40400 [Benincasa hispida] >XP_038890313.1 pentatricopeptide repeat-containing protein At5g40400 [Benincasa hispida])

HSP 1 Score: 1173.3 bits (3034), Expect = 0.0e+00
Identity = 578/630 (91.75%), Postives = 604/630 (95.87%), Query Frame = 0

Query: 1   MHRTSTSNLPKLSQSFFFLTSFPKSTFSSASSSTFLQSVTQSETKSIVVNPLYHFLPQNQ 60
           MHRTSTS LPKLSQS  FLTSFPKSTFSSASSSTFLQS+ QSETKSIVVNPLYHFLPQNQ
Sbjct: 1   MHRTSTSILPKLSQS--FLTSFPKSTFSSASSSTFLQSIPQSETKSIVVNPLYHFLPQNQ 60

Query: 61  NPFNIVELVSSRLKTSNPQLALLQSDIKVLLPHLGHREISKILLRCQSNFVSALTFFNWV 120
           NPFNI++L+SS LKT NPQLALLQS+IK LLPHLG REISKILLRCQSNFVSAL FFNWV
Sbjct: 61  NPFNIIDLISSHLKTGNPQLALLQSEIKELLPHLGPREISKILLRCQSNFVSALAFFNWV 120

Query: 121 KYDLDIRLNSHNYCLIIHILAWSRQFPLAMKFLSELIELSKDVSSSEDVFQNLVLCTEHC 180
           KYDLD+RLNSHNYCLIIHILAWSRQFPLAMKFLSELIELSKDVS+ EDVFQNLVLCTEHC
Sbjct: 121 KYDLDLRLNSHNYCLIIHILAWSRQFPLAMKFLSELIELSKDVSNREDVFQNLVLCTEHC 180

Query: 181 NWNPVIFEMLVKAYVKVDMIQESYWSFKKMVKLGFVPSVIACNCILNGLAKMKYDGQCWE 240
           NWNPVI EML+KAYVK+DMIQESYWSFKKMVKLGFVPSVIACNCILNGLAKMKYD +CWE
Sbjct: 181 NWNPVILEMLIKAYVKLDMIQESYWSFKKMVKLGFVPSVIACNCILNGLAKMKYDEKCWE 240

Query: 241 LYEEMGRIGVHSNAYTFNILTYVLCRDGDVNKINEFLEKMEEEGFDPDVVTYNTLIDSYC 300
           LYEEMGRIGVHSNAYTFNILTYVLCRDGD+NKINEFLEKMEEEGF+PDVVTYNTLIDSYC
Sbjct: 241 LYEEMGRIGVHSNAYTFNILTYVLCRDGDMNKINEFLEKMEEEGFNPDVVTYNTLIDSYC 300

Query: 301 RRGRLDDAFYLYRIMYRRGVMPDLVSYTSLMKGLCRLRRVKEAHQLFHRMIDRGMDPDVV 360
           RR RLD AFYLYRIMYRRGVMPDLVSYTSLMKGLCRL R++EAHQLFHRMIDRGMDPDVV
Sbjct: 301 RRERLDYAFYLYRIMYRRGVMPDLVSYTSLMKGLCRLGRLREAHQLFHRMIDRGMDPDVV 360

Query: 361 SYNTLIGAYCKNGMLQEARSLLHDMIGLGIHPDSFTCRILVEGHGREGRLISALNLVVEL 420
           SYNTLIGAYCK+G LQ+ARSLLHDMIG+GIHPDSFTCRILVEGHGREGRLISALNLVVEL
Sbjct: 361 SYNTLIGAYCKDGRLQDARSLLHDMIGIGIHPDSFTCRILVEGHGREGRLISALNLVVEL 420

Query: 421 QKLGVTVAYDIYEYLIISLCREDRPFAAKSLLERIIKDGFQPDFNIYNKLIESFCRGDNV 480
           QKLGVT+AY+IYEYLIISLCREDRPFAAKSLLERIIK+GFQPD +IYNKLIESFCRG+NV
Sbjct: 421 QKLGVTIAYNIYEYLIISLCREDRPFAAKSLLERIIKEGFQPDSDIYNKLIESFCRGNNV 480

Query: 481 SEALLLKLEMINRNFKPTIDTYKSLICCMSEINRSVDGESLMVEMVESGVLPDREICRAL 540
           SEALLLKLEMINRNFKP+ID YKSLICCMSEINR+VDGE LMVEMVE GVLPD EICRAL
Sbjct: 481 SEALLLKLEMINRNFKPSIDAYKSLICCMSEINRTVDGEGLMVEMVEFGVLPDHEICRAL 540

Query: 541 INGYCKEGNADKAESLLVSFAKDFQFFDSESFNALVKVYHDVGNETKLMELQDRMIKAGF 600
           INGYCKEGNADKAESLLVSFAKDFQFF+SESFNALVKVY DVGNETKLMELQDRM+KAGF
Sbjct: 541 INGYCKEGNADKAESLLVSFAKDFQFFESESFNALVKVYQDVGNETKLMELQDRMLKAGF 600

Query: 601 LPNSLTCRYIIHGLWKSARLNKQRVHAVAV 631
           LPNSLTCRYIIHGLWKS R NKQRV AV V
Sbjct: 601 LPNSLTCRYIIHGLWKSVRFNKQRVQAVTV 628

BLAST of HG10022172 vs. NCBI nr
Match: XP_004142850.2 (pentatricopeptide repeat-containing protein At5g40400 [Cucumis sativus] >XP_011655402.1 pentatricopeptide repeat-containing protein At5g40400 [Cucumis sativus] >XP_011655403.1 pentatricopeptide repeat-containing protein At5g40400 [Cucumis sativus] >XP_011655404.1 pentatricopeptide repeat-containing protein At5g40400 [Cucumis sativus] >XP_011655405.1 pentatricopeptide repeat-containing protein At5g40400 [Cucumis sativus] >XP_011655406.1 pentatricopeptide repeat-containing protein At5g40400 [Cucumis sativus] >XP_031741426.1 pentatricopeptide repeat-containing protein At5g40400 [Cucumis sativus] >XP_031741427.1 pentatricopeptide repeat-containing protein At5g40400 [Cucumis sativus] >XP_031741428.1 pentatricopeptide repeat-containing protein At5g40400 [Cucumis sativus] >KGN51367.1 hypothetical protein Csa_008891 [Cucumis sativus])

HSP 1 Score: 1135.6 bits (2936), Expect = 0.0e+00
Identity = 561/637 (88.07%), Postives = 598/637 (93.88%), Query Frame = 0

Query: 1   MHRTSTSNLPKLSQSFFFLTSFPKSTF-------SSASSSTFLQSVTQSETKSIVVNPLY 60
           M RT+ SNLPKL+QSFFF TSF KST        SS+SSSTFLQS+ +SE K ++VNPLY
Sbjct: 1   MLRTTASNLPKLAQSFFFHTSFSKSTLSSSSSSSSSSSSSTFLQSIPESEAK-LIVNPLY 60

Query: 61  HFLPQNQNPFNIVELVSSRLKTSNPQLALLQSDIKVLLPHLGHREISKILLRCQSNFVSA 120
           HFLPQNQNPFNIVELVSS LKT+NP+LALLQS IK L+PHLGHR+ISKILLRCQSNFVSA
Sbjct: 61  HFLPQNQNPFNIVELVSSHLKTNNPRLALLQSHIKELIPHLGHRQISKILLRCQSNFVSA 120

Query: 121 LTFFNWVKYDLDIRLNSHNYCLIIHILAWSRQFPLAMKFLSELIELSKDVSSSEDVFQNL 180
           L FFNWVKYDLDIRL+SHNYCLIIHILAWSRQFPLAMKFLSELIELSKDVSSSEDVFQNL
Sbjct: 121 LAFFNWVKYDLDIRLSSHNYCLIIHILAWSRQFPLAMKFLSELIELSKDVSSSEDVFQNL 180

Query: 181 VLCTEHCNWNPVIFEMLVKAYVKVDMIQESYWSFKKMVKLGFVPSVIACNCILNGLAKMK 240
           VLCTEHCNWNPVIFEML+KAYVK+D+I ESYWSFKKMVKLGFVP+VIACNCILNGLAKMK
Sbjct: 181 VLCTEHCNWNPVIFEMLIKAYVKLDLIHESYWSFKKMVKLGFVPNVIACNCILNGLAKMK 240

Query: 241 YDGQCWELYEEMGRIGVHSNAYTFNILTYVLCRDGDVNKINEFLEKMEEEGFDPDVVTYN 300
            D QCWELYEEMGRIGVHSNAYTFNILTYVLCRDGDVNKIN FLEKMEEEGFDPDVVTYN
Sbjct: 241 SDAQCWELYEEMGRIGVHSNAYTFNILTYVLCRDGDVNKINGFLEKMEEEGFDPDVVTYN 300

Query: 301 TLIDSYCRRGRLDDAFYLYRIMYRRGVMPDLVSYTSLMKGLCRLRRVKEAHQLFHRMIDR 360
           TLIDSY RRGRL+DAFYLY+IMYRRGVMPDLVSYTSLM+GLCRL RV+EAHQLFHRMIDR
Sbjct: 301 TLIDSYVRRGRLEDAFYLYKIMYRRGVMPDLVSYTSLMRGLCRLGRVREAHQLFHRMIDR 360

Query: 361 GMDPDVVSYNTLIGAYCKNGMLQEARSLLHDMIGLGIHPDSFTCRILVEGHGREGRLISA 420
           GMDPDVV YNTLIGAYCK+GMLQEARSLLH+MIG+GIHPDSFTCRILVEG+GREGRLISA
Sbjct: 361 GMDPDVVLYNTLIGAYCKDGMLQEARSLLHEMIGIGIHPDSFTCRILVEGYGREGRLISA 420

Query: 421 LNLVVELQKLGVTVAYDIYEYLIISLCREDRPFAAKSLLERIIKDGFQPDFNIYNKLIES 480
           LNLVVE+QKLGVTVA+DIY+YLIISLCREDRPFAAKSLLERI++D FQPD +IYNKLIES
Sbjct: 421 LNLVVEIQKLGVTVAHDIYKYLIISLCREDRPFAAKSLLERILEDSFQPDSDIYNKLIES 480

Query: 481 FCRGDNVSEALLLKLEMINRNFKPTIDTYKSLICCMSEINRSVDGESLMVEMVESGVLPD 540
           FCR +NVSEALLLKLEMINRN+KPT DTYKSLI CM EINRSVDGE LMVEMVES V+PD
Sbjct: 481 FCRSNNVSEALLLKLEMINRNYKPTTDTYKSLIHCMCEINRSVDGEGLMVEMVESEVIPD 540

Query: 541 REICRALINGYCKEGNADKAESLLVSFAKDFQFFDSESFNALVKVYHDVGNETKLMELQD 600
            EICRAL+NGYCKEGNADKAESLLVSFAKDFQFFDSESFN+LVKVY DVGNETKLMELQD
Sbjct: 541 HEICRALVNGYCKEGNADKAESLLVSFAKDFQFFDSESFNSLVKVYRDVGNETKLMELQD 600

Query: 601 RMIKAGFLPNSLTCRYIIHGLWKSARLNKQRVHAVAV 631
           RM+KAGFLPNSLTCRYIIHG+WKS RLNKQRV  VAV
Sbjct: 601 RMLKAGFLPNSLTCRYIIHGIWKSMRLNKQRVQTVAV 636

BLAST of HG10022172 vs. NCBI nr
Match: XP_008458797.1 (PREDICTED: pentatricopeptide repeat-containing protein At5g40400 [Cucumis melo])

HSP 1 Score: 1130.5 bits (2923), Expect = 0.0e+00
Identity = 558/630 (88.57%), Postives = 596/630 (94.60%), Query Frame = 0

Query: 1   MHRTSTSNLPKLSQSFFFLTSFPKSTFSSASSSTFLQSVTQSETKSIVVNPLYHFLPQNQ 60
           M RT+ SNLPKLSQSFFFL+SF KST SS+SSSTFLQS+ +SE KSI VNPLYHFLPQNQ
Sbjct: 1   MLRTTASNLPKLSQSFFFLSSFSKSTLSSSSSSTFLQSIPESEAKSI-VNPLYHFLPQNQ 60

Query: 61  NPFNIVELVSSRLKTSNPQLALLQSDIKVLLPHLGHREISKILLRCQSNFVSALTFFNWV 120
           NPFNIVELVS  LKT+NP+LALLQ++IK L+P+LGH +ISKILLRCQSNFVSAL FFNWV
Sbjct: 61  NPFNIVELVSLHLKTNNPRLALLQANIKGLIPYLGHCQISKILLRCQSNFVSALAFFNWV 120

Query: 121 KYDLDIRLNSHNYCLIIHILAWSRQFPLAMKFLSELIELSKDVSSSEDVFQNLVLCTEHC 180
           KYDLDIRLNSHNYCLIIHILAWSRQFPLAMK LSELIELSKDVSSSEDVFQNLVLCTEHC
Sbjct: 121 KYDLDIRLNSHNYCLIIHILAWSRQFPLAMKLLSELIELSKDVSSSEDVFQNLVLCTEHC 180

Query: 181 NWNPVIFEMLVKAYVKVDMIQESYWSFKKMVKLGFVPSVIACNCILNGLAKMKYDGQCWE 240
           NWNPVIFEML+KAYVK+D+I ESYWSFKKMVKLGFVPSVIACNCIL+GLAKMK DGQCWE
Sbjct: 181 NWNPVIFEMLIKAYVKLDLIHESYWSFKKMVKLGFVPSVIACNCILHGLAKMKSDGQCWE 240

Query: 241 LYEEMGRIGVHSNAYTFNILTYVLCRDGDVNKINEFLEKMEEEGFDPDVVTYNTLIDSYC 300
           LYEEMGRIGVHSNAYTFNILTYVLCRDGDVNKINEFLEKMEEEGFDPDVVTYNTLIDSYC
Sbjct: 241 LYEEMGRIGVHSNAYTFNILTYVLCRDGDVNKINEFLEKMEEEGFDPDVVTYNTLIDSYC 300

Query: 301 RRGRLDDAFYLYRIMYRRGVMPDLVSYTSLMKGLCRLRRVKEAHQLFHRMIDRGMDPDVV 360
           RRGRLDDAFYLYRIMYRR VMPDLVSYTSLM+GLCRL RV+EAHQLFHRMIDRGMDPDVV
Sbjct: 301 RRGRLDDAFYLYRIMYRRSVMPDLVSYTSLMRGLCRLGRVREAHQLFHRMIDRGMDPDVV 360

Query: 361 SYNTLIGAYCKNGMLQEARSLLHDMIGLGIHPDSFTCRILVEGHGREGRLISALNLVVEL 420
           SYNTLIGAYCK+GMLQEARSLLHDMIG+GIHPD+FTCRILVEG+GREGRLISALNLVVE+
Sbjct: 361 SYNTLIGAYCKDGMLQEARSLLHDMIGIGIHPDNFTCRILVEGYGREGRLISALNLVVEI 420

Query: 421 QKLGVTVAYDIYEYLIISLCREDRPFAAKSLLERIIKDGFQPDFNIYNKLIESFCRGDNV 480
           QKLGVT+A+DIY+YLIISLC+EDRPFAAKSLLERI++D FQPD +IYNKLIESFCR +NV
Sbjct: 421 QKLGVTIAHDIYKYLIISLCQEDRPFAAKSLLERILEDRFQPDSDIYNKLIESFCRSNNV 480

Query: 481 SEALLLKLEMINRNFKPTIDTYKSLICCMSEINRSVDGESLMVEMVESGVLPDREICRAL 540
           SEALLLK EMINRNFKPTI TYKSLI CM EINRSVDGE LM EMVES VLPD EICRAL
Sbjct: 481 SEALLLKSEMINRNFKPTIYTYKSLIHCMCEINRSVDGEGLMEEMVESEVLPDHEICRAL 540

Query: 541 INGYCKEGNADKAESLLVSFAKDFQFFDSESFNALVKVYHDVGNETKLMELQDRMIKAGF 600
           +NGYCKEGNADKAESLLVSFAKDFQFFDSESFN+LVKVY D+GNETKLMELQ RM+KAGF
Sbjct: 541 VNGYCKEGNADKAESLLVSFAKDFQFFDSESFNSLVKVYCDMGNETKLMELQTRMLKAGF 600

Query: 601 LPNSLTCRYIIHGLWKSARLNKQRVHAVAV 631
           LPN+LTC+YIIHGLWK  RLN+QRV AV V
Sbjct: 601 LPNNLTCQYIIHGLWKFTRLNEQRVQAVVV 629

BLAST of HG10022172 vs. NCBI nr
Match: KAA0037561.1 (pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYK03111.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 1129.4 bits (2920), Expect = 0.0e+00
Identity = 557/630 (88.41%), Postives = 596/630 (94.60%), Query Frame = 0

Query: 1   MHRTSTSNLPKLSQSFFFLTSFPKSTFSSASSSTFLQSVTQSETKSIVVNPLYHFLPQNQ 60
           M RT+ SNLPKLSQSFFFL+SF KST SS+SSSTFLQS+ +SE KSI VNPLYHFLPQNQ
Sbjct: 1   MLRTTASNLPKLSQSFFFLSSFSKSTLSSSSSSTFLQSIPESEAKSI-VNPLYHFLPQNQ 60

Query: 61  NPFNIVELVSSRLKTSNPQLALLQSDIKVLLPHLGHREISKILLRCQSNFVSALTFFNWV 120
           NPFNIVELVS  LKT+NP+LALLQ++IK L+P+LGH +ISKILLRCQSNFVSAL FFNWV
Sbjct: 61  NPFNIVELVSLHLKTNNPRLALLQANIKGLIPYLGHCQISKILLRCQSNFVSALAFFNWV 120

Query: 121 KYDLDIRLNSHNYCLIIHILAWSRQFPLAMKFLSELIELSKDVSSSEDVFQNLVLCTEHC 180
           KYDLDIRLNSHNYCLIIHILAWSRQFPLAMK LSELIELSKDVSSSEDVFQNLVLCTEHC
Sbjct: 121 KYDLDIRLNSHNYCLIIHILAWSRQFPLAMKLLSELIELSKDVSSSEDVFQNLVLCTEHC 180

Query: 181 NWNPVIFEMLVKAYVKVDMIQESYWSFKKMVKLGFVPSVIACNCILNGLAKMKYDGQCWE 240
           NWNPVIFEML+KAYVK+D+I ESYWSFK+MVKLGFVPSVIACNCIL+GLAKMK DGQCWE
Sbjct: 181 NWNPVIFEMLIKAYVKLDLIHESYWSFKRMVKLGFVPSVIACNCILHGLAKMKSDGQCWE 240

Query: 241 LYEEMGRIGVHSNAYTFNILTYVLCRDGDVNKINEFLEKMEEEGFDPDVVTYNTLIDSYC 300
           LYEEMGRIGVHSNAYTFNILTYVLCRDGDVNKINEFLEKMEEEGFDPDVVTYNTLIDSYC
Sbjct: 241 LYEEMGRIGVHSNAYTFNILTYVLCRDGDVNKINEFLEKMEEEGFDPDVVTYNTLIDSYC 300

Query: 301 RRGRLDDAFYLYRIMYRRGVMPDLVSYTSLMKGLCRLRRVKEAHQLFHRMIDRGMDPDVV 360
           RRGRLDDAFYLYRIMYRR VMPDLVSYTSLM+GLCRL RV+EAHQLFHRMIDRGMDPDVV
Sbjct: 301 RRGRLDDAFYLYRIMYRRSVMPDLVSYTSLMRGLCRLGRVREAHQLFHRMIDRGMDPDVV 360

Query: 361 SYNTLIGAYCKNGMLQEARSLLHDMIGLGIHPDSFTCRILVEGHGREGRLISALNLVVEL 420
           SYNTLIGAYCK+GMLQEARSLLHDMIG+GIHPD+FTCRILVEG+GREGRLISALNLVVE+
Sbjct: 361 SYNTLIGAYCKDGMLQEARSLLHDMIGIGIHPDNFTCRILVEGYGREGRLISALNLVVEI 420

Query: 421 QKLGVTVAYDIYEYLIISLCREDRPFAAKSLLERIIKDGFQPDFNIYNKLIESFCRGDNV 480
           QKLGVT+A+DIY+YLIISLC+EDRPFAAKSLLERI++D FQPD +IYNKLIESFCR +NV
Sbjct: 421 QKLGVTIAHDIYKYLIISLCQEDRPFAAKSLLERILEDRFQPDSDIYNKLIESFCRSNNV 480

Query: 481 SEALLLKLEMINRNFKPTIDTYKSLICCMSEINRSVDGESLMVEMVESGVLPDREICRAL 540
           SEALLLK EMINRNFKPTI TYKSLI CM EINRSVDGE LM EMVES VLPD EICRAL
Sbjct: 481 SEALLLKSEMINRNFKPTIYTYKSLIHCMCEINRSVDGEGLMEEMVESEVLPDHEICRAL 540

Query: 541 INGYCKEGNADKAESLLVSFAKDFQFFDSESFNALVKVYHDVGNETKLMELQDRMIKAGF 600
           +NGYCKEGNADKAESLLVSFAKDFQFFDSESFN+LVKVY D+GNETKLMELQ RM+KAGF
Sbjct: 541 VNGYCKEGNADKAESLLVSFAKDFQFFDSESFNSLVKVYCDMGNETKLMELQTRMLKAGF 600

Query: 601 LPNSLTCRYIIHGLWKSARLNKQRVHAVAV 631
           LPN+LTC+YIIHGLWK  RLN+QRV AV V
Sbjct: 601 LPNNLTCQYIIHGLWKFTRLNEQRVQAVVV 629

BLAST of HG10022172 vs. NCBI nr
Match: XP_022134396.1 (pentatricopeptide repeat-containing protein At5g40400 [Momordica charantia] >XP_022134397.1 pentatricopeptide repeat-containing protein At5g40400 [Momordica charantia] >XP_022134398.1 pentatricopeptide repeat-containing protein At5g40400 [Momordica charantia])

HSP 1 Score: 1054.3 bits (2725), Expect = 4.2e-304
Identity = 514/628 (81.85%), Postives = 565/628 (89.97%), Query Frame = 0

Query: 1   MHRTSTSNLPKLSQSFFFLTSFPKSTFSSASSSTFLQSVTQSETKSIVVNPLYHFLPQNQ 60
           MHRTSTS+L KLS SFF L   PKS FSSASSSTFLQ + +S+ + ++VNPL+HFLPQNQ
Sbjct: 1   MHRTSTSSLAKLSPSFFCLAFVPKSAFSSASSSTFLQPIPESQAQ-LIVNPLFHFLPQNQ 60

Query: 61  NPFNIVELVSSRLKTSNPQLALLQSDIKVLLPHLGHREISKILLRCQSNFVSALTFFNWV 120
           NPFNIVE+V S LKT NP+LA LQS+IK LLPHLGHRE+SK+LLRCQSNF SAL FFNWV
Sbjct: 61  NPFNIVEIVCSHLKTGNPELARLQSEIKELLPHLGHREVSKVLLRCQSNFGSALAFFNWV 120

Query: 121 KYDLDIRLNSHNYCLIIHILAWSRQFPLAMKFLSELIELSKDVSSSEDVFQNLVLCTEHC 180
           KYDL++R NS NYCLIIHILAWSRQ PLAMKFLSELIELS+     EDVFQNLVLCTEHC
Sbjct: 121 KYDLNLRPNSKNYCLIIHILAWSRQLPLAMKFLSELIELSE-----EDVFQNLVLCTEHC 180

Query: 181 NWNPVIFEMLVKAYVKVDMIQESYWSFKKMVKLGFVPSVIACNCILNGLAKMKYDGQCWE 240
           NWNPVIFEML+KAY+KV MIQESYWSFKKMV+LGFVPSVIACNCILNGLAKMK DG CWE
Sbjct: 181 NWNPVIFEMLIKAYMKVGMIQESYWSFKKMVRLGFVPSVIACNCILNGLAKMKCDGHCWE 240

Query: 241 LYEEMGRIGVHSNAYTFNILTYVLCRDGDVNKINEFLEKMEEEGFDPDVVTYNTLIDSYC 300
           LYEEMGRIGVHSNAYTFNILTYVLCRDGDVNK+NEFLEKMEEEGFDPD+VTYNTLI SYC
Sbjct: 241 LYEEMGRIGVHSNAYTFNILTYVLCRDGDVNKVNEFLEKMEEEGFDPDIVTYNTLIGSYC 300

Query: 301 RRGRLDDAFYLYRIMYRRGVMPDLVSYTSLMKGLCRLRRVKEAHQLFHRMIDRGMDPDVV 360
           RRGRL+DAF+LYRIMYRRGVMPDLVSYTSLM GLC++ RV+EAHQ+FHRMIDRG+DPDVV
Sbjct: 301 RRGRLEDAFHLYRIMYRRGVMPDLVSYTSLMNGLCQIGRVREAHQIFHRMIDRGLDPDVV 360

Query: 361 SYNTLIGAYCKNGMLQEARSLLHDMIGLGIHPDSFTCRILVEGHGREGRLISALNLVVEL 420
           SYN LI AYCK G L EAR LLHDMI +G++PDSFTCRI+VEG+GREGRLISALNLVVEL
Sbjct: 361 SYNMLISAYCKVGRLPEARLLLHDMIRIGLYPDSFTCRIMVEGYGREGRLISALNLVVEL 420

Query: 421 QKLGVTVAYDIYEYLIISLCREDRPFAAKSLLERIIKDGFQPDFNIYNKLIESFCRGDNV 480
           QKLGV V  DIY+YLIISLC+EDRPFAAKSLLERII DGF+PD  IY KLIESFCRG++ 
Sbjct: 421 QKLGVAVTCDIYKYLIISLCQEDRPFAAKSLLERIINDGFEPDAGIYGKLIESFCRGNHA 480

Query: 481 SEALLLKLEMINRNFKPTIDTYKSLICCMSEINRSVDGESLMVEMVESGVLPDREICRAL 540
           SEALL+K EM NRNF+P +D YKSLICC+ EINRS DGE LMVEMVESG+ PD  ICRAL
Sbjct: 481 SEALLMKSEMENRNFEPGVDIYKSLICCLCEINRSGDGEGLMVEMVESGLPPDHIICRAL 540

Query: 541 INGYCKEGNADKAESLLVSFAKDFQFFDSESFNALVKVYHDVGNETKLMELQDRMIKAGF 600
           ING+CKEGNADKAESLL SFAK+FQFFD+ESFNAL+K YHDVGNETKLMELQDRM+KAGF
Sbjct: 541 INGHCKEGNADKAESLLASFAKEFQFFDTESFNALIKFYHDVGNETKLMELQDRMLKAGF 600

Query: 601 LPNSLTCRYIIHGLWKSARLNKQRVHAV 629
           +PNSLTCRY+IHGLWKSAR  K R+ AV
Sbjct: 601 VPNSLTCRYVIHGLWKSARFGKHRLQAV 622

BLAST of HG10022172 vs. ExPASy Swiss-Prot
Match: Q9FND8 (Pentatricopeptide repeat-containing protein At5g40400 OS=Arabidopsis thaliana OX=3702 GN=At5g40400 PE=2 SV=1)

HSP 1 Score: 631.7 bits (1628), Expect = 8.8e-180
Identity = 314/592 (53.04%), Postives = 431/592 (72.80%), Query Frame = 0

Query: 27  FSSASSSTFLQSVTQSETKSIVVNPLYHFLPQNQNPFNIVELVSSRLKTSNPQLAL--LQ 86
           FSS SSS   +    S     ++NPLY+ LPQ+QNP  IV+++ S L  S+  + L  L+
Sbjct: 11  FSSYSSSIVPRC---SNIPKPILNPLYNLLPQSQNPSKIVDVICSTLNHSDYSVLLPNLR 70

Query: 87  SDIKVLLPHLGHREISKILLRCQSNFVSALTFFNWVKYDLDIRLNSHNYCLIIHILAWSR 146
            ++K L+PHLG+ EIS++LLR QS+   A+TFF WVK+DL  R N  NYCL++HIL  S+
Sbjct: 71  DEVKSLIPHLGYPEISRVLLRFQSDASRAITFFKWVKFDLGKRPNVGNYCLLLHILVSSK 130

Query: 147 QFPLAMKFLSELIELSKDVSSSEDVFQNLVLCTEHCNWNPVIFEMLVKAYVKVDMIQESY 206
           +FPLAM+FL ELIEL+       DVF+ LV  T+ CNW+PV+F+MLVK Y+K+ +++E +
Sbjct: 131 KFPLAMQFLCELIELTSK-KEEVDVFRVLVSATDECNWDPVVFDMLVKGYLKLGLVEEGF 190

Query: 207 WSFKKMVKLGFVPSVIACNCILNGLAKMKYDGQCWELYEEMGRIGVHSNAYTFNILTYVL 266
             F++++  GF  SV+ CN +LNGL K+     CW++Y  M R+G+H N YTFNILT V 
Sbjct: 191 RVFREVLDSGFSVSVVTCNHLLNGLLKLDLMEDCWQVYSVMCRVGIHPNTYTFNILTNVF 250

Query: 267 CRDGDVNKINEFLEKMEEEGFDPDVVTYNTLIDSYCRRGRLDDAFYLYRIMYRRGVMPDL 326
           C D +  ++++FLEKMEEEGF+PD+VTYNTL+ SYCRRGRL +AFYLY+IMYRR V+PDL
Sbjct: 251 CNDSNFREVDDFLEKMEEEGFEPDLVTYNTLVSSYCRRGRLKEAFYLYKIMYRRRVVPDL 310

Query: 327 VSYTSLMKGLCRLRRVKEAHQLFHRMIDRGMDPDVVSYNTLIGAYCKNGMLQEARSLLHD 386
           V+YTSL+KGLC+  RV+EAHQ FHRM+DRG+ PD +SYNTLI AYCK GM+Q+++ LLH+
Sbjct: 311 VTYTSLIKGLCKDGRVREAHQTFHRMVDRGIKPDCMSYNTLIYAYCKEGMMQQSKKLLHE 370

Query: 387 MIGLGIHPDSFTCRILVEGHGREGRLISALNLVVELQKLGVTVAYDIYEYLIISLCREDR 446
           M+G  + PD FTC+++VEG  REGRL+SA+N VVEL++L V + +++ ++LI+SLC+E +
Sbjct: 371 MLGNSVVPDRFTCKVIVEGFVREGRLLSAVNFVVELRRLKVDIPFEVCDFLIVSLCQEGK 430

Query: 447 PFAAKSLLERII-KDGFQPDFNIYNKLIESFCRGDNVSEALLLKLEMINRNFKPTIDTYK 506
           PFAAK LL+RII ++G +     YN LIES  R D + EAL+LK ++ N+N      TY+
Sbjct: 431 PFAAKHLLDRIIEEEGHEAKPETYNNLIESLSRCDAIEEALVLKGKLKNQNQVLDAKTYR 490

Query: 507 SLICCMSEINRSVDGESLMVEMVESGVLPDREICRALINGYCKEGNADKAESLLVSFAKD 566
           +LI C+  I R+ + ESLM EM +S V PD  IC AL+ GYCKE + DKAE LL  FA +
Sbjct: 491 ALIGCLCRIGRNREAESLMAEMFDSEVKPDSFICGALVYGYCKELDFDKAERLLSLFAME 550

Query: 567 FQFFDSESFNALVKVYHDVG-NETKLMELQDRMIKAGFLPNSLTCRYIIHGL 615
           F+ FD ES+N+LVK   + G    K +ELQ+RM + GF+PN LTC+Y+I  L
Sbjct: 551 FRIFDPESYNSLVKAVCETGCGYKKALELQERMQRLGFVPNRLTCKYLIQVL 598

BLAST of HG10022172 vs. ExPASy Swiss-Prot
Match: Q9LVQ5 (Pentatricopeptide repeat-containing protein At5g55840 OS=Arabidopsis thaliana OX=3702 GN=At5g55840 PE=3 SV=2)

HSP 1 Score: 248.4 bits (633), Expect = 2.1e-64
Identity = 153/507 (30.18%), Postives = 243/507 (47.93%), Query Frame = 0

Query: 113 ALTFFNWVKYDLDIRLNS--HNYCLIIHILAWSRQFPLAMKFLSELIELSKDVSSSEDVF 172
           AL F  WV     +  +      C+  HIL  +R +  A   L EL  +S     S  VF
Sbjct: 53  ALKFLKWVVKQPGLETDHIVQLVCITTHILVRARMYDPARHILKELSLMS---GKSSFVF 112

Query: 173 QNLVLCTEHCNWNPVIFEMLVKAYVKVDMIQESYWSFKKMVKLGFVPSVIACNCILNGLA 232
             L+     CN NP ++++L++ Y++  MIQ+S   F+ M   GF PSV  CN IL  + 
Sbjct: 113 GALMTTYRLCNSNPSVYDILIRVYLREGMIQDSLEIFRLMGLYGFNPSVYTCNAILGSVV 172

Query: 233 KMKYDGQCWELYEEMGRIGVHSNAYTFNILTYVLCRDGDVNKINEFLEKMEEEGFDPDVV 292
           K   D   W   +EM +  +  +  TFNIL  VLC +G   K +  ++KME+ G+ P +V
Sbjct: 173 KSGEDVSVWSFLKEMLKRKICPDVATFNILINVLCAEGSFEKSSYLMQKMEKSGYAPTIV 232

Query: 293 TYNTLIDSYCRRGRLDDAFYLYRIMYRRGVMPDLVSYTSLMKGLCRLRRVKEAHQLFHRM 352
           TYNT++  YC++GR   A  L   M  +GV  D+ +Y  L+  LCR  R+ + + L   M
Sbjct: 233 TYNTVLHWYCKKGRFKAAIELLDHMKSKGVDADVCTYNMLIHDLCRSNRIAKGYLLLRDM 292

Query: 353 IDRGMDPDVVSYNTLIGAYCKNGMLQEARSLLHDMIGLGIHPDSFTCRILVEGHGREGRL 412
             R + P+ V+YNTLI  +   G +  A  LL++M+  G+ P+  T   L++GH  EG  
Sbjct: 293 RKRMIHPNEVTYNTLINGFSNEGKVLIASQLLNEMLSFGLSPNHVTFNALIDGHISEGNF 352

Query: 413 ISALNLVVELQKLGVTVAYDIYEYLIISLCREDRPFAAKSLLERIIKDGFQPDFNIYNKL 472
             AL +   ++  G+T +   Y  L+  LC+      A+    R+ ++G       Y  +
Sbjct: 353 KEALKMFYMMEAKGLTPSEVSYGVLLDGLCKNAEFDLARGFYMRMKRNGVCVGRITYTGM 412

Query: 473 IESFCRGDNVSEALLLKLEMINRNFKPTIDTYKSLICCMSEINRSVDGESLMVEMVESGV 532
           I+  C+   + EA++L  EM      P I TY +LI    ++ R    + ++  +   G+
Sbjct: 413 IDGLCKNGFLDEAVVLLNEMSKDGIDPDIVTYSALINGFCKVGRFKTAKEIVCRIYRVGL 472

Query: 533 LPDREICRALINGYCKEGNADKAESLLVSFAKDFQFFDSESFNALVKVYHDVGNETKLME 592
            P+  I   LI   C+ G   +A  +  +   +    D  +FN LV      G   +  E
Sbjct: 473 SPNGIIYSTLIYNCCRMGCLKEAIRIYEAMILEGHTRDHFTFNVLVTSLCKAGKVAEAEE 532

Query: 593 LQDRMIKAGFLPNSLTCRYIIHGLWKS 618
               M   G LPN+++   +I+G   S
Sbjct: 533 FMRCMTSDGILPNTVSFDCLINGYGNS 556

BLAST of HG10022172 vs. ExPASy Swiss-Prot
Match: Q9FIX3 (Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana OX=3702 GN=EMB2745 PE=2 SV=1)

HSP 1 Score: 238.0 bits (606), Expect = 2.8e-61
Identity = 153/535 (28.60%), Postives = 264/535 (49.35%), Query Frame = 0

Query: 100 SKILLRCQSNFVSALTFFNWVKYDLDIRLNSHNY------CLIIHILAWSRQFPLAMKFL 159
           S +LL+ Q++    L F NW         N H +      C+ +HIL   + +  A    
Sbjct: 52  SNLLLKSQNDQALILKFLNWA--------NPHQFFTLRCKCITLHILTKFKLYKTAQILA 111

Query: 160 SELIELSKDVSSSEDVFQNLVLCTEHCNWNPVIFEMLVKAYVKVDMIQESYWSFKKMVKL 219
            ++   + D   +  VF++L    + C     +F+++VK+Y ++ +I ++          
Sbjct: 112 EDVAAKTLDDEYASLVFKSLQETYDLCYSTSSVFDLVVKSYSRLSLIDKALSIVHLAQAH 171

Query: 220 GFVPSVIACNCILNGLAKMKYDGQCWE-LYEEMGRIGVHSNAYTFNILTYVLCRDGDVNK 279
           GF+P V++ N +L+   + K +    E +++EM    V  N +T+NIL    C  G+++ 
Sbjct: 172 GFMPGVLSYNAVLDATIRSKRNISFAENVFKEMLESQVSPNVFTYNILIRGFCFAGNIDV 231

Query: 280 INEFLEKMEEEGFDPDVVTYNTLIDSYCRRGRLDDAFYLYRIMYRRGVMPDLVSYTSLMK 339
                +KME +G  P+VVTYNTLID YC+  ++DD F L R M  +G+ P+L+SY  ++ 
Sbjct: 232 ALTLFDKMETKGCLPNVVTYNTLIDGYCKLRKIDDGFKLLRSMALKGLEPNLISYNVVIN 291

Query: 340 GLCRLRRVKEAHQLFHRMIDRGMDPDVVSYNTLIGAYCKNGMLQEARSLLHDMIGLGIHP 399
           GLCR  R+KE   +   M  RG   D V+YNTLI  YCK G   +A  +  +M+  G+ P
Sbjct: 292 GLCREGRMKEVSFVLTEMNRRGYSLDEVTYNTLIKGYCKEGNFHQALVMHAEMLRHGLTP 351

Query: 400 DSFTCRILVEGHGREGRLISALNLVVELQKLGVTVAYDIYEYLIISLCREDRPFAAKSLL 459
              T   L+    + G +  A+  + +++  G+      Y  L+    ++     A  +L
Sbjct: 352 SVITYTSLIHSMCKAGNMNRAMEFLDQMRVRGLCPNERTYTTLVDGFSQKGYMNEAYRVL 411

Query: 460 ERIIKDGFQPDFNIYNKLIESFCRGDNVSEALLLKLEMINRNFKPTIDTYKSLICCMSEI 519
             +  +GF P    YN LI   C    + +A+ +  +M  +   P + +Y ++   +S  
Sbjct: 412 REMNDNGFSPSVVTYNALINGHCVTGKMEDAIAVLEDMKEKGLSPDVVSYSTV---LSGF 471

Query: 520 NRSVD-GESLMV--EMVESGVLPDREICRALINGYCKEGNADKAESLLVSFAKDFQFFDS 579
            RS D  E+L V  EMVE G+ PD     +LI G+C++    +A  L     +     D 
Sbjct: 472 CRSYDVDEALRVKREMVEKGIKPDTITYSSLIQGFCEQRRTKEACDLYEEMLRVGLPPDE 531

Query: 580 ESFNALVKVYHDVGNETKLMELQDRMIKAGFLPNSLTCRYIIHGLWKSARLNKQR 625
            ++ AL+  Y   G+  K ++L + M++ G LP+ +T   +I+GL K +R  + +
Sbjct: 532 FTYTALINAYCMEGDLEKALQLHNEMVEKGVLPDVVTYSVLINGLNKQSRTREAK 575

BLAST of HG10022172 vs. ExPASy Swiss-Prot
Match: Q9LFC5 (Pentatricopeptide repeat-containing protein At5g01110 OS=Arabidopsis thaliana OX=3702 GN=At5g01110 PE=2 SV=1)

HSP 1 Score: 232.6 bits (592), Expect = 1.2e-59
Identity = 146/554 (26.35%), Postives = 259/554 (46.75%), Query Frame = 0

Query: 65  IVELVSSRLKTSNPQLALLQSDIKVLLPHLGHREISKILLRCQSNFVSALTFFNWVKYDL 124
           +VE +   LK  N       ++++  L  L    + ++L RC+++      F + + +  
Sbjct: 54  LVEKICFSLKQGN-------NNVRNHLIRLNPLAVVEVLYRCRNDLTLGQRFVDQLGFHF 113

Query: 125 -DIRLNSHNYCLIIHILAWSRQFPLAMKFLSELIELSKDVSSSEDVFQNLVLCTEHCNWN 184
            + +  S +   +IHIL  S +   A   L  +I  S    S  ++  +L     +C  N
Sbjct: 114 PNFKHTSLSLSAMIHILVRSGRLSDAQSCLLRMIRRSG--VSRLEIVNSLDSTFSNCGSN 173

Query: 185 PVIFEMLVKAYVKVDMIQESYWSFKKMVKLGFVPSVIACNCILNGLAKMKYDGQCWELYE 244
             +F++L++ YV+   ++E++ +F  +   GF  S+ ACN ++  L ++ +    W +Y+
Sbjct: 174 DSVFDLLIRTYVQARKLREAHEAFTLLRSKGFTVSIDACNALIGSLVRIGWVELAWGVYQ 233

Query: 245 EMGRIGVHSNAYTFNILTYVLCRDGDVNKINEFLEKMEEEGFDPDVVTYNTLIDSYCRRG 304
           E+ R GV  N YT NI+   LC+DG + K+  FL +++E+G  PD+VTYNTLI +Y  +G
Sbjct: 234 EISRSGVGINVYTLNIMVNALCKDGKMEKVGTFLSQVQEKGVYPDIVTYNTLISAYSSKG 293

Query: 305 RLDDAFYLYRIMYRRGVMPDLVSYTSLMKGLCRLRRVKEAHQLFHRMIDRGMDPDVVSYN 364
            +++AF L   M  +G  P + +Y +++ GLC+  + + A ++F  M+  G+ PD  +Y 
Sbjct: 294 LMEEAFELMNAMPGKGFSPGVYTYNTVINGLCKHGKYERAKEVFAEMLRSGLSPDSTTYR 353

Query: 365 TLIGAYCKNGMLQEARSLLHDMIGLGIHPDSFTCRILVEGHGREGRLISALNLVVELQKL 424
           +L+   CK G + E   +  DM    + PD      ++    R G L  AL     +++ 
Sbjct: 354 SLLMEACKKGDVVETEKVFSDMRSRDVVPDLVCFSSMMSLFTRSGNLDKALMYFNSVKEA 413

Query: 425 GVTVAYDIYEYLIISLCREDRPFAAKSLLERIIKDGFQPDFNIYNKLIESFCRGDNVSEA 484
           G+     IY  LI   CR+     A +L   +++ G   D   YN ++   C+   + EA
Sbjct: 414 GLIPDNVIYTILIQGYCRKGMISVAMNLRNEMLQQGCAMDVVTYNTILHGLCKRKMLGEA 473

Query: 485 LLLKLEMINRNFKPTIDTYKSLICCMSEINRSVDGESLMVEMVESGVLPDREICRALING 544
             L  EM  R   P   T   LI    ++    +   L  +M E  +  D      L++G
Sbjct: 474 DKLFNEMTERALFPDSYTLTILIDGHCKLGNLQNAMELFQKMKEKRIRLDVVTYNTLLDG 533

Query: 545 YCKEGNADKAESLLVSFAKDFQFFDSESFNALVKVYHDVGNETKLMELQDRMIKAGFLPN 604
           + K G+ D A+ +              S++ LV      G+  +   + D MI     P 
Sbjct: 534 FGKVGDIDTAKEIWADMVSKEILPTPISYSILVNALCSKGHLAEAFRVWDEMISKNIKPT 593

Query: 605 SLTCRYIIHGLWKS 618
            + C  +I G  +S
Sbjct: 594 VMICNSMIKGYCRS 598

BLAST of HG10022172 vs. ExPASy Swiss-Prot
Match: Q9ZQF1 (Pentatricopeptide repeat-containing protein At2g15630, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At2g15630 PE=3 SV=1)

HSP 1 Score: 232.3 bits (591), Expect = 1.6e-59
Identity = 160/565 (28.32%), Postives = 268/565 (47.43%), Query Frame = 0

Query: 69  VSSRLKTSNPQLALLQSDIKVLLP-------HLGHREISKILLRCQSNFVSALT-----F 128
           +SS  +TS P+  L     ++LL        H+      K+     S  + +L       
Sbjct: 29  LSSLAQTSTPESVLPPITSEILLESIRSSQWHIVEHVADKLTPSLVSTTLLSLVKTPNLA 88

Query: 129 FNWVKYDLDIRLNSHNYCLIIHILAWSRQFPLAMKFLSELIELSKDVSSSEDVFQNLVLC 188
           FN+V +    RL+    CL I +++         + L E++   K  +S  ++F  LVL 
Sbjct: 89  FNFVNHIDLYRLDFQTQCLAIAVISKLSSPKPVTQLLKEVVTSRK--NSIRNLFDELVLA 148

Query: 189 TEHCNW-NPVIFEMLVKAYVKVDMIQESYWSFKKMVKLGFVPSVIACNCILNGLAKMKYD 248
            +     + ++F++LV+   ++ M+ E+   F  M + GF P    CN IL  L+++   
Sbjct: 149 HDRLETKSTILFDLLVRCCCQLRMVDEAIECFYLMKEKGFYPKTETCNHILTLLSRLNRI 208

Query: 249 GQCWELYEEMGRIGVHSNAYTFNILTYVLCRDGDVNKINEFLEKMEEEGFDPDVVTYNTL 308
              W  Y +M R+ + SN YTFNI+  VLC++G + K   FL  ME  G  P +VTYNTL
Sbjct: 209 ENAWVFYADMYRMEIKSNVYTFNIMINVLCKEGKLKKAKGFLGIMEVFGIKPTIVTYNTL 268

Query: 309 IDSYCRRGRLDDAFYLYRIMYRRGVMPDLVSYTSLMKGLCRLRRVKEAHQLFHRMIDRGM 368
           +  +  RGR++ A  +   M  +G  PD+ +Y  ++  +C   R   A ++   M + G+
Sbjct: 269 VQGFSLRGRIEGARLIISEMKSKGFQPDMQTYNPILSWMCNEGR---ASEVLREMKEIGL 328

Query: 369 DPDVVSYNTLIGAYCKNGMLQEARSLLHDMIGLGIHPDSFTCRILVEGHGREGRLISALN 428
            PD VSYN LI     NG L+ A +   +M+  G+ P  +T   L+ G   E ++ +A  
Sbjct: 329 VPDSVSYNILIRGCSNNGDLEMAFAYRDEMVKQGMVPTFYTYNTLIHGLFMENKIEAAEI 388

Query: 429 LVVELQKLGVTVAYDIYEYLIISLCREDRPFAAKSLLERIIKDGFQPDFNIYNKLIESFC 488
           L+ E+++ G+ +    Y  LI   C+      A +L + ++ DG QP    Y  LI   C
Sbjct: 389 LIREIREKGIVLDSVTYNILINGYCQHGDAKKAFALHDEMMTDGIQPTQFTYTSLIYVLC 448

Query: 489 RGDNVSEALLLKLEMINRNFKPTIDTYKSLI---CCMSEINRSVDGESLMVEMVESGVLP 548
           R +   EA  L  +++ +  KP +    +L+   C +  ++R+    SL+ EM    + P
Sbjct: 449 RKNKTREADELFEKVVGKGMKPDLVMMNTLMDGHCAIGNMDRAF---SLLKEMDMMSINP 508

Query: 549 DREICRALINGYCKEGNADKAESLLVSFAKDFQFFDSESFNALVKVYHDVGNETKLMELQ 608
           D      L+ G C EG  ++A  L+    +     D  S+N L+  Y   G+      ++
Sbjct: 509 DDVTYNCLMRGLCGEGKFEEARELMGEMKRRGIKPDHISYNTLISGYSKKGDTKHAFMVR 568

Query: 609 DRMIKAGFLPNSLTCRYIIHGLWKS 618
           D M+  GF P  LT   ++ GL K+
Sbjct: 569 DEMLSLGFNPTLLTYNALLKGLSKN 585

BLAST of HG10022172 vs. ExPASy TrEMBL
Match: A0A0A0KSF1 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G523120 PE=4 SV=1)

HSP 1 Score: 1135.6 bits (2936), Expect = 0.0e+00
Identity = 561/637 (88.07%), Postives = 598/637 (93.88%), Query Frame = 0

Query: 1   MHRTSTSNLPKLSQSFFFLTSFPKSTF-------SSASSSTFLQSVTQSETKSIVVNPLY 60
           M RT+ SNLPKL+QSFFF TSF KST        SS+SSSTFLQS+ +SE K ++VNPLY
Sbjct: 1   MLRTTASNLPKLAQSFFFHTSFSKSTLSSSSSSSSSSSSSTFLQSIPESEAK-LIVNPLY 60

Query: 61  HFLPQNQNPFNIVELVSSRLKTSNPQLALLQSDIKVLLPHLGHREISKILLRCQSNFVSA 120
           HFLPQNQNPFNIVELVSS LKT+NP+LALLQS IK L+PHLGHR+ISKILLRCQSNFVSA
Sbjct: 61  HFLPQNQNPFNIVELVSSHLKTNNPRLALLQSHIKELIPHLGHRQISKILLRCQSNFVSA 120

Query: 121 LTFFNWVKYDLDIRLNSHNYCLIIHILAWSRQFPLAMKFLSELIELSKDVSSSEDVFQNL 180
           L FFNWVKYDLDIRL+SHNYCLIIHILAWSRQFPLAMKFLSELIELSKDVSSSEDVFQNL
Sbjct: 121 LAFFNWVKYDLDIRLSSHNYCLIIHILAWSRQFPLAMKFLSELIELSKDVSSSEDVFQNL 180

Query: 181 VLCTEHCNWNPVIFEMLVKAYVKVDMIQESYWSFKKMVKLGFVPSVIACNCILNGLAKMK 240
           VLCTEHCNWNPVIFEML+KAYVK+D+I ESYWSFKKMVKLGFVP+VIACNCILNGLAKMK
Sbjct: 181 VLCTEHCNWNPVIFEMLIKAYVKLDLIHESYWSFKKMVKLGFVPNVIACNCILNGLAKMK 240

Query: 241 YDGQCWELYEEMGRIGVHSNAYTFNILTYVLCRDGDVNKINEFLEKMEEEGFDPDVVTYN 300
            D QCWELYEEMGRIGVHSNAYTFNILTYVLCRDGDVNKIN FLEKMEEEGFDPDVVTYN
Sbjct: 241 SDAQCWELYEEMGRIGVHSNAYTFNILTYVLCRDGDVNKINGFLEKMEEEGFDPDVVTYN 300

Query: 301 TLIDSYCRRGRLDDAFYLYRIMYRRGVMPDLVSYTSLMKGLCRLRRVKEAHQLFHRMIDR 360
           TLIDSY RRGRL+DAFYLY+IMYRRGVMPDLVSYTSLM+GLCRL RV+EAHQLFHRMIDR
Sbjct: 301 TLIDSYVRRGRLEDAFYLYKIMYRRGVMPDLVSYTSLMRGLCRLGRVREAHQLFHRMIDR 360

Query: 361 GMDPDVVSYNTLIGAYCKNGMLQEARSLLHDMIGLGIHPDSFTCRILVEGHGREGRLISA 420
           GMDPDVV YNTLIGAYCK+GMLQEARSLLH+MIG+GIHPDSFTCRILVEG+GREGRLISA
Sbjct: 361 GMDPDVVLYNTLIGAYCKDGMLQEARSLLHEMIGIGIHPDSFTCRILVEGYGREGRLISA 420

Query: 421 LNLVVELQKLGVTVAYDIYEYLIISLCREDRPFAAKSLLERIIKDGFQPDFNIYNKLIES 480
           LNLVVE+QKLGVTVA+DIY+YLIISLCREDRPFAAKSLLERI++D FQPD +IYNKLIES
Sbjct: 421 LNLVVEIQKLGVTVAHDIYKYLIISLCREDRPFAAKSLLERILEDSFQPDSDIYNKLIES 480

Query: 481 FCRGDNVSEALLLKLEMINRNFKPTIDTYKSLICCMSEINRSVDGESLMVEMVESGVLPD 540
           FCR +NVSEALLLKLEMINRN+KPT DTYKSLI CM EINRSVDGE LMVEMVES V+PD
Sbjct: 481 FCRSNNVSEALLLKLEMINRNYKPTTDTYKSLIHCMCEINRSVDGEGLMVEMVESEVIPD 540

Query: 541 REICRALINGYCKEGNADKAESLLVSFAKDFQFFDSESFNALVKVYHDVGNETKLMELQD 600
            EICRAL+NGYCKEGNADKAESLLVSFAKDFQFFDSESFN+LVKVY DVGNETKLMELQD
Sbjct: 541 HEICRALVNGYCKEGNADKAESLLVSFAKDFQFFDSESFNSLVKVYRDVGNETKLMELQD 600

Query: 601 RMIKAGFLPNSLTCRYIIHGLWKSARLNKQRVHAVAV 631
           RM+KAGFLPNSLTCRYIIHG+WKS RLNKQRV  VAV
Sbjct: 601 RMLKAGFLPNSLTCRYIIHGIWKSMRLNKQRVQTVAV 636

BLAST of HG10022172 vs. ExPASy TrEMBL
Match: A0A1S3C995 (pentatricopeptide repeat-containing protein At5g40400 OS=Cucumis melo OX=3656 GN=LOC103498097 PE=4 SV=1)

HSP 1 Score: 1130.5 bits (2923), Expect = 0.0e+00
Identity = 558/630 (88.57%), Postives = 596/630 (94.60%), Query Frame = 0

Query: 1   MHRTSTSNLPKLSQSFFFLTSFPKSTFSSASSSTFLQSVTQSETKSIVVNPLYHFLPQNQ 60
           M RT+ SNLPKLSQSFFFL+SF KST SS+SSSTFLQS+ +SE KSI VNPLYHFLPQNQ
Sbjct: 1   MLRTTASNLPKLSQSFFFLSSFSKSTLSSSSSSTFLQSIPESEAKSI-VNPLYHFLPQNQ 60

Query: 61  NPFNIVELVSSRLKTSNPQLALLQSDIKVLLPHLGHREISKILLRCQSNFVSALTFFNWV 120
           NPFNIVELVS  LKT+NP+LALLQ++IK L+P+LGH +ISKILLRCQSNFVSAL FFNWV
Sbjct: 61  NPFNIVELVSLHLKTNNPRLALLQANIKGLIPYLGHCQISKILLRCQSNFVSALAFFNWV 120

Query: 121 KYDLDIRLNSHNYCLIIHILAWSRQFPLAMKFLSELIELSKDVSSSEDVFQNLVLCTEHC 180
           KYDLDIRLNSHNYCLIIHILAWSRQFPLAMK LSELIELSKDVSSSEDVFQNLVLCTEHC
Sbjct: 121 KYDLDIRLNSHNYCLIIHILAWSRQFPLAMKLLSELIELSKDVSSSEDVFQNLVLCTEHC 180

Query: 181 NWNPVIFEMLVKAYVKVDMIQESYWSFKKMVKLGFVPSVIACNCILNGLAKMKYDGQCWE 240
           NWNPVIFEML+KAYVK+D+I ESYWSFKKMVKLGFVPSVIACNCIL+GLAKMK DGQCWE
Sbjct: 181 NWNPVIFEMLIKAYVKLDLIHESYWSFKKMVKLGFVPSVIACNCILHGLAKMKSDGQCWE 240

Query: 241 LYEEMGRIGVHSNAYTFNILTYVLCRDGDVNKINEFLEKMEEEGFDPDVVTYNTLIDSYC 300
           LYEEMGRIGVHSNAYTFNILTYVLCRDGDVNKINEFLEKMEEEGFDPDVVTYNTLIDSYC
Sbjct: 241 LYEEMGRIGVHSNAYTFNILTYVLCRDGDVNKINEFLEKMEEEGFDPDVVTYNTLIDSYC 300

Query: 301 RRGRLDDAFYLYRIMYRRGVMPDLVSYTSLMKGLCRLRRVKEAHQLFHRMIDRGMDPDVV 360
           RRGRLDDAFYLYRIMYRR VMPDLVSYTSLM+GLCRL RV+EAHQLFHRMIDRGMDPDVV
Sbjct: 301 RRGRLDDAFYLYRIMYRRSVMPDLVSYTSLMRGLCRLGRVREAHQLFHRMIDRGMDPDVV 360

Query: 361 SYNTLIGAYCKNGMLQEARSLLHDMIGLGIHPDSFTCRILVEGHGREGRLISALNLVVEL 420
           SYNTLIGAYCK+GMLQEARSLLHDMIG+GIHPD+FTCRILVEG+GREGRLISALNLVVE+
Sbjct: 361 SYNTLIGAYCKDGMLQEARSLLHDMIGIGIHPDNFTCRILVEGYGREGRLISALNLVVEI 420

Query: 421 QKLGVTVAYDIYEYLIISLCREDRPFAAKSLLERIIKDGFQPDFNIYNKLIESFCRGDNV 480
           QKLGVT+A+DIY+YLIISLC+EDRPFAAKSLLERI++D FQPD +IYNKLIESFCR +NV
Sbjct: 421 QKLGVTIAHDIYKYLIISLCQEDRPFAAKSLLERILEDRFQPDSDIYNKLIESFCRSNNV 480

Query: 481 SEALLLKLEMINRNFKPTIDTYKSLICCMSEINRSVDGESLMVEMVESGVLPDREICRAL 540
           SEALLLK EMINRNFKPTI TYKSLI CM EINRSVDGE LM EMVES VLPD EICRAL
Sbjct: 481 SEALLLKSEMINRNFKPTIYTYKSLIHCMCEINRSVDGEGLMEEMVESEVLPDHEICRAL 540

Query: 541 INGYCKEGNADKAESLLVSFAKDFQFFDSESFNALVKVYHDVGNETKLMELQDRMIKAGF 600
           +NGYCKEGNADKAESLLVSFAKDFQFFDSESFN+LVKVY D+GNETKLMELQ RM+KAGF
Sbjct: 541 VNGYCKEGNADKAESLLVSFAKDFQFFDSESFNSLVKVYCDMGNETKLMELQTRMLKAGF 600

Query: 601 LPNSLTCRYIIHGLWKSARLNKQRVHAVAV 631
           LPN+LTC+YIIHGLWK  RLN+QRV AV V
Sbjct: 601 LPNNLTCQYIIHGLWKFTRLNEQRVQAVVV 629

BLAST of HG10022172 vs. ExPASy TrEMBL
Match: A0A5D3BTR0 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold374G00460 PE=4 SV=1)

HSP 1 Score: 1129.4 bits (2920), Expect = 0.0e+00
Identity = 557/630 (88.41%), Postives = 596/630 (94.60%), Query Frame = 0

Query: 1   MHRTSTSNLPKLSQSFFFLTSFPKSTFSSASSSTFLQSVTQSETKSIVVNPLYHFLPQNQ 60
           M RT+ SNLPKLSQSFFFL+SF KST SS+SSSTFLQS+ +SE KSI VNPLYHFLPQNQ
Sbjct: 1   MLRTTASNLPKLSQSFFFLSSFSKSTLSSSSSSTFLQSIPESEAKSI-VNPLYHFLPQNQ 60

Query: 61  NPFNIVELVSSRLKTSNPQLALLQSDIKVLLPHLGHREISKILLRCQSNFVSALTFFNWV 120
           NPFNIVELVS  LKT+NP+LALLQ++IK L+P+LGH +ISKILLRCQSNFVSAL FFNWV
Sbjct: 61  NPFNIVELVSLHLKTNNPRLALLQANIKGLIPYLGHCQISKILLRCQSNFVSALAFFNWV 120

Query: 121 KYDLDIRLNSHNYCLIIHILAWSRQFPLAMKFLSELIELSKDVSSSEDVFQNLVLCTEHC 180
           KYDLDIRLNSHNYCLIIHILAWSRQFPLAMK LSELIELSKDVSSSEDVFQNLVLCTEHC
Sbjct: 121 KYDLDIRLNSHNYCLIIHILAWSRQFPLAMKLLSELIELSKDVSSSEDVFQNLVLCTEHC 180

Query: 181 NWNPVIFEMLVKAYVKVDMIQESYWSFKKMVKLGFVPSVIACNCILNGLAKMKYDGQCWE 240
           NWNPVIFEML+KAYVK+D+I ESYWSFK+MVKLGFVPSVIACNCIL+GLAKMK DGQCWE
Sbjct: 181 NWNPVIFEMLIKAYVKLDLIHESYWSFKRMVKLGFVPSVIACNCILHGLAKMKSDGQCWE 240

Query: 241 LYEEMGRIGVHSNAYTFNILTYVLCRDGDVNKINEFLEKMEEEGFDPDVVTYNTLIDSYC 300
           LYEEMGRIGVHSNAYTFNILTYVLCRDGDVNKINEFLEKMEEEGFDPDVVTYNTLIDSYC
Sbjct: 241 LYEEMGRIGVHSNAYTFNILTYVLCRDGDVNKINEFLEKMEEEGFDPDVVTYNTLIDSYC 300

Query: 301 RRGRLDDAFYLYRIMYRRGVMPDLVSYTSLMKGLCRLRRVKEAHQLFHRMIDRGMDPDVV 360
           RRGRLDDAFYLYRIMYRR VMPDLVSYTSLM+GLCRL RV+EAHQLFHRMIDRGMDPDVV
Sbjct: 301 RRGRLDDAFYLYRIMYRRSVMPDLVSYTSLMRGLCRLGRVREAHQLFHRMIDRGMDPDVV 360

Query: 361 SYNTLIGAYCKNGMLQEARSLLHDMIGLGIHPDSFTCRILVEGHGREGRLISALNLVVEL 420
           SYNTLIGAYCK+GMLQEARSLLHDMIG+GIHPD+FTCRILVEG+GREGRLISALNLVVE+
Sbjct: 361 SYNTLIGAYCKDGMLQEARSLLHDMIGIGIHPDNFTCRILVEGYGREGRLISALNLVVEI 420

Query: 421 QKLGVTVAYDIYEYLIISLCREDRPFAAKSLLERIIKDGFQPDFNIYNKLIESFCRGDNV 480
           QKLGVT+A+DIY+YLIISLC+EDRPFAAKSLLERI++D FQPD +IYNKLIESFCR +NV
Sbjct: 421 QKLGVTIAHDIYKYLIISLCQEDRPFAAKSLLERILEDRFQPDSDIYNKLIESFCRSNNV 480

Query: 481 SEALLLKLEMINRNFKPTIDTYKSLICCMSEINRSVDGESLMVEMVESGVLPDREICRAL 540
           SEALLLK EMINRNFKPTI TYKSLI CM EINRSVDGE LM EMVES VLPD EICRAL
Sbjct: 481 SEALLLKSEMINRNFKPTIYTYKSLIHCMCEINRSVDGEGLMEEMVESEVLPDHEICRAL 540

Query: 541 INGYCKEGNADKAESLLVSFAKDFQFFDSESFNALVKVYHDVGNETKLMELQDRMIKAGF 600
           +NGYCKEGNADKAESLLVSFAKDFQFFDSESFN+LVKVY D+GNETKLMELQ RM+KAGF
Sbjct: 541 VNGYCKEGNADKAESLLVSFAKDFQFFDSESFNSLVKVYCDMGNETKLMELQTRMLKAGF 600

Query: 601 LPNSLTCRYIIHGLWKSARLNKQRVHAVAV 631
           LPN+LTC+YIIHGLWK  RLN+QRV AV V
Sbjct: 601 LPNNLTCQYIIHGLWKFTRLNEQRVQAVVV 629

BLAST of HG10022172 vs. ExPASy TrEMBL
Match: A0A6J1C1W2 (pentatricopeptide repeat-containing protein At5g40400 OS=Momordica charantia OX=3673 GN=LOC111006668 PE=4 SV=1)

HSP 1 Score: 1054.3 bits (2725), Expect = 2.0e-304
Identity = 514/628 (81.85%), Postives = 565/628 (89.97%), Query Frame = 0

Query: 1   MHRTSTSNLPKLSQSFFFLTSFPKSTFSSASSSTFLQSVTQSETKSIVVNPLYHFLPQNQ 60
           MHRTSTS+L KLS SFF L   PKS FSSASSSTFLQ + +S+ + ++VNPL+HFLPQNQ
Sbjct: 1   MHRTSTSSLAKLSPSFFCLAFVPKSAFSSASSSTFLQPIPESQAQ-LIVNPLFHFLPQNQ 60

Query: 61  NPFNIVELVSSRLKTSNPQLALLQSDIKVLLPHLGHREISKILLRCQSNFVSALTFFNWV 120
           NPFNIVE+V S LKT NP+LA LQS+IK LLPHLGHRE+SK+LLRCQSNF SAL FFNWV
Sbjct: 61  NPFNIVEIVCSHLKTGNPELARLQSEIKELLPHLGHREVSKVLLRCQSNFGSALAFFNWV 120

Query: 121 KYDLDIRLNSHNYCLIIHILAWSRQFPLAMKFLSELIELSKDVSSSEDVFQNLVLCTEHC 180
           KYDL++R NS NYCLIIHILAWSRQ PLAMKFLSELIELS+     EDVFQNLVLCTEHC
Sbjct: 121 KYDLNLRPNSKNYCLIIHILAWSRQLPLAMKFLSELIELSE-----EDVFQNLVLCTEHC 180

Query: 181 NWNPVIFEMLVKAYVKVDMIQESYWSFKKMVKLGFVPSVIACNCILNGLAKMKYDGQCWE 240
           NWNPVIFEML+KAY+KV MIQESYWSFKKMV+LGFVPSVIACNCILNGLAKMK DG CWE
Sbjct: 181 NWNPVIFEMLIKAYMKVGMIQESYWSFKKMVRLGFVPSVIACNCILNGLAKMKCDGHCWE 240

Query: 241 LYEEMGRIGVHSNAYTFNILTYVLCRDGDVNKINEFLEKMEEEGFDPDVVTYNTLIDSYC 300
           LYEEMGRIGVHSNAYTFNILTYVLCRDGDVNK+NEFLEKMEEEGFDPD+VTYNTLI SYC
Sbjct: 241 LYEEMGRIGVHSNAYTFNILTYVLCRDGDVNKVNEFLEKMEEEGFDPDIVTYNTLIGSYC 300

Query: 301 RRGRLDDAFYLYRIMYRRGVMPDLVSYTSLMKGLCRLRRVKEAHQLFHRMIDRGMDPDVV 360
           RRGRL+DAF+LYRIMYRRGVMPDLVSYTSLM GLC++ RV+EAHQ+FHRMIDRG+DPDVV
Sbjct: 301 RRGRLEDAFHLYRIMYRRGVMPDLVSYTSLMNGLCQIGRVREAHQIFHRMIDRGLDPDVV 360

Query: 361 SYNTLIGAYCKNGMLQEARSLLHDMIGLGIHPDSFTCRILVEGHGREGRLISALNLVVEL 420
           SYN LI AYCK G L EAR LLHDMI +G++PDSFTCRI+VEG+GREGRLISALNLVVEL
Sbjct: 361 SYNMLISAYCKVGRLPEARLLLHDMIRIGLYPDSFTCRIMVEGYGREGRLISALNLVVEL 420

Query: 421 QKLGVTVAYDIYEYLIISLCREDRPFAAKSLLERIIKDGFQPDFNIYNKLIESFCRGDNV 480
           QKLGV V  DIY+YLIISLC+EDRPFAAKSLLERII DGF+PD  IY KLIESFCRG++ 
Sbjct: 421 QKLGVAVTCDIYKYLIISLCQEDRPFAAKSLLERIINDGFEPDAGIYGKLIESFCRGNHA 480

Query: 481 SEALLLKLEMINRNFKPTIDTYKSLICCMSEINRSVDGESLMVEMVESGVLPDREICRAL 540
           SEALL+K EM NRNF+P +D YKSLICC+ EINRS DGE LMVEMVESG+ PD  ICRAL
Sbjct: 481 SEALLMKSEMENRNFEPGVDIYKSLICCLCEINRSGDGEGLMVEMVESGLPPDHIICRAL 540

Query: 541 INGYCKEGNADKAESLLVSFAKDFQFFDSESFNALVKVYHDVGNETKLMELQDRMIKAGF 600
           ING+CKEGNADKAESLL SFAK+FQFFD+ESFNAL+K YHDVGNETKLMELQDRM+KAGF
Sbjct: 541 INGHCKEGNADKAESLLASFAKEFQFFDTESFNALIKFYHDVGNETKLMELQDRMLKAGF 600

Query: 601 LPNSLTCRYIIHGLWKSARLNKQRVHAV 629
           +PNSLTCRY+IHGLWKSAR  K R+ AV
Sbjct: 601 VPNSLTCRYVIHGLWKSARFGKHRLQAV 622

BLAST of HG10022172 vs. ExPASy TrEMBL
Match: A0A6J1E7L8 (pentatricopeptide repeat-containing protein At5g40400 OS=Cucurbita moschata OX=3662 GN=LOC111430138 PE=4 SV=1)

HSP 1 Score: 1045.4 bits (2702), Expect = 9.4e-302
Identity = 518/628 (82.48%), Postives = 562/628 (89.49%), Query Frame = 0

Query: 1   MHRTSTSNLPKLSQSFFFLTSFPKSTFSSASSSTFLQSVTQSETKSIVVNPLYHFLPQNQ 60
           MHR   SNL KLS+ F FL S PKS FSSASSSTFL S+ +SETK ++VNPLYHFLPQNQ
Sbjct: 1   MHRIPASNLTKLSKPFVFLASIPKSNFSSASSSTFLPSIPRSETK-LIVNPLYHFLPQNQ 60

Query: 61  NPFNIVELVSSRLKTSNPQLALLQSDIKVLLPHLGHREISKILLRCQSNFVSALTFFNWV 120
           NPFNIVELVSS LKTSN  L+LLQSDIK LLPHLGHRE+SKI+LRCQSNFVS L FFNWV
Sbjct: 61  NPFNIVELVSSHLKTSNTNLSLLQSDIKELLPHLGHREVSKIILRCQSNFVSVLAFFNWV 120

Query: 121 KYDLDIRLNSHNYCLIIHILAWSRQFPLAMKFLSELIELSKD-VSSSEDVFQNLVLCTEH 180
           K+DL I L+S NYCLIIHILAWSRQF +AMKFLSELIELSKD  S SEDVF NLVLCTEH
Sbjct: 121 KFDLGITLSSQNYCLIIHILAWSRQFSMAMKFLSELIELSKDNASGSEDVFHNLVLCTEH 180

Query: 181 CNWNPVIFEMLVKAYVKVDMIQESYWSFKKMVKLGFVPSVIACNCILNGLAKMKYDGQCW 240
           CNWNPVIFEML+KAYVKV MIQESY SFKKMVK+GFVPSVIACNCILNGLAKMK D QCW
Sbjct: 181 CNWNPVIFEMLIKAYVKVHMIQESYESFKKMVKMGFVPSVIACNCILNGLAKMKCDAQCW 240

Query: 241 ELYEEMGRIGVHSNAYTFNILTYVLCRDGDVNKINEFLEKMEEEGFDPDVVTYNTLIDSY 300
           ELYEEMGRIGVHSNAYTFNILTYVLCR GDVNK+NEFLEKMEEEGFDPDVVTYNTLI+SY
Sbjct: 241 ELYEEMGRIGVHSNAYTFNILTYVLCRVGDVNKVNEFLEKMEEEGFDPDVVTYNTLINSY 300

Query: 301 CRRGRLDDAFYLYRIMYRRGVMPDLVSYTSLMKGLCRLRRVKEAHQLFHRMIDRGMDPDV 360
           CRRGRLDDAFYLYRIM+RRGVMPDLVSYTSLM GLC+L RV+EAHQLFHRMIDR +DPDV
Sbjct: 301 CRRGRLDDAFYLYRIMFRRGVMPDLVSYTSLMNGLCKLGRVREAHQLFHRMIDRELDPDV 360

Query: 361 VSYNTLIGAYCKNGMLQEARSLLHDMIGLGIHPDSFTCRILVEGHGREGRLISALNLVVE 420
           V YNTLI AYCK+G LQEARSLLHDM  +GI PDSFTCRI+VEG+GR GRLISALNLVVE
Sbjct: 361 VLYNTLINAYCKDGRLQEARSLLHDMTRIGICPDSFTCRIMVEGYGRGGRLISALNLVVE 420

Query: 421 LQKLGVTVAYDIYEYLIISLCREDRPFAAKSLLERIIKDGFQPDFNIYNKLIESFCRGDN 480
           L+KLG  V Y+IY+YLI+SLC EDRPFAAKS+LERIIKDGFQP+  IYNKLIESFCR  N
Sbjct: 421 LRKLGTIVTYEIYDYLIVSLCLEDRPFAAKSVLERIIKDGFQPNACIYNKLIESFCRVHN 480

Query: 481 VSEALLLKLEMINRNFKPTIDTYKSLICCMSEINRSVDGESLMVEMVESGVLPDREICRA 540
           VSEALLLK EM+ RNFK +ID+YK LI C+  INRSVDGE LMVEMVESGVLPD +ICR 
Sbjct: 481 VSEALLLKSEMVKRNFKLSIDSYKPLISCLCGINRSVDGEGLMVEMVESGVLPDHQICRV 540

Query: 541 LINGYCKEGNADKAESLLVSFAKDFQFFDSESFNALVKVYHDVGNETKLMELQDRMIKAG 600
           LINGYCKEGN  KAESLLVSFAKDF+FFD+ESFNALVK + D GNET+LM+LQDRM+K G
Sbjct: 541 LINGYCKEGNVYKAESLLVSFAKDFEFFDTESFNALVKFHRDFGNETELMQLQDRMLKVG 600

Query: 601 FLPNSLTCRYIIHGLWKSARLNKQRVHA 628
           F+PNSLTCRY+IHGLWKSARL+K+RV A
Sbjct: 601 FVPNSLTCRYVIHGLWKSARLDKRRVQA 627

BLAST of HG10022172 vs. TAIR 10
Match: AT5G40400.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 631.7 bits (1628), Expect = 6.2e-181
Identity = 314/592 (53.04%), Postives = 431/592 (72.80%), Query Frame = 0

Query: 27  FSSASSSTFLQSVTQSETKSIVVNPLYHFLPQNQNPFNIVELVSSRLKTSNPQLAL--LQ 86
           FSS SSS   +    S     ++NPLY+ LPQ+QNP  IV+++ S L  S+  + L  L+
Sbjct: 11  FSSYSSSIVPRC---SNIPKPILNPLYNLLPQSQNPSKIVDVICSTLNHSDYSVLLPNLR 70

Query: 87  SDIKVLLPHLGHREISKILLRCQSNFVSALTFFNWVKYDLDIRLNSHNYCLIIHILAWSR 146
            ++K L+PHLG+ EIS++LLR QS+   A+TFF WVK+DL  R N  NYCL++HIL  S+
Sbjct: 71  DEVKSLIPHLGYPEISRVLLRFQSDASRAITFFKWVKFDLGKRPNVGNYCLLLHILVSSK 130

Query: 147 QFPLAMKFLSELIELSKDVSSSEDVFQNLVLCTEHCNWNPVIFEMLVKAYVKVDMIQESY 206
           +FPLAM+FL ELIEL+       DVF+ LV  T+ CNW+PV+F+MLVK Y+K+ +++E +
Sbjct: 131 KFPLAMQFLCELIELTSK-KEEVDVFRVLVSATDECNWDPVVFDMLVKGYLKLGLVEEGF 190

Query: 207 WSFKKMVKLGFVPSVIACNCILNGLAKMKYDGQCWELYEEMGRIGVHSNAYTFNILTYVL 266
             F++++  GF  SV+ CN +LNGL K+     CW++Y  M R+G+H N YTFNILT V 
Sbjct: 191 RVFREVLDSGFSVSVVTCNHLLNGLLKLDLMEDCWQVYSVMCRVGIHPNTYTFNILTNVF 250

Query: 267 CRDGDVNKINEFLEKMEEEGFDPDVVTYNTLIDSYCRRGRLDDAFYLYRIMYRRGVMPDL 326
           C D +  ++++FLEKMEEEGF+PD+VTYNTL+ SYCRRGRL +AFYLY+IMYRR V+PDL
Sbjct: 251 CNDSNFREVDDFLEKMEEEGFEPDLVTYNTLVSSYCRRGRLKEAFYLYKIMYRRRVVPDL 310

Query: 327 VSYTSLMKGLCRLRRVKEAHQLFHRMIDRGMDPDVVSYNTLIGAYCKNGMLQEARSLLHD 386
           V+YTSL+KGLC+  RV+EAHQ FHRM+DRG+ PD +SYNTLI AYCK GM+Q+++ LLH+
Sbjct: 311 VTYTSLIKGLCKDGRVREAHQTFHRMVDRGIKPDCMSYNTLIYAYCKEGMMQQSKKLLHE 370

Query: 387 MIGLGIHPDSFTCRILVEGHGREGRLISALNLVVELQKLGVTVAYDIYEYLIISLCREDR 446
           M+G  + PD FTC+++VEG  REGRL+SA+N VVEL++L V + +++ ++LI+SLC+E +
Sbjct: 371 MLGNSVVPDRFTCKVIVEGFVREGRLLSAVNFVVELRRLKVDIPFEVCDFLIVSLCQEGK 430

Query: 447 PFAAKSLLERII-KDGFQPDFNIYNKLIESFCRGDNVSEALLLKLEMINRNFKPTIDTYK 506
           PFAAK LL+RII ++G +     YN LIES  R D + EAL+LK ++ N+N      TY+
Sbjct: 431 PFAAKHLLDRIIEEEGHEAKPETYNNLIESLSRCDAIEEALVLKGKLKNQNQVLDAKTYR 490

Query: 507 SLICCMSEINRSVDGESLMVEMVESGVLPDREICRALINGYCKEGNADKAESLLVSFAKD 566
           +LI C+  I R+ + ESLM EM +S V PD  IC AL+ GYCKE + DKAE LL  FA +
Sbjct: 491 ALIGCLCRIGRNREAESLMAEMFDSEVKPDSFICGALVYGYCKELDFDKAERLLSLFAME 550

Query: 567 FQFFDSESFNALVKVYHDVG-NETKLMELQDRMIKAGFLPNSLTCRYIIHGL 615
           F+ FD ES+N+LVK   + G    K +ELQ+RM + GF+PN LTC+Y+I  L
Sbjct: 551 FRIFDPESYNSLVKAVCETGCGYKKALELQERMQRLGFVPNRLTCKYLIQVL 598

BLAST of HG10022172 vs. TAIR 10
Match: AT5G55840.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 248.4 bits (633), Expect = 1.5e-65
Identity = 153/507 (30.18%), Postives = 243/507 (47.93%), Query Frame = 0

Query: 113 ALTFFNWVKYDLDIRLNS--HNYCLIIHILAWSRQFPLAMKFLSELIELSKDVSSSEDVF 172
           AL F  WV     +  +      C+  HIL  +R +  A   L EL  +S     S  VF
Sbjct: 93  ALKFLKWVVKQPGLETDHIVQLVCITTHILVRARMYDPARHILKELSLMS---GKSSFVF 152

Query: 173 QNLVLCTEHCNWNPVIFEMLVKAYVKVDMIQESYWSFKKMVKLGFVPSVIACNCILNGLA 232
             L+     CN NP ++++L++ Y++  MIQ+S   F+ M   GF PSV  CN IL  + 
Sbjct: 153 GALMTTYRLCNSNPSVYDILIRVYLREGMIQDSLEIFRLMGLYGFNPSVYTCNAILGSVV 212

Query: 233 KMKYDGQCWELYEEMGRIGVHSNAYTFNILTYVLCRDGDVNKINEFLEKMEEEGFDPDVV 292
           K   D   W   +EM +  +  +  TFNIL  VLC +G   K +  ++KME+ G+ P +V
Sbjct: 213 KSGEDVSVWSFLKEMLKRKICPDVATFNILINVLCAEGSFEKSSYLMQKMEKSGYAPTIV 272

Query: 293 TYNTLIDSYCRRGRLDDAFYLYRIMYRRGVMPDLVSYTSLMKGLCRLRRVKEAHQLFHRM 352
           TYNT++  YC++GR   A  L   M  +GV  D+ +Y  L+  LCR  R+ + + L   M
Sbjct: 273 TYNTVLHWYCKKGRFKAAIELLDHMKSKGVDADVCTYNMLIHDLCRSNRIAKGYLLLRDM 332

Query: 353 IDRGMDPDVVSYNTLIGAYCKNGMLQEARSLLHDMIGLGIHPDSFTCRILVEGHGREGRL 412
             R + P+ V+YNTLI  +   G +  A  LL++M+  G+ P+  T   L++GH  EG  
Sbjct: 333 RKRMIHPNEVTYNTLINGFSNEGKVLIASQLLNEMLSFGLSPNHVTFNALIDGHISEGNF 392

Query: 413 ISALNLVVELQKLGVTVAYDIYEYLIISLCREDRPFAAKSLLERIIKDGFQPDFNIYNKL 472
             AL +   ++  G+T +   Y  L+  LC+      A+    R+ ++G       Y  +
Sbjct: 393 KEALKMFYMMEAKGLTPSEVSYGVLLDGLCKNAEFDLARGFYMRMKRNGVCVGRITYTGM 452

Query: 473 IESFCRGDNVSEALLLKLEMINRNFKPTIDTYKSLICCMSEINRSVDGESLMVEMVESGV 532
           I+  C+   + EA++L  EM      P I TY +LI    ++ R    + ++  +   G+
Sbjct: 453 IDGLCKNGFLDEAVVLLNEMSKDGIDPDIVTYSALINGFCKVGRFKTAKEIVCRIYRVGL 512

Query: 533 LPDREICRALINGYCKEGNADKAESLLVSFAKDFQFFDSESFNALVKVYHDVGNETKLME 592
            P+  I   LI   C+ G   +A  +  +   +    D  +FN LV      G   +  E
Sbjct: 513 SPNGIIYSTLIYNCCRMGCLKEAIRIYEAMILEGHTRDHFTFNVLVTSLCKAGKVAEAEE 572

Query: 593 LQDRMIKAGFLPNSLTCRYIIHGLWKS 618
               M   G LPN+++   +I+G   S
Sbjct: 573 FMRCMTSDGILPNTVSFDCLINGYGNS 596

BLAST of HG10022172 vs. TAIR 10
Match: AT5G39710.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 238.0 bits (606), Expect = 2.0e-62
Identity = 153/535 (28.60%), Postives = 264/535 (49.35%), Query Frame = 0

Query: 100 SKILLRCQSNFVSALTFFNWVKYDLDIRLNSHNY------CLIIHILAWSRQFPLAMKFL 159
           S +LL+ Q++    L F NW         N H +      C+ +HIL   + +  A    
Sbjct: 52  SNLLLKSQNDQALILKFLNWA--------NPHQFFTLRCKCITLHILTKFKLYKTAQILA 111

Query: 160 SELIELSKDVSSSEDVFQNLVLCTEHCNWNPVIFEMLVKAYVKVDMIQESYWSFKKMVKL 219
            ++   + D   +  VF++L    + C     +F+++VK+Y ++ +I ++          
Sbjct: 112 EDVAAKTLDDEYASLVFKSLQETYDLCYSTSSVFDLVVKSYSRLSLIDKALSIVHLAQAH 171

Query: 220 GFVPSVIACNCILNGLAKMKYDGQCWE-LYEEMGRIGVHSNAYTFNILTYVLCRDGDVNK 279
           GF+P V++ N +L+   + K +    E +++EM    V  N +T+NIL    C  G+++ 
Sbjct: 172 GFMPGVLSYNAVLDATIRSKRNISFAENVFKEMLESQVSPNVFTYNILIRGFCFAGNIDV 231

Query: 280 INEFLEKMEEEGFDPDVVTYNTLIDSYCRRGRLDDAFYLYRIMYRRGVMPDLVSYTSLMK 339
                +KME +G  P+VVTYNTLID YC+  ++DD F L R M  +G+ P+L+SY  ++ 
Sbjct: 232 ALTLFDKMETKGCLPNVVTYNTLIDGYCKLRKIDDGFKLLRSMALKGLEPNLISYNVVIN 291

Query: 340 GLCRLRRVKEAHQLFHRMIDRGMDPDVVSYNTLIGAYCKNGMLQEARSLLHDMIGLGIHP 399
           GLCR  R+KE   +   M  RG   D V+YNTLI  YCK G   +A  +  +M+  G+ P
Sbjct: 292 GLCREGRMKEVSFVLTEMNRRGYSLDEVTYNTLIKGYCKEGNFHQALVMHAEMLRHGLTP 351

Query: 400 DSFTCRILVEGHGREGRLISALNLVVELQKLGVTVAYDIYEYLIISLCREDRPFAAKSLL 459
              T   L+    + G +  A+  + +++  G+      Y  L+    ++     A  +L
Sbjct: 352 SVITYTSLIHSMCKAGNMNRAMEFLDQMRVRGLCPNERTYTTLVDGFSQKGYMNEAYRVL 411

Query: 460 ERIIKDGFQPDFNIYNKLIESFCRGDNVSEALLLKLEMINRNFKPTIDTYKSLICCMSEI 519
             +  +GF P    YN LI   C    + +A+ +  +M  +   P + +Y ++   +S  
Sbjct: 412 REMNDNGFSPSVVTYNALINGHCVTGKMEDAIAVLEDMKEKGLSPDVVSYSTV---LSGF 471

Query: 520 NRSVD-GESLMV--EMVESGVLPDREICRALINGYCKEGNADKAESLLVSFAKDFQFFDS 579
            RS D  E+L V  EMVE G+ PD     +LI G+C++    +A  L     +     D 
Sbjct: 472 CRSYDVDEALRVKREMVEKGIKPDTITYSSLIQGFCEQRRTKEACDLYEEMLRVGLPPDE 531

Query: 580 ESFNALVKVYHDVGNETKLMELQDRMIKAGFLPNSLTCRYIIHGLWKSARLNKQR 625
            ++ AL+  Y   G+  K ++L + M++ G LP+ +T   +I+GL K +R  + +
Sbjct: 532 FTYTALINAYCMEGDLEKALQLHNEMVEKGVLPDVVTYSVLINGLNKQSRTREAK 575

BLAST of HG10022172 vs. TAIR 10
Match: AT5G01110.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 232.6 bits (592), Expect = 8.4e-61
Identity = 146/554 (26.35%), Postives = 259/554 (46.75%), Query Frame = 0

Query: 65  IVELVSSRLKTSNPQLALLQSDIKVLLPHLGHREISKILLRCQSNFVSALTFFNWVKYDL 124
           +VE +   LK  N       ++++  L  L    + ++L RC+++      F + + +  
Sbjct: 54  LVEKICFSLKQGN-------NNVRNHLIRLNPLAVVEVLYRCRNDLTLGQRFVDQLGFHF 113

Query: 125 -DIRLNSHNYCLIIHILAWSRQFPLAMKFLSELIELSKDVSSSEDVFQNLVLCTEHCNWN 184
            + +  S +   +IHIL  S +   A   L  +I  S    S  ++  +L     +C  N
Sbjct: 114 PNFKHTSLSLSAMIHILVRSGRLSDAQSCLLRMIRRSG--VSRLEIVNSLDSTFSNCGSN 173

Query: 185 PVIFEMLVKAYVKVDMIQESYWSFKKMVKLGFVPSVIACNCILNGLAKMKYDGQCWELYE 244
             +F++L++ YV+   ++E++ +F  +   GF  S+ ACN ++  L ++ +    W +Y+
Sbjct: 174 DSVFDLLIRTYVQARKLREAHEAFTLLRSKGFTVSIDACNALIGSLVRIGWVELAWGVYQ 233

Query: 245 EMGRIGVHSNAYTFNILTYVLCRDGDVNKINEFLEKMEEEGFDPDVVTYNTLIDSYCRRG 304
           E+ R GV  N YT NI+   LC+DG + K+  FL +++E+G  PD+VTYNTLI +Y  +G
Sbjct: 234 EISRSGVGINVYTLNIMVNALCKDGKMEKVGTFLSQVQEKGVYPDIVTYNTLISAYSSKG 293

Query: 305 RLDDAFYLYRIMYRRGVMPDLVSYTSLMKGLCRLRRVKEAHQLFHRMIDRGMDPDVVSYN 364
            +++AF L   M  +G  P + +Y +++ GLC+  + + A ++F  M+  G+ PD  +Y 
Sbjct: 294 LMEEAFELMNAMPGKGFSPGVYTYNTVINGLCKHGKYERAKEVFAEMLRSGLSPDSTTYR 353

Query: 365 TLIGAYCKNGMLQEARSLLHDMIGLGIHPDSFTCRILVEGHGREGRLISALNLVVELQKL 424
           +L+   CK G + E   +  DM    + PD      ++    R G L  AL     +++ 
Sbjct: 354 SLLMEACKKGDVVETEKVFSDMRSRDVVPDLVCFSSMMSLFTRSGNLDKALMYFNSVKEA 413

Query: 425 GVTVAYDIYEYLIISLCREDRPFAAKSLLERIIKDGFQPDFNIYNKLIESFCRGDNVSEA 484
           G+     IY  LI   CR+     A +L   +++ G   D   YN ++   C+   + EA
Sbjct: 414 GLIPDNVIYTILIQGYCRKGMISVAMNLRNEMLQQGCAMDVVTYNTILHGLCKRKMLGEA 473

Query: 485 LLLKLEMINRNFKPTIDTYKSLICCMSEINRSVDGESLMVEMVESGVLPDREICRALING 544
             L  EM  R   P   T   LI    ++    +   L  +M E  +  D      L++G
Sbjct: 474 DKLFNEMTERALFPDSYTLTILIDGHCKLGNLQNAMELFQKMKEKRIRLDVVTYNTLLDG 533

Query: 545 YCKEGNADKAESLLVSFAKDFQFFDSESFNALVKVYHDVGNETKLMELQDRMIKAGFLPN 604
           + K G+ D A+ +              S++ LV      G+  +   + D MI     P 
Sbjct: 534 FGKVGDIDTAKEIWADMVSKEILPTPISYSILVNALCSKGHLAEAFRVWDEMISKNIKPT 593

Query: 605 SLTCRYIIHGLWKS 618
            + C  +I G  +S
Sbjct: 594 VMICNSMIKGYCRS 598

BLAST of HG10022172 vs. TAIR 10
Match: AT2G15630.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 232.3 bits (591), Expect = 1.1e-60
Identity = 160/565 (28.32%), Postives = 268/565 (47.43%), Query Frame = 0

Query: 69  VSSRLKTSNPQLALLQSDIKVLLP-------HLGHREISKILLRCQSNFVSALT-----F 128
           +SS  +TS P+  L     ++LL        H+      K+     S  + +L       
Sbjct: 29  LSSLAQTSTPESVLPPITSEILLESIRSSQWHIVEHVADKLTPSLVSTTLLSLVKTPNLA 88

Query: 129 FNWVKYDLDIRLNSHNYCLIIHILAWSRQFPLAMKFLSELIELSKDVSSSEDVFQNLVLC 188
           FN+V +    RL+    CL I +++         + L E++   K  +S  ++F  LVL 
Sbjct: 89  FNFVNHIDLYRLDFQTQCLAIAVISKLSSPKPVTQLLKEVVTSRK--NSIRNLFDELVLA 148

Query: 189 TEHCNW-NPVIFEMLVKAYVKVDMIQESYWSFKKMVKLGFVPSVIACNCILNGLAKMKYD 248
            +     + ++F++LV+   ++ M+ E+   F  M + GF P    CN IL  L+++   
Sbjct: 149 HDRLETKSTILFDLLVRCCCQLRMVDEAIECFYLMKEKGFYPKTETCNHILTLLSRLNRI 208

Query: 249 GQCWELYEEMGRIGVHSNAYTFNILTYVLCRDGDVNKINEFLEKMEEEGFDPDVVTYNTL 308
              W  Y +M R+ + SN YTFNI+  VLC++G + K   FL  ME  G  P +VTYNTL
Sbjct: 209 ENAWVFYADMYRMEIKSNVYTFNIMINVLCKEGKLKKAKGFLGIMEVFGIKPTIVTYNTL 268

Query: 309 IDSYCRRGRLDDAFYLYRIMYRRGVMPDLVSYTSLMKGLCRLRRVKEAHQLFHRMIDRGM 368
           +  +  RGR++ A  +   M  +G  PD+ +Y  ++  +C   R   A ++   M + G+
Sbjct: 269 VQGFSLRGRIEGARLIISEMKSKGFQPDMQTYNPILSWMCNEGR---ASEVLREMKEIGL 328

Query: 369 DPDVVSYNTLIGAYCKNGMLQEARSLLHDMIGLGIHPDSFTCRILVEGHGREGRLISALN 428
            PD VSYN LI     NG L+ A +   +M+  G+ P  +T   L+ G   E ++ +A  
Sbjct: 329 VPDSVSYNILIRGCSNNGDLEMAFAYRDEMVKQGMVPTFYTYNTLIHGLFMENKIEAAEI 388

Query: 429 LVVELQKLGVTVAYDIYEYLIISLCREDRPFAAKSLLERIIKDGFQPDFNIYNKLIESFC 488
           L+ E+++ G+ +    Y  LI   C+      A +L + ++ DG QP    Y  LI   C
Sbjct: 389 LIREIREKGIVLDSVTYNILINGYCQHGDAKKAFALHDEMMTDGIQPTQFTYTSLIYVLC 448

Query: 489 RGDNVSEALLLKLEMINRNFKPTIDTYKSLI---CCMSEINRSVDGESLMVEMVESGVLP 548
           R +   EA  L  +++ +  KP +    +L+   C +  ++R+    SL+ EM    + P
Sbjct: 449 RKNKTREADELFEKVVGKGMKPDLVMMNTLMDGHCAIGNMDRAF---SLLKEMDMMSINP 508

Query: 549 DREICRALINGYCKEGNADKAESLLVSFAKDFQFFDSESFNALVKVYHDVGNETKLMELQ 608
           D      L+ G C EG  ++A  L+    +     D  S+N L+  Y   G+      ++
Sbjct: 509 DDVTYNCLMRGLCGEGKFEEARELMGEMKRRGIKPDHISYNTLISGYSKKGDTKHAFMVR 568

Query: 609 DRMIKAGFLPNSLTCRYIIHGLWKS 618
           D M+  GF P  LT   ++ GL K+
Sbjct: 569 DEMLSLGFNPTLLTYNALLKGLSKN 585

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038890312.10.0e+0091.75pentatricopeptide repeat-containing protein At5g40400 [Benincasa hispida] >XP_03... [more]
XP_004142850.20.0e+0088.07pentatricopeptide repeat-containing protein At5g40400 [Cucumis sativus] >XP_0116... [more]
XP_008458797.10.0e+0088.57PREDICTED: pentatricopeptide repeat-containing protein At5g40400 [Cucumis melo][more]
KAA0037561.10.0e+0088.41pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYK03111... [more]
XP_022134396.14.2e-30481.85pentatricopeptide repeat-containing protein At5g40400 [Momordica charantia] >XP_... [more]
Match NameE-valueIdentityDescription
Q9FND88.8e-18053.04Pentatricopeptide repeat-containing protein At5g40400 OS=Arabidopsis thaliana OX... [more]
Q9LVQ52.1e-6430.18Pentatricopeptide repeat-containing protein At5g55840 OS=Arabidopsis thaliana OX... [more]
Q9FIX32.8e-6128.60Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana OX... [more]
Q9LFC51.2e-5926.35Pentatricopeptide repeat-containing protein At5g01110 OS=Arabidopsis thaliana OX... [more]
Q9ZQF11.6e-5928.32Pentatricopeptide repeat-containing protein At2g15630, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A0A0KSF10.0e+0088.07Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G523120 PE=4 SV=1[more]
A0A1S3C9950.0e+0088.57pentatricopeptide repeat-containing protein At5g40400 OS=Cucumis melo OX=3656 GN... [more]
A0A5D3BTR00.0e+0088.41Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A6J1C1W22.0e-30481.85pentatricopeptide repeat-containing protein At5g40400 OS=Momordica charantia OX=... [more]
A0A6J1E7L89.4e-30282.48pentatricopeptide repeat-containing protein At5g40400 OS=Cucurbita moschata OX=3... [more]
Match NameE-valueIdentityDescription
AT5G40400.16.2e-18153.04Pentatricopeptide repeat (PPR) superfamily protein [more]
AT5G55840.11.5e-6530.18Pentatricopeptide repeat (PPR) superfamily protein [more]
AT5G39710.12.0e-6228.60Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT5G01110.18.4e-6126.35Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT2G15630.11.1e-6028.32Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 217..266
e-value: 4.4E-9
score: 36.4
coord: 462..508
e-value: 2.2E-7
score: 31.0
coord: 287..336
e-value: 7.3E-19
score: 67.7
coord: 357..404
e-value: 2.7E-14
score: 53.1
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 568..611
e-value: 0.0017
score: 18.4
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 187..215
e-value: 0.49
score: 10.8
coord: 433..460
e-value: 0.36
score: 11.2
coord: 539..557
e-value: 9.9E-4
score: 19.2
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 290..323
e-value: 5.1E-11
score: 40.0
coord: 431..463
e-value: 0.0012
score: 16.8
coord: 360..394
e-value: 1.3E-10
score: 38.7
coord: 501..533
e-value: 4.4E-4
score: 18.2
coord: 536..557
e-value: 7.9E-4
score: 17.4
coord: 255..289
e-value: 7.5E-6
score: 23.8
coord: 466..498
e-value: 7.7E-6
score: 23.8
coord: 325..359
e-value: 1.7E-10
score: 38.4
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 358..392
score: 13.745527
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 323..357
score: 13.635915
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 393..427
score: 8.769097
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 498..532
score: 8.801982
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 253..287
score: 11.202506
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 463..497
score: 10.325603
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 288..322
score: 14.458013
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 568..602
score: 9.339086
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 183..217
score: 9.777537
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 304..438
e-value: 1.9E-37
score: 131.4
coord: 103..301
e-value: 4.6E-30
score: 107.1
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 439..552
e-value: 1.6E-23
score: 85.0
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 561..626
e-value: 1.4E-5
score: 26.7
NoneNo IPR availablePANTHERPTHR47941:SF6OS01G0546700 PROTEINcoord: 23..620
NoneNo IPR availablePANTHERPTHR47941PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEIN 3, MITOCHONDRIALcoord: 23..620
NoneNo IPR availableSUPERFAMILY81901HCP-likecoord: 194..391

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10022172.1HG10022172.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding