Moc02g02020 (gene) Bitter gourd (OHB3-1) v2

Overview
NameMoc02g02020
Typegene
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionPentatricopeptide repeat-containing protein
Locationchr2: 1516373 .. 1518871 (+)
RNA-Seq ExpressionMoc02g02020
SyntenyMoc02g02020
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTTACATTCATGCGCTCGAAGCACCGGTCTTCACTTCGCCTCACCATTCCAATTCAGTTTTCAGACCACTCGATTCCAATACAGAAGCTTCCTTCACCCCGTTCTCGGTTCAGCTTCATCGTCCCCACTTCCTCTTCCAAGTACAGAGCCAGTTTCTTCCGTTCACGTTGACCTCCTCACTCTGTGTTCCAACGCTCAATCTCTTCGTCAAACCAAACAACTCCATGCTCTTTGCCTCCTCAATGGCTTACTTCCCCACAGCGTCTCGCTCTGCGCTTCCCTTATTCTCGATTACGCCACGTTCCGAGACCCAGAATCGTTTCGTATTCTGTTCCATCAAACTGTCCAGAATTGCCGCACTGCGTTCCTGTGGAATACCTTGATTCGGGCTCACTCCATAGCCGGCAACGGGATGATTGATGGGCTCCAGACGTATAACAGGATGGTTCGAAGCGGTGTTCAACTCGATGACCATACATTTCCTTTCGTTCTCAAGTTGTGTTCCGATTCGTTAGATATTTGCAAAGGTATGGAGGTTCATGGGGTTGTTTCCAAGTTGGGCTTTGATTCAGATGTCTATGTTGGGAATACGCTGTTGATGCTGTATGGGAATTGTGGGGTTGTAAGTGATGCGAGAAGGGTGTTCGACGAAATGCCGGAAAGAGATGTTGTTTCATGGAATACGATTCTTGGGCTCCTTTCGGTTAATGGGGATTATAGGGAGGCTCGTAATTATTACTTTTGGATGACTTTGAGGTCTGGGATTCAACCAAATGTGGTGAGTGTCGTTATTTTGTTACCCATTTCTGCTGCCCTCGAAGATGAGGAGATGACAAGGCGAATTCATTGTTACACCATGAAAGCTGGTTTGGATTCTCAGGTAACTACTTGCAATGCACTTGTCGATGCATATGGGAAATGTGGGAATGTGAAAGCTTCGTGGCAAGTTTTTGATGAGATGGTTGAGAGGAATGAAGTTTCATGGAATGCAATAATCAATGGTCTAGCTTGTAAGGGTCATTTCTGTGATGCCTTGGATGTTTTCAGGATGATGATTGATGCAGAAGTTAAGCCAAACTCTGTCACCATTTCTAGTATTCTACCTGTGCTGGTTGAACTTGAGTATTTCAAAGCAGGGAAAGAAATTCATGGGTTCAGTATGAGGATGGGAACAGAAACTGATATTTTCATTGCAAATTCATTGATTGATATGTATGCAAAATCTGGCCATTCAACTGAGGCATCTTGCATATTCCACAACATGGATAGAAGGAATGTTGTTTCTTGGAATGCTATGATCGCCAATTATGCTCTAAATGGGCTTGCGTTAGAAGCAATAAGACAAGTAATACTAATGCAAGAGACTGGAGAAAATCCAAATGCAGTGACCTTCACCAATGTTCTTCCCGCTTGTGCACGTTTGGGTTTCCTTGGACCTGGCAAAGAAATACATGCCATTGGCATTCGTTTAGGACTAACAGCCGATATGTTTGTTTCCAATGCTCTGACAGACATGTATGCGAAATGCGGTTGCCTTCATTCTGCTCGAAATGTCTTCAACACCTCCCATAAAGATGAAGTTTCTTATAACATATTGATTCTTGGATATTCTGAAACTAGCGATTGCTTGGAGTCACTGAATTTGTTCTCGGAGATGAGGTTGCTTGGTAGGAAGCCGGATGTTGTTTCCTTTGTGGGTGTCATATCAGCTTGTGCAAATCTAGCTGCAGTCAAGCAAGGAAAAGAGATCCACGGTGTAGCATTAAGAAATCATTTTTACTCTCATCTTTTTGTCTCAAACTCCCTTTTGGACTTCTATACAAAATGTGCAAGAATTGATCTTGCCTGTAAGGTCTTTAACCAAATTCTATTCAAAGATGTAGCCTCTTGGAATACTATGATTTTAGGATACGGAATGATAGGAGAGTTGGAGACTGCAATTACTATGTTTGAAGAAATGAGAGGTGATACTGTGCAATATGACTTAGTTTCATATATTGCAGTTCTCTCAGCTTGCAGTCATGGAGGACTAGTTGAACAAGGTTGGAAATACTTCAGTGAGATGCTAGCTCAAAATCTTGAACCAACCCAAATGCACTATGCATGTATAGTCGATCTACTCGGCCGTGCTGGTTTTGTGGAAGAGGCAGCAGAGCTTATCCGGTGCCTTCCTATAGCCCCAGATGCAAATATTTGGGGAGCTCTACTCGGGGCTTGCCGAATTTACCGAAATGTTGAGCTAGGGTGTTGGGCAGCAGAACATTTATTTGAGATAAAGCCTCAGCATTGTGGATACTATATTCTTCTTTCAAATATGTATGCTGAAGCAGGAAGATGGGATGAGGTAAACAGGGTTAGGGACCTGATGAAGTCTAGAGGAGCGAAGAAGAACCCTGGCTGTAGCTGGGTTCAGATTCACGATCAGGTGCATACTTTTATGGCTGAAGAGAGAGCAGAGGGATTTGAATCAGGTGGTTGGTTAGCAGAATCCTTCTGA

mRNA sequence

ATGTTACATTCATGCGCTCGAAGCACCGGTCTTCACTTCGCCTCACCATTCCAATTCAGTTTTCAGACCACTCGATTCCAATACAGAAGCTTCCTTCACCCCGTTCTCGGTTCAGCTTCATCGTCCCCACTTCCTCTTCCAAGTACAGAGCCAGTTTCTTCCGTTCACGTTGACCTCCTCACTCTGTGTTCCAACGCTCAATCTCTTCGTCAAACCAAACAACTCCATGCTCTTTGCCTCCTCAATGGCTTACTTCCCCACAGCGTCTCGCTCTGCGCTTCCCTTATTCTCGATTACGCCACGTTCCGAGACCCAGAATCGTTTCGTATTCTGTTCCATCAAACTGTCCAGAATTGCCGCACTGCGTTCCTGTGGAATACCTTGATTCGGGCTCACTCCATAGCCGGCAACGGGATGATTGATGGGCTCCAGACGTATAACAGGATGGTTCGAAGCGGTGTTCAACTCGATGACCATACATTTCCTTTCGTTCTCAAGTTGTGTTCCGATTCGTTAGATATTTGCAAAGGTATGGAGGTTCATGGGGTTGTTTCCAAGTTGGGCTTTGATTCAGATGTCTATGTTGGGAATACGCTGTTGATGCTGTATGGGAATTGTGGGGTTGTAAGTGATGCGAGAAGGGTGTTCGACGAAATGCCGGAAAGAGATGTTGTTTCATGGAATACGATTCTTGGGCTCCTTTCGGTTAATGGGGATTATAGGGAGGCTCGTAATTATTACTTTTGGATGACTTTGAGGTCTGGGATTCAACCAAATGTGGTGAGTGTCGTTATTTTGTTACCCATTTCTGCTGCCCTCGAAGATGAGGAGATGACAAGGCGAATTCATTGTTACACCATGAAAGCTGGTTTGGATTCTCAGGTAACTACTTGCAATGCACTTGTCGATGCATATGGGAAATGTGGGAATGTGAAAGCTTCGTGGCAAGTTTTTGATGAGATGGTTGAGAGGAATGAAGTTTCATGGAATGCAATAATCAATGGTCTAGCTTGTAAGGGTCATTTCTGTGATGCCTTGGATGTTTTCAGGATGATGATTGATGCAGAAGTTAAGCCAAACTCTGTCACCATTTCTAGTATTCTACCTGTGCTGGTTGAACTTGAGTATTTCAAAGCAGGGAAAGAAATTCATGGGTTCAGTATGAGGATGGGAACAGAAACTGATATTTTCATTGCAAATTCATTGATTGATATGTATGCAAAATCTGGCCATTCAACTGAGGCATCTTGCATATTCCACAACATGGATAGAAGGAATGTTGTTTCTTGGAATGCTATGATCGCCAATTATGCTCTAAATGGGCTTGCGTTAGAAGCAATAAGACAAGTAATACTAATGCAAGAGACTGGAGAAAATCCAAATGCAGTGACCTTCACCAATGTTCTTCCCGCTTGTGCACGTTTGGGTTTCCTTGGACCTGGCAAAGAAATACATGCCATTGGCATTCGTTTAGGACTAACAGCCGATATGTTTGTTTCCAATGCTCTGACAGACATGTATGCGAAATGCGGTTGCCTTCATTCTGCTCGAAATGTCTTCAACACCTCCCATAAAGATGAAGTTTCTTATAACATATTGATTCTTGGATATTCTGAAACTAGCGATTGCTTGGAGTCACTGAATTTGTTCTCGGAGATGAGGTTGCTTGGTAGGAAGCCGGATGTTGTTTCCTTTGTGGGTGTCATATCAGCTTGTGCAAATCTAGCTGCAGTCAAGCAAGGAAAAGAGATCCACGGTGTAGCATTAAGAAATCATTTTTACTCTCATCTTTTTGTCTCAAACTCCCTTTTGGACTTCTATACAAAATGTGCAAGAATTGATCTTGCCTGTAAGGTCTTTAACCAAATTCTATTCAAAGATGTAGCCTCTTGGAATACTATGATTTTAGGATACGGAATGATAGGAGAGTTGGAGACTGCAATTACTATGTTTGAAGAAATGAGAGGTGATACTGTGCAATATGACTTAGTTTCATATATTGCAGTTCTCTCAGCTTGCAGTCATGGAGGACTAGTTGAACAAGGTTGGAAATACTTCAGTGAGATGCTAGCTCAAAATCTTGAACCAACCCAAATGCACTATGCATGTATAGTCGATCTACTCGGCCGTGCTGGTTTTGTGGAAGAGGCAGCAGAGCTTATCCGGTGCCTTCCTATAGCCCCAGATGCAAATATTTGGGGAGCTCTACTCGGGGCTTGCCGAATTTACCGAAATGTTGAGCTAGGGTGTTGGGCAGCAGAACATTTATTTGAGATAAAGCCTCAGCATTGTGGATACTATATTCTTCTTTCAAATATGTATGCTGAAGCAGGAAGATGGGATGAGGTAAACAGGGTTAGGGACCTGATGAAGTCTAGAGGAGCGAAGAAGAACCCTGGCTGTAGCTGGGTTCAGATTCACGATCAGGTGCATACTTTTATGGCTGAAGAGAGAGCAGAGGGATTTGAATCAGGTGGTTGGTTAGCAGAATCCTTCTGA

Coding sequence (CDS)

ATGTTACATTCATGCGCTCGAAGCACCGGTCTTCACTTCGCCTCACCATTCCAATTCAGTTTTCAGACCACTCGATTCCAATACAGAAGCTTCCTTCACCCCGTTCTCGGTTCAGCTTCATCGTCCCCACTTCCTCTTCCAAGTACAGAGCCAGTTTCTTCCGTTCACGTTGACCTCCTCACTCTGTGTTCCAACGCTCAATCTCTTCGTCAAACCAAACAACTCCATGCTCTTTGCCTCCTCAATGGCTTACTTCCCCACAGCGTCTCGCTCTGCGCTTCCCTTATTCTCGATTACGCCACGTTCCGAGACCCAGAATCGTTTCGTATTCTGTTCCATCAAACTGTCCAGAATTGCCGCACTGCGTTCCTGTGGAATACCTTGATTCGGGCTCACTCCATAGCCGGCAACGGGATGATTGATGGGCTCCAGACGTATAACAGGATGGTTCGAAGCGGTGTTCAACTCGATGACCATACATTTCCTTTCGTTCTCAAGTTGTGTTCCGATTCGTTAGATATTTGCAAAGGTATGGAGGTTCATGGGGTTGTTTCCAAGTTGGGCTTTGATTCAGATGTCTATGTTGGGAATACGCTGTTGATGCTGTATGGGAATTGTGGGGTTGTAAGTGATGCGAGAAGGGTGTTCGACGAAATGCCGGAAAGAGATGTTGTTTCATGGAATACGATTCTTGGGCTCCTTTCGGTTAATGGGGATTATAGGGAGGCTCGTAATTATTACTTTTGGATGACTTTGAGGTCTGGGATTCAACCAAATGTGGTGAGTGTCGTTATTTTGTTACCCATTTCTGCTGCCCTCGAAGATGAGGAGATGACAAGGCGAATTCATTGTTACACCATGAAAGCTGGTTTGGATTCTCAGGTAACTACTTGCAATGCACTTGTCGATGCATATGGGAAATGTGGGAATGTGAAAGCTTCGTGGCAAGTTTTTGATGAGATGGTTGAGAGGAATGAAGTTTCATGGAATGCAATAATCAATGGTCTAGCTTGTAAGGGTCATTTCTGTGATGCCTTGGATGTTTTCAGGATGATGATTGATGCAGAAGTTAAGCCAAACTCTGTCACCATTTCTAGTATTCTACCTGTGCTGGTTGAACTTGAGTATTTCAAAGCAGGGAAAGAAATTCATGGGTTCAGTATGAGGATGGGAACAGAAACTGATATTTTCATTGCAAATTCATTGATTGATATGTATGCAAAATCTGGCCATTCAACTGAGGCATCTTGCATATTCCACAACATGGATAGAAGGAATGTTGTTTCTTGGAATGCTATGATCGCCAATTATGCTCTAAATGGGCTTGCGTTAGAAGCAATAAGACAAGTAATACTAATGCAAGAGACTGGAGAAAATCCAAATGCAGTGACCTTCACCAATGTTCTTCCCGCTTGTGCACGTTTGGGTTTCCTTGGACCTGGCAAAGAAATACATGCCATTGGCATTCGTTTAGGACTAACAGCCGATATGTTTGTTTCCAATGCTCTGACAGACATGTATGCGAAATGCGGTTGCCTTCATTCTGCTCGAAATGTCTTCAACACCTCCCATAAAGATGAAGTTTCTTATAACATATTGATTCTTGGATATTCTGAAACTAGCGATTGCTTGGAGTCACTGAATTTGTTCTCGGAGATGAGGTTGCTTGGTAGGAAGCCGGATGTTGTTTCCTTTGTGGGTGTCATATCAGCTTGTGCAAATCTAGCTGCAGTCAAGCAAGGAAAAGAGATCCACGGTGTAGCATTAAGAAATCATTTTTACTCTCATCTTTTTGTCTCAAACTCCCTTTTGGACTTCTATACAAAATGTGCAAGAATTGATCTTGCCTGTAAGGTCTTTAACCAAATTCTATTCAAAGATGTAGCCTCTTGGAATACTATGATTTTAGGATACGGAATGATAGGAGAGTTGGAGACTGCAATTACTATGTTTGAAGAAATGAGAGGTGATACTGTGCAATATGACTTAGTTTCATATATTGCAGTTCTCTCAGCTTGCAGTCATGGAGGACTAGTTGAACAAGGTTGGAAATACTTCAGTGAGATGCTAGCTCAAAATCTTGAACCAACCCAAATGCACTATGCATGTATAGTCGATCTACTCGGCCGTGCTGGTTTTGTGGAAGAGGCAGCAGAGCTTATCCGGTGCCTTCCTATAGCCCCAGATGCAAATATTTGGGGAGCTCTACTCGGGGCTTGCCGAATTTACCGAAATGTTGAGCTAGGGTGTTGGGCAGCAGAACATTTATTTGAGATAAAGCCTCAGCATTGTGGATACTATATTCTTCTTTCAAATATGTATGCTGAAGCAGGAAGATGGGATGAGGTAAACAGGGTTAGGGACCTGATGAAGTCTAGAGGAGCGAAGAAGAACCCTGGCTGTAGCTGGGTTCAGATTCACGATCAGGTGCATACTTTTATGGCTGAAGAGAGAGCAGAGGGATTTGAATCAGGTGGTTGGTTAGCAGAATCCTTCTGA

Protein sequence

MLHSCARSTGLHFASPFQFSFQTTRFQYRSFLHPVLGSASSSPLPLPSTEPVSSVHVDLLTLCSNAQSLRQTKQLHALCLLNGLLPHSVSLCASLILDYATFRDPESFRILFHQTVQNCRTAFLWNTLIRAHSIAGNGMIDGLQTYNRMVRSGVQLDDHTFPFVLKLCSDSLDICKGMEVHGVVSKLGFDSDVYVGNTLLMLYGNCGVVSDARRVFDEMPERDVVSWNTILGLLSVNGDYREARNYYFWMTLRSGIQPNVVSVVILLPISAALEDEEMTRRIHCYTMKAGLDSQVTTCNALVDAYGKCGNVKASWQVFDEMVERNEVSWNAIINGLACKGHFCDALDVFRMMIDAEVKPNSVTISSILPVLVELEYFKAGKEIHGFSMRMGTETDIFIANSLIDMYAKSGHSTEASCIFHNMDRRNVVSWNAMIANYALNGLALEAIRQVILMQETGENPNAVTFTNVLPACARLGFLGPGKEIHAIGIRLGLTADMFVSNALTDMYAKCGCLHSARNVFNTSHKDEVSYNILILGYSETSDCLESLNLFSEMRLLGRKPDVVSFVGVISACANLAAVKQGKEIHGVALRNHFYSHLFVSNSLLDFYTKCARIDLACKVFNQILFKDVASWNTMILGYGMIGELETAITMFEEMRGDTVQYDLVSYIAVLSACSHGGLVEQGWKYFSEMLAQNLEPTQMHYACIVDLLGRAGFVEEAAELIRCLPIAPDANIWGALLGACRIYRNVELGCWAAEHLFEIKPQHCGYYILLSNMYAEAGRWDEVNRVRDLMKSRGAKKNPGCSWVQIHDQVHTFMAEERAEGFESGGWLAESF
Homology
BLAST of Moc02g02020 vs. NCBI nr
Match: XP_022153017.1 (pentatricopeptide repeat-containing protein At4g14170-like [Momordica charantia])

HSP 1 Score: 1705.3 bits (4415), Expect = 0.0e+00
Identity = 832/832 (100.00%), Postives = 832/832 (100.00%), Query Frame = 0

Query: 1   MLHSCARSTGLHFASPFQFSFQTTRFQYRSFLHPVLGSASSSPLPLPSTEPVSSVHVDLL 60
           MLHSCARSTGLHFASPFQFSFQTTRFQYRSFLHPVLGSASSSPLPLPSTEPVSSVHVDLL
Sbjct: 1   MLHSCARSTGLHFASPFQFSFQTTRFQYRSFLHPVLGSASSSPLPLPSTEPVSSVHVDLL 60

Query: 61  TLCSNAQSLRQTKQLHALCLLNGLLPHSVSLCASLILDYATFRDPESFRILFHQTVQNCR 120
           TLCSNAQSLRQTKQLHALCLLNGLLPHSVSLCASLILDYATFRDPESFRILFHQTVQNCR
Sbjct: 61  TLCSNAQSLRQTKQLHALCLLNGLLPHSVSLCASLILDYATFRDPESFRILFHQTVQNCR 120

Query: 121 TAFLWNTLIRAHSIAGNGMIDGLQTYNRMVRSGVQLDDHTFPFVLKLCSDSLDICKGMEV 180
           TAFLWNTLIRAHSIAGNGMIDGLQTYNRMVRSGVQLDDHTFPFVLKLCSDSLDICKGMEV
Sbjct: 121 TAFLWNTLIRAHSIAGNGMIDGLQTYNRMVRSGVQLDDHTFPFVLKLCSDSLDICKGMEV 180

Query: 181 HGVVSKLGFDSDVYVGNTLLMLYGNCGVVSDARRVFDEMPERDVVSWNTILGLLSVNGDY 240
           HGVVSKLGFDSDVYVGNTLLMLYGNCGVVSDARRVFDEMPERDVVSWNTILGLLSVNGDY
Sbjct: 181 HGVVSKLGFDSDVYVGNTLLMLYGNCGVVSDARRVFDEMPERDVVSWNTILGLLSVNGDY 240

Query: 241 REARNYYFWMTLRSGIQPNVVSVVILLPISAALEDEEMTRRIHCYTMKAGLDSQVTTCNA 300
           REARNYYFWMTLRSGIQPNVVSVVILLPISAALEDEEMTRRIHCYTMKAGLDSQVTTCNA
Sbjct: 241 REARNYYFWMTLRSGIQPNVVSVVILLPISAALEDEEMTRRIHCYTMKAGLDSQVTTCNA 300

Query: 301 LVDAYGKCGNVKASWQVFDEMVERNEVSWNAIINGLACKGHFCDALDVFRMMIDAEVKPN 360
           LVDAYGKCGNVKASWQVFDEMVERNEVSWNAIINGLACKGHFCDALDVFRMMIDAEVKPN
Sbjct: 301 LVDAYGKCGNVKASWQVFDEMVERNEVSWNAIINGLACKGHFCDALDVFRMMIDAEVKPN 360

Query: 361 SVTISSILPVLVELEYFKAGKEIHGFSMRMGTETDIFIANSLIDMYAKSGHSTEASCIFH 420
           SVTISSILPVLVELEYFKAGKEIHGFSMRMGTETDIFIANSLIDMYAKSGHSTEASCIFH
Sbjct: 361 SVTISSILPVLVELEYFKAGKEIHGFSMRMGTETDIFIANSLIDMYAKSGHSTEASCIFH 420

Query: 421 NMDRRNVVSWNAMIANYALNGLALEAIRQVILMQETGENPNAVTFTNVLPACARLGFLGP 480
           NMDRRNVVSWNAMIANYALNGLALEAIRQVILMQETGENPNAVTFTNVLPACARLGFLGP
Sbjct: 421 NMDRRNVVSWNAMIANYALNGLALEAIRQVILMQETGENPNAVTFTNVLPACARLGFLGP 480

Query: 481 GKEIHAIGIRLGLTADMFVSNALTDMYAKCGCLHSARNVFNTSHKDEVSYNILILGYSET 540
           GKEIHAIGIRLGLTADMFVSNALTDMYAKCGCLHSARNVFNTSHKDEVSYNILILGYSET
Sbjct: 481 GKEIHAIGIRLGLTADMFVSNALTDMYAKCGCLHSARNVFNTSHKDEVSYNILILGYSET 540

Query: 541 SDCLESLNLFSEMRLLGRKPDVVSFVGVISACANLAAVKQGKEIHGVALRNHFYSHLFVS 600
           SDCLESLNLFSEMRLLGRKPDVVSFVGVISACANLAAVKQGKEIHGVALRNHFYSHLFVS
Sbjct: 541 SDCLESLNLFSEMRLLGRKPDVVSFVGVISACANLAAVKQGKEIHGVALRNHFYSHLFVS 600

Query: 601 NSLLDFYTKCARIDLACKVFNQILFKDVASWNTMILGYGMIGELETAITMFEEMRGDTVQ 660
           NSLLDFYTKCARIDLACKVFNQILFKDVASWNTMILGYGMIGELETAITMFEEMRGDTVQ
Sbjct: 601 NSLLDFYTKCARIDLACKVFNQILFKDVASWNTMILGYGMIGELETAITMFEEMRGDTVQ 660

Query: 661 YDLVSYIAVLSACSHGGLVEQGWKYFSEMLAQNLEPTQMHYACIVDLLGRAGFVEEAAEL 720
           YDLVSYIAVLSACSHGGLVEQGWKYFSEMLAQNLEPTQMHYACIVDLLGRAGFVEEAAEL
Sbjct: 661 YDLVSYIAVLSACSHGGLVEQGWKYFSEMLAQNLEPTQMHYACIVDLLGRAGFVEEAAEL 720

Query: 721 IRCLPIAPDANIWGALLGACRIYRNVELGCWAAEHLFEIKPQHCGYYILLSNMYAEAGRW 780
           IRCLPIAPDANIWGALLGACRIYRNVELGCWAAEHLFEIKPQHCGYYILLSNMYAEAGRW
Sbjct: 721 IRCLPIAPDANIWGALLGACRIYRNVELGCWAAEHLFEIKPQHCGYYILLSNMYAEAGRW 780

Query: 781 DEVNRVRDLMKSRGAKKNPGCSWVQIHDQVHTFMAEERAEGFESGGWLAESF 833
           DEVNRVRDLMKSRGAKKNPGCSWVQIHDQVHTFMAEERAEGFESGGWLAESF
Sbjct: 781 DEVNRVRDLMKSRGAKKNPGCSWVQIHDQVHTFMAEERAEGFESGGWLAESF 832

BLAST of Moc02g02020 vs. NCBI nr
Match: XP_038901996.1 (pentatricopeptide repeat-containing protein At4g14170-like [Benincasa hispida])

HSP 1 Score: 1461.0 bits (3781), Expect = 0.0e+00
Identity = 712/832 (85.58%), Postives = 761/832 (91.47%), Query Frame = 0

Query: 1   MLHSCARSTGLHFASPFQFSFQTTRFQYRSFLHPVLGSASSSPLPLP-STEPVSSVHVDL 60
           ML  C RS+     SPF F  Q TRFQYR F  P+  SA  SP PLP STEP SS+H++L
Sbjct: 1   MLQFCIRSS---IVSPFPFISQITRFQYRHFHQPIFVSALQSPFPLPRSTEPNSSIHINL 60

Query: 61  LTLCSNAQSLRQTKQLHALCLLNGLLPHSVSLCASLILDYATFRDPESFRILFHQTVQNC 120
           LTLCSNAQSL QTKQLHALCLLNGLLP SVSLC+SLIL+YA F+ PESF  LFHQTVQNC
Sbjct: 61  LTLCSNAQSLPQTKQLHALCLLNGLLPRSVSLCSSLILNYAKFQHPESFCSLFHQTVQNC 120

Query: 121 RTAFLWNTLIRAHSIAGNGMIDGLQTYNRMVRSGVQLDDHTFPFVLKLCSDSLDICKGME 180
           RTAFLWNTLIRAHSIAGNG  DG +TYNRMVR GVQLDDHTFPF+LKLCSDS DI KGME
Sbjct: 121 RTAFLWNTLIRAHSIAGNGTRDGFETYNRMVRVGVQLDDHTFPFLLKLCSDSFDIWKGME 180

Query: 181 VHGVVSKLGFDSDVYVGNTLLMLYGNCGVVSDARRVFDEMPERDVVSWNTILGLLSVNGD 240
           VHGVV KLGFD+DVYVGNTLLMLYGNC  ++DARRVFDEMPERDVVSWNTI+GL SVNGD
Sbjct: 181 VHGVVFKLGFDTDVYVGNTLLMLYGNCRFLNDARRVFDEMPERDVVSWNTIIGLFSVNGD 240

Query: 241 YREARNYYFWMTLRSGIQPNVVSVVILLPISAALEDEEMTRRIHCYTMKAGLDSQVTTCN 300
           YREARNYYFWM LRSGI+PN+VSV+ LLPISA LEDEEMTRRIHCYT+K GLDS VT CN
Sbjct: 241 YREARNYYFWMNLRSGIKPNLVSVITLLPISAGLEDEEMTRRIHCYTVKVGLDSHVTICN 300

Query: 301 ALVDAYGKCGNVKASWQVFDEMVERNEVSWNAIINGLACKGHFCDALDVFRMMIDAEVKP 360
           ALVDAYGKCGNVKA WQVFDE+ ERNEVSWNA+INGLACKG   DAL+VF+MMIDA  KP
Sbjct: 301 ALVDAYGKCGNVKALWQVFDEIFERNEVSWNAMINGLACKGRCWDALNVFKMMIDAGAKP 360

Query: 361 NSVTISSILPVLVELEYFKAGKEIHGFSMRMGTETDIFIANSLIDMYAKSGHSTEASCIF 420
           NS+T+SSILPVLVELE FKAGKEIHGFSMRMGTETDIFIANSLIDMYAKSG STEAS IF
Sbjct: 361 NSITVSSILPVLVELECFKAGKEIHGFSMRMGTETDIFIANSLIDMYAKSGRSTEASSIF 420

Query: 421 HNMDRRNVVSWNAMIANYALNGLALEAIRQVILMQETGENPNAVTFTNVLPACARLGFLG 480
           HNM RRN+VSWNAMIANYALNGLALEAIR +ILMQETGE PNAVTFTNVLPACARLG LG
Sbjct: 421 HNMGRRNIVSWNAMIANYALNGLALEAIRFIILMQETGERPNAVTFTNVLPACARLGLLG 480

Query: 481 PGKEIHAIGIRLGLTADMFVSNALTDMYAKCGCLHSARNVFNTSHKDEVSYNILILGYSE 540
           PGKEIHA+ +RLG T+D+FVSNALTDMYAKCGCLHSARNVFNTSHKDEVSYNILI+GYSE
Sbjct: 481 PGKEIHAMAVRLGPTSDLFVSNALTDMYAKCGCLHSARNVFNTSHKDEVSYNILIIGYSE 540

Query: 541 TSDCLESLNLFSEMRLLGRKPDVVSFVGVISACANLAAVKQGKEIHGVALRNHFYSHLFV 600
           ++DCLESLNLFSEMRLLG+KPDVVSFVGVISACANLAAVKQGKEIHGVALRN  YSHLFV
Sbjct: 541 SNDCLESLNLFSEMRLLGKKPDVVSFVGVISACANLAAVKQGKEIHGVALRNLLYSHLFV 600

Query: 601 SNSLLDFYTKCARIDLACKVFNQILFKDVASWNTMILGYGMIGELETAITMFEEMRGDTV 660
           SNSLLDFYTKC RIDLACKVFNQILFKDVASWNTMILGYGMIGELETAI+MFE MR DTV
Sbjct: 601 SNSLLDFYTKCGRIDLACKVFNQILFKDVASWNTMILGYGMIGELETAISMFEAMRNDTV 660

Query: 661 QYDLVSYIAVLSACSHGGLVEQGWKYFSEMLAQNLEPTQMHYACIVDLLGRAGFVEEAAE 720
           QYDLVSYIAVLSACSHGGLVE+GW+YFSEMLAQ+LEPT+MHY C+VDLLGRAGFVEEAAE
Sbjct: 661 QYDLVSYIAVLSACSHGGLVERGWQYFSEMLAQHLEPTEMHYTCLVDLLGRAGFVEEAAE 720

Query: 721 LIRCLPIAPDANIWGALLGACRIYRNVELGCWAAEHLFEIKPQHCGYYILLSNMYAEAGR 780
           LI+ LPIAPDANIWGALLGACRIY NV+LGC AAEHLFE+KPQHCGYYILLSN+YAE GR
Sbjct: 721 LIQQLPIAPDANIWGALLGACRIYGNVKLGCRAAEHLFELKPQHCGYYILLSNIYAETGR 780

Query: 781 WDEVNRVRDLMKSRGAKKNPGCSWVQIHDQVHTFMAEERAEGFESGGWLAES 832
           WDEVNR+R+LMKSRGAKKNPGCSWVQI+DQ+H F+AEERAEGFESGGWLAES
Sbjct: 781 WDEVNRIRELMKSRGAKKNPGCSWVQIYDQLHAFVAEERAEGFESGGWLAES 829

BLAST of Moc02g02020 vs. NCBI nr
Match: KAG6570435.1 (Pentatricopeptide repeat-containing protein DOT4, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1425.6 bits (3689), Expect = 0.0e+00
Identity = 689/830 (83.01%), Postives = 750/830 (90.36%), Query Frame = 0

Query: 1   MLHSCARSTGLHFASPFQFSFQTTRFQYRSFLHPVLGSASSSPLPLPSTEPVSSVHVDLL 60
           ML  C RS   HFA       Q  RF +R+++               STE +SSVH++LL
Sbjct: 1   MLQFCIRSIRFHFA-------QIARFHFRNYVR--------------STEQISSVHINLL 60

Query: 61  TLCSNAQSLRQTKQLHALCLLNGLLPHSVSLCASLILDYATFRDPESFRILFHQTVQNCR 120
           TLC NAQSLRQTK++HA+CLLNGLLPHSVSLCASLIL+YA F+ PESF  LFHQTVQNCR
Sbjct: 61  TLCFNAQSLRQTKEVHAICLLNGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCR 120

Query: 121 TAFLWNTLIRAHSIAGNGMIDGLQTYNRMVRSGVQLDDHTFPFVLKLCSDSLDICKGMEV 180
           TAFLWNTLIRAHSIAGNG +DGL+TYNRMVR GVQLDDHTFPFVLK+CSDSLDICKGMEV
Sbjct: 121 TAFLWNTLIRAHSIAGNGTLDGLETYNRMVRLGVQLDDHTFPFVLKICSDSLDICKGMEV 180

Query: 181 HGVVSKLGFDSDVYVGNTLLMLYGNCGVVSDARRVFDEMPERDVVSWNTILGLLSVNGDY 240
           HGVV KLGFDS VYVGNTLLMLYGNCG ++DA++VFDEM ERDVVSWNT++GLLSVNGDY
Sbjct: 181 HGVVFKLGFDSHVYVGNTLLMLYGNCGFLNDAKKVFDEMSERDVVSWNTVIGLLSVNGDY 240

Query: 241 REARNYYFWMTLRSGIQPNVVSVVILLPISAALEDEEMTRRIHCYTMKAGLDSQVTTCNA 300
           REARNYYFWMTLRSGIQPN+VSV+ LLPISA LEDEEMTRRIHCY +K GLDS VT+CNA
Sbjct: 241 REARNYYFWMTLRSGIQPNLVSVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNA 300

Query: 301 LVDAYGKCGNVKASWQVFDEMVERNEVSWNAIINGLACKGHFCDALDVFRMMIDAEVKPN 360
           LVDAYGKCG+VK SWQVFDE++E+NEVSWN+IINGLA KGHF DALDVFRMMIDA  KPN
Sbjct: 301 LVDAYGKCGSVKTSWQVFDEIIEKNEVSWNSIINGLAFKGHFRDALDVFRMMIDAGTKPN 360

Query: 361 SVTISSILPVLVELEYFKAGKEIHGFSMRMGTETDIFIANSLIDMYAKSGHSTEASCIFH 420
           SVTISSILPV VELE FKAGKEIHGFSMRMGTETD+FIANSLIDMYAKSGHSTEAS IFH
Sbjct: 361 SVTISSILPVFVELECFKAGKEIHGFSMRMGTETDLFIANSLIDMYAKSGHSTEASSIFH 420

Query: 421 NMDRRNVVSWNAMIANYALNGLALEAIRQVILMQETGENPNAVTFTNVLPACARLGFLGP 480
           NMD RN+VSWNAMIANY LNG++LEAIR VIL+QE+GE PNAVTFTNVLPACARLG LGP
Sbjct: 421 NMDGRNIVSWNAMIANYVLNGVSLEAIRFVILLQESGERPNAVTFTNVLPACARLGHLGP 480

Query: 481 GKEIHAIGIRLGLTADMFVSNALTDMYAKCGCLHSARNVFNTSHKDEVSYNILILGYSET 540
           GKEIHA+G+RLGLT+D+FV+NALTDMYAKCGC  SARNVFNTSHKDEVSYNILI GYSET
Sbjct: 481 GKEIHAMGVRLGLTSDLFVTNALTDMYAKCGCFRSARNVFNTSHKDEVSYNILITGYSET 540

Query: 541 SDCLESLNLFSEMRLLGRKPDVVSFVGVISACANLAAVKQGKEIHGVALRNHFYSHLFVS 600
           +DCLESLNLFSEMRLLG+KPDVVSF+GVISACANLAAVKQGKEIHGVALRNH  SHLFVS
Sbjct: 541 NDCLESLNLFSEMRLLGKKPDVVSFMGVISACANLAAVKQGKEIHGVALRNHLNSHLFVS 600

Query: 601 NSLLDFYTKCARIDLACKVFNQILFKDVASWNTMILGYGMIGELETAITMFEEMRGDTVQ 660
           NSLLDFYTKC RIDLACK+FNQILFKDVASWNTMILGYGMIGELETAI MFE MR D VQ
Sbjct: 601 NSLLDFYTKCGRIDLACKIFNQILFKDVASWNTMILGYGMIGELETAINMFEAMRDDKVQ 660

Query: 661 YDLVSYIAVLSACSHGGLVEQGWKYFSEMLAQNLEPTQMHYACIVDLLGRAGFVEEAAEL 720
           YDLVSYIAVLSACSHGGLVE+GW+YFSEMLAQ+LEPT+MHY C+VDLLGRAGFVEEAAEL
Sbjct: 661 YDLVSYIAVLSACSHGGLVERGWQYFSEMLAQHLEPTEMHYTCLVDLLGRAGFVEEAAEL 720

Query: 721 IRCLPIAPDANIWGALLGACRIYRNVELGCWAAEHLFEIKPQHCGYYILLSNMYAEAGRW 780
           IR LPIAPD+NIWGALLGACRIY NVELGC AAEHLFE+KPQHCGYYILL+N++AE GRW
Sbjct: 721 IRRLPIAPDSNIWGALLGACRIYGNVELGCKAAEHLFELKPQHCGYYILLANIHAETGRW 780

Query: 781 DEVNRVRDLMKSRGAKKNPGCSWVQIHDQVHTFMAEERAEGFESGGWLAE 831
           DEVNR+R+LMKSRGAKK+PGCSWVQIHDQ H F+ ++RAEGFESGG LAE
Sbjct: 781 DEVNRIRELMKSRGAKKSPGCSWVQIHDQPHAFVVDDRAEGFESGGLLAE 809

BLAST of Moc02g02020 vs. NCBI nr
Match: XP_022985648.1 (pentatricopeptide repeat-containing protein At4g14170-like isoform X1 [Cucurbita maxima])

HSP 1 Score: 1423.7 bits (3684), Expect = 0.0e+00
Identity = 687/830 (82.77%), Postives = 752/830 (90.60%), Query Frame = 0

Query: 1   MLHSCARSTGLHFASPFQFSFQTTRFQYRSFLHPVLGSASSSPLPLPSTEPVSSVHVDLL 60
           ML  C RS   HFA       Q  RFQ+R+++               STEP SSVH++LL
Sbjct: 1   MLQFCIRSIRFHFA-------QIARFQFRNYVR--------------STEPNSSVHINLL 60

Query: 61  TLCSNAQSLRQTKQLHALCLLNGLLPHSVSLCASLILDYATFRDPESFRILFHQTVQNCR 120
           TLC N+QSLRQTK++HA+CLLNGLLPHSVSLCASLIL+YA F+ PESF  LFHQTVQNCR
Sbjct: 61  TLCFNSQSLRQTKEVHAICLLNGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCR 120

Query: 121 TAFLWNTLIRAHSIAGNGMIDGLQTYNRMVRSGVQLDDHTFPFVLKLCSDSLDICKGMEV 180
           T FLWNTLIRAHSIAGNG +DGL+TYNRMVR GVQLDDHTFPFVLK+CSDSLDICKGMEV
Sbjct: 121 TTFLWNTLIRAHSIAGNGTLDGLETYNRMVRFGVQLDDHTFPFVLKICSDSLDICKGMEV 180

Query: 181 HGVVSKLGFDSDVYVGNTLLMLYGNCGVVSDARRVFDEMPERDVVSWNTILGLLSVNGDY 240
           HGVV KLGFDSDVYVGNTLLMLYGNCG ++ A++VFDEM ERDVVSWNT++GLLSVNGDY
Sbjct: 181 HGVVFKLGFDSDVYVGNTLLMLYGNCGFLNGAKKVFDEMSERDVVSWNTVIGLLSVNGDY 240

Query: 241 REARNYYFWMTLRSGIQPNVVSVVILLPISAALEDEEMTRRIHCYTMKAGLDSQVTTCNA 300
           REARNYYFWMTLRSGIQPN+VSV+ LLPISA LEDEEMTRRIHCY +K GLDS VT+CNA
Sbjct: 241 REARNYYFWMTLRSGIQPNLVSVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNA 300

Query: 301 LVDAYGKCGNVKASWQVFDEMVERNEVSWNAIINGLACKGHFCDALDVFRMMIDAEVKPN 360
           LVDAYGKCG+VKASWQVFDE++E+NEVSWN+IINGLA KGHF DAL+VFRMMIDA  KPN
Sbjct: 301 LVDAYGKCGSVKASWQVFDEIIEKNEVSWNSIINGLAFKGHFWDALEVFRMMIDAGTKPN 360

Query: 361 SVTISSILPVLVELEYFKAGKEIHGFSMRMGTETDIFIANSLIDMYAKSGHSTEASCIFH 420
           SVTISSILPV VELE FKAGKEIHGFSMRMGTETD+FIANSLIDMYAKSGH TEAS IFH
Sbjct: 361 SVTISSILPVFVELECFKAGKEIHGFSMRMGTETDLFIANSLIDMYAKSGHLTEASSIFH 420

Query: 421 NMDRRNVVSWNAMIANYALNGLALEAIRQVILMQETGENPNAVTFTNVLPACARLGFLGP 480
           NMD RN+VSWNAMIANYALNG+ALEAIR VIL+QE+GE PNAVTFTNVLPACARLG LGP
Sbjct: 421 NMDGRNIVSWNAMIANYALNGVALEAIRFVILLQESGERPNAVTFTNVLPACARLGHLGP 480

Query: 481 GKEIHAIGIRLGLTADMFVSNALTDMYAKCGCLHSARNVFNTSHKDEVSYNILILGYSET 540
           GKEIHA+G+RLGLT+D+FV+NALTDMYAKCGC  SARNVFNTSHKDEVSYNILI GYSET
Sbjct: 481 GKEIHAMGVRLGLTSDLFVTNALTDMYAKCGCFRSARNVFNTSHKDEVSYNILITGYSET 540

Query: 541 SDCLESLNLFSEMRLLGRKPDVVSFVGVISACANLAAVKQGKEIHGVALRNHFYSHLFVS 600
           +DCLESLNLFSEMRLLG+KPDVVSF+GV+SACANLAAVKQGKEIHGVALRNH  SHLFVS
Sbjct: 541 NDCLESLNLFSEMRLLGKKPDVVSFMGVLSACANLAAVKQGKEIHGVALRNHLNSHLFVS 600

Query: 601 NSLLDFYTKCARIDLACKVFNQILFKDVASWNTMILGYGMIGELETAITMFEEMRGDTVQ 660
           NSLLDFYTKC RIDLACK+FNQILFKDVASWNTMILGYGMIGELETAI MFE MR D VQ
Sbjct: 601 NSLLDFYTKCGRIDLACKIFNQILFKDVASWNTMILGYGMIGELETAINMFEAMRDDKVQ 660

Query: 661 YDLVSYIAVLSACSHGGLVEQGWKYFSEMLAQNLEPTQMHYACIVDLLGRAGFVEEAAEL 720
           YD+VSYIAVLSACSHGGLVE+G +YFSEMLAQ+LEPT+MHY C+VDLLGRAGFVEEAAEL
Sbjct: 661 YDVVSYIAVLSACSHGGLVERGCQYFSEMLAQHLEPTEMHYTCLVDLLGRAGFVEEAAEL 720

Query: 721 IRCLPIAPDANIWGALLGACRIYRNVELGCWAAEHLFEIKPQHCGYYILLSNMYAEAGRW 780
           IR LPIAPD+NIWGALLGACRIY N++LGC AAEHLFE+KPQHCGYYILLSNMYAE GRW
Sbjct: 721 IRRLPIAPDSNIWGALLGACRIYGNIDLGCKAAEHLFELKPQHCGYYILLSNMYAETGRW 780

Query: 781 DEVNRVRDLMKSRGAKKNPGCSWVQIHDQVHTFMAEERAEGFESGGWLAE 831
           D+VNR+R+LMKSRGAKK+PGCSWVQIHDQ+H F+ ++RAEGFESGG+LAE
Sbjct: 781 DDVNRIRELMKSRGAKKSPGCSWVQIHDQLHAFVVDDRAEGFESGGFLAE 809

BLAST of Moc02g02020 vs. NCBI nr
Match: XP_022944177.1 (pentatricopeptide repeat-containing protein At4g14170-like isoform X1 [Cucurbita moschata])

HSP 1 Score: 1421.4 bits (3678), Expect = 0.0e+00
Identity = 690/830 (83.13%), Postives = 748/830 (90.12%), Query Frame = 0

Query: 1   MLHSCARSTGLHFASPFQFSFQTTRFQYRSFLHPVLGSASSSPLPLPSTEPVSSVHVDLL 60
           ML  C RS   HFA       Q  RFQ+R+F+                TEP SSVH++LL
Sbjct: 1   MLQFCIRSFRFHFA-------QIARFQFRNFVR--------------RTEPNSSVHINLL 60

Query: 61  TLCSNAQSLRQTKQLHALCLLNGLLPHSVSLCASLILDYATFRDPESFRILFHQTVQNCR 120
           TLC NAQSLRQTK++HA+CLLNGLLPHSVSLCASLIL+YA F+ PESF  LFHQTVQNCR
Sbjct: 61  TLCFNAQSLRQTKEVHAICLLNGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCR 120

Query: 121 TAFLWNTLIRAHSIAGNGMIDGLQTYNRMVRSGVQLDDHTFPFVLKLCSDSLDICKGMEV 180
           TAFLWNTLIRAHSIAGNG +DGL+TYNRMVR GVQLDDHTFPFVLK+CSDSLDICKGMEV
Sbjct: 121 TAFLWNTLIRAHSIAGNGTLDGLETYNRMVRFGVQLDDHTFPFVLKICSDSLDICKGMEV 180

Query: 181 HGVVSKLGFDSDVYVGNTLLMLYGNCGVVSDARRVFDEMPERDVVSWNTILGLLSVNGDY 240
           HGVV KLGFDSDVYVGNTLLMLYGNCG ++DA++VFDEM ERDVVSWNT++GLLSVNGDY
Sbjct: 181 HGVVFKLGFDSDVYVGNTLLMLYGNCGFLNDAKKVFDEMSERDVVSWNTVIGLLSVNGDY 240

Query: 241 REARNYYFWMTLRSGIQPNVVSVVILLPISAALEDEEMTRRIHCYTMKAGLDSQVTTCNA 300
           REARNYYFWMTLRSGIQPN+VSV+ LLPISA LEDEEMTRRIHCY +K GLDS VT+CNA
Sbjct: 241 REARNYYFWMTLRSGIQPNLVSVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNA 300

Query: 301 LVDAYGKCGNVKASWQVFDEMVERNEVSWNAIINGLACKGHFCDALDVFRMMIDAEVKPN 360
           LVDAY KCG+VKASWQVFDE++E+NEVSWN+IINGLA KGHF DALDVFRMMIDA  KPN
Sbjct: 301 LVDAYWKCGSVKASWQVFDEIIEKNEVSWNSIINGLAFKGHFRDALDVFRMMIDAGTKPN 360

Query: 361 SVTISSILPVLVELEYFKAGKEIHGFSMRMGTETDIFIANSLIDMYAKSGHSTEASCIFH 420
           SVTISSILPV VELE FKAGKEIHGFSMRMGTETD+FIANSLIDMYAKSGHSTEAS IFH
Sbjct: 361 SVTISSILPVFVELECFKAGKEIHGFSMRMGTETDLFIANSLIDMYAKSGHSTEASSIFH 420

Query: 421 NMDRRNVVSWNAMIANYALNGLALEAIRQVILMQETGENPNAVTFTNVLPACARLGFLGP 480
           NMD RN+VSWNAMIANY LNG+ALEAIR VIL+QE+GE PNAVTFTNVLPACAR G LGP
Sbjct: 421 NMDGRNIVSWNAMIANYVLNGVALEAIRFVILLQESGERPNAVTFTNVLPACARSGHLGP 480

Query: 481 GKEIHAIGIRLGLTADMFVSNALTDMYAKCGCLHSARNVFNTSHKDEVSYNILILGYSET 540
           GKEIHA+G+RLGLT+D+FV+NALTDMYAKCGC  SARNVFNTSHKDEVSYNILI GYSET
Sbjct: 481 GKEIHAMGVRLGLTSDLFVTNALTDMYAKCGCFRSARNVFNTSHKDEVSYNILITGYSET 540

Query: 541 SDCLESLNLFSEMRLLGRKPDVVSFVGVISACANLAAVKQGKEIHGVALRNHFYSHLFVS 600
           +DCLESLNLFSEMRLL +KPDVVSF+GVISACANLAAVKQGKEIHGVALRNH  SHLFVS
Sbjct: 541 NDCLESLNLFSEMRLLRKKPDVVSFMGVISACANLAAVKQGKEIHGVALRNHLNSHLFVS 600

Query: 601 NSLLDFYTKCARIDLACKVFNQILFKDVASWNTMILGYGMIGELETAITMFEEMRGDTVQ 660
           NSLLDFYTKC RIDLACK+FNQILFKDVASWNTMILGYGMIGELETAI MFE MR D VQ
Sbjct: 601 NSLLDFYTKCGRIDLACKIFNQILFKDVASWNTMILGYGMIGELETAINMFEAMRDDKVQ 660

Query: 661 YDLVSYIAVLSACSHGGLVEQGWKYFSEMLAQNLEPTQMHYACIVDLLGRAGFVEEAAEL 720
           YDLVSYIAVLSACSHGGLVE+GW+Y SEMLAQ+LEPT+MHY C+VDLLGRAGFVEEAAEL
Sbjct: 661 YDLVSYIAVLSACSHGGLVERGWQYLSEMLAQHLEPTEMHYTCLVDLLGRAGFVEEAAEL 720

Query: 721 IRCLPIAPDANIWGALLGACRIYRNVELGCWAAEHLFEIKPQHCGYYILLSNMYAEAGRW 780
           IR LPIAPD+NIWGALLGACRIY NVELGC AAE LFE+KPQHCGYYILL+NM+AE GRW
Sbjct: 721 IRRLPIAPDSNIWGALLGACRIYGNVELGCKAAEQLFELKPQHCGYYILLANMHAETGRW 780

Query: 781 DEVNRVRDLMKSRGAKKNPGCSWVQIHDQVHTFMAEERAEGFESGGWLAE 831
           DEVNR+R+LMKSRGAKK+PGCSWVQIHDQ+H F+ ++RAEGFESGG LAE
Sbjct: 781 DEVNRIRELMKSRGAKKSPGCSWVQIHDQLHAFVVDDRAEGFESGGLLAE 809

BLAST of Moc02g02020 vs. ExPASy Swiss-Prot
Match: Q9SN39 (Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=DOT4 PE=2 SV=1)

HSP 1 Score: 490.0 bits (1260), Expect = 5.5e-137
Identity = 258/663 (38.91%), Postives = 395/663 (59.58%), Query Frame = 0

Query: 156 LDDHTFPFVLKLCSDSLDICKGMEVHGVVSKLGFDSDVYVGNTLLMLYGNCGVVSDARRV 215
           +D  T   VL+LC+DS  +  G EV   +   GF  D  +G+ L ++Y NCG + +A RV
Sbjct: 92  IDPRTLCSVLQLCADSKSLKDGKEVDNFIRGNGFVIDSNLGSKLSLMYTNCGDLKEASRV 151

Query: 216 FDEMPERDVVSWNTILGLLSVNGDYREARNYYFWMTLRSGIQPNVVSVVILLPISAALED 275
           FDE+     + WN ++  L+ +GD+  +   +  M + SG++ +  +   +    ++L  
Sbjct: 152 FDEVKIEKALFWNILMNELAKSGDFSGSIGLFKKM-MSSGVEMDSYTFSCVSKSFSSLRS 211

Query: 276 EEMTRRIHCYTMKAGLDSQVTTCNALVDAYGKCGNVKASWQVFDEMVERNEVSWNAIING 335
                ++H + +K+G   + +  N+LV  Y K   V ++ +VFDEM ER+ +SWN+IING
Sbjct: 212 VHGGEQLHGFILKSGFGERNSVGNSLVAFYLKNQRVDSARKVFDEMTERDVISWNSIING 271

Query: 336 LACKGHFCDALDVFRMMIDAEVKPNSVTISSILPVLVELEYFKAGKEIHGFSMRMGTETD 395
               G     L VF  M+ + ++ +  TI S+     +      G+ +H   ++     +
Sbjct: 272 YVSNGLAEKGLSVFVQMLVSGIEIDLATIVSVFAGCADSRLISLGRAVHSIGVKACFSRE 331

Query: 396 IFIANSLIDMYAKSGHSTEASCIFHNMDRRNVVSWNAMIANYALNGLALEAIRQVILMQE 455
               N+L+DMY+K G    A  +F  M  R+VVS+ +MIA YA  GLA EA++    M+E
Sbjct: 332 DRFCNTLLDMYSKCGDLDSAKAVFREMSDRSVVSYTSMIAGYAREGLAGEAVKLFEEMEE 391

Query: 456 TGENPNAVTFTNVLPACARLGFLGPGKEIHAIGIRLGLTADMFVSNALTDMYAKCGCLHS 515
            G +P+  T T VL  CAR   L  GK +H       L  D+FVSNAL DMYAKCG +  
Sbjct: 392 EGISPDVYTVTAVLNCCARYRLLDEGKRVHEWIKENDLGFDIFVSNALMDMYAKCGSMQE 451

Query: 516 ARNVFNTSH-KDEVSYNILILGYSETSDCLESLNLFSEMRLLGR-KPDVVSFVGVISACA 575
           A  VF+    KD +S+N +I GYS+     E+L+LF+ +    R  PD  +   V+ ACA
Sbjct: 452 AELVFSEMRVKDIISWNTIIGGYSKNCYANEALSLFNLLLEEKRFSPDERTVACVLPACA 511

Query: 576 NLAAVKQGKEIHGVALRNHFYSHLFVSNSLLDFYTKCARIDLACKVFNQILFKDVASWNT 635
           +L+A  +G+EIHG  +RN ++S   V+NSL+D Y KC  + LA  +F+ I  KD+ SW  
Sbjct: 512 SLSAFDKGREIHGYIMRNGYFSDRHVANSLVDMYAKCGALLLAHMLFDDIASKDLVSWTV 571

Query: 636 MILGYGMIGELETAITMFEEMRGDTVQYDLVSYIAVLSACSHGGLVEQGWKYFSEMLAQ- 695
           MI GYGM G  + AI +F +MR   ++ D +S++++L ACSH GLV++GW++F+ M  + 
Sbjct: 572 MIAGYGMHGFGKEAIALFNQMRQAGIEADEISFVSLLYACSHSGLVDEGWRFFNIMRHEC 631

Query: 696 NLEPTQMHYACIVDLLGRAGFVEEAAELIRCLPIAPDANIWGALLGACRIYRNVELGCWA 755
            +EPT  HYACIVD+L R G + +A   I  +PI PDA IWGALL  CRI+ +V+L    
Sbjct: 632 KIEPTVEHYACIVDMLARTGDLIKAYRFIENMPIPPDATIWGALLCGCRIHHDVKLAEKV 691

Query: 756 AEHLFEIKPQHCGYYILLSNMYAEAGRWDEVNRVRDLMKSRGAKKNPGCSWVQIHDQVHT 815
           AE +FE++P++ GYY+L++N+YAEA +W++V R+R  +  RG +KNPGCSW++I  +V+ 
Sbjct: 692 AEKVFELEPENTGYYVLMANIYAEAEKWEQVKRLRKRIGQRGLRKNPGCSWIEIKGRVNI 751

BLAST of Moc02g02020 vs. ExPASy Swiss-Prot
Match: Q9SS60 (Pentatricopeptide repeat-containing protein At3g03580 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H23 PE=2 SV=1)

HSP 1 Score: 473.4 bits (1217), Expect = 5.3e-132
Identity = 265/753 (35.19%), Postives = 429/753 (56.97%), Query Frame = 0

Query: 64  SNAQSLRQTKQLHALCLLNGLLPHSVSLCASLILDYATFRDPESFRILFHQTVQNCRTAF 123
           S++ +L + +++HAL +  G L  S      LI  Y+ FR+P S   +F + V   +  +
Sbjct: 15  SSSSNLNELRRIHALVISLG-LDSSDFFSGKLIDKYSHFREPASSLSVFRR-VSPAKNVY 74

Query: 124 LWNTLIRAHSIAGNGMI-DGLQTYNRMVRSGVQLDDHTFPFVLKLCSDSLDICKGMEVHG 183
           LWN++IRA S   NG+  + L+ Y ++  S V  D +TFP V+K C+   D   G  V+ 
Sbjct: 75  LWNSIIRAFS--KNGLFPEALEFYGKLRESKVSPDKYTFPSVIKACAGLFDAEMGDLVYE 134

Query: 184 VVSKLGFDSDVYVGNTLLMLYGNCGVVSDARRVFDEMPERDVVSWNTILGLLSVNGDYRE 243
            +  +GF+SD++VGN L+ +Y   G+++ AR+VFDEMP RD+VSWN+++   S +G Y E
Sbjct: 135 QILDMGFESDLFVGNALVDMYSRMGLLTRARQVFDEMPVRDLVSWNSLISGYSSHGYYEE 194

Query: 244 ARNYYFWMTLRSGIQPNVVSVVILLPISAALEDEEMTRRIHCYTMKAGLDSQVTTCNALV 303
           A   Y  +   S I P+  +V  +LP    L   +  + +H + +K+G++S V   N LV
Sbjct: 195 ALEIYHELK-NSWIVPDSFTVSSVLPAFGNLLVVKQGQGLHGFALKSGVNSVVVVNNGLV 254

Query: 304 DAYGKCGNVKASWQVFDEMVERNEVSWNAIINGLACKGHFCDALDVFRMMIDAEVKPNSV 363
             Y K      + +VFDEM  R+ VS+N +I G        +++ +F   +D + KP+ +
Sbjct: 255 AMYLKFRRPTDARRVFDEMDVRDSVSYNTMICGYLKLEMVEESVRMFLENLD-QFKPDLL 314

Query: 364 TISSILPVLVELEYFKAGKEIHGFSMRMGTETDIFIANSLIDMYAKSGHSTEASCIFHNM 423
           T+SS+L     L      K I+ + ++ G   +  + N LID+YAK G    A  +F++M
Sbjct: 315 TVSSVLRACGHLRDLSLAKYIYNYMLKAGFVLESTVRNILIDVYAKCGDMITARDVFNSM 374

Query: 424 DRRNVVSWNAMIANYALNGLALEAIRQVILMQETGENPNAVTFTNVLPACARLGFLGPGK 483
           + ++ VSWN++I+ Y  +G  +EA++   +M    E  + +T+  ++    RL  L  GK
Sbjct: 375 ECKDTVSWNSIISGYIQSGDLMEAMKLFKMMMIMEEQADHITYLMLISVSTRLADLKFGK 434

Query: 484 EIHAIGIRLGLTADMFVSNALTDMYAKCGCLHSARNVFNT-SHKDEVSYNILILGYSETS 543
            +H+ GI+ G+  D+ VSNAL DMYAKCG +  +  +F++    D V++N +I       
Sbjct: 435 GLHSNGIKSGICIDLSVSNALIDMYAKCGEVGDSLKIFSSMGTGDTVTWNTVISACVRFG 494

Query: 544 DCLESLNLFSEMRLLGRKPDVVSFVGVISACANLAAVKQGKEIHGVALRNHFYSHLFVSN 603
           D    L + ++MR     PD+ +F+  +  CA+LAA + GKEIH   LR  + S L + N
Sbjct: 495 DFATGLQVTTQMRKSEVVPDMATFLVTLPMCASLAAKRLGKEIHCCLLRFGYESELQIGN 554

Query: 604 SLLDFYTKCARIDLACKVFNQILFKDVASWNTMILGYGMIGELETAITMFEEMRGDTVQY 663
           +L++ Y+KC  ++ + +VF ++  +DV +W  MI  YGM GE E A+  F +M    +  
Sbjct: 555 ALIEMYSKCGCLENSSRVFERMSRRDVVTWTGMIYAYGMYGEGEKALETFADMEKSGIVP 614

Query: 664 DLVSYIAVLSACSHGGLVEQGWKYFSEMLAQ-NLEPTQMHYACIVDLLGRAGFVEEAAEL 723
           D V +IA++ ACSH GLV++G   F +M     ++P   HYAC+VDLL R+  + +A E 
Sbjct: 615 DSVVFIAIIYACSHSGLVDEGLACFEKMKTHYKIDPMIEHYACVVDLLSRSQKISKAEEF 674

Query: 724 IRCLPIAPDANIWGALLGACRIYRNVELGCWAAEHLFEIKPQHCGYYILLSNMYAEAGRW 783
           I+ +PI PDA+IW ++L ACR   ++E     +  + E+ P   GY IL SN YA   +W
Sbjct: 675 IQAMPIKPDASIWASVLRACRTSGDMETAERVSRRIIELNPDDPGYSILASNAYAALRKW 734

Query: 784 DEVNRVRDLMKSRGAKKNPGCSWVQIHDQVHTF 814
           D+V+ +R  +K +   KNPG SW+++   VH F
Sbjct: 735 DKVSLIRKSLKDKHITKNPGYSWIEVGKNVHVF 761

BLAST of Moc02g02020 vs. ExPASy Swiss-Prot
Match: Q0WN60 (Pentatricopeptide repeat-containing protein At1g18485 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H8 PE=2 SV=2)

HSP 1 Score: 471.9 bits (1213), Expect = 1.5e-131
Identity = 263/776 (33.89%), Postives = 437/776 (56.31%), Query Frame = 0

Query: 59  LLTLCSNAQSLRQTKQLHALCLLNGLLPHSVSLCASLILDYATFRDPESFRILFHQTVQN 118
           LL      + +   +++H L   +  L +   LC  +I  YA    P+  R +F      
Sbjct: 90  LLQASGKRKDIEMGRKIHQLVSGSTRLRNDDVLCTRIITMYAMCGSPDDSRFVF--DALR 149

Query: 119 CRTAFLWNTLIRAHSIAGNGMIDG-LQTYNRMVRSGVQLDDH-TFPFVLKLCSDSLDICK 178
            +  F WN +I ++S   N + D  L+T+  M+ +   L DH T+P V+K C+   D+  
Sbjct: 150 SKNLFQWNAVISSYS--RNELYDEVLETFIEMISTTDLLPDHFTYPCVIKACAGMSDVGI 209

Query: 179 GMEVHGVVSKLGFDSDVYVGNTLLMLYGNCGVVSDARRVFDEMPERDVVSWNTILGLLSV 238
           G+ VHG+V K G   DV+VGN L+  YG  G V+DA ++FD MPER++VSWN+++ + S 
Sbjct: 210 GLAVHGLVVKTGLVEDVFVGNALVSFYGTHGFVTDALQLFDIMPERNLVSWNSMIRVFSD 269

Query: 239 NGDYREARNYYFWMTLRSG---IQPNVVSVVILLPISAALEDEEMTRRIHCYTMKAGLDS 298
           NG   E+      M   +G     P+V ++V +LP+ A   +  + + +H + +K  LD 
Sbjct: 270 NGFSEESFLLLGEMMEENGDGAFMPDVATLVTVLPVCAREREIGLGKGVHGWAVKLRLDK 329

Query: 299 QVTTCNALVDAYGKCGNVKASWQVFDEMVERNEVSWNAIINGLACKGHFCDALDVFRMMI 358
           ++   NAL+D Y KCG +  +  +F     +N VSWN ++ G + +G      DV R M+
Sbjct: 330 ELVLNNALMDMYSKCGCITNAQMIFKMNNNKNVVSWNTMVGGFSAEGDTHGTFDVLRQML 389

Query: 359 --DAEVKPNSVTISSILPVLVELEYFKAGKEIHGFSMRMGTETDIFIANSLIDMYAKSGH 418
               +VK + VTI + +PV     +  + KE+H +S++     +  +AN+ +  YAK G 
Sbjct: 390 AGGEDVKADEVTILNAVPVCFHESFLPSLKELHCYSLKQEFVYNELVANAFVASYAKCGS 449

Query: 419 STEASCIFHNMDRRNVVSWNAMIANYALNGLALEAIRQVILMQETGENPNAVTFTNVLPA 478
            + A  +FH +  + V SWNA+I  +A +     ++   + M+ +G  P++ T  ++L A
Sbjct: 450 LSYAQRVFHGIRSKTVNSWNALIGGHAQSNDPRLSLDAHLQMKISGLLPDSFTVCSLLSA 509

Query: 479 CARLGFLGPGKEIHAIGIRLGLTADMFVSNALTDMYAKCGCLHSARNVFNT-SHKDEVSY 538
           C++L  L  GKE+H   IR  L  D+FV  ++  +Y  CG L + + +F+    K  VS+
Sbjct: 510 CSKLKSLRLGKEVHGFIIRNWLERDLFVYLSVLSLYIHCGELCTVQALFDAMEDKSLVSW 569

Query: 539 NILILGYSETSDCLESLNLFSEMRLLGRKPDVVSFVGVISACANLAAVKQGKEIHGVALR 598
           N +I GY +      +L +F +M L G +   +S + V  AC+ L +++ G+E H  AL+
Sbjct: 570 NTVITGYLQNGFPDRALGVFRQMVLYGIQLCGISMMPVFGACSLLPSLRLGREAHAYALK 629

Query: 599 NHFYSHLFVSNSLLDFYTKCARIDLACKVFNQILFKDVASWNTMILGYGMIGELETAITM 658
           +      F++ SL+D Y K   I  + KVFN +  K  ASWN MI+GYG+ G  + AI +
Sbjct: 630 HLLEDDAFIACSLIDMYAKNGSITQSSKVFNGLKEKSTASWNAMIMGYGIHGLAKEAIKL 689

Query: 659 FEEMRGDTVQYDLVSYIAVLSACSHGGLVEQGWKYFSEMLAQ-NLEPTQMHYACIVDLLG 718
           FEEM+      D ++++ VL+AC+H GL+ +G +Y  +M +   L+P   HYAC++D+LG
Sbjct: 690 FEEMQRTGHNPDDLTFLGVLTACNHSGLIHEGLRYLDQMKSSFGLKPNLKHYACVIDMLG 749

Query: 719 RAGFVEEAAELI-RCLPIAPDANIWGALLGACRIYRNVELGCWAAEHLFEIKPQHCGYYI 778
           RAG +++A  ++   +    D  IW +LL +CRI++N+E+G   A  LFE++P+    Y+
Sbjct: 750 RAGQLDKALRVVAEEMSEEADVGIWKSLLSSCRIHQNLEMGEKVAAKLFELEPEKPENYV 809

Query: 779 LLSNMYAEAGRWDEVNRVRDLMKSRGAKKNPGCSWVQIHDQVHTFMAEER-AEGFE 824
           LLSN+YA  G+W++V +VR  M     +K+ GCSW++++ +V +F+  ER  +GFE
Sbjct: 810 LLSNLYAGLGKWEDVRKVRQRMNEMSLRKDAGCSWIELNRKVFSFVVGERFLDGFE 861

BLAST of Moc02g02020 vs. ExPASy Swiss-Prot
Match: Q9M9E2 (Pentatricopeptide repeat-containing protein At1g15510, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H73 PE=1 SV=1)

HSP 1 Score: 468.0 bits (1203), Expect = 2.2e-130
Identity = 237/689 (34.40%), Postives = 389/689 (56.46%), Query Frame = 0

Query: 132 HSIAGNGMI-DGLQTYNRMVRSGVQLDDHTFPFVLKLCSDSLDICKGMEVHGVVSKLGFD 191
           H +  NG + + ++  N M    V +D+  F  +++LC       +G +V+ +       
Sbjct: 67  HGLCANGKLEEAMKLLNSMQELRVAVDEDVFVALVRLCEWKRAQEEGSKVYSIALSSMSS 126

Query: 192 SDVYVGNTLLMLYGNCGVVSDARRVFDEMPERDVVSWNTILGLLSVNGDYREARNYYFWM 251
             V +GN  L ++   G + DA  VF +M ER++ SWN ++G  +  G + EA   Y  M
Sbjct: 127 LGVELGNAFLAMFVRFGNLVDAWYVFGKMSERNLFSWNVLVGGYAKQGYFDEAMCLYHRM 186

Query: 252 TLRSGIQPNVVSVVILLPISAALEDEEMTRRIHCYTMKAGLDSQVTTCNALVDAYGKCGN 311
               G++P+V +   +L     + D    + +H + ++ G +  +   NAL+  Y KCG+
Sbjct: 187 LWVGGVKPDVYTFPCVLRTCGGIPDLARGKEVHVHVVRYGYELDIDVVNALITMYVKCGD 246

Query: 312 VKASWQVFDEMVERNEVSWNAIINGLACKGHFCDALDVFRMMIDAEVKPNSVTISSILPV 371
           VK++  +FD M  R+ +SWNA+I+G    G   + L++F  M    V P+ +T++S++  
Sbjct: 247 VKSARLLFDRMPRRDIISWNAMISGYFENGMCHEGLELFFAMRGLSVDPDLMTLTSVISA 306

Query: 372 LVELEYFKAGKEIHGFSMRMGTETDIFIANSLIDMYAKSGHSTEASCIFHNMDRRNVVSW 431
              L   + G++IH + +  G   DI + NSL  MY  +G   EA  +F  M+R+++VSW
Sbjct: 307 CELLGDRRLGRDIHAYVITTGFAVDISVCNSLTQMYLNAGSWREAEKLFSRMERKDIVSW 366

Query: 432 NAMIANYALNGLALEAIRQVILMQETGENPNAVTFTNVLPACARLGFLGPGKEIHAIGIR 491
             MI+ Y  N L  +AI    +M +    P+ +T   VL ACA LG L  G E+H + I+
Sbjct: 367 TTMISGYEYNFLPDKAIDTYRMMDQDSVKPDEITVAAVLSACATLGDLDTGVELHKLAIK 426

Query: 492 LGLTADMFVSNALTDMYAKCGCLHSARNVF-NTSHKDEVSYNILILGYSETSDCLESLNL 551
             L + + V+N L +MY+KC C+  A ++F N   K+ +S+  +I G    + C E+L  
Sbjct: 427 ARLISYVIVANNLINMYSKCKCIDKALDIFHNIPRKNVISWTSIIAGLRLNNRCFEALIF 486

Query: 552 FSEMRLLGRKPDVVSFVGVISACANLAAVKQGKEIHGVALRNHFYSHLFVSNSLLDFYTK 611
             +M++   +P+ ++    ++ACA + A+  GKEIH   LR       F+ N+LLD Y +
Sbjct: 487 LRQMKMT-LQPNAITLTAALAACARIGALMCGKEIHAHVLRTGVGLDDFLPNALLDMYVR 546

Query: 612 CARIDLACKVFNQILFKDVASWNTMILGYGMIGELETAITMFEEMRGDTVQYDLVSYIAV 671
           C R++ A   FN    KDV SWN ++ GY   G+    + +F+ M    V+ D +++I++
Sbjct: 547 CGRMNTAWSQFNS-QKKDVTSWNILLTGYSERGQGSMVVELFDRMVKSRVRPDEITFISL 606

Query: 672 LSACSHGGLVEQGWKYFSEMLAQNLEPTQMHYACIVDLLGRAGFVEEAAELIRCLPIAPD 731
           L  CS   +V QG  YFS+M    + P   HYAC+VDLLGRAG ++EA + I+ +P+ PD
Sbjct: 607 LCGCSKSQMVRQGLMYFSKMEDYGVTPNLKHYACVVDLLGRAGELQEAHKFIQKMPVTPD 666

Query: 732 ANIWGALLGACRIYRNVELGCWAAEHLFEIKPQHCGYYILLSNMYAEAGRWDEVNRVRDL 791
             +WGALL ACRI+  ++LG  +A+H+FE+  +  GYYILL N+YA+ G+W EV +VR +
Sbjct: 667 PAVWGALLNACRIHHKIDLGELSAQHIFELDKKSVGYYILLCNLYADCGKWREVAKVRRM 726

Query: 792 MKSRGAKKNPGCSWVQIHDQVHTFMAEER 819
           MK  G   + GCSWV++  +VH F+++++
Sbjct: 727 MKENGLTVDAGCSWVEVKGKVHAFLSDDK 753

BLAST of Moc02g02020 vs. ExPASy Swiss-Prot
Match: Q9C507 (Putative pentatricopeptide repeat-containing protein At1g69350, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-E66 PE=3 SV=1)

HSP 1 Score: 453.4 bits (1165), Expect = 5.7e-126
Identity = 252/762 (33.07%), Postives = 429/762 (56.30%), Query Frame = 0

Query: 60  LTLCSNAQSLRQTKQLHALCLLNGLLPHSVSLCASLILDYATFRDPESFRILFHQTVQNC 119
           + L  +  SLR   QLHA  L+ G L         LI  YA    P+S R++F       
Sbjct: 5   MPLFRSCSSLRLVSQLHAHLLVTGRLRRDPLPVTKLIESYAFMGSPDSSRLVFEAFPY-- 64

Query: 120 RTAFLWNTLIRAHSIAGNGMIDGLQTYNRMVRSGVQLDDHTFPFVLKLCSDSLD-ICKGM 179
             +F++  LI+  ++  + +   +  Y+R+V    Q+    FP VL+ C+ S + +  G 
Sbjct: 65  PDSFMYGVLIKC-NVWCHLLDAAIDLYHRLVSETTQISKFVFPSVLRACAGSREHLSVGG 124

Query: 180 EVHGVVSKLGFDSDVYVGNTLLMLYGNCGVVSDARRVFDEMPERDVVSWNTILGLLSVNG 239
           +VHG + K G D D  +  +LL +YG  G +SDA +VFD MP RD+V+W+T++     NG
Sbjct: 125 KVHGRIIKGGVDDDAVIETSLLCMYGQTGNLSDAEKVFDGMPVRDLVAWSTLVSSCLENG 184

Query: 240 DYREARNYYFWMTLRSGIQPNVVSVVILLPISAALEDEEMTRRIHCYTMKAGLDSQVTTC 299
           +  +A   +  M +  G++P+ V+++ ++   A L    + R +H    +   D   T C
Sbjct: 185 EVVKALRMFKCM-VDDGVEPDAVTMISVVEGCAELGCLRIARSVHGQITRKMFDLDETLC 244

Query: 300 NALVDAYGKCGNVKASWQVFDEMVERNEVSWNAIINGLACKGHFCD-ALDVFRMMIDAEV 359
           N+L+  Y KCG++ +S ++F+++ ++N VSW A+I+    +G F + AL  F  MI + +
Sbjct: 245 NSLLTMYSKCGDLLSSERIFEKIAKKNAVSWTAMISSYN-RGEFSEKALRSFSEMIKSGI 304

Query: 360 KPNSVTISSILPVLVELEYFKAGKEIHGFSMRMGTETDI-FIANSLIDMYAKSGHSTEAS 419
           +PN VT+ S+L     +   + GK +HGF++R   + +   ++ +L+++YA+ G  ++  
Sbjct: 305 EPNLVTLYSVLSSCGLIGLIREGKSVHGFAVRRELDPNYESLSLALVELYAECGKLSDCE 364

Query: 420 CIFHNMDRRNVVSWNAMIANYALNGLALEAIRQVILMQETGENPNAVTFTNVLPACARLG 479
            +   +  RN+V+WN++I+ YA  G+ ++A+     M      P+A T  + + AC   G
Sbjct: 365 TVLRVVSDRNIVAWNSLISLYAHRGMVIQALGLFRQMVTQRIKPDAFTLASSISACENAG 424

Query: 480 FLGPGKEIHAIGIRLGLTADMFVSNALTDMYAKCGCLHSARNVFN-TSHKDEVSYNILIL 539
            +  GK+IH   IR  + +D FV N+L DMY+K G + SA  VFN   H+  V++N ++ 
Sbjct: 425 LVPLGKQIHGHVIRTDV-SDEFVQNSLIDMYSKSGSVDSASTVFNQIKHRSVVTWNSMLC 484

Query: 540 GYSETSDCLESLNLFSEMRLLGRKPDVVSFVGVISACANLAAVKQGKEIHGVALRNHFYS 599
           G+S+  + +E+++LF  M     + + V+F+ VI AC+++ ++++GK +H   + +    
Sbjct: 485 GFSQNGNSVEAISLFDYMYHSYLEMNEVTFLAVIQACSSIGSLEKGKWVHHKLIISGL-K 544

Query: 600 HLFVSNSLLDFYTKCARIDLACKVFNQILFKDVASWNTMILGYGMIGELETAITMFEEMR 659
            LF   +L+D Y KC  ++ A  VF  +  + + SW++MI  YGM G + +AI+ F +M 
Sbjct: 545 DLFTDTALIDMYAKCGDLNAAETVFRAMSSRSIVSWSSMINAYGMHGRIGSAISTFNQMV 604

Query: 660 GDTVQYDLVSYIAVLSACSHGGLVEQGWKYFSEMLAQNLEPTQMHYACIVDLLGRAGFVE 719
               + + V ++ VLSAC H G VE+G  YF+ M +  + P   H+AC +DLL R+G ++
Sbjct: 605 ESGTKPNEVVFMNVLSACGHSGSVEEGKYYFNLMKSFGVSPNSEHFACFIDLLSRSGDLK 664

Query: 720 EAAELIRCLPIAPDANIWGALLGACRIYRNVELGCWAAEHLFEIKPQHCGYYILLSNMYA 779
           EA   I+ +P   DA++WG+L+  CRI++ +++       L +I     GYY LLSN+YA
Sbjct: 665 EAYRTIKEMPFLADASVWGSLVNGCRIHQKMDIIKAIKNDLSDIVTDDTGYYTLLSNIYA 724

Query: 780 EAGRWDEVNRVRDLMKSRGAKKNPGCSWVQIHDQVHTFMAEE 818
           E G W+E  R+R  MKS   KK PG S ++I  +V  F A E
Sbjct: 725 EEGEWEEFRRLRSAMKSSNLKKVPGYSAIEIDQKVFRFGAGE 759

BLAST of Moc02g02020 vs. ExPASy TrEMBL
Match: A0A6J1DHT8 (pentatricopeptide repeat-containing protein At4g14170-like OS=Momordica charantia OX=3673 GN=LOC111020621 PE=4 SV=1)

HSP 1 Score: 1705.3 bits (4415), Expect = 0.0e+00
Identity = 832/832 (100.00%), Postives = 832/832 (100.00%), Query Frame = 0

Query: 1   MLHSCARSTGLHFASPFQFSFQTTRFQYRSFLHPVLGSASSSPLPLPSTEPVSSVHVDLL 60
           MLHSCARSTGLHFASPFQFSFQTTRFQYRSFLHPVLGSASSSPLPLPSTEPVSSVHVDLL
Sbjct: 1   MLHSCARSTGLHFASPFQFSFQTTRFQYRSFLHPVLGSASSSPLPLPSTEPVSSVHVDLL 60

Query: 61  TLCSNAQSLRQTKQLHALCLLNGLLPHSVSLCASLILDYATFRDPESFRILFHQTVQNCR 120
           TLCSNAQSLRQTKQLHALCLLNGLLPHSVSLCASLILDYATFRDPESFRILFHQTVQNCR
Sbjct: 61  TLCSNAQSLRQTKQLHALCLLNGLLPHSVSLCASLILDYATFRDPESFRILFHQTVQNCR 120

Query: 121 TAFLWNTLIRAHSIAGNGMIDGLQTYNRMVRSGVQLDDHTFPFVLKLCSDSLDICKGMEV 180
           TAFLWNTLIRAHSIAGNGMIDGLQTYNRMVRSGVQLDDHTFPFVLKLCSDSLDICKGMEV
Sbjct: 121 TAFLWNTLIRAHSIAGNGMIDGLQTYNRMVRSGVQLDDHTFPFVLKLCSDSLDICKGMEV 180

Query: 181 HGVVSKLGFDSDVYVGNTLLMLYGNCGVVSDARRVFDEMPERDVVSWNTILGLLSVNGDY 240
           HGVVSKLGFDSDVYVGNTLLMLYGNCGVVSDARRVFDEMPERDVVSWNTILGLLSVNGDY
Sbjct: 181 HGVVSKLGFDSDVYVGNTLLMLYGNCGVVSDARRVFDEMPERDVVSWNTILGLLSVNGDY 240

Query: 241 REARNYYFWMTLRSGIQPNVVSVVILLPISAALEDEEMTRRIHCYTMKAGLDSQVTTCNA 300
           REARNYYFWMTLRSGIQPNVVSVVILLPISAALEDEEMTRRIHCYTMKAGLDSQVTTCNA
Sbjct: 241 REARNYYFWMTLRSGIQPNVVSVVILLPISAALEDEEMTRRIHCYTMKAGLDSQVTTCNA 300

Query: 301 LVDAYGKCGNVKASWQVFDEMVERNEVSWNAIINGLACKGHFCDALDVFRMMIDAEVKPN 360
           LVDAYGKCGNVKASWQVFDEMVERNEVSWNAIINGLACKGHFCDALDVFRMMIDAEVKPN
Sbjct: 301 LVDAYGKCGNVKASWQVFDEMVERNEVSWNAIINGLACKGHFCDALDVFRMMIDAEVKPN 360

Query: 361 SVTISSILPVLVELEYFKAGKEIHGFSMRMGTETDIFIANSLIDMYAKSGHSTEASCIFH 420
           SVTISSILPVLVELEYFKAGKEIHGFSMRMGTETDIFIANSLIDMYAKSGHSTEASCIFH
Sbjct: 361 SVTISSILPVLVELEYFKAGKEIHGFSMRMGTETDIFIANSLIDMYAKSGHSTEASCIFH 420

Query: 421 NMDRRNVVSWNAMIANYALNGLALEAIRQVILMQETGENPNAVTFTNVLPACARLGFLGP 480
           NMDRRNVVSWNAMIANYALNGLALEAIRQVILMQETGENPNAVTFTNVLPACARLGFLGP
Sbjct: 421 NMDRRNVVSWNAMIANYALNGLALEAIRQVILMQETGENPNAVTFTNVLPACARLGFLGP 480

Query: 481 GKEIHAIGIRLGLTADMFVSNALTDMYAKCGCLHSARNVFNTSHKDEVSYNILILGYSET 540
           GKEIHAIGIRLGLTADMFVSNALTDMYAKCGCLHSARNVFNTSHKDEVSYNILILGYSET
Sbjct: 481 GKEIHAIGIRLGLTADMFVSNALTDMYAKCGCLHSARNVFNTSHKDEVSYNILILGYSET 540

Query: 541 SDCLESLNLFSEMRLLGRKPDVVSFVGVISACANLAAVKQGKEIHGVALRNHFYSHLFVS 600
           SDCLESLNLFSEMRLLGRKPDVVSFVGVISACANLAAVKQGKEIHGVALRNHFYSHLFVS
Sbjct: 541 SDCLESLNLFSEMRLLGRKPDVVSFVGVISACANLAAVKQGKEIHGVALRNHFYSHLFVS 600

Query: 601 NSLLDFYTKCARIDLACKVFNQILFKDVASWNTMILGYGMIGELETAITMFEEMRGDTVQ 660
           NSLLDFYTKCARIDLACKVFNQILFKDVASWNTMILGYGMIGELETAITMFEEMRGDTVQ
Sbjct: 601 NSLLDFYTKCARIDLACKVFNQILFKDVASWNTMILGYGMIGELETAITMFEEMRGDTVQ 660

Query: 661 YDLVSYIAVLSACSHGGLVEQGWKYFSEMLAQNLEPTQMHYACIVDLLGRAGFVEEAAEL 720
           YDLVSYIAVLSACSHGGLVEQGWKYFSEMLAQNLEPTQMHYACIVDLLGRAGFVEEAAEL
Sbjct: 661 YDLVSYIAVLSACSHGGLVEQGWKYFSEMLAQNLEPTQMHYACIVDLLGRAGFVEEAAEL 720

Query: 721 IRCLPIAPDANIWGALLGACRIYRNVELGCWAAEHLFEIKPQHCGYYILLSNMYAEAGRW 780
           IRCLPIAPDANIWGALLGACRIYRNVELGCWAAEHLFEIKPQHCGYYILLSNMYAEAGRW
Sbjct: 721 IRCLPIAPDANIWGALLGACRIYRNVELGCWAAEHLFEIKPQHCGYYILLSNMYAEAGRW 780

Query: 781 DEVNRVRDLMKSRGAKKNPGCSWVQIHDQVHTFMAEERAEGFESGGWLAESF 833
           DEVNRVRDLMKSRGAKKNPGCSWVQIHDQVHTFMAEERAEGFESGGWLAESF
Sbjct: 781 DEVNRVRDLMKSRGAKKNPGCSWVQIHDQVHTFMAEERAEGFESGGWLAESF 832

BLAST of Moc02g02020 vs. ExPASy TrEMBL
Match: A0A6J1JE86 (pentatricopeptide repeat-containing protein At4g14170-like isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111483644 PE=4 SV=1)

HSP 1 Score: 1423.7 bits (3684), Expect = 0.0e+00
Identity = 687/830 (82.77%), Postives = 752/830 (90.60%), Query Frame = 0

Query: 1   MLHSCARSTGLHFASPFQFSFQTTRFQYRSFLHPVLGSASSSPLPLPSTEPVSSVHVDLL 60
           ML  C RS   HFA       Q  RFQ+R+++               STEP SSVH++LL
Sbjct: 1   MLQFCIRSIRFHFA-------QIARFQFRNYVR--------------STEPNSSVHINLL 60

Query: 61  TLCSNAQSLRQTKQLHALCLLNGLLPHSVSLCASLILDYATFRDPESFRILFHQTVQNCR 120
           TLC N+QSLRQTK++HA+CLLNGLLPHSVSLCASLIL+YA F+ PESF  LFHQTVQNCR
Sbjct: 61  TLCFNSQSLRQTKEVHAICLLNGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCR 120

Query: 121 TAFLWNTLIRAHSIAGNGMIDGLQTYNRMVRSGVQLDDHTFPFVLKLCSDSLDICKGMEV 180
           T FLWNTLIRAHSIAGNG +DGL+TYNRMVR GVQLDDHTFPFVLK+CSDSLDICKGMEV
Sbjct: 121 TTFLWNTLIRAHSIAGNGTLDGLETYNRMVRFGVQLDDHTFPFVLKICSDSLDICKGMEV 180

Query: 181 HGVVSKLGFDSDVYVGNTLLMLYGNCGVVSDARRVFDEMPERDVVSWNTILGLLSVNGDY 240
           HGVV KLGFDSDVYVGNTLLMLYGNCG ++ A++VFDEM ERDVVSWNT++GLLSVNGDY
Sbjct: 181 HGVVFKLGFDSDVYVGNTLLMLYGNCGFLNGAKKVFDEMSERDVVSWNTVIGLLSVNGDY 240

Query: 241 REARNYYFWMTLRSGIQPNVVSVVILLPISAALEDEEMTRRIHCYTMKAGLDSQVTTCNA 300
           REARNYYFWMTLRSGIQPN+VSV+ LLPISA LEDEEMTRRIHCY +K GLDS VT+CNA
Sbjct: 241 REARNYYFWMTLRSGIQPNLVSVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNA 300

Query: 301 LVDAYGKCGNVKASWQVFDEMVERNEVSWNAIINGLACKGHFCDALDVFRMMIDAEVKPN 360
           LVDAYGKCG+VKASWQVFDE++E+NEVSWN+IINGLA KGHF DAL+VFRMMIDA  KPN
Sbjct: 301 LVDAYGKCGSVKASWQVFDEIIEKNEVSWNSIINGLAFKGHFWDALEVFRMMIDAGTKPN 360

Query: 361 SVTISSILPVLVELEYFKAGKEIHGFSMRMGTETDIFIANSLIDMYAKSGHSTEASCIFH 420
           SVTISSILPV VELE FKAGKEIHGFSMRMGTETD+FIANSLIDMYAKSGH TEAS IFH
Sbjct: 361 SVTISSILPVFVELECFKAGKEIHGFSMRMGTETDLFIANSLIDMYAKSGHLTEASSIFH 420

Query: 421 NMDRRNVVSWNAMIANYALNGLALEAIRQVILMQETGENPNAVTFTNVLPACARLGFLGP 480
           NMD RN+VSWNAMIANYALNG+ALEAIR VIL+QE+GE PNAVTFTNVLPACARLG LGP
Sbjct: 421 NMDGRNIVSWNAMIANYALNGVALEAIRFVILLQESGERPNAVTFTNVLPACARLGHLGP 480

Query: 481 GKEIHAIGIRLGLTADMFVSNALTDMYAKCGCLHSARNVFNTSHKDEVSYNILILGYSET 540
           GKEIHA+G+RLGLT+D+FV+NALTDMYAKCGC  SARNVFNTSHKDEVSYNILI GYSET
Sbjct: 481 GKEIHAMGVRLGLTSDLFVTNALTDMYAKCGCFRSARNVFNTSHKDEVSYNILITGYSET 540

Query: 541 SDCLESLNLFSEMRLLGRKPDVVSFVGVISACANLAAVKQGKEIHGVALRNHFYSHLFVS 600
           +DCLESLNLFSEMRLLG+KPDVVSF+GV+SACANLAAVKQGKEIHGVALRNH  SHLFVS
Sbjct: 541 NDCLESLNLFSEMRLLGKKPDVVSFMGVLSACANLAAVKQGKEIHGVALRNHLNSHLFVS 600

Query: 601 NSLLDFYTKCARIDLACKVFNQILFKDVASWNTMILGYGMIGELETAITMFEEMRGDTVQ 660
           NSLLDFYTKC RIDLACK+FNQILFKDVASWNTMILGYGMIGELETAI MFE MR D VQ
Sbjct: 601 NSLLDFYTKCGRIDLACKIFNQILFKDVASWNTMILGYGMIGELETAINMFEAMRDDKVQ 660

Query: 661 YDLVSYIAVLSACSHGGLVEQGWKYFSEMLAQNLEPTQMHYACIVDLLGRAGFVEEAAEL 720
           YD+VSYIAVLSACSHGGLVE+G +YFSEMLAQ+LEPT+MHY C+VDLLGRAGFVEEAAEL
Sbjct: 661 YDVVSYIAVLSACSHGGLVERGCQYFSEMLAQHLEPTEMHYTCLVDLLGRAGFVEEAAEL 720

Query: 721 IRCLPIAPDANIWGALLGACRIYRNVELGCWAAEHLFEIKPQHCGYYILLSNMYAEAGRW 780
           IR LPIAPD+NIWGALLGACRIY N++LGC AAEHLFE+KPQHCGYYILLSNMYAE GRW
Sbjct: 721 IRRLPIAPDSNIWGALLGACRIYGNIDLGCKAAEHLFELKPQHCGYYILLSNMYAETGRW 780

Query: 781 DEVNRVRDLMKSRGAKKNPGCSWVQIHDQVHTFMAEERAEGFESGGWLAE 831
           D+VNR+R+LMKSRGAKK+PGCSWVQIHDQ+H F+ ++RAEGFESGG+LAE
Sbjct: 781 DDVNRIRELMKSRGAKKSPGCSWVQIHDQLHAFVVDDRAEGFESGGFLAE 809

BLAST of Moc02g02020 vs. ExPASy TrEMBL
Match: A0A6J1FV31 (pentatricopeptide repeat-containing protein At4g14170-like isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111448704 PE=4 SV=1)

HSP 1 Score: 1421.4 bits (3678), Expect = 0.0e+00
Identity = 690/830 (83.13%), Postives = 748/830 (90.12%), Query Frame = 0

Query: 1   MLHSCARSTGLHFASPFQFSFQTTRFQYRSFLHPVLGSASSSPLPLPSTEPVSSVHVDLL 60
           ML  C RS   HFA       Q  RFQ+R+F+                TEP SSVH++LL
Sbjct: 1   MLQFCIRSFRFHFA-------QIARFQFRNFVR--------------RTEPNSSVHINLL 60

Query: 61  TLCSNAQSLRQTKQLHALCLLNGLLPHSVSLCASLILDYATFRDPESFRILFHQTVQNCR 120
           TLC NAQSLRQTK++HA+CLLNGLLPHSVSLCASLIL+YA F+ PESF  LFHQTVQNCR
Sbjct: 61  TLCFNAQSLRQTKEVHAICLLNGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCR 120

Query: 121 TAFLWNTLIRAHSIAGNGMIDGLQTYNRMVRSGVQLDDHTFPFVLKLCSDSLDICKGMEV 180
           TAFLWNTLIRAHSIAGNG +DGL+TYNRMVR GVQLDDHTFPFVLK+CSDSLDICKGMEV
Sbjct: 121 TAFLWNTLIRAHSIAGNGTLDGLETYNRMVRFGVQLDDHTFPFVLKICSDSLDICKGMEV 180

Query: 181 HGVVSKLGFDSDVYVGNTLLMLYGNCGVVSDARRVFDEMPERDVVSWNTILGLLSVNGDY 240
           HGVV KLGFDSDVYVGNTLLMLYGNCG ++DA++VFDEM ERDVVSWNT++GLLSVNGDY
Sbjct: 181 HGVVFKLGFDSDVYVGNTLLMLYGNCGFLNDAKKVFDEMSERDVVSWNTVIGLLSVNGDY 240

Query: 241 REARNYYFWMTLRSGIQPNVVSVVILLPISAALEDEEMTRRIHCYTMKAGLDSQVTTCNA 300
           REARNYYFWMTLRSGIQPN+VSV+ LLPISA LEDEEMTRRIHCY +K GLDS VT+CNA
Sbjct: 241 REARNYYFWMTLRSGIQPNLVSVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNA 300

Query: 301 LVDAYGKCGNVKASWQVFDEMVERNEVSWNAIINGLACKGHFCDALDVFRMMIDAEVKPN 360
           LVDAY KCG+VKASWQVFDE++E+NEVSWN+IINGLA KGHF DALDVFRMMIDA  KPN
Sbjct: 301 LVDAYWKCGSVKASWQVFDEIIEKNEVSWNSIINGLAFKGHFRDALDVFRMMIDAGTKPN 360

Query: 361 SVTISSILPVLVELEYFKAGKEIHGFSMRMGTETDIFIANSLIDMYAKSGHSTEASCIFH 420
           SVTISSILPV VELE FKAGKEIHGFSMRMGTETD+FIANSLIDMYAKSGHSTEAS IFH
Sbjct: 361 SVTISSILPVFVELECFKAGKEIHGFSMRMGTETDLFIANSLIDMYAKSGHSTEASSIFH 420

Query: 421 NMDRRNVVSWNAMIANYALNGLALEAIRQVILMQETGENPNAVTFTNVLPACARLGFLGP 480
           NMD RN+VSWNAMIANY LNG+ALEAIR VIL+QE+GE PNAVTFTNVLPACAR G LGP
Sbjct: 421 NMDGRNIVSWNAMIANYVLNGVALEAIRFVILLQESGERPNAVTFTNVLPACARSGHLGP 480

Query: 481 GKEIHAIGIRLGLTADMFVSNALTDMYAKCGCLHSARNVFNTSHKDEVSYNILILGYSET 540
           GKEIHA+G+RLGLT+D+FV+NALTDMYAKCGC  SARNVFNTSHKDEVSYNILI GYSET
Sbjct: 481 GKEIHAMGVRLGLTSDLFVTNALTDMYAKCGCFRSARNVFNTSHKDEVSYNILITGYSET 540

Query: 541 SDCLESLNLFSEMRLLGRKPDVVSFVGVISACANLAAVKQGKEIHGVALRNHFYSHLFVS 600
           +DCLESLNLFSEMRLL +KPDVVSF+GVISACANLAAVKQGKEIHGVALRNH  SHLFVS
Sbjct: 541 NDCLESLNLFSEMRLLRKKPDVVSFMGVISACANLAAVKQGKEIHGVALRNHLNSHLFVS 600

Query: 601 NSLLDFYTKCARIDLACKVFNQILFKDVASWNTMILGYGMIGELETAITMFEEMRGDTVQ 660
           NSLLDFYTKC RIDLACK+FNQILFKDVASWNTMILGYGMIGELETAI MFE MR D VQ
Sbjct: 601 NSLLDFYTKCGRIDLACKIFNQILFKDVASWNTMILGYGMIGELETAINMFEAMRDDKVQ 660

Query: 661 YDLVSYIAVLSACSHGGLVEQGWKYFSEMLAQNLEPTQMHYACIVDLLGRAGFVEEAAEL 720
           YDLVSYIAVLSACSHGGLVE+GW+Y SEMLAQ+LEPT+MHY C+VDLLGRAGFVEEAAEL
Sbjct: 661 YDLVSYIAVLSACSHGGLVERGWQYLSEMLAQHLEPTEMHYTCLVDLLGRAGFVEEAAEL 720

Query: 721 IRCLPIAPDANIWGALLGACRIYRNVELGCWAAEHLFEIKPQHCGYYILLSNMYAEAGRW 780
           IR LPIAPD+NIWGALLGACRIY NVELGC AAE LFE+KPQHCGYYILL+NM+AE GRW
Sbjct: 721 IRRLPIAPDSNIWGALLGACRIYGNVELGCKAAEQLFELKPQHCGYYILLANMHAETGRW 780

Query: 781 DEVNRVRDLMKSRGAKKNPGCSWVQIHDQVHTFMAEERAEGFESGGWLAE 831
           DEVNR+R+LMKSRGAKK+PGCSWVQIHDQ+H F+ ++RAEGFESGG LAE
Sbjct: 781 DEVNRIRELMKSRGAKKSPGCSWVQIHDQLHAFVVDDRAEGFESGGLLAE 809

BLAST of Moc02g02020 vs. ExPASy TrEMBL
Match: A0A5A7V0P3 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold212G001270 PE=4 SV=1)

HSP 1 Score: 1417.5 bits (3668), Expect = 0.0e+00
Identity = 693/833 (83.19%), Postives = 754/833 (90.52%), Query Frame = 0

Query: 1   MLHSCARSTGLHFASPFQFSFQTTRFQYRSFLHPVLGSASSSPL--PLPSTEPVSSVHVD 60
           ML  C RS+ +    PF F  Q TRF+YR+F  P+L SA  SPL  P  ST+  S +H++
Sbjct: 1   MLELCIRSS-IVSPLPFPFPSQITRFRYRNFHQPILVSALQSPLSPPSSSTDSNSFIHIN 60

Query: 61  LLTLCSNAQSLRQTKQLHALCLLNGLLPHSVSLCASLILDYATFRDPESFRILFHQTVQN 120
           LLTLCS  QSL QTKQ+HAL +LNG LP SVSLCASLIL+YA F+ PESF  LF+QT QN
Sbjct: 61  LLTLCSKVQSLLQTKQVHALGILNGFLPRSVSLCASLILNYAKFQHPESFCSLFNQTFQN 120

Query: 121 CRTAFLWNTLIRAHSIAGNGMIDGLQTYNRMVRSGVQLDDHTFPFVLKLCSDSLDICKGM 180
           CRTAFLWNTLIRAHSIA NG +DG +TYNRMVR GVQLDDHTFPFVLKLCSDS DICKGM
Sbjct: 121 CRTAFLWNTLIRAHSIAWNGTLDGFETYNRMVRLGVQLDDHTFPFVLKLCSDSFDICKGM 180

Query: 181 EVHGVVSKLGFDSDVYVGNTLLMLYGNCGVVSDARRVFDEMPERDVVSWNTILGLLSVNG 240
           EVHGVV KLGFD+DVYVGNTLLMLYGNCG ++DARRVFDEMPERDVVSWNT++GLLSVNG
Sbjct: 181 EVHGVVFKLGFDTDVYVGNTLLMLYGNCGFLNDARRVFDEMPERDVVSWNTVIGLLSVNG 240

Query: 241 DYREARNYYFWMTLRSGIQPNVVSVVILLPISAALEDEEMTRRIHCYTMKAGLDSQVTTC 300
           DY+EARNYYFWM LRSGI+PN+VSV+ LLPISAALEDEEMTRRIHC+++K GLDSQVTTC
Sbjct: 241 DYKEARNYYFWMILRSGIKPNLVSVISLLPISAALEDEEMTRRIHCFSVKVGLDSQVTTC 300

Query: 301 NALVDAYGKCGNVKASWQVFDEMVERNEVSWNAIINGLACKGHFCDALDVFRMMIDAEVK 360
           NALVDAYGKCG+VKA WQVF+EMVERNEVSWN+IINGLACKG   DAL  FRMMIDA  K
Sbjct: 301 NALVDAYGKCGSVKALWQVFNEMVERNEVSWNSIINGLACKGRCWDALKAFRMMIDAGAK 360

Query: 361 PNSVTISSILPVLVELEYFKAGKEIHGFSMRMGTETDIFIANSLIDMYAKSGHSTEASCI 420
           PNSVTISSILPVLVELE FKAGKEIHGFSMR+GTETDIFIANSLIDMYAKSG STEAS I
Sbjct: 361 PNSVTISSILPVLVELECFKAGKEIHGFSMRIGTETDIFIANSLIDMYAKSGRSTEASTI 420

Query: 421 FHNMDRRNVVSWNAMIANYALNGLALEAIRQVILMQETGENPNAVTFTNVLPACARLGFL 480
           FHN+DRRNVV+WNAMIANYALN L LEAIR VI MQETGE PNAVTFTNVLPACARLGFL
Sbjct: 421 FHNLDRRNVVTWNAMIANYALNRLPLEAIRFVIQMQETGECPNAVTFTNVLPACARLGFL 480

Query: 481 GPGKEIHAIGIRLGLTADMFVSNALTDMYAKCGCLHSARNVFNTSHKDEVSYNILILGYS 540
           GPGKEIHA+ +R+GLT+D+FVSN+L DMYAKCG L SARN+FNTSHKDEVSYNILI+GYS
Sbjct: 481 GPGKEIHAMVVRIGLTSDLFVSNSLIDMYAKCGSLCSARNLFNTSHKDEVSYNILIIGYS 540

Query: 541 ETSDCLESLNLFSEMRLLGRKPDVVSFVGVISACANLAAVKQGKEIHGVALRNHFYSHLF 600
           ET+DC +SLNLFSEMRLLG+KPDVVSFVGVISACANLAA+KQGKEIHGVALRNH YSHLF
Sbjct: 541 ETNDCFQSLNLFSEMRLLGKKPDVVSFVGVISACANLAALKQGKEIHGVALRNHLYSHLF 600

Query: 601 VSNSLLDFYTKCARIDLACKVFNQILFKDVASWNTMILGYGMIGELETAITMFEEMRGDT 660
           VSNSLLDFYTKC RID+AC+VFNQILFKDVASWNTMILGYGMIGELETAI+MFE MR DT
Sbjct: 601 VSNSLLDFYTKCGRIDIACRVFNQILFKDVASWNTMILGYGMIGELETAISMFEAMRDDT 660

Query: 661 VQYDLVSYIAVLSACSHGGLVEQGWKYFSEMLAQNLEPTQMHYACIVDLLGRAGFVEEAA 720
           VQYDLVSYIAVLSACSHGGLVE+GW+YFSEMLAQ+LEPT+MHY C+VDLLGRAGFVEEAA
Sbjct: 661 VQYDLVSYIAVLSACSHGGLVERGWQYFSEMLAQHLEPTEMHYTCMVDLLGRAGFVEEAA 720

Query: 721 ELIRCLPIAPDANIWGALLGACRIYRNVELGCWAAEHLFEIKPQHCGYYILLSNMYAEAG 780
           ELI+ LPIAPDANIWGALLGACRIY NVELGC AAEHLFE+KPQHCGYYILLSN+YAE G
Sbjct: 721 ELIQRLPIAPDANIWGALLGACRIYGNVELGCRAAEHLFELKPQHCGYYILLSNIYAETG 780

Query: 781 RWDEVNRVRDLMKSRGAKKNPGCSWVQIHDQVHTFMAEERAEGFESGGWLAES 832
           RWDE NR+R+LMKSRGAKKNPGCSWVQI DQVH+F+AEER EGFESG WLAES
Sbjct: 781 RWDEANRIRELMKSRGAKKNPGCSWVQICDQVHSFVAEERVEGFESGDWLAES 832

BLAST of Moc02g02020 vs. ExPASy TrEMBL
Match: A0A0A0KEH6 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G428560 PE=4 SV=1)

HSP 1 Score: 1417.5 bits (3668), Expect = 0.0e+00
Identity = 691/833 (82.95%), Postives = 751/833 (90.16%), Query Frame = 0

Query: 1   MLHSCARSTGLHFASPFQFSFQTTRFQYRSFLHPVLGSASSSPL--PLPSTEPVSSVHVD 60
           ML  C RS+ +    PF F  Q  RFQYR+FL P+L SA  SPL  P  ST+P S +H++
Sbjct: 1   MLEFCIRSS-IVSPLPFPFPSQIIRFQYRNFLQPILVSALQSPLSPPSSSTDPNSYIHIN 60

Query: 61  LLTLCSNAQSLRQTKQLHALCLLNGLLPHSVSLCASLILDYATFRDPESFRILFHQTVQN 120
           LLTLCS  QSL QTKQ+HAL +LNG LP SVSLCASLIL+YA F+ P SF  LF+QT QN
Sbjct: 61  LLTLCSKVQSLLQTKQVHALGILNGFLPRSVSLCASLILNYAKFQHPGSFCSLFNQTFQN 120

Query: 121 CRTAFLWNTLIRAHSIAGNGMIDGLQTYNRMVRSGVQLDDHTFPFVLKLCSDSLDICKGM 180
           CRTAFLWNTLIRAHSIA NG  DG +TYNRMVR GVQLDDHTFPFVLKLCSDS DICKGM
Sbjct: 121 CRTAFLWNTLIRAHSIAWNGTFDGFETYNRMVRRGVQLDDHTFPFVLKLCSDSFDICKGM 180

Query: 181 EVHGVVSKLGFDSDVYVGNTLLMLYGNCGVVSDARRVFDEMPERDVVSWNTILGLLSVNG 240
           EVHGVV KLGFD+DVYVGNTLLMLYGNCG ++DARR+FDEMPERDVVSWNTI+GLLSVNG
Sbjct: 181 EVHGVVFKLGFDTDVYVGNTLLMLYGNCGFLNDARRLFDEMPERDVVSWNTIIGLLSVNG 240

Query: 241 DYREARNYYFWMTLRSGIQPNVVSVVILLPISAALEDEEMTRRIHCYTMKAGLDSQVTTC 300
           DY EARNYYFWM LRS I+PN+VSV+ LLPISAALEDEEMTRRIHCY++K GLDSQVTTC
Sbjct: 241 DYTEARNYYFWMILRSVIKPNLVSVISLLPISAALEDEEMTRRIHCYSVKVGLDSQVTTC 300

Query: 301 NALVDAYGKCGNVKASWQVFDEMVERNEVSWNAIINGLACKGHFCDALDVFRMMIDAEVK 360
           NALVDAYGKCG+VKA WQVF+E VE+NEVSWN+IINGLACKG   DAL+ FRMMIDA  +
Sbjct: 301 NALVDAYGKCGSVKALWQVFNETVEKNEVSWNSIINGLACKGRCWDALNAFRMMIDAGAQ 360

Query: 361 PNSVTISSILPVLVELEYFKAGKEIHGFSMRMGTETDIFIANSLIDMYAKSGHSTEASCI 420
           PNSVTISSILPVLVELE FKAGKEIHGFSMRMGTETDIFIANSLIDMYAKSGHSTEAS I
Sbjct: 361 PNSVTISSILPVLVELECFKAGKEIHGFSMRMGTETDIFIANSLIDMYAKSGHSTEASTI 420

Query: 421 FHNMDRRNVVSWNAMIANYALNGLALEAIRQVILMQETGENPNAVTFTNVLPACARLGFL 480
           FHN+DRRN+VSWNAMIANYALN L LEAIR VI MQETGE PNAVTFTNVLPACARLGFL
Sbjct: 421 FHNLDRRNIVSWNAMIANYALNRLPLEAIRFVIQMQETGECPNAVTFTNVLPACARLGFL 480

Query: 481 GPGKEIHAIGIRLGLTADMFVSNALTDMYAKCGCLHSARNVFNTSHKDEVSYNILILGYS 540
           GPGKEIHA+G+R+GLT+D+FVSN+L DMYAKCGCLHSARNVFNTS KDEVSYNILI+GYS
Sbjct: 481 GPGKEIHAMGVRIGLTSDLFVSNSLIDMYAKCGCLHSARNVFNTSRKDEVSYNILIIGYS 540

Query: 541 ETSDCLESLNLFSEMRLLGRKPDVVSFVGVISACANLAAVKQGKEIHGVALRNHFYSHLF 600
           ET DCL+SLNLFSEMRLLG+KPDVVSFVGVISACANLAA+KQGKE+HGVALRNH YSHLF
Sbjct: 541 ETDDCLQSLNLFSEMRLLGKKPDVVSFVGVISACANLAALKQGKEVHGVALRNHLYSHLF 600

Query: 601 VSNSLLDFYTKCARIDLACKVFNQILFKDVASWNTMILGYGMIGELETAITMFEEMRGDT 660
           VSNSLLDFYTKC RID+AC++FNQILFKDVASWNTMILGYGMIGELETAI+MFE MR DT
Sbjct: 601 VSNSLLDFYTKCGRIDIACRLFNQILFKDVASWNTMILGYGMIGELETAISMFEAMRDDT 660

Query: 661 VQYDLVSYIAVLSACSHGGLVEQGWKYFSEMLAQNLEPTQMHYACIVDLLGRAGFVEEAA 720
           VQYDLVSYIAVLSACSHGGLVE+GW+YFSEMLAQ LEPT+MHY C+VDLLGRAGFVEEAA
Sbjct: 661 VQYDLVSYIAVLSACSHGGLVERGWQYFSEMLAQRLEPTEMHYTCMVDLLGRAGFVEEAA 720

Query: 721 ELIRCLPIAPDANIWGALLGACRIYRNVELGCWAAEHLFEIKPQHCGYYILLSNMYAEAG 780
           +LI+ LPIAPDANIWGALLGACRIY NVELG  AAEHLFE+KPQHCGYYILLSN+YAE G
Sbjct: 721 KLIQQLPIAPDANIWGALLGACRIYGNVELGRRAAEHLFELKPQHCGYYILLSNIYAETG 780

Query: 781 RWDEVNRVRDLMKSRGAKKNPGCSWVQIHDQVHTFMAEERAEGFESGGWLAES 832
           RWDE N++R+LMKSRGAKKNPGCSWVQI+DQVH F+AEER EGFE G WLAES
Sbjct: 781 RWDEANKIRELMKSRGAKKNPGCSWVQIYDQVHAFVAEERVEGFELGDWLAES 832

BLAST of Moc02g02020 vs. TAIR 10
Match: AT4G18750.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 490.0 bits (1260), Expect = 3.9e-138
Identity = 258/663 (38.91%), Postives = 395/663 (59.58%), Query Frame = 0

Query: 156 LDDHTFPFVLKLCSDSLDICKGMEVHGVVSKLGFDSDVYVGNTLLMLYGNCGVVSDARRV 215
           +D  T   VL+LC+DS  +  G EV   +   GF  D  +G+ L ++Y NCG + +A RV
Sbjct: 92  IDPRTLCSVLQLCADSKSLKDGKEVDNFIRGNGFVIDSNLGSKLSLMYTNCGDLKEASRV 151

Query: 216 FDEMPERDVVSWNTILGLLSVNGDYREARNYYFWMTLRSGIQPNVVSVVILLPISAALED 275
           FDE+     + WN ++  L+ +GD+  +   +  M + SG++ +  +   +    ++L  
Sbjct: 152 FDEVKIEKALFWNILMNELAKSGDFSGSIGLFKKM-MSSGVEMDSYTFSCVSKSFSSLRS 211

Query: 276 EEMTRRIHCYTMKAGLDSQVTTCNALVDAYGKCGNVKASWQVFDEMVERNEVSWNAIING 335
                ++H + +K+G   + +  N+LV  Y K   V ++ +VFDEM ER+ +SWN+IING
Sbjct: 212 VHGGEQLHGFILKSGFGERNSVGNSLVAFYLKNQRVDSARKVFDEMTERDVISWNSIING 271

Query: 336 LACKGHFCDALDVFRMMIDAEVKPNSVTISSILPVLVELEYFKAGKEIHGFSMRMGTETD 395
               G     L VF  M+ + ++ +  TI S+     +      G+ +H   ++     +
Sbjct: 272 YVSNGLAEKGLSVFVQMLVSGIEIDLATIVSVFAGCADSRLISLGRAVHSIGVKACFSRE 331

Query: 396 IFIANSLIDMYAKSGHSTEASCIFHNMDRRNVVSWNAMIANYALNGLALEAIRQVILMQE 455
               N+L+DMY+K G    A  +F  M  R+VVS+ +MIA YA  GLA EA++    M+E
Sbjct: 332 DRFCNTLLDMYSKCGDLDSAKAVFREMSDRSVVSYTSMIAGYAREGLAGEAVKLFEEMEE 391

Query: 456 TGENPNAVTFTNVLPACARLGFLGPGKEIHAIGIRLGLTADMFVSNALTDMYAKCGCLHS 515
            G +P+  T T VL  CAR   L  GK +H       L  D+FVSNAL DMYAKCG +  
Sbjct: 392 EGISPDVYTVTAVLNCCARYRLLDEGKRVHEWIKENDLGFDIFVSNALMDMYAKCGSMQE 451

Query: 516 ARNVFNTSH-KDEVSYNILILGYSETSDCLESLNLFSEMRLLGR-KPDVVSFVGVISACA 575
           A  VF+    KD +S+N +I GYS+     E+L+LF+ +    R  PD  +   V+ ACA
Sbjct: 452 AELVFSEMRVKDIISWNTIIGGYSKNCYANEALSLFNLLLEEKRFSPDERTVACVLPACA 511

Query: 576 NLAAVKQGKEIHGVALRNHFYSHLFVSNSLLDFYTKCARIDLACKVFNQILFKDVASWNT 635
           +L+A  +G+EIHG  +RN ++S   V+NSL+D Y KC  + LA  +F+ I  KD+ SW  
Sbjct: 512 SLSAFDKGREIHGYIMRNGYFSDRHVANSLVDMYAKCGALLLAHMLFDDIASKDLVSWTV 571

Query: 636 MILGYGMIGELETAITMFEEMRGDTVQYDLVSYIAVLSACSHGGLVEQGWKYFSEMLAQ- 695
           MI GYGM G  + AI +F +MR   ++ D +S++++L ACSH GLV++GW++F+ M  + 
Sbjct: 572 MIAGYGMHGFGKEAIALFNQMRQAGIEADEISFVSLLYACSHSGLVDEGWRFFNIMRHEC 631

Query: 696 NLEPTQMHYACIVDLLGRAGFVEEAAELIRCLPIAPDANIWGALLGACRIYRNVELGCWA 755
            +EPT  HYACIVD+L R G + +A   I  +PI PDA IWGALL  CRI+ +V+L    
Sbjct: 632 KIEPTVEHYACIVDMLARTGDLIKAYRFIENMPIPPDATIWGALLCGCRIHHDVKLAEKV 691

Query: 756 AEHLFEIKPQHCGYYILLSNMYAEAGRWDEVNRVRDLMKSRGAKKNPGCSWVQIHDQVHT 815
           AE +FE++P++ GYY+L++N+YAEA +W++V R+R  +  RG +KNPGCSW++I  +V+ 
Sbjct: 692 AEKVFELEPENTGYYVLMANIYAEAEKWEQVKRLRKRIGQRGLRKNPGCSWIEIKGRVNI 751

BLAST of Moc02g02020 vs. TAIR 10
Match: AT3G03580.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 473.4 bits (1217), Expect = 3.8e-133
Identity = 265/753 (35.19%), Postives = 429/753 (56.97%), Query Frame = 0

Query: 64  SNAQSLRQTKQLHALCLLNGLLPHSVSLCASLILDYATFRDPESFRILFHQTVQNCRTAF 123
           S++ +L + +++HAL +  G L  S      LI  Y+ FR+P S   +F + V   +  +
Sbjct: 15  SSSSNLNELRRIHALVISLG-LDSSDFFSGKLIDKYSHFREPASSLSVFRR-VSPAKNVY 74

Query: 124 LWNTLIRAHSIAGNGMI-DGLQTYNRMVRSGVQLDDHTFPFVLKLCSDSLDICKGMEVHG 183
           LWN++IRA S   NG+  + L+ Y ++  S V  D +TFP V+K C+   D   G  V+ 
Sbjct: 75  LWNSIIRAFS--KNGLFPEALEFYGKLRESKVSPDKYTFPSVIKACAGLFDAEMGDLVYE 134

Query: 184 VVSKLGFDSDVYVGNTLLMLYGNCGVVSDARRVFDEMPERDVVSWNTILGLLSVNGDYRE 243
            +  +GF+SD++VGN L+ +Y   G+++ AR+VFDEMP RD+VSWN+++   S +G Y E
Sbjct: 135 QILDMGFESDLFVGNALVDMYSRMGLLTRARQVFDEMPVRDLVSWNSLISGYSSHGYYEE 194

Query: 244 ARNYYFWMTLRSGIQPNVVSVVILLPISAALEDEEMTRRIHCYTMKAGLDSQVTTCNALV 303
           A   Y  +   S I P+  +V  +LP    L   +  + +H + +K+G++S V   N LV
Sbjct: 195 ALEIYHELK-NSWIVPDSFTVSSVLPAFGNLLVVKQGQGLHGFALKSGVNSVVVVNNGLV 254

Query: 304 DAYGKCGNVKASWQVFDEMVERNEVSWNAIINGLACKGHFCDALDVFRMMIDAEVKPNSV 363
             Y K      + +VFDEM  R+ VS+N +I G        +++ +F   +D + KP+ +
Sbjct: 255 AMYLKFRRPTDARRVFDEMDVRDSVSYNTMICGYLKLEMVEESVRMFLENLD-QFKPDLL 314

Query: 364 TISSILPVLVELEYFKAGKEIHGFSMRMGTETDIFIANSLIDMYAKSGHSTEASCIFHNM 423
           T+SS+L     L      K I+ + ++ G   +  + N LID+YAK G    A  +F++M
Sbjct: 315 TVSSVLRACGHLRDLSLAKYIYNYMLKAGFVLESTVRNILIDVYAKCGDMITARDVFNSM 374

Query: 424 DRRNVVSWNAMIANYALNGLALEAIRQVILMQETGENPNAVTFTNVLPACARLGFLGPGK 483
           + ++ VSWN++I+ Y  +G  +EA++   +M    E  + +T+  ++    RL  L  GK
Sbjct: 375 ECKDTVSWNSIISGYIQSGDLMEAMKLFKMMMIMEEQADHITYLMLISVSTRLADLKFGK 434

Query: 484 EIHAIGIRLGLTADMFVSNALTDMYAKCGCLHSARNVFNT-SHKDEVSYNILILGYSETS 543
            +H+ GI+ G+  D+ VSNAL DMYAKCG +  +  +F++    D V++N +I       
Sbjct: 435 GLHSNGIKSGICIDLSVSNALIDMYAKCGEVGDSLKIFSSMGTGDTVTWNTVISACVRFG 494

Query: 544 DCLESLNLFSEMRLLGRKPDVVSFVGVISACANLAAVKQGKEIHGVALRNHFYSHLFVSN 603
           D    L + ++MR     PD+ +F+  +  CA+LAA + GKEIH   LR  + S L + N
Sbjct: 495 DFATGLQVTTQMRKSEVVPDMATFLVTLPMCASLAAKRLGKEIHCCLLRFGYESELQIGN 554

Query: 604 SLLDFYTKCARIDLACKVFNQILFKDVASWNTMILGYGMIGELETAITMFEEMRGDTVQY 663
           +L++ Y+KC  ++ + +VF ++  +DV +W  MI  YGM GE E A+  F +M    +  
Sbjct: 555 ALIEMYSKCGCLENSSRVFERMSRRDVVTWTGMIYAYGMYGEGEKALETFADMEKSGIVP 614

Query: 664 DLVSYIAVLSACSHGGLVEQGWKYFSEMLAQ-NLEPTQMHYACIVDLLGRAGFVEEAAEL 723
           D V +IA++ ACSH GLV++G   F +M     ++P   HYAC+VDLL R+  + +A E 
Sbjct: 615 DSVVFIAIIYACSHSGLVDEGLACFEKMKTHYKIDPMIEHYACVVDLLSRSQKISKAEEF 674

Query: 724 IRCLPIAPDANIWGALLGACRIYRNVELGCWAAEHLFEIKPQHCGYYILLSNMYAEAGRW 783
           I+ +PI PDA+IW ++L ACR   ++E     +  + E+ P   GY IL SN YA   +W
Sbjct: 675 IQAMPIKPDASIWASVLRACRTSGDMETAERVSRRIIELNPDDPGYSILASNAYAALRKW 734

Query: 784 DEVNRVRDLMKSRGAKKNPGCSWVQIHDQVHTF 814
           D+V+ +R  +K +   KNPG SW+++   VH F
Sbjct: 735 DKVSLIRKSLKDKHITKNPGYSWIEVGKNVHVF 761

BLAST of Moc02g02020 vs. TAIR 10
Match: AT1G18485.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 471.9 bits (1213), Expect = 1.1e-132
Identity = 263/776 (33.89%), Postives = 437/776 (56.31%), Query Frame = 0

Query: 59  LLTLCSNAQSLRQTKQLHALCLLNGLLPHSVSLCASLILDYATFRDPESFRILFHQTVQN 118
           LL      + +   +++H L   +  L +   LC  +I  YA    P+  R +F      
Sbjct: 90  LLQASGKRKDIEMGRKIHQLVSGSTRLRNDDVLCTRIITMYAMCGSPDDSRFVF--DALR 149

Query: 119 CRTAFLWNTLIRAHSIAGNGMIDG-LQTYNRMVRSGVQLDDH-TFPFVLKLCSDSLDICK 178
            +  F WN +I ++S   N + D  L+T+  M+ +   L DH T+P V+K C+   D+  
Sbjct: 150 SKNLFQWNAVISSYS--RNELYDEVLETFIEMISTTDLLPDHFTYPCVIKACAGMSDVGI 209

Query: 179 GMEVHGVVSKLGFDSDVYVGNTLLMLYGNCGVVSDARRVFDEMPERDVVSWNTILGLLSV 238
           G+ VHG+V K G   DV+VGN L+  YG  G V+DA ++FD MPER++VSWN+++ + S 
Sbjct: 210 GLAVHGLVVKTGLVEDVFVGNALVSFYGTHGFVTDALQLFDIMPERNLVSWNSMIRVFSD 269

Query: 239 NGDYREARNYYFWMTLRSG---IQPNVVSVVILLPISAALEDEEMTRRIHCYTMKAGLDS 298
           NG   E+      M   +G     P+V ++V +LP+ A   +  + + +H + +K  LD 
Sbjct: 270 NGFSEESFLLLGEMMEENGDGAFMPDVATLVTVLPVCAREREIGLGKGVHGWAVKLRLDK 329

Query: 299 QVTTCNALVDAYGKCGNVKASWQVFDEMVERNEVSWNAIINGLACKGHFCDALDVFRMMI 358
           ++   NAL+D Y KCG +  +  +F     +N VSWN ++ G + +G      DV R M+
Sbjct: 330 ELVLNNALMDMYSKCGCITNAQMIFKMNNNKNVVSWNTMVGGFSAEGDTHGTFDVLRQML 389

Query: 359 --DAEVKPNSVTISSILPVLVELEYFKAGKEIHGFSMRMGTETDIFIANSLIDMYAKSGH 418
               +VK + VTI + +PV     +  + KE+H +S++     +  +AN+ +  YAK G 
Sbjct: 390 AGGEDVKADEVTILNAVPVCFHESFLPSLKELHCYSLKQEFVYNELVANAFVASYAKCGS 449

Query: 419 STEASCIFHNMDRRNVVSWNAMIANYALNGLALEAIRQVILMQETGENPNAVTFTNVLPA 478
            + A  +FH +  + V SWNA+I  +A +     ++   + M+ +G  P++ T  ++L A
Sbjct: 450 LSYAQRVFHGIRSKTVNSWNALIGGHAQSNDPRLSLDAHLQMKISGLLPDSFTVCSLLSA 509

Query: 479 CARLGFLGPGKEIHAIGIRLGLTADMFVSNALTDMYAKCGCLHSARNVFNT-SHKDEVSY 538
           C++L  L  GKE+H   IR  L  D+FV  ++  +Y  CG L + + +F+    K  VS+
Sbjct: 510 CSKLKSLRLGKEVHGFIIRNWLERDLFVYLSVLSLYIHCGELCTVQALFDAMEDKSLVSW 569

Query: 539 NILILGYSETSDCLESLNLFSEMRLLGRKPDVVSFVGVISACANLAAVKQGKEIHGVALR 598
           N +I GY +      +L +F +M L G +   +S + V  AC+ L +++ G+E H  AL+
Sbjct: 570 NTVITGYLQNGFPDRALGVFRQMVLYGIQLCGISMMPVFGACSLLPSLRLGREAHAYALK 629

Query: 599 NHFYSHLFVSNSLLDFYTKCARIDLACKVFNQILFKDVASWNTMILGYGMIGELETAITM 658
           +      F++ SL+D Y K   I  + KVFN +  K  ASWN MI+GYG+ G  + AI +
Sbjct: 630 HLLEDDAFIACSLIDMYAKNGSITQSSKVFNGLKEKSTASWNAMIMGYGIHGLAKEAIKL 689

Query: 659 FEEMRGDTVQYDLVSYIAVLSACSHGGLVEQGWKYFSEMLAQ-NLEPTQMHYACIVDLLG 718
           FEEM+      D ++++ VL+AC+H GL+ +G +Y  +M +   L+P   HYAC++D+LG
Sbjct: 690 FEEMQRTGHNPDDLTFLGVLTACNHSGLIHEGLRYLDQMKSSFGLKPNLKHYACVIDMLG 749

Query: 719 RAGFVEEAAELI-RCLPIAPDANIWGALLGACRIYRNVELGCWAAEHLFEIKPQHCGYYI 778
           RAG +++A  ++   +    D  IW +LL +CRI++N+E+G   A  LFE++P+    Y+
Sbjct: 750 RAGQLDKALRVVAEEMSEEADVGIWKSLLSSCRIHQNLEMGEKVAAKLFELEPEKPENYV 809

Query: 779 LLSNMYAEAGRWDEVNRVRDLMKSRGAKKNPGCSWVQIHDQVHTFMAEER-AEGFE 824
           LLSN+YA  G+W++V +VR  M     +K+ GCSW++++ +V +F+  ER  +GFE
Sbjct: 810 LLSNLYAGLGKWEDVRKVRQRMNEMSLRKDAGCSWIELNRKVFSFVVGERFLDGFE 861

BLAST of Moc02g02020 vs. TAIR 10
Match: AT1G15510.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 468.0 bits (1203), Expect = 1.6e-131
Identity = 237/689 (34.40%), Postives = 389/689 (56.46%), Query Frame = 0

Query: 132 HSIAGNGMI-DGLQTYNRMVRSGVQLDDHTFPFVLKLCSDSLDICKGMEVHGVVSKLGFD 191
           H +  NG + + ++  N M    V +D+  F  +++LC       +G +V+ +       
Sbjct: 67  HGLCANGKLEEAMKLLNSMQELRVAVDEDVFVALVRLCEWKRAQEEGSKVYSIALSSMSS 126

Query: 192 SDVYVGNTLLMLYGNCGVVSDARRVFDEMPERDVVSWNTILGLLSVNGDYREARNYYFWM 251
             V +GN  L ++   G + DA  VF +M ER++ SWN ++G  +  G + EA   Y  M
Sbjct: 127 LGVELGNAFLAMFVRFGNLVDAWYVFGKMSERNLFSWNVLVGGYAKQGYFDEAMCLYHRM 186

Query: 252 TLRSGIQPNVVSVVILLPISAALEDEEMTRRIHCYTMKAGLDSQVTTCNALVDAYGKCGN 311
               G++P+V +   +L     + D    + +H + ++ G +  +   NAL+  Y KCG+
Sbjct: 187 LWVGGVKPDVYTFPCVLRTCGGIPDLARGKEVHVHVVRYGYELDIDVVNALITMYVKCGD 246

Query: 312 VKASWQVFDEMVERNEVSWNAIINGLACKGHFCDALDVFRMMIDAEVKPNSVTISSILPV 371
           VK++  +FD M  R+ +SWNA+I+G    G   + L++F  M    V P+ +T++S++  
Sbjct: 247 VKSARLLFDRMPRRDIISWNAMISGYFENGMCHEGLELFFAMRGLSVDPDLMTLTSVISA 306

Query: 372 LVELEYFKAGKEIHGFSMRMGTETDIFIANSLIDMYAKSGHSTEASCIFHNMDRRNVVSW 431
              L   + G++IH + +  G   DI + NSL  MY  +G   EA  +F  M+R+++VSW
Sbjct: 307 CELLGDRRLGRDIHAYVITTGFAVDISVCNSLTQMYLNAGSWREAEKLFSRMERKDIVSW 366

Query: 432 NAMIANYALNGLALEAIRQVILMQETGENPNAVTFTNVLPACARLGFLGPGKEIHAIGIR 491
             MI+ Y  N L  +AI    +M +    P+ +T   VL ACA LG L  G E+H + I+
Sbjct: 367 TTMISGYEYNFLPDKAIDTYRMMDQDSVKPDEITVAAVLSACATLGDLDTGVELHKLAIK 426

Query: 492 LGLTADMFVSNALTDMYAKCGCLHSARNVF-NTSHKDEVSYNILILGYSETSDCLESLNL 551
             L + + V+N L +MY+KC C+  A ++F N   K+ +S+  +I G    + C E+L  
Sbjct: 427 ARLISYVIVANNLINMYSKCKCIDKALDIFHNIPRKNVISWTSIIAGLRLNNRCFEALIF 486

Query: 552 FSEMRLLGRKPDVVSFVGVISACANLAAVKQGKEIHGVALRNHFYSHLFVSNSLLDFYTK 611
             +M++   +P+ ++    ++ACA + A+  GKEIH   LR       F+ N+LLD Y +
Sbjct: 487 LRQMKMT-LQPNAITLTAALAACARIGALMCGKEIHAHVLRTGVGLDDFLPNALLDMYVR 546

Query: 612 CARIDLACKVFNQILFKDVASWNTMILGYGMIGELETAITMFEEMRGDTVQYDLVSYIAV 671
           C R++ A   FN    KDV SWN ++ GY   G+    + +F+ M    V+ D +++I++
Sbjct: 547 CGRMNTAWSQFNS-QKKDVTSWNILLTGYSERGQGSMVVELFDRMVKSRVRPDEITFISL 606

Query: 672 LSACSHGGLVEQGWKYFSEMLAQNLEPTQMHYACIVDLLGRAGFVEEAAELIRCLPIAPD 731
           L  CS   +V QG  YFS+M    + P   HYAC+VDLLGRAG ++EA + I+ +P+ PD
Sbjct: 607 LCGCSKSQMVRQGLMYFSKMEDYGVTPNLKHYACVVDLLGRAGELQEAHKFIQKMPVTPD 666

Query: 732 ANIWGALLGACRIYRNVELGCWAAEHLFEIKPQHCGYYILLSNMYAEAGRWDEVNRVRDL 791
             +WGALL ACRI+  ++LG  +A+H+FE+  +  GYYILL N+YA+ G+W EV +VR +
Sbjct: 667 PAVWGALLNACRIHHKIDLGELSAQHIFELDKKSVGYYILLCNLYADCGKWREVAKVRRM 726

Query: 792 MKSRGAKKNPGCSWVQIHDQVHTFMAEER 819
           MK  G   + GCSWV++  +VH F+++++
Sbjct: 727 MKENGLTVDAGCSWVEVKGKVHAFLSDDK 753

BLAST of Moc02g02020 vs. TAIR 10
Match: AT1G69350.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 453.4 bits (1165), Expect = 4.0e-127
Identity = 252/762 (33.07%), Postives = 429/762 (56.30%), Query Frame = 0

Query: 60  LTLCSNAQSLRQTKQLHALCLLNGLLPHSVSLCASLILDYATFRDPESFRILFHQTVQNC 119
           + L  +  SLR   QLHA  L+ G L         LI  YA    P+S R++F       
Sbjct: 5   MPLFRSCSSLRLVSQLHAHLLVTGRLRRDPLPVTKLIESYAFMGSPDSSRLVFEAFPY-- 64

Query: 120 RTAFLWNTLIRAHSIAGNGMIDGLQTYNRMVRSGVQLDDHTFPFVLKLCSDSLD-ICKGM 179
             +F++  LI+  ++  + +   +  Y+R+V    Q+    FP VL+ C+ S + +  G 
Sbjct: 65  PDSFMYGVLIKC-NVWCHLLDAAIDLYHRLVSETTQISKFVFPSVLRACAGSREHLSVGG 124

Query: 180 EVHGVVSKLGFDSDVYVGNTLLMLYGNCGVVSDARRVFDEMPERDVVSWNTILGLLSVNG 239
           +VHG + K G D D  +  +LL +YG  G +SDA +VFD MP RD+V+W+T++     NG
Sbjct: 125 KVHGRIIKGGVDDDAVIETSLLCMYGQTGNLSDAEKVFDGMPVRDLVAWSTLVSSCLENG 184

Query: 240 DYREARNYYFWMTLRSGIQPNVVSVVILLPISAALEDEEMTRRIHCYTMKAGLDSQVTTC 299
           +  +A   +  M +  G++P+ V+++ ++   A L    + R +H    +   D   T C
Sbjct: 185 EVVKALRMFKCM-VDDGVEPDAVTMISVVEGCAELGCLRIARSVHGQITRKMFDLDETLC 244

Query: 300 NALVDAYGKCGNVKASWQVFDEMVERNEVSWNAIINGLACKGHFCD-ALDVFRMMIDAEV 359
           N+L+  Y KCG++ +S ++F+++ ++N VSW A+I+    +G F + AL  F  MI + +
Sbjct: 245 NSLLTMYSKCGDLLSSERIFEKIAKKNAVSWTAMISSYN-RGEFSEKALRSFSEMIKSGI 304

Query: 360 KPNSVTISSILPVLVELEYFKAGKEIHGFSMRMGTETDI-FIANSLIDMYAKSGHSTEAS 419
           +PN VT+ S+L     +   + GK +HGF++R   + +   ++ +L+++YA+ G  ++  
Sbjct: 305 EPNLVTLYSVLSSCGLIGLIREGKSVHGFAVRRELDPNYESLSLALVELYAECGKLSDCE 364

Query: 420 CIFHNMDRRNVVSWNAMIANYALNGLALEAIRQVILMQETGENPNAVTFTNVLPACARLG 479
            +   +  RN+V+WN++I+ YA  G+ ++A+     M      P+A T  + + AC   G
Sbjct: 365 TVLRVVSDRNIVAWNSLISLYAHRGMVIQALGLFRQMVTQRIKPDAFTLASSISACENAG 424

Query: 480 FLGPGKEIHAIGIRLGLTADMFVSNALTDMYAKCGCLHSARNVFN-TSHKDEVSYNILIL 539
            +  GK+IH   IR  + +D FV N+L DMY+K G + SA  VFN   H+  V++N ++ 
Sbjct: 425 LVPLGKQIHGHVIRTDV-SDEFVQNSLIDMYSKSGSVDSASTVFNQIKHRSVVTWNSMLC 484

Query: 540 GYSETSDCLESLNLFSEMRLLGRKPDVVSFVGVISACANLAAVKQGKEIHGVALRNHFYS 599
           G+S+  + +E+++LF  M     + + V+F+ VI AC+++ ++++GK +H   + +    
Sbjct: 485 GFSQNGNSVEAISLFDYMYHSYLEMNEVTFLAVIQACSSIGSLEKGKWVHHKLIISGL-K 544

Query: 600 HLFVSNSLLDFYTKCARIDLACKVFNQILFKDVASWNTMILGYGMIGELETAITMFEEMR 659
            LF   +L+D Y KC  ++ A  VF  +  + + SW++MI  YGM G + +AI+ F +M 
Sbjct: 545 DLFTDTALIDMYAKCGDLNAAETVFRAMSSRSIVSWSSMINAYGMHGRIGSAISTFNQMV 604

Query: 660 GDTVQYDLVSYIAVLSACSHGGLVEQGWKYFSEMLAQNLEPTQMHYACIVDLLGRAGFVE 719
               + + V ++ VLSAC H G VE+G  YF+ M +  + P   H+AC +DLL R+G ++
Sbjct: 605 ESGTKPNEVVFMNVLSACGHSGSVEEGKYYFNLMKSFGVSPNSEHFACFIDLLSRSGDLK 664

Query: 720 EAAELIRCLPIAPDANIWGALLGACRIYRNVELGCWAAEHLFEIKPQHCGYYILLSNMYA 779
           EA   I+ +P   DA++WG+L+  CRI++ +++       L +I     GYY LLSN+YA
Sbjct: 665 EAYRTIKEMPFLADASVWGSLVNGCRIHQKMDIIKAIKNDLSDIVTDDTGYYTLLSNIYA 724

Query: 780 EAGRWDEVNRVRDLMKSRGAKKNPGCSWVQIHDQVHTFMAEE 818
           E G W+E  R+R  MKS   KK PG S ++I  +V  F A E
Sbjct: 725 EEGEWEEFRRLRSAMKSSNLKKVPGYSAIEIDQKVFRFGAGE 759

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022153017.10.0e+00100.00pentatricopeptide repeat-containing protein At4g14170-like [Momordica charantia][more]
XP_038901996.10.0e+0085.58pentatricopeptide repeat-containing protein At4g14170-like [Benincasa hispida][more]
KAG6570435.10.0e+0083.01Pentatricopeptide repeat-containing protein DOT4, chloroplastic, partial [Cucurb... [more]
XP_022985648.10.0e+0082.77pentatricopeptide repeat-containing protein At4g14170-like isoform X1 [Cucurbita... [more]
XP_022944177.10.0e+0083.13pentatricopeptide repeat-containing protein At4g14170-like isoform X1 [Cucurbita... [more]
Match NameE-valueIdentityDescription
Q9SN395.5e-13738.91Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis t... [more]
Q9SS605.3e-13235.19Pentatricopeptide repeat-containing protein At3g03580 OS=Arabidopsis thaliana OX... [more]
Q0WN601.5e-13133.89Pentatricopeptide repeat-containing protein At1g18485 OS=Arabidopsis thaliana OX... [more]
Q9M9E22.2e-13034.40Pentatricopeptide repeat-containing protein At1g15510, chloroplastic OS=Arabidop... [more]
Q9C5075.7e-12633.07Putative pentatricopeptide repeat-containing protein At1g69350, mitochondrial OS... [more]
Match NameE-valueIdentityDescription
A0A6J1DHT80.0e+00100.00pentatricopeptide repeat-containing protein At4g14170-like OS=Momordica charanti... [more]
A0A6J1JE860.0e+0082.77pentatricopeptide repeat-containing protein At4g14170-like isoform X1 OS=Cucurbi... [more]
A0A6J1FV310.0e+0083.13pentatricopeptide repeat-containing protein At4g14170-like isoform X1 OS=Cucurbi... [more]
A0A5A7V0P30.0e+0083.19Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A0A0KEH60.0e+0082.95Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G428560 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G18750.13.9e-13838.91Pentatricopeptide repeat (PPR) superfamily protein [more]
AT3G03580.13.8e-13335.19Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G18485.11.1e-13233.89Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G15510.11.6e-13134.40Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G69350.14.0e-12733.07Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (OHB3-1) v2
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 525..574
e-value: 3.4E-7
score: 30.4
coord: 325..368
e-value: 2.7E-8
score: 33.9
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 770..794
e-value: 1.2
score: 9.6
coord: 400..427
e-value: 0.0031
score: 17.6
coord: 428..455
e-value: 0.077
score: 13.3
coord: 197..223
e-value: 0.015
score: 15.5
coord: 664..691
e-value: 0.032
score: 14.5
coord: 225..247
e-value: 0.35
score: 11.2
coord: 630..656
e-value: 1.7E-5
score: 24.7
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 630..656
e-value: 6.9E-5
score: 20.8
coord: 297..325
e-value: 5.2E-7
score: 27.4
coord: 664..697
e-value: 6.2E-5
score: 20.9
coord: 400..428
e-value: 0.002
score: 16.2
coord: 528..562
e-value: 0.0028
score: 15.7
coord: 327..360
e-value: 2.9E-6
score: 25.1
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 223..258
score: 9.339086
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 662..696
score: 10.577712
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 627..661
score: 9.898111
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 121..156
score: 9.021208
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 526..560
score: 10.336563
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 325..359
score: 11.103854
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 395..429
score: 9.415814
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 294..324
score: 10.117337
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 601..830
e-value: 3.5E-33
score: 117.3
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 383..479
e-value: 2.4E-18
score: 68.1
coord: 164..279
e-value: 6.3E-18
score: 66.8
coord: 280..382
e-value: 4.2E-23
score: 83.6
coord: 484..587
e-value: 1.1E-15
score: 59.4
NoneNo IPR availablePANTHERPTHR47924PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 481..816
coord: 51..478
coord: 458..572

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Moc02g02020.1Moc02g02020.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding