Homology
BLAST of HG10020254 vs. NCBI nr
Match:
XP_038903429.1 (pentatricopeptide repeat-containing protein At3g53360, mitochondrial isoform X1 [Benincasa hispida])
HSP 1 Score: 1520.8 bits (3936), Expect = 0.0e+00
Identity = 740/792 (93.43%), Postives = 764/792 (96.46%), Query Frame = 0
Query: 1 MAFIRTLNSIYFVTLASDAIVFFNPFSPTTSTKNFKPQLHLALSHIRLNTQLAFSPFPVT 60
M FIRTLNSIYFVTLASDA VFF PFSPT S KNFK +LHLALSHIRLNT LAFSP PVT
Sbjct: 1 MGFIRTLNSIYFVTLASDASVFFTPFSPTASIKNFKSRLHLALSHIRLNTHLAFSPCPVT 60
Query: 61 DHYSHDDKLISLCKKNCHREALQAFDIFQKCSNSPLKSITYTHLINACSSLRSLEHGRKI 120
+HYSHDDK+ISLCKKN HREALQAFDIFQKCS+SPLKSITYTHLINACSSLRSLEHGR+I
Sbjct: 61 EHYSHDDKIISLCKKNLHREALQAFDIFQKCSSSPLKSITYTHLINACSSLRSLEHGREI 120
Query: 121 HRHMLTFNYQPDMILQNHILNMYGKCGSLKEARNIFDAMPLKNVVSWTSMISGYSHYGQE 180
HRHMLTFN QPDMILQNHILNMYGKCGSLKEARNIF+AMPLKNVVSWTSMISGYSHYGQE
Sbjct: 121 HRHMLTFNCQPDMILQNHILNMYGKCGSLKEARNIFNAMPLKNVVSWTSMISGYSHYGQE 180
Query: 181 DNAIALYVQMLRSGYIPDHFTFGSVVKSCSGLDDFMLARQLHAHVLKSEFGSHLIAQNAL 240
DNAI +YVQMLRSGYIPDHFTFGSVVKSCSGLDDFMLARQLHAHVLKSEFG HLIAQNAL
Sbjct: 181 DNAITMYVQMLRSGYIPDHFTFGSVVKSCSGLDDFMLARQLHAHVLKSEFGGHLIAQNAL 240
Query: 241 ISMYTKLSQMADAKNVFSHIIIKDLISWGSMIAGFSQLGYELEALCHFREMLSQPVYQPN 300
ISMYTK S++ADA NVFSHI+IKDLISWGSMIAGFSQLGYELEALCHFREM+SQPVYQPN
Sbjct: 241 ISMYTKFSRIADATNVFSHIVIKDLISWGSMIAGFSQLGYELEALCHFREMVSQPVYQPN 300
Query: 301 EFVFGSAFSACSKLLEPNYGRQIHGLCIKFGLGSDLFAGCSLCDMYAKCGFLGSARTVFY 360
EFVFGSAFSACSKLLEPN GRQIHGLCIKFGLGSDLFAGCSLCDMYAKCGFLGSARTVFY
Sbjct: 301 EFVFGSAFSACSKLLEPNCGRQIHGLCIKFGLGSDLFAGCSLCDMYAKCGFLGSARTVFY 360
Query: 361 HIEKPDLVAWNSIIAGFASVGDAKESLSFFSRMRHTGLVPNDVTVLSLLCACSEPVMLNQ 420
HIEKPDLVAWN+IIAGFASVGDAKESLSFFS+MRHTGLVPNDVTVLSLLCACSEPVMLNQ
Sbjct: 361 HIEKPDLVAWNAIIAGFASVGDAKESLSFFSQMRHTGLVPNDVTVLSLLCACSEPVMLNQ 420
Query: 421 GMQVHSYIVRMGFDLDIPVCNSLLSMYSKCSNLNDALQIFEGIGNKADIVSWNTMLTACL 480
GMQVHSY+V+MGFDLDIPV N+LL MYSKCSNL DAL IFEGIG KADIVSWNT+LTACL
Sbjct: 421 GMQVHSYVVKMGFDLDIPVSNTLLGMYSKCSNLTDALHIFEGIGTKADIVSWNTLLTACL 480
Query: 481 QQNQAGEVLRLTKLMLASHIKPDHVTLTNVLVSSGQIASYEVGSQVHCFIMKSGLNLDTS 540
QQNQAGEVLRLTKLMLASHIKPDHVTLTNVLVSSGQIASYEVGSQVHCFIMKSGLNLDTS
Sbjct: 481 QQNQAGEVLRLTKLMLASHIKPDHVTLTNVLVSSGQIASYEVGSQVHCFIMKSGLNLDTS 540
Query: 541 VSNALINMYTKCGSLGCARKMFDSIDNPDIISWSSLIVGYAQAGCGEEAFELFRTMRGLG 600
VSNALIN YTKCGSLGCARKMFDSIDNPDIISWSSLIVGYAQAGCGEEAF LFRTMR LG
Sbjct: 541 VSNALINTYTKCGSLGCARKMFDSIDNPDIISWSSLIVGYAQAGCGEEAFVLFRTMRSLG 600
Query: 601 VKPNEITFVGILTACSHIGMVEEGLKLYRTMQEEYDISPTKEHCSCMVDLLARAGCLDGA 660
VKPNEITFVGIL ACSHIGMVEEGLKLY+TMQ+EY ISPTKEHCSCMVDLLARAGCLDGA
Sbjct: 601 VKPNEITFVGILIACSHIGMVEEGLKLYKTMQKEYGISPTKEHCSCMVDLLARAGCLDGA 660
Query: 661 EDFIKHMPFDPDIVVWKTLLAACKVHGNLEVGKRAAENVLKIDPSNSAALVMLCNIHASS 720
E+FIK MPF+PDIVVWKTLLAACKVHGNLEVGKRAAENVLKIDPSNSAALVMLCNIHASS
Sbjct: 661 ENFIKQMPFNPDIVVWKTLLAACKVHGNLEVGKRAAENVLKIDPSNSAALVMLCNIHASS 720
Query: 721 GCWKDFAQYRSSMRQMNVSKVPGQSWIEIKDKVHVFLAEDSLHPERGKIYTMLEELMLQV 780
G WKDFAQ RSSMRQMNVSKVPGQSWIEIKDKVHVFLAEDSLHPERGKIYT+LEELMLQ+
Sbjct: 721 GHWKDFAQCRSSMRQMNVSKVPGQSWIEIKDKVHVFLAEDSLHPERGKIYTILEELMLQI 780
Query: 781 LDDGCDPIKVTK 793
LDDGCDPI+VTK
Sbjct: 781 LDDGCDPIQVTK 792
BLAST of HG10020254 vs. NCBI nr
Match:
XP_004137966.1 (pentatricopeptide repeat-containing protein At3g53360, mitochondrial [Cucumis sativus])
HSP 1 Score: 1481.8 bits (3835), Expect = 0.0e+00
Identity = 720/790 (91.14%), Postives = 757/790 (95.82%), Query Frame = 0
Query: 1 MAFIRTLNSIYFVTLASDAIVFFNPFSPTTSTKNFKPQLHLALSHIRLNTQLAFSPFPVT 60
MAFIRT +SIYF+TLASDAIVFFNPFSPT+S KNFKPQLHLALSHIRLNTQLAFSP P+T
Sbjct: 1 MAFIRTFSSIYFLTLASDAIVFFNPFSPTSSIKNFKPQLHLALSHIRLNTQLAFSPCPLT 60
Query: 61 DHYSHDDKLISLCKKNCHREALQAFDIFQKCSNSPLKSITYTHLINACSSLRSLEHGRKI 120
HY HDDK+ISLCKKN HREAL+AFDIFQKCS+SPLKS+TYTHLINACSSLRSLEHGRKI
Sbjct: 61 VHYPHDDKIISLCKKNLHREALKAFDIFQKCSSSPLKSVTYTHLINACSSLRSLEHGRKI 120
Query: 121 HRHMLTFNYQPDMILQNHILNMYGKCGSLKEARNIFDAMPLKNVVSWTSMISGYSHYGQE 180
HRHMLT NYQPDMILQNHIL+MYGKCGSLKEARN+FD+MPLKNVVSWTSMISGYS YG+E
Sbjct: 121 HRHMLTCNYQPDMILQNHILSMYGKCGSLKEARNMFDSMPLKNVVSWTSMISGYSRYGEE 180
Query: 181 DNAIALYVQMLRSGYIPDHFTFGSVVKSCSGLDDFMLARQLHAHVLKSEFGSHLIAQNAL 240
DNAI LYVQMLRSG+IPDHFTFGS+VKSCSGLDDF LARQLHAHVLKSEFG+ LIAQNAL
Sbjct: 181 DNAITLYVQMLRSGHIPDHFTFGSIVKSCSGLDDFKLARQLHAHVLKSEFGADLIAQNAL 240
Query: 241 ISMYTKLSQMADAKNVFSHIIIKDLISWGSMIAGFSQLGYELEALCHFREMLSQPVYQPN 300
ISMYTK SQMADA NVFS IIIKDLISWGSMIAGFSQLGYELEALCHFREMLSQ VYQPN
Sbjct: 241 ISMYTKFSQMADAINVFSRIIIKDLISWGSMIAGFSQLGYELEALCHFREMLSQSVYQPN 300
Query: 301 EFVFGSAFSACSKLLEPNYGRQIHGLCIKFGLGSDLFAGCSLCDMYAKCGFLGSARTVFY 360
EFVFGSAFSACSKLLEP+ GRQIHGLCIKFGLGSDLFAGCSLCDMYAKCGFL SARTVFY
Sbjct: 301 EFVFGSAFSACSKLLEPDCGRQIHGLCIKFGLGSDLFAGCSLCDMYAKCGFLESARTVFY 360
Query: 361 HIEKPDLVAWNSIIAGFASVGDAKESLSFFSRMRHTGLVPNDVTVLSLLCACSEPVMLNQ 420
HIEKPDLVAWN+IIAGFASV +AKES SFFS+MRHTGLVPNDVTVLSLLCACSEPVMLN
Sbjct: 361 HIEKPDLVAWNAIIAGFASVSNAKESSSFFSQMRHTGLVPNDVTVLSLLCACSEPVMLNH 420
Query: 421 GMQVHSYIVRMGFDLDIPVCNSLLSMYSKCSNLNDALQIFEGIGNKADIVSWNTMLTACL 480
G+QVHSYIV+MGF+LDIPVCNSLLSMYSKCSNLNDALQ+FE IGNKADIVSWNT+LTACL
Sbjct: 421 GIQVHSYIVKMGFNLDIPVCNSLLSMYSKCSNLNDALQVFEDIGNKADIVSWNTLLTACL 480
Query: 481 QQNQAGEVLRLTKLMLASHIKPDHVTLTNVLVSSGQIASYEVGSQVHCFIMKSGLNLDTS 540
QQNQAGEVLRLTKLM AS IKPDHVTLTNVLVSSGQIASYEVGSQ+HCFIMKSGLNLD S
Sbjct: 481 QQNQAGEVLRLTKLMFASRIKPDHVTLTNVLVSSGQIASYEVGSQIHCFIMKSGLNLDIS 540
Query: 541 VSNALINMYTKCGSLGCARKMFDSIDNPDIISWSSLIVGYAQAGCGEEAFELFRTMRGLG 600
VSNALINMYTKCGSL CARKMFDSI NPDIISWSSLIVGYAQAGCG+EAFELFRTMRGLG
Sbjct: 541 VSNALINMYTKCGSLECARKMFDSIGNPDIISWSSLIVGYAQAGCGKEAFELFRTMRGLG 600
Query: 601 VKPNEITFVGILTACSHIGMVEEGLKLYRTMQEEYDISPTKEHCSCMVDLLARAGCLDGA 660
VKPNEITFVGILTACSHIGMVEEGLKLYRTMQE+Y ISPTKEHCSCMVDLLARAGCLD A
Sbjct: 601 VKPNEITFVGILTACSHIGMVEEGLKLYRTMQEDYRISPTKEHCSCMVDLLARAGCLDVA 660
Query: 661 EDFIKHMPFDPDIVVWKTLLAACKVHGNLEVGKRAAENVLKIDPSNSAALVMLCNIHASS 720
EDFIK MPF PD+VVWKTLLAACKVHGNLEVGKRAAENVLKIDPSNSAA+VMLCNIHASS
Sbjct: 661 EDFIKQMPFVPDVVVWKTLLAACKVHGNLEVGKRAAENVLKIDPSNSAAVVMLCNIHASS 720
Query: 721 GCWKDFAQYRSSMRQMNVSKVPGQSWIEIKDKVHVFLAEDSLHPERGKIYTMLEELMLQV 780
G WKDFA+ RSSMR+M+V KVPGQSWIEIKDKVHVFLAED+LHPERGKIYTMLEELMLQ+
Sbjct: 721 GHWKDFARLRSSMRRMDVGKVPGQSWIEIKDKVHVFLAEDNLHPERGKIYTMLEELMLQI 780
Query: 781 LDDGCDPIKV 791
LDDGCDP+++
Sbjct: 781 LDDGCDPLQM 790
BLAST of HG10020254 vs. NCBI nr
Match:
XP_008442662.1 (PREDICTED: pentatricopeptide repeat-containing protein At3g53360, mitochondrial [Cucumis melo] >KAA0044026.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYK25114.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])
HSP 1 Score: 1472.6 bits (3811), Expect = 0.0e+00
Identity = 718/790 (90.89%), Postives = 750/790 (94.94%), Query Frame = 0
Query: 1 MAFIRTLNSIYFVTLASDAIVFFNPFSPTTSTKNFKPQLHLALSHIRLNTQLAFSPFPVT 60
MA IRT NSIYF+TLASDAIVFFNPFSP+TSTKNFKPQLHLA SHIRL TQLAFSP PVT
Sbjct: 1 MAPIRTFNSIYFLTLASDAIVFFNPFSPSTSTKNFKPQLHLAHSHIRLGTQLAFSPCPVT 60
Query: 61 DHYSHDDKLISLCKKNCHREALQAFDIFQKCSNSPLKSITYTHLINACSSLRSLEHGRKI 120
HY HDDK+ISLCKKN HREALQAFDIF+KCS+SPLKSITYTHLINACSSLRSLEHGRKI
Sbjct: 61 VHYPHDDKIISLCKKNLHREALQAFDIFRKCSSSPLKSITYTHLINACSSLRSLEHGRKI 120
Query: 121 HRHMLTFNYQPDMILQNHILNMYGKCGSLKEARNIFDAMPLKNVVSWTSMISGYSHYGQE 180
HRHMLT NYQPDMILQNHIL+MYGKCGSLKEARNIFDAMPLKNVVSWTSMISGYS YGQE
Sbjct: 121 HRHMLTCNYQPDMILQNHILSMYGKCGSLKEARNIFDAMPLKNVVSWTSMISGYSRYGQE 180
Query: 181 DNAIALYVQMLRSGYIPDHFTFGSVVKSCSGLDDFMLARQLHAHVLKSEFGSHLIAQNAL 240
DNAI LY+QMLRSGYIPDHFTFGS+VKSCSGLDDFMLARQLHAHVLK EFG HLIAQNAL
Sbjct: 181 DNAITLYIQMLRSGYIPDHFTFGSIVKSCSGLDDFMLARQLHAHVLKFEFGGHLIAQNAL 240
Query: 241 ISMYTKLSQMADAKNVFSHIIIKDLISWGSMIAGFSQLGYELEALCHFREMLSQPVYQPN 300
ISMYTK SQMADA NVFS IIIKDLISWGSMIAGFSQLGYELEALCHFREMLSQ VYQPN
Sbjct: 241 ISMYTKFSQMADAINVFSRIIIKDLISWGSMIAGFSQLGYELEALCHFREMLSQSVYQPN 300
Query: 301 EFVFGSAFSACSKLLEPNYGRQIHGLCIKFGLGSDLFAGCSLCDMYAKCGFLGSARTVFY 360
EFVFGSAFSACSKLLEP+ GRQIHGLCIK GLGSD+FAGCSLCDMYAKCGFL SARTVFY
Sbjct: 301 EFVFGSAFSACSKLLEPDCGRQIHGLCIKLGLGSDIFAGCSLCDMYAKCGFLESARTVFY 360
Query: 361 HIEKPDLVAWNSIIAGFASVGDAKESLSFFSRMRHTGLVPNDVTVLSLLCACSEPVMLNQ 420
HIEKPDLVAWN+IIAGFASV +AKESLSFFS+MRH G+VPNDVTVLSLLCACSEPVMLN
Sbjct: 361 HIEKPDLVAWNAIIAGFASVSNAKESLSFFSQMRHRGVVPNDVTVLSLLCACSEPVMLNN 420
Query: 421 GMQVHSYIVRMGFDLDIPVCNSLLSMYSKCSNLNDALQIFEGIGNKADIVSWNTMLTACL 480
GMQVHSYIV+MGF+LDIPVCNSLLSMYSKCSNLNDALQ+FEGIGNKADIVSWNT+LTACL
Sbjct: 421 GMQVHSYIVKMGFNLDIPVCNSLLSMYSKCSNLNDALQVFEGIGNKADIVSWNTLLTACL 480
Query: 481 QQNQAGEVLRLTKLMLASHIKPDHVTLTNVLVSSGQIASYEVGSQVHCFIMKSGLNLDTS 540
QQNQAGEVLRLTKLM AS I PDHVTLTNVLVSSGQIASYEVGSQ+HCFIMKSGLNLDTS
Sbjct: 481 QQNQAGEVLRLTKLMFASRIMPDHVTLTNVLVSSGQIASYEVGSQIHCFIMKSGLNLDTS 540
Query: 541 VSNALINMYTKCGSLGCARKMFDSIDNPDIISWSSLIVGYAQAGCGEEAFELFRTMRGLG 600
VSNALINMYTKCGSL CARKMFDSI NPDIISWSSLIVGYAQAGCG+EAFELFRTMRGLG
Sbjct: 541 VSNALINMYTKCGSLECARKMFDSIGNPDIISWSSLIVGYAQAGCGKEAFELFRTMRGLG 600
Query: 601 VKPNEITFVGILTACSHIGMVEEGLKLYRTMQEEYDISPTKEHCSCMVDLLARAGCLDGA 660
VKPNEITF+GILTACSHIGMVEEGLKLYRTMQE+ ISPTKEHCSC+VDLLARAGCLD A
Sbjct: 601 VKPNEITFLGILTACSHIGMVEEGLKLYRTMQEDCRISPTKEHCSCIVDLLARAGCLDVA 660
Query: 661 EDFIKHMPFDPDIVVWKTLLAACKVHGNLEVGKRAAENVLKIDPSNSAALVMLCNIHASS 720
E+FIK MPF PD+VVWKTLLAACKVHGNLEVGKRAAENVLKIDPSNSAA+VMLCNIHASS
Sbjct: 661 EEFIKQMPFVPDVVVWKTLLAACKVHGNLEVGKRAAENVLKIDPSNSAAVVMLCNIHASS 720
Query: 721 GCWKDFAQYRSSMRQMNVSKVPGQSWIEIKDKVHVFLAEDSLHPERGKIYTMLEELMLQV 780
G WKD AQ R SMR+M+V KVPGQSWIEIKDKVHVFLAED+LHPERGKIYTMLEELMLQ
Sbjct: 721 GHWKDLAQLRISMRRMDVGKVPGQSWIEIKDKVHVFLAEDNLHPERGKIYTMLEELMLQT 780
Query: 781 LDDGCDPIKV 791
LDD CDP+++
Sbjct: 781 LDDSCDPLQM 790
BLAST of HG10020254 vs. NCBI nr
Match:
XP_023526527.1 (pentatricopeptide repeat-containing protein At3g53360, mitochondrial [Cucurbita pepo subsp. pepo] >XP_023526528.1 pentatricopeptide repeat-containing protein At3g53360, mitochondrial [Cucurbita pepo subsp. pepo] >XP_023526529.1 pentatricopeptide repeat-containing protein At3g53360, mitochondrial [Cucurbita pepo subsp. pepo])
HSP 1 Score: 1413.7 bits (3658), Expect = 0.0e+00
Identity = 687/790 (86.96%), Postives = 734/790 (92.91%), Query Frame = 0
Query: 1 MAFIRTLNSIYFVTLASDAIVFFNPFSPTTSTKNFKPQLHLALSHIRLNTQLAFSPFPVT 60
MAFIRTL+SIYFVT +S+AIVFFNP SP S KNFKPQLHLALSHIRLN+Q+AFSP PV
Sbjct: 29 MAFIRTLHSIYFVTYSSEAIVFFNPCSPANSIKNFKPQLHLALSHIRLNSQIAFSPSPVA 88
Query: 61 DHYSHDDKLISLCKKNCHREALQAFDIFQKCSNSPLKSITYTHLINACSSLRSLEHGRKI 120
+H S+DD +ISLCKK HREALQAFDIFQKCSNSPL SITYTHLI+ACSSLRSLEHGRKI
Sbjct: 89 EH-SYDDNIISLCKKKLHREALQAFDIFQKCSNSPLNSITYTHLIHACSSLRSLEHGRKI 148
Query: 121 HRHMLTFNYQPDMILQNHILNMYGKCGSLKEARNIFDAMPLKNVVSWTSMISGYSHYGQE 180
H HM TFNYQPD+ILQNHILNMYGKCGSLKEARNIFDAMPLKN VSWTSMISGYSHYGQ+
Sbjct: 149 HCHMSTFNYQPDLILQNHILNMYGKCGSLKEARNIFDAMPLKNAVSWTSMISGYSHYGQD 208
Query: 181 DNAIALYVQMLRSGYIPDHFTFGSVVKSCSGLDDFMLARQLHAHVLKSEFGSHLIAQNAL 240
DNAI LYVQMLRSG+IPDHFTFGSVVKSCSGLDD MLARQLHAHVLKSEFG + IAQNAL
Sbjct: 209 DNAITLYVQMLRSGHIPDHFTFGSVVKSCSGLDDLMLARQLHAHVLKSEFGGNPIAQNAL 268
Query: 241 ISMYTKLSQMADAKNVFSHIIIKDLISWGSMIAGFSQLGYELEALCHFREMLSQPVYQPN 300
ISMYTK SQ+ADA NVFSHII K+LISWGSMIAGFSQLGYE+EALCHFREMLSQP+YQPN
Sbjct: 269 ISMYTKFSQIADATNVFSHIITKNLISWGSMIAGFSQLGYEIEALCHFREMLSQPIYQPN 328
Query: 301 EFVFGSAFSACSKLLEPNYGRQIHGLCIKFGLGSDLFAGCSLCDMYAKCGFLGSARTVFY 360
EFVFGSAFSACS L EPN GRQIHGLCIKFGLG D FAGCSLCDMYAKCGFLGSARTVF
Sbjct: 329 EFVFGSAFSACSNLSEPNCGRQIHGLCIKFGLGRDRFAGCSLCDMYAKCGFLGSARTVFC 388
Query: 361 HIEKPDLVAWNSIIAGFASVGDAKESLSFFSRMRHTGLVPNDVTVLSLLCACSEPVMLNQ 420
IEKPDLVAWN+IIAGFASVGDAKESLSFFS+MRHTGL NDVTVLSLLCACSEP+MLNQ
Sbjct: 389 QIEKPDLVAWNAIIAGFASVGDAKESLSFFSQMRHTGLASNDVTVLSLLCACSEPMMLNQ 448
Query: 421 GMQVHSYIVRMGFDLDIPVCNSLLSMYSKCSNLNDALQIFEGIGNKADIVSWNTMLTACL 480
GMQVHSYIV+ GFDL++PVCN LLSMYSKCS+LND+L+IFE IGNKAD+VSWNTMLT C
Sbjct: 449 GMQVHSYIVKTGFDLEVPVCNGLLSMYSKCSDLNDSLKIFEDIGNKADVVSWNTMLTVCR 508
Query: 481 QQNQAGEVLRLTKLMLASHIKPDHVTLTNVLVSSGQIASYEVGSQVHCFIMKSGLNLDTS 540
QNQAGEVLRL KLMLAS IK D VTLTNVLVSSGQIASYEVGSQVHCFIMKSG NLDTS
Sbjct: 509 LQNQAGEVLRLMKLMLASRIKSDRVTLTNVLVSSGQIASYEVGSQVHCFIMKSGQNLDTS 568
Query: 541 VSNALINMYTKCGSLGCARKMFDSIDNPDIISWSSLIVGYAQAGCGEEAFELFRTMRGLG 600
VSNALINMY KCGSLGCA+KMFDSID+PD+ISWSSLIVGYAQAGCGEEAF+LFRTMRGLG
Sbjct: 569 VSNALINMYMKCGSLGCAQKMFDSIDDPDVISWSSLIVGYAQAGCGEEAFKLFRTMRGLG 628
Query: 601 VKPNEITFVGILTACSHIGMVEEGLKLYRTMQEEYDISPTKEHCSCMVDLLARAGCLDGA 660
++PNEITF+GILTAC HIGMVEEGL+LYRTMQE+ ISPTKEH SC+VDLLARAGCLD A
Sbjct: 629 IRPNEITFLGILTACCHIGMVEEGLRLYRTMQEQDGISPTKEHYSCIVDLLARAGCLDAA 688
Query: 661 EDFIKHMPFDPDIVVWKTLLAACKVHGNLEVGKRAAENVLKIDPSNSAALVMLCNIHASS 720
EDFIK MPF+PDIVVW TLLAACK+HGNLEVGKRAAENVLK DPSNSAALVMLCNIHASS
Sbjct: 689 EDFIKKMPFEPDIVVWMTLLAACKLHGNLEVGKRAAENVLKNDPSNSAALVMLCNIHASS 748
Query: 721 GCWKDFAQYRSSMRQMNVSKVPGQSWIEIKDKVHVFLAEDSLHPERGKIYTMLEELMLQV 780
G WKDFA+ R SMR+M+VSKVPGQSWIEIKD+VHVF AEDSLHPERG+IYTMLEELMLQV
Sbjct: 749 GRWKDFARIRRSMRRMDVSKVPGQSWIEIKDRVHVFFAEDSLHPERGRIYTMLEELMLQV 808
Query: 781 LDDGCDPIKV 791
LDD CDP+++
Sbjct: 809 LDDSCDPLQM 817
BLAST of HG10020254 vs. NCBI nr
Match:
XP_022983903.1 (pentatricopeptide repeat-containing protein At3g53360, mitochondrial-like [Cucurbita maxima] >XP_022983904.1 pentatricopeptide repeat-containing protein At3g53360, mitochondrial-like [Cucurbita maxima] >XP_022983905.1 pentatricopeptide repeat-containing protein At3g53360, mitochondrial-like [Cucurbita maxima])
HSP 1 Score: 1409.0 bits (3646), Expect = 0.0e+00
Identity = 685/790 (86.71%), Postives = 732/790 (92.66%), Query Frame = 0
Query: 1 MAFIRTLNSIYFVTLASDAIVFFNPFSPTTSTKNFKPQLHLALSHIRLNTQLAFSPFPVT 60
MAFIRTLNSIYFVTL+S+AIVFFNP SP TS KNFKPQLHLALSHIR ++ LA P PVT
Sbjct: 29 MAFIRTLNSIYFVTLSSEAIVFFNPCSPATSIKNFKPQLHLALSHIRFSSLLASPPSPVT 88
Query: 61 DHYSHDDKLISLCKKNCHREALQAFDIFQKCSNSPLKSITYTHLINACSSLRSLEHGRKI 120
+H SHDD +ISLCKK HREALQAFDIF KCS+SPL SITYTHLI+ACSSLR LEHGRKI
Sbjct: 89 EH-SHDDNIISLCKKKLHREALQAFDIFHKCSSSPLNSITYTHLIHACSSLRFLEHGRKI 148
Query: 121 HRHMLTFNYQPDMILQNHILNMYGKCGSLKEARNIFDAMPLKNVVSWTSMISGYSHYGQE 180
H HM TFNYQPD+ILQNHILNMYGKCGSLKEARNIFDAMPLKN VSWTSMISGYSHYG++
Sbjct: 149 HCHMSTFNYQPDLILQNHILNMYGKCGSLKEARNIFDAMPLKNAVSWTSMISGYSHYGED 208
Query: 181 DNAIALYVQMLRSGYIPDHFTFGSVVKSCSGLDDFMLARQLHAHVLKSEFGSHLIAQNAL 240
DNAI LYVQMLRSG+IPDHFTFGSVVKSCSGLDD MLARQLHAHVLKSEFG + IAQNAL
Sbjct: 209 DNAITLYVQMLRSGHIPDHFTFGSVVKSCSGLDDLMLARQLHAHVLKSEFGGNPIAQNAL 268
Query: 241 ISMYTKLSQMADAKNVFSHIIIKDLISWGSMIAGFSQLGYELEALCHFREMLSQPVYQPN 300
ISMYTK SQ+ADA NVFSHII KDLISWGSMIAGFSQLG E+EALCHFREMLSQP+YQPN
Sbjct: 269 ISMYTKFSQIADATNVFSHIITKDLISWGSMIAGFSQLGCEIEALCHFREMLSQPIYQPN 328
Query: 301 EFVFGSAFSACSKLLEPNYGRQIHGLCIKFGLGSDLFAGCSLCDMYAKCGFLGSARTVFY 360
EFVFGSAFSACS L EPN GRQIHGLCIKFGLGSD FAGCSLCDMYAKCGFLGSARTVF
Sbjct: 329 EFVFGSAFSACSNLSEPNCGRQIHGLCIKFGLGSDRFAGCSLCDMYAKCGFLGSARTVFC 388
Query: 361 HIEKPDLVAWNSIIAGFASVGDAKESLSFFSRMRHTGLVPNDVTVLSLLCACSEPVMLNQ 420
IEKPDLVAWN+IIAGFASVGDAKESLSFFS+MRHTGL NDVTVLSLLCACSEP+MLNQ
Sbjct: 389 QIEKPDLVAWNAIIAGFASVGDAKESLSFFSQMRHTGLASNDVTVLSLLCACSEPMMLNQ 448
Query: 421 GMQVHSYIVRMGFDLDIPVCNSLLSMYSKCSNLNDALQIFEGIGNKADIVSWNTMLTACL 480
GMQVHSYIV+ GFDL++ VCN LLSMYSKCS+LND+L+IFE IGNKAD+VSWNTMLTAC
Sbjct: 449 GMQVHSYIVKTGFDLEVLVCNGLLSMYSKCSDLNDSLKIFEDIGNKADVVSWNTMLTACR 508
Query: 481 QQNQAGEVLRLTKLMLASHIKPDHVTLTNVLVSSGQIASYEVGSQVHCFIMKSGLNLDTS 540
+NQAGEVLRL KLMLASHIK D VTLTNVLVSSG IASYEVGSQVHCFIMKSG NLDTS
Sbjct: 509 LRNQAGEVLRLMKLMLASHIKSDRVTLTNVLVSSGLIASYEVGSQVHCFIMKSGQNLDTS 568
Query: 541 VSNALINMYTKCGSLGCARKMFDSIDNPDIISWSSLIVGYAQAGCGEEAFELFRTMRGLG 600
VSNALINMY KCGSLGCARKMFDSID+PD+ISWSSLIVGYAQAGCGEEAFELFRTMRGLG
Sbjct: 569 VSNALINMYMKCGSLGCARKMFDSIDDPDVISWSSLIVGYAQAGCGEEAFELFRTMRGLG 628
Query: 601 VKPNEITFVGILTACSHIGMVEEGLKLYRTMQEEYDISPTKEHCSCMVDLLARAGCLDGA 660
++PNEITF+GILTAC H+GMVEEGL+LYRTMQE+ DISPTKEHCSC+VDLLARAGCLD A
Sbjct: 629 IRPNEITFLGILTACCHVGMVEEGLRLYRTMQEQDDISPTKEHCSCIVDLLARAGCLDAA 688
Query: 661 EDFIKHMPFDPDIVVWKTLLAACKVHGNLEVGKRAAENVLKIDPSNSAALVMLCNIHASS 720
EDFIK MPF+PDIVVW TLLAACKVHGNLEVGKRAAENVL+ DPSNSAALVMLCNIHASS
Sbjct: 689 EDFIKKMPFEPDIVVWMTLLAACKVHGNLEVGKRAAENVLRNDPSNSAALVMLCNIHASS 748
Query: 721 GCWKDFAQYRSSMRQMNVSKVPGQSWIEIKDKVHVFLAEDSLHPERGKIYTMLEELMLQV 780
G WKDFA+ R SMR+M+VSKVPGQSWIEIKD+VHVF AEDSLHPERG+IY MLEELMLQ+
Sbjct: 749 GHWKDFARLRRSMRRMDVSKVPGQSWIEIKDRVHVFFAEDSLHPERGRIYIMLEELMLQL 808
Query: 781 LDDGCDPIKV 791
LDD CDP+++
Sbjct: 809 LDDSCDPLQM 817
BLAST of HG10020254 vs. ExPASy Swiss-Prot
Match:
Q9LFI1 (Pentatricopeptide repeat-containing protein At3g53360, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-E86 PE=2 SV=1)
HSP 1 Score: 896.0 bits (2314), Expect = 3.1e-259
Identity = 426/722 (59.00%), Postives = 552/722 (76.45%), Query Frame = 0
Query: 66 DDKLISLCKKNCHREALQAFDIFQKCSNSPLKSITYTHLINACSSLRSLEHGRKIHRHML 125
+D + SLCK N +REAL+AFD QK S+ ++ TY LI ACSS RSL GRKIH H+L
Sbjct: 35 NDHINSLCKSNFYREALEAFDFAQKNSSFKIRLRTYISLICACSSSRSLAQGRKIHDHIL 94
Query: 126 TFNYQPDMILQNHILNMYGKCGSLKEARNIFDAMPLKNVVSWTSMISGYSHYGQEDNAIA 185
N + D IL NHIL+MYGKCGSL++AR +FD MP +N+VS+TS+I+GYS GQ AI
Sbjct: 95 NSNCKYDTILNNHILSMYGKCGSLRDAREVFDFMPERNLVSYTSVITGYSQNGQGAEAIR 154
Query: 186 LYVQMLRSGYIPDHFTFGSVVKSCSGLDDFMLARQLHAHVLKSEFGSHLIAQNALISMYT 245
LY++ML+ +PD F FGS++K+C+ D L +QLHA V+K E SHLIAQNALI+MY
Sbjct: 155 LYLKMLQEDLVPDQFAFGSIIKACASSSDVGLGKQLHAQVIKLESSSHLIAQNALIAMYV 214
Query: 246 KLSQMADAKNVFSHIIIKDLISWGSMIAGFSQLGYELEALCHFREMLSQPVYQPNEFVFG 305
+ +QM+DA VF I +KDLISW S+IAGFSQLG+E EAL H +EMLS V+ PNE++FG
Sbjct: 215 RFNQMSDASRVFYGIPMKDLISWSSIIAGFSQLGFEFEALSHLKEMLSFGVFHPNEYIFG 274
Query: 306 SAFSACSKLLEPNYGRQIHGLCIKFGLGSDLFAGCSLCDMYAKCGFLGSARTVFYHIEKP 365
S+ ACS LL P+YG QIHGLCIK L + AGCSLCDMYA+CGFL SAR VF IE+P
Sbjct: 275 SSLKACSSLLRPDYGSQIHGLCIKSELAGNAIAGCSLCDMYARCGFLNSARRVFDQIERP 334
Query: 366 DLVAWNSIIAGFASVGDAKESLSFFSRMRHTGLVPNDVTVLSLLCACSEPVMLNQGMQVH 425
D +WN IIAG A+ G A E++S FS+MR +G +P+ +++ SLLCA ++P+ L+QGMQ+H
Sbjct: 335 DTASWNVIIAGLANNGYADEAVSVFSQMRSSGFIPDAISLRSLLCAQTKPMALSQGMQIH 394
Query: 426 SYIVRMGFDLDIPVCNSLLSMYSKCSNLNDALQIFEGIGNKADIVSWNTMLTACLQQNQA 485
SYI++ GF D+ VCNSLL+MY+ CS+L +FE N AD VSWNT+LTACLQ Q
Sbjct: 395 SYIIKWGFLADLTVCNSLLTMYTFCSDLYCCFNLFEDFRNNADSVSWNTILTACLQHEQP 454
Query: 486 GEVLRLTKLMLASHIKPDHVTLTNVLVSSGQIASYEVGSQVHCFIMKSGLNLDTSVSNAL 545
E+LRL KLML S +PDH+T+ N+L +I+S ++GSQVHC+ +K+GL + + N L
Sbjct: 455 VEMLRLFKLMLVSECEPDHITMGNLLRGCVEISSLKLGSQVHCYSLKTGLAPEQFIKNGL 514
Query: 546 INMYTKCGSLGCARKMFDSIDNPDIISWSSLIVGYAQAGCGEEAFELFRTMRGLGVKPNE 605
I+MY KCGSLG AR++FDS+DN D++SWS+LIVGYAQ+G GEEA LF+ M+ G++PN
Sbjct: 515 IDMYAKCGSLGQARRIFDSMDNRDVVSWSTLIVGYAQSGFGEEALILFKEMKSAGIEPNH 574
Query: 606 ITFVGILTACSHIGMVEEGLKLYRTMQEEYDISPTKEHCSCMVDLLARAGCLDGAEDFIK 665
+TFVG+LTACSH+G+VEEGLKLY TMQ E+ ISPTKEHCSC+VDLLARAG L+ AE FI
Sbjct: 575 VTFVGVLTACSHVGLVEEGLKLYATMQTEHGISPTKEHCSCVVDLLARAGRLNEAERFID 634
Query: 666 HMPFDPDIVVWKTLLAACKVHGNLEVGKRAAENVLKIDPSNSAALVMLCNIHASSGCWKD 725
M +PD+VVWKTLL+ACK GN+ + ++AAEN+LKIDP NS A V+LC++HASSG W++
Sbjct: 635 EMKLEPDVVVWKTLLSACKTQGNVHLAQKAAENILKIDPFNSTAHVLLCSMHASSGNWEN 694
Query: 726 FAQYRSSMRQMNVSKVPGQSWIEIKDKVHVFLAEDSLHPERGKIYTMLEELMLQVLDDGC 785
A RSSM++ +V K+PGQSWIEI+DK+H+F AED HPER IYT+L + Q+LD+ C
Sbjct: 695 AALLRSSMKKHDVKKIPGQSWIEIEDKIHIFFAEDIFHPERDDIYTVLHNIWSQMLDE-C 754
Query: 786 DP 788
+P
Sbjct: 755 NP 755
BLAST of HG10020254 vs. ExPASy Swiss-Prot
Match:
Q9SS60 (Pentatricopeptide repeat-containing protein At3g03580 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H23 PE=2 SV=1)
HSP 1 Score: 471.9 bits (1213), Expect = 1.5e-131
Identity = 237/730 (32.47%), Postives = 401/730 (54.93%), Query Frame = 0
Query: 58 PVTDHYSHDDKLISLCKKNCHREALQAFDIFQKCSNSPLKSITYTHLINACSSLRSLEHG 117
P + Y + + + K EAL+ + ++ SP K T+ +I AC+ L E G
Sbjct: 67 PAKNVYLWNSIIRAFSKNGLFPEALEFYGKLRESKVSPDK-YTFPSVIKACAGLFDAEMG 126
Query: 118 RKIHRHMLTFNYQPDMILQNHILNMYGKCGSLKEARNIFDAMPLKNVVSWTSMISGYSHY 177
++ +L ++ D+ + N +++MY + G L AR +FD MP++++VSW S+ISGYS +
Sbjct: 127 DLVYEQILDMGFESDLFVGNALVDMYSRMGLLTRARQVFDEMPVRDLVSWNSLISGYSSH 186
Query: 178 GQEDNAIALYVQMLRSGYIPDHFTFGSVVKSCSGLDDFMLARQLHAHVLKSEFGSHLIAQ 237
G + A+ +Y ++ S +PD FT SV+ + L + LH LKS S ++
Sbjct: 187 GYYEEALEIYHELKNSWIVPDSFTVSSVLPAFGNLLVVKQGQGLHGFALKSGVNSVVVVN 246
Query: 238 NALISMYTKLSQMADAKNVFSHIIIKDLISWGSMIAGFSQLGYELEALCHFREMLSQPVY 297
N L++MY K + DA+ VF + ++D +S+ +MI G+ +L E++ F E L Q +
Sbjct: 247 NGLVAMYLKFRRPTDARRVFDEMDVRDSVSYNTMICGYLKLEMVEESVRMFLENLDQ--F 306
Query: 298 QPNEFVFGSAFSACSKLLEPNYGRQIHGLCIKFGLGSDLFAGCSLCDMYAKCGFLGSART 357
+P+ S AC L + + + I+ +K G + L D+YAKCG + +AR
Sbjct: 307 KPDLLTVSSVLRACGHLRDLSLAKYIYNYMLKAGFVLESTVRNILIDVYAKCGDMITARD 366
Query: 358 VFYHIEKPDLVAWNSIIAGFASVGDAKESLSFFSRMRHTGLVPNDVTVLSLLCACSEPVM 417
VF +E D V+WNSII+G+ GD E++ F M + +T L L+ +
Sbjct: 367 VFNSMECKDTVSWNSIISGYIQSGDLMEAMKLFKMMMIMEEQADHITYLMLISVSTRLAD 426
Query: 418 LNQGMQVHSYIVRMGFDLDIPVCNSLLSMYSKCSNLNDALQIFEGIGNKADIVSWNTMLT 477
L G +HS ++ G +D+ V N+L+ MY+KC + D+L+IF +G D V+WNT+++
Sbjct: 427 LKFGKGLHSNGIKSGICIDLSVSNALIDMYAKCGEVGDSLKIFSSMGT-GDTVTWNTVIS 486
Query: 478 ACLQQNQAGEVLRLTKLMLASHIKPDHVTLTNVLVSSGQIASYEVGSQVHCFIMKSGLNL 537
AC++ L++T M S + PD T L +A+ +G ++HC +++ G
Sbjct: 487 ACVRFGDFATGLQVTTQMRKSEVVPDMATFLVTLPMCASLAAKRLGKEIHCCLLRFGYES 546
Query: 538 DTSVSNALINMYTKCGSLGCARKMFDSIDNPDIISWSSLIVGYAQAGCGEEAFELFRTMR 597
+ + NALI MY+KCG L + ++F+ + D+++W+ +I Y G GE+A E F M
Sbjct: 547 ELQIGNALIEMYSKCGCLENSSRVFERMSRRDVVTWTGMIYAYGMYGEGEKALETFADME 606
Query: 598 GLGVKPNEITFVGILTACSHIGMVEEGLKLYRTMQEEYDISPTKEHCSCMVDLLARAGCL 657
G+ P+ + F+ I+ ACSH G+V+EGL + M+ Y I P EH +C+VDLL+R+ +
Sbjct: 607 KSGIVPDSVVFIAIIYACSHSGLVDEGLACFEKMKTHYKIDPMIEHYACVVDLLSRSQKI 666
Query: 658 DGAEDFIKHMPFDPDIVVWKTLLAACKVHGNLEVGKRAAENVLKIDPSNSAALVMLCNIH 717
AE+FI+ MP PD +W ++L AC+ G++E +R + +++++P + ++ N +
Sbjct: 667 SKAEEFIQAMPIKPDASIWASVLRACRTSGDMETAERVSRRIIELNPDDPGYSILASNAY 726
Query: 718 ASSGCWKDFAQYRSSMRQMNVSKVPGQSWIEIKDKVHVFLAEDSLHPERGKIYTMLEELM 777
A+ W + R S++ +++K PG SWIE+ VHVF + D P+ IY LE L
Sbjct: 727 AALRKWDKVSLIRKSLKDKHITKNPGYSWIEVGKNVHVFSSGDDSAPQSEAIYKSLEILY 786
Query: 778 LQVLDDGCDP 788
+ +G P
Sbjct: 787 SLMAKEGYIP 792
BLAST of HG10020254 vs. ExPASy Swiss-Prot
Match:
Q5G1T1 (Pentatricopeptide repeat-containing protein At3g49170, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=EMB2261 PE=2 SV=1)
HSP 1 Score: 471.1 bits (1211), Expect = 2.5e-131
Identity = 248/719 (34.49%), Postives = 410/719 (57.02%), Query Frame = 0
Query: 79 REALQAFDIFQKCSNSPLKSITYTHLINACSSLRSLEHGRKIHRHMLTFNYQPDMILQNH 138
R A+ A D+ + P+ S+T++ L+ +C R G+ +H ++ F+ +PD +L N
Sbjct: 43 RGAVSALDLMARDGIRPMDSVTFSSLLKSCIRARDFRLGKLVHARLIEFDIEPDSVLYNS 102
Query: 139 ILNMYGKCGSLKEARNIFDAM---PLKNVVSWTSMISGYSHYGQEDNAIALYVQMLRSGY 198
++++Y K G +A ++F+ M ++VVSW++M++ Y + G+E +AI ++V+ L G
Sbjct: 103 LISLYSKSGDSAKAEDVFETMRRFGKRDVVSWSAMMACYGNNGRELDAIKVFVEFLELGL 162
Query: 199 IPDHFTFGSVVKSCSGLDDFMLARQLHAHVLKS-EFGSHLIAQNALISMYTK-LSQMADA 258
+P+ + + +V+++CS D + R ++K+ F S + +LI M+ K + +A
Sbjct: 163 VPNDYCYTAVIRACSNSDFVGVGRVTLGFLMKTGHFESDVCVGCSLIDMFVKGENSFENA 222
Query: 259 KNVFSHIIIKDLISWGSMIAGFSQLGYELEALCHFREMLSQPVYQPNEFVFGSAFSACSK 318
VF + ++++W MI Q+G+ EA+ F +M+ ++ ++F S FSAC++
Sbjct: 223 YKVFDKMSELNVVTWTLMITRCMQMGFPREAIRFFLDMVLSG-FESDKFTLSSVFSACAE 282
Query: 319 LLEPNYGRQIHGLCIKFGLGSDLFAGCSLCDMYAKC---GFLGSARTVFYHIEKPDLVAW 378
L + G+Q+H I+ GL D+ CSL DMYAKC G + R VF +E +++W
Sbjct: 283 LENLSLGKQLHSWAIRSGLVDDV--ECSLVDMYAKCSADGSVDDCRKVFDRMEDHSVMSW 342
Query: 379 NSIIAGF-ASVGDAKESLSFFSRMRHTGLV-PNDVTVLSLLCACSEPVMLNQGMQVHSYI 438
++I G+ + A E+++ FS M G V PN T S AC G QV
Sbjct: 343 TALITGYMKNCNLATEAINLFSEMITQGHVEPNHFTFSSAFKACGNLSDPRVGKQVLGQA 402
Query: 439 VRMGFDLDIPVCNSLLSMYSKCSNLNDALQIFEGIGNKADIVSWNTMLTACLQQNQAGEV 498
+ G + V NS++SM+ K + DA + FE + K ++VS+NT L + +
Sbjct: 403 FKRGLASNSSVANSVISMFVKSDRMEDAQRAFESLSEK-NLVSYNTFLDGTCRNLNFEQA 462
Query: 499 LRLTKLMLASHIKPDHVTLTNVLVSSGQIASYEVGSQVHCFIMKSGLNLDTSVSNALINM 558
+L + + T ++L + S G Q+H ++K GL+ + V NALI+M
Sbjct: 463 FKLLSEITERELGVSAFTFASLLSGVANVGSIRKGEQIHSQVVKLGLSCNQPVCNALISM 522
Query: 559 YTKCGSLGCARKMFDSIDNPDIISWSSLIVGYAQAGCGEEAFELFRTMRGLGVKPNEITF 618
Y+KCGS+ A ++F+ ++N ++ISW+S+I G+A+ G E F M GVKPNE+T+
Sbjct: 523 YSKCGSIDTASRVFNFMENRNVISWTSMITGFAKHGFAIRVLETFNQMIEEGVKPNEVTY 582
Query: 619 VGILTACSHIGMVEEGLKLYRTMQEEYDISPTKEHCSCMVDLLARAGCLDGAEDFIKHMP 678
V IL+ACSH+G+V EG + + +M E++ I P EH +CMVDLL RAG L A +FI MP
Sbjct: 583 VAILSACSHVGLVSEGWRHFNSMYEDHKIKPKMEHYACMVDLLCRAGLLTDAFEFINTMP 642
Query: 679 FDPDIVVWKTLLAACKVHGNLEVGKRAAENVLKIDPSNSAALVMLCNIHASSGCWKDFAQ 738
F D++VW+T L AC+VH N E+GK AA +L++DP+ AA + L NI+A +G W++ +
Sbjct: 643 FQADVLVWRTFLGACRVHSNTELGKLAARKILELDPNEPAAYIQLSNIYACAGKWEESTE 702
Query: 739 YRSSMRQMNVSKVPGQSWIEIKDKVHVFLAEDSLHPERGKIYTMLEELMLQVLDDGCDP 788
R M++ N+ K G SWIE+ DK+H F D+ HP +IY L+ L+ ++ G P
Sbjct: 703 MRRKMKERNLVKEGGCSWIEVGDKIHKFYVGDTAHPNAHQIYDELDRLITEIKRCGYVP 757
BLAST of HG10020254 vs. ExPASy Swiss-Prot
Match:
Q9SS83 (Pentatricopeptide repeat-containing protein At3g09040, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-E88 PE=2 SV=1)
HSP 1 Score: 469.2 bits (1206), Expect = 9.5e-131
Identity = 245/712 (34.41%), Postives = 390/712 (54.78%), Query Frame = 0
Query: 74 KKNCHREALQAFDIFQKCSNSPLKSITYTHLINACSSLRSLEHGRKIHRHMLTFNYQPDM 133
K+ C A++ F +K S +S T +++A + +L+ G +H + ++
Sbjct: 304 KRGCETVAIEYFFNMRKSSVKSTRS-TLGSVLSAIGIVANLDLGLVVHAEAIKLGLASNI 363
Query: 134 ILQNHILNMYGKCGSLKEARNIFDAMPLKNVVSWTSMISGYSHYGQEDNAIALYVQMLRS 193
+ + +++MY KC ++ A +F+A+ KN V W +MI GY+H G+ + L++ M S
Sbjct: 364 YVGSSLVSMYSKCEKMEAAAKVFEALEEKNDVFWNAMIRGYAHNGESHKVMELFMDMKSS 423
Query: 194 GYIPDHFTFGSVVKSCSGLDDFMLARQLHAHVLKSEFGSHLIAQNALISMYTKLSQMADA 253
GY D FTF S++ +C+ D + Q H+ ++K + +L NAL+ MY K + DA
Sbjct: 424 GYNIDDFTFTSLLSTCAASHDLEMGSQFHSIIIKKKLAKNLFVGNALVDMYAKCGALEDA 483
Query: 254 KNVFSHIIIKDLISWGSMIAGFSQLGYELEALCHFREMLSQPVYQPNEFVFGSAFSACSK 313
+ +F + +D ++W ++I + Q E EA F+ M + S AC+
Sbjct: 484 RQIFERMCDRDNVTWNTIIGSYVQDENESEAFDLFKRMNLCGIVSDGA-CLASTLKACTH 543
Query: 314 LLEPNYGRQIHGLCIKFGLGSDLFAGCSLCDMYAKCGFLGSARTVFYHIEKPDLVAWNSI 373
+ G+Q+H L +K GL DL G SL DMY+KCG + AR VF + + +V+ N++
Sbjct: 544 VHGLYQGKQVHCLSVKCGLDRDLHTGSSLIDMYSKCGIIKDARKVFSSLPEWSVVSMNAL 603
Query: 374 IAGFASVGDAKESLSFFSRMRHTGLVPNDVTVLSLLCACSEPVMLNQGMQVHSYIVRMGF 433
IAG+ S + +E++ F M G+ P+++T +++ AC +P L G Q H I + GF
Sbjct: 604 IAGY-SQNNLEEAVVLFQEMLTRGVNPSEITFATIVEACHKPESLTLGTQFHGQITKRGF 663
Query: 434 DLDIPVCN-SLLSMYSKCSNLNDALQIFEGIGNKADIVSWNTMLTACLQQNQAGEVLRLT 493
+ SLL MY + +A +F + + IV W M++ Q E L+
Sbjct: 664 SSEGEYLGISLLGMYMNSRGMTEACALFSELSSPKSIVLWTGMMSGHSQNGFYEEALKFY 723
Query: 494 KLMLASHIKPDHVTLTNVLVSSGQIASYEVGSQVHCFIMKSGLNLDTSVSNALINMYTKC 553
K M + PD T VL ++S G +H I +LD SN LI+MY KC
Sbjct: 724 KEMRHDGVLPDQATFVTVLRVCSVLSSLREGRAIHSLIFHLAHDLDELTSNTLIDMYAKC 783
Query: 554 GSLGCARKMFDSI-DNPDIISWSSLIVGYAQAGCGEEAFELFRTMRGLGVKPNEITFVGI 613
G + + ++FD + +++SW+SLI GYA+ G E+A ++F +MR + P+EITF+G+
Sbjct: 784 GDMKGSSQVFDEMRRRSNVVSWNSLINGYAKNGYAEDALKIFDSMRQSHIMPDEITFLGV 843
Query: 614 LTACSHIGMVEEGLKLYRTMQEEYDISPTKEHCSCMVDLLARAGCLDGAEDFIKHMPFDP 673
LTACSH G V +G K++ M +Y I +H +CMVDLL R G L A+DFI+ P
Sbjct: 844 LTACSHAGKVSDGRKIFEMMIGQYGIEARVDHVACMVDLLGRWGYLQEADDFIEAQNLKP 903
Query: 674 DIVVWKTLLAACKVHGNLEVGKRAAENVLKIDPSNSAALVMLCNIHASSGCWKDFAQYRS 733
D +W +LL AC++HG+ G+ +AE +++++P NS+A V+L NI+AS GCW+ R
Sbjct: 904 DARLWSSLLGACRIHGDDIRGEISAEKLIELEPQNSSAYVLLSNIYASQGCWEKANALRK 963
Query: 734 SMRQMNVSKVPGQSWIEIKDKVHVFLAEDSLHPERGKIYTMLEELMLQVLDD 784
MR V KVPG SWI+++ + H+F A D H E GKI LE+L + DD
Sbjct: 964 VMRDRGVKKVPGYSWIDVEQRTHIFAAGDKSHSEIGKIEMFLEDLYDLMKDD 1012
BLAST of HG10020254 vs. ExPASy Swiss-Prot
Match:
Q9SVA5 (Pentatricopeptide repeat-containing protein At4g39530 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E52 PE=3 SV=1)
HSP 1 Score: 465.7 bits (1197), Expect = 1.0e-129
Identity = 249/720 (34.58%), Postives = 408/720 (56.67%), Query Frame = 0
Query: 69 LISLCKKN-CHREALQAFDIFQKCSNSPLKSITYTHLINACSSLRSLEHGR----KIHRH 128
++S C + + E+L F F + + I ACS L GR ++
Sbjct: 116 MVSACNHHGIYEESLVVFLEFWRTRKDSPNEYILSSFIQACSGLDG--RGRWMVFQLQSF 175
Query: 129 MLTFNYQPDMILQNHILNMYGKCGSLKEARNIFDAMPLKNVVSWTSMISGYSHYGQEDNA 188
++ + D+ + +++ Y K G++ AR +FDA+P K+ V+WT+MISG G+ +
Sbjct: 176 LVKSGFDRDVYVGTLLIDFYLKDGNIDYARLVFDALPEKSTVTWTTMISGCVKMGRSYVS 235
Query: 189 IALYVQMLRSGYIPDHFTFGSVVKSCSGLDDFMLARQLHAHVLKSEFGSHLIAQNALISM 248
+ L+ Q++ +PD + +V+ +CS L +Q+HAH+L+ N LI
Sbjct: 236 LQLFYQLMEDNVVPDGYILSTVLSACSILPFLEGGKQIHAHILRYGLEMDASLMNVLIDS 295
Query: 249 YTKLSQMADAKNVFSHIIIKDLISWGSMIAGFSQLGYELEALCHFREMLSQPVYQPNEFV 308
Y K ++ A +F+ + K++ISW ++++G+ Q EA+ F M S+ +P+ +
Sbjct: 296 YVKCGRVIAAHKLFNGMPNKNIISWTTLLSGYKQNALHKEAMELFTSM-SKFGLKPDMYA 355
Query: 309 FGSAFSACSKLLEPNYGRQIHGLCIKFGLGSDLFAGCSLCDMYAKCGFLGSARTVFYHIE 368
S ++C+ L +G Q+H IK LG+D + SL DMYAKC L AR VF
Sbjct: 356 CSSILTSCASLHALGFGTQVHAYTIKANLGNDSYVTNSLIDMYAKCDCLTDARKVFDIFA 415
Query: 369 KPDLVAWNSIIAGFASVG---DAKESLSFFSRMRHTGLVPNDVTVLSLLCACSEPVMLNQ 428
D+V +N++I G++ +G + E+L+ F MR + P+ +T +SLL A + L
Sbjct: 416 AADVVLFNAMIEGYSRLGTQWELHEALNIFRDMRFRLIRPSLLTFVSLLRASASLTSLGL 475
Query: 429 GMQVHSYIVRMGFDLDIPVCNSLLSMYSKCSNLNDALQIFEGIGNKADIVSWNTMLTACL 488
Q+H + + G +LDI ++L+ +YS C L D+ +F+ + K D+V WN+M +
Sbjct: 476 SKQIHGLMFKYGLNLDIFAGSALIDVYSNCYCLKDSRLVFDEMKVK-DLVIWNSMFAGYV 535
Query: 489 QQNQAGEVLRLTKLMLASHIKPDHVTLTNVLVSSGQIASYEVGSQVHCFIMKSGLNLDTS 548
QQ++ E L L + S +PD T N++ ++G +AS ++G + HC ++K GL +
Sbjct: 536 QQSENEEALNLFLELQLSRERPDEFTFANMVTAAGNLASVQLGQEFHCQLLKRGLECNPY 595
Query: 549 VSNALINMYTKCGSLGCARKMFDSIDNPDIISWSSLIVGYAQAGCGEEAFELFRTMRGLG 608
++NAL++MY KCGS A K FDS + D++ W+S+I YA G G++A ++ M G
Sbjct: 596 ITNALLDMYAKCGSPEDAHKAFDSAASRDVVCWNSVISSYANHGEGKKALQMLEKMMSEG 655
Query: 609 VKPNEITFVGILTACSHIGMVEEGLKLYRTMQEEYDISPTKEHCSCMVDLLARAGCLDGA 668
++PN ITFVG+L+ACSH G+VE+GLK + M + I P EH CMV LL RAG L+ A
Sbjct: 656 IEPNYITFVGVLSACSHAGLVEDGLKQFELML-RFGIEPETEHYVCMVSLLGRAGRLNKA 715
Query: 669 EDFIKHMPFDPDIVVWKTLLAACKVHGNLEVGKRAAENVLKIDPSNSAALVMLCNIHASS 728
+ I+ MP P +VW++LL+ C GN+E+ + AAE + DP +S + ML NI+AS
Sbjct: 716 RELIEKMPTKPAAIVWRSLLSGCAKAGNVELAEHAAEMAILSDPKDSGSFTMLSNIYASK 775
Query: 729 GCWKDFAQYRSSMRQMNVSKVPGQSWIEIKDKVHVFLAEDSLHPERGKIYTMLEELMLQV 781
G W + + R M+ V K PG+SWI I +VH+FL++D H + +IY +L++L++Q+
Sbjct: 776 GMWTEAKKVRERMKVEGVVKEPGRSWIGINKEVHIFLSKDKSHCKANQIYEVLDDLLVQI 830
BLAST of HG10020254 vs. ExPASy TrEMBL
Match:
A0A0A0LDF3 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G736910 PE=4 SV=1)
HSP 1 Score: 1481.8 bits (3835), Expect = 0.0e+00
Identity = 720/790 (91.14%), Postives = 757/790 (95.82%), Query Frame = 0
Query: 1 MAFIRTLNSIYFVTLASDAIVFFNPFSPTTSTKNFKPQLHLALSHIRLNTQLAFSPFPVT 60
MAFIRT +SIYF+TLASDAIVFFNPFSPT+S KNFKPQLHLALSHIRLNTQLAFSP P+T
Sbjct: 1 MAFIRTFSSIYFLTLASDAIVFFNPFSPTSSIKNFKPQLHLALSHIRLNTQLAFSPCPLT 60
Query: 61 DHYSHDDKLISLCKKNCHREALQAFDIFQKCSNSPLKSITYTHLINACSSLRSLEHGRKI 120
HY HDDK+ISLCKKN HREAL+AFDIFQKCS+SPLKS+TYTHLINACSSLRSLEHGRKI
Sbjct: 61 VHYPHDDKIISLCKKNLHREALKAFDIFQKCSSSPLKSVTYTHLINACSSLRSLEHGRKI 120
Query: 121 HRHMLTFNYQPDMILQNHILNMYGKCGSLKEARNIFDAMPLKNVVSWTSMISGYSHYGQE 180
HRHMLT NYQPDMILQNHIL+MYGKCGSLKEARN+FD+MPLKNVVSWTSMISGYS YG+E
Sbjct: 121 HRHMLTCNYQPDMILQNHILSMYGKCGSLKEARNMFDSMPLKNVVSWTSMISGYSRYGEE 180
Query: 181 DNAIALYVQMLRSGYIPDHFTFGSVVKSCSGLDDFMLARQLHAHVLKSEFGSHLIAQNAL 240
DNAI LYVQMLRSG+IPDHFTFGS+VKSCSGLDDF LARQLHAHVLKSEFG+ LIAQNAL
Sbjct: 181 DNAITLYVQMLRSGHIPDHFTFGSIVKSCSGLDDFKLARQLHAHVLKSEFGADLIAQNAL 240
Query: 241 ISMYTKLSQMADAKNVFSHIIIKDLISWGSMIAGFSQLGYELEALCHFREMLSQPVYQPN 300
ISMYTK SQMADA NVFS IIIKDLISWGSMIAGFSQLGYELEALCHFREMLSQ VYQPN
Sbjct: 241 ISMYTKFSQMADAINVFSRIIIKDLISWGSMIAGFSQLGYELEALCHFREMLSQSVYQPN 300
Query: 301 EFVFGSAFSACSKLLEPNYGRQIHGLCIKFGLGSDLFAGCSLCDMYAKCGFLGSARTVFY 360
EFVFGSAFSACSKLLEP+ GRQIHGLCIKFGLGSDLFAGCSLCDMYAKCGFL SARTVFY
Sbjct: 301 EFVFGSAFSACSKLLEPDCGRQIHGLCIKFGLGSDLFAGCSLCDMYAKCGFLESARTVFY 360
Query: 361 HIEKPDLVAWNSIIAGFASVGDAKESLSFFSRMRHTGLVPNDVTVLSLLCACSEPVMLNQ 420
HIEKPDLVAWN+IIAGFASV +AKES SFFS+MRHTGLVPNDVTVLSLLCACSEPVMLN
Sbjct: 361 HIEKPDLVAWNAIIAGFASVSNAKESSSFFSQMRHTGLVPNDVTVLSLLCACSEPVMLNH 420
Query: 421 GMQVHSYIVRMGFDLDIPVCNSLLSMYSKCSNLNDALQIFEGIGNKADIVSWNTMLTACL 480
G+QVHSYIV+MGF+LDIPVCNSLLSMYSKCSNLNDALQ+FE IGNKADIVSWNT+LTACL
Sbjct: 421 GIQVHSYIVKMGFNLDIPVCNSLLSMYSKCSNLNDALQVFEDIGNKADIVSWNTLLTACL 480
Query: 481 QQNQAGEVLRLTKLMLASHIKPDHVTLTNVLVSSGQIASYEVGSQVHCFIMKSGLNLDTS 540
QQNQAGEVLRLTKLM AS IKPDHVTLTNVLVSSGQIASYEVGSQ+HCFIMKSGLNLD S
Sbjct: 481 QQNQAGEVLRLTKLMFASRIKPDHVTLTNVLVSSGQIASYEVGSQIHCFIMKSGLNLDIS 540
Query: 541 VSNALINMYTKCGSLGCARKMFDSIDNPDIISWSSLIVGYAQAGCGEEAFELFRTMRGLG 600
VSNALINMYTKCGSL CARKMFDSI NPDIISWSSLIVGYAQAGCG+EAFELFRTMRGLG
Sbjct: 541 VSNALINMYTKCGSLECARKMFDSIGNPDIISWSSLIVGYAQAGCGKEAFELFRTMRGLG 600
Query: 601 VKPNEITFVGILTACSHIGMVEEGLKLYRTMQEEYDISPTKEHCSCMVDLLARAGCLDGA 660
VKPNEITFVGILTACSHIGMVEEGLKLYRTMQE+Y ISPTKEHCSCMVDLLARAGCLD A
Sbjct: 601 VKPNEITFVGILTACSHIGMVEEGLKLYRTMQEDYRISPTKEHCSCMVDLLARAGCLDVA 660
Query: 661 EDFIKHMPFDPDIVVWKTLLAACKVHGNLEVGKRAAENVLKIDPSNSAALVMLCNIHASS 720
EDFIK MPF PD+VVWKTLLAACKVHGNLEVGKRAAENVLKIDPSNSAA+VMLCNIHASS
Sbjct: 661 EDFIKQMPFVPDVVVWKTLLAACKVHGNLEVGKRAAENVLKIDPSNSAAVVMLCNIHASS 720
Query: 721 GCWKDFAQYRSSMRQMNVSKVPGQSWIEIKDKVHVFLAEDSLHPERGKIYTMLEELMLQV 780
G WKDFA+ RSSMR+M+V KVPGQSWIEIKDKVHVFLAED+LHPERGKIYTMLEELMLQ+
Sbjct: 721 GHWKDFARLRSSMRRMDVGKVPGQSWIEIKDKVHVFLAEDNLHPERGKIYTMLEELMLQI 780
Query: 781 LDDGCDPIKV 791
LDDGCDP+++
Sbjct: 781 LDDGCDPLQM 790
BLAST of HG10020254 vs. ExPASy TrEMBL
Match:
A0A5A7TPL0 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold352G002390 PE=4 SV=1)
HSP 1 Score: 1472.6 bits (3811), Expect = 0.0e+00
Identity = 718/790 (90.89%), Postives = 750/790 (94.94%), Query Frame = 0
Query: 1 MAFIRTLNSIYFVTLASDAIVFFNPFSPTTSTKNFKPQLHLALSHIRLNTQLAFSPFPVT 60
MA IRT NSIYF+TLASDAIVFFNPFSP+TSTKNFKPQLHLA SHIRL TQLAFSP PVT
Sbjct: 1 MAPIRTFNSIYFLTLASDAIVFFNPFSPSTSTKNFKPQLHLAHSHIRLGTQLAFSPCPVT 60
Query: 61 DHYSHDDKLISLCKKNCHREALQAFDIFQKCSNSPLKSITYTHLINACSSLRSLEHGRKI 120
HY HDDK+ISLCKKN HREALQAFDIF+KCS+SPLKSITYTHLINACSSLRSLEHGRKI
Sbjct: 61 VHYPHDDKIISLCKKNLHREALQAFDIFRKCSSSPLKSITYTHLINACSSLRSLEHGRKI 120
Query: 121 HRHMLTFNYQPDMILQNHILNMYGKCGSLKEARNIFDAMPLKNVVSWTSMISGYSHYGQE 180
HRHMLT NYQPDMILQNHIL+MYGKCGSLKEARNIFDAMPLKNVVSWTSMISGYS YGQE
Sbjct: 121 HRHMLTCNYQPDMILQNHILSMYGKCGSLKEARNIFDAMPLKNVVSWTSMISGYSRYGQE 180
Query: 181 DNAIALYVQMLRSGYIPDHFTFGSVVKSCSGLDDFMLARQLHAHVLKSEFGSHLIAQNAL 240
DNAI LY+QMLRSGYIPDHFTFGS+VKSCSGLDDFMLARQLHAHVLK EFG HLIAQNAL
Sbjct: 181 DNAITLYIQMLRSGYIPDHFTFGSIVKSCSGLDDFMLARQLHAHVLKFEFGGHLIAQNAL 240
Query: 241 ISMYTKLSQMADAKNVFSHIIIKDLISWGSMIAGFSQLGYELEALCHFREMLSQPVYQPN 300
ISMYTK SQMADA NVFS IIIKDLISWGSMIAGFSQLGYELEALCHFREMLSQ VYQPN
Sbjct: 241 ISMYTKFSQMADAINVFSRIIIKDLISWGSMIAGFSQLGYELEALCHFREMLSQSVYQPN 300
Query: 301 EFVFGSAFSACSKLLEPNYGRQIHGLCIKFGLGSDLFAGCSLCDMYAKCGFLGSARTVFY 360
EFVFGSAFSACSKLLEP+ GRQIHGLCIK GLGSD+FAGCSLCDMYAKCGFL SARTVFY
Sbjct: 301 EFVFGSAFSACSKLLEPDCGRQIHGLCIKLGLGSDIFAGCSLCDMYAKCGFLESARTVFY 360
Query: 361 HIEKPDLVAWNSIIAGFASVGDAKESLSFFSRMRHTGLVPNDVTVLSLLCACSEPVMLNQ 420
HIEKPDLVAWN+IIAGFASV +AKESLSFFS+MRH G+VPNDVTVLSLLCACSEPVMLN
Sbjct: 361 HIEKPDLVAWNAIIAGFASVSNAKESLSFFSQMRHRGVVPNDVTVLSLLCACSEPVMLNN 420
Query: 421 GMQVHSYIVRMGFDLDIPVCNSLLSMYSKCSNLNDALQIFEGIGNKADIVSWNTMLTACL 480
GMQVHSYIV+MGF+LDIPVCNSLLSMYSKCSNLNDALQ+FEGIGNKADIVSWNT+LTACL
Sbjct: 421 GMQVHSYIVKMGFNLDIPVCNSLLSMYSKCSNLNDALQVFEGIGNKADIVSWNTLLTACL 480
Query: 481 QQNQAGEVLRLTKLMLASHIKPDHVTLTNVLVSSGQIASYEVGSQVHCFIMKSGLNLDTS 540
QQNQAGEVLRLTKLM AS I PDHVTLTNVLVSSGQIASYEVGSQ+HCFIMKSGLNLDTS
Sbjct: 481 QQNQAGEVLRLTKLMFASRIMPDHVTLTNVLVSSGQIASYEVGSQIHCFIMKSGLNLDTS 540
Query: 541 VSNALINMYTKCGSLGCARKMFDSIDNPDIISWSSLIVGYAQAGCGEEAFELFRTMRGLG 600
VSNALINMYTKCGSL CARKMFDSI NPDIISWSSLIVGYAQAGCG+EAFELFRTMRGLG
Sbjct: 541 VSNALINMYTKCGSLECARKMFDSIGNPDIISWSSLIVGYAQAGCGKEAFELFRTMRGLG 600
Query: 601 VKPNEITFVGILTACSHIGMVEEGLKLYRTMQEEYDISPTKEHCSCMVDLLARAGCLDGA 660
VKPNEITF+GILTACSHIGMVEEGLKLYRTMQE+ ISPTKEHCSC+VDLLARAGCLD A
Sbjct: 601 VKPNEITFLGILTACSHIGMVEEGLKLYRTMQEDCRISPTKEHCSCIVDLLARAGCLDVA 660
Query: 661 EDFIKHMPFDPDIVVWKTLLAACKVHGNLEVGKRAAENVLKIDPSNSAALVMLCNIHASS 720
E+FIK MPF PD+VVWKTLLAACKVHGNLEVGKRAAENVLKIDPSNSAA+VMLCNIHASS
Sbjct: 661 EEFIKQMPFVPDVVVWKTLLAACKVHGNLEVGKRAAENVLKIDPSNSAAVVMLCNIHASS 720
Query: 721 GCWKDFAQYRSSMRQMNVSKVPGQSWIEIKDKVHVFLAEDSLHPERGKIYTMLEELMLQV 780
G WKD AQ R SMR+M+V KVPGQSWIEIKDKVHVFLAED+LHPERGKIYTMLEELMLQ
Sbjct: 721 GHWKDLAQLRISMRRMDVGKVPGQSWIEIKDKVHVFLAEDNLHPERGKIYTMLEELMLQT 780
Query: 781 LDDGCDPIKV 791
LDD CDP+++
Sbjct: 781 LDDSCDPLQM 790
BLAST of HG10020254 vs. ExPASy TrEMBL
Match:
A0A1S3B684 (pentatricopeptide repeat-containing protein At3g53360, mitochondrial OS=Cucumis melo OX=3656 GN=LOC103486463 PE=4 SV=1)
HSP 1 Score: 1472.6 bits (3811), Expect = 0.0e+00
Identity = 718/790 (90.89%), Postives = 750/790 (94.94%), Query Frame = 0
Query: 1 MAFIRTLNSIYFVTLASDAIVFFNPFSPTTSTKNFKPQLHLALSHIRLNTQLAFSPFPVT 60
MA IRT NSIYF+TLASDAIVFFNPFSP+TSTKNFKPQLHLA SHIRL TQLAFSP PVT
Sbjct: 1 MAPIRTFNSIYFLTLASDAIVFFNPFSPSTSTKNFKPQLHLAHSHIRLGTQLAFSPCPVT 60
Query: 61 DHYSHDDKLISLCKKNCHREALQAFDIFQKCSNSPLKSITYTHLINACSSLRSLEHGRKI 120
HY HDDK+ISLCKKN HREALQAFDIF+KCS+SPLKSITYTHLINACSSLRSLEHGRKI
Sbjct: 61 VHYPHDDKIISLCKKNLHREALQAFDIFRKCSSSPLKSITYTHLINACSSLRSLEHGRKI 120
Query: 121 HRHMLTFNYQPDMILQNHILNMYGKCGSLKEARNIFDAMPLKNVVSWTSMISGYSHYGQE 180
HRHMLT NYQPDMILQNHIL+MYGKCGSLKEARNIFDAMPLKNVVSWTSMISGYS YGQE
Sbjct: 121 HRHMLTCNYQPDMILQNHILSMYGKCGSLKEARNIFDAMPLKNVVSWTSMISGYSRYGQE 180
Query: 181 DNAIALYVQMLRSGYIPDHFTFGSVVKSCSGLDDFMLARQLHAHVLKSEFGSHLIAQNAL 240
DNAI LY+QMLRSGYIPDHFTFGS+VKSCSGLDDFMLARQLHAHVLK EFG HLIAQNAL
Sbjct: 181 DNAITLYIQMLRSGYIPDHFTFGSIVKSCSGLDDFMLARQLHAHVLKFEFGGHLIAQNAL 240
Query: 241 ISMYTKLSQMADAKNVFSHIIIKDLISWGSMIAGFSQLGYELEALCHFREMLSQPVYQPN 300
ISMYTK SQMADA NVFS IIIKDLISWGSMIAGFSQLGYELEALCHFREMLSQ VYQPN
Sbjct: 241 ISMYTKFSQMADAINVFSRIIIKDLISWGSMIAGFSQLGYELEALCHFREMLSQSVYQPN 300
Query: 301 EFVFGSAFSACSKLLEPNYGRQIHGLCIKFGLGSDLFAGCSLCDMYAKCGFLGSARTVFY 360
EFVFGSAFSACSKLLEP+ GRQIHGLCIK GLGSD+FAGCSLCDMYAKCGFL SARTVFY
Sbjct: 301 EFVFGSAFSACSKLLEPDCGRQIHGLCIKLGLGSDIFAGCSLCDMYAKCGFLESARTVFY 360
Query: 361 HIEKPDLVAWNSIIAGFASVGDAKESLSFFSRMRHTGLVPNDVTVLSLLCACSEPVMLNQ 420
HIEKPDLVAWN+IIAGFASV +AKESLSFFS+MRH G+VPNDVTVLSLLCACSEPVMLN
Sbjct: 361 HIEKPDLVAWNAIIAGFASVSNAKESLSFFSQMRHRGVVPNDVTVLSLLCACSEPVMLNN 420
Query: 421 GMQVHSYIVRMGFDLDIPVCNSLLSMYSKCSNLNDALQIFEGIGNKADIVSWNTMLTACL 480
GMQVHSYIV+MGF+LDIPVCNSLLSMYSKCSNLNDALQ+FEGIGNKADIVSWNT+LTACL
Sbjct: 421 GMQVHSYIVKMGFNLDIPVCNSLLSMYSKCSNLNDALQVFEGIGNKADIVSWNTLLTACL 480
Query: 481 QQNQAGEVLRLTKLMLASHIKPDHVTLTNVLVSSGQIASYEVGSQVHCFIMKSGLNLDTS 540
QQNQAGEVLRLTKLM AS I PDHVTLTNVLVSSGQIASYEVGSQ+HCFIMKSGLNLDTS
Sbjct: 481 QQNQAGEVLRLTKLMFASRIMPDHVTLTNVLVSSGQIASYEVGSQIHCFIMKSGLNLDTS 540
Query: 541 VSNALINMYTKCGSLGCARKMFDSIDNPDIISWSSLIVGYAQAGCGEEAFELFRTMRGLG 600
VSNALINMYTKCGSL CARKMFDSI NPDIISWSSLIVGYAQAGCG+EAFELFRTMRGLG
Sbjct: 541 VSNALINMYTKCGSLECARKMFDSIGNPDIISWSSLIVGYAQAGCGKEAFELFRTMRGLG 600
Query: 601 VKPNEITFVGILTACSHIGMVEEGLKLYRTMQEEYDISPTKEHCSCMVDLLARAGCLDGA 660
VKPNEITF+GILTACSHIGMVEEGLKLYRTMQE+ ISPTKEHCSC+VDLLARAGCLD A
Sbjct: 601 VKPNEITFLGILTACSHIGMVEEGLKLYRTMQEDCRISPTKEHCSCIVDLLARAGCLDVA 660
Query: 661 EDFIKHMPFDPDIVVWKTLLAACKVHGNLEVGKRAAENVLKIDPSNSAALVMLCNIHASS 720
E+FIK MPF PD+VVWKTLLAACKVHGNLEVGKRAAENVLKIDPSNSAA+VMLCNIHASS
Sbjct: 661 EEFIKQMPFVPDVVVWKTLLAACKVHGNLEVGKRAAENVLKIDPSNSAAVVMLCNIHASS 720
Query: 721 GCWKDFAQYRSSMRQMNVSKVPGQSWIEIKDKVHVFLAEDSLHPERGKIYTMLEELMLQV 780
G WKD AQ R SMR+M+V KVPGQSWIEIKDKVHVFLAED+LHPERGKIYTMLEELMLQ
Sbjct: 721 GHWKDLAQLRISMRRMDVGKVPGQSWIEIKDKVHVFLAEDNLHPERGKIYTMLEELMLQT 780
Query: 781 LDDGCDPIKV 791
LDD CDP+++
Sbjct: 781 LDDSCDPLQM 790
BLAST of HG10020254 vs. ExPASy TrEMBL
Match:
A0A6J1J911 (pentatricopeptide repeat-containing protein At3g53360, mitochondrial-like OS=Cucurbita maxima OX=3661 GN=LOC111482386 PE=4 SV=1)
HSP 1 Score: 1409.0 bits (3646), Expect = 0.0e+00
Identity = 685/790 (86.71%), Postives = 732/790 (92.66%), Query Frame = 0
Query: 1 MAFIRTLNSIYFVTLASDAIVFFNPFSPTTSTKNFKPQLHLALSHIRLNTQLAFSPFPVT 60
MAFIRTLNSIYFVTL+S+AIVFFNP SP TS KNFKPQLHLALSHIR ++ LA P PVT
Sbjct: 29 MAFIRTLNSIYFVTLSSEAIVFFNPCSPATSIKNFKPQLHLALSHIRFSSLLASPPSPVT 88
Query: 61 DHYSHDDKLISLCKKNCHREALQAFDIFQKCSNSPLKSITYTHLINACSSLRSLEHGRKI 120
+H SHDD +ISLCKK HREALQAFDIF KCS+SPL SITYTHLI+ACSSLR LEHGRKI
Sbjct: 89 EH-SHDDNIISLCKKKLHREALQAFDIFHKCSSSPLNSITYTHLIHACSSLRFLEHGRKI 148
Query: 121 HRHMLTFNYQPDMILQNHILNMYGKCGSLKEARNIFDAMPLKNVVSWTSMISGYSHYGQE 180
H HM TFNYQPD+ILQNHILNMYGKCGSLKEARNIFDAMPLKN VSWTSMISGYSHYG++
Sbjct: 149 HCHMSTFNYQPDLILQNHILNMYGKCGSLKEARNIFDAMPLKNAVSWTSMISGYSHYGED 208
Query: 181 DNAIALYVQMLRSGYIPDHFTFGSVVKSCSGLDDFMLARQLHAHVLKSEFGSHLIAQNAL 240
DNAI LYVQMLRSG+IPDHFTFGSVVKSCSGLDD MLARQLHAHVLKSEFG + IAQNAL
Sbjct: 209 DNAITLYVQMLRSGHIPDHFTFGSVVKSCSGLDDLMLARQLHAHVLKSEFGGNPIAQNAL 268
Query: 241 ISMYTKLSQMADAKNVFSHIIIKDLISWGSMIAGFSQLGYELEALCHFREMLSQPVYQPN 300
ISMYTK SQ+ADA NVFSHII KDLISWGSMIAGFSQLG E+EALCHFREMLSQP+YQPN
Sbjct: 269 ISMYTKFSQIADATNVFSHIITKDLISWGSMIAGFSQLGCEIEALCHFREMLSQPIYQPN 328
Query: 301 EFVFGSAFSACSKLLEPNYGRQIHGLCIKFGLGSDLFAGCSLCDMYAKCGFLGSARTVFY 360
EFVFGSAFSACS L EPN GRQIHGLCIKFGLGSD FAGCSLCDMYAKCGFLGSARTVF
Sbjct: 329 EFVFGSAFSACSNLSEPNCGRQIHGLCIKFGLGSDRFAGCSLCDMYAKCGFLGSARTVFC 388
Query: 361 HIEKPDLVAWNSIIAGFASVGDAKESLSFFSRMRHTGLVPNDVTVLSLLCACSEPVMLNQ 420
IEKPDLVAWN+IIAGFASVGDAKESLSFFS+MRHTGL NDVTVLSLLCACSEP+MLNQ
Sbjct: 389 QIEKPDLVAWNAIIAGFASVGDAKESLSFFSQMRHTGLASNDVTVLSLLCACSEPMMLNQ 448
Query: 421 GMQVHSYIVRMGFDLDIPVCNSLLSMYSKCSNLNDALQIFEGIGNKADIVSWNTMLTACL 480
GMQVHSYIV+ GFDL++ VCN LLSMYSKCS+LND+L+IFE IGNKAD+VSWNTMLTAC
Sbjct: 449 GMQVHSYIVKTGFDLEVLVCNGLLSMYSKCSDLNDSLKIFEDIGNKADVVSWNTMLTACR 508
Query: 481 QQNQAGEVLRLTKLMLASHIKPDHVTLTNVLVSSGQIASYEVGSQVHCFIMKSGLNLDTS 540
+NQAGEVLRL KLMLASHIK D VTLTNVLVSSG IASYEVGSQVHCFIMKSG NLDTS
Sbjct: 509 LRNQAGEVLRLMKLMLASHIKSDRVTLTNVLVSSGLIASYEVGSQVHCFIMKSGQNLDTS 568
Query: 541 VSNALINMYTKCGSLGCARKMFDSIDNPDIISWSSLIVGYAQAGCGEEAFELFRTMRGLG 600
VSNALINMY KCGSLGCARKMFDSID+PD+ISWSSLIVGYAQAGCGEEAFELFRTMRGLG
Sbjct: 569 VSNALINMYMKCGSLGCARKMFDSIDDPDVISWSSLIVGYAQAGCGEEAFELFRTMRGLG 628
Query: 601 VKPNEITFVGILTACSHIGMVEEGLKLYRTMQEEYDISPTKEHCSCMVDLLARAGCLDGA 660
++PNEITF+GILTAC H+GMVEEGL+LYRTMQE+ DISPTKEHCSC+VDLLARAGCLD A
Sbjct: 629 IRPNEITFLGILTACCHVGMVEEGLRLYRTMQEQDDISPTKEHCSCIVDLLARAGCLDAA 688
Query: 661 EDFIKHMPFDPDIVVWKTLLAACKVHGNLEVGKRAAENVLKIDPSNSAALVMLCNIHASS 720
EDFIK MPF+PDIVVW TLLAACKVHGNLEVGKRAAENVL+ DPSNSAALVMLCNIHASS
Sbjct: 689 EDFIKKMPFEPDIVVWMTLLAACKVHGNLEVGKRAAENVLRNDPSNSAALVMLCNIHASS 748
Query: 721 GCWKDFAQYRSSMRQMNVSKVPGQSWIEIKDKVHVFLAEDSLHPERGKIYTMLEELMLQV 780
G WKDFA+ R SMR+M+VSKVPGQSWIEIKD+VHVF AEDSLHPERG+IY MLEELMLQ+
Sbjct: 749 GHWKDFARLRRSMRRMDVSKVPGQSWIEIKDRVHVFFAEDSLHPERGRIYIMLEELMLQL 808
Query: 781 LDDGCDPIKV 791
LDD CDP+++
Sbjct: 809 LDDSCDPLQM 817
BLAST of HG10020254 vs. ExPASy TrEMBL
Match:
A0A6J1F1S7 (pentatricopeptide repeat-containing protein At3g53360, mitochondrial-like OS=Cucurbita moschata OX=3662 GN=LOC111441412 PE=4 SV=1)
HSP 1 Score: 1409.0 bits (3646), Expect = 0.0e+00
Identity = 688/791 (86.98%), Postives = 735/791 (92.92%), Query Frame = 0
Query: 1 MAFIRTLNSIYFVTLASDAIVFFNPFSPTTSTKNFKPQLHLALSHIRLNTQLAF-SPFPV 60
MAFIRTL+SIYFVTL+S+AIVFFNP SP TS KNFKPQL LALSHIR ++ LA P PV
Sbjct: 29 MAFIRTLHSIYFVTLSSEAIVFFNPCSPATSIKNFKPQLRLALSHIRFSSLLASPPPSPV 88
Query: 61 TDHYSHDDKLISLCKKNCHREALQAFDIFQKCSNSPLKSITYTHLINACSSLRSLEHGRK 120
T+H S+DD +ISLCKK HREALQAFDIFQKCS+SPL SITYTHLI+ACSSLRSLEHGRK
Sbjct: 89 TEH-SYDDNIISLCKKKLHREALQAFDIFQKCSSSPLNSITYTHLIHACSSLRSLEHGRK 148
Query: 121 IHRHMLTFNYQPDMILQNHILNMYGKCGSLKEARNIFDAMPLKNVVSWTSMISGYSHYGQ 180
IH HM TFNYQPD+ILQNHILNMYGKCGSLKEARNIFDAMPLKN VSWTSMISGYSHYGQ
Sbjct: 149 IHCHMSTFNYQPDLILQNHILNMYGKCGSLKEARNIFDAMPLKNAVSWTSMISGYSHYGQ 208
Query: 181 EDNAIALYVQMLRSGYIPDHFTFGSVVKSCSGLDDFMLARQLHAHVLKSEFGSHLIAQNA 240
+DNAI LYVQMLRSG+IPDHFTFGSVVKSCSGLDD MLARQLHAHVLKSEFG + IAQNA
Sbjct: 209 DDNAITLYVQMLRSGHIPDHFTFGSVVKSCSGLDDLMLARQLHAHVLKSEFGGNPIAQNA 268
Query: 241 LISMYTKLSQMADAKNVFSHIIIKDLISWGSMIAGFSQLGYELEALCHFREMLSQPVYQP 300
LISMYTK SQ+ADA NVFSHIIIKDLISWGSMIAGFSQLGYE+EALCHFREMLSQ +YQP
Sbjct: 269 LISMYTKFSQIADATNVFSHIIIKDLISWGSMIAGFSQLGYEIEALCHFREMLSQAIYQP 328
Query: 301 NEFVFGSAFSACSKLLEPNYGRQIHGLCIKFGLGSDLFAGCSLCDMYAKCGFLGSARTVF 360
NEFVFGSAFSACS L EPN GRQIHGLCIKFGLGSD FAGCSLCDMYAKCGFLGSARTVF
Sbjct: 329 NEFVFGSAFSACSSLSEPNCGRQIHGLCIKFGLGSDRFAGCSLCDMYAKCGFLGSARTVF 388
Query: 361 YHIEKPDLVAWNSIIAGFASVGDAKESLSFFSRMRHTGLVPNDVTVLSLLCACSEPVMLN 420
IEKPDLVAWN+IIAGFASVGDAKESLSFFS+MRHTGL NDVTVLSLLCACS+P+MLN
Sbjct: 389 CQIEKPDLVAWNAIIAGFASVGDAKESLSFFSQMRHTGLASNDVTVLSLLCACSDPMMLN 448
Query: 421 QGMQVHSYIVRMGFDLDIPVCNSLLSMYSKCSNLNDALQIFEGIGNKADIVSWNTMLTAC 480
QGMQVHSYIV+ GFDL++PVCN LLSMYSKCS LND+L+IFE IGNKADIVSWNTMLTAC
Sbjct: 449 QGMQVHSYIVKTGFDLEVPVCNGLLSMYSKCSVLNDSLKIFEDIGNKADIVSWNTMLTAC 508
Query: 481 LQQNQAGEVLRLTKLMLASHIKPDHVTLTNVLVSSGQIASYEVGSQVHCFIMKSGLNLDT 540
QNQAGEVLRL KLMLASHIK D VTLTNVLVSSGQIASYEVGSQVHCFIMKSG NLDT
Sbjct: 509 RLQNQAGEVLRLMKLMLASHIKSDRVTLTNVLVSSGQIASYEVGSQVHCFIMKSGQNLDT 568
Query: 541 SVSNALINMYTKCGSLGCARKMFDSIDNPDIISWSSLIVGYAQAGCGEEAFELFRTMRGL 600
SVSNALINMY KCGSLGCARKMFDSI++PD+ISWSSLIVGYAQAGCGEEAFELFRTMRGL
Sbjct: 569 SVSNALINMYMKCGSLGCARKMFDSINDPDVISWSSLIVGYAQAGCGEEAFELFRTMRGL 628
Query: 601 GVKPNEITFVGILTACSHIGMVEEGLKLYRTMQEEYDISPTKEHCSCMVDLLARAGCLDG 660
G++PNEITF+GILTAC HIGMVEEGL+LYRTMQE+ ISPTKEHCSC+VDLLARAGCLD
Sbjct: 629 GIRPNEITFIGILTACCHIGMVEEGLRLYRTMQEQDGISPTKEHCSCIVDLLARAGCLDA 688
Query: 661 AEDFIKHMPFDPDIVVWKTLLAACKVHGNLEVGKRAAENVLKIDPSNSAALVMLCNIHAS 720
AEDFIK MPF+PDIVVW TLLAACK+HGNLEVGKRAAENVLK DPSNSAALVMLCNIHAS
Sbjct: 689 AEDFIKKMPFEPDIVVWMTLLAACKLHGNLEVGKRAAENVLKNDPSNSAALVMLCNIHAS 748
Query: 721 SGCWKDFAQYRSSMRQMNVSKVPGQSWIEIKDKVHVFLAEDSLHPERGKIYTMLEELMLQ 780
SG WKDFA+ R SMR+M+VSKVPGQSWIEIKD+VHVF AEDSLHPERG+IYTMLEELM+Q
Sbjct: 749 SGHWKDFARLRRSMRRMDVSKVPGQSWIEIKDRVHVFFAEDSLHPERGRIYTMLEELMVQ 808
Query: 781 VLDDGCDPIKV 791
+LDD CDP+++
Sbjct: 809 ILDDSCDPLQM 818
BLAST of HG10020254 vs. TAIR 10
Match:
AT3G53360.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )
HSP 1 Score: 896.0 bits (2314), Expect = 2.2e-260
Identity = 426/722 (59.00%), Postives = 552/722 (76.45%), Query Frame = 0
Query: 66 DDKLISLCKKNCHREALQAFDIFQKCSNSPLKSITYTHLINACSSLRSLEHGRKIHRHML 125
+D + SLCK N +REAL+AFD QK S+ ++ TY LI ACSS RSL GRKIH H+L
Sbjct: 35 NDHINSLCKSNFYREALEAFDFAQKNSSFKIRLRTYISLICACSSSRSLAQGRKIHDHIL 94
Query: 126 TFNYQPDMILQNHILNMYGKCGSLKEARNIFDAMPLKNVVSWTSMISGYSHYGQEDNAIA 185
N + D IL NHIL+MYGKCGSL++AR +FD MP +N+VS+TS+I+GYS GQ AI
Sbjct: 95 NSNCKYDTILNNHILSMYGKCGSLRDAREVFDFMPERNLVSYTSVITGYSQNGQGAEAIR 154
Query: 186 LYVQMLRSGYIPDHFTFGSVVKSCSGLDDFMLARQLHAHVLKSEFGSHLIAQNALISMYT 245
LY++ML+ +PD F FGS++K+C+ D L +QLHA V+K E SHLIAQNALI+MY
Sbjct: 155 LYLKMLQEDLVPDQFAFGSIIKACASSSDVGLGKQLHAQVIKLESSSHLIAQNALIAMYV 214
Query: 246 KLSQMADAKNVFSHIIIKDLISWGSMIAGFSQLGYELEALCHFREMLSQPVYQPNEFVFG 305
+ +QM+DA VF I +KDLISW S+IAGFSQLG+E EAL H +EMLS V+ PNE++FG
Sbjct: 215 RFNQMSDASRVFYGIPMKDLISWSSIIAGFSQLGFEFEALSHLKEMLSFGVFHPNEYIFG 274
Query: 306 SAFSACSKLLEPNYGRQIHGLCIKFGLGSDLFAGCSLCDMYAKCGFLGSARTVFYHIEKP 365
S+ ACS LL P+YG QIHGLCIK L + AGCSLCDMYA+CGFL SAR VF IE+P
Sbjct: 275 SSLKACSSLLRPDYGSQIHGLCIKSELAGNAIAGCSLCDMYARCGFLNSARRVFDQIERP 334
Query: 366 DLVAWNSIIAGFASVGDAKESLSFFSRMRHTGLVPNDVTVLSLLCACSEPVMLNQGMQVH 425
D +WN IIAG A+ G A E++S FS+MR +G +P+ +++ SLLCA ++P+ L+QGMQ+H
Sbjct: 335 DTASWNVIIAGLANNGYADEAVSVFSQMRSSGFIPDAISLRSLLCAQTKPMALSQGMQIH 394
Query: 426 SYIVRMGFDLDIPVCNSLLSMYSKCSNLNDALQIFEGIGNKADIVSWNTMLTACLQQNQA 485
SYI++ GF D+ VCNSLL+MY+ CS+L +FE N AD VSWNT+LTACLQ Q
Sbjct: 395 SYIIKWGFLADLTVCNSLLTMYTFCSDLYCCFNLFEDFRNNADSVSWNTILTACLQHEQP 454
Query: 486 GEVLRLTKLMLASHIKPDHVTLTNVLVSSGQIASYEVGSQVHCFIMKSGLNLDTSVSNAL 545
E+LRL KLML S +PDH+T+ N+L +I+S ++GSQVHC+ +K+GL + + N L
Sbjct: 455 VEMLRLFKLMLVSECEPDHITMGNLLRGCVEISSLKLGSQVHCYSLKTGLAPEQFIKNGL 514
Query: 546 INMYTKCGSLGCARKMFDSIDNPDIISWSSLIVGYAQAGCGEEAFELFRTMRGLGVKPNE 605
I+MY KCGSLG AR++FDS+DN D++SWS+LIVGYAQ+G GEEA LF+ M+ G++PN
Sbjct: 515 IDMYAKCGSLGQARRIFDSMDNRDVVSWSTLIVGYAQSGFGEEALILFKEMKSAGIEPNH 574
Query: 606 ITFVGILTACSHIGMVEEGLKLYRTMQEEYDISPTKEHCSCMVDLLARAGCLDGAEDFIK 665
+TFVG+LTACSH+G+VEEGLKLY TMQ E+ ISPTKEHCSC+VDLLARAG L+ AE FI
Sbjct: 575 VTFVGVLTACSHVGLVEEGLKLYATMQTEHGISPTKEHCSCVVDLLARAGRLNEAERFID 634
Query: 666 HMPFDPDIVVWKTLLAACKVHGNLEVGKRAAENVLKIDPSNSAALVMLCNIHASSGCWKD 725
M +PD+VVWKTLL+ACK GN+ + ++AAEN+LKIDP NS A V+LC++HASSG W++
Sbjct: 635 EMKLEPDVVVWKTLLSACKTQGNVHLAQKAAENILKIDPFNSTAHVLLCSMHASSGNWEN 694
Query: 726 FAQYRSSMRQMNVSKVPGQSWIEIKDKVHVFLAEDSLHPERGKIYTMLEELMLQVLDDGC 785
A RSSM++ +V K+PGQSWIEI+DK+H+F AED HPER IYT+L + Q+LD+ C
Sbjct: 695 AALLRSSMKKHDVKKIPGQSWIEIEDKIHIFFAEDIFHPERDDIYTVLHNIWSQMLDE-C 754
Query: 786 DP 788
+P
Sbjct: 755 NP 755
BLAST of HG10020254 vs. TAIR 10
Match:
AT3G03580.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )
HSP 1 Score: 471.9 bits (1213), Expect = 1.0e-132
Identity = 237/730 (32.47%), Postives = 401/730 (54.93%), Query Frame = 0
Query: 58 PVTDHYSHDDKLISLCKKNCHREALQAFDIFQKCSNSPLKSITYTHLINACSSLRSLEHG 117
P + Y + + + K EAL+ + ++ SP K T+ +I AC+ L E G
Sbjct: 67 PAKNVYLWNSIIRAFSKNGLFPEALEFYGKLRESKVSPDK-YTFPSVIKACAGLFDAEMG 126
Query: 118 RKIHRHMLTFNYQPDMILQNHILNMYGKCGSLKEARNIFDAMPLKNVVSWTSMISGYSHY 177
++ +L ++ D+ + N +++MY + G L AR +FD MP++++VSW S+ISGYS +
Sbjct: 127 DLVYEQILDMGFESDLFVGNALVDMYSRMGLLTRARQVFDEMPVRDLVSWNSLISGYSSH 186
Query: 178 GQEDNAIALYVQMLRSGYIPDHFTFGSVVKSCSGLDDFMLARQLHAHVLKSEFGSHLIAQ 237
G + A+ +Y ++ S +PD FT SV+ + L + LH LKS S ++
Sbjct: 187 GYYEEALEIYHELKNSWIVPDSFTVSSVLPAFGNLLVVKQGQGLHGFALKSGVNSVVVVN 246
Query: 238 NALISMYTKLSQMADAKNVFSHIIIKDLISWGSMIAGFSQLGYELEALCHFREMLSQPVY 297
N L++MY K + DA+ VF + ++D +S+ +MI G+ +L E++ F E L Q +
Sbjct: 247 NGLVAMYLKFRRPTDARRVFDEMDVRDSVSYNTMICGYLKLEMVEESVRMFLENLDQ--F 306
Query: 298 QPNEFVFGSAFSACSKLLEPNYGRQIHGLCIKFGLGSDLFAGCSLCDMYAKCGFLGSART 357
+P+ S AC L + + + I+ +K G + L D+YAKCG + +AR
Sbjct: 307 KPDLLTVSSVLRACGHLRDLSLAKYIYNYMLKAGFVLESTVRNILIDVYAKCGDMITARD 366
Query: 358 VFYHIEKPDLVAWNSIIAGFASVGDAKESLSFFSRMRHTGLVPNDVTVLSLLCACSEPVM 417
VF +E D V+WNSII+G+ GD E++ F M + +T L L+ +
Sbjct: 367 VFNSMECKDTVSWNSIISGYIQSGDLMEAMKLFKMMMIMEEQADHITYLMLISVSTRLAD 426
Query: 418 LNQGMQVHSYIVRMGFDLDIPVCNSLLSMYSKCSNLNDALQIFEGIGNKADIVSWNTMLT 477
L G +HS ++ G +D+ V N+L+ MY+KC + D+L+IF +G D V+WNT+++
Sbjct: 427 LKFGKGLHSNGIKSGICIDLSVSNALIDMYAKCGEVGDSLKIFSSMGT-GDTVTWNTVIS 486
Query: 478 ACLQQNQAGEVLRLTKLMLASHIKPDHVTLTNVLVSSGQIASYEVGSQVHCFIMKSGLNL 537
AC++ L++T M S + PD T L +A+ +G ++HC +++ G
Sbjct: 487 ACVRFGDFATGLQVTTQMRKSEVVPDMATFLVTLPMCASLAAKRLGKEIHCCLLRFGYES 546
Query: 538 DTSVSNALINMYTKCGSLGCARKMFDSIDNPDIISWSSLIVGYAQAGCGEEAFELFRTMR 597
+ + NALI MY+KCG L + ++F+ + D+++W+ +I Y G GE+A E F M
Sbjct: 547 ELQIGNALIEMYSKCGCLENSSRVFERMSRRDVVTWTGMIYAYGMYGEGEKALETFADME 606
Query: 598 GLGVKPNEITFVGILTACSHIGMVEEGLKLYRTMQEEYDISPTKEHCSCMVDLLARAGCL 657
G+ P+ + F+ I+ ACSH G+V+EGL + M+ Y I P EH +C+VDLL+R+ +
Sbjct: 607 KSGIVPDSVVFIAIIYACSHSGLVDEGLACFEKMKTHYKIDPMIEHYACVVDLLSRSQKI 666
Query: 658 DGAEDFIKHMPFDPDIVVWKTLLAACKVHGNLEVGKRAAENVLKIDPSNSAALVMLCNIH 717
AE+FI+ MP PD +W ++L AC+ G++E +R + +++++P + ++ N +
Sbjct: 667 SKAEEFIQAMPIKPDASIWASVLRACRTSGDMETAERVSRRIIELNPDDPGYSILASNAY 726
Query: 718 ASSGCWKDFAQYRSSMRQMNVSKVPGQSWIEIKDKVHVFLAEDSLHPERGKIYTMLEELM 777
A+ W + R S++ +++K PG SWIE+ VHVF + D P+ IY LE L
Sbjct: 727 AALRKWDKVSLIRKSLKDKHITKNPGYSWIEVGKNVHVFSSGDDSAPQSEAIYKSLEILY 786
Query: 778 LQVLDDGCDP 788
+ +G P
Sbjct: 787 SLMAKEGYIP 792
BLAST of HG10020254 vs. TAIR 10
Match:
AT3G49170.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )
HSP 1 Score: 471.1 bits (1211), Expect = 1.8e-132
Identity = 248/719 (34.49%), Postives = 410/719 (57.02%), Query Frame = 0
Query: 79 REALQAFDIFQKCSNSPLKSITYTHLINACSSLRSLEHGRKIHRHMLTFNYQPDMILQNH 138
R A+ A D+ + P+ S+T++ L+ +C R G+ +H ++ F+ +PD +L N
Sbjct: 43 RGAVSALDLMARDGIRPMDSVTFSSLLKSCIRARDFRLGKLVHARLIEFDIEPDSVLYNS 102
Query: 139 ILNMYGKCGSLKEARNIFDAM---PLKNVVSWTSMISGYSHYGQEDNAIALYVQMLRSGY 198
++++Y K G +A ++F+ M ++VVSW++M++ Y + G+E +AI ++V+ L G
Sbjct: 103 LISLYSKSGDSAKAEDVFETMRRFGKRDVVSWSAMMACYGNNGRELDAIKVFVEFLELGL 162
Query: 199 IPDHFTFGSVVKSCSGLDDFMLARQLHAHVLKS-EFGSHLIAQNALISMYTK-LSQMADA 258
+P+ + + +V+++CS D + R ++K+ F S + +LI M+ K + +A
Sbjct: 163 VPNDYCYTAVIRACSNSDFVGVGRVTLGFLMKTGHFESDVCVGCSLIDMFVKGENSFENA 222
Query: 259 KNVFSHIIIKDLISWGSMIAGFSQLGYELEALCHFREMLSQPVYQPNEFVFGSAFSACSK 318
VF + ++++W MI Q+G+ EA+ F +M+ ++ ++F S FSAC++
Sbjct: 223 YKVFDKMSELNVVTWTLMITRCMQMGFPREAIRFFLDMVLSG-FESDKFTLSSVFSACAE 282
Query: 319 LLEPNYGRQIHGLCIKFGLGSDLFAGCSLCDMYAKC---GFLGSARTVFYHIEKPDLVAW 378
L + G+Q+H I+ GL D+ CSL DMYAKC G + R VF +E +++W
Sbjct: 283 LENLSLGKQLHSWAIRSGLVDDV--ECSLVDMYAKCSADGSVDDCRKVFDRMEDHSVMSW 342
Query: 379 NSIIAGF-ASVGDAKESLSFFSRMRHTGLV-PNDVTVLSLLCACSEPVMLNQGMQVHSYI 438
++I G+ + A E+++ FS M G V PN T S AC G QV
Sbjct: 343 TALITGYMKNCNLATEAINLFSEMITQGHVEPNHFTFSSAFKACGNLSDPRVGKQVLGQA 402
Query: 439 VRMGFDLDIPVCNSLLSMYSKCSNLNDALQIFEGIGNKADIVSWNTMLTACLQQNQAGEV 498
+ G + V NS++SM+ K + DA + FE + K ++VS+NT L + +
Sbjct: 403 FKRGLASNSSVANSVISMFVKSDRMEDAQRAFESLSEK-NLVSYNTFLDGTCRNLNFEQA 462
Query: 499 LRLTKLMLASHIKPDHVTLTNVLVSSGQIASYEVGSQVHCFIMKSGLNLDTSVSNALINM 558
+L + + T ++L + S G Q+H ++K GL+ + V NALI+M
Sbjct: 463 FKLLSEITERELGVSAFTFASLLSGVANVGSIRKGEQIHSQVVKLGLSCNQPVCNALISM 522
Query: 559 YTKCGSLGCARKMFDSIDNPDIISWSSLIVGYAQAGCGEEAFELFRTMRGLGVKPNEITF 618
Y+KCGS+ A ++F+ ++N ++ISW+S+I G+A+ G E F M GVKPNE+T+
Sbjct: 523 YSKCGSIDTASRVFNFMENRNVISWTSMITGFAKHGFAIRVLETFNQMIEEGVKPNEVTY 582
Query: 619 VGILTACSHIGMVEEGLKLYRTMQEEYDISPTKEHCSCMVDLLARAGCLDGAEDFIKHMP 678
V IL+ACSH+G+V EG + + +M E++ I P EH +CMVDLL RAG L A +FI MP
Sbjct: 583 VAILSACSHVGLVSEGWRHFNSMYEDHKIKPKMEHYACMVDLLCRAGLLTDAFEFINTMP 642
Query: 679 FDPDIVVWKTLLAACKVHGNLEVGKRAAENVLKIDPSNSAALVMLCNIHASSGCWKDFAQ 738
F D++VW+T L AC+VH N E+GK AA +L++DP+ AA + L NI+A +G W++ +
Sbjct: 643 FQADVLVWRTFLGACRVHSNTELGKLAARKILELDPNEPAAYIQLSNIYACAGKWEESTE 702
Query: 739 YRSSMRQMNVSKVPGQSWIEIKDKVHVFLAEDSLHPERGKIYTMLEELMLQVLDDGCDP 788
R M++ N+ K G SWIE+ DK+H F D+ HP +IY L+ L+ ++ G P
Sbjct: 703 MRRKMKERNLVKEGGCSWIEVGDKIHKFYVGDTAHPNAHQIYDELDRLITEIKRCGYVP 757
BLAST of HG10020254 vs. TAIR 10
Match:
AT3G09040.1 (Pentatricopeptide repeat (PPR) superfamily protein )
HSP 1 Score: 469.2 bits (1206), Expect = 6.7e-132
Identity = 245/712 (34.41%), Postives = 390/712 (54.78%), Query Frame = 0
Query: 74 KKNCHREALQAFDIFQKCSNSPLKSITYTHLINACSSLRSLEHGRKIHRHMLTFNYQPDM 133
K+ C A++ F +K S +S T +++A + +L+ G +H + ++
Sbjct: 304 KRGCETVAIEYFFNMRKSSVKSTRS-TLGSVLSAIGIVANLDLGLVVHAEAIKLGLASNI 363
Query: 134 ILQNHILNMYGKCGSLKEARNIFDAMPLKNVVSWTSMISGYSHYGQEDNAIALYVQMLRS 193
+ + +++MY KC ++ A +F+A+ KN V W +MI GY+H G+ + L++ M S
Sbjct: 364 YVGSSLVSMYSKCEKMEAAAKVFEALEEKNDVFWNAMIRGYAHNGESHKVMELFMDMKSS 423
Query: 194 GYIPDHFTFGSVVKSCSGLDDFMLARQLHAHVLKSEFGSHLIAQNALISMYTKLSQMADA 253
GY D FTF S++ +C+ D + Q H+ ++K + +L NAL+ MY K + DA
Sbjct: 424 GYNIDDFTFTSLLSTCAASHDLEMGSQFHSIIIKKKLAKNLFVGNALVDMYAKCGALEDA 483
Query: 254 KNVFSHIIIKDLISWGSMIAGFSQLGYELEALCHFREMLSQPVYQPNEFVFGSAFSACSK 313
+ +F + +D ++W ++I + Q E EA F+ M + S AC+
Sbjct: 484 RQIFERMCDRDNVTWNTIIGSYVQDENESEAFDLFKRMNLCGIVSDGA-CLASTLKACTH 543
Query: 314 LLEPNYGRQIHGLCIKFGLGSDLFAGCSLCDMYAKCGFLGSARTVFYHIEKPDLVAWNSI 373
+ G+Q+H L +K GL DL G SL DMY+KCG + AR VF + + +V+ N++
Sbjct: 544 VHGLYQGKQVHCLSVKCGLDRDLHTGSSLIDMYSKCGIIKDARKVFSSLPEWSVVSMNAL 603
Query: 374 IAGFASVGDAKESLSFFSRMRHTGLVPNDVTVLSLLCACSEPVMLNQGMQVHSYIVRMGF 433
IAG+ S + +E++ F M G+ P+++T +++ AC +P L G Q H I + GF
Sbjct: 604 IAGY-SQNNLEEAVVLFQEMLTRGVNPSEITFATIVEACHKPESLTLGTQFHGQITKRGF 663
Query: 434 DLDIPVCN-SLLSMYSKCSNLNDALQIFEGIGNKADIVSWNTMLTACLQQNQAGEVLRLT 493
+ SLL MY + +A +F + + IV W M++ Q E L+
Sbjct: 664 SSEGEYLGISLLGMYMNSRGMTEACALFSELSSPKSIVLWTGMMSGHSQNGFYEEALKFY 723
Query: 494 KLMLASHIKPDHVTLTNVLVSSGQIASYEVGSQVHCFIMKSGLNLDTSVSNALINMYTKC 553
K M + PD T VL ++S G +H I +LD SN LI+MY KC
Sbjct: 724 KEMRHDGVLPDQATFVTVLRVCSVLSSLREGRAIHSLIFHLAHDLDELTSNTLIDMYAKC 783
Query: 554 GSLGCARKMFDSI-DNPDIISWSSLIVGYAQAGCGEEAFELFRTMRGLGVKPNEITFVGI 613
G + + ++FD + +++SW+SLI GYA+ G E+A ++F +MR + P+EITF+G+
Sbjct: 784 GDMKGSSQVFDEMRRRSNVVSWNSLINGYAKNGYAEDALKIFDSMRQSHIMPDEITFLGV 843
Query: 614 LTACSHIGMVEEGLKLYRTMQEEYDISPTKEHCSCMVDLLARAGCLDGAEDFIKHMPFDP 673
LTACSH G V +G K++ M +Y I +H +CMVDLL R G L A+DFI+ P
Sbjct: 844 LTACSHAGKVSDGRKIFEMMIGQYGIEARVDHVACMVDLLGRWGYLQEADDFIEAQNLKP 903
Query: 674 DIVVWKTLLAACKVHGNLEVGKRAAENVLKIDPSNSAALVMLCNIHASSGCWKDFAQYRS 733
D +W +LL AC++HG+ G+ +AE +++++P NS+A V+L NI+AS GCW+ R
Sbjct: 904 DARLWSSLLGACRIHGDDIRGEISAEKLIELEPQNSSAYVLLSNIYASQGCWEKANALRK 963
Query: 734 SMRQMNVSKVPGQSWIEIKDKVHVFLAEDSLHPERGKIYTMLEELMLQVLDD 784
MR V KVPG SWI+++ + H+F A D H E GKI LE+L + DD
Sbjct: 964 VMRDRGVKKVPGYSWIDVEQRTHIFAAGDKSHSEIGKIEMFLEDLYDLMKDD 1012
BLAST of HG10020254 vs. TAIR 10
Match:
AT4G39530.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )
HSP 1 Score: 465.7 bits (1197), Expect = 7.5e-131
Identity = 249/720 (34.58%), Postives = 408/720 (56.67%), Query Frame = 0
Query: 69 LISLCKKN-CHREALQAFDIFQKCSNSPLKSITYTHLINACSSLRSLEHGR----KIHRH 128
++S C + + E+L F F + + I ACS L GR ++
Sbjct: 116 MVSACNHHGIYEESLVVFLEFWRTRKDSPNEYILSSFIQACSGLDG--RGRWMVFQLQSF 175
Query: 129 MLTFNYQPDMILQNHILNMYGKCGSLKEARNIFDAMPLKNVVSWTSMISGYSHYGQEDNA 188
++ + D+ + +++ Y K G++ AR +FDA+P K+ V+WT+MISG G+ +
Sbjct: 176 LVKSGFDRDVYVGTLLIDFYLKDGNIDYARLVFDALPEKSTVTWTTMISGCVKMGRSYVS 235
Query: 189 IALYVQMLRSGYIPDHFTFGSVVKSCSGLDDFMLARQLHAHVLKSEFGSHLIAQNALISM 248
+ L+ Q++ +PD + +V+ +CS L +Q+HAH+L+ N LI
Sbjct: 236 LQLFYQLMEDNVVPDGYILSTVLSACSILPFLEGGKQIHAHILRYGLEMDASLMNVLIDS 295
Query: 249 YTKLSQMADAKNVFSHIIIKDLISWGSMIAGFSQLGYELEALCHFREMLSQPVYQPNEFV 308
Y K ++ A +F+ + K++ISW ++++G+ Q EA+ F M S+ +P+ +
Sbjct: 296 YVKCGRVIAAHKLFNGMPNKNIISWTTLLSGYKQNALHKEAMELFTSM-SKFGLKPDMYA 355
Query: 309 FGSAFSACSKLLEPNYGRQIHGLCIKFGLGSDLFAGCSLCDMYAKCGFLGSARTVFYHIE 368
S ++C+ L +G Q+H IK LG+D + SL DMYAKC L AR VF
Sbjct: 356 CSSILTSCASLHALGFGTQVHAYTIKANLGNDSYVTNSLIDMYAKCDCLTDARKVFDIFA 415
Query: 369 KPDLVAWNSIIAGFASVG---DAKESLSFFSRMRHTGLVPNDVTVLSLLCACSEPVMLNQ 428
D+V +N++I G++ +G + E+L+ F MR + P+ +T +SLL A + L
Sbjct: 416 AADVVLFNAMIEGYSRLGTQWELHEALNIFRDMRFRLIRPSLLTFVSLLRASASLTSLGL 475
Query: 429 GMQVHSYIVRMGFDLDIPVCNSLLSMYSKCSNLNDALQIFEGIGNKADIVSWNTMLTACL 488
Q+H + + G +LDI ++L+ +YS C L D+ +F+ + K D+V WN+M +
Sbjct: 476 SKQIHGLMFKYGLNLDIFAGSALIDVYSNCYCLKDSRLVFDEMKVK-DLVIWNSMFAGYV 535
Query: 489 QQNQAGEVLRLTKLMLASHIKPDHVTLTNVLVSSGQIASYEVGSQVHCFIMKSGLNLDTS 548
QQ++ E L L + S +PD T N++ ++G +AS ++G + HC ++K GL +
Sbjct: 536 QQSENEEALNLFLELQLSRERPDEFTFANMVTAAGNLASVQLGQEFHCQLLKRGLECNPY 595
Query: 549 VSNALINMYTKCGSLGCARKMFDSIDNPDIISWSSLIVGYAQAGCGEEAFELFRTMRGLG 608
++NAL++MY KCGS A K FDS + D++ W+S+I YA G G++A ++ M G
Sbjct: 596 ITNALLDMYAKCGSPEDAHKAFDSAASRDVVCWNSVISSYANHGEGKKALQMLEKMMSEG 655
Query: 609 VKPNEITFVGILTACSHIGMVEEGLKLYRTMQEEYDISPTKEHCSCMVDLLARAGCLDGA 668
++PN ITFVG+L+ACSH G+VE+GLK + M + I P EH CMV LL RAG L+ A
Sbjct: 656 IEPNYITFVGVLSACSHAGLVEDGLKQFELML-RFGIEPETEHYVCMVSLLGRAGRLNKA 715
Query: 669 EDFIKHMPFDPDIVVWKTLLAACKVHGNLEVGKRAAENVLKIDPSNSAALVMLCNIHASS 728
+ I+ MP P +VW++LL+ C GN+E+ + AAE + DP +S + ML NI+AS
Sbjct: 716 RELIEKMPTKPAAIVWRSLLSGCAKAGNVELAEHAAEMAILSDPKDSGSFTMLSNIYASK 775
Query: 729 GCWKDFAQYRSSMRQMNVSKVPGQSWIEIKDKVHVFLAEDSLHPERGKIYTMLEELMLQV 781
G W + + R M+ V K PG+SWI I +VH+FL++D H + +IY +L++L++Q+
Sbjct: 776 GMWTEAKKVRERMKVEGVVKEPGRSWIGINKEVHIFLSKDKSHCKANQIYEVLDDLLVQI 830
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_038903429.1 | 0.0e+00 | 93.43 | pentatricopeptide repeat-containing protein At3g53360, mitochondrial isoform X1 ... | [more] |
XP_004137966.1 | 0.0e+00 | 91.14 | pentatricopeptide repeat-containing protein At3g53360, mitochondrial [Cucumis sa... | [more] |
XP_008442662.1 | 0.0e+00 | 90.89 | PREDICTED: pentatricopeptide repeat-containing protein At3g53360, mitochondrial ... | [more] |
XP_023526527.1 | 0.0e+00 | 86.96 | pentatricopeptide repeat-containing protein At3g53360, mitochondrial [Cucurbita ... | [more] |
XP_022983903.1 | 0.0e+00 | 86.71 | pentatricopeptide repeat-containing protein At3g53360, mitochondrial-like [Cucur... | [more] |
Match Name | E-value | Identity | Description | |
Q9LFI1 | 3.1e-259 | 59.00 | Pentatricopeptide repeat-containing protein At3g53360, mitochondrial OS=Arabidop... | [more] |
Q9SS60 | 1.5e-131 | 32.47 | Pentatricopeptide repeat-containing protein At3g03580 OS=Arabidopsis thaliana OX... | [more] |
Q5G1T1 | 2.5e-131 | 34.49 | Pentatricopeptide repeat-containing protein At3g49170, chloroplastic OS=Arabidop... | [more] |
Q9SS83 | 9.5e-131 | 34.41 | Pentatricopeptide repeat-containing protein At3g09040, mitochondrial OS=Arabidop... | [more] |
Q9SVA5 | 1.0e-129 | 34.58 | Pentatricopeptide repeat-containing protein At4g39530 OS=Arabidopsis thaliana OX... | [more] |
Match Name | E-value | Identity | Description | |
A0A0A0LDF3 | 0.0e+00 | 91.14 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G736910 PE=4 SV=1 | [more] |
A0A5A7TPL0 | 0.0e+00 | 90.89 | Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... | [more] |
A0A1S3B684 | 0.0e+00 | 90.89 | pentatricopeptide repeat-containing protein At3g53360, mitochondrial OS=Cucumis ... | [more] |
A0A6J1J911 | 0.0e+00 | 86.71 | pentatricopeptide repeat-containing protein At3g53360, mitochondrial-like OS=Cuc... | [more] |
A0A6J1F1S7 | 0.0e+00 | 86.98 | pentatricopeptide repeat-containing protein At3g53360, mitochondrial-like OS=Cuc... | [more] |
Match Name | E-value | Identity | Description | |
AT3G53360.1 | 2.2e-260 | 59.00 | Tetratricopeptide repeat (TPR)-like superfamily protein | [more] |
AT3G03580.1 | 1.0e-132 | 32.47 | Tetratricopeptide repeat (TPR)-like superfamily protein | [more] |
AT3G49170.1 | 1.8e-132 | 34.49 | Tetratricopeptide repeat (TPR)-like superfamily protein | [more] |
AT3G09040.1 | 6.7e-132 | 34.41 | Pentatricopeptide repeat (PPR) superfamily protein | [more] |
AT4G39530.1 | 7.5e-131 | 34.58 | Tetratricopeptide repeat (TPR)-like superfamily protein | [more] |