Tan0016733 (gene) Snake gourd v1

Overview
NameTan0016733
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionTetratricopeptide repeat (TPR)-like superfamily protein
LocationLG09: 2901957 .. 2904429 (+)
RNA-Seq ExpressionTan0016733
SyntenyTan0016733
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGCGCCAAGAAACTGACATCTTCCACTCATTTCCAGCCAATCCCATTGATAGTAAGGAACTCTCTTCAATGGATTAACAACTCCACCACTTTACAATCAAACCCACCTTTCAGACCAAAAGGGCCATCCATTTGGGCCACAAATCTCATCAAATCATACTTCGACAAAGGCCTAACCAAAGAAGCTCGTAACCTGTTTGATGAAATGCCTGAACGAGATGTGGTGGCCTGGACTGCTATGATTGTTGGCTTTACTTCTTGCAATCACTATACTCAAGCGTGGGCTGTGTTCTGTGAGATGATGAGGAGTGATATCGAGCCAAATGCCTTCACTATGTCTAGTGTTCTCAAGGCTTGCAAGGGCATGAAGGCTCTTTCATGTGGGACTTTGGCTCATGGTTTGGGGATGAAGCACGGTATTGACGGGTCAATGTACGTCCAAAATGCACTTTTGGACATGTATGCTACTTGCTGTGCTACCATGGATGATGCATTGACTGTGTTTAATGATATACCTCTGAAGACTGCTGTGTCATGGACTACTTTGATTGCAGGGTTCACTCACAGAGGTGATGGCTACAGCGGGCTTCAAGTTTTCAGGCAGATGTTACTGGTAACTATAGGCTTAATGATCGTATAGTTCTTCCAGCAATGCATTCTATATTTAGGAAGTACCATAGTCGATTTCCGAGCACATTACCATGCCACATAAGCATTGAACTCACTTAGTTAATGACAGGAACTTATTTGATAAAAAATTGAAAGTTTTAGAGACTAACTAGATGACAATGCTTGAACTTTTTAATTTTTTTTTTATTGATATTAGAAAATTAAATTTAACCTTAATATTATCATATATGTAATATTGAATATTTTGGCGAACAGGAAGACGTTGAACCGAACTCGTTTAGCTTTTCCATTGCGGTTAGAGCCTGTGCTTCAATCAGCTCATATTCATATGGAAAACAAGTACATGCAGCAGTCACCAAATATGGCCTCCATTCTGATGCTCCAGTAATGAATTCAATACTCGACATGTATTGCAGGTGTAACTGTTTATGTGATGCAAAAAGATGCTTTGGTGAAATGACTCAAAAGAATTTGATTACATGGAACACCTTGATAGCAGGATATGAAAGGTCAGATTCCAGCGAGTCTCTAAGTTTATTTTCACAAATGGGGTTTGAAGGCTATGAACCGAACTGTTTTACATTCACAAGTATTACAGCTGCTTGTGCCAATTTAGCAGTCTTAAGCTGCGGACAACAGGTTCATGGTGGAATTGTTCGTAGAGGATTTGACAAGAGTGTAGCATTGGTAAATGCACTTATTGACATGTACGCGAAGTGTGGAAACGTAAATGATTCACACAAACTTTTCTGTGATATGCCTCAAAGAGACTTGGTGTCCTGGACTACCATGATGATTGGCTATGGAGCACATGGATATGGAAAAGAGGCCATTAAGTTGTTCGATGAAATGGTTCAAAGTGGAATTCGACCTGATCGGATAGTGTTCATGGCAGTCGTAAGTGCTTGCAGCCATGCCGGACTTGTGGACAAGGGACTAAGATACTTCAGATCAATGCTGGAAGATTACAGTCTTAACCCCGATCAAGAGATCTATGGGTGTGTGGTGGACTTGCTTGGCCGTGCTGGGAGAGTTGAGGAGGCTTTTCAGCTAGTCAAGAGCATGCCATTCGAACCCGATGAGTCTGTTTGGGGTGCCCTCTTGGGAGCTTGTAAAGCATATGAACTTCCAAATCTAGGAAAATTGGCAGCTCAGAGAGTATTGGATACGAGGCCGAATATGGCGGGGACTTATCTGCTGTTGTCCAATATATATGCAGCTGAAGGTAAATGGGCCGAGTTCGCCAAAATGAGGAAGCTGATGAAAGGGATGGACAACAAGAAAGAAGTGGGTAAGAGTTGGATTGAAATTAGAAATGAAGTTTATAGTTTTGTGGTTGGAAATAAGATGGGCCCTCACATAGAGTGGGTGCATAAAGTTCTCGAACTACTGATTTGGCATATGAAGGATGACGGGGATATGACAGATTTGGATTACTTGTAGAATATCTTGAAGGAACCTGATTCAGAACATGAATAACCTAACTGCGAAGACCGATCACATGAAACGAAGTTGGAAGTTTGGAATATCAATATCACCTCCTTCCTTTTCCAAAAGGGCCATTGTCAACGTGCTCGAAACAAGAAGGTGAAAATGATATTAGCAAATGAGGTAAGGATGAGAGAATTCTCCTCAGGGAAGCACTTTTCTATGCAAATTCAGTATGGAGGAGGAGATGAAAACTGTGCCACCAATACTGAAAGAGATACATTATGACTGGGGTGACCGTCATATGAAACGCAAGAGATCCGAGTGAAAGGAAAACTACATCCCCGAGCGGCAGTCAAAATGAGGCATGAATGCAAGATGTTAGGGGAGTGATCGGCCGGACAGAGGCTAGC

mRNA sequence

ATGAGCGCCAAGAAACTGACATCTTCCACTCATTTCCAGCCAATCCCATTGATAGTAAGGAACTCTCTTCAATGGATTAACAACTCCACCACTTTACAATCAAACCCACCTTTCAGACCAAAAGGGCCATCCATTTGGGCCACAAATCTCATCAAATCATACTTCGACAAAGGCCTAACCAAAGAAGCTCGTAACCTGTTTGATGAAATGCCTGAACGAGATGTGGTGGCCTGGACTGCTATGATTGTTGGCTTTACTTCTTGCAATCACTATACTCAAGCGTGGGCTGTGTTCTGTGAGATGATGAGGAGTGATATCGAGCCAAATGCCTTCACTATGTCTAGTGTTCTCAAGGCTTGCAAGGGCATGAAGGCTCTTTCATGTGGGACTTTGGCTCATGGTTTGGGGATGAAGCACGGTATTGACGGGTCAATGTACGTCCAAAATGCACTTTTGGACATGTATGCTACTTGCTGTGCTACCATGGATGATGCATTGACTGTGTTTAATGATATACCTCTGAAGACTGCTGTGTCATGGACTACTTTGATTGCAGGGTTCACTCACAGAGGTGATGGCTACAGCGGGCTTCAAGTTTTCAGGCAGATGTTACTGGAAGACGTTGAACCGAACTCGTTTAGCTTTTCCATTGCGGTTAGAGCCTGTGCTTCAATCAGCTCATATTCATATGGAAAACAAGTACATGCAGCAGTCACCAAATATGGCCTCCATTCTGATGCTCCAGTAATGAATTCAATACTCGACATGTATTGCAGGTGTAACTGTTTATGTGATGCAAAAAGATGCTTTGGTGAAATGACTCAAAAGAATTTGATTACATGGAACACCTTGATAGCAGGATATGAAAGGTCAGATTCCAGCGAGTCTCTAAGTTTATTTTCACAAATGGGGTTTGAAGGCTATGAACCGAACTGTTTTACATTCACAAGTATTACAGCTGCTTGTGCCAATTTAGCAGTCTTAAGCTGCGGACAACAGGTTCATGGTGGAATTGTTCGTAGAGGATTTGACAAGAGTGTAGCATTGGTAAATGCACTTATTGACATGTACGCGAAGTGTGGAAACGTAAATGATTCACACAAACTTTTCTGTGATATGCCTCAAAGAGACTTGGTGTCCTGGACTACCATGATGATTGGCTATGGAGCACATGGATATGGAAAAGAGGCCATTAAGTTGTTCGATGAAATGGTTCAAAGTGGAATTCGACCTGATCGGATAGTGTTCATGGCAGTCGTAAGTGCTTGCAGCCATGCCGGACTTGTGGACAAGGGACTAAGATACTTCAGATCAATGCTGGAAGATTACAGTCTTAACCCCGATCAAGAGATCTATGGGTGTGTGGTGGACTTGCTTGGCCGTGCTGGGAGAGTTGAGGAGGCTTTTCAGCTAGTCAAGAGCATGCCATTCGAACCCGATGAGTCTGTTTGGGGTGCCCTCTTGGGAGCTTGTAAAGCATATGAACTTCCAAATCTAGGAAAATTGGCAGCTCAGAGAGTATTGGATACGAGGCCGAATATGGCGGGGACTTATCTGCTGTTGTCCAATATATATGCAGCTGAAGGTAAATGGGCCGAGTTCGCCAAAATGAGGAAGCTGATGAAAGGGATGGACAACAAGAAAGAAGTGGGTAAGAGTTGGATTGAAATTAGAAATGAAGTTTATAGTTTTGTGGTTGGAAATAAGATGGGCCCTCACATAGAGTGGGTGCATAAAGTTCTCGAACTACTGATTTGGCATATGAAGGATGACGGGGATATGACAGATTTGGATTACTTGTAGAATATCTTGAAGGAACCTGATTCAGAACATGAATAACCTAACTGCGAAGACCGATCACATGAAACGAAGTTGGAAGTTTGGAATATCAATATCACCTCCTTCCTTTTCCAAAAGGGCCATTGTCAACGTGCTCGAAACAAGAAGGTGAAAATGATATTAGCAAATGAGGTAAGGATGAGAGAATTCTCCTCAGGGAAGCACTTTTCTATGCAAATTCAGTATGGAGGAGGAGATGAAAACTGTGCCACCAATACTGAAAGAGATACATTATGACTGGGGTGACCGTCATATGAAACGCAAGAGATCCGAGTGAAAGGAAAACTACATCCCCGAGCGGCAGTCAAAATGAGGCATGAATGCAAGATGTTAGGGGAGTGATCGGCCGGACAGAGGCTAGC

Coding sequence (CDS)

ATGAGCGCCAAGAAACTGACATCTTCCACTCATTTCCAGCCAATCCCATTGATAGTAAGGAACTCTCTTCAATGGATTAACAACTCCACCACTTTACAATCAAACCCACCTTTCAGACCAAAAGGGCCATCCATTTGGGCCACAAATCTCATCAAATCATACTTCGACAAAGGCCTAACCAAAGAAGCTCGTAACCTGTTTGATGAAATGCCTGAACGAGATGTGGTGGCCTGGACTGCTATGATTGTTGGCTTTACTTCTTGCAATCACTATACTCAAGCGTGGGCTGTGTTCTGTGAGATGATGAGGAGTGATATCGAGCCAAATGCCTTCACTATGTCTAGTGTTCTCAAGGCTTGCAAGGGCATGAAGGCTCTTTCATGTGGGACTTTGGCTCATGGTTTGGGGATGAAGCACGGTATTGACGGGTCAATGTACGTCCAAAATGCACTTTTGGACATGTATGCTACTTGCTGTGCTACCATGGATGATGCATTGACTGTGTTTAATGATATACCTCTGAAGACTGCTGTGTCATGGACTACTTTGATTGCAGGGTTCACTCACAGAGGTGATGGCTACAGCGGGCTTCAAGTTTTCAGGCAGATGTTACTGGAAGACGTTGAACCGAACTCGTTTAGCTTTTCCATTGCGGTTAGAGCCTGTGCTTCAATCAGCTCATATTCATATGGAAAACAAGTACATGCAGCAGTCACCAAATATGGCCTCCATTCTGATGCTCCAGTAATGAATTCAATACTCGACATGTATTGCAGGTGTAACTGTTTATGTGATGCAAAAAGATGCTTTGGTGAAATGACTCAAAAGAATTTGATTACATGGAACACCTTGATAGCAGGATATGAAAGGTCAGATTCCAGCGAGTCTCTAAGTTTATTTTCACAAATGGGGTTTGAAGGCTATGAACCGAACTGTTTTACATTCACAAGTATTACAGCTGCTTGTGCCAATTTAGCAGTCTTAAGCTGCGGACAACAGGTTCATGGTGGAATTGTTCGTAGAGGATTTGACAAGAGTGTAGCATTGGTAAATGCACTTATTGACATGTACGCGAAGTGTGGAAACGTAAATGATTCACACAAACTTTTCTGTGATATGCCTCAAAGAGACTTGGTGTCCTGGACTACCATGATGATTGGCTATGGAGCACATGGATATGGAAAAGAGGCCATTAAGTTGTTCGATGAAATGGTTCAAAGTGGAATTCGACCTGATCGGATAGTGTTCATGGCAGTCGTAAGTGCTTGCAGCCATGCCGGACTTGTGGACAAGGGACTAAGATACTTCAGATCAATGCTGGAAGATTACAGTCTTAACCCCGATCAAGAGATCTATGGGTGTGTGGTGGACTTGCTTGGCCGTGCTGGGAGAGTTGAGGAGGCTTTTCAGCTAGTCAAGAGCATGCCATTCGAACCCGATGAGTCTGTTTGGGGTGCCCTCTTGGGAGCTTGTAAAGCATATGAACTTCCAAATCTAGGAAAATTGGCAGCTCAGAGAGTATTGGATACGAGGCCGAATATGGCGGGGACTTATCTGCTGTTGTCCAATATATATGCAGCTGAAGGTAAATGGGCCGAGTTCGCCAAAATGAGGAAGCTGATGAAAGGGATGGACAACAAGAAAGAAGTGGGTAAGAGTTGGATTGAAATTAGAAATGAAGTTTATAGTTTTGTGGTTGGAAATAAGATGGGCCCTCACATAGAGTGGGTGCATAAAGTTCTCGAACTACTGATTTGGCATATGAAGGATGACGGGGATATGACAGATTTGGATTACTTGTAG

Protein sequence

MSAKKLTSSTHFQPIPLIVRNSLQWINNSTTLQSNPPFRPKGPSIWATNLIKSYFDKGLTKEARNLFDEMPERDVVAWTAMIVGFTSCNHYTQAWAVFCEMMRSDIEPNAFTMSSVLKACKGMKALSCGTLAHGLGMKHGIDGSMYVQNALLDMYATCCATMDDALTVFNDIPLKTAVSWTTLIAGFTHRGDGYSGLQVFRQMLLEDVEPNSFSFSIAVRACASISSYSYGKQVHAAVTKYGLHSDAPVMNSILDMYCRCNCLCDAKRCFGEMTQKNLITWNTLIAGYERSDSSESLSLFSQMGFEGYEPNCFTFTSITAACANLAVLSCGQQVHGGIVRRGFDKSVALVNALIDMYAKCGNVNDSHKLFCDMPQRDLVSWTTMMIGYGAHGYGKEAIKLFDEMVQSGIRPDRIVFMAVVSACSHAGLVDKGLRYFRSMLEDYSLNPDQEIYGCVVDLLGRAGRVEEAFQLVKSMPFEPDESVWGALLGACKAYELPNLGKLAAQRVLDTRPNMAGTYLLLSNIYAAEGKWAEFAKMRKLMKGMDNKKEVGKSWIEIRNEVYSFVVGNKMGPHIEWVHKVLELLIWHMKDDGDMTDLDYL
Homology
BLAST of Tan0016733 vs. ExPASy Swiss-Prot
Match: Q9FXA9 (Putative pentatricopeptide repeat-containing protein At1g56570 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E64 PE=3 SV=1)

HSP 1 Score: 730.7 bits (1885), Expect = 1.3e-209
Identity = 351/603 (58.21%), Postives = 460/603 (76.29%), Query Frame = 0

Query: 1   MSAKKLTSSTHFQPIPLIVRNSLQWIN-NSTTLQSNPPFRPKGPSIWATNLIKSYFDKGL 60
           MS  KL  S  F+PIP  VR+SL+     S+     PP++PK   I ATNLI SYF+KGL
Sbjct: 1   MSITKLARSNAFKPIPNFVRSSLRNAGVESSQNTEYPPYKPKKHHILATNLIVSYFEKGL 60

Query: 61  TKEARNLFDEMPERDVVAWTAMIVGFTSCNHYTQAWAVFCEMMRSDIEPNAFTMSSVLKA 120
            +EAR+LFDEMP+RDVVAWTAMI G+ S N+  +AW  F EM++    PN FT+SSVLK+
Sbjct: 61  VEEARSLFDEMPDRDVVAWTAMITGYASSNYNARAWECFHEMVKQGTSPNEFTLSSVLKS 120

Query: 121 CKGMKALSCGTLAHGLGMKHGIDGSMYVQNALLDMYATCCATMDDALTVFNDIPLKTAVS 180
           C+ MK L+ G L HG+ +K G++GS+YV NA+++MYATC  TM+ A  +F DI +K  V+
Sbjct: 121 CRNMKVLAYGALVHGVVVKLGMEGSLYVDNAMMNMYATCSVTMEAACLIFRDIKVKNDVT 180

Query: 181 WTTLIAGFTHRGDGYSGLQVFRQMLLEDVEPNSFSFSIAVRACASISSYSYGKQVHAAVT 240
           WTTLI GFTH GDG  GL++++QMLLE+ E   +  +IAVRA ASI S + GKQ+HA+V 
Sbjct: 181 WTTLITGFTHLGDGIGGLKMYKQMLLENAEVTPYCITIAVRASASIDSVTTGKQIHASVI 240

Query: 241 KYGLHSDAPVMNSILDMYCRCNCLCDAKRCFGEMTQKNLITWNTLIAGYERSDSSESLSL 300
           K G  S+ PVMNSILD+YCRC  L +AK  F EM  K+LITWNTLI+  ERSDSSE+L +
Sbjct: 241 KRGFQSNLPVMNSILDLYCRCGYLSEAKHYFHEMEDKDLITWNTLISELERSDSSEALLM 300

Query: 301 FSQMGFEGYEPNCFTFTSITAACANLAVLSCGQQVHGGIVRRGFDKSVALVNALIDMYAK 360
           F +   +G+ PNC+TFTS+ AACAN+A L+CGQQ+HG I RRGF+K+V L NALIDMYAK
Sbjct: 301 FQRFESQGFVPNCYTFTSLVAACANIAALNCGQQLHGRIFRRGFNKNVELANALIDMYAK 360

Query: 361 CGNVNDSHKLFCD-MPQRDLVSWTTMMIGYGAHGYGKEAIKLFDEMVQSGIRPDRIVFMA 420
           CGN+ DS ++F + + +R+LVSWT+MMIGYG+HGYG EA++LFD+MV SGIRPDRIVFMA
Sbjct: 361 CGNIPDSQRVFGEIVDRRNLVSWTSMMIGYGSHGYGAEAVELFDKMVSSGIRPDRIVFMA 420

Query: 421 VVSACSHAGLVDKGLRYFRSMLEDYSLNPDQEIYGCVVDLLGRAGRVEEAFQLVKSMPFE 480
           V+SAC HAGLV+KGL+YF  M  +Y +NPD++IY CVVDLLGRAG++ EA++LV+ MPF+
Sbjct: 421 VLSACRHAGLVEKGLKYFNVMESEYGINPDRDIYNCVVDLLGRAGKIGEAYELVERMPFK 480

Query: 481 PDESVWGALLGACKAYELPNL-GKLAAQRVLDTRPNMAGTYLLLSNIYAAEGKWAEFAKM 540
           PDES WGA+LGACKA++   L  +LAA++V++ +P M GTY++LS IYAAEGKW +FA++
Sbjct: 481 PDESTWGAILGACKAHKHNGLISRLAARKVMELKPKMVGTYVMLSYIYAAEGKWVDFARV 540

Query: 541 RKLMKGMDNKKEVGKSWIEIRNEVYSFVVGNKMGPHIEWVHKVLELLIWHMKDDGDMTDL 600
           RK+M+ M NKKE G SWI + N+V+SF V +KM P+   V+ VL LLI   ++ G + +L
Sbjct: 541 RKMMRMMGNKKEAGMSWILVENQVFSFAVSDKMCPNASSVYSVLGLLIEETREAGYVPEL 600

BLAST of Tan0016733 vs. ExPASy Swiss-Prot
Match: Q9SIT7 (Pentatricopeptide repeat-containing protein At2g13600 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E76 PE=3 SV=1)

HSP 1 Score: 412.5 bits (1059), Expect = 8.0e-114
Identity = 213/582 (36.60%), Postives = 332/582 (57.04%), Query Frame = 0

Query: 58  GLTKEARNLFDEMPERDVVAWTAMIVGFTSCNHYTQAWAVFCEMMRSDIEPNAFTMSSVL 117
           G   EA +LF  MPERD   W +M+ GF   +   +A   F  M +     N ++ +SVL
Sbjct: 100 GFLDEADSLFRSMPERDQCTWNSMVSGFAQHDRCEEALCYFAMMHKEGFVLNEYSFASVL 159

Query: 118 KACKGMKALSCGTLAHGLGMKHGIDGSMYVQNALLDMYATCCATMDDALTVFNDIPLKTA 177
            AC G+  ++ G   H L  K      +Y+ +AL+DMY+  C  ++DA  VF+++  +  
Sbjct: 160 SACSGLNDMNKGVQVHSLIAKSPFLSDVYIGSALVDMYSK-CGNVNDAQRVFDEMGDRNV 219

Query: 178 VSWTTLIAGFTHRGDGYSGLQVFRQMLLEDVEPNSFSFSIAVRACASISSYSYGKQVHAA 237
           VSW +LI  F   G     L VF+ ML   VEP+  + +  + ACAS+S+   G++VH  
Sbjct: 220 VSWNSLITCFEQNGPAVEALDVFQMMLESRVEPDEVTLASVISACASLSAIKVGQEVHGR 279

Query: 238 VTKYG-LHSDAPVMNSILDMYCRCNCLCD------------------------------- 297
           V K   L +D  + N+ +DMY +C+ + +                               
Sbjct: 280 VVKNDKLRNDIILSNAFVDMYAKCSRIKEARFIFDSMPIRNVIAETSMISGYAMAASTKA 339

Query: 298 AKRCFGEMTQKNLITWNTLIAGY-ERSDSSESLSLFSQMGFEGYEPNCFTFTSITAACAN 357
           A+  F +M ++N+++WN LIAGY +  ++ E+LSLF  +  E   P  ++F +I  ACA+
Sbjct: 340 ARLMFTKMAERNVVSWNALIAGYTQNGENEEALSLFCLLKRESVCPTHYSFANILKACAD 399

Query: 358 LAVLSCGQQVHGGIVRRGF------DKSVALVNALIDMYAKCGNVNDSHKLFCDMPQRDL 417
           LA L  G Q H  +++ GF      +  + + N+LIDMY KCG V + + +F  M +RD 
Sbjct: 400 LAELHLGMQAHVHVLKHGFKFQSGEEDDIFVGNSLIDMYVKCGCVEEGYLVFRKMMERDC 459

Query: 418 VSWTTMMIGYGAHGYGKEAIKLFDEMVQSGIRPDRIVFMAVVSACSHAGLVDKGLRYFRS 477
           VSW  M+IG+  +GYG EA++LF EM++SG +PD I  + V+SAC HAG V++G  YF S
Sbjct: 460 VSWNAMIIGFAQNGYGNEALELFREMLESGEKPDHITMIGVLSACGHAGFVEEGRHYFSS 519

Query: 478 MLEDYSLNPDQEIYGCVVDLLGRAGRVEEAFQLVKSMPFEPDESVWGALLGACKAYELPN 537
           M  D+ + P ++ Y C+VDLLGRAG +EEA  +++ MP +PD  +WG+LL ACK +    
Sbjct: 520 MTRDFGVAPLRDHYTCMVDLLGRAGFLEEAKSMIEEMPMQPDSVIWGSLLAACKVHRNIT 579

Query: 538 LGKLAAQRVLDTRPNMAGTYLLLSNIYAAEGKWAEFAKMRKLMKGMDNKKEVGKSWIEIR 597
           LGK  A+++L+  P+ +G Y+LLSN+YA  GKW +   +RK M+     K+ G SWI+I+
Sbjct: 580 LGKYVAEKLLEVEPSNSGPYVLLSNMYAELGKWEDVMNVRKSMRKEGVTKQPGCSWIKIQ 639

Query: 598 NEVYSFVVGNKMGPHIEWVHKVLELLIWHMKDDGDMTDLDYL 601
              + F+V +K  P  + +H +L++LI  M+ + D T++  L
Sbjct: 640 GHDHVFMVKDKSHPRKKQIHSLLDILIAEMRPEQDHTEIGSL 680

BLAST of Tan0016733 vs. ExPASy Swiss-Prot
Match: Q3E6Q1 (Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H40 PE=2 SV=1)

HSP 1 Score: 400.2 bits (1027), Expect = 4.1e-110
Identity = 195/550 (35.45%), Postives = 324/550 (58.91%), Query Frame = 0

Query: 48  TNLIKSYFDKGLTKEARNLFDEMPERDVVAWTAMIVGFTSCNHYTQAWAVFCEMMRSDIE 107
           T L   Y       EAR +FD MPERD+V+W  ++ G++       A  +   M   +++
Sbjct: 174 TGLENMYAKCRQVNEARKVFDRMPERDLVSWNTIVAGYSQNGMARMALEMVKSMCEENLK 233

Query: 108 PNAFTMSSVLKACKGMKALSCGTLAHGLGMKHGIDGSMYVQNALLDMYATCCATMDDALT 167
           P+  T+ SVL A   ++ +S G   HG  M+ G D  + +  AL+DMYA  C +++ A  
Sbjct: 234 PSFITIVSVLPAVSALRLISVGKEIHGYAMRSGFDSLVNISTALVDMYAK-CGSLETARQ 293

Query: 168 VFNDIPLKTAVSWTTLIAGFTHRGDGYSGLQVFRQMLLEDVEPNSFSFSIAVRACASISS 227
           +F+ +  +  VSW ++I  +    +    + +F++ML E V+P   S   A+ ACA +  
Sbjct: 294 LFDGMLERNVVSWNSMIDAYVQNENPKEAMLIFQKMLDEGVKPTDVSVMGALHACADLGD 353

Query: 228 YSYGKQVHAAVTKYGLHSDAPVMNSILDMYCRCNCLCDAKRCFGEMTQKNLITWNTLIAG 287
              G+ +H    + GL  +  V+NS++ MYC+C  +  A   FG++  + L++WN +I G
Sbjct: 354 LERGRFIHKLSVELGLDRNVSVVNSLISMYCKCKEVDTAASMFGKLQSRTLVSWNAMILG 413

Query: 288 YERSDSS-ESLSLFSQMGFEGYEPNCFTFTSITAACANLAVLSCGQQVHGGIVRRGFDKS 347
           + ++    ++L+ FSQM     +P+ FT+ S+  A A L++    + +HG ++R   DK+
Sbjct: 414 FAQNGRPIDALNYFSQMRSRTVKPDTFTYVSVITAIAELSITHHAKWIHGVVMRSCLDKN 473

Query: 348 VALVNALIDMYAKCGNVNDSHKLFCDMPQRDLVSWTTMMIGYGAHGYGKEAIKLFDEMVQ 407
           V +  AL+DMYAKCG +  +  +F  M +R + +W  M+ GYG HG+GK A++LF+EM +
Sbjct: 474 VFVTTALVDMYAKCGAIMIARLIFDMMSERHVTTWNAMIDGYGTHGFGKAALELFEEMQK 533

Query: 408 SGIRPDRIVFMAVVSACSHAGLVDKGLRYFRSMLEDYSLNPDQEIYGCVVDLLGRAGRVE 467
             I+P+ + F++V+SACSH+GLV+ GL+ F  M E+YS+    + YG +VDLLGRAGR+ 
Sbjct: 534 GTIKPNGVTFLSVISACSHSGLVEAGLKCFYMMKENYSIELSMDHYGAMVDLLGRAGRLN 593

Query: 468 EAFQLVKSMPFEPDESVWGALLGACKAYELPNLGKLAAQRVLDTRPNMAGTYLLLSNIYA 527
           EA+  +  MP +P  +V+GA+LGAC+ ++  N  + AA+R+ +  P+  G ++LL+NIY 
Sbjct: 594 EAWDFIMQMPVKPAVNVYGAMLGACQIHKNVNFAEKAAERLFELNPDDGGYHVLLANIYR 653

Query: 528 AEGKWAEFAKMRKLMKGMDNKKEVGKSWIEIRNEVYSFVVGNKMGPHIEWVHKVLELLIW 587
           A   W +  ++R  M     +K  G S +EI+NEV+SF  G+   P  + ++  LE LI 
Sbjct: 654 AASMWEKVGQVRVSMLRQGLRKTPGCSMVEIKNEVHSFFSGSTAHPDSKKIYAFLEKLIC 713

Query: 588 HMKDDGDMTD 597
           H+K+ G + D
Sbjct: 714 HIKEAGYVPD 722

BLAST of Tan0016733 vs. ExPASy Swiss-Prot
Match: Q9SN39 (Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=DOT4 PE=2 SV=1)

HSP 1 Score: 398.3 bits (1022), Expect = 1.6e-109
Identity = 199/546 (36.45%), Postives = 320/546 (58.61%), Query Frame = 0

Query: 49  NLIKSYFDKGLTKEARNLFDEMPERDVVAWTAMIVGFTSCNHYTQAWAVFCEMMRSDIEP 108
           +L+  Y        AR +FDEM ERDV++W ++I G+ S     +  +VF +M+ S IE 
Sbjct: 235 SLVAFYLKNQRVDSARKVFDEMTERDVISWNSIINGYVSNGLAEKGLSVFVQMLVSGIEI 294

Query: 109 NAFTMSSVLKACKGMKALSCGTLAHGLGMKHGIDGSMYVQNALLDMYATCCATMDDALTV 168
           +  T+ SV   C   + +S G   H +G+K          N LLDMY+  C  +D A  V
Sbjct: 295 DLATIVSVFAGCADSRLISLGRAVHSIGVKACFSREDRFCNTLLDMYSK-CGDLDSAKAV 354

Query: 169 FNDIPLKTAVSWTTLIAGFTHRGDGYSGLQVFRQMLLEDVEPNSFSFSIAVRACASISSY 228
           F ++  ++ VS+T++IAG+   G     +++F +M  E + P+ ++ +  +  CA     
Sbjct: 355 FREMSDRSVVSYTSMIAGYAREGLAGEAVKLFEEMEEEGISPDVYTVTAVLNCCARYRLL 414

Query: 229 SYGKQVHAAVTKYGLHSDAPVMNSILDMYCRCNCLCDAKRCFGEMTQKNLITWNTLIAGY 288
             GK+VH  + +  L  D  V N+++DMY +C  + +A+  F EM  K++I+WNT+I GY
Sbjct: 415 DEGKRVHEWIKENDLGFDIFVSNALMDMYAKCGSMQEAELVFSEMRVKDIISWNTIIGGY 474

Query: 289 ERS-DSSESLSLFSQMGFE-GYEPNCFTFTSITAACANLAVLSCGQQVHGGIVRRGFDKS 348
            ++  ++E+LSLF+ +  E  + P+  T   +  ACA+L+    G+++HG I+R G+   
Sbjct: 475 SKNCYANEALSLFNLLLEEKRFSPDERTVACVLPACASLSAFDKGREIHGYIMRNGYFSD 534

Query: 349 VALVNALIDMYAKCGNVNDSHKLFCDMPQRDLVSWTTMMIGYGAHGYGKEAIKLFDEMVQ 408
             + N+L+DMYAKCG +  +H LF D+  +DLVSWT M+ GYG HG+GKEAI LF++M Q
Sbjct: 535 RHVANSLVDMYAKCGALLLAHMLFDDIASKDLVSWTVMIAGYGMHGFGKEAIALFNQMRQ 594

Query: 409 SGIRPDRIVFMAVVSACSHAGLVDKGLRYFRSMLEDYSLNPDQEIYGCVVDLLGRAGRVE 468
           +GI  D I F++++ ACSH+GLVD+G R+F  M  +  + P  E Y C+VD+L R G + 
Sbjct: 595 AGIEADEISFVSLLYACSHSGLVDEGWRFFNIMRHECKIEPTVEHYACIVDMLARTGDLI 654

Query: 469 EAFQLVKSMPFEPDESVWGALLGACKAYELPNLGKLAAQRVLDTRPNMAGTYLLLSNIYA 528
           +A++ +++MP  PD ++WGALL  C+ +    L +  A++V +  P   G Y+L++NIYA
Sbjct: 655 KAYRFIENMPIPPDATIWGALLCGCRIHHDVKLAEKVAEKVFELEPENTGYYVLMANIYA 714

Query: 529 AEGKWAEFAKMRKLMKGMDNKKEVGKSWIEIRNEVYSFVVGNKMGPHIEWVHKVLELLIW 588
              KW +  ++RK +     +K  G SWIEI+  V  FV G+   P  E +   L  +  
Sbjct: 715 EAEKWEQVKRLRKRIGQRGLRKNPGCSWIEIKGRVNIFVAGDSSNPETENIEAFLRKVRA 774

Query: 589 HMKDDG 593
            M ++G
Sbjct: 775 RMIEEG 779

BLAST of Tan0016733 vs. ExPASy Swiss-Prot
Match: Q5G1T1 (Pentatricopeptide repeat-containing protein At3g49170, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=EMB2261 PE=2 SV=1)

HSP 1 Score: 396.0 bits (1016), Expect = 7.7e-109
Identity = 199/543 (36.65%), Postives = 327/543 (60.22%), Query Frame = 0

Query: 61  KEARNLFDEMPERDVVAWTAMIVGFTSCNHYTQAWAVFCEMMRSDIEPNAFTMSSVLKAC 120
           + A  +FD+M E +VV WT MI          +A   F +M+ S  E + FT+SSV  AC
Sbjct: 220 ENAYKVFDKMSELNVVTWTLMITRCMQMGFPREAIRFFLDMVLSGFESDKFTLSSVFSAC 279

Query: 121 KGMKALSCGTLAHGLGMKHGIDGSMYVQNALLDMYATCCA--TMDDALTVFNDIPLKTAV 180
             ++ LS G   H   ++ G+     V+ +L+DMYA C A  ++DD   VF+ +   + +
Sbjct: 280 AELENLSLGKQLHSWAIRSGLVDD--VECSLVDMYAKCSADGSVDDCRKVFDRMEDHSVM 339

Query: 181 SWTTLIAGFTHRGD-GYSGLQVFRQMLLE-DVEPNSFSFSIAVRACASISSYSYGKQVHA 240
           SWT LI G+    +     + +F +M+ +  VEPN F+FS A +AC ++S    GKQV  
Sbjct: 340 SWTALITGYMKNCNLATEAINLFSEMITQGHVEPNHFTFSSAFKACGNLSDPRVGKQVLG 399

Query: 241 AVTKYGLHSDAPVMNSILDMYCRCNCLCDAKRCFGEMTQKNLITWNTLIAGYERS-DSSE 300
              K GL S++ V NS++ M+ + + + DA+R F  +++KNL+++NT + G  R+ +  +
Sbjct: 400 QAFKRGLASNSSVANSVISMFVKSDRMEDAQRAFESLSEKNLVSYNTFLDGTCRNLNFEQ 459

Query: 301 SLSLFSQMGFEGYEPNCFTFTSITAACANLAVLSCGQQVHGGIVRRGFDKSVALVNALID 360
           +  L S++       + FTF S+ +  AN+  +  G+Q+H  +V+ G   +  + NALI 
Sbjct: 460 AFKLLSEITERELGVSAFTFASLLSGVANVGSIRKGEQIHSQVVKLGLSCNQPVCNALIS 519

Query: 361 MYAKCGNVNDSHKLFCDMPQRDLVSWTTMMIGYGAHGYGKEAIKLFDEMVQSGIRPDRIV 420
           MY+KCG+++ + ++F  M  R+++SWT+M+ G+  HG+    ++ F++M++ G++P+ + 
Sbjct: 520 MYSKCGSIDTASRVFNFMENRNVISWTSMITGFAKHGFAIRVLETFNQMIEEGVKPNEVT 579

Query: 421 FMAVVSACSHAGLVDKGLRYFRSMLEDYSLNPDQEIYGCVVDLLGRAGRVEEAFQLVKSM 480
           ++A++SACSH GLV +G R+F SM ED+ + P  E Y C+VDLL RAG + +AF+ + +M
Sbjct: 580 YVAILSACSHVGLVSEGWRHFNSMYEDHKIKPKMEHYACMVDLLCRAGLLTDAFEFINTM 639

Query: 481 PFEPDESVWGALLGACKAYELPNLGKLAAQRVLDTRPNMAGTYLLLSNIYAAEGKWAEFA 540
           PF+ D  VW   LGAC+ +    LGKLAA+++L+  PN    Y+ LSNIYA  GKW E  
Sbjct: 640 PFQADVLVWRTFLGACRVHSNTELGKLAARKILELDPNEPAAYIQLSNIYACAGKWEEST 699

Query: 541 KMRKLMKGMDNKKEVGKSWIEIRNEVYSFVVGNKMGPHIEWVHKVLELLIWHMKDDGDMT 599
           +MR+ MK  +  KE G SWIE+ ++++ F VG+   P+   ++  L+ LI  +K  G + 
Sbjct: 700 EMRRKMKERNLVKEGGCSWIEVGDKIHKFYVGDTAHPNAHQIYDELDRLITEIKRCGYVP 759

BLAST of Tan0016733 vs. NCBI nr
Match: XP_022931578.1 (putative pentatricopeptide repeat-containing protein At1g56570 [Cucurbita moschata])

HSP 1 Score: 1152.9 bits (2981), Expect = 0.0e+00
Identity = 548/598 (91.64%), Postives = 574/598 (95.99%), Query Frame = 0

Query: 1   MSAKKLTSSTHFQPIPLIVRNSLQWINNSTTLQSNPPFRPKGPSIWATNLIKSYFDKGLT 60
           MSA KL SST F PIPLIVRNSLQWINNSTTLQS PPF PK PSIWATNLIKSYFDKGL+
Sbjct: 1   MSANKLASSTRFHPIPLIVRNSLQWINNSTTLQSTPPFTPKSPSIWATNLIKSYFDKGLS 60

Query: 61  KEARNLFDEMPERDVVAWTAMIVGFTSCNHYTQAWAVFCEMMRSDIEPNAFTMSSVLKAC 120
           K ARNLFDEMPERDVVAWTAMIVGFTSCN YTQ+WAVFCEM+RSDI PNAFT+SSVLKAC
Sbjct: 61  KHARNLFDEMPERDVVAWTAMIVGFTSCNDYTQSWAVFCEMLRSDIHPNAFTLSSVLKAC 120

Query: 121 KGMKALSCGTLAHGLGMKHGIDGSMYVQNALLDMYATCCATMDDALTVFNDIPLKTAVSW 180
           KGMKALSCGTLAH L  KHGIDGSMYVQNALLDMYATCCATMDDALTVFNDIPLKTAVSW
Sbjct: 121 KGMKALSCGTLAHSLATKHGIDGSMYVQNALLDMYATCCATMDDALTVFNDIPLKTAVSW 180

Query: 181 TTLIAGFTHRGDGYSGLQVFRQMLLEDVEPNSFSFSIAVRACASISSYSYGKQVHAAVTK 240
           TTLIAGFTHRGDGYSGLQVFRQMLL++VEPNSFSFSIAVRACASI SY+YGKQ+HAAVTK
Sbjct: 181 TTLIAGFTHRGDGYSGLQVFRQMLLDNVEPNSFSFSIAVRACASIGSYAYGKQIHAAVTK 240

Query: 241 YGLHSDAPVMNSILDMYCRCNCLCDAKRCFGEMTQKNLITWNTLIAGYERSDSSESLSLF 300
           YGLHSD PV+NSILDMYCRCNCLCDAKRCFGEMT++NLITWNTLIAGYERSDSSESLSLF
Sbjct: 241 YGLHSDIPVLNSILDMYCRCNCLCDAKRCFGEMTERNLITWNTLIAGYERSDSSESLSLF 300

Query: 301 SQMGFEGYEPNCFTFTSITAACANLAVLSCGQQVHGGIVRRGFDKSVALVNALIDMYAKC 360
           SQMG EGYEPNCFTFTSITAACANLAVL CGQQVHGGI+RRGFD SVALVNALIDMYAKC
Sbjct: 301 SQMGSEGYEPNCFTFTSITAACANLAVLGCGQQVHGGIIRRGFDNSVALVNALIDMYAKC 360

Query: 361 GNVNDSHKLFCDMPQRDLVSWTTMMIGYGAHGYGKEAIKLFDEMVQSGIRPDRIVFMAVV 420
           GN+NDSHKLFCDMP+RDLVSWTTMMIGYG+HGYGKE IKLFDEMV+SGI+PDRIVFMAV+
Sbjct: 361 GNINDSHKLFCDMPRRDLVSWTTMMIGYGSHGYGKEVIKLFDEMVRSGIQPDRIVFMAVL 420

Query: 421 SACSHAGLVDKGLRYFRSMLEDYSLNPDQEIYGCVVDLLGRAGRVEEAFQLVKSMPFEPD 480
           SACSHAGLV+KGL YFRSM+EDY+LNPD EIYGCVVDLLGRAGRVEEAFQLV+SMPFEPD
Sbjct: 421 SACSHAGLVNKGLSYFRSMVEDYNLNPDHEIYGCVVDLLGRAGRVEEAFQLVESMPFEPD 480

Query: 481 ESVWGALLGACKAYELPNLGKLAAQRVLDTRPNMAGTYLLLSNIYAAEGKWAEFAKMRKL 540
           ESVWGALLGACKAYEL NLGKLAAQRVLDTRPNMAGTYLLLSNIYAAEGKW EFAKMRKL
Sbjct: 481 ESVWGALLGACKAYELSNLGKLAAQRVLDTRPNMAGTYLLLSNIYAAEGKWGEFAKMRKL 540

Query: 541 MKGMDNKKEVGKSWIEIRNEVYSFVVGNKMGPHIEWVHKVLELLIWHMKDDGDMTDLD 599
           MKGMDNKKEVGKSWIEIRNEVYSFVVG+KMGPHIE VHKVL+LL+WHMKDDGD+TDLD
Sbjct: 541 MKGMDNKKEVGKSWIEIRNEVYSFVVGDKMGPHIECVHKVLKLLVWHMKDDGDVTDLD 598

BLAST of Tan0016733 vs. NCBI nr
Match: XP_023520790.1 (putative pentatricopeptide repeat-containing protein At1g56570 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1151.7 bits (2978), Expect = 0.0e+00
Identity = 549/598 (91.81%), Postives = 573/598 (95.82%), Query Frame = 0

Query: 1   MSAKKLTSSTHFQPIPLIVRNSLQWINNSTTLQSNPPFRPKGPSIWATNLIKSYFDKGLT 60
           MSA KL SST F PIPLIVRNSLQWINNSTTLQS PPF PK PSIWATNLIKSYFDKGL+
Sbjct: 1   MSANKLASSTRFHPIPLIVRNSLQWINNSTTLQSTPPFTPKSPSIWATNLIKSYFDKGLS 60

Query: 61  KEARNLFDEMPERDVVAWTAMIVGFTSCNHYTQAWAVFCEMMRSDIEPNAFTMSSVLKAC 120
           K ARNLFDEMPERDVVAWTAMIVGFTSCN YTQ+WAVFCEM+RS I PNAFT+SSVLKAC
Sbjct: 61  KHARNLFDEMPERDVVAWTAMIVGFTSCNDYTQSWAVFCEMLRSHIHPNAFTLSSVLKAC 120

Query: 121 KGMKALSCGTLAHGLGMKHGIDGSMYVQNALLDMYATCCATMDDALTVFNDIPLKTAVSW 180
           KGMKALSCGTLAH L  KHGIDGSMYVQNALLDMYATCCATMDDALTVFNDIPLKTAVSW
Sbjct: 121 KGMKALSCGTLAHSLATKHGIDGSMYVQNALLDMYATCCATMDDALTVFNDIPLKTAVSW 180

Query: 181 TTLIAGFTHRGDGYSGLQVFRQMLLEDVEPNSFSFSIAVRACASISSYSYGKQVHAAVTK 240
           TTLIAGFTHRGDGYSGLQVFRQMLL++VEPNSFSFSIAVRACASI SY+YGKQ+HAAVTK
Sbjct: 181 TTLIAGFTHRGDGYSGLQVFRQMLLDNVEPNSFSFSIAVRACASIGSYAYGKQIHAAVTK 240

Query: 241 YGLHSDAPVMNSILDMYCRCNCLCDAKRCFGEMTQKNLITWNTLIAGYERSDSSESLSLF 300
           YGLHSD PV+NSILDMYCRCNCLCDAKRCFGEMT++NLITWNTLIAGYERSDSSESLSLF
Sbjct: 241 YGLHSDIPVLNSILDMYCRCNCLCDAKRCFGEMTERNLITWNTLIAGYERSDSSESLSLF 300

Query: 301 SQMGFEGYEPNCFTFTSITAACANLAVLSCGQQVHGGIVRRGFDKSVALVNALIDMYAKC 360
           SQMG EGYEPNCFTFTSITAACANLAVL CGQQVHGGI+RRGFD SVALVNALIDMYAKC
Sbjct: 301 SQMGSEGYEPNCFTFTSITAACANLAVLGCGQQVHGGIIRRGFDNSVALVNALIDMYAKC 360

Query: 361 GNVNDSHKLFCDMPQRDLVSWTTMMIGYGAHGYGKEAIKLFDEMVQSGIRPDRIVFMAVV 420
           GN+NDSHKLFCDMP+RDLVSWTTMMIGYG+HGYGKE IKLFDEMV+SGI+PDRIVFMAV+
Sbjct: 361 GNINDSHKLFCDMPRRDLVSWTTMMIGYGSHGYGKEVIKLFDEMVRSGIQPDRIVFMAVL 420

Query: 421 SACSHAGLVDKGLRYFRSMLEDYSLNPDQEIYGCVVDLLGRAGRVEEAFQLVKSMPFEPD 480
           SACSHAGLV+KGL YFRSMLEDY+LNPD EIYGCVVDLLGRAGRVEEAFQLV+SMPFEPD
Sbjct: 421 SACSHAGLVNKGLSYFRSMLEDYNLNPDHEIYGCVVDLLGRAGRVEEAFQLVESMPFEPD 480

Query: 481 ESVWGALLGACKAYELPNLGKLAAQRVLDTRPNMAGTYLLLSNIYAAEGKWAEFAKMRKL 540
           ESVWGALLGACKAYEL NLGKLAAQRVLDTRPNMAGTYLLLSNIYAAEGKW EFAKMRKL
Sbjct: 481 ESVWGALLGACKAYELSNLGKLAAQRVLDTRPNMAGTYLLLSNIYAAEGKWGEFAKMRKL 540

Query: 541 MKGMDNKKEVGKSWIEIRNEVYSFVVGNKMGPHIEWVHKVLELLIWHMKDDGDMTDLD 599
           MKGMDNKKEVGKSWIEIRNEVYSFVVG+KMGPHIE VHKVL+LLIWHMKDDGD+TDLD
Sbjct: 541 MKGMDNKKEVGKSWIEIRNEVYSFVVGDKMGPHIECVHKVLKLLIWHMKDDGDVTDLD 598

BLAST of Tan0016733 vs. NCBI nr
Match: KAG7022386.1 (putative pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1150.2 bits (2974), Expect = 0.0e+00
Identity = 547/598 (91.47%), Postives = 573/598 (95.82%), Query Frame = 0

Query: 1   MSAKKLTSSTHFQPIPLIVRNSLQWINNSTTLQSNPPFRPKGPSIWATNLIKSYFDKGLT 60
           MSA KL SST F PIPLIVRNSLQWINNSTTLQS PPF PK PSIWATNLIKSYFDKGL+
Sbjct: 1   MSANKLASSTRFHPIPLIVRNSLQWINNSTTLQSTPPFTPKSPSIWATNLIKSYFDKGLS 60

Query: 61  KEARNLFDEMPERDVVAWTAMIVGFTSCNHYTQAWAVFCEMMRSDIEPNAFTMSSVLKAC 120
           K ARNLFDEMPERDVVAWTAMIVGFTSCN YTQ+WAVFCEM+RS I PNAFT+SSVLKAC
Sbjct: 61  KHARNLFDEMPERDVVAWTAMIVGFTSCNDYTQSWAVFCEMLRSHIHPNAFTLSSVLKAC 120

Query: 121 KGMKALSCGTLAHGLGMKHGIDGSMYVQNALLDMYATCCATMDDALTVFNDIPLKTAVSW 180
           KGMKALSCGTLAH L  KHGIDGSMYVQNALLDMYATCCATMDDALTVFNDIPLKTAVSW
Sbjct: 121 KGMKALSCGTLAHSLATKHGIDGSMYVQNALLDMYATCCATMDDALTVFNDIPLKTAVSW 180

Query: 181 TTLIAGFTHRGDGYSGLQVFRQMLLEDVEPNSFSFSIAVRACASISSYSYGKQVHAAVTK 240
           TTLIAGFTHRGDGYSGLQVFRQMLL++VEPNSFSFSIAVRACASI SY+YGKQ+HAAVTK
Sbjct: 181 TTLIAGFTHRGDGYSGLQVFRQMLLDNVEPNSFSFSIAVRACASIGSYAYGKQIHAAVTK 240

Query: 241 YGLHSDAPVMNSILDMYCRCNCLCDAKRCFGEMTQKNLITWNTLIAGYERSDSSESLSLF 300
           YGLHSD PV+NSILDMYCRCNCLCDAKRCFGEMT++NLITWNTLIAGYERSDSSESLSLF
Sbjct: 241 YGLHSDIPVLNSILDMYCRCNCLCDAKRCFGEMTERNLITWNTLIAGYERSDSSESLSLF 300

Query: 301 SQMGFEGYEPNCFTFTSITAACANLAVLSCGQQVHGGIVRRGFDKSVALVNALIDMYAKC 360
           SQMG EGYEPNCFTFTSITAACANLAVL CGQQVHGGI+RRGFD SVALVNALIDMYAKC
Sbjct: 301 SQMGSEGYEPNCFTFTSITAACANLAVLGCGQQVHGGIIRRGFDNSVALVNALIDMYAKC 360

Query: 361 GNVNDSHKLFCDMPQRDLVSWTTMMIGYGAHGYGKEAIKLFDEMVQSGIRPDRIVFMAVV 420
           GN+NDSHKLFCDMP+RDLVSWTTMMIGYG+HGYGKE IKLFDEMV+SGI+PDRIVFMAV+
Sbjct: 361 GNINDSHKLFCDMPRRDLVSWTTMMIGYGSHGYGKEVIKLFDEMVRSGIQPDRIVFMAVL 420

Query: 421 SACSHAGLVDKGLRYFRSMLEDYSLNPDQEIYGCVVDLLGRAGRVEEAFQLVKSMPFEPD 480
           SACSHAGLV+KGL YFRSM+EDY+LNPD EIYGCVVDLLGRAGRVEEAFQLV+SMPFEPD
Sbjct: 421 SACSHAGLVNKGLSYFRSMVEDYNLNPDHEIYGCVVDLLGRAGRVEEAFQLVESMPFEPD 480

Query: 481 ESVWGALLGACKAYELPNLGKLAAQRVLDTRPNMAGTYLLLSNIYAAEGKWAEFAKMRKL 540
           ESVWGALLGACKAYEL NLGKLAAQRVLDTRPNMAGTYLLLSNIYAAEGKW EFAKMRKL
Sbjct: 481 ESVWGALLGACKAYELSNLGKLAAQRVLDTRPNMAGTYLLLSNIYAAEGKWGEFAKMRKL 540

Query: 541 MKGMDNKKEVGKSWIEIRNEVYSFVVGNKMGPHIEWVHKVLELLIWHMKDDGDMTDLD 599
           MKGMDNKKEVGKSWIEIRNEVYSFVVG+KMGPHIE VHKVL+LL+WHMKDDGD+TDLD
Sbjct: 541 MKGMDNKKEVGKSWIEIRNEVYSFVVGDKMGPHIECVHKVLKLLVWHMKDDGDVTDLD 598

BLAST of Tan0016733 vs. NCBI nr
Match: XP_022989366.1 (putative pentatricopeptide repeat-containing protein At1g56570 [Cucurbita maxima])

HSP 1 Score: 1149.0 bits (2971), Expect = 0.0e+00
Identity = 548/598 (91.64%), Postives = 573/598 (95.82%), Query Frame = 0

Query: 1   MSAKKLTSSTHFQPIPLIVRNSLQWINNSTTLQSNPPFRPKGPSIWATNLIKSYFDKGLT 60
           MSA KL SST F PIPLI+RNSLQWINNSTTLQS PPF PK PSIWATNLIKSYFD+GL+
Sbjct: 1   MSANKLASSTRFHPIPLIIRNSLQWINNSTTLQSTPPFTPKPPSIWATNLIKSYFDRGLS 60

Query: 61  KEARNLFDEMPERDVVAWTAMIVGFTSCNHYTQAWAVFCEMMRSDIEPNAFTMSSVLKAC 120
           K ARNLFDEMPERDVVAWTAMIVGFTSCN YTQ+WAVFCEM+RSDI PNAFT+SSVLKAC
Sbjct: 61  KHARNLFDEMPERDVVAWTAMIVGFTSCNDYTQSWAVFCEMLRSDIHPNAFTLSSVLKAC 120

Query: 121 KGMKALSCGTLAHGLGMKHGIDGSMYVQNALLDMYATCCATMDDALTVFNDIPLKTAVSW 180
           KGMKALSCGTLAH L  KHGIDGSMYVQNALLDMYATCCATMDDALTVFNDIPLKTAVSW
Sbjct: 121 KGMKALSCGTLAHSLATKHGIDGSMYVQNALLDMYATCCATMDDALTVFNDIPLKTAVSW 180

Query: 181 TTLIAGFTHRGDGYSGLQVFRQMLLEDVEPNSFSFSIAVRACASISSYSYGKQVHAAVTK 240
           TTLIAGFTHRGDGYSGLQVFRQMLL++VEPNSFSFSIAVRACASI SY+YGKQ+HAAVTK
Sbjct: 181 TTLIAGFTHRGDGYSGLQVFRQMLLDNVEPNSFSFSIAVRACASIGSYAYGKQIHAAVTK 240

Query: 241 YGLHSDAPVMNSILDMYCRCNCLCDAKRCFGEMTQKNLITWNTLIAGYERSDSSESLSLF 300
           YGLHSD PV+NSILDMYCRCN LCDAKRCFGEMT++NLITWNTLIAGYERSDSSESLSLF
Sbjct: 241 YGLHSDIPVLNSILDMYCRCNYLCDAKRCFGEMTRRNLITWNTLIAGYERSDSSESLSLF 300

Query: 301 SQMGFEGYEPNCFTFTSITAACANLAVLSCGQQVHGGIVRRGFDKSVALVNALIDMYAKC 360
           SQMG EGYEPNCFTFTSITAACANLAVL CGQQVHGGI+RRGFD SVALVNALIDMYAKC
Sbjct: 301 SQMGSEGYEPNCFTFTSITAACANLAVLGCGQQVHGGIIRRGFDNSVALVNALIDMYAKC 360

Query: 361 GNVNDSHKLFCDMPQRDLVSWTTMMIGYGAHGYGKEAIKLFDEMVQSGIRPDRIVFMAVV 420
           GN+NDSHKLFCDMPQRDLVSWTTMMIGYG+HGYGKE IKLFDEMV+SGI+PDRIVFMAV+
Sbjct: 361 GNINDSHKLFCDMPQRDLVSWTTMMIGYGSHGYGKEVIKLFDEMVRSGIQPDRIVFMAVL 420

Query: 421 SACSHAGLVDKGLRYFRSMLEDYSLNPDQEIYGCVVDLLGRAGRVEEAFQLVKSMPFEPD 480
           SACSHAGLV+KGL YFRSMLEDY+LNPD EIYGCVVDLLGRAGRVEEAFQLV+SMPFEPD
Sbjct: 421 SACSHAGLVNKGLSYFRSMLEDYNLNPDHEIYGCVVDLLGRAGRVEEAFQLVESMPFEPD 480

Query: 481 ESVWGALLGACKAYELPNLGKLAAQRVLDTRPNMAGTYLLLSNIYAAEGKWAEFAKMRKL 540
           ESVWGALLGACKAYEL NLGKLAAQRVLDTRPNMAGTYLLLSNIYAAEGKW EFAKMRKL
Sbjct: 481 ESVWGALLGACKAYELSNLGKLAAQRVLDTRPNMAGTYLLLSNIYAAEGKWGEFAKMRKL 540

Query: 541 MKGMDNKKEVGKSWIEIRNEVYSFVVGNKMGPHIEWVHKVLELLIWHMKDDGDMTDLD 599
           MKGMDNKKEVGKSWIEIRNEVYSFVVG+KMGPHIE VHKVL+LLIWHMKDDGD+TDLD
Sbjct: 541 MKGMDNKKEVGKSWIEIRNEVYSFVVGDKMGPHIECVHKVLKLLIWHMKDDGDVTDLD 598

BLAST of Tan0016733 vs. NCBI nr
Match: XP_022157012.1 (putative pentatricopeptide repeat-containing protein At1g56570 [Momordica charantia])

HSP 1 Score: 1130.9 bits (2924), Expect = 0.0e+00
Identity = 548/600 (91.33%), Postives = 565/600 (94.17%), Query Frame = 0

Query: 1   MSAKKLTSSTHFQPIPLIVRNSLQWINNSTTLQSNPPFRPKGPSIWATNLIKSYFDKGLT 60
           MSA KL SSTHF PIPL+VRNSLQ +N+STT+Q +PPF+PKGPSIWATNLIKSYFDKGLT
Sbjct: 58  MSANKLASSTHFHPIPLLVRNSLQGVNSSTTIQPHPPFKPKGPSIWATNLIKSYFDKGLT 117

Query: 61  KEARNLFDEMPERDVVAWTAMIVGFTSCNHYTQAWAVFCEMMRSDIEPNAFTMSSVLKAC 120
           KEARNLFDEMPERDVVAWT +IVGFTSCNHY QAWAVFCEMMRS+IEPNAFTMSSVLKA 
Sbjct: 118 KEARNLFDEMPERDVVAWTTLIVGFTSCNHYAQAWAVFCEMMRSEIEPNAFTMSSVLKAS 177

Query: 121 KGMKALSCGTLAHGLGMKHGIDGSMYVQNALLDMYATCCATMDDALTVFNDIPLKTAVSW 180
           KGM+ALSCG LAHGL  K GID SMYV NALLDMYAT CATMDDALTVFNDIPLKTAVSW
Sbjct: 178 KGMEALSCGALAHGLATKLGIDRSMYVGNALLDMYAT-CATMDDALTVFNDIPLKTAVSW 237

Query: 181 TTLIAGFTHRGDGYSGLQVFRQMLLEDVEPNSFSFSIAVRACASISSYSYGKQVHAAVTK 240
           TTLIA FTHRGDGYSGLQVFRQMLLEDVEPNSFSFSIAVRACASI S SYGKQ+HAAVTK
Sbjct: 238 TTLIAAFTHRGDGYSGLQVFRQMLLEDVEPNSFSFSIAVRACASIGSSSYGKQIHAAVTK 297

Query: 241 YGLHSDAPVMNSILDMYCRCNCLCDAKRCFGEMTQKNLITWNTLIAGYERSDSSESLSLF 300
           YGLHSD PVMNSILDMYCRCNCLCDAKRCFGEMT KNLITWNTLIAGYERSDSSESL LF
Sbjct: 298 YGLHSDVPVMNSILDMYCRCNCLCDAKRCFGEMTVKNLITWNTLIAGYERSDSSESLRLF 357

Query: 301 SQMGFEGYEPNCFTFTSITAACANLAVLSCGQQVHGGIVRRGFDKSVALVNALIDMYAKC 360
           S MG EGYEPNCFTFTS+TAACANLAVLSCGQQVHGGIVRRGFDKSVAL+NALIDMYAKC
Sbjct: 358 SHMGCEGYEPNCFTFTSVTAACANLAVLSCGQQVHGGIVRRGFDKSVALINALIDMYAKC 417

Query: 361 GNVNDSHKLFCDMPQRDLVSWTTMMIGYGAHGYGKEAIKLFDEMVQSGIRPDRIVFMAVV 420
           GNVNDSHKLF DM QRDLVSWTTMMIGYGAHGYGKEAIKLFDEMV+S IRPDRIVFMAV+
Sbjct: 418 GNVNDSHKLFSDMTQRDLVSWTTMMIGYGAHGYGKEAIKLFDEMVRSRIRPDRIVFMAVL 477

Query: 421 SACSHAGLVDKGLRYFRSMLEDYSLNPDQEIYGCVVDLLGRAGRVEEAFQLVKSMPFEPD 480
           SACSHAGLVDKGL YFRSMLEDY LNPDQEIYGCVVDLLGRAGRVEEAFQL +SMPFEPD
Sbjct: 478 SACSHAGLVDKGLIYFRSMLEDYRLNPDQEIYGCVVDLLGRAGRVEEAFQLAESMPFEPD 537

Query: 481 ESVWGALLGACKAYELPNLGKLAAQRVLDTRPNMAGTYLLLSNIYAAEGKWAEFAKMRKL 540
           ESVWGALLGACK YEL NLG LAAQRVLD RPNMAGTYLLLSNIYAAEGKW EFAKMRKL
Sbjct: 538 ESVWGALLGACKEYELSNLGNLAAQRVLDMRPNMAGTYLLLSNIYAAEGKWNEFAKMRKL 597

Query: 541 MKGMDNKKEVGKSWIEIRNEVYSFVVGNKMGPHIEWVHKVLELLIWHMKDDGDMTDLDYL 600
           MKGMDNKKEVGKSWIEIRNEVYSFVVG+KMGPHIEWVHKVLELLIWHMKDDGD+TDL YL
Sbjct: 598 MKGMDNKKEVGKSWIEIRNEVYSFVVGDKMGPHIEWVHKVLELLIWHMKDDGDVTDLKYL 656

BLAST of Tan0016733 vs. ExPASy TrEMBL
Match: A0A6J1EZ33 (putative pentatricopeptide repeat-containing protein At1g56570 OS=Cucurbita moschata OX=3662 GN=LOC111437751 PE=4 SV=1)

HSP 1 Score: 1152.9 bits (2981), Expect = 0.0e+00
Identity = 548/598 (91.64%), Postives = 574/598 (95.99%), Query Frame = 0

Query: 1   MSAKKLTSSTHFQPIPLIVRNSLQWINNSTTLQSNPPFRPKGPSIWATNLIKSYFDKGLT 60
           MSA KL SST F PIPLIVRNSLQWINNSTTLQS PPF PK PSIWATNLIKSYFDKGL+
Sbjct: 1   MSANKLASSTRFHPIPLIVRNSLQWINNSTTLQSTPPFTPKSPSIWATNLIKSYFDKGLS 60

Query: 61  KEARNLFDEMPERDVVAWTAMIVGFTSCNHYTQAWAVFCEMMRSDIEPNAFTMSSVLKAC 120
           K ARNLFDEMPERDVVAWTAMIVGFTSCN YTQ+WAVFCEM+RSDI PNAFT+SSVLKAC
Sbjct: 61  KHARNLFDEMPERDVVAWTAMIVGFTSCNDYTQSWAVFCEMLRSDIHPNAFTLSSVLKAC 120

Query: 121 KGMKALSCGTLAHGLGMKHGIDGSMYVQNALLDMYATCCATMDDALTVFNDIPLKTAVSW 180
           KGMKALSCGTLAH L  KHGIDGSMYVQNALLDMYATCCATMDDALTVFNDIPLKTAVSW
Sbjct: 121 KGMKALSCGTLAHSLATKHGIDGSMYVQNALLDMYATCCATMDDALTVFNDIPLKTAVSW 180

Query: 181 TTLIAGFTHRGDGYSGLQVFRQMLLEDVEPNSFSFSIAVRACASISSYSYGKQVHAAVTK 240
           TTLIAGFTHRGDGYSGLQVFRQMLL++VEPNSFSFSIAVRACASI SY+YGKQ+HAAVTK
Sbjct: 181 TTLIAGFTHRGDGYSGLQVFRQMLLDNVEPNSFSFSIAVRACASIGSYAYGKQIHAAVTK 240

Query: 241 YGLHSDAPVMNSILDMYCRCNCLCDAKRCFGEMTQKNLITWNTLIAGYERSDSSESLSLF 300
           YGLHSD PV+NSILDMYCRCNCLCDAKRCFGEMT++NLITWNTLIAGYERSDSSESLSLF
Sbjct: 241 YGLHSDIPVLNSILDMYCRCNCLCDAKRCFGEMTERNLITWNTLIAGYERSDSSESLSLF 300

Query: 301 SQMGFEGYEPNCFTFTSITAACANLAVLSCGQQVHGGIVRRGFDKSVALVNALIDMYAKC 360
           SQMG EGYEPNCFTFTSITAACANLAVL CGQQVHGGI+RRGFD SVALVNALIDMYAKC
Sbjct: 301 SQMGSEGYEPNCFTFTSITAACANLAVLGCGQQVHGGIIRRGFDNSVALVNALIDMYAKC 360

Query: 361 GNVNDSHKLFCDMPQRDLVSWTTMMIGYGAHGYGKEAIKLFDEMVQSGIRPDRIVFMAVV 420
           GN+NDSHKLFCDMP+RDLVSWTTMMIGYG+HGYGKE IKLFDEMV+SGI+PDRIVFMAV+
Sbjct: 361 GNINDSHKLFCDMPRRDLVSWTTMMIGYGSHGYGKEVIKLFDEMVRSGIQPDRIVFMAVL 420

Query: 421 SACSHAGLVDKGLRYFRSMLEDYSLNPDQEIYGCVVDLLGRAGRVEEAFQLVKSMPFEPD 480
           SACSHAGLV+KGL YFRSM+EDY+LNPD EIYGCVVDLLGRAGRVEEAFQLV+SMPFEPD
Sbjct: 421 SACSHAGLVNKGLSYFRSMVEDYNLNPDHEIYGCVVDLLGRAGRVEEAFQLVESMPFEPD 480

Query: 481 ESVWGALLGACKAYELPNLGKLAAQRVLDTRPNMAGTYLLLSNIYAAEGKWAEFAKMRKL 540
           ESVWGALLGACKAYEL NLGKLAAQRVLDTRPNMAGTYLLLSNIYAAEGKW EFAKMRKL
Sbjct: 481 ESVWGALLGACKAYELSNLGKLAAQRVLDTRPNMAGTYLLLSNIYAAEGKWGEFAKMRKL 540

Query: 541 MKGMDNKKEVGKSWIEIRNEVYSFVVGNKMGPHIEWVHKVLELLIWHMKDDGDMTDLD 599
           MKGMDNKKEVGKSWIEIRNEVYSFVVG+KMGPHIE VHKVL+LL+WHMKDDGD+TDLD
Sbjct: 541 MKGMDNKKEVGKSWIEIRNEVYSFVVGDKMGPHIECVHKVLKLLVWHMKDDGDVTDLD 598

BLAST of Tan0016733 vs. ExPASy TrEMBL
Match: A0A6J1JJV4 (putative pentatricopeptide repeat-containing protein At1g56570 OS=Cucurbita maxima OX=3661 GN=LOC111486440 PE=4 SV=1)

HSP 1 Score: 1149.0 bits (2971), Expect = 0.0e+00
Identity = 548/598 (91.64%), Postives = 573/598 (95.82%), Query Frame = 0

Query: 1   MSAKKLTSSTHFQPIPLIVRNSLQWINNSTTLQSNPPFRPKGPSIWATNLIKSYFDKGLT 60
           MSA KL SST F PIPLI+RNSLQWINNSTTLQS PPF PK PSIWATNLIKSYFD+GL+
Sbjct: 1   MSANKLASSTRFHPIPLIIRNSLQWINNSTTLQSTPPFTPKPPSIWATNLIKSYFDRGLS 60

Query: 61  KEARNLFDEMPERDVVAWTAMIVGFTSCNHYTQAWAVFCEMMRSDIEPNAFTMSSVLKAC 120
           K ARNLFDEMPERDVVAWTAMIVGFTSCN YTQ+WAVFCEM+RSDI PNAFT+SSVLKAC
Sbjct: 61  KHARNLFDEMPERDVVAWTAMIVGFTSCNDYTQSWAVFCEMLRSDIHPNAFTLSSVLKAC 120

Query: 121 KGMKALSCGTLAHGLGMKHGIDGSMYVQNALLDMYATCCATMDDALTVFNDIPLKTAVSW 180
           KGMKALSCGTLAH L  KHGIDGSMYVQNALLDMYATCCATMDDALTVFNDIPLKTAVSW
Sbjct: 121 KGMKALSCGTLAHSLATKHGIDGSMYVQNALLDMYATCCATMDDALTVFNDIPLKTAVSW 180

Query: 181 TTLIAGFTHRGDGYSGLQVFRQMLLEDVEPNSFSFSIAVRACASISSYSYGKQVHAAVTK 240
           TTLIAGFTHRGDGYSGLQVFRQMLL++VEPNSFSFSIAVRACASI SY+YGKQ+HAAVTK
Sbjct: 181 TTLIAGFTHRGDGYSGLQVFRQMLLDNVEPNSFSFSIAVRACASIGSYAYGKQIHAAVTK 240

Query: 241 YGLHSDAPVMNSILDMYCRCNCLCDAKRCFGEMTQKNLITWNTLIAGYERSDSSESLSLF 300
           YGLHSD PV+NSILDMYCRCN LCDAKRCFGEMT++NLITWNTLIAGYERSDSSESLSLF
Sbjct: 241 YGLHSDIPVLNSILDMYCRCNYLCDAKRCFGEMTRRNLITWNTLIAGYERSDSSESLSLF 300

Query: 301 SQMGFEGYEPNCFTFTSITAACANLAVLSCGQQVHGGIVRRGFDKSVALVNALIDMYAKC 360
           SQMG EGYEPNCFTFTSITAACANLAVL CGQQVHGGI+RRGFD SVALVNALIDMYAKC
Sbjct: 301 SQMGSEGYEPNCFTFTSITAACANLAVLGCGQQVHGGIIRRGFDNSVALVNALIDMYAKC 360

Query: 361 GNVNDSHKLFCDMPQRDLVSWTTMMIGYGAHGYGKEAIKLFDEMVQSGIRPDRIVFMAVV 420
           GN+NDSHKLFCDMPQRDLVSWTTMMIGYG+HGYGKE IKLFDEMV+SGI+PDRIVFMAV+
Sbjct: 361 GNINDSHKLFCDMPQRDLVSWTTMMIGYGSHGYGKEVIKLFDEMVRSGIQPDRIVFMAVL 420

Query: 421 SACSHAGLVDKGLRYFRSMLEDYSLNPDQEIYGCVVDLLGRAGRVEEAFQLVKSMPFEPD 480
           SACSHAGLV+KGL YFRSMLEDY+LNPD EIYGCVVDLLGRAGRVEEAFQLV+SMPFEPD
Sbjct: 421 SACSHAGLVNKGLSYFRSMLEDYNLNPDHEIYGCVVDLLGRAGRVEEAFQLVESMPFEPD 480

Query: 481 ESVWGALLGACKAYELPNLGKLAAQRVLDTRPNMAGTYLLLSNIYAAEGKWAEFAKMRKL 540
           ESVWGALLGACKAYEL NLGKLAAQRVLDTRPNMAGTYLLLSNIYAAEGKW EFAKMRKL
Sbjct: 481 ESVWGALLGACKAYELSNLGKLAAQRVLDTRPNMAGTYLLLSNIYAAEGKWGEFAKMRKL 540

Query: 541 MKGMDNKKEVGKSWIEIRNEVYSFVVGNKMGPHIEWVHKVLELLIWHMKDDGDMTDLD 599
           MKGMDNKKEVGKSWIEIRNEVYSFVVG+KMGPHIE VHKVL+LLIWHMKDDGD+TDLD
Sbjct: 541 MKGMDNKKEVGKSWIEIRNEVYSFVVGDKMGPHIECVHKVLKLLIWHMKDDGDVTDLD 598

BLAST of Tan0016733 vs. ExPASy TrEMBL
Match: A0A6J1DTG1 (putative pentatricopeptide repeat-containing protein At1g56570 OS=Momordica charantia OX=3673 GN=LOC111023836 PE=4 SV=1)

HSP 1 Score: 1130.9 bits (2924), Expect = 0.0e+00
Identity = 548/600 (91.33%), Postives = 565/600 (94.17%), Query Frame = 0

Query: 1   MSAKKLTSSTHFQPIPLIVRNSLQWINNSTTLQSNPPFRPKGPSIWATNLIKSYFDKGLT 60
           MSA KL SSTHF PIPL+VRNSLQ +N+STT+Q +PPF+PKGPSIWATNLIKSYFDKGLT
Sbjct: 58  MSANKLASSTHFHPIPLLVRNSLQGVNSSTTIQPHPPFKPKGPSIWATNLIKSYFDKGLT 117

Query: 61  KEARNLFDEMPERDVVAWTAMIVGFTSCNHYTQAWAVFCEMMRSDIEPNAFTMSSVLKAC 120
           KEARNLFDEMPERDVVAWT +IVGFTSCNHY QAWAVFCEMMRS+IEPNAFTMSSVLKA 
Sbjct: 118 KEARNLFDEMPERDVVAWTTLIVGFTSCNHYAQAWAVFCEMMRSEIEPNAFTMSSVLKAS 177

Query: 121 KGMKALSCGTLAHGLGMKHGIDGSMYVQNALLDMYATCCATMDDALTVFNDIPLKTAVSW 180
           KGM+ALSCG LAHGL  K GID SMYV NALLDMYAT CATMDDALTVFNDIPLKTAVSW
Sbjct: 178 KGMEALSCGALAHGLATKLGIDRSMYVGNALLDMYAT-CATMDDALTVFNDIPLKTAVSW 237

Query: 181 TTLIAGFTHRGDGYSGLQVFRQMLLEDVEPNSFSFSIAVRACASISSYSYGKQVHAAVTK 240
           TTLIA FTHRGDGYSGLQVFRQMLLEDVEPNSFSFSIAVRACASI S SYGKQ+HAAVTK
Sbjct: 238 TTLIAAFTHRGDGYSGLQVFRQMLLEDVEPNSFSFSIAVRACASIGSSSYGKQIHAAVTK 297

Query: 241 YGLHSDAPVMNSILDMYCRCNCLCDAKRCFGEMTQKNLITWNTLIAGYERSDSSESLSLF 300
           YGLHSD PVMNSILDMYCRCNCLCDAKRCFGEMT KNLITWNTLIAGYERSDSSESL LF
Sbjct: 298 YGLHSDVPVMNSILDMYCRCNCLCDAKRCFGEMTVKNLITWNTLIAGYERSDSSESLRLF 357

Query: 301 SQMGFEGYEPNCFTFTSITAACANLAVLSCGQQVHGGIVRRGFDKSVALVNALIDMYAKC 360
           S MG EGYEPNCFTFTS+TAACANLAVLSCGQQVHGGIVRRGFDKSVAL+NALIDMYAKC
Sbjct: 358 SHMGCEGYEPNCFTFTSVTAACANLAVLSCGQQVHGGIVRRGFDKSVALINALIDMYAKC 417

Query: 361 GNVNDSHKLFCDMPQRDLVSWTTMMIGYGAHGYGKEAIKLFDEMVQSGIRPDRIVFMAVV 420
           GNVNDSHKLF DM QRDLVSWTTMMIGYGAHGYGKEAIKLFDEMV+S IRPDRIVFMAV+
Sbjct: 418 GNVNDSHKLFSDMTQRDLVSWTTMMIGYGAHGYGKEAIKLFDEMVRSRIRPDRIVFMAVL 477

Query: 421 SACSHAGLVDKGLRYFRSMLEDYSLNPDQEIYGCVVDLLGRAGRVEEAFQLVKSMPFEPD 480
           SACSHAGLVDKGL YFRSMLEDY LNPDQEIYGCVVDLLGRAGRVEEAFQL +SMPFEPD
Sbjct: 478 SACSHAGLVDKGLIYFRSMLEDYRLNPDQEIYGCVVDLLGRAGRVEEAFQLAESMPFEPD 537

Query: 481 ESVWGALLGACKAYELPNLGKLAAQRVLDTRPNMAGTYLLLSNIYAAEGKWAEFAKMRKL 540
           ESVWGALLGACK YEL NLG LAAQRVLD RPNMAGTYLLLSNIYAAEGKW EFAKMRKL
Sbjct: 538 ESVWGALLGACKEYELSNLGNLAAQRVLDMRPNMAGTYLLLSNIYAAEGKWNEFAKMRKL 597

Query: 541 MKGMDNKKEVGKSWIEIRNEVYSFVVGNKMGPHIEWVHKVLELLIWHMKDDGDMTDLDYL 600
           MKGMDNKKEVGKSWIEIRNEVYSFVVG+KMGPHIEWVHKVLELLIWHMKDDGD+TDL YL
Sbjct: 598 MKGMDNKKEVGKSWIEIRNEVYSFVVGDKMGPHIEWVHKVLELLIWHMKDDGDVTDLKYL 656

BLAST of Tan0016733 vs. ExPASy TrEMBL
Match: A0A0A0LW37 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G555630 PE=4 SV=1)

HSP 1 Score: 1098.6 bits (2840), Expect = 0.0e+00
Identity = 522/600 (87.00%), Postives = 560/600 (93.33%), Query Frame = 0

Query: 1   MSAKKLTSSTHFQPIPLIVRNSLQWINNSTTLQSNPPFRPKGPSIWATNLIKSYFDKGLT 60
           MS  KL SS HF PIPLIVRNSLQWI+NS TLQSNPPF P+GPS+WATNLIKSYFDKGLT
Sbjct: 1   MSVDKLASSPHFHPIPLIVRNSLQWISNS-TLQSNPPFTPEGPSVWATNLIKSYFDKGLT 60

Query: 61  KEARNLFDEMPERDVVAWTAMIVGFTSCNHYTQAWAVFCEMMRSDIEPNAFTMSSVLKAC 120
           +EA NLF+E+PERDVV WTAMIVGFTSCNHY QAW +F EM+RS+++PNAFTMSSVLKAC
Sbjct: 61  REACNLFNEIPERDVVTWTAMIVGFTSCNHYHQAWTMFSEMLRSEVQPNAFTMSSVLKAC 120

Query: 121 KGMKALSCGTLAHGLGMKHGIDGSMYVQNALLDMYATCCATMDDALTVFNDIPLKTAVSW 180
           KGMKALSCG LAH L  KHGID S+YVQNALLDMYA  CATMDDAL+VFNDIPLKTAVSW
Sbjct: 121 KGMKALSCGALAHSLATKHGIDRSVYVQNALLDMYAASCATMDDALSVFNDIPLKTAVSW 180

Query: 181 TTLIAGFTHRGDGYSGLQVFRQMLLEDVEPNSFSFSIAVRACASISSYSYGKQVHAAVTK 240
           TTLIAGFTHRGDGYSGL  FRQMLLEDV PNSFSFSIA RACASISSYS GKQ+HAAVTK
Sbjct: 181 TTLIAGFTHRGDGYSGLLAFRQMLLEDVGPNSFSFSIAARACASISSYSCGKQIHAAVTK 240

Query: 241 YGLHSDAPVMNSILDMYCRCNCLCDAKRCFGEMTQKNLITWNTLIAGYERSDSSESLSLF 300
           YGLH DAPVMNSILDMYCRCN LCDAKRCFGE+T+KNLITWNTLIAGYERSDSSESLSLF
Sbjct: 241 YGLHCDAPVMNSILDMYCRCNYLCDAKRCFGELTEKNLITWNTLIAGYERSDSSESLSLF 300

Query: 301 SQMGFEGYEPNCFTFTSITAACANLAVLSCGQQVHGGIVRRGFDKSVALVNALIDMYAKC 360
            QMG EGY+PNCFTFTSITAACANLAVLSCGQQVHGGIVRRGFDK+VAL+N+LIDMYAKC
Sbjct: 301 FQMGSEGYKPNCFTFTSITAACANLAVLSCGQQVHGGIVRRGFDKNVALINSLIDMYAKC 360

Query: 361 GNVNDSHKLFCDMPQRDLVSWTTMMIGYGAHGYGKEAIKLFDEMVQSGIRPDRIVFMAVV 420
           G+++DSHKLFCDMP RDLVSWTTMMIGYGAHGYGKEA+KLFDEMVQSGI+PDRIVFM V+
Sbjct: 361 GSISDSHKLFCDMPGRDLVSWTTMMIGYGAHGYGKEAVKLFDEMVQSGIQPDRIVFMGVL 420

Query: 421 SACSHAGLVDKGLRYFRSMLEDYSLNPDQEIYGCVVDLLGRAGRVEEAFQLVKSMPFEPD 480
             CSHAGLVDKGL+YFRSMLEDY++NPDQEIY CVVDLLGRAGRVEEAFQLV++MPFEPD
Sbjct: 421 CGCSHAGLVDKGLKYFRSMLEDYNINPDQEIYRCVVDLLGRAGRVEEAFQLVENMPFEPD 480

Query: 481 ESVWGALLGACKAYELPNLGKLAAQRVLDTRPNMAGTYLLLSNIYAAEGKWAEFAKMRKL 540
           ESVWGALLGACKAY+L NLG LAAQRVLD RPNMAGTYLLLS IYAAEGKW EFAKMRKL
Sbjct: 481 ESVWGALLGACKAYKLSNLGNLAAQRVLDRRPNMAGTYLLLSKIYAAEGKWGEFAKMRKL 540

Query: 541 MKGMDNKKEVGKSWIEIRNEVYSFVVGNKMGPHIEWVHKVLELLIWHMKDDGDMTDLDYL 600
           MKGM+ KKEVGKSWIEIRNEVYSFVVG KMGPHIEWVHKV+++LIWHMKDDGD+TDLDY+
Sbjct: 541 MKGMNKKKEVGKSWIEIRNEVYSFVVGAKMGPHIEWVHKVIDVLIWHMKDDGDVTDLDYI 599

BLAST of Tan0016733 vs. ExPASy TrEMBL
Match: A0A1S3BQ30 (putative pentatricopeptide repeat-containing protein At1g56570 OS=Cucumis melo OX=3656 GN=LOC103492105 PE=4 SV=1)

HSP 1 Score: 1086.2 bits (2808), Expect = 0.0e+00
Identity = 518/600 (86.33%), Postives = 555/600 (92.50%), Query Frame = 0

Query: 1   MSAKKLTSSTHFQPIPLIVRNSLQWINNSTTLQSNPPFRPKGPSIWATNLIKSYFDKGLT 60
           MS  KL SS HF PIPLIVRNSLQWI+NS TLQSNPPF PKGPS WATNLIKSYFDKGLT
Sbjct: 1   MSVDKLASSPHFHPIPLIVRNSLQWISNS-TLQSNPPFTPKGPSFWATNLIKSYFDKGLT 60

Query: 61  KEARNLFDEMPERDVVAWTAMIVGFTSCNHYTQAWAVFCEMMRSDIEPNAFTMSSVLKAC 120
           +EA NLF+E+PERDVV WTAMIVGFTSCNHY QAW +F EM+RS+++PNAFTMSSVLKAC
Sbjct: 61  REACNLFNEIPERDVVTWTAMIVGFTSCNHYPQAWTMFSEMLRSEVQPNAFTMSSVLKAC 120

Query: 121 KGMKALSCGTLAHGLGMKHGIDGSMYVQNALLDMYATCCATMDDALTVFNDIPLKTAVSW 180
           KGMKALSCG LAH L  K GID S+YVQNALLDMYA  CATMDDAL+VFNDIPLKTAVSW
Sbjct: 121 KGMKALSCGALAHSLATKLGIDRSVYVQNALLDMYAASCATMDDALSVFNDIPLKTAVSW 180

Query: 181 TTLIAGFTHRGDGYSGLQVFRQMLLEDVEPNSFSFSIAVRACASISSYSYGKQVHAAVTK 240
           TTLIAG THRGDGYSGL  FR+MLLEDV PNSFSFSIA RACASISSYS GKQ+HAAVTK
Sbjct: 181 TTLIAGLTHRGDGYSGLLAFRKMLLEDVGPNSFSFSIAARACASISSYSCGKQIHAAVTK 240

Query: 241 YGLHSDAPVMNSILDMYCRCNCLCDAKRCFGEMTQKNLITWNTLIAGYERSDSSESLSLF 300
           YGLH DAPVMNSILDMYCRCN LCDAKRCF E+T+KNLITWNTLIAGYERSDSSESLSLF
Sbjct: 241 YGLHCDAPVMNSILDMYCRCNYLCDAKRCFDELTEKNLITWNTLIAGYERSDSSESLSLF 300

Query: 301 SQMGFEGYEPNCFTFTSITAACANLAVLSCGQQVHGGIVRRGFDKSVALVNALIDMYAKC 360
            QMG EGY+PNCFTFTSITAACANLAVLSCGQQVHGGIVRRGFDK+VAL+N+LIDMYAKC
Sbjct: 301 FQMGSEGYKPNCFTFTSITAACANLAVLSCGQQVHGGIVRRGFDKNVALINSLIDMYAKC 360

Query: 361 GNVNDSHKLFCDMPQRDLVSWTTMMIGYGAHGYGKEAIKLFDEMVQSGIRPDRIVFMAVV 420
           G++NDSHKLFCDMP RDLVSWTTMMIGYG HGYGKEA+KLFDEMVQSGI+PDRIVFM V+
Sbjct: 361 GSINDSHKLFCDMPGRDLVSWTTMMIGYGTHGYGKEAVKLFDEMVQSGIQPDRIVFMGVL 420

Query: 421 SACSHAGLVDKGLRYFRSMLEDYSLNPDQEIYGCVVDLLGRAGRVEEAFQLVKSMPFEPD 480
           S CSHAGLVD+GL+YFRSMLEDY++NPDQEIY CVVDLLGRAGRVEEAFQLV++MPFEPD
Sbjct: 421 SGCSHAGLVDRGLKYFRSMLEDYNINPDQEIYRCVVDLLGRAGRVEEAFQLVENMPFEPD 480

Query: 481 ESVWGALLGACKAYELPNLGKLAAQRVLDTRPNMAGTYLLLSNIYAAEGKWAEFAKMRKL 540
           ESVWGALLGACKA +L NLG LAAQRVL TRPNMAGTYLLLSNIYAAEGKW EFAKMRKL
Sbjct: 481 ESVWGALLGACKACKLSNLGNLAAQRVLGTRPNMAGTYLLLSNIYAAEGKWGEFAKMRKL 540

Query: 541 MKGMDNKKEVGKSWIEIRNEVYSFVVGNKMGPHIEWVHKVLELLIWHMKDDGDMTDLDYL 600
           MKGMD KKEVGKSWIEIRNEVYSFVVG KMGPHIEWVHKV+++LIWHMKDDGD+ DL+Y+
Sbjct: 541 MKGMDKKKEVGKSWIEIRNEVYSFVVGAKMGPHIEWVHKVIDVLIWHMKDDGDVADLNYI 599

BLAST of Tan0016733 vs. TAIR 10
Match: AT1G56570.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 730.7 bits (1885), Expect = 9.4e-211
Identity = 351/603 (58.21%), Postives = 460/603 (76.29%), Query Frame = 0

Query: 1   MSAKKLTSSTHFQPIPLIVRNSLQWIN-NSTTLQSNPPFRPKGPSIWATNLIKSYFDKGL 60
           MS  KL  S  F+PIP  VR+SL+     S+     PP++PK   I ATNLI SYF+KGL
Sbjct: 1   MSITKLARSNAFKPIPNFVRSSLRNAGVESSQNTEYPPYKPKKHHILATNLIVSYFEKGL 60

Query: 61  TKEARNLFDEMPERDVVAWTAMIVGFTSCNHYTQAWAVFCEMMRSDIEPNAFTMSSVLKA 120
            +EAR+LFDEMP+RDVVAWTAMI G+ S N+  +AW  F EM++    PN FT+SSVLK+
Sbjct: 61  VEEARSLFDEMPDRDVVAWTAMITGYASSNYNARAWECFHEMVKQGTSPNEFTLSSVLKS 120

Query: 121 CKGMKALSCGTLAHGLGMKHGIDGSMYVQNALLDMYATCCATMDDALTVFNDIPLKTAVS 180
           C+ MK L+ G L HG+ +K G++GS+YV NA+++MYATC  TM+ A  +F DI +K  V+
Sbjct: 121 CRNMKVLAYGALVHGVVVKLGMEGSLYVDNAMMNMYATCSVTMEAACLIFRDIKVKNDVT 180

Query: 181 WTTLIAGFTHRGDGYSGLQVFRQMLLEDVEPNSFSFSIAVRACASISSYSYGKQVHAAVT 240
           WTTLI GFTH GDG  GL++++QMLLE+ E   +  +IAVRA ASI S + GKQ+HA+V 
Sbjct: 181 WTTLITGFTHLGDGIGGLKMYKQMLLENAEVTPYCITIAVRASASIDSVTTGKQIHASVI 240

Query: 241 KYGLHSDAPVMNSILDMYCRCNCLCDAKRCFGEMTQKNLITWNTLIAGYERSDSSESLSL 300
           K G  S+ PVMNSILD+YCRC  L +AK  F EM  K+LITWNTLI+  ERSDSSE+L +
Sbjct: 241 KRGFQSNLPVMNSILDLYCRCGYLSEAKHYFHEMEDKDLITWNTLISELERSDSSEALLM 300

Query: 301 FSQMGFEGYEPNCFTFTSITAACANLAVLSCGQQVHGGIVRRGFDKSVALVNALIDMYAK 360
           F +   +G+ PNC+TFTS+ AACAN+A L+CGQQ+HG I RRGF+K+V L NALIDMYAK
Sbjct: 301 FQRFESQGFVPNCYTFTSLVAACANIAALNCGQQLHGRIFRRGFNKNVELANALIDMYAK 360

Query: 361 CGNVNDSHKLFCD-MPQRDLVSWTTMMIGYGAHGYGKEAIKLFDEMVQSGIRPDRIVFMA 420
           CGN+ DS ++F + + +R+LVSWT+MMIGYG+HGYG EA++LFD+MV SGIRPDRIVFMA
Sbjct: 361 CGNIPDSQRVFGEIVDRRNLVSWTSMMIGYGSHGYGAEAVELFDKMVSSGIRPDRIVFMA 420

Query: 421 VVSACSHAGLVDKGLRYFRSMLEDYSLNPDQEIYGCVVDLLGRAGRVEEAFQLVKSMPFE 480
           V+SAC HAGLV+KGL+YF  M  +Y +NPD++IY CVVDLLGRAG++ EA++LV+ MPF+
Sbjct: 421 VLSACRHAGLVEKGLKYFNVMESEYGINPDRDIYNCVVDLLGRAGKIGEAYELVERMPFK 480

Query: 481 PDESVWGALLGACKAYELPNL-GKLAAQRVLDTRPNMAGTYLLLSNIYAAEGKWAEFAKM 540
           PDES WGA+LGACKA++   L  +LAA++V++ +P M GTY++LS IYAAEGKW +FA++
Sbjct: 481 PDESTWGAILGACKAHKHNGLISRLAARKVMELKPKMVGTYVMLSYIYAAEGKWVDFARV 540

Query: 541 RKLMKGMDNKKEVGKSWIEIRNEVYSFVVGNKMGPHIEWVHKVLELLIWHMKDDGDMTDL 600
           RK+M+ M NKKE G SWI + N+V+SF V +KM P+   V+ VL LLI   ++ G + +L
Sbjct: 541 RKMMRMMGNKKEAGMSWILVENQVFSFAVSDKMCPNASSVYSVLGLLIEETREAGYVPEL 600

BLAST of Tan0016733 vs. TAIR 10
Match: AT2G13600.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 412.5 bits (1059), Expect = 5.7e-115
Identity = 213/582 (36.60%), Postives = 332/582 (57.04%), Query Frame = 0

Query: 58  GLTKEARNLFDEMPERDVVAWTAMIVGFTSCNHYTQAWAVFCEMMRSDIEPNAFTMSSVL 117
           G   EA +LF  MPERD   W +M+ GF   +   +A   F  M +     N ++ +SVL
Sbjct: 100 GFLDEADSLFRSMPERDQCTWNSMVSGFAQHDRCEEALCYFAMMHKEGFVLNEYSFASVL 159

Query: 118 KACKGMKALSCGTLAHGLGMKHGIDGSMYVQNALLDMYATCCATMDDALTVFNDIPLKTA 177
            AC G+  ++ G   H L  K      +Y+ +AL+DMY+  C  ++DA  VF+++  +  
Sbjct: 160 SACSGLNDMNKGVQVHSLIAKSPFLSDVYIGSALVDMYSK-CGNVNDAQRVFDEMGDRNV 219

Query: 178 VSWTTLIAGFTHRGDGYSGLQVFRQMLLEDVEPNSFSFSIAVRACASISSYSYGKQVHAA 237
           VSW +LI  F   G     L VF+ ML   VEP+  + +  + ACAS+S+   G++VH  
Sbjct: 220 VSWNSLITCFEQNGPAVEALDVFQMMLESRVEPDEVTLASVISACASLSAIKVGQEVHGR 279

Query: 238 VTKYG-LHSDAPVMNSILDMYCRCNCLCD------------------------------- 297
           V K   L +D  + N+ +DMY +C+ + +                               
Sbjct: 280 VVKNDKLRNDIILSNAFVDMYAKCSRIKEARFIFDSMPIRNVIAETSMISGYAMAASTKA 339

Query: 298 AKRCFGEMTQKNLITWNTLIAGY-ERSDSSESLSLFSQMGFEGYEPNCFTFTSITAACAN 357
           A+  F +M ++N+++WN LIAGY +  ++ E+LSLF  +  E   P  ++F +I  ACA+
Sbjct: 340 ARLMFTKMAERNVVSWNALIAGYTQNGENEEALSLFCLLKRESVCPTHYSFANILKACAD 399

Query: 358 LAVLSCGQQVHGGIVRRGF------DKSVALVNALIDMYAKCGNVNDSHKLFCDMPQRDL 417
           LA L  G Q H  +++ GF      +  + + N+LIDMY KCG V + + +F  M +RD 
Sbjct: 400 LAELHLGMQAHVHVLKHGFKFQSGEEDDIFVGNSLIDMYVKCGCVEEGYLVFRKMMERDC 459

Query: 418 VSWTTMMIGYGAHGYGKEAIKLFDEMVQSGIRPDRIVFMAVVSACSHAGLVDKGLRYFRS 477
           VSW  M+IG+  +GYG EA++LF EM++SG +PD I  + V+SAC HAG V++G  YF S
Sbjct: 460 VSWNAMIIGFAQNGYGNEALELFREMLESGEKPDHITMIGVLSACGHAGFVEEGRHYFSS 519

Query: 478 MLEDYSLNPDQEIYGCVVDLLGRAGRVEEAFQLVKSMPFEPDESVWGALLGACKAYELPN 537
           M  D+ + P ++ Y C+VDLLGRAG +EEA  +++ MP +PD  +WG+LL ACK +    
Sbjct: 520 MTRDFGVAPLRDHYTCMVDLLGRAGFLEEAKSMIEEMPMQPDSVIWGSLLAACKVHRNIT 579

Query: 538 LGKLAAQRVLDTRPNMAGTYLLLSNIYAAEGKWAEFAKMRKLMKGMDNKKEVGKSWIEIR 597
           LGK  A+++L+  P+ +G Y+LLSN+YA  GKW +   +RK M+     K+ G SWI+I+
Sbjct: 580 LGKYVAEKLLEVEPSNSGPYVLLSNMYAELGKWEDVMNVRKSMRKEGVTKQPGCSWIKIQ 639

Query: 598 NEVYSFVVGNKMGPHIEWVHKVLELLIWHMKDDGDMTDLDYL 601
              + F+V +K  P  + +H +L++LI  M+ + D T++  L
Sbjct: 640 GHDHVFMVKDKSHPRKKQIHSLLDILIAEMRPEQDHTEIGSL 680

BLAST of Tan0016733 vs. TAIR 10
Match: AT1G11290.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 400.2 bits (1027), Expect = 2.9e-111
Identity = 195/550 (35.45%), Postives = 324/550 (58.91%), Query Frame = 0

Query: 48  TNLIKSYFDKGLTKEARNLFDEMPERDVVAWTAMIVGFTSCNHYTQAWAVFCEMMRSDIE 107
           T L   Y       EAR +FD MPERD+V+W  ++ G++       A  +   M   +++
Sbjct: 174 TGLENMYAKCRQVNEARKVFDRMPERDLVSWNTIVAGYSQNGMARMALEMVKSMCEENLK 233

Query: 108 PNAFTMSSVLKACKGMKALSCGTLAHGLGMKHGIDGSMYVQNALLDMYATCCATMDDALT 167
           P+  T+ SVL A   ++ +S G   HG  M+ G D  + +  AL+DMYA  C +++ A  
Sbjct: 234 PSFITIVSVLPAVSALRLISVGKEIHGYAMRSGFDSLVNISTALVDMYAK-CGSLETARQ 293

Query: 168 VFNDIPLKTAVSWTTLIAGFTHRGDGYSGLQVFRQMLLEDVEPNSFSFSIAVRACASISS 227
           +F+ +  +  VSW ++I  +    +    + +F++ML E V+P   S   A+ ACA +  
Sbjct: 294 LFDGMLERNVVSWNSMIDAYVQNENPKEAMLIFQKMLDEGVKPTDVSVMGALHACADLGD 353

Query: 228 YSYGKQVHAAVTKYGLHSDAPVMNSILDMYCRCNCLCDAKRCFGEMTQKNLITWNTLIAG 287
              G+ +H    + GL  +  V+NS++ MYC+C  +  A   FG++  + L++WN +I G
Sbjct: 354 LERGRFIHKLSVELGLDRNVSVVNSLISMYCKCKEVDTAASMFGKLQSRTLVSWNAMILG 413

Query: 288 YERSDSS-ESLSLFSQMGFEGYEPNCFTFTSITAACANLAVLSCGQQVHGGIVRRGFDKS 347
           + ++    ++L+ FSQM     +P+ FT+ S+  A A L++    + +HG ++R   DK+
Sbjct: 414 FAQNGRPIDALNYFSQMRSRTVKPDTFTYVSVITAIAELSITHHAKWIHGVVMRSCLDKN 473

Query: 348 VALVNALIDMYAKCGNVNDSHKLFCDMPQRDLVSWTTMMIGYGAHGYGKEAIKLFDEMVQ 407
           V +  AL+DMYAKCG +  +  +F  M +R + +W  M+ GYG HG+GK A++LF+EM +
Sbjct: 474 VFVTTALVDMYAKCGAIMIARLIFDMMSERHVTTWNAMIDGYGTHGFGKAALELFEEMQK 533

Query: 408 SGIRPDRIVFMAVVSACSHAGLVDKGLRYFRSMLEDYSLNPDQEIYGCVVDLLGRAGRVE 467
             I+P+ + F++V+SACSH+GLV+ GL+ F  M E+YS+    + YG +VDLLGRAGR+ 
Sbjct: 534 GTIKPNGVTFLSVISACSHSGLVEAGLKCFYMMKENYSIELSMDHYGAMVDLLGRAGRLN 593

Query: 468 EAFQLVKSMPFEPDESVWGALLGACKAYELPNLGKLAAQRVLDTRPNMAGTYLLLSNIYA 527
           EA+  +  MP +P  +V+GA+LGAC+ ++  N  + AA+R+ +  P+  G ++LL+NIY 
Sbjct: 594 EAWDFIMQMPVKPAVNVYGAMLGACQIHKNVNFAEKAAERLFELNPDDGGYHVLLANIYR 653

Query: 528 AEGKWAEFAKMRKLMKGMDNKKEVGKSWIEIRNEVYSFVVGNKMGPHIEWVHKVLELLIW 587
           A   W +  ++R  M     +K  G S +EI+NEV+SF  G+   P  + ++  LE LI 
Sbjct: 654 AASMWEKVGQVRVSMLRQGLRKTPGCSMVEIKNEVHSFFSGSTAHPDSKKIYAFLEKLIC 713

Query: 588 HMKDDGDMTD 597
           H+K+ G + D
Sbjct: 714 HIKEAGYVPD 722

BLAST of Tan0016733 vs. TAIR 10
Match: AT4G18750.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 398.3 bits (1022), Expect = 1.1e-110
Identity = 199/546 (36.45%), Postives = 320/546 (58.61%), Query Frame = 0

Query: 49  NLIKSYFDKGLTKEARNLFDEMPERDVVAWTAMIVGFTSCNHYTQAWAVFCEMMRSDIEP 108
           +L+  Y        AR +FDEM ERDV++W ++I G+ S     +  +VF +M+ S IE 
Sbjct: 235 SLVAFYLKNQRVDSARKVFDEMTERDVISWNSIINGYVSNGLAEKGLSVFVQMLVSGIEI 294

Query: 109 NAFTMSSVLKACKGMKALSCGTLAHGLGMKHGIDGSMYVQNALLDMYATCCATMDDALTV 168
           +  T+ SV   C   + +S G   H +G+K          N LLDMY+  C  +D A  V
Sbjct: 295 DLATIVSVFAGCADSRLISLGRAVHSIGVKACFSREDRFCNTLLDMYSK-CGDLDSAKAV 354

Query: 169 FNDIPLKTAVSWTTLIAGFTHRGDGYSGLQVFRQMLLEDVEPNSFSFSIAVRACASISSY 228
           F ++  ++ VS+T++IAG+   G     +++F +M  E + P+ ++ +  +  CA     
Sbjct: 355 FREMSDRSVVSYTSMIAGYAREGLAGEAVKLFEEMEEEGISPDVYTVTAVLNCCARYRLL 414

Query: 229 SYGKQVHAAVTKYGLHSDAPVMNSILDMYCRCNCLCDAKRCFGEMTQKNLITWNTLIAGY 288
             GK+VH  + +  L  D  V N+++DMY +C  + +A+  F EM  K++I+WNT+I GY
Sbjct: 415 DEGKRVHEWIKENDLGFDIFVSNALMDMYAKCGSMQEAELVFSEMRVKDIISWNTIIGGY 474

Query: 289 ERS-DSSESLSLFSQMGFE-GYEPNCFTFTSITAACANLAVLSCGQQVHGGIVRRGFDKS 348
            ++  ++E+LSLF+ +  E  + P+  T   +  ACA+L+    G+++HG I+R G+   
Sbjct: 475 SKNCYANEALSLFNLLLEEKRFSPDERTVACVLPACASLSAFDKGREIHGYIMRNGYFSD 534

Query: 349 VALVNALIDMYAKCGNVNDSHKLFCDMPQRDLVSWTTMMIGYGAHGYGKEAIKLFDEMVQ 408
             + N+L+DMYAKCG +  +H LF D+  +DLVSWT M+ GYG HG+GKEAI LF++M Q
Sbjct: 535 RHVANSLVDMYAKCGALLLAHMLFDDIASKDLVSWTVMIAGYGMHGFGKEAIALFNQMRQ 594

Query: 409 SGIRPDRIVFMAVVSACSHAGLVDKGLRYFRSMLEDYSLNPDQEIYGCVVDLLGRAGRVE 468
           +GI  D I F++++ ACSH+GLVD+G R+F  M  +  + P  E Y C+VD+L R G + 
Sbjct: 595 AGIEADEISFVSLLYACSHSGLVDEGWRFFNIMRHECKIEPTVEHYACIVDMLARTGDLI 654

Query: 469 EAFQLVKSMPFEPDESVWGALLGACKAYELPNLGKLAAQRVLDTRPNMAGTYLLLSNIYA 528
           +A++ +++MP  PD ++WGALL  C+ +    L +  A++V +  P   G Y+L++NIYA
Sbjct: 655 KAYRFIENMPIPPDATIWGALLCGCRIHHDVKLAEKVAEKVFELEPENTGYYVLMANIYA 714

Query: 529 AEGKWAEFAKMRKLMKGMDNKKEVGKSWIEIRNEVYSFVVGNKMGPHIEWVHKVLELLIW 588
              KW +  ++RK +     +K  G SWIEI+  V  FV G+   P  E +   L  +  
Sbjct: 715 EAEKWEQVKRLRKRIGQRGLRKNPGCSWIEIKGRVNIFVAGDSSNPETENIEAFLRKVRA 774

Query: 589 HMKDDG 593
            M ++G
Sbjct: 775 RMIEEG 779

BLAST of Tan0016733 vs. TAIR 10
Match: AT3G49170.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 396.0 bits (1016), Expect = 5.5e-110
Identity = 199/543 (36.65%), Postives = 327/543 (60.22%), Query Frame = 0

Query: 61  KEARNLFDEMPERDVVAWTAMIVGFTSCNHYTQAWAVFCEMMRSDIEPNAFTMSSVLKAC 120
           + A  +FD+M E +VV WT MI          +A   F +M+ S  E + FT+SSV  AC
Sbjct: 220 ENAYKVFDKMSELNVVTWTLMITRCMQMGFPREAIRFFLDMVLSGFESDKFTLSSVFSAC 279

Query: 121 KGMKALSCGTLAHGLGMKHGIDGSMYVQNALLDMYATCCA--TMDDALTVFNDIPLKTAV 180
             ++ LS G   H   ++ G+     V+ +L+DMYA C A  ++DD   VF+ +   + +
Sbjct: 280 AELENLSLGKQLHSWAIRSGLVDD--VECSLVDMYAKCSADGSVDDCRKVFDRMEDHSVM 339

Query: 181 SWTTLIAGFTHRGD-GYSGLQVFRQMLLE-DVEPNSFSFSIAVRACASISSYSYGKQVHA 240
           SWT LI G+    +     + +F +M+ +  VEPN F+FS A +AC ++S    GKQV  
Sbjct: 340 SWTALITGYMKNCNLATEAINLFSEMITQGHVEPNHFTFSSAFKACGNLSDPRVGKQVLG 399

Query: 241 AVTKYGLHSDAPVMNSILDMYCRCNCLCDAKRCFGEMTQKNLITWNTLIAGYERS-DSSE 300
              K GL S++ V NS++ M+ + + + DA+R F  +++KNL+++NT + G  R+ +  +
Sbjct: 400 QAFKRGLASNSSVANSVISMFVKSDRMEDAQRAFESLSEKNLVSYNTFLDGTCRNLNFEQ 459

Query: 301 SLSLFSQMGFEGYEPNCFTFTSITAACANLAVLSCGQQVHGGIVRRGFDKSVALVNALID 360
           +  L S++       + FTF S+ +  AN+  +  G+Q+H  +V+ G   +  + NALI 
Sbjct: 460 AFKLLSEITERELGVSAFTFASLLSGVANVGSIRKGEQIHSQVVKLGLSCNQPVCNALIS 519

Query: 361 MYAKCGNVNDSHKLFCDMPQRDLVSWTTMMIGYGAHGYGKEAIKLFDEMVQSGIRPDRIV 420
           MY+KCG+++ + ++F  M  R+++SWT+M+ G+  HG+    ++ F++M++ G++P+ + 
Sbjct: 520 MYSKCGSIDTASRVFNFMENRNVISWTSMITGFAKHGFAIRVLETFNQMIEEGVKPNEVT 579

Query: 421 FMAVVSACSHAGLVDKGLRYFRSMLEDYSLNPDQEIYGCVVDLLGRAGRVEEAFQLVKSM 480
           ++A++SACSH GLV +G R+F SM ED+ + P  E Y C+VDLL RAG + +AF+ + +M
Sbjct: 580 YVAILSACSHVGLVSEGWRHFNSMYEDHKIKPKMEHYACMVDLLCRAGLLTDAFEFINTM 639

Query: 481 PFEPDESVWGALLGACKAYELPNLGKLAAQRVLDTRPNMAGTYLLLSNIYAAEGKWAEFA 540
           PF+ D  VW   LGAC+ +    LGKLAA+++L+  PN    Y+ LSNIYA  GKW E  
Sbjct: 640 PFQADVLVWRTFLGACRVHSNTELGKLAARKILELDPNEPAAYIQLSNIYACAGKWEEST 699

Query: 541 KMRKLMKGMDNKKEVGKSWIEIRNEVYSFVVGNKMGPHIEWVHKVLELLIWHMKDDGDMT 599
           +MR+ MK  +  KE G SWIE+ ++++ F VG+   P+   ++  L+ LI  +K  G + 
Sbjct: 700 EMRRKMKERNLVKEGGCSWIEVGDKIHKFYVGDTAHPNAHQIYDELDRLITEIKRCGYVP 759

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9FXA91.3e-20958.21Putative pentatricopeptide repeat-containing protein At1g56570 OS=Arabidopsis th... [more]
Q9SIT78.0e-11436.60Pentatricopeptide repeat-containing protein At2g13600 OS=Arabidopsis thaliana OX... [more]
Q3E6Q14.1e-11035.45Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidop... [more]
Q9SN391.6e-10936.45Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis t... [more]
Q5G1T17.7e-10936.65Pentatricopeptide repeat-containing protein At3g49170, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
XP_022931578.10.0e+0091.64putative pentatricopeptide repeat-containing protein At1g56570 [Cucurbita moscha... [more]
XP_023520790.10.0e+0091.81putative pentatricopeptide repeat-containing protein At1g56570 [Cucurbita pepo s... [more]
KAG7022386.10.0e+0091.47putative pentatricopeptide repeat-containing protein, partial [Cucurbita argyros... [more]
XP_022989366.10.0e+0091.64putative pentatricopeptide repeat-containing protein At1g56570 [Cucurbita maxima... [more]
XP_022157012.10.0e+0091.33putative pentatricopeptide repeat-containing protein At1g56570 [Momordica charan... [more]
Match NameE-valueIdentityDescription
A0A6J1EZ330.0e+0091.64putative pentatricopeptide repeat-containing protein At1g56570 OS=Cucurbita mosc... [more]
A0A6J1JJV40.0e+0091.64putative pentatricopeptide repeat-containing protein At1g56570 OS=Cucurbita maxi... [more]
A0A6J1DTG10.0e+0091.33putative pentatricopeptide repeat-containing protein At1g56570 OS=Momordica char... [more]
A0A0A0LW370.0e+0087.00Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G555630 PE=4 SV=1[more]
A0A1S3BQ300.0e+0086.33putative pentatricopeptide repeat-containing protein At1g56570 OS=Cucumis melo O... [more]
Match NameE-valueIdentityDescription
AT1G56570.19.4e-21158.21Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT2G13600.15.7e-11536.60Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G11290.12.9e-11135.45Pentatricopeptide repeat (PPR) superfamily protein [more]
AT4G18750.11.1e-11036.45Pentatricopeptide repeat (PPR) superfamily protein [more]
AT3G49170.15.5e-11036.65Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 228..333
e-value: 1.6E-17
score: 65.4
coord: 131..227
e-value: 5.0E-15
score: 57.3
coord: 40..127
e-value: 8.6E-20
score: 72.8
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 394..570
e-value: 2.5E-30
score: 107.9
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 73..120
e-value: 1.2E-12
score: 47.8
coord: 276..323
e-value: 1.8E-7
score: 31.3
coord: 176..223
e-value: 6.6E-8
score: 32.6
coord: 377..423
e-value: 5.2E-9
score: 36.2
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 379..412
e-value: 2.4E-8
score: 31.6
coord: 415..448
e-value: 9.4E-4
score: 17.2
coord: 50..76
e-value: 0.0025
score: 15.9
coord: 178..211
e-value: 6.2E-5
score: 20.9
coord: 76..110
e-value: 1.2E-6
score: 26.3
coord: 451..476
e-value: 3.7E-4
score: 18.5
coord: 351..378
e-value: 3.9E-4
score: 18.4
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 451..475
e-value: 0.012
score: 15.8
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 176..210
score: 9.536388
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 377..411
score: 12.550746
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 74..108
score: 10.928473
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 246..280
score: 8.637562
NoneNo IPR availablePANTHERPTHR47925OS01G0913400 PROTEIN-RELATEDcoord: 1..599
NoneNo IPR availablePANTHERPTHR47925:SF28BNAA09G15330D PROTEINcoord: 1..599

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0016733.1Tan0016733.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009451 RNA modification
cellular_component GO:0043231 intracellular membrane-bounded organelle
molecular_function GO:0005515 protein binding
molecular_function GO:0003723 RNA binding