Tan0004075 (gene) Snake gourd v1

Overview
NameTan0004075
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPentatricopeptide repeat-containing protein
LocationLG06: 6907860 .. 6909738 (+)
RNA-Seq ExpressionTan0004075
SyntenyTan0004075
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CTCTAATCTTAAACCAATCTTCATTTTTTTAAGGCCTTTCTGAGTTGTCTCTCAGCCCTGCCCTTCGAAATTACAGTGATTGATTGCTTTCGACTAAATTTTCATTCTTCATCAGTTCATCTCAAGAGCCATTTTTAATCATCAACAATTTAGAGTAATGACACCTTTTGCCGCACAATTCTGGGCTCAATCAAACAGAATTCTCTCTCTTTTCACCGCTTCTCTTTCCCCAATTCACATCCCTCAAATTCAAGCTCAACTCATCGTTCAAAACCTCCATTCAAACCCCACCATAGCCCACCACTTCATAAACACTTGCCACTCTCTTCATCTTCTCAATTCCGCTCTTCTCTTCTTCTTCACCCATTTCCCCAATCCCCACGTCTTCATCTGCAATTCCTTGATCAGAGCCTTCTCTCACTCCAAAATCCCTCATACCCCTCTTTCCATTTACGCCCACATGACCAGAAACTCGATTCTTCCCAACAATTTCACCTTCCCTTTCCTTCTCAAGTCCTTGGCTGACTTCAACGACCTTGTGGGTGGACAATCTGTTCATACCCATGTTGTGAAACTGGGTTATGTTTCTGATGTTTATGTGCAGAACTCGTTGATGGACGTTTATGCGTCGTGTGGGAGAATGGGGTTATGCAAGAAGGTGTTCGACGAAATGCCTCAAAGAGATGTTGTGTCGTGGACTGTTTTGATTATGGGTTATCGAGTTGCTTTGATGTTTGATGATGCTTTGATTGCGTTTGAGCAGATGCAATATGCAGGCGTGGAGCCTAACCGTGTGACGATGGTGAATGCATTAGCTGCTTGTGCGAGCTTTGGAGCCATTGAAATGGGTGTTTGGATACATGAGTTTGTGAAGAAAAAAGGGTGGGAAGTGGATGTGATTTTGGGAACTTCTTTGATTGATATGTATGGGAAATGTGGGAGAATCAAAGAGGGATTGGCTGTTTTCCAAGCCATGAAAGAGAAGAATGTGTATACATGGAATGCACTCATTAAGGGGCTGGCTTTAGCCAAGAGTGGAGAGGAGGCCATTGCTTGGTTTAAGAGAATGGATGAAGAAGGAGTTGAGGCAGATGATGTGACATTAGTGGCAGTGCTTTGTGCTTGTAGCCACTCTGGTTTGGTCGACATGGGGAGGCAGATCTTTCGTTCGTTGATCGATGGGAGGTTCCGGTTTTCTCCAGGAATCAAACATTATTCATGTATGGCAGATCTATTGGCTCGTTGTGGGTGTATTGAAGAGGCTTTTGAGTTGATAAAGAATATGCCTTTTGATGCTACCAAAGCAATGTGGGGGTCTTTGCTAGCTGGTGGCAGAGCTCATGGGAGCTTGGAAGTGAGTGAGTTTGCAGCAAGGAAACTTGTTGAAATGGAACCAGAGAATGGTGCTTATTATGCTGTGTTATCTAATATTTGTGCAGAGATTGGAAAATGGAGTGAGGTTGAGAAAGTCAGAGAGATCATGAAAGAGAGAGGACTGAAGAAAGACTTGGGGTCGAGTTCTGTTGAAGAAGCTGTTTATGAATCGTTTATGCCTACAGAAACTGCCCATTTCTCATGATATCAAAATTCAGAAGAAGGACTCGCATCTCTGCAAACTCACATTTTAAAAGCTTACAAGTTTTGGGAGTTATAATCATCATAGAGGCAATGCATGTAAGTTAGGGACTGATAAATTATTGGAAGTGACAGGACTGTTAGATGCATGGTTAAAGGATATCTTTTAATCTCTTGACTATGCATTCTCGAAGTCACGAGACTGACAGCTACATTGAATAACTTATCGAACCTAGACCAGTTACAGAGATCCATTGTTGGCTTCCTCAGTTTTATTAGTGATGGAGATGCTGCATTCAACAAC

mRNA sequence

CTCTAATCTTAAACCAATCTTCATTTTTTTAAGGCCTTTCTGAGTTGTCTCTCAGCCCTGCCCTTCGAAATTACAGTGATTGATTGCTTTCGACTAAATTTTCATTCTTCATCAGTTCATCTCAAGAGCCATTTTTAATCATCAACAATTTAGAGTAATGACACCTTTTGCCGCACAATTCTGGGCTCAATCAAACAGAATTCTCTCTCTTTTCACCGCTTCTCTTTCCCCAATTCACATCCCTCAAATTCAAGCTCAACTCATCGTTCAAAACCTCCATTCAAACCCCACCATAGCCCACCACTTCATAAACACTTGCCACTCTCTTCATCTTCTCAATTCCGCTCTTCTCTTCTTCTTCACCCATTTCCCCAATCCCCACGTCTTCATCTGCAATTCCTTGATCAGAGCCTTCTCTCACTCCAAAATCCCTCATACCCCTCTTTCCATTTACGCCCACATGACCAGAAACTCGATTCTTCCCAACAATTTCACCTTCCCTTTCCTTCTCAAGTCCTTGGCTGACTTCAACGACCTTGTGGGTGGACAATCTGTTCATACCCATGTTGTGAAACTGGGTTATGTTTCTGATGTTTATGTGCAGAACTCGTTGATGGACGTTTATGCGTCGTGTGGGAGAATGGGGTTATGCAAGAAGGTGTTCGACGAAATGCCTCAAAGAGATGTTGTGTCGTGGACTGTTTTGATTATGGGTTATCGAGTTGCTTTGATGTTTGATGATGCTTTGATTGCGTTTGAGCAGATGCAATATGCAGGCGTGGAGCCTAACCGTGTGACGATGGTGAATGCATTAGCTGCTTGTGCGAGCTTTGGAGCCATTGAAATGGGTGTTTGGATACATGAGTTTGTGAAGAAAAAAGGGTGGGAAGTGGATGTGATTTTGGGAACTTCTTTGATTGATATGTATGGGAAATGTGGGAGAATCAAAGAGGGATTGGCTGTTTTCCAAGCCATGAAAGAGAAGAATGTGTATACATGGAATGCACTCATTAAGGGGCTGGCTTTAGCCAAGAGTGGAGAGGAGGCCATTGCTTGGTTTAAGAGAATGGATGAAGAAGGAGTTGAGGCAGATGATGTGACATTAGTGGCAGTGCTTTGTGCTTGTAGCCACTCTGGTTTGGTCGACATGGGGAGGCAGATCTTTCGTTCGTTGATCGATGGGAGGTTCCGGTTTTCTCCAGGAATCAAACATTATTCATGTATGGCAGATCTATTGGCTCGTTGTGGGTGTATTGAAGAGGCTTTTGAGTTGATAAAGAATATGCCTTTTGATGCTACCAAAGCAATGTGGGGGTCTTTGCTAGCTGGTGGCAGAGCTCATGGGAGCTTGGAAGTGAGTGAGTTTGCAGCAAGGAAACTTGTTGAAATGGAACCAGAGAATGGTGCTTATTATGCTGTGTTATCTAATATTTGTGCAGAGATTGGAAAATGGAGTGAGGTTGAGAAAGTCAGAGAGATCATGAAAGAGAGAGGACTGAAGAAAGACTTGGGGTCGAGTTCTGTTGAAGAAGCTGTTTATGAATCGTTTATGCCTACAGAAACTGCCCATTTCTCATGATATCAAAATTCAGAAGAAGGACTCGCATCTCTGCAAACTCACATTTTAAAAGCTTACAAGTTTTGGGAGTTATAATCATCATAGAGGCAATGCATGTAAGTTAGGGACTGATAAATTATTGGAAGTGACAGGACTGTTAGATGCATGGTTAAAGGATATCTTTTAATCTCTTGACTATGCATTCTCGAAGTCACGAGACTGACAGCTACATTGAATAACTTATCGAACCTAGACCAGTTACAGAGATCCATTGTTGGCTTCCTCAGTTTTATTAGTGATGGAGATGCTGCATTCAACAAC

Coding sequence (CDS)

ATGACACCTTTTGCCGCACAATTCTGGGCTCAATCAAACAGAATTCTCTCTCTTTTCACCGCTTCTCTTTCCCCAATTCACATCCCTCAAATTCAAGCTCAACTCATCGTTCAAAACCTCCATTCAAACCCCACCATAGCCCACCACTTCATAAACACTTGCCACTCTCTTCATCTTCTCAATTCCGCTCTTCTCTTCTTCTTCACCCATTTCCCCAATCCCCACGTCTTCATCTGCAATTCCTTGATCAGAGCCTTCTCTCACTCCAAAATCCCTCATACCCCTCTTTCCATTTACGCCCACATGACCAGAAACTCGATTCTTCCCAACAATTTCACCTTCCCTTTCCTTCTCAAGTCCTTGGCTGACTTCAACGACCTTGTGGGTGGACAATCTGTTCATACCCATGTTGTGAAACTGGGTTATGTTTCTGATGTTTATGTGCAGAACTCGTTGATGGACGTTTATGCGTCGTGTGGGAGAATGGGGTTATGCAAGAAGGTGTTCGACGAAATGCCTCAAAGAGATGTTGTGTCGTGGACTGTTTTGATTATGGGTTATCGAGTTGCTTTGATGTTTGATGATGCTTTGATTGCGTTTGAGCAGATGCAATATGCAGGCGTGGAGCCTAACCGTGTGACGATGGTGAATGCATTAGCTGCTTGTGCGAGCTTTGGAGCCATTGAAATGGGTGTTTGGATACATGAGTTTGTGAAGAAAAAAGGGTGGGAAGTGGATGTGATTTTGGGAACTTCTTTGATTGATATGTATGGGAAATGTGGGAGAATCAAAGAGGGATTGGCTGTTTTCCAAGCCATGAAAGAGAAGAATGTGTATACATGGAATGCACTCATTAAGGGGCTGGCTTTAGCCAAGAGTGGAGAGGAGGCCATTGCTTGGTTTAAGAGAATGGATGAAGAAGGAGTTGAGGCAGATGATGTGACATTAGTGGCAGTGCTTTGTGCTTGTAGCCACTCTGGTTTGGTCGACATGGGGAGGCAGATCTTTCGTTCGTTGATCGATGGGAGGTTCCGGTTTTCTCCAGGAATCAAACATTATTCATGTATGGCAGATCTATTGGCTCGTTGTGGGTGTATTGAAGAGGCTTTTGAGTTGATAAAGAATATGCCTTTTGATGCTACCAAAGCAATGTGGGGGTCTTTGCTAGCTGGTGGCAGAGCTCATGGGAGCTTGGAAGTGAGTGAGTTTGCAGCAAGGAAACTTGTTGAAATGGAACCAGAGAATGGTGCTTATTATGCTGTGTTATCTAATATTTGTGCAGAGATTGGAAAATGGAGTGAGGTTGAGAAAGTCAGAGAGATCATGAAAGAGAGAGGACTGAAGAAAGACTTGGGGTCGAGTTCTGTTGAAGAAGCTGTTTATGAATCGTTTATGCCTACAGAAACTGCCCATTTCTCATGA

Protein sequence

MTPFAAQFWAQSNRILSLFTASLSPIHIPQIQAQLIVQNLHSNPTIAHHFINTCHSLHLLNSALLFFFTHFPNPHVFICNSLIRAFSHSKIPHTPLSIYAHMTRNSILPNNFTFPFLLKSLADFNDLVGGQSVHTHVVKLGYVSDVYVQNSLMDVYASCGRMGLCKKVFDEMPQRDVVSWTVLIMGYRVALMFDDALIAFEQMQYAGVEPNRVTMVNALAACASFGAIEMGVWIHEFVKKKGWEVDVILGTSLIDMYGKCGRIKEGLAVFQAMKEKNVYTWNALIKGLALAKSGEEAIAWFKRMDEEGVEADDVTLVAVLCACSHSGLVDMGRQIFRSLIDGRFRFSPGIKHYSCMADLLARCGCIEEAFELIKNMPFDATKAMWGSLLAGGRAHGSLEVSEFAARKLVEMEPENGAYYAVLSNICAEIGKWSEVEKVREIMKERGLKKDLGSSSVEEAVYESFMPTETAHFS
Homology
BLAST of Tan0004075 vs. ExPASy Swiss-Prot
Match: Q9FMA1 (Pentatricopeptide repeat-containing protein At5g56310 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E13 PE=2 SV=1)

HSP 1 Score: 308.1 bits (788), Expect = 1.7e-82
Identity = 172/466 (36.91%), Postives = 256/466 (54.94%), Query Frame = 0

Query: 28  IPQIQAQLIVQNLHSNPTIAHHFINTCHSLHLLNSALLFFFTHFPNPHVFICNSLIRAFS 87
           + Q    +I+  L+ +      FI  C +   L  A    FTH P P+ ++ N++IRA S
Sbjct: 31  LKQSHCYMIITGLNRDNLNVAKFIEACSNAGHLRYA-YSVFTHQPCPNTYLHNTMIRALS 90

Query: 88  HSKIPHT---PLSIYAHMTRNSILPNNFTFPFLLKSLADFNDLVGGQSVHTHVVKLGYVS 147
               P+     +++Y  +      P+ FTFPF+LK     +D+  G+ +H  VV  G+ S
Sbjct: 91  LLDEPNAHSIAITVYRKLWALCAKPDTFTFPFVLKIAVRVSDVWFGRQIHGQVVVFGFDS 150

Query: 148 DVYVQNSLMDVYASCGRMGLCKKVFDEMPQRDV--------------------------- 207
            V+V   L+ +Y SCG +G  +K+FDEM  +DV                           
Sbjct: 151 SVHVVTGLIQMYFSCGGLGDARKMFDEMLVKDVNVWNALLAGYGKVGEMDEARSLLEMMP 210

Query: 208 ------VSWTVLIMGYRVALMFDDALIAFEQMQYAGVEPNRVTMVNALAACASFGAIEMG 267
                 VSWT +I GY  +    +A+  F++M    VEP+ VT++  L+ACA  G++E+G
Sbjct: 211 CWVRNEVSWTCVISGYAKSGRASEAIEVFQRMLMENVEPDEVTLLAVLSACADLGSLELG 270

Query: 268 VWIHEFVKKKGWEVDVILGTSLIDMYGKCGRIKEGLAVFQAMKEKNVYTWNALIKGLALA 327
             I  +V  +G    V L  ++IDMY K G I + L VF+ + E+NV TW  +I GLA  
Sbjct: 271 ERICSYVDHRGMNRAVSLNNAVIDMYAKSGNITKALDVFECVNERNVVTWTTIIAGLATH 330

Query: 328 KSGEEAIAWFKRMDEEGVEADDVTLVAVLCACSHSGLVDMGRQIFRSLIDGRFRFSPGIK 387
             G EA+A F RM + GV  +DVT +A+L ACSH G VD+G+++F S+   ++   P I+
Sbjct: 331 GHGAEALAMFNRMVKAGVRPNDVTFIAILSACSHVGWVDLGKRLFNSM-RSKYGIHPNIE 390

Query: 388 HYSCMADLLARCGCIEEAFELIKNMPFDATKAMWGSLLAGGRAHGSLEVSEFAARKLVEM 447
           HY CM DLL R G + EA E+IK+MPF A  A+WGSLLA    H  LE+ E A  +L+++
Sbjct: 391 HYGCMIDLLGRAGKLREADEVIKSMPFKANAAIWGSLLAASNVHHDLELGERALSELIKL 450

Query: 448 EPENGAYYAVLSNICAEIGKWSEVEKVREIMKERGLKKDLGSSSVE 458
           EP N   Y +L+N+ + +G+W E   +R +MK  G+KK  G SS+E
Sbjct: 451 EPNNSGNYMLLANLYSNLGRWDESRMMRNMMKGIGVKKMAGESSIE 494

BLAST of Tan0004075 vs. ExPASy Swiss-Prot
Match: Q9SN85 (Pentatricopeptide repeat-containing protein At3g47530 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H76 PE=2 SV=1)

HSP 1 Score: 300.4 bits (768), Expect = 3.5e-80
Identity = 164/467 (35.12%), Postives = 260/467 (55.67%), Query Frame = 0

Query: 13  NRILSLFTASLSPIHIPQIQAQLIVQNLHSNPTIAHHFINTCHSLHLLNSALLF---FFT 72
           + +LSL  +S   +H+ QI A L+  +L  N  + HHF++   +L L+   + +    F+
Sbjct: 12  DHLLSLIVSSTGKLHLRQIHALLLRTSLIRNSDVFHHFLSRL-ALSLIPRDINYSCRVFS 71

Query: 73  HFPNPHVFICNSLIRAFSHSKIPHTPLSIYAHMTRNSILP-NNFTFPFLLKSLADFNDLV 132
              NP +  CN++IRAFS S+ P     ++  + RNS LP N  +  F LK      DL+
Sbjct: 72  QRLNPTLSHCNTMIRAFSLSQTPCEGFRLFRSLRRNSSLPANPLSSSFALKCCIKSGDLL 131

Query: 133 GGQSVHTHVVKLGYVSDVYVQNSLMDVYASCGRMGLCKKVFDEMPQRDVVSWTVLIMGYR 192
           GG  +H  +   G++SD  +  +LMD+Y++C       KVFDE+P+RD VSW VL   Y 
Sbjct: 132 GGLQIHGKIFSDGFLSDSLLMTTLMDLYSTCENSTDACKVFDEIPKRDTVSWNVLFSCYL 191

Query: 193 VALMFDDALIAFEQMQY---AGVEPNRVTMVNALAACASFGAIEMGVWIHEFVKKKGWEV 252
                 D L+ F++M+      V+P+ VT + AL ACA+ GA++ G  +H+F+ + G   
Sbjct: 192 RNKRTRDVLVLFDKMKNDVDGCVKPDGVTCLLALQACANLGALDFGKQVHDFIDENGLSG 251

Query: 253 DVILGTSLIDMYGKCGRIKEGLAVFQAMKEKNVYTWNALIKGLALAKSGEEAIAWFKRMD 312
            + L  +L+ MY +CG + +   VF  M+E+NV +W ALI GLA+   G+EAI  F  M 
Sbjct: 252 ALNLSNTLVSMYSRCGSMDKAYQVFYGMRERNVVSWTALISGLAMNGFGKEAIEAFNEML 311

Query: 313 EEGVEADDVTLVAVLCACSHSGLVDMGRQIFRSLIDGRFRFSPGIKHYSCMADLLARCGC 372
           + G+  ++ TL  +L ACSHSGLV  G   F  +  G F+  P + HY C+ DLL R   
Sbjct: 312 KFGISPEEQTLTGLLSACSHSGLVAEGMMFFDRMRSGEFKIKPNLHHYGCVVDLLGRARL 371

Query: 373 IEEAFELIKNMPFDATKAMWGSLLAGGRAHGSLEVSEFAARKLVEMEPENGAYYAVLSNI 432
           +++A+ LIK+M       +W +LL   R HG +E+ E     L+E++ E    Y +L N 
Sbjct: 372 LDKAYSLIKSMEMKPDSTIWRTLLGACRVHGDVELGERVISHLIELKAEEAGDYVLLLNT 431

Query: 433 CAEIGKWSEVEKVREIMKERGLKKDLGSSSVE-EAVYESFMPTETAH 472
            + +GKW +V ++R +MKE+ +    G S++E +     F+  + +H
Sbjct: 432 YSTVGKWEKVTELRSLMKEKRIHTKPGCSAIELQGTVHEFIVDDVSH 477

BLAST of Tan0004075 vs. ExPASy Swiss-Prot
Match: P93011 (Pentatricopeptide repeat-containing protein At2g33760 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H6 PE=3 SV=1)

HSP 1 Score: 292.4 bits (747), Expect = 9.5e-78
Identity = 151/436 (34.63%), Postives = 258/436 (59.17%), Query Frame = 0

Query: 28  IPQIQAQLIVQNLHSNPTIAHHFINTCHSLHLLNSALLFFFTHFPNPHVFICNSLIRAFS 87
           + Q+ A LIV     + ++    I    S   +    L F +  P P  F+ NS+I++ S
Sbjct: 25  LQQVHAHLIVTGYGRSRSLLTKLITLACSARAIAYTHLLFLS-VPLPDDFLFNSVIKSTS 84

Query: 88  HSKIPHTPLSIYAHMTRNSILPNNFTFPFLLKSLADFNDLVGGQSVHTHVVKLGYVSDVY 147
             ++P   ++ Y  M  +++ P+N+TF  ++KS AD + L  G+ VH H V  G+  D Y
Sbjct: 85  KLRLPLHCVAYYRRMLSSNVSPSNYTFTSVIKSCADLSALRIGKGVHCHAVVSGFGLDTY 144

Query: 148 VQNSLMDVYASCGRMGLCKKVFDEMPQRDVVSWTVLIMGYRVALMFDDALIAFEQMQYAG 207
           VQ +L+  Y+ CG M   ++VFD MP++ +V+W  L+ G+    + D+A+  F QM+ +G
Sbjct: 145 VQAALVTFYSKCGDMEGARQVFDRMPEKSIVAWNSLVSGFEQNGLADEAIQVFYQMRESG 204

Query: 208 VEPNRVTMVNALAACASFGAIEMGVWIHEFVKKKGWEVDVILGTSLIDMYGKCGRIKEGL 267
            EP+  T V+ L+ACA  GA+ +G W+H+++  +G +++V LGT+LI++Y +CG + +  
Sbjct: 205 FEPDSATFVSLLSACAQTGAVSLGSWVHQYIISEGLDLNVKLGTALINLYSRCGDVGKAR 264

Query: 268 AVFQAMKEKNVYTWNALIKGLALAKSGEEAIAWFKRMDEE-GVEADDVTLVAVLCACSHS 327
            VF  MKE NV  W A+I        G++A+  F +M+++ G   ++VT VAVL AC+H+
Sbjct: 265 EVFDKMKETNVAAWTAMISAYGTHGYGQQAVELFNKMEDDCGPIPNNVTFVAVLSACAHA 324

Query: 328 GLVDMGRQIFRSLIDGRFRFSPGIKHYSCMADLLARCGCIEEAFELIKNMPFDAT----- 387
           GLV+ GR +++ +    +R  PG++H+ CM D+L R G ++EA++ I  +  DAT     
Sbjct: 325 GLVEEGRSVYKRMTKS-YRLIPGVEHHVCMVDMLGRAGFLDEAYKFIHQL--DATGKATA 384

Query: 388 KAMWGSLLAGGRAHGSLEVSEFAARKLVEMEPENGAYYAVLSNICAEIGKWSEVEKVREI 447
            A+W ++L   + H + ++    A++L+ +EP+N  ++ +LSNI A  GK  EV  +R+ 
Sbjct: 385 PALWTAMLGACKMHRNYDLGVEIAKRLIALEPDNPGHHVMLSNIYALSGKTDEVSHIRDG 444

Query: 448 MKERGLKKDLGSSSVE 458
           M    L+K +G S +E
Sbjct: 445 MMRNNLRKQVGYSVIE 456

BLAST of Tan0004075 vs. ExPASy Swiss-Prot
Match: Q9FJY7 (Pentatricopeptide repeat-containing protein At5g66520 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H61 PE=2 SV=1)

HSP 1 Score: 291.6 bits (745), Expect = 1.6e-77
Identity = 168/480 (35.00%), Postives = 252/480 (52.50%), Query Frame = 0

Query: 28  IPQIQAQLIVQNLHSNPTIAHHFINTCHSLHLLNSALL----FFFTHFPNPHVFICNSLI 87
           + QI A+++   L  +      F++ C  +   +S  L      F  F  P  F+ N +I
Sbjct: 30  LKQIHARMLKTGLMQDSYAITKFLSFC--ISSTSSDFLPYAQIVFDGFDRPDTFLWNLMI 89

Query: 88  RAFSHSKIPHTPLSIYAHMTRNSILPNNFTFPFLLKSLADFNDLVGGQSVHTHVVKLGYV 147
           R FS S  P   L +Y  M  +S   N +TFP LLK+ ++ +       +H  + KLGY 
Sbjct: 90  RGFSCSDEPERSLLLYQRMLCSSAPHNAYTFPSLLKACSNLSAFEETTQIHAQITKLGYE 149

Query: 148 SDVYVQNSLMDVYASCGRMGLCKKVFDEMPQRD--------------------------- 207
           +DVY  NSL++ YA  G   L   +FD +P+ D                           
Sbjct: 150 NDVYAVNSLINSYAVTGNFKLAHLLFDRIPEPDDVSWNSVIKGYVKAGKMDIALTLFRKM 209

Query: 208 ----VVSWTVLIMGYRVALMFDDALIAFEQMQYAGVEPNRVTMVNALAACASFGAIEMGV 267
                +SWT +I GY  A M  +AL  F +MQ + VEP+ V++ NAL+ACA  GA+E G 
Sbjct: 210 AEKNAISWTTMISGYVQADMNKEALQLFHEMQNSDVEPDNVSLANALSACAQLGALEQGK 269

Query: 268 WIHEFVKKKGWEVDVILGTSLIDMYGKCGRIKEGLAVFQAMKEKNVYTWNALIKGLALAK 327
           WIH ++ K    +D +LG  LIDMY KCG ++E L VF+ +K+K+V  W ALI G A   
Sbjct: 270 WIHSYLNKTRIRMDSVLGCVLIDMYAKCGEMEEALEVFKNIKKKSVQAWTALISGYAYHG 329

Query: 328 SGEEAIAWFKRMDEEGVEADDVTLVAVLCACSHSGLVDMGRQIFRSLIDGRFRFSPGIKH 387
            G EAI+ F  M + G++ + +T  AVL ACS++GLV+ G+ IF S+ +  +   P I+H
Sbjct: 330 HGREAISKFMEMQKMGIKPNVITFTAVLTACSYTGLVEEGKLIFYSM-ERDYNLKPTIEH 389

Query: 388 YSCMADLLARCGCIEEAFELIKNMPFDATKAMWGSLLAGGRAHGSLEVSEFAARKLVEME 447
           Y C+ DLL R G ++EA   I+ MP      +WG+LL   R H ++E+ E     L+ ++
Sbjct: 390 YGCIVDLLGRAGLLDEAKRFIQEMPLKPNAVIWGALLKACRIHKNIELGEEIGEILIAID 449

Query: 448 PENGAYYAVLSNICAEIGKWSEVEKVREIMKERGLKKDLGSSSVE-EAVYESFMPTETAH 472
           P +G  Y   +NI A   KW +  + R +MKE+G+ K  G S++  E     F+  + +H
Sbjct: 450 PYHGGRYVHKANIHAMDKKWDKAAETRRLMKEQGVAKVPGCSTISLEGTTHEFLAGDRSH 506

BLAST of Tan0004075 vs. ExPASy Swiss-Prot
Match: O80488 (Pentatricopeptide repeat-containing protein At1g09190 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E70 PE=2 SV=1)

HSP 1 Score: 290.4 bits (742), Expect = 3.6e-77
Identity = 168/476 (35.29%), Postives = 265/476 (55.67%), Query Frame = 0

Query: 14  RILSLFTASLSPIHIPQIQAQLIVQNLHSNPTIAHHFINTCHSLHLLNSALLFFFTHFPN 73
           ++L L     +   +P+I A L+   LH +  +  HFI+ C SL   + A    F+H  N
Sbjct: 6   KLLRLLHGHNTRTRLPEIHAHLLRHFLHGSNLLLAHFISICGSLSNSDYANR-VFSHIQN 65

Query: 74  PHVFICNSLIRAFSHSKIPHTPLSIYAHMTRNSILPNNFTFPFLLKSLADFNDLVGGQSV 133
           P+V + N++I+ +S    P   LS ++ M    I  + +T+  LLKS +  +DL  G+ V
Sbjct: 66  PNVLVFNAMIKCYSLVGPPLESLSFFSSMKSRGIWADEYTYAPLLKSCSSLSDLRFGKCV 125

Query: 134 HTHVVKLGYVSDVYVQNSLMDVYASCGRMGLCKKVFDEMPQRDVVSWTVLIMGY------ 193
           H  +++ G+     ++  ++++Y S GRMG  +KVFDEM +R+VV W ++I G+      
Sbjct: 126 HGELIRTGFHRLGKIRIGVVELYTSGGRMGDAQKVFDEMSERNVVVWNLMIRGFCDSGDV 185

Query: 194 -RVALMFD------------------------DALIAFEQMQYAGVEPNRVTMVNALAAC 253
            R   +F                         +AL  F +M   G +P+  T+V  L   
Sbjct: 186 ERGLHLFKQMSERSIVSWNSMISSLSKCGRDREALELFCEMIDQGFDPDEATVVTVLPIS 245

Query: 254 ASFGAIEMGVWIHEFVKKKGWEVDVI-LGTSLIDMYGKCGRIKEGLAVFQAMKEKNVYTW 313
           AS G ++ G WIH   +  G   D I +G +L+D Y K G ++   A+F+ M+ +NV +W
Sbjct: 246 ASLGVLDTGKWIHSTAESSGLFKDFITVGNALVDFYCKSGDLEAATAIFRKMQRRNVVSW 305

Query: 314 NALIKGLALAKSGEEAIAWFKRMDEEG-VEADDVTLVAVLCACSHSGLVDMGRQIFRSLI 373
           N LI G A+   GE  I  F  M EEG V  ++ T + VL  CS++G V+ G ++F  ++
Sbjct: 306 NTLISGSAVNGKGEFGIDLFDAMIEEGKVAPNEATFLGVLACCSYTGQVERGEELFGLMM 365

Query: 374 DGRFRFSPGIKHYSCMADLLARCGCIEEAFELIKNMPFDATKAMWGSLLAGGRAHGSLEV 433
           + RF+     +HY  M DL++R G I EAF+ +KNMP +A  AMWGSLL+  R+HG +++
Sbjct: 366 E-RFKLEARTEHYGAMVDLMSRSGRITEAFKFLKNMPVNANAAMWGSLLSACRSHGDVKL 425

Query: 434 SEFAARKLVEMEPENGAYYAVLSNICAEIGKWSEVEKVREIMKERGLKKDLGSSSV 457
           +E AA +LV++EP N   Y +LSN+ AE G+W +VEKVR +MK+  L+K  G S++
Sbjct: 426 AEVAAMELVKIEPGNSGNYVLLSNLYAEEGRWQDVEKVRTLMKKNRLRKSTGQSTI 479

BLAST of Tan0004075 vs. NCBI nr
Match: XP_038898992.1 (pentatricopeptide repeat-containing protein At5g56310-like [Benincasa hispida])

HSP 1 Score: 827.8 bits (2137), Expect = 4.8e-236
Identity = 407/457 (89.06%), Postives = 428/457 (93.65%), Query Frame = 0

Query: 1   MTPFAAQFWAQSNRILSLFTASLSPIHIPQIQAQLIVQNLHSNPTIAHHFINTCHSLHLL 60
           MTPF AQF A+SNRILS+FT SLSPIHIPQIQAQLI+QNLHSNPTIAHHFINTCH L LL
Sbjct: 1   MTPFTAQFRAESNRILSVFTTSLSPIHIPQIQAQLILQNLHSNPTIAHHFINTCHHLQLL 60

Query: 61  NSALLFFFTHFPNPHVFICNSLIRAFSHSKIPHTPLSIYAHMTRNSILPNNFTFPFLLKS 120
           +SALL FF H P PHVF+CNSLIRAFSHSKIPHTPLSIY HM RNSI PNN+TFPFLLKS
Sbjct: 61  DSALL-FFNHIPKPHVFVCNSLIRAFSHSKIPHTPLSIYTHMNRNSIFPNNYTFPFLLKS 120

Query: 121 LADFNDLVGGQSVHTHVVKLGYVSDVYVQNSLMDVYASCGRMGLCKKVFDEMPQRDVVSW 180
           LADFNDLV GQSVHTHV+KLGYV+DVYVQNSLMDVYASCG+MGLCKKVFDEMPQRDVVSW
Sbjct: 121 LADFNDLVSGQSVHTHVLKLGYVADVYVQNSLMDVYASCGKMGLCKKVFDEMPQRDVVSW 180

Query: 181 TVLIMGYRVALMFDDALIAFEQMQYAGVEPNRVTMVNALAACASFGAIEMGVWIHEFVKK 240
           TVLIMGYRV+LMFDDALIAFEQMQYAGVEPNRVTMVNALAACA+FGAIEMGVWIHEFVK+
Sbjct: 181 TVLIMGYRVSLMFDDALIAFEQMQYAGVEPNRVTMVNALAACANFGAIEMGVWIHEFVKR 240

Query: 241 KGWEVDVILGTSLIDMYGKCGRIKEGLAVFQAMKEKNVYTWNALIKGLALAKSGEEAIAW 300
           KGWE+DVILGTSLIDMYGKCGRIKEGL VFQAMKEKNVYTWNALIKGLALAKSGEEAIAW
Sbjct: 241 KGWEMDVILGTSLIDMYGKCGRIKEGLVVFQAMKEKNVYTWNALIKGLALAKSGEEAIAW 300

Query: 301 FKRMDEEGVEADDVTLVAVLCACSHSGLVDMGRQIFRSLIDGRFRFSPGIKHYSCMADLL 360
           F RMDEEGVE D+VTLVAVLCACSHSGLVDMG+QIF+SL D RF FSPGIKHYSCM DLL
Sbjct: 301 FNRMDEEGVEPDEVTLVAVLCACSHSGLVDMGKQIFQSLTDRRFGFSPGIKHYSCMVDLL 360

Query: 361 ARCGCIEEAFELIKNMPFDATKAMWGSLLAGGRAHGSLEVSEFAARKLVEMEPENGAYYA 420
           AR GCIE AF LIK+MPF+ATKAMWGSLLAG RAHG LEVSE AA+KLVEMEPENGAYY 
Sbjct: 361 ARYGCIEAAFVLIKDMPFEATKAMWGSLLAGSRAHGGLEVSEIAAKKLVEMEPENGAYYV 420

Query: 421 VLSNICAEIGKWSEVEKVREIMKERGLKKDLGSSSVE 458
           VLSNI AE+ KWSEVEKVRE+MKERGLKKDLGSSSVE
Sbjct: 421 VLSNIYAEMEKWSEVEKVREMMKERGLKKDLGSSSVE 456

BLAST of Tan0004075 vs. NCBI nr
Match: XP_023548659.1 (pentatricopeptide repeat-containing protein At5g56310-like [Cucurbita pepo subsp. pepo] >XP_023548660.1 pentatricopeptide repeat-containing protein At5g56310-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 827.0 bits (2135), Expect = 8.2e-236
Identity = 407/458 (88.86%), Postives = 429/458 (93.67%), Query Frame = 0

Query: 1   MTPFAAQFWAQSNRILSLFTASLSPIHIPQIQAQLIVQNLHSNPTIAHHFINTCHSLHLL 60
           M  F  Q WA+SNRILSLF  SLSPIHIPQIQAQLIVQNLHSNPTIAHHFINTCH L LL
Sbjct: 1   MAAFGVQMWAESNRILSLFATSLSPIHIPQIQAQLIVQNLHSNPTIAHHFINTCHDLRLL 60

Query: 61  NSALLFFFTHFPNPHVFICNSLIRAFSHSKIPHTPLSIYAHMTRNSILPNNFTFPFLLKS 120
           +SALL FFT  P PHVF+CNSLIRAFSHSKIPHTPLSIYAHM RNSILPNN+TFPFLLKS
Sbjct: 61  DSALL-FFTQIPKPHVFVCNSLIRAFSHSKIPHTPLSIYAHMNRNSILPNNYTFPFLLKS 120

Query: 121 LADFNDLVGGQSVHTHVVKLGYVSDVYVQNSLMDVYASCGRMGLCKKVFDEMPQRDVVSW 180
           LADFN LV GQSVH HVVKLGYVSDVYVQNSLMDVYASCGRMGLCKKVFDEMP RDVVSW
Sbjct: 121 LADFNHLVSGQSVHAHVVKLGYVSDVYVQNSLMDVYASCGRMGLCKKVFDEMPLRDVVSW 180

Query: 181 TVLIMGYRVALMFDDALIAFEQMQYAGVEPNRVTMVNALAACASFGAIEMGVWIHEFVKK 240
           TVLIMGYRVALMFDDALIAFEQMQYAGVEPNRVTMVNALAACASFGAIEMG+WIHEFVK+
Sbjct: 181 TVLIMGYRVALMFDDALIAFEQMQYAGVEPNRVTMVNALAACASFGAIEMGIWIHEFVKR 240

Query: 241 KGWEVDVILGTSLIDMYGKCGRIKEGLAVFQAMKEKNVYTWNALIKGLALAKSGEEAIAW 300
           +GWE+DVILGTSLIDMYGKCGRI+EGL VFQAMK+KNVYTWNALI GLALAKSGEEAIAW
Sbjct: 241 RGWEMDVILGTSLIDMYGKCGRIEEGLVVFQAMKDKNVYTWNALINGLALAKSGEEAIAW 300

Query: 301 FKRMDEE-GVEADDVTLVAVLCACSHSGLVDMGRQIFRSLIDGRFRFSPGIKHYSCMADL 360
           FKRMDE+ GV+AD+VTLVAVLCACSHSGLVD+GRQIF S+IDG+F FSPGI+HYSCM DL
Sbjct: 301 FKRMDEDGGVKADEVTLVAVLCACSHSGLVDIGRQIFGSMIDGKFGFSPGIQHYSCMVDL 360

Query: 361 LARCGCIEEAFELIKNMPFDATKAMWGSLLAGGRAHGSLEVSEFAARKLVEMEPENGAYY 420
           LAR GCIEE+FELIKNMPFDATKAMWGSLLAG RA GSLE+SEFAARKLVEMEPENGAYY
Sbjct: 361 LARSGCIEESFELIKNMPFDATKAMWGSLLAGSRAQGSLEMSEFAARKLVEMEPENGAYY 420

Query: 421 AVLSNICAEIGKWSEVEKVREIMKERGLKKDLGSSSVE 458
           AVLSNICAE+GKW+EVEKVR+IMK  GLKKDLGSSSVE
Sbjct: 421 AVLSNICAEMGKWNEVEKVRKIMKVEGLKKDLGSSSVE 457

BLAST of Tan0004075 vs. NCBI nr
Match: KAG6575842.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia] >KAG7014378.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 823.9 bits (2127), Expect = 6.9e-235
Identity = 407/458 (88.86%), Postives = 427/458 (93.23%), Query Frame = 0

Query: 1   MTPFAAQFWAQSNRILSLFTASLSPIHIPQIQAQLIVQNLHSNPTIAHHFINTCHSLHLL 60
           M  F  Q WA+SNRILSLF  SLSPIHIPQIQAQLIVQNLHSNPTIAHHFINTCH L LL
Sbjct: 1   MAAFGVQMWAESNRILSLFATSLSPIHIPQIQAQLIVQNLHSNPTIAHHFINTCHDLRLL 60

Query: 61  NSALLFFFTHFPNPHVFICNSLIRAFSHSKIPHTPLSIYAHMTRNSILPNNFTFPFLLKS 120
           +SAL  FFT  P PHVF+CNSLIRAFSHSKIP TPLSIYAHM RNSILPNN+TFPFLLKS
Sbjct: 61  DSALR-FFTQIPKPHVFVCNSLIRAFSHSKIPRTPLSIYAHMNRNSILPNNYTFPFLLKS 120

Query: 121 LADFNDLVGGQSVHTHVVKLGYVSDVYVQNSLMDVYASCGRMGLCKKVFDEMPQRDVVSW 180
           LADFN LV GQSVH HVVKLGYVSDVYVQNSLMDVYASCGRMGLCKKVFDEMP RDVVSW
Sbjct: 121 LADFNHLVSGQSVHAHVVKLGYVSDVYVQNSLMDVYASCGRMGLCKKVFDEMPLRDVVSW 180

Query: 181 TVLIMGYRVALMFDDALIAFEQMQYAGVEPNRVTMVNALAACASFGAIEMGVWIHEFVKK 240
           TVLIMGYRVALMFDDALIAFEQMQYAGVEPNRVTMVNALAACASFGAIEMGVWIHEFVK+
Sbjct: 181 TVLIMGYRVALMFDDALIAFEQMQYAGVEPNRVTMVNALAACASFGAIEMGVWIHEFVKR 240

Query: 241 KGWEVDVILGTSLIDMYGKCGRIKEGLAVFQAMKEKNVYTWNALIKGLALAKSGEEAIAW 300
           +GWE+DVILGTSLIDMYGKCGRI+EGL VFQAMK+KNVYTWNALI GLALAKSGEEAIAW
Sbjct: 241 RGWEMDVILGTSLIDMYGKCGRIEEGLVVFQAMKDKNVYTWNALINGLALAKSGEEAIAW 300

Query: 301 FKRMDEE-GVEADDVTLVAVLCACSHSGLVDMGRQIFRSLIDGRFRFSPGIKHYSCMADL 360
           FKRM+EE G+EAD+VTLVAVLCACSHSGLVD+GRQIF S+IDG+F FSPGI+HYSCM DL
Sbjct: 301 FKRMNEEGGIEADEVTLVAVLCACSHSGLVDIGRQIFGSMIDGKFGFSPGIQHYSCMVDL 360

Query: 361 LARCGCIEEAFELIKNMPFDATKAMWGSLLAGGRAHGSLEVSEFAARKLVEMEPENGAYY 420
           LAR GCIEE+FELIKNMPFDATKAMWGSLLAG RA GSLE+SEFAARKLVEMEPENGAYY
Sbjct: 361 LARSGCIEESFELIKNMPFDATKAMWGSLLAGSRAQGSLEMSEFAARKLVEMEPENGAYY 420

Query: 421 AVLSNICAEIGKWSEVEKVREIMKERGLKKDLGSSSVE 458
           AVLSNICAE+GKW+EVEKVREIMK  GLKKDLGSSSVE
Sbjct: 421 AVLSNICAEMGKWNEVEKVREIMKVEGLKKDLGSSSVE 457

BLAST of Tan0004075 vs. NCBI nr
Match: XP_022991568.1 (pentatricopeptide repeat-containing protein At5g56310-like [Cucurbita maxima])

HSP 1 Score: 820.8 bits (2119), Expect = 5.9e-234
Identity = 406/458 (88.65%), Postives = 424/458 (92.58%), Query Frame = 0

Query: 1   MTPFAAQFWAQSNRILSLFTASLSPIHIPQIQAQLIVQNLHSNPTIAHHFINTCHSLHLL 60
           M  F  Q WA+SNRILSLF  SLSPIHIPQIQAQLIVQNLHSNPTIAHHFINTCH L LL
Sbjct: 1   MAAFGVQMWAESNRILSLFATSLSPIHIPQIQAQLIVQNLHSNPTIAHHFINTCHHLRLL 60

Query: 61  NSALLFFFTHFPNPHVFICNSLIRAFSHSKIPHTPLSIYAHMTRNSILPNNFTFPFLLKS 120
           +SAL  FFT  P PHVF+CNSLIRAFSHSKIPHTPLSIYAHM R SILPNN+TFPFLLKS
Sbjct: 61  DSALR-FFTQIPKPHVFVCNSLIRAFSHSKIPHTPLSIYAHMNRTSILPNNYTFPFLLKS 120

Query: 121 LADFNDLVGGQSVHTHVVKLGYVSDVYVQNSLMDVYASCGRMGLCKKVFDEMPQRDVVSW 180
           LADFN LV GQSVH HVVKLGYVSDVYVQNSLMDVYASCGRMGLCKKVFDEMP RDVVSW
Sbjct: 121 LADFNHLVSGQSVHAHVVKLGYVSDVYVQNSLMDVYASCGRMGLCKKVFDEMPLRDVVSW 180

Query: 181 TVLIMGYRVALMFDDALIAFEQMQYAGVEPNRVTMVNALAACASFGAIEMGVWIHEFVKK 240
           TVLIMGYRVALMFDDALIAFEQMQYAGVEPNRVTMVNALAACASFGAIEMGVWIH FVK+
Sbjct: 181 TVLIMGYRVALMFDDALIAFEQMQYAGVEPNRVTMVNALAACASFGAIEMGVWIHAFVKR 240

Query: 241 KGWEVDVILGTSLIDMYGKCGRIKEGLAVFQAMKEKNVYTWNALIKGLALAKSGEEAIAW 300
           +GWE+DVILGTSLIDMYGKCGRI+EGL VFQAMK+KNVYTWNALI GLALAKSGEEAIAW
Sbjct: 241 RGWEMDVILGTSLIDMYGKCGRIEEGLVVFQAMKDKNVYTWNALINGLALAKSGEEAIAW 300

Query: 301 FKRMDE-EGVEADDVTLVAVLCACSHSGLVDMGRQIFRSLIDGRFRFSPGIKHYSCMADL 360
           FKRMDE  GVEAD+VTLV VLCACSHSGLVD+GRQIF S+IDG+F FSPGI+HYSCM DL
Sbjct: 301 FKRMDEGGGVEADEVTLVTVLCACSHSGLVDIGRQIFGSMIDGKFGFSPGIQHYSCMVDL 360

Query: 361 LARCGCIEEAFELIKNMPFDATKAMWGSLLAGGRAHGSLEVSEFAARKLVEMEPENGAYY 420
           LAR GCIEE+FELIKNMPFDATKAMWGSLLAG RA GSLE+SEFAARKLVEMEPENGAYY
Sbjct: 361 LARSGCIEESFELIKNMPFDATKAMWGSLLAGSRAQGSLEISEFAARKLVEMEPENGAYY 420

Query: 421 AVLSNICAEIGKWSEVEKVREIMKERGLKKDLGSSSVE 458
           AVLSNICAE+GKW+EVEKVREIMK  GLKKDLGSSSVE
Sbjct: 421 AVLSNICAEMGKWNEVEKVREIMKVEGLKKDLGSSSVE 457

BLAST of Tan0004075 vs. NCBI nr
Match: XP_022953836.1 (pentatricopeptide repeat-containing protein At5g56310-like [Cucurbita moschata])

HSP 1 Score: 816.6 bits (2108), Expect = 1.1e-232
Identity = 405/458 (88.43%), Postives = 425/458 (92.79%), Query Frame = 0

Query: 1   MTPFAAQFWAQSNRILSLFTASLSPIHIPQIQAQLIVQNLHSNPTIAHHFINTCHSLHLL 60
           M  F  Q WA+SNRILSLF  SLSPIHIPQIQAQLIVQNLHSNPTIAHHFINTCH L LL
Sbjct: 1   MAAFGVQMWAESNRILSLFATSLSPIHIPQIQAQLIVQNLHSNPTIAHHFINTCHDLRLL 60

Query: 61  NSALLFFFTHFPNPHVFICNSLIRAFSHSKIPHTPLSIYAHMTRNSILPNNFTFPFLLKS 120
           +SAL  FFT  P PHVF+CNSLIRAFSHSKIP TPLSIYAHM RNSILPNN+TFPFLLKS
Sbjct: 61  DSALR-FFTQIPKPHVFVCNSLIRAFSHSKIPRTPLSIYAHMNRNSILPNNYTFPFLLKS 120

Query: 121 LADFNDLVGGQSVHTHVVKLGYVSDVYVQNSLMDVYASCGRMGLCKKVFDEMPQRDVVSW 180
           LADFN LV GQSVH HVVKLGYVSDVYVQNSLMDVYASCGRMGLCKKVFDEMP RDVVSW
Sbjct: 121 LADFNHLVSGQSVHAHVVKLGYVSDVYVQNSLMDVYASCGRMGLCKKVFDEMPLRDVVSW 180

Query: 181 TVLIMGYRVALMFDDALIAFEQMQYAGVEPNRVTMVNALAACASFGAIEMGVWIHEFVKK 240
           TVLIMGYRVALMFDDALIAFEQMQYAGV PNRVTMVNALAACASFGAIEMGVWIHEFVK+
Sbjct: 181 TVLIMGYRVALMFDDALIAFEQMQYAGVVPNRVTMVNALAACASFGAIEMGVWIHEFVKR 240

Query: 241 KGWEVDVILGTSLIDMYGKCGRIKEGLAVFQAMKEKNVYTWNALIKGLALAKSGEEAIAW 300
           +GWE+DVILGTSLIDMYGKCGRI+EGL VFQAMK+KNVYTWNALI GLALAKSGEEAIAW
Sbjct: 241 RGWEMDVILGTSLIDMYGKCGRIEEGLVVFQAMKDKNVYTWNALINGLALAKSGEEAIAW 300

Query: 301 FKRMDEE-GVEADDVTLVAVLCACSHSGLVDMGRQIFRSLIDGRFRFSPGIKHYSCMADL 360
           FKRM+EE G+EAD+VTLVAVLCACSHSGLVD+GRQIF S+IDG+F FSPGI+HYSCM DL
Sbjct: 301 FKRMNEEGGIEADEVTLVAVLCACSHSGLVDIGRQIFGSMIDGKFGFSPGIQHYSCMVDL 360

Query: 361 LARCGCIEEAFELIKNMPFDATKAMWGSLLAGGRAHGSLEVSEFAARKLVEMEPENGAYY 420
           LAR G IEE+FELIKNMPFDATKAMWGSLLAG RA GSLE+SEFAARKLVEMEPENGAYY
Sbjct: 361 LARSGRIEESFELIKNMPFDATKAMWGSLLAGSRAQGSLEMSEFAARKLVEMEPENGAYY 420

Query: 421 AVLSNICAEIGKWSEVEKVREIMKERGLKKDLGSSSVE 458
           AVLSNICAE+GKW+EVEKVREIMK  GLKKDLGSSSVE
Sbjct: 421 AVLSNICAEMGKWNEVEKVREIMKVEGLKKDLGSSSVE 457

BLAST of Tan0004075 vs. ExPASy TrEMBL
Match: A0A6J1JWL5 (pentatricopeptide repeat-containing protein At5g56310-like OS=Cucurbita maxima OX=3661 GN=LOC111488140 PE=4 SV=1)

HSP 1 Score: 820.8 bits (2119), Expect = 2.8e-234
Identity = 406/458 (88.65%), Postives = 424/458 (92.58%), Query Frame = 0

Query: 1   MTPFAAQFWAQSNRILSLFTASLSPIHIPQIQAQLIVQNLHSNPTIAHHFINTCHSLHLL 60
           M  F  Q WA+SNRILSLF  SLSPIHIPQIQAQLIVQNLHSNPTIAHHFINTCH L LL
Sbjct: 1   MAAFGVQMWAESNRILSLFATSLSPIHIPQIQAQLIVQNLHSNPTIAHHFINTCHHLRLL 60

Query: 61  NSALLFFFTHFPNPHVFICNSLIRAFSHSKIPHTPLSIYAHMTRNSILPNNFTFPFLLKS 120
           +SAL  FFT  P PHVF+CNSLIRAFSHSKIPHTPLSIYAHM R SILPNN+TFPFLLKS
Sbjct: 61  DSALR-FFTQIPKPHVFVCNSLIRAFSHSKIPHTPLSIYAHMNRTSILPNNYTFPFLLKS 120

Query: 121 LADFNDLVGGQSVHTHVVKLGYVSDVYVQNSLMDVYASCGRMGLCKKVFDEMPQRDVVSW 180
           LADFN LV GQSVH HVVKLGYVSDVYVQNSLMDVYASCGRMGLCKKVFDEMP RDVVSW
Sbjct: 121 LADFNHLVSGQSVHAHVVKLGYVSDVYVQNSLMDVYASCGRMGLCKKVFDEMPLRDVVSW 180

Query: 181 TVLIMGYRVALMFDDALIAFEQMQYAGVEPNRVTMVNALAACASFGAIEMGVWIHEFVKK 240
           TVLIMGYRVALMFDDALIAFEQMQYAGVEPNRVTMVNALAACASFGAIEMGVWIH FVK+
Sbjct: 181 TVLIMGYRVALMFDDALIAFEQMQYAGVEPNRVTMVNALAACASFGAIEMGVWIHAFVKR 240

Query: 241 KGWEVDVILGTSLIDMYGKCGRIKEGLAVFQAMKEKNVYTWNALIKGLALAKSGEEAIAW 300
           +GWE+DVILGTSLIDMYGKCGRI+EGL VFQAMK+KNVYTWNALI GLALAKSGEEAIAW
Sbjct: 241 RGWEMDVILGTSLIDMYGKCGRIEEGLVVFQAMKDKNVYTWNALINGLALAKSGEEAIAW 300

Query: 301 FKRMDE-EGVEADDVTLVAVLCACSHSGLVDMGRQIFRSLIDGRFRFSPGIKHYSCMADL 360
           FKRMDE  GVEAD+VTLV VLCACSHSGLVD+GRQIF S+IDG+F FSPGI+HYSCM DL
Sbjct: 301 FKRMDEGGGVEADEVTLVTVLCACSHSGLVDIGRQIFGSMIDGKFGFSPGIQHYSCMVDL 360

Query: 361 LARCGCIEEAFELIKNMPFDATKAMWGSLLAGGRAHGSLEVSEFAARKLVEMEPENGAYY 420
           LAR GCIEE+FELIKNMPFDATKAMWGSLLAG RA GSLE+SEFAARKLVEMEPENGAYY
Sbjct: 361 LARSGCIEESFELIKNMPFDATKAMWGSLLAGSRAQGSLEISEFAARKLVEMEPENGAYY 420

Query: 421 AVLSNICAEIGKWSEVEKVREIMKERGLKKDLGSSSVE 458
           AVLSNICAE+GKW+EVEKVREIMK  GLKKDLGSSSVE
Sbjct: 421 AVLSNICAEMGKWNEVEKVREIMKVEGLKKDLGSSSVE 457

BLAST of Tan0004075 vs. ExPASy TrEMBL
Match: A0A6J1GP72 (pentatricopeptide repeat-containing protein At5g56310-like OS=Cucurbita moschata OX=3662 GN=LOC111456251 PE=4 SV=1)

HSP 1 Score: 816.6 bits (2108), Expect = 5.3e-233
Identity = 405/458 (88.43%), Postives = 425/458 (92.79%), Query Frame = 0

Query: 1   MTPFAAQFWAQSNRILSLFTASLSPIHIPQIQAQLIVQNLHSNPTIAHHFINTCHSLHLL 60
           M  F  Q WA+SNRILSLF  SLSPIHIPQIQAQLIVQNLHSNPTIAHHFINTCH L LL
Sbjct: 1   MAAFGVQMWAESNRILSLFATSLSPIHIPQIQAQLIVQNLHSNPTIAHHFINTCHDLRLL 60

Query: 61  NSALLFFFTHFPNPHVFICNSLIRAFSHSKIPHTPLSIYAHMTRNSILPNNFTFPFLLKS 120
           +SAL  FFT  P PHVF+CNSLIRAFSHSKIP TPLSIYAHM RNSILPNN+TFPFLLKS
Sbjct: 61  DSALR-FFTQIPKPHVFVCNSLIRAFSHSKIPRTPLSIYAHMNRNSILPNNYTFPFLLKS 120

Query: 121 LADFNDLVGGQSVHTHVVKLGYVSDVYVQNSLMDVYASCGRMGLCKKVFDEMPQRDVVSW 180
           LADFN LV GQSVH HVVKLGYVSDVYVQNSLMDVYASCGRMGLCKKVFDEMP RDVVSW
Sbjct: 121 LADFNHLVSGQSVHAHVVKLGYVSDVYVQNSLMDVYASCGRMGLCKKVFDEMPLRDVVSW 180

Query: 181 TVLIMGYRVALMFDDALIAFEQMQYAGVEPNRVTMVNALAACASFGAIEMGVWIHEFVKK 240
           TVLIMGYRVALMFDDALIAFEQMQYAGV PNRVTMVNALAACASFGAIEMGVWIHEFVK+
Sbjct: 181 TVLIMGYRVALMFDDALIAFEQMQYAGVVPNRVTMVNALAACASFGAIEMGVWIHEFVKR 240

Query: 241 KGWEVDVILGTSLIDMYGKCGRIKEGLAVFQAMKEKNVYTWNALIKGLALAKSGEEAIAW 300
           +GWE+DVILGTSLIDMYGKCGRI+EGL VFQAMK+KNVYTWNALI GLALAKSGEEAIAW
Sbjct: 241 RGWEMDVILGTSLIDMYGKCGRIEEGLVVFQAMKDKNVYTWNALINGLALAKSGEEAIAW 300

Query: 301 FKRMDEE-GVEADDVTLVAVLCACSHSGLVDMGRQIFRSLIDGRFRFSPGIKHYSCMADL 360
           FKRM+EE G+EAD+VTLVAVLCACSHSGLVD+GRQIF S+IDG+F FSPGI+HYSCM DL
Sbjct: 301 FKRMNEEGGIEADEVTLVAVLCACSHSGLVDIGRQIFGSMIDGKFGFSPGIQHYSCMVDL 360

Query: 361 LARCGCIEEAFELIKNMPFDATKAMWGSLLAGGRAHGSLEVSEFAARKLVEMEPENGAYY 420
           LAR G IEE+FELIKNMPFDATKAMWGSLLAG RA GSLE+SEFAARKLVEMEPENGAYY
Sbjct: 361 LARSGRIEESFELIKNMPFDATKAMWGSLLAGSRAQGSLEMSEFAARKLVEMEPENGAYY 420

Query: 421 AVLSNICAEIGKWSEVEKVREIMKERGLKKDLGSSSVE 458
           AVLSNICAE+GKW+EVEKVREIMK  GLKKDLGSSSVE
Sbjct: 421 AVLSNICAEMGKWNEVEKVREIMKVEGLKKDLGSSSVE 457

BLAST of Tan0004075 vs. ExPASy TrEMBL
Match: A0A6J1D9C2 (pentatricopeptide repeat-containing protein At5g56310-like isoform X2 OS=Momordica charantia OX=3673 GN=LOC111018217 PE=4 SV=1)

HSP 1 Score: 813.9 bits (2101), Expect = 3.5e-232
Identity = 399/457 (87.31%), Postives = 427/457 (93.44%), Query Frame = 0

Query: 1   MTPFAAQFWAQSNRILSLFTASLSPIHIPQIQAQLIVQNLHSNPTIAHHFINTCHSLHLL 60
           M P AAQ WAQSNRILSL  ASLSPIHIPQIQ+QLI+QNLHS+  IAHHFIN CH L LL
Sbjct: 3   MKPSAAQLWAQSNRILSLLAASLSPIHIPQIQSQLILQNLHSSTAIAHHFINACHFLRLL 62

Query: 61  NSALLFFFTHFPNPHVFICNSLIRAFSHSKIPHTPLSIYAHMTRNSILPNNFTFPFLLKS 120
           +SALL FFTHFPNPHVF+ NSLIRAFSHSKIPHTPLSIYAHM RNSILPNN+TFPFLLKS
Sbjct: 63  DSALL-FFTHFPNPHVFVFNSLIRAFSHSKIPHTPLSIYAHMNRNSILPNNYTFPFLLKS 122

Query: 121 LADFNDLVGGQSVHTHVVKLGYVSDVYVQNSLMDVYASCGRMGLCKKVFDEMPQRDVVSW 180
           L+DFNDLVGGQSVHTHVVK G+VSDVYVQNSLMDVYASCGRMGLC+KVFDEMPQRDVVSW
Sbjct: 123 LSDFNDLVGGQSVHTHVVKWGFVSDVYVQNSLMDVYASCGRMGLCRKVFDEMPQRDVVSW 182

Query: 181 TVLIMGYRVALMFDDALIAFEQMQYAGVEPNRVTMVNALAACASFGAIEMGVWIHEFVKK 240
           TVLIMGYR  LMFDDAL+AFE MQYAGVEPN VTMVNALAACA +GAIEMGVWIHEFVK+
Sbjct: 183 TVLIMGYRGGLMFDDALVAFEHMQYAGVEPNCVTMVNALAACAGYGAIEMGVWIHEFVKR 242

Query: 241 KGWEVDVILGTSLIDMYGKCGRIKEGLAVFQAMKEKNVYTWNALIKGLALAKSGEEAIAW 300
           +GWEVDVILGTSLIDMYGKCGRIKEGL VFQAMKEKNV+TWNALIKGLALAKSGEEAIAW
Sbjct: 243 RGWEVDVILGTSLIDMYGKCGRIKEGLVVFQAMKEKNVFTWNALIKGLALAKSGEEAIAW 302

Query: 301 FKRMDEEGVEADDVTLVAVLCACSHSGLVDMGRQIFRSLIDGRFRFSPGIKHYSCMADLL 360
           FKRMDEEGVEAD+VTLVAVLCACSHSGLV+ GR+IFR+L+DG + FSPGIKH+SCM DLL
Sbjct: 303 FKRMDEEGVEADEVTLVAVLCACSHSGLVNKGREIFRALVDGSYGFSPGIKHFSCMVDLL 362

Query: 361 ARCGCIEEAFELIKNMPFDATKAMWGSLLAGGRAHGSLEVSEFAARKLVEMEPENGAYYA 420
           AR GCIEEAF LIK+MPFDATKAMWGSLLAGGRA+GSLEVSEFAARKLVEMEPENGAYY 
Sbjct: 363 ARSGCIEEAFVLIKDMPFDATKAMWGSLLAGGRANGSLEVSEFAARKLVEMEPENGAYYV 422

Query: 421 VLSNICAEIGKWSEVEKVREIMKERGLKKDLGSSSVE 458
           VLSNI AE+G+W EVE+VR+IM+ERGLKKD GSSSVE
Sbjct: 423 VLSNILAEMGRWGEVEEVRDIMRERGLKKDSGSSSVE 458

BLAST of Tan0004075 vs. ExPASy TrEMBL
Match: A0A6J1D8G0 (pentatricopeptide repeat-containing protein At1g09190-like isoform X1 OS=Momordica charantia OX=3673 GN=LOC111018217 PE=4 SV=1)

HSP 1 Score: 813.9 bits (2101), Expect = 3.5e-232
Identity = 399/457 (87.31%), Postives = 427/457 (93.44%), Query Frame = 0

Query: 1   MTPFAAQFWAQSNRILSLFTASLSPIHIPQIQAQLIVQNLHSNPTIAHHFINTCHSLHLL 60
           M P AAQ WAQSNRILSL  ASLSPIHIPQIQ+QLI+QNLHS+  IAHHFIN CH L LL
Sbjct: 87  MKPSAAQLWAQSNRILSLLAASLSPIHIPQIQSQLILQNLHSSTAIAHHFINACHFLRLL 146

Query: 61  NSALLFFFTHFPNPHVFICNSLIRAFSHSKIPHTPLSIYAHMTRNSILPNNFTFPFLLKS 120
           +SALL FFTHFPNPHVF+ NSLIRAFSHSKIPHTPLSIYAHM RNSILPNN+TFPFLLKS
Sbjct: 147 DSALL-FFTHFPNPHVFVFNSLIRAFSHSKIPHTPLSIYAHMNRNSILPNNYTFPFLLKS 206

Query: 121 LADFNDLVGGQSVHTHVVKLGYVSDVYVQNSLMDVYASCGRMGLCKKVFDEMPQRDVVSW 180
           L+DFNDLVGGQSVHTHVVK G+VSDVYVQNSLMDVYASCGRMGLC+KVFDEMPQRDVVSW
Sbjct: 207 LSDFNDLVGGQSVHTHVVKWGFVSDVYVQNSLMDVYASCGRMGLCRKVFDEMPQRDVVSW 266

Query: 181 TVLIMGYRVALMFDDALIAFEQMQYAGVEPNRVTMVNALAACASFGAIEMGVWIHEFVKK 240
           TVLIMGYR  LMFDDAL+AFE MQYAGVEPN VTMVNALAACA +GAIEMGVWIHEFVK+
Sbjct: 267 TVLIMGYRGGLMFDDALVAFEHMQYAGVEPNCVTMVNALAACAGYGAIEMGVWIHEFVKR 326

Query: 241 KGWEVDVILGTSLIDMYGKCGRIKEGLAVFQAMKEKNVYTWNALIKGLALAKSGEEAIAW 300
           +GWEVDVILGTSLIDMYGKCGRIKEGL VFQAMKEKNV+TWNALIKGLALAKSGEEAIAW
Sbjct: 327 RGWEVDVILGTSLIDMYGKCGRIKEGLVVFQAMKEKNVFTWNALIKGLALAKSGEEAIAW 386

Query: 301 FKRMDEEGVEADDVTLVAVLCACSHSGLVDMGRQIFRSLIDGRFRFSPGIKHYSCMADLL 360
           FKRMDEEGVEAD+VTLVAVLCACSHSGLV+ GR+IFR+L+DG + FSPGIKH+SCM DLL
Sbjct: 387 FKRMDEEGVEADEVTLVAVLCACSHSGLVNKGREIFRALVDGSYGFSPGIKHFSCMVDLL 446

Query: 361 ARCGCIEEAFELIKNMPFDATKAMWGSLLAGGRAHGSLEVSEFAARKLVEMEPENGAYYA 420
           AR GCIEEAF LIK+MPFDATKAMWGSLLAGGRA+GSLEVSEFAARKLVEMEPENGAYY 
Sbjct: 447 ARSGCIEEAFVLIKDMPFDATKAMWGSLLAGGRANGSLEVSEFAARKLVEMEPENGAYYV 506

Query: 421 VLSNICAEIGKWSEVEKVREIMKERGLKKDLGSSSVE 458
           VLSNI AE+G+W EVE+VR+IM+ERGLKKD GSSSVE
Sbjct: 507 VLSNILAEMGRWGEVEEVRDIMRERGLKKDSGSSSVE 542

BLAST of Tan0004075 vs. ExPASy TrEMBL
Match: A0A1S3BRY1 (pentatricopeptide repeat-containing protein At5g56310-like OS=Cucumis melo OX=3656 GN=LOC103492838 PE=4 SV=1)

HSP 1 Score: 798.1 bits (2060), Expect = 2.0e-227
Identity = 392/457 (85.78%), Postives = 418/457 (91.47%), Query Frame = 0

Query: 1   MTPFAAQFWAQSNRILSLFTASLSPIHIPQIQAQLIVQNLHSNPTIAHHFINTCHSLHLL 60
           M PF  QF A+S RILS+FT SLSP+HIPQIQAQLI++NLHS+P IAHHFINTCH LHLL
Sbjct: 1   MRPFGTQFRAESTRILSIFTTSLSPVHIPQIQAQLILRNLHSHPLIAHHFINTCHHLHLL 60

Query: 61  NSALLFFFTHFPNPHVFICNSLIRAFSHSKIPHTPLSIYAHMTRNSILPNNFTFPFLLKS 120
           +SA L FFTH P PHVFICNSLIRAF+HS IPHTPLSIY HM RNSI PNN+TFPF+LKS
Sbjct: 61  DSAFL-FFTHIPKPHVFICNSLIRAFAHSNIPHTPLSIYTHMNRNSISPNNYTFPFVLKS 120

Query: 121 LADFNDLVGGQSVHTHVVKLGYVSDVYVQNSLMDVYASCGRMGLCKKVFDEMPQRDVVSW 180
           LADF DLV GQSVHTHVVKLG+ SD+YVQN+LMDVYASCG+MGLCKKVFDEM QRDVVSW
Sbjct: 121 LADFKDLVSGQSVHTHVVKLGHDSDLYVQNTLMDVYASCGKMGLCKKVFDEMLQRDVVSW 180

Query: 181 TVLIMGYRVALMFDDALIAFEQMQYAGVEPNRVTMVNALAACASFGAIEMGVWIHEFVKK 240
           T+LIMGYRV+LM DDALI FEQMQYAGVEPNRVT+VNALAACASFGAIEMGVWIHEFVK 
Sbjct: 181 TILIMGYRVSLMLDDALIVFEQMQYAGVEPNRVTIVNALAACASFGAIEMGVWIHEFVKT 240

Query: 241 KGWEVDVILGTSLIDMYGKCGRIKEGLAVFQAMKEKNVYTWNALIKGLALAKSGEEAIAW 300
           K WEVDV+LGT+LIDMYGKCGRIKE LAVFQAMKEKNVYTWN LI GLALAKSGEEAIAW
Sbjct: 241 KRWEVDVVLGTALIDMYGKCGRIKEALAVFQAMKEKNVYTWNVLINGLALAKSGEEAIAW 300

Query: 301 FKRMDEEGVEADDVTLVAVLCACSHSGLVDMGRQIFRSLIDGRFRFSPGIKHYSCMADLL 360
           FKRMDEEGVEADDVTLVAVLCACSHSGLV+ GRQIFRSLI GRF FSP IKHYSCM D+L
Sbjct: 301 FKRMDEEGVEADDVTLVAVLCACSHSGLVNSGRQIFRSLIHGRFGFSPEIKHYSCMVDIL 360

Query: 361 ARCGCIEEAFELIKNMPFDATKAMWGSLLAGGRAHGSLEVSEFAARKLVEMEPENGAYYA 420
           AR GCIEEAF +IK+MPF+ATKAMWGSLL G RAHG+LEVSE AARKLVEMEPENGAYY 
Sbjct: 361 ARNGCIEEAFVMIKDMPFEATKAMWGSLLTGSRAHGNLEVSEIAARKLVEMEPENGAYYV 420

Query: 421 VLSNICAEIGKWSEVEKVREIMKERGLKKDLGSSSVE 458
           VLSNI AE+GKWSEVEKVREIMKERGLKKDLGSSSVE
Sbjct: 421 VLSNIYAEMGKWSEVEKVREIMKERGLKKDLGSSSVE 456

BLAST of Tan0004075 vs. TAIR 10
Match: AT5G56310.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 308.1 bits (788), Expect = 1.2e-83
Identity = 172/466 (36.91%), Postives = 256/466 (54.94%), Query Frame = 0

Query: 28  IPQIQAQLIVQNLHSNPTIAHHFINTCHSLHLLNSALLFFFTHFPNPHVFICNSLIRAFS 87
           + Q    +I+  L+ +      FI  C +   L  A    FTH P P+ ++ N++IRA S
Sbjct: 31  LKQSHCYMIITGLNRDNLNVAKFIEACSNAGHLRYA-YSVFTHQPCPNTYLHNTMIRALS 90

Query: 88  HSKIPHT---PLSIYAHMTRNSILPNNFTFPFLLKSLADFNDLVGGQSVHTHVVKLGYVS 147
               P+     +++Y  +      P+ FTFPF+LK     +D+  G+ +H  VV  G+ S
Sbjct: 91  LLDEPNAHSIAITVYRKLWALCAKPDTFTFPFVLKIAVRVSDVWFGRQIHGQVVVFGFDS 150

Query: 148 DVYVQNSLMDVYASCGRMGLCKKVFDEMPQRDV--------------------------- 207
            V+V   L+ +Y SCG +G  +K+FDEM  +DV                           
Sbjct: 151 SVHVVTGLIQMYFSCGGLGDARKMFDEMLVKDVNVWNALLAGYGKVGEMDEARSLLEMMP 210

Query: 208 ------VSWTVLIMGYRVALMFDDALIAFEQMQYAGVEPNRVTMVNALAACASFGAIEMG 267
                 VSWT +I GY  +    +A+  F++M    VEP+ VT++  L+ACA  G++E+G
Sbjct: 211 CWVRNEVSWTCVISGYAKSGRASEAIEVFQRMLMENVEPDEVTLLAVLSACADLGSLELG 270

Query: 268 VWIHEFVKKKGWEVDVILGTSLIDMYGKCGRIKEGLAVFQAMKEKNVYTWNALIKGLALA 327
             I  +V  +G    V L  ++IDMY K G I + L VF+ + E+NV TW  +I GLA  
Sbjct: 271 ERICSYVDHRGMNRAVSLNNAVIDMYAKSGNITKALDVFECVNERNVVTWTTIIAGLATH 330

Query: 328 KSGEEAIAWFKRMDEEGVEADDVTLVAVLCACSHSGLVDMGRQIFRSLIDGRFRFSPGIK 387
             G EA+A F RM + GV  +DVT +A+L ACSH G VD+G+++F S+   ++   P I+
Sbjct: 331 GHGAEALAMFNRMVKAGVRPNDVTFIAILSACSHVGWVDLGKRLFNSM-RSKYGIHPNIE 390

Query: 388 HYSCMADLLARCGCIEEAFELIKNMPFDATKAMWGSLLAGGRAHGSLEVSEFAARKLVEM 447
           HY CM DLL R G + EA E+IK+MPF A  A+WGSLLA    H  LE+ E A  +L+++
Sbjct: 391 HYGCMIDLLGRAGKLREADEVIKSMPFKANAAIWGSLLAASNVHHDLELGERALSELIKL 450

Query: 448 EPENGAYYAVLSNICAEIGKWSEVEKVREIMKERGLKKDLGSSSVE 458
           EP N   Y +L+N+ + +G+W E   +R +MK  G+KK  G SS+E
Sbjct: 451 EPNNSGNYMLLANLYSNLGRWDESRMMRNMMKGIGVKKMAGESSIE 494

BLAST of Tan0004075 vs. TAIR 10
Match: AT3G47530.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 300.4 bits (768), Expect = 2.5e-81
Identity = 164/467 (35.12%), Postives = 260/467 (55.67%), Query Frame = 0

Query: 13  NRILSLFTASLSPIHIPQIQAQLIVQNLHSNPTIAHHFINTCHSLHLLNSALLF---FFT 72
           + +LSL  +S   +H+ QI A L+  +L  N  + HHF++   +L L+   + +    F+
Sbjct: 12  DHLLSLIVSSTGKLHLRQIHALLLRTSLIRNSDVFHHFLSRL-ALSLIPRDINYSCRVFS 71

Query: 73  HFPNPHVFICNSLIRAFSHSKIPHTPLSIYAHMTRNSILP-NNFTFPFLLKSLADFNDLV 132
              NP +  CN++IRAFS S+ P     ++  + RNS LP N  +  F LK      DL+
Sbjct: 72  QRLNPTLSHCNTMIRAFSLSQTPCEGFRLFRSLRRNSSLPANPLSSSFALKCCIKSGDLL 131

Query: 133 GGQSVHTHVVKLGYVSDVYVQNSLMDVYASCGRMGLCKKVFDEMPQRDVVSWTVLIMGYR 192
           GG  +H  +   G++SD  +  +LMD+Y++C       KVFDE+P+RD VSW VL   Y 
Sbjct: 132 GGLQIHGKIFSDGFLSDSLLMTTLMDLYSTCENSTDACKVFDEIPKRDTVSWNVLFSCYL 191

Query: 193 VALMFDDALIAFEQMQY---AGVEPNRVTMVNALAACASFGAIEMGVWIHEFVKKKGWEV 252
                 D L+ F++M+      V+P+ VT + AL ACA+ GA++ G  +H+F+ + G   
Sbjct: 192 RNKRTRDVLVLFDKMKNDVDGCVKPDGVTCLLALQACANLGALDFGKQVHDFIDENGLSG 251

Query: 253 DVILGTSLIDMYGKCGRIKEGLAVFQAMKEKNVYTWNALIKGLALAKSGEEAIAWFKRMD 312
            + L  +L+ MY +CG + +   VF  M+E+NV +W ALI GLA+   G+EAI  F  M 
Sbjct: 252 ALNLSNTLVSMYSRCGSMDKAYQVFYGMRERNVVSWTALISGLAMNGFGKEAIEAFNEML 311

Query: 313 EEGVEADDVTLVAVLCACSHSGLVDMGRQIFRSLIDGRFRFSPGIKHYSCMADLLARCGC 372
           + G+  ++ TL  +L ACSHSGLV  G   F  +  G F+  P + HY C+ DLL R   
Sbjct: 312 KFGISPEEQTLTGLLSACSHSGLVAEGMMFFDRMRSGEFKIKPNLHHYGCVVDLLGRARL 371

Query: 373 IEEAFELIKNMPFDATKAMWGSLLAGGRAHGSLEVSEFAARKLVEMEPENGAYYAVLSNI 432
           +++A+ LIK+M       +W +LL   R HG +E+ E     L+E++ E    Y +L N 
Sbjct: 372 LDKAYSLIKSMEMKPDSTIWRTLLGACRVHGDVELGERVISHLIELKAEEAGDYVLLLNT 431

Query: 433 CAEIGKWSEVEKVREIMKERGLKKDLGSSSVE-EAVYESFMPTETAH 472
            + +GKW +V ++R +MKE+ +    G S++E +     F+  + +H
Sbjct: 432 YSTVGKWEKVTELRSLMKEKRIHTKPGCSAIELQGTVHEFIVDDVSH 477

BLAST of Tan0004075 vs. TAIR 10
Match: AT2G33760.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 292.4 bits (747), Expect = 6.7e-79
Identity = 151/436 (34.63%), Postives = 258/436 (59.17%), Query Frame = 0

Query: 28  IPQIQAQLIVQNLHSNPTIAHHFINTCHSLHLLNSALLFFFTHFPNPHVFICNSLIRAFS 87
           + Q+ A LIV     + ++    I    S   +    L F +  P P  F+ NS+I++ S
Sbjct: 25  LQQVHAHLIVTGYGRSRSLLTKLITLACSARAIAYTHLLFLS-VPLPDDFLFNSVIKSTS 84

Query: 88  HSKIPHTPLSIYAHMTRNSILPNNFTFPFLLKSLADFNDLVGGQSVHTHVVKLGYVSDVY 147
             ++P   ++ Y  M  +++ P+N+TF  ++KS AD + L  G+ VH H V  G+  D Y
Sbjct: 85  KLRLPLHCVAYYRRMLSSNVSPSNYTFTSVIKSCADLSALRIGKGVHCHAVVSGFGLDTY 144

Query: 148 VQNSLMDVYASCGRMGLCKKVFDEMPQRDVVSWTVLIMGYRVALMFDDALIAFEQMQYAG 207
           VQ +L+  Y+ CG M   ++VFD MP++ +V+W  L+ G+    + D+A+  F QM+ +G
Sbjct: 145 VQAALVTFYSKCGDMEGARQVFDRMPEKSIVAWNSLVSGFEQNGLADEAIQVFYQMRESG 204

Query: 208 VEPNRVTMVNALAACASFGAIEMGVWIHEFVKKKGWEVDVILGTSLIDMYGKCGRIKEGL 267
            EP+  T V+ L+ACA  GA+ +G W+H+++  +G +++V LGT+LI++Y +CG + +  
Sbjct: 205 FEPDSATFVSLLSACAQTGAVSLGSWVHQYIISEGLDLNVKLGTALINLYSRCGDVGKAR 264

Query: 268 AVFQAMKEKNVYTWNALIKGLALAKSGEEAIAWFKRMDEE-GVEADDVTLVAVLCACSHS 327
            VF  MKE NV  W A+I        G++A+  F +M+++ G   ++VT VAVL AC+H+
Sbjct: 265 EVFDKMKETNVAAWTAMISAYGTHGYGQQAVELFNKMEDDCGPIPNNVTFVAVLSACAHA 324

Query: 328 GLVDMGRQIFRSLIDGRFRFSPGIKHYSCMADLLARCGCIEEAFELIKNMPFDAT----- 387
           GLV+ GR +++ +    +R  PG++H+ CM D+L R G ++EA++ I  +  DAT     
Sbjct: 325 GLVEEGRSVYKRMTKS-YRLIPGVEHHVCMVDMLGRAGFLDEAYKFIHQL--DATGKATA 384

Query: 388 KAMWGSLLAGGRAHGSLEVSEFAARKLVEMEPENGAYYAVLSNICAEIGKWSEVEKVREI 447
            A+W ++L   + H + ++    A++L+ +EP+N  ++ +LSNI A  GK  EV  +R+ 
Sbjct: 385 PALWTAMLGACKMHRNYDLGVEIAKRLIALEPDNPGHHVMLSNIYALSGKTDEVSHIRDG 444

Query: 448 MKERGLKKDLGSSSVE 458
           M    L+K +G S +E
Sbjct: 445 MMRNNLRKQVGYSVIE 456

BLAST of Tan0004075 vs. TAIR 10
Match: AT5G66520.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 291.6 bits (745), Expect = 1.2e-78
Identity = 168/480 (35.00%), Postives = 252/480 (52.50%), Query Frame = 0

Query: 28  IPQIQAQLIVQNLHSNPTIAHHFINTCHSLHLLNSALL----FFFTHFPNPHVFICNSLI 87
           + QI A+++   L  +      F++ C  +   +S  L      F  F  P  F+ N +I
Sbjct: 30  LKQIHARMLKTGLMQDSYAITKFLSFC--ISSTSSDFLPYAQIVFDGFDRPDTFLWNLMI 89

Query: 88  RAFSHSKIPHTPLSIYAHMTRNSILPNNFTFPFLLKSLADFNDLVGGQSVHTHVVKLGYV 147
           R FS S  P   L +Y  M  +S   N +TFP LLK+ ++ +       +H  + KLGY 
Sbjct: 90  RGFSCSDEPERSLLLYQRMLCSSAPHNAYTFPSLLKACSNLSAFEETTQIHAQITKLGYE 149

Query: 148 SDVYVQNSLMDVYASCGRMGLCKKVFDEMPQRD--------------------------- 207
           +DVY  NSL++ YA  G   L   +FD +P+ D                           
Sbjct: 150 NDVYAVNSLINSYAVTGNFKLAHLLFDRIPEPDDVSWNSVIKGYVKAGKMDIALTLFRKM 209

Query: 208 ----VVSWTVLIMGYRVALMFDDALIAFEQMQYAGVEPNRVTMVNALAACASFGAIEMGV 267
                +SWT +I GY  A M  +AL  F +MQ + VEP+ V++ NAL+ACA  GA+E G 
Sbjct: 210 AEKNAISWTTMISGYVQADMNKEALQLFHEMQNSDVEPDNVSLANALSACAQLGALEQGK 269

Query: 268 WIHEFVKKKGWEVDVILGTSLIDMYGKCGRIKEGLAVFQAMKEKNVYTWNALIKGLALAK 327
           WIH ++ K    +D +LG  LIDMY KCG ++E L VF+ +K+K+V  W ALI G A   
Sbjct: 270 WIHSYLNKTRIRMDSVLGCVLIDMYAKCGEMEEALEVFKNIKKKSVQAWTALISGYAYHG 329

Query: 328 SGEEAIAWFKRMDEEGVEADDVTLVAVLCACSHSGLVDMGRQIFRSLIDGRFRFSPGIKH 387
            G EAI+ F  M + G++ + +T  AVL ACS++GLV+ G+ IF S+ +  +   P I+H
Sbjct: 330 HGREAISKFMEMQKMGIKPNVITFTAVLTACSYTGLVEEGKLIFYSM-ERDYNLKPTIEH 389

Query: 388 YSCMADLLARCGCIEEAFELIKNMPFDATKAMWGSLLAGGRAHGSLEVSEFAARKLVEME 447
           Y C+ DLL R G ++EA   I+ MP      +WG+LL   R H ++E+ E     L+ ++
Sbjct: 390 YGCIVDLLGRAGLLDEAKRFIQEMPLKPNAVIWGALLKACRIHKNIELGEEIGEILIAID 449

Query: 448 PENGAYYAVLSNICAEIGKWSEVEKVREIMKERGLKKDLGSSSVE-EAVYESFMPTETAH 472
           P +G  Y   +NI A   KW +  + R +MKE+G+ K  G S++  E     F+  + +H
Sbjct: 450 PYHGGRYVHKANIHAMDKKWDKAAETRRLMKEQGVAKVPGCSTISLEGTTHEFLAGDRSH 506

BLAST of Tan0004075 vs. TAIR 10
Match: AT1G09190.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 290.4 bits (742), Expect = 2.6e-78
Identity = 168/476 (35.29%), Postives = 265/476 (55.67%), Query Frame = 0

Query: 14  RILSLFTASLSPIHIPQIQAQLIVQNLHSNPTIAHHFINTCHSLHLLNSALLFFFTHFPN 73
           ++L L     +   +P+I A L+   LH +  +  HFI+ C SL   + A    F+H  N
Sbjct: 6   KLLRLLHGHNTRTRLPEIHAHLLRHFLHGSNLLLAHFISICGSLSNSDYANR-VFSHIQN 65

Query: 74  PHVFICNSLIRAFSHSKIPHTPLSIYAHMTRNSILPNNFTFPFLLKSLADFNDLVGGQSV 133
           P+V + N++I+ +S    P   LS ++ M    I  + +T+  LLKS +  +DL  G+ V
Sbjct: 66  PNVLVFNAMIKCYSLVGPPLESLSFFSSMKSRGIWADEYTYAPLLKSCSSLSDLRFGKCV 125

Query: 134 HTHVVKLGYVSDVYVQNSLMDVYASCGRMGLCKKVFDEMPQRDVVSWTVLIMGY------ 193
           H  +++ G+     ++  ++++Y S GRMG  +KVFDEM +R+VV W ++I G+      
Sbjct: 126 HGELIRTGFHRLGKIRIGVVELYTSGGRMGDAQKVFDEMSERNVVVWNLMIRGFCDSGDV 185

Query: 194 -RVALMFD------------------------DALIAFEQMQYAGVEPNRVTMVNALAAC 253
            R   +F                         +AL  F +M   G +P+  T+V  L   
Sbjct: 186 ERGLHLFKQMSERSIVSWNSMISSLSKCGRDREALELFCEMIDQGFDPDEATVVTVLPIS 245

Query: 254 ASFGAIEMGVWIHEFVKKKGWEVDVI-LGTSLIDMYGKCGRIKEGLAVFQAMKEKNVYTW 313
           AS G ++ G WIH   +  G   D I +G +L+D Y K G ++   A+F+ M+ +NV +W
Sbjct: 246 ASLGVLDTGKWIHSTAESSGLFKDFITVGNALVDFYCKSGDLEAATAIFRKMQRRNVVSW 305

Query: 314 NALIKGLALAKSGEEAIAWFKRMDEEG-VEADDVTLVAVLCACSHSGLVDMGRQIFRSLI 373
           N LI G A+   GE  I  F  M EEG V  ++ T + VL  CS++G V+ G ++F  ++
Sbjct: 306 NTLISGSAVNGKGEFGIDLFDAMIEEGKVAPNEATFLGVLACCSYTGQVERGEELFGLMM 365

Query: 374 DGRFRFSPGIKHYSCMADLLARCGCIEEAFELIKNMPFDATKAMWGSLLAGGRAHGSLEV 433
           + RF+     +HY  M DL++R G I EAF+ +KNMP +A  AMWGSLL+  R+HG +++
Sbjct: 366 E-RFKLEARTEHYGAMVDLMSRSGRITEAFKFLKNMPVNANAAMWGSLLSACRSHGDVKL 425

Query: 434 SEFAARKLVEMEPENGAYYAVLSNICAEIGKWSEVEKVREIMKERGLKKDLGSSSV 457
           +E AA +LV++EP N   Y +LSN+ AE G+W +VEKVR +MK+  L+K  G S++
Sbjct: 426 AEVAAMELVKIEPGNSGNYVLLSNLYAEEGRWQDVEKVRTLMKKNRLRKSTGQSTI 479

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9FMA11.7e-8236.91Pentatricopeptide repeat-containing protein At5g56310 OS=Arabidopsis thaliana OX... [more]
Q9SN853.5e-8035.12Pentatricopeptide repeat-containing protein At3g47530 OS=Arabidopsis thaliana OX... [more]
P930119.5e-7834.63Pentatricopeptide repeat-containing protein At2g33760 OS=Arabidopsis thaliana OX... [more]
Q9FJY71.6e-7735.00Pentatricopeptide repeat-containing protein At5g66520 OS=Arabidopsis thaliana OX... [more]
O804883.6e-7735.29Pentatricopeptide repeat-containing protein At1g09190 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
XP_038898992.14.8e-23689.06pentatricopeptide repeat-containing protein At5g56310-like [Benincasa hispida][more]
XP_023548659.18.2e-23688.86pentatricopeptide repeat-containing protein At5g56310-like [Cucurbita pepo subsp... [more]
KAG6575842.16.9e-23588.86Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
XP_022991568.15.9e-23488.65pentatricopeptide repeat-containing protein At5g56310-like [Cucurbita maxima][more]
XP_022953836.11.1e-23288.43pentatricopeptide repeat-containing protein At5g56310-like [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
A0A6J1JWL52.8e-23488.65pentatricopeptide repeat-containing protein At5g56310-like OS=Cucurbita maxima O... [more]
A0A6J1GP725.3e-23388.43pentatricopeptide repeat-containing protein At5g56310-like OS=Cucurbita moschata... [more]
A0A6J1D9C23.5e-23287.31pentatricopeptide repeat-containing protein At5g56310-like isoform X2 OS=Momordi... [more]
A0A6J1D8G03.5e-23287.31pentatricopeptide repeat-containing protein At1g09190-like isoform X1 OS=Momordi... [more]
A0A1S3BRY12.0e-22785.78pentatricopeptide repeat-containing protein At5g56310-like OS=Cucumis melo OX=36... [more]
Match NameE-valueIdentityDescription
AT5G56310.11.2e-8336.91Pentatricopeptide repeat (PPR) superfamily protein [more]
AT3G47530.12.5e-8135.12Pentatricopeptide repeat (PPR) superfamily protein [more]
AT2G33760.16.7e-7934.63Pentatricopeptide repeat (PPR) superfamily protein [more]
AT5G66520.11.2e-7835.00Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G09190.12.6e-7835.29Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 344..471
e-value: 6.2E-9
score: 37.7
coord: 229..343
e-value: 1.9E-27
score: 98.5
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 37..136
e-value: 1.5E-5
score: 26.4
coord: 137..228
e-value: 1.9E-20
score: 74.9
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 251..278
e-value: 1.4E-4
score: 19.8
coord: 279..312
e-value: 3.0E-7
score: 28.2
coord: 148..177
e-value: 3.0E-4
score: 18.7
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 352..376
e-value: 0.0059
score: 16.8
coord: 178..208
e-value: 0.0023
score: 18.1
coord: 422..447
e-value: 1.1
score: 9.6
coord: 150..176
e-value: 5.0E-4
score: 20.1
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 276..323
e-value: 3.1E-8
score: 33.7
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 277..311
score: 11.005202
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 176..210
score: 9.832344
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 246..276
score: 8.889672
NoneNo IPR availablePANTHERPTHR47924PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 6..459
NoneNo IPR availablePANTHERPTHR47924:SF31SUBFAMILY NOT NAMEDcoord: 6..459

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0004075.1Tan0004075.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006749 glutathione metabolic process
biological_process GO:0009451 RNA modification
cellular_component GO:0043231 intracellular membrane-bounded organelle
molecular_function GO:0005515 protein binding
molecular_function GO:0003723 RNA binding