Tan0020971 (gene) Snake gourd v1

Overview
NameTan0020971
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPentatricopeptide repeat-containing protein
LocationLG11: 16241050 .. 16246763 (+)
RNA-Seq ExpressionTan0020971
SyntenyTan0020971
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GGCCCAAGCAAATCTGAGTTAGGCTTCCATAAACAGATCGCCTCATTCCCCTCATTGGGTTTCGTTCTTTTCAGTTCCCATTTCTGTGCTGCTTCATTCAGGTTCAGTCTCCACTCGGTCTGCTCATTTTTCAGAATCTCTTCGCGTTTGAATTTCGGTTCGATCGATTGAGGCACTTGAAACTCCGCAAGCAGTTCGATTCGCAGTGGCGCGGTTGGCCTATGCCCATAAACCCTAAAGCCTTACAGCTCAGGATTTTACCACCCATTAATCTAACATGAGCATCCATTGCTTTAAAGAAGATGAAGCTTTCCGTTGAAGGTCTTGCCTTTGCTTCACTCCTCTCAGCGATGTTACTCTTCTTTCGCAATCTTTTCCACGTTAGTCGCAGAGCCTCTTGCCGAGTAATCTCTCTGTCTTCAAGTTCCTCGCATCCGGGCTGCCTTTCTTTCAAAGTATTTAATGCCTCATCGTCTCTAACATTAATAAATGGCTGTTATATTTCTTGTCCGTTTTTCTGGTTCACTAGCTTTCTTTGTATATTTCGGCTCCCTTTTGTTAGTTACTCGAATACAAATAATTCTTTTGAATGTTTAGACATTGGTTCCCTTCGTAAAATCATACAACAAGACCTCTGGAACGATCCTAAGATTGTTACTTTATTTGATTCAGCACTAGCGCCCATTTGGGTTCCTAGGATTTTAGTTGAATTGAAAAATGATCCAAAATTAGCTCTCAAGTTCTTCAAATGGGCTGAATCCCAGATTGGTTTCCGCCATACCACCGAGTCTTACTGCATTGTAGTTCACATGCTGTTTCGTGCGAGAATGTACACAAATGCCCATGATATTATTAAAGAAGTGATTTTGAAGAACCGTGCTGACTTGATTTTGCCAGTTTGTAATATATTTGATATGTTATGGTCCACTAGGAATATTTGTGTGTCAGGATCAGGAGTTTTTGACGTGTTATTTAGTGTTTTGATAGAGTTGGATCTGCTTGAAGAAGCCAATGAATGTTTCTCGAGAATGAGGAAGTTTAGGACTCTGCCCAAAGCACGTTCTTGCAATTTTCTTTTGCATAGATTATCAAAGTCAGGGAATGGACAGTTGGTGAGAAAGTTTTTCAATGATATGATTGGGGCTGGTATTACACCTTCTGTTTTTACTTACAATGTAATGATAGATCACTTGTGCAAAGAAGGGGATTTGGAAAATGCTAGACGTTTGTTTGTGCAAATGAGGCAGATGGGCTTTTCTCCAGACGTTGTCACATATAATTCTTTGATTGATGGCTATGGCAAGGTTGGTTTATTAGAAGAATCTGTGTATTTATTTAATGAAATGAAGGATGCAGGTTGTGTTCCTGATATAATTACCTATAATGCTTTAATCAATCGTTTCTGCAAGTTTGAGAAGATGCCTCGAGCTTTTGAGTATCTCTCTGAGATGAAGAACAATGGGCTAAAGCCAAATGTTGTAACCTATAGCACATTGATTGATGCTTTTTGCAAGGACGGAATGATGCAAGGTGCCATCAAACTTTTTGTTGATATGAGAAGAGTTGGCCTTTTACCTAATGAATTTACCTACACTTCTCTGATTGATGCCAATTGTAAGGCAGGTAATTTAACAGAAGCATGGAAGTTGTCCAATGATATGTTGCAAGCAGGAGTTAATTTAAATATAGTCACCTATACAGCTCTAATGGATGGCCTTTGTGAAGATGGAAGAATGGTGGAAGCAGAAGAAGTGTTTAGGGCAATGCTGAAAGATGGAATATCTCCCAACCAGCAGGTTTACACTGCTTTGGTTCATGGCTATATTAAGGCGGAGAGATTGGGGGATGCTATGGAGATATTGAAGCAAATGACAGAATGTGACATCAAACCAGATTTAATACTCTATGGCACAATTCTTTGGGGTCTCTGTAATCAAAGCAAACTTGAAGAAACTAAGCTTATTATTAAAGAAATGAAAAGTCGGGGTATTAGTGCAAATCCTGTTATATACACAACAATTATAGATGCTTATTTTAAGGCTGGAAAAAGCTCAGATGCATTAAATCTTCTTCAGGAGATGCAGGATGTAGGTGTTGAGGCTACTGTTGTAACCTACTGTGTATTAATTGATGGTTTGTGCAAAACAGGCATGGTCGAACTGGCAGTTGATTATTTTGGTAGAATGTCTGACTTTGGTTTACAACCTAATGTTGCAGTTTACACCGCCCTCATCGATGGTCTTTGTAAAATTAATTGCATTGAATCTGCCAAAAAGTTGTTTGATGAAATGCAATGTAGGGGTATGACTCCGGATAAAGCAGCTTTCACTGCTCTAATTGATGGCAACTTGAAGCTTGGAAATCTTCAGGAAGCTTTGAATTTGATTAGCAAAATGACAGAATTAGCTATTGATTTTGATCTACATGCTTATACTTCCTTGGTTTCGGGATTTTCTCAATGTGGTGAGCTGCACCAAGCGAGGAAGTTTTTTGATGAGATGATTGAGAAGGGCATACTTCCCGAGGAAATTTTATGTATATGTCTATTGAGGGAGTATTATAAGCTTGGACAGTTGGATGAAGCCATTGAATTGAAGAATGAAATGCAAAGGAGGGGTCTAATTGCTGAAAAGTGCAGCCATGCAATTCCCAGTCTAAAAATGTGAAGCGGAGGTCCAATCTGGTCATCTTTGGTGTCTGGAAGCTGAGTTTGTATTTAATTGCTAGCACATGCGGTATTGATTTGGGATTCAGAGTTATGAAGACTCGAGAAACCCACCATTTAAATGGGTTCCATGTTTGATGGAAAGAATTTGCAGGCCATGCATGATGTATCTGATGTTGGTTTTGGATGTACTGATGCGAAATTTTCTATCTGATAAGATGCAGCTTCTTGTTTTCAAATGACATTCTGCAGAAAAATCCCATCATGTTATGTGTTTCATTAGTCATCAATCTTCTATGAATGTGAGTGACAATCTTTCTTGTCCTCTTCCTCCTTTTTCCCAAAAAGAAGATTTTTGTTTAGATATGGGATACAAGCATGCTATTTTTAGCAGTAATAGCCATTTATATGAGATTGTGAGTGATAACTTTTTCTTCAACGATTGTGCATGATGCTTCCATCCTGCTTGTCTTCTGTCAACTAATTTTTCATTTGGGTTTACAGGTCCAAAGAGGATTCCTGTGGCACGAGTGATTCCCATGGAATCATTCTAGGAAAGTACCTATCCGTTATTGTTACAGATGGAAGCTTATTCATTCAAGTTTGAAGCTCAGCTCTGCTTATAATGAGCAAGGTGTTAACCTTATTCTCCATGCTTGTTTATAGTTGGATTTCAATACTCAGCTCTATAAGCATATGGTCACCTTTGGAATGGGTAAAGTGGTGAAGGAAGTCTGCTCTCTTCTCATGGCACTGGGGGTTCTCACAGGAACCAATCTTTTTCTGCTGTTCTTTCCTCCATTGAGGAAGTGTGGTAGGTTCTTATGCAAATTATTGACAACATTCAACTTAATAATCCAAGTGACTCTCTGAAGTTTGAAATGCATCCTTGATTCTTTCCAGTTGTTTATGAGCATCTCCTTATTTTCACTTTTCATGAATGGTTATAGATTTTAGCCAACAAATCTTTCTAGGAAGATTTGGGTGGCCAAAATTACTATGAAACTCTGAACATCACTCTGATTTCCTTCTCCCAATTCTATGAATACGCCAAGTCACCAACAACAGGATTTGTAGGAGGAAACCTAACTTATCTTCCTCCTCAAATTTATGGTGATTGTCATGGACAACAATCACCTTTCGTTGCCCGTTTGCTCACAAATGCAGGACCTCTCTCTTCCCTCTATTTAGTCCTGTAGGTTGTTTGCCTAAGAGCTTGAGAGTAAGTGGGTTGGTTTCTGGCCTACAATTCCAAACTGACTTCAAGATTTTATAAGAAAACCAAGCAGACCAGGCCAATGAACCCAAAAAATCCATCTGAAATAGACAAAACCAACTGCAGTGAAAAAACCAGCCAAATCAATCAAATCTATGAGAAACCAACCTATTGAGGCATTCACGGTTTTGAAACTATATAGAAGTGAATAAACCAAAACCAGTCTCAAATTGACCAAACTGACGTGATTCAATCATTTTGGTTTTCATGGGCATGCCTATCTAAGAGTGTTTGAGTCTCTCTCCGAGTCCTGAAAGCCTGTGGTGGCCCCCTTTGGGTTGTAGGCTTGTTCTCCAGTGGGTGGAATGTTGTATGGCTTTTCATTGGAACATTTAGTCAGAGAAAAATACGAGGGTCTTTCATGTTCTTTATAGAGGGATATTGGCTACAGTAATCTTCTTGCTGTTGGAGTTGGAGTTCTTTTAAAAGGAATTTTTATAGTCATTCTACTACTACTATTGTTAGAAACTGGTCAAATTTTTGTAATTTCTAATTGTTTGGGGGATTGAGAGATGCCTTATACTTCTCTCCCTTACAAATTTTTCTTTCCTAGAGTAGAACTCTCGGACTTTTTTCTCCTTTGTGAATAAGGAGACACGTCTGTGGTGTTTTTATTTTAAAATTTTCGTATACCTTATTTTGACAATCTATCATAAAGGAGTGTATTTTAATTATCTATCATAAACACTGTAATTACTCTTATGTGGTTGTGTTCTTATTTCTATTGCTAGACATAGTTTGTTACAGCATTATGATGGTGTTGAAAAACTACTTGACTTATCACTTAAGAAGTTTGTAGCATTGGTTAGGATTTCAAAGTTGCATTCCCTTTTTAAATAAATTTAAGTGAACTGATTATCAGTACAATCTTCACTTCTGTCCAGGGATTTTTGGCACATCCATTCATTTAAGTTCAACAGAAGCAACCACTCACCTGAGTTAATGACGACTTGCAAAAACTTCTAAGGGTTGTTTGGGATGTTGAGTGGGTTATAATAGTTTGTGATGTTATATAATTTTTATTTAGGATGAGAGTTATTTAGTCTGTGTTTGGGATGCGGAGTTTTCATAGACTAATATAACATTGGTTATTATTACATGCATCTCAAACAGACTCTTAGAGTTCCGTTCTCATGTCATGTCTGTTTGTTACATCTTCAGGAGCTGTAGTGTACCGAAAAAAATTTCAGTACAGTTGTATAGATGCCGAAGAGCAGATCTTAGGTACCTAGCGGAAGTTTACATTGTTATGATGATTATTTGAATATTTGTTTACTATGGAAGCCACAGGATGAATGATTTTATGAGATGTGTTTGTTTGAATTTCTCAGGTGGTTGGTTTTGGAGGGAGCTTATTTTGGCATATGAATTGTAGAAGCCATCCATGTTGATTGCTGTTGCATATAGGCATGGTTTGAGAGTTTCAAGTGTTGTATGATTGTGACTTGTTTGCCTCCTAATATATTAATGCAAATCTCAATCTCAAAAGCTCTGGAAGGAGGAAGCTGCACAATAATTTGTTGTAGCCTTGCAGAATGGAGAGCAGAAAGGCGAGGTGATAAGATGTACAAATTCTTTATGTTAAAAAGTATGCTTATGATTCTGAGAAATGAAGCAGGATTATAGGAACATAGAGAATAGTGTTTTATTTTATTACTTTTATTTATTTTTCTGACATAAATAGAGAATAATGTGTTATTATTTCTCAATAACTCTAAACCAGAAACATAAAGGATATTAATTTAATAGGGACCAGATTTTTTTTTTCCAGAA

mRNA sequence

GGCCCAAGCAAATCTGAGTTAGGCTTCCATAAACAGATCGCCTCATTCCCCTCATTGGGTTTCGTTCTTTTCAGTTCCCATTTCTGTGCTGCTTCATTCAGGTTCAGTCTCCACTCGGTCTGCTCATTTTTCAGAATCTCTTCGCGTTTGAATTTCGGTTCGATCGATTGAGGCACTTGAAACTCCGCAAGCAGTTCGATTCGCAGTGGCGCGGTTGGCCTATGCCCATAAACCCTAAAGCCTTACAGCTCAGGATTTTACCACCCATTAATCTAACATGAGCATCCATTGCTTTAAAGAAGATGAAGCTTTCCGTTGAAGGTCTTGCCTTTGCTTCACTCCTCTCAGCGATGTTACTCTTCTTTCGCAATCTTTTCCACGTTAGTCGCAGAGCCTCTTGCCGAGTAATCTCTCTGTCTTCAAGTTCCTCGCATCCGGGCTGCCTTTCTTTCAAAGTATTTAATGCCTCATCGTCTCTAACATTAATAAATGGCTGTTATATTTCTTGTCCGTTTTTCTGGTTCACTAGCTTTCTTTGTATATTTCGGCTCCCTTTTGTTAGTTACTCGAATACAAATAATTCTTTTGAATGTTTAGACATTGGTTCCCTTCGTAAAATCATACAACAAGACCTCTGGAACGATCCTAAGATTGTTACTTTATTTGATTCAGCACTAGCGCCCATTTGGGTTCCTAGGATTTTAGTTGAATTGAAAAATGATCCAAAATTAGCTCTCAAGTTCTTCAAATGGGCTGAATCCCAGATTGGTTTCCGCCATACCACCGAGTCTTACTGCATTGTAGTTCACATGCTGTTTCGTGCGAGAATGTACACAAATGCCCATGATATTATTAAAGAAGTGATTTTGAAGAACCGTGCTGACTTGATTTTGCCAGTTTGTAATATATTTGATATGTTATGGTCCACTAGGAATATTTGTGTGTCAGGATCAGGAGTTTTTGACGTGTTATTTAGTGTTTTGATAGAGTTGGATCTGCTTGAAGAAGCCAATGAATGTTTCTCGAGAATGAGGAAGTTTAGGACTCTGCCCAAAGCACGTTCTTGCAATTTTCTTTTGCATAGATTATCAAAGTCAGGGAATGGACAGTTGGTGAGAAAGTTTTTCAATGATATGATTGGGGCTGGTATTACACCTTCTGTTTTTACTTACAATGTAATGATAGATCACTTGTGCAAAGAAGGGGATTTGGAAAATGCTAGACGTTTGTTTGTGCAAATGAGGCAGATGGGCTTTTCTCCAGACGTTGTCACATATAATTCTTTGATTGATGGCTATGGCAAGTTTGAGAAGATGCCTCGAGCTTTTGAGTATCTCTCTGAGATGAAGAACAATGGGCTAAAGCCAAATGTTGTAACCTATAGCACATTGATTGATGCTTTTTGCAAGGACGGAATGATGCAAGGTGCCATCAAACTTTTTGTTGATATGAGAAGAGTTGGCCTTTTACCTAATGAATTTACCTACACTTCTCTGATTGATGCCAATTGTAAGGCAGGTAATTTAACAGAAGCATGGAAGTTGTCCAATGATATGTTGCAAGCAGGAGTTAATTTAAATATAGTCACCTATACAGCTCTAATGGATGGCCTTTGTGAAGATGGAAGAATGGTGGAAGCAGAAGAAGTGTTTAGGGCAATGCTGAAAGATGGAATATCTCCCAACCAGCAGGTTTACACTGCTTTGGTTCATGGCTATATTAAGGCGGAGAGATTGGGGGATGCTATGGAGATATTGAAGCAAATGACAGAATGTGACATCAAACCAGATTTAATACTCTATGGCACAATTCTTTGGGGTCTCTGTAATCAAAGCAAACTTGAAGAAACTAAGCTTATTATTAAAGAAATGAAAAGTCGGGGTATTAGTGCAAATCCTGTTATATACACAACAATTATAGATGCTTATTTTAAGGCTGGAAAAAGCTCAGATGCATTAAATCTTCTTCAGGAGATGCAGGATGTAGGTGTTGAGGCTACTGTTGTAACCTACTGTGTATTAATTGATGGTTTGTGCAAAACAGGCATGGTCGAACTGGCAGTTGATTATTTTGGTAGAATGTCTGACTTTGGTTTACAACCTAATGTTGCAGTTTACACCGCCCTCATCGATGGTCTTTGTAAAATTAATTGCATTGAATCTGCCAAAAAGTTGTTTGATGAAATGCAATGTAGGGGTATGACTCCGGATAAAGCAGCTTTCACTGCTCTAATTGATGGCAACTTGAAGCTTGGAAATCTTCAGGAAGCTTTGAATTTGATTAGCAAAATGACAGAATTAGCTATTGATTTTGATCTACATGCTTATACTTCCTTGGTTTCGGGATTTTCTCAATGTGGTGAGCTGCACCAAGCGAGGAAGTTTTTTGATGAGATGATTGAGAAGGGCATACTTCCCGAGGAAATTTTATGTATATGTCTATTGAGGGAGTATTATAAGCTTGGACAGTTGGATGAAGCCATTGAATTGAAGAATGAAATGCAAAGGAGGGGTCTAATTGCTGAAAAGTGCAGCCATGCAATTCCCAGTCTAAAAATGTGAAGCGGAGGTCCAATCTGGTCATCTTTGGTGTCTGGAAGCTGAGTTTGTATTTAATTGCTAGCACATGCGGTATTGATTTGGGATTCAGAGTTATGAAGACTCGAGAAACCCACCATTTAAATGGGTTCCATGTTTGATGGAAAGAATTTGCAGGCCATGCATGATGTATCTGATGTTGGTTTTGGATGTACTGATGCGAAATTTTCTATCTGATAAGATGCAGCTTCTTGTTTTCAAATGACATTCTGCAGAAAAATCCCATCATGTTATGTGTTTCATTAGTCATCAATCTTCTATGAATGTCCAAAGAGGATTCCTGTGGCACGAGTGATTCCCATGGAATCATTCTAGGAAAGTACCTATCCGTTATTGTTACAGATGGAAGCTTATTCATTCAAGTTTGAAGCTCAGCTCTGCTTATAATGAGCAAGGTGTTAACCTTATTCTCCATGCTTGTTTATAGTTGGATTTCAATACTCAGCTCTATAAGCATATGGTCACCTTTGGAATGGGTAAAGTGGTGAAGGAAGTCTGCTCTCTTCTCATGGCACTGGGGGTTCTCACAGGAACCAATCTTTTTCTGCTGTTCTTTCCTCCATTGAGGAAGTGTGGAGCTGTAGTGTACCGAAAAAAATTTCAGTACAGTTGTATAGATGCCGAAGAGCAGATCTTAGGTGGTTGGTTTTGGAGGGAGCTTATTTTGGCATATGAATTGTAGAAGCCATCCATGTTGATTGCTGTTGCATATAGGCATGGTTTGAGAGTTTCAAGTGTTGTATGATTGTGACTTGTTTGCCTCCTAATATATTAATGCAAATCTCAATCTCAAAAGCTCTGGAAGGAGGAAGCTGCACAATAATTTGTTGTAGCCTTGCAGAATGGAGAGCAGAAAGGCGAGGTGATAAGATGTACAAATTCTTTATGTTAAAAAGTATGCTTATGATTCTGAGAAATGAAGCAGGATTATAGGAACATAGAGAATAGTGTTTTATTTTATTACTTTTATTTATTTTTCTGACATAAATAGAGAATAATGTGTTATTATTTCTCAATAACTCTAAACCAGAAACATAAAGGATATTAATTTAATAGGGACCAGATTTTTTTTTTCCAGAA

Coding sequence (CDS)

ATGAAGCTTTCCGTTGAAGGTCTTGCCTTTGCTTCACTCCTCTCAGCGATGTTACTCTTCTTTCGCAATCTTTTCCACGTTAGTCGCAGAGCCTCTTGCCGAGTAATCTCTCTGTCTTCAAGTTCCTCGCATCCGGGCTGCCTTTCTTTCAAAGTATTTAATGCCTCATCGTCTCTAACATTAATAAATGGCTGTTATATTTCTTGTCCGTTTTTCTGGTTCACTAGCTTTCTTTGTATATTTCGGCTCCCTTTTGTTAGTTACTCGAATACAAATAATTCTTTTGAATGTTTAGACATTGGTTCCCTTCGTAAAATCATACAACAAGACCTCTGGAACGATCCTAAGATTGTTACTTTATTTGATTCAGCACTAGCGCCCATTTGGGTTCCTAGGATTTTAGTTGAATTGAAAAATGATCCAAAATTAGCTCTCAAGTTCTTCAAATGGGCTGAATCCCAGATTGGTTTCCGCCATACCACCGAGTCTTACTGCATTGTAGTTCACATGCTGTTTCGTGCGAGAATGTACACAAATGCCCATGATATTATTAAAGAAGTGATTTTGAAGAACCGTGCTGACTTGATTTTGCCAGTTTGTAATATATTTGATATGTTATGGTCCACTAGGAATATTTGTGTGTCAGGATCAGGAGTTTTTGACGTGTTATTTAGTGTTTTGATAGAGTTGGATCTGCTTGAAGAAGCCAATGAATGTTTCTCGAGAATGAGGAAGTTTAGGACTCTGCCCAAAGCACGTTCTTGCAATTTTCTTTTGCATAGATTATCAAAGTCAGGGAATGGACAGTTGGTGAGAAAGTTTTTCAATGATATGATTGGGGCTGGTATTACACCTTCTGTTTTTACTTACAATGTAATGATAGATCACTTGTGCAAAGAAGGGGATTTGGAAAATGCTAGACGTTTGTTTGTGCAAATGAGGCAGATGGGCTTTTCTCCAGACGTTGTCACATATAATTCTTTGATTGATGGCTATGGCAAGTTTGAGAAGATGCCTCGAGCTTTTGAGTATCTCTCTGAGATGAAGAACAATGGGCTAAAGCCAAATGTTGTAACCTATAGCACATTGATTGATGCTTTTTGCAAGGACGGAATGATGCAAGGTGCCATCAAACTTTTTGTTGATATGAGAAGAGTTGGCCTTTTACCTAATGAATTTACCTACACTTCTCTGATTGATGCCAATTGTAAGGCAGGTAATTTAACAGAAGCATGGAAGTTGTCCAATGATATGTTGCAAGCAGGAGTTAATTTAAATATAGTCACCTATACAGCTCTAATGGATGGCCTTTGTGAAGATGGAAGAATGGTGGAAGCAGAAGAAGTGTTTAGGGCAATGCTGAAAGATGGAATATCTCCCAACCAGCAGGTTTACACTGCTTTGGTTCATGGCTATATTAAGGCGGAGAGATTGGGGGATGCTATGGAGATATTGAAGCAAATGACAGAATGTGACATCAAACCAGATTTAATACTCTATGGCACAATTCTTTGGGGTCTCTGTAATCAAAGCAAACTTGAAGAAACTAAGCTTATTATTAAAGAAATGAAAAGTCGGGGTATTAGTGCAAATCCTGTTATATACACAACAATTATAGATGCTTATTTTAAGGCTGGAAAAAGCTCAGATGCATTAAATCTTCTTCAGGAGATGCAGGATGTAGGTGTTGAGGCTACTGTTGTAACCTACTGTGTATTAATTGATGGTTTGTGCAAAACAGGCATGGTCGAACTGGCAGTTGATTATTTTGGTAGAATGTCTGACTTTGGTTTACAACCTAATGTTGCAGTTTACACCGCCCTCATCGATGGTCTTTGTAAAATTAATTGCATTGAATCTGCCAAAAAGTTGTTTGATGAAATGCAATGTAGGGGTATGACTCCGGATAAAGCAGCTTTCACTGCTCTAATTGATGGCAACTTGAAGCTTGGAAATCTTCAGGAAGCTTTGAATTTGATTAGCAAAATGACAGAATTAGCTATTGATTTTGATCTACATGCTTATACTTCCTTGGTTTCGGGATTTTCTCAATGTGGTGAGCTGCACCAAGCGAGGAAGTTTTTTGATGAGATGATTGAGAAGGGCATACTTCCCGAGGAAATTTTATGTATATGTCTATTGAGGGAGTATTATAAGCTTGGACAGTTGGATGAAGCCATTGAATTGAAGAATGAAATGCAAAGGAGGGGTCTAATTGCTGAAAAGTGCAGCCATGCAATTCCCAGTCTAAAAATGTGA

Protein sequence

MKLSVEGLAFASLLSAMLLFFRNLFHVSRRASCRVISLSSSSSHPGCLSFKVFNASSSLTLINGCYISCPFFWFTSFLCIFRLPFVSYSNTNNSFECLDIGSLRKIIQQDLWNDPKIVTLFDSALAPIWVPRILVELKNDPKLALKFFKWAESQIGFRHTTESYCIVVHMLFRARMYTNAHDIIKEVILKNRADLILPVCNIFDMLWSTRNICVSGSGVFDVLFSVLIELDLLEEANECFSRMRKFRTLPKARSCNFLLHRLSKSGNGQLVRKFFNDMIGAGITPSVFTYNVMIDHLCKEGDLENARRLFVQMRQMGFSPDVVTYNSLIDGYGKFEKMPRAFEYLSEMKNNGLKPNVVTYSTLIDAFCKDGMMQGAIKLFVDMRRVGLLPNEFTYTSLIDANCKAGNLTEAWKLSNDMLQAGVNLNIVTYTALMDGLCEDGRMVEAEEVFRAMLKDGISPNQQVYTALVHGYIKAERLGDAMEILKQMTECDIKPDLILYGTILWGLCNQSKLEETKLIIKEMKSRGISANPVIYTTIIDAYFKAGKSSDALNLLQEMQDVGVEATVVTYCVLIDGLCKTGMVELAVDYFGRMSDFGLQPNVAVYTALIDGLCKINCIESAKKLFDEMQCRGMTPDKAAFTALIDGNLKLGNLQEALNLISKMTELAIDFDLHAYTSLVSGFSQCGELHQARKFFDEMIEKGILPEEILCICLLREYYKLGQLDEAIELKNEMQRRGLIAEKCSHAIPSLKM
Homology
BLAST of Tan0020971 vs. ExPASy Swiss-Prot
Match: P0C894 (Putative pentatricopeptide repeat-containing protein At2g02150 OS=Arabidopsis thaliana OX=3702 GN=At2g02150 PE=3 SV=1)

HSP 1 Score: 829.7 bits (2142), Expect = 2.6e-239
Identity = 412/771 (53.44%), Postives = 545/771 (70.69%), Query Frame = 0

Query: 17  MLLFFRNLFHVSRRASCRVISLSSSSSH-PGCLSFKVFNASSSLTLINGCYISCPFFWFT 76
           M    RN  HV+RR    V   SSS S     L F + + S S       +ISCPF WFT
Sbjct: 1   MFCSLRNFLHVNRRFPRHVSPSSSSLSQIQSPLCFPLSSPSPS----QSSFISCPFVWFT 60

Query: 77  SFLCIFRLPFVSYSNTNNSFECLDIGSLRKIIQQDLWNDPKIVTLFDSALAPIWVPRILV 136
           SFLCI R PFV+ S T+   E  D   +RK++  DLW+DP +  LFD  LAPIWVPR+LV
Sbjct: 61  SFLCIIRYPFVTKSGTSTYSEDFDRDWIRKVVHNDLWDDPGLEKLFDLTLAPIWVPRVLV 120

Query: 137 ELKNDPKLALKFFKWAESQIGFRHTTESYCIVVHMLFRARMYTNAHDIIKEVILKNRADL 196
           ELK DPKLA KFFKW+ ++ GF+H+ ESYCIV H+LF ARMY +A+ ++KE++L ++AD 
Sbjct: 121 ELKEDPKLAFKFFKWSMTRNGFKHSVESYCIVAHILFCARMYYDANSVLKEMVL-SKAD- 180

Query: 197 ILPVCNIFDMLWSTRNICVSGSGVFDVLFSVLIELDLLEEANECFSRMRKFRTLPKARSC 256
               C++FD+LWSTRN+CV G GVFD LFSVLI+L +LEEA +CFS+M++FR  PK RSC
Sbjct: 181 ----CDVFDVLWSTRNVCVPGFGVFDALFSVLIDLGMLEEAIQCFSKMKRFRVFPKTRSC 240

Query: 257 NFLLHRLSKSGNGQLVRKFFNDMIGAGITPSVFTYNVMIDHLCKEGDLENARRLFVQMRQ 316
           N LLHR +K G    V++FF DMIGAG  P+VFTYN+MID +CKEGD+E AR LF +M+ 
Sbjct: 241 NGLLHRFAKLGKTDDVKRFFKDMIGAGARPTVFTYNIMIDCMCKEGDVEAARGLFEEMKF 300

Query: 317 MGFSPDVVTYNSLIDGYG-----------------------------------KFEKMPR 376
            G  PD VTYNS+IDG+G                                   KF K+P 
Sbjct: 301 RGLVPDTVTYNSMIDGFGKVGRLDDTVCFFEEMKDMCCEPDVITYNALINCFCKFGKLPI 360

Query: 377 AFEYLSEMKNNGLKPNVVTYSTLIDAFCKDGMMQGAIKLFVDMRRVGLLPNEFTYTSLID 436
             E+  EMK NGLKPNVV+YSTL+DAFCK+GMMQ AIK +VDMRRVGL+PNE+TYTSLID
Sbjct: 361 GLEFYREMKGNGLKPNVVSYSTLVDAFCKEGMMQQAIKFYVDMRRVGLVPNEYTYTSLID 420

Query: 437 ANCKAGNLTEAWKLSNDMLQAGVNLNIVTYTALMDGLCEDGRMVEAEEVFRAMLKDGISP 496
           ANCK GNL++A++L N+MLQ GV  N+VTYTAL+DGLC+  RM EAEE+F  M   G+ P
Sbjct: 421 ANCKIGNLSDAFRLGNEMLQVGVEWNVVTYTALIDGLCDAERMKEAEELFGKMDTAGVIP 480

Query: 497 NQQVYTALVHGYIKAERLGDAMEILKQMTECDIKPDLILYGTILWGLCNQSKLEETKLII 556
           N   Y AL+HG++KA+ +  A+E+L ++    IKPDL+LYGT +WGLC+  K+E  K+++
Sbjct: 481 NLASYNALIHGFVKAKNMDRALELLNELKGRGIKPDLLLYGTFIWGLCSLEKIEAAKVVM 540

Query: 557 KEMKSRGISANPVIYTTIIDAYFKAGKSSDALNLLQEMQDVGVEATVVTYCVLIDGLCKT 616
            EMK  GI AN +IYTT++DAYFK+G  ++ L+LL EM+++ +E TVVT+CVLIDGLCK 
Sbjct: 541 NEMKECGIKANSLIYTTLMDAYFKSGNPTEGLHLLDEMKELDIEVTVVTFCVLIDGLCKN 600

Query: 617 GMVELAVDYFGRMS-DFGLQPNVAVYTALIDGLCKINCIESAKKLFDEMQCRGMTPDKAA 676
            +V  AVDYF R+S DFGLQ N A++TA+IDGLCK N +E+A  LF++M  +G+ PD+ A
Sbjct: 601 KLVSKAVDYFNRISNDFGLQANAAIFTAMIDGLCKDNQVEAATTLFEQMVQKGLVPDRTA 660

Query: 677 FTALIDGNLKLGNLQEALNLISKMTELAIDFDLHAYTSLVSGFSQCGELHQARKFFDEMI 736
           +T+L+DGN K GN+ EAL L  KM E+ +  DL AYTSLV G S C +L +AR F +EMI
Sbjct: 661 YTSLMDGNFKQGNVLEALALRDKMAEIGMKLDLLAYTSLVWGLSHCNQLQKARSFLEEMI 720

Query: 737 EKGILPEEILCICLLREYYKLGQLDEAIELKNEMQRRGLIAEKCSHAIPSL 751
            +GI P+E+LCI +L+++Y+LG +DEA+EL++ + +  L+     +A+P++
Sbjct: 721 GEGIHPDEVLCISVLKKHYELGCIDEAVELQSYLMKHQLLTSDNDNALPNM 761

BLAST of Tan0020971 vs. ExPASy Swiss-Prot
Match: Q9ZUA2 (Pentatricopeptide repeat-containing protein At2g01740 OS=Arabidopsis thaliana OX=3702 GN=At2g01740 PE=3 SV=1)

HSP 1 Score: 348.6 bits (893), Expect = 1.8e-94
Identity = 197/546 (36.08%), Postives = 299/546 (54.76%), Query Frame = 0

Query: 232 LLEEANECFSRMRKFRTLPKARSCNFLLHRLSKSGNGQLVRKFFNDMIGAGIT------- 291
           ++ EA +  SR+RK   LP   +CN  +H+L  S  G L  KF   ++  G T       
Sbjct: 1   MVREALQFLSRLRKSSNLPDPFTCNKHIHQLINSNCGILSLKFLAYLVSRGYTPHRSSFN 60

Query: 292 ----------------------------PSVFTYNVMIDHLCKEGDLENARRLFVQMR-Q 351
                                       P V +YN +ID  C+ GD+ +A  +   +R  
Sbjct: 61  SVVSFVCKLGQVKFAEDIVHSMPRFGCEPDVISYNSLIDGHCRNGDIRSASLVLESLRAS 120

Query: 352 MGF--SPDVVTYNSLIDGYGKFEKMPRAFEYLSEMKNNGLKPNVVTYSTLIDAFCKDGMM 411
            GF   PD+V++NSL +G+ K + +   F Y+  M      PNVVTYST ID FCK G +
Sbjct: 121 HGFICKPDIVSFNSLFNGFSKMKMLDEVFVYMGVML-KCCSPNVVTYSTWIDTFCKSGEL 180

Query: 412 QGAIKLFVDMRRVGLLPNEFTYTSLIDANCKAGNLTEAWKLSNDMLQAGVNLNIVTYTAL 471
           Q A+K F  M+R  L PN  T+T LID  CKAG+L  A  L  +M +  ++LN+VTYTAL
Sbjct: 181 QLALKSFHSMKRDALSPNVVTFTCLIDGYCKAGDLEVAVSLYKEMRRVRMSLNVVTYTAL 240

Query: 472 MDGLCEDGRMVEAEEVFRAMLKDGISPNQQVYTALVHGYIKAERLGDAMEILKQMTECDI 531
           +DG C+ G M  AEE++  M++D + PN  VYT ++ G+ +     +AM+ L +M    +
Sbjct: 241 IDGFCKKGEMQRAEEMYSRMVEDRVEPNSLVYTTIIDGFFQRGDSDNAMKFLAKMLNQGM 300

Query: 532 KPDLILYGTILWGLCNQSKLEETKLIIKEMKSRGISANPVIYTTIIDAYFKAGKSSDALN 591
           + D+  YG I+ GLC   KL+E   I+++M+   +  + VI+TT+++AYFK+G+   A+N
Sbjct: 301 RLDITAYGVIISGLCGNGKLKEATEIVEDMEKSDLVPDMVIFTTMMNAYFKSGRMKAAVN 360

Query: 592 LLQEMQDVGVEATVVTYCVLIDGLCKTGMVELAVDYFGRMSDFGLQPNVAVYTALIDGLC 651
           +  ++ + G E  VV    +IDG+ K G +  A+ YF        + N  +YT LID LC
Sbjct: 361 MYHKLIERGFEPDVVALSTMIDGIAKNGQLHEAIVYFCIE-----KANDVMYTVLIDALC 420

Query: 652 KINCIESAKKLFDEMQCRGMTPDKAAFTALIDGNLKLGNLQEALNLISKMTELAIDFDLH 711
           K       ++LF ++   G+ PDK  +T+ I G  K GNL +A  L ++M +  +  DL 
Sbjct: 421 KEGDFIEVERLFSKISEAGLVPDKFMYTSWIAGLCKQGNLVDAFKLKTRMVQEGLLLDLL 480

Query: 712 AYTSLVSGFSQCGELHQARKFFDEMIEKGILPEEILCICLLREYYKLGQLDEAIELKNEM 740
           AYT+L+ G +  G + +AR+ FDEM+  GI P+  +   L+R Y K G +  A +L  +M
Sbjct: 481 AYTTLIYGLASKGLMVEARQVFDEMLNSGISPDSAVFDLLIRAYEKEGNMAAASDLLLDM 540

BLAST of Tan0020971 vs. ExPASy Swiss-Prot
Match: Q0WVK7 (Pentatricopeptide repeat-containing protein At1g05670, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At1g05670 PE=2 SV=1)

HSP 1 Score: 340.5 bits (872), Expect = 4.8e-92
Identity = 189/616 (30.68%), Postives = 317/616 (51.46%), Query Frame = 0

Query: 128 IWVPRILVELKNDPKLALKFFKWAESQIGFRHTTESYCIVVHMLFRARMYTNAHDIIKEV 187
           IWV   L+++K D +L L FF WA S+       ES CIV+H+   ++    A  +I   
Sbjct: 91  IWV---LMKIKCDYRLVLDFFDWARSRRD--SNLESLCIVIHLAVASKDLKVAQSLISSF 150

Query: 188 ILKNRADLILPVCNIFDMLWSTRNICVSGSGVFDVLFSVLIELDLLEEANECFSRMRKFR 247
             + + ++       FD+L  T     S   VFDV F VL++  LL EA   F +M  + 
Sbjct: 151 WERPKLNVTDSFVQFFDLLVYTYKDWGSDPRVFDVFFQVLVDFGLLREARRVFEKMLNYG 210

Query: 248 TLPKARSCNFLLHRLSKSGNGQLVRKF-FNDMIGAGITPSVFTYNVMIDHLCKEGDLENA 307
            +    SCN  L RLSK           F +    G+  +V +YN++I  +C+ G ++ A
Sbjct: 211 LVLSVDSCNVYLTRLSKDCYKTATAIIVFREFPEVGVCWNVASYNIVIHFVCQLGRIKEA 270

Query: 308 RRLFVQMRQMGFSPDVVTYNSLIDGYGKFEKMPRAFEYLSEMKNNGLKPNVVTYSTLIDA 367
             L + M   G++PDV++Y+++++GY +F ++ + ++ +  MK  GLKPN   Y ++I  
Sbjct: 271 HHLLLLMELKGYTPDVISYSTVVNGYCRFGELDKVWKLIEVMKRKGLKPNSYIYGSIIGL 330

Query: 368 FCKDGMMQGAIKLFVDMRRVGLLPNEFTYTSLIDANCKAGNLTEAWKLSNDMLQAGVNLN 427
            C+   +  A + F +M R G+LP+   YT+LID  CK G++  A K   +M    +  +
Sbjct: 331 LCRICKLAEAEEAFSEMIRQGILPDTVVYTTLIDGFCKRGDIRAASKFFYEMHSRDITPD 390

Query: 428 IVTYTALMDGLCEDGRMVEAEEVFRAMLKDGISPNQQVYTALVHGYIKAERLGDAMEILK 487
           ++TYTA++ G C+ G MVEA ++F  M   G+ P+   +T L++GY KA  + DA  +  
Sbjct: 391 VLTYTAIISGFCQIGDMVEAGKLFHEMFCKGLEPDSVTFTELINGYCKAGHMKDAFRVHN 450

Query: 488 QMTECDIKPDLILYGTILWGLCNQSKLEETKLIIKEMKSRGISANPVIYTTIIDAYFKAG 547
            M +    P+++ Y T++ GLC +  L+    ++ EM   G+  N   Y +I++   K+G
Sbjct: 451 HMIQAGCSPNVVTYTTLIDGLCKEGDLDSANELLHEMWKIGLQPNIFTYNSIVNGLCKSG 510

Query: 548 KSSDALNLLQEMQDVGVEATVVTYCVLIDGLCKTGMVELAVDYFGRMSDFGLQPNVAVYT 607
              +A+ L+ E +  G+ A  VTY  L+D  CK+G ++ A +    M   GLQP +  + 
Sbjct: 511 NIEEAVKLVGEFEAAGLNADTVTYTTLMDAYCKSGEMDKAQEILKEMLGKGLQPTIVTFN 570

Query: 608 ALIDGLCKINCIESAKKLFDEMQCRGMTPDKAAFTALIDGNLKLGNLQEALNLISKMTEL 667
            L++G C    +E  +KL + M  +G+ P+   F +L+       NL+ A  +   M   
Sbjct: 571 VLMNGFCLHGMLEDGEKLLNWMLAKGIAPNATTFNSLVKQYCIRNNLKAATAIYKDMCSR 630

Query: 668 AIDFDLHAYTSLVSGFSQCGELHQARKFFDEMIEKGILPEEILCICLLREYYKLGQLDEA 727
            +  D   Y +LV G  +   + +A   F EM  KG          L++ + K  +  EA
Sbjct: 631 GVGPDGKTYENLVKGHCKARNMKEAWFLFQEMKGKGFSVSVSTYSVLIKGFLKRKKFLEA 690

Query: 728 IELKNEMQRRGLIAEK 743
            E+ ++M+R GL A+K
Sbjct: 691 REVFDQMRREGLAADK 701

BLAST of Tan0020971 vs. ExPASy Swiss-Prot
Match: Q9FIX3 (Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana OX=3702 GN=EMB2745 PE=2 SV=1)

HSP 1 Score: 332.0 bits (850), Expect = 1.7e-89
Identity = 185/623 (29.70%), Postives = 323/623 (51.85%), Query Frame = 0

Query: 133 ILVELKNDPKLALKFFKWAESQIGFRHTTESYCIVVHMLFRARMYTNAHDIIKEVILKNR 192
           +L++ +ND  L LKF  WA     F  T    CI +H+L + ++Y  A  + ++V  K  
Sbjct: 54  LLLKSQNDQALILKFLNWANPHQFF--TLRCKCITLHILTKFKLYKTAQILAEDVAAKTL 113

Query: 193 ADLILPVCNIFDMLWSTRNICVSGSGVFDVLFSVLIELDLLEEANECFSRMRKFRTLPKA 252
            D    +  +F  L  T ++C S S VFD++      L L+++A       +    +P  
Sbjct: 114 DDEYASL--VFKSLQETYDLCYSTSSVFDLVVKSYSRLSLIDKALSIVHLAQAHGFMPGV 173

Query: 253 RSCNFLLHRLSKS-GNGQLVRKFFNDMIGAGITPSVFTYNVMIDHLCKEGDLENARRLFV 312
            S N +L    +S  N       F +M+ + ++P+VFTYN++I   C  G+++ A  LF 
Sbjct: 174 LSYNAVLDATIRSKRNISFAENVFKEMLESQVSPNVFTYNILIRGFCFAGNIDVALTLFD 233

Query: 313 QMRQMGFSPDVVTYNSLIDGYGKFEKMPRAFEYLSEMKNNGLKPNVVTYSTLIDAFCKDG 372
           +M   G  P+VVTYN+LIDGY K  K+   F+ L  M   GL+PN+++Y+ +I+  C++G
Sbjct: 234 KMETKGCLPNVVTYNTLIDGYCKLRKIDDGFKLLRSMALKGLEPNLISYNVVINGLCREG 293

Query: 373 MMQGAIKLFVDMRRVGLLPNEFTYTSLIDANCKAGNLTEAWKLSNDMLQAGVNLNIVTYT 432
            M+    +  +M R G   +E TY +LI   CK GN  +A  +  +ML+ G+  +++TYT
Sbjct: 294 RMKEVSFVLTEMNRRGYSLDEVTYNTLIKGYCKEGNFHQALVMHAEMLRHGLTPSVITYT 353

Query: 433 ALMDGLCEDGRMVEAEEVFRAMLKDGISPNQQVYTALVHGYIKAERLGDAMEILKQMTEC 492
           +L+  +C+ G M  A E    M   G+ PN++ YT LV G+ +   + +A  +L++M + 
Sbjct: 354 SLIHSMCKAGNMNRAMEFLDQMRVRGLCPNERTYTTLVDGFSQKGYMNEAYRVLREMNDN 413

Query: 493 DIKPDLILYGTILWGLCNQSKLEETKLIIKEMKSRGISANPVIYTTIIDAYFKAGKSSDA 552
              P ++ Y  ++ G C   K+E+   ++++MK +G+S + V Y+T++  + ++    +A
Sbjct: 414 GFSPSVVTYNALINGHCVTGKMEDAIAVLEDMKEKGLSPDVVSYSTVLSGFCRSYDVDEA 473

Query: 553 LNLLQEMQDVGVEATVVTYCVLIDGLCKTGMVELAVDYFGRMSDFGLQPNVAVYTALIDG 612
           L + +EM + G++   +TY  LI G C+    + A D +  M   GL P+   YTALI+ 
Sbjct: 474 LRVKREMVEKGIKPDTITYSSLIQGFCEQRRTKEACDLYEEMLRVGLPPDEFTYTALINA 533

Query: 613 LCKINCIESAKKLFDEMQCRGMTPDKAAFTALIDGNLKLGNLQEALNLISK--------- 672
            C    +E A +L +EM  +G+ PD   ++ LI+G  K    +EA  L+ K         
Sbjct: 534 YCMEGDLEKALQLHNEMVEKGVLPDVVTYSVLINGLNKQSRTREAKRLLLKLFYEESVPS 593

Query: 673 ------MTELAIDFDLHAYTSLVSGFSQCGELHQARKFFDEMIEKGILPEEILCICLLRE 732
                 + E   + +  +  SL+ GF   G + +A + F+ M+ K   P+      ++  
Sbjct: 594 DVTYHTLIENCSNIEFKSVVSLIKGFCMKGMMTEADQVFESMLGKNHKPDGTAYNIMIHG 653

Query: 733 YYKLGQLDEAIELKNEMQRRGLI 740
           + + G + +A  L  EM + G +
Sbjct: 654 HCRAGDIRKAYTLYKEMVKSGFL 672

BLAST of Tan0020971 vs. ExPASy Swiss-Prot
Match: Q9LFC5 (Pentatricopeptide repeat-containing protein At5g01110 OS=Arabidopsis thaliana OX=3702 GN=At5g01110 PE=2 SV=1)

HSP 1 Score: 308.9 bits (790), Expect = 1.6e-82
Identity = 186/655 (28.40%), Postives = 320/655 (48.85%), Query Frame = 0

Query: 125 LAPIWVPRILVELKNDPKLALKFFKWAESQIG-----FRHTTESYCIVVHMLFRARMYTN 184
           L P+ V  +L   +ND  L  +F      Q+G     F+HT+ S   ++H+L R+   ++
Sbjct: 76  LNPLAVVEVLYRCRNDLTLGQRFV----DQLGFHFPNFKHTSLSLSAMIHILVRSGRLSD 135

Query: 185 AHDIIKEVILKNRADLILPVCNIFDMLWSTRNICVSGSGVFDVLFSVLIELDLLEEANEC 244
           A   +  +I ++    +     I + L ST + C S   VFD+L    ++   L EA+E 
Sbjct: 136 AQSCLLRMIRRSGVSRL----EIVNSLDSTFSNCGSNDSVFDLLIRTYVQARKLREAHEA 195

Query: 245 FSRMRKFRTLPKARSCNFLLHRLSKSGNGQLVRKFFNDMIGAGITPSVFTYNVMIDHLCK 304
           F+ +R         +CN L+  L + G  +L    + ++  +G+  +V+T N+M++ LCK
Sbjct: 196 FTLLRSKGFTVSIDACNALIGSLVRIGWVELAWGVYQEISRSGVGINVYTLNIMVNALCK 255

Query: 305 EGDLENARRLFVQMRQMGFSPDVVTYNSLIDGYGKFEKMPRAFEYLSEMKNNGLKPNVVT 364
           +G +E       Q+++ G  PD+VTYN+LI  Y     M  AFE ++ M   G  P V T
Sbjct: 256 DGKMEKVGTFLSQVQEKGVYPDIVTYNTLISAYSSKGLMEEAFELMNAMPGKGFSPGVYT 315

Query: 365 YSTLIDAFCKDGMMQGAIKLFVDMRRVGLLPNEFTYTSLIDANCKAGNLTEAWKLSNDML 424
           Y+T+I+  CK G  + A ++F +M R GL P+  TY SL+   CK G++ E  K+ +DM 
Sbjct: 316 YNTVINGLCKHGKYERAKEVFAEMLRSGLSPDSTTYRSLLMEACKKGDVVETEKVFSDMR 375

Query: 425 QAGVNLNIVTYTALMDGLCEDGRMVEAEEVFRAMLKDGISPNQQVYTALVHGYIKAERLG 484
              V  ++V ++++M      G + +A   F ++ + G+ P+  +YT L+ GY +   + 
Sbjct: 376 SRDVVPDLVCFSSMMSLFTRSGNLDKALMYFNSVKEAGLIPDNVIYTILIQGYCRKGMIS 435

Query: 485 DAMEILKQMTECDIKPDLILYGTILWGLCNQSKLEETKLIIKEMKSRGISANPVIYTTII 544
            AM +  +M +     D++ Y TIL GLC +  L E   +  EM  R +  +    T +I
Sbjct: 436 VAMNLRNEMLQQGCAMDVVTYNTILHGLCKRKMLGEADKLFNEMTERALFPDSYTLTILI 495

Query: 545 DAYFKAGKSSDALNLLQEMQDVGVEATVVTYCVLIDGLCKTGMVELAVDYFGRMSDFGLQ 604
           D + K G   +A+ L Q+M++  +   VVTY  L+DG  K G ++ A + +  M    + 
Sbjct: 496 DGHCKLGNLQNAMELFQKMKEKRIRLDVVTYNTLLDGFGKVGDIDTAKEIWADMVSKEIL 555

Query: 605 PNVAVYTALIDGLCKINCIESAKKLFDEM------------------QCR---------- 664
           P    Y+ L++ LC    +  A +++DEM                   CR          
Sbjct: 556 PTPISYSILVNALCSKGHLAEAFRVWDEMISKNIKPTVMICNSMIKGYCRSGNASDGESF 615

Query: 665 -------GMTPDKAAFTALIDGNLKLGNLQEALNLISKMTEL--AIDFDLHAYTSLVSGF 724
                  G  PD  ++  LI G ++  N+ +A  L+ KM E    +  D+  Y S++ GF
Sbjct: 616 LEKMISEGFVPDCISYNTLIYGFVREENMSKAFGLVKKMEEEQGGLVPDVFTYNSILHGF 675

Query: 725 SQCGELHQARKFFDEMIEKGILPEEILCICLLREYYKLGQLDEAIELKNEMQRRG 738
            +  ++ +A     +MIE+G+ P+     C++  +     L EA  + +EM +RG
Sbjct: 676 CRQNQMKEAEVVLRKMIERGVNPDRSTYTCMINGFVSQDNLTEAFRIHDEMLQRG 722

BLAST of Tan0020971 vs. NCBI nr
Match: XP_022959718.1 (putative pentatricopeptide repeat-containing protein At2g02150 isoform X3 [Cucurbita moschata])

HSP 1 Score: 1381.3 bits (3574), Expect = 0.0e+00
Identity = 679/751 (90.41%), Postives = 718/751 (95.61%), Query Frame = 0

Query: 1   MKLSVEGLAFASLLSAMLLFFRNLFHVSRRASCRVISLSSSSSHPGCLSFKVFNASSSLT 60
           MKLSVE LAFASL SAMLLFFR+LFHVSRRAS RVISLS +SSHPGCLSF VFN  SSLT
Sbjct: 1   MKLSVEVLAFASLFSAMLLFFRSLFHVSRRASYRVISLSLNSSHPGCLSFNVFNGPSSLT 60

Query: 61  LINGCYISCPFFWFTSFLCIFRLPFVSYSNTNNSFECLDIGSLRKIIQQDLWNDPKIVTL 120
            +NG YISCPFFWFTSFLCIFRLPFVSYS TN+SFE LDIGSLRKIIQQDLWNDPKIV L
Sbjct: 61  SMNGYYISCPFFWFTSFLCIFRLPFVSYSITNDSFELLDIGSLRKIIQQDLWNDPKIVVL 120

Query: 121 FDSALAPIWVPRILVELKNDPKLALKFFKWAESQIGFRHTTESYCIVVHMLFRARMYTNA 180
           FDSALAPIWV +ILVELK DPKLALKFFKWA + IGFRHTTESYCI+VHMLFRARMYTNA
Sbjct: 121 FDSALAPIWVSKILVELKEDPKLALKFFKWAGTHIGFRHTTESYCIIVHMLFRARMYTNA 180

Query: 181 HDIIKEVILKNRADLILPVCNIFDMLWSTRNICVSGSGVFDVLFSVLIELDLLEEANECF 240
           HDI+KE++LK+R DLILPVCN+FD+LWSTRN CVSG+GVFDVLFSVL+EL LLEEANECF
Sbjct: 181 HDIMKEMVLKSRTDLILPVCNVFDILWSTRNFCVSGTGVFDVLFSVLVELGLLEEANECF 240

Query: 241 SRMRKFRTLPKARSCNFLLHRLSKSGNGQLVRKFFNDMIGAGITPSVFTYNVMIDHLCKE 300
           S+MRKFRTLPKARSCNFLLHRLSKSGNGQLVRKFF+DMIGAGI PSVFTYNVMIDHLCKE
Sbjct: 241 SKMRKFRTLPKARSCNFLLHRLSKSGNGQLVRKFFHDMIGAGIAPSVFTYNVMIDHLCKE 300

Query: 301 GDLENARRLFVQMRQMGFSPDVVTYNSLIDGYGKFEKMPRAFEYLSEMKNNGLKPNVVTY 360
           GDLENAR LFVQMR MGFSPDVVTYNSLIDGYGKFEKMP+AFEYLSEMKN GLKPNVVTY
Sbjct: 301 GDLENARSLFVQMRTMGFSPDVVTYNSLIDGYGKFEKMPQAFEYLSEMKNIGLKPNVVTY 360

Query: 361 STLIDAFCKDGMMQGAIKLFVDMRRVGLLPNEFTYTSLIDANCKAGNLTEAWKLSNDMLQ 420
           STLIDAFCK+GMMQGAIKLFVDMRRVGLLPNEFTYTSLIDANCKAGNLTEAWKLSNDMLQ
Sbjct: 361 STLIDAFCKEGMMQGAIKLFVDMRRVGLLPNEFTYTSLIDANCKAGNLTEAWKLSNDMLQ 420

Query: 421 AGVNLNIVTYTALMDGLCEDGRMVEAEEVFRAMLKDGISPNQQVYTALVHGYIKAERLGD 480
           AGVNLNIVTYTALMDGLCEDGRM+EAEEVFRAMLKDGISPNQQVYTALVHGYIKAE++ D
Sbjct: 421 AGVNLNIVTYTALMDGLCEDGRMMEAEEVFRAMLKDGISPNQQVYTALVHGYIKAEKMED 480

Query: 481 AMEILKQMTECDIKPDLILYGTILWGLCNQSKLEETKLIIKEMKSRGISANPVIYTTIID 540
           A+EILKQMTEC IKPDL+LYGTI+WGLCNQ+KLEETKLIIKEMKSRGI ANPVIYTTIID
Sbjct: 481 ALEILKQMTECGIKPDLVLYGTIIWGLCNQNKLEETKLIIKEMKSRGIRANPVIYTTIID 540

Query: 541 AYFKAGKSSDALNLLQEMQDVGVEATVVTYCVLIDGLCKTGMVELAVDYFGRMSDFGLQP 600
           AYFKAGKSSDAL+LLQEMQ+VGVEATVVTYCVLIDGLCKTGMVE+AVDYFGRMSDFG+QP
Sbjct: 541 AYFKAGKSSDALDLLQEMQEVGVEATVVTYCVLIDGLCKTGMVEVAVDYFGRMSDFGVQP 600

Query: 601 NVAVYTALIDGLCKINCIESAKKLFDEMQCRGMTPDKAAFTALIDGNLKLGNLQEALNLI 660
           NVAVYTALIDGLCKINCIESA+KLF+EMQCRGMTPDK AFTALIDGNLKLGNLQE LNLI
Sbjct: 601 NVAVYTALIDGLCKINCIESAEKLFEEMQCRGMTPDKTAFTALIDGNLKLGNLQETLNLI 660

Query: 661 SKMTELAIDFDLHAYTSLVSGFSQCGELHQARKFFDEMIEKGILPEEILCICLLREYYKL 720
           SKMTEL I+FDLHAYT+LVSGFSQCGELHQARKFF+EMIEKGILP+EILCICLL+EY KL
Sbjct: 661 SKMTELVIEFDLHAYTTLVSGFSQCGELHQARKFFNEMIEKGILPDEILCICLLKEYNKL 720

Query: 721 GQLDEAIELKNEMQRRGLIAEKCSHAIPSLK 752
           G LDEAI+LKNEMQRRGLI EKCSH +PSLK
Sbjct: 721 GHLDEAIKLKNEMQRRGLITEKCSHEVPSLK 751

BLAST of Tan0020971 vs. NCBI nr
Match: XP_023534850.1 (putative pentatricopeptide repeat-containing protein At2g02150 isoform X3 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1376.3 bits (3561), Expect = 0.0e+00
Identity = 676/751 (90.01%), Postives = 716/751 (95.34%), Query Frame = 0

Query: 1   MKLSVEGLAFASLLSAMLLFFRNLFHVSRRASCRVISLSSSSSHPGCLSFKVFNASSSLT 60
           MKLSVE LAFASL SAML FFR+LFHVSRRAS RVISLS +SSHPGCLSF VFN  SSLT
Sbjct: 1   MKLSVEVLAFASLFSAMLPFFRSLFHVSRRASYRVISLSLNSSHPGCLSFHVFNGPSSLT 60

Query: 61  LINGCYISCPFFWFTSFLCIFRLPFVSYSNTNNSFECLDIGSLRKIIQQDLWNDPKIVTL 120
            ING +ISCPFFWFTSFLCIFRLPFVSYS TN+SFE LDI SLRKIIQQDLWNDPKIV L
Sbjct: 61  SINGYHISCPFFWFTSFLCIFRLPFVSYSITNDSFELLDIDSLRKIIQQDLWNDPKIVVL 120

Query: 121 FDSALAPIWVPRILVELKNDPKLALKFFKWAESQIGFRHTTESYCIVVHMLFRARMYTNA 180
           FDS+LAPIWV +ILVELK DP LALKFFKWA + IGFRHTTESYCI+VHMLFRARMYTNA
Sbjct: 121 FDSSLAPIWVSKILVELKEDPNLALKFFKWAGTHIGFRHTTESYCIIVHMLFRARMYTNA 180

Query: 181 HDIIKEVILKNRADLILPVCNIFDMLWSTRNICVSGSGVFDVLFSVLIELDLLEEANECF 240
           HDI+KE++LK+R DLILPVCN+FD+LWSTRN CVSG+GVFDVLFSVL+EL LLEEANECF
Sbjct: 181 HDIMKEMVLKSRTDLILPVCNVFDILWSTRNFCVSGTGVFDVLFSVLVELGLLEEANECF 240

Query: 241 SRMRKFRTLPKARSCNFLLHRLSKSGNGQLVRKFFNDMIGAGITPSVFTYNVMIDHLCKE 300
           S+MRKFRTLPKARSCNFLLHRLSK+GNGQLVRKFF+DM+GAGI PSVFTYNVMIDHLCKE
Sbjct: 241 SKMRKFRTLPKARSCNFLLHRLSKAGNGQLVRKFFHDMVGAGIAPSVFTYNVMIDHLCKE 300

Query: 301 GDLENARRLFVQMRQMGFSPDVVTYNSLIDGYGKFEKMPRAFEYLSEMKNNGLKPNVVTY 360
           GDLENAR LFVQMR MGFSPDVVTYNSLIDGYGKFEKMP+AFEYLSEMKNNGLKPNVVTY
Sbjct: 301 GDLENARSLFVQMRTMGFSPDVVTYNSLIDGYGKFEKMPQAFEYLSEMKNNGLKPNVVTY 360

Query: 361 STLIDAFCKDGMMQGAIKLFVDMRRVGLLPNEFTYTSLIDANCKAGNLTEAWKLSNDMLQ 420
           STLIDAFCK+GMMQGAIKLFVDMRRVGLLPNEFTYTSLIDANCKAGNLTEAWKLSNDMLQ
Sbjct: 361 STLIDAFCKEGMMQGAIKLFVDMRRVGLLPNEFTYTSLIDANCKAGNLTEAWKLSNDMLQ 420

Query: 421 AGVNLNIVTYTALMDGLCEDGRMVEAEEVFRAMLKDGISPNQQVYTALVHGYIKAERLGD 480
           AGVNLNIVTYTALMDGLCEDGRM+EAEEVFRAMLKDGISPNQQVYTALVHGYIKAE++ D
Sbjct: 421 AGVNLNIVTYTALMDGLCEDGRMMEAEEVFRAMLKDGISPNQQVYTALVHGYIKAEKMED 480

Query: 481 AMEILKQMTECDIKPDLILYGTILWGLCNQSKLEETKLIIKEMKSRGISANPVIYTTIID 540
           A+EILKQ+T+C IKPDL+LYGTI+WGLCNQ+KLEETKLIIKEMKSRGI ANPVIYTTIID
Sbjct: 481 ALEILKQITKCGIKPDLVLYGTIIWGLCNQNKLEETKLIIKEMKSRGIRANPVIYTTIID 540

Query: 541 AYFKAGKSSDALNLLQEMQDVGVEATVVTYCVLIDGLCKTGMVELAVDYFGRMSDFGLQP 600
           AYFKAGK SDAL+LLQEMQ+VGVEATVVTYCVLIDGLCKTGMVE+AVDYFGRMSDFG+QP
Sbjct: 541 AYFKAGKGSDALDLLQEMQEVGVEATVVTYCVLIDGLCKTGMVEVAVDYFGRMSDFGVQP 600

Query: 601 NVAVYTALIDGLCKINCIESAKKLFDEMQCRGMTPDKAAFTALIDGNLKLGNLQEALNLI 660
           NVAVYTALIDGLCKINCIESAKKLFDEMQCRGMTPDK AFTALIDGNLKLGNLQEALNLI
Sbjct: 601 NVAVYTALIDGLCKINCIESAKKLFDEMQCRGMTPDKTAFTALIDGNLKLGNLQEALNLI 660

Query: 661 SKMTELAIDFDLHAYTSLVSGFSQCGELHQARKFFDEMIEKGILPEEILCICLLREYYKL 720
           SKMTEL I+FDLHAYT+LVSGFSQCGELHQARKFF+EMIEKGILP+EILCICLLREY KL
Sbjct: 661 SKMTELVIEFDLHAYTTLVSGFSQCGELHQARKFFNEMIEKGILPDEILCICLLREYNKL 720

Query: 721 GQLDEAIELKNEMQRRGLIAEKCSHAIPSLK 752
           G LDEAIELKNEMQRRGLI EKCSH +PSLK
Sbjct: 721 GHLDEAIELKNEMQRRGLITEKCSHEVPSLK 751

BLAST of Tan0020971 vs. NCBI nr
Match: KAG6601913.1 (putative pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1369.0 bits (3542), Expect = 0.0e+00
Identity = 682/786 (86.77%), Postives = 718/786 (91.35%), Query Frame = 0

Query: 1   MKLSVEGLAFASLLSAMLLFFRNLFHVSRRASCRVISLSSSSSHPGCLSFKVFNASSSLT 60
           MKLSVE LAFASL SAMLLFFR+LFHVSRRAS RVISLS +SSHPGCLSF VFN  SSLT
Sbjct: 1   MKLSVEVLAFASLFSAMLLFFRSLFHVSRRASYRVISLSLNSSHPGCLSFNVFNGPSSLT 60

Query: 61  LINGCYISCPFFWFTSFLCIFRLPFVSYSNTNNSFECLDIGSLRKIIQQDLWNDPKIVTL 120
            +NG YISCPFFWFTSFLCIFRLPFVSYS TN+SFE LDIGSLRKIIQQDLWNDPKIV L
Sbjct: 61  SMNGYYISCPFFWFTSFLCIFRLPFVSYSITNDSFELLDIGSLRKIIQQDLWNDPKIVVL 120

Query: 121 FDSALAPIWVPRILVELKNDPKLALKFFKWAESQIGFRHTTESYCIVVHMLFRARMYTNA 180
           FDSALAPIWV +ILVELK DPKLALKFFKWA + IGFRHTTESYCI+VHMLFRARMYTNA
Sbjct: 121 FDSALAPIWVSKILVELKEDPKLALKFFKWAGTHIGFRHTTESYCIIVHMLFRARMYTNA 180

Query: 181 HDIIKEVILKNRADLILPVCNIFDMLWSTRNICVSGSGVFDVLFSVLIELDLLEEANECF 240
           HDI+KE++LK+R DLILPVCN+FD+LWSTRN CVSG+GVFDVLFSVL+EL LLEEANECF
Sbjct: 181 HDIMKEMVLKSRTDLILPVCNVFDILWSTRNFCVSGTGVFDVLFSVLVELGLLEEANECF 240

Query: 241 SRMRKFRTLPKARSCNFLLHRLSKSGNGQLVRKFFNDMIGAGITPSVFTYNVMIDHLCKE 300
           S+MRKFRTLPKARSCNFLLHRLSKSGNGQLVRKFF+DMIGAGI PSVFTYNVMIDHLCKE
Sbjct: 241 SKMRKFRTLPKARSCNFLLHRLSKSGNGQLVRKFFHDMIGAGIAPSVFTYNVMIDHLCKE 300

Query: 301 GDLENARRLFVQMRQMGFSPDVVTYNSLIDGYG--------------------------- 360
           GD+ENAR LFVQMR MGFSPDVVTYNSLIDGYG                           
Sbjct: 301 GDVENARSLFVQMRTMGFSPDVVTYNSLIDGYGKVGLLKESVYLFNEMKDVGCVPDVITY 360

Query: 361 --------KFEKMPRAFEYLSEMKNNGLKPNVVTYSTLIDAFCKDGMMQGAIKLFVDMRR 420
                   KFEKMP+AFEYLSEMKNNGLKPNVVTYSTLIDAFCK+GMMQGAIKLFVDMRR
Sbjct: 361 NALINCFCKFEKMPQAFEYLSEMKNNGLKPNVVTYSTLIDAFCKEGMMQGAIKLFVDMRR 420

Query: 421 VGLLPNEFTYTSLIDANCKAGNLTEAWKLSNDMLQAGVNLNIVTYTALMDGLCEDGRMVE 480
           VGLLPNEFTYTSLIDANCKAGNLTEAWKLSNDMLQAGVNLNIVTYTALMDGLCEDGRM+E
Sbjct: 421 VGLLPNEFTYTSLIDANCKAGNLTEAWKLSNDMLQAGVNLNIVTYTALMDGLCEDGRMME 480

Query: 481 AEEVFRAMLKDGISPNQQVYTALVHGYIKAERLGDAMEILKQMTECDIKPDLILYGTILW 540
           AEEVFRAMLKDGISPNQQVYTALVHGYIKAE++ DA+EILKQMTEC IKPDL+LYGTI+W
Sbjct: 481 AEEVFRAMLKDGISPNQQVYTALVHGYIKAEKMEDALEILKQMTECGIKPDLVLYGTIIW 540

Query: 541 GLCNQSKLEETKLIIKEMKSRGISANPVIYTTIIDAYFKAGKSSDALNLLQEMQDVGVEA 600
           GLCNQ+KLEETKLIIKEMK RGI ANPVIYTTIIDAYFKAGKSSDAL+LLQEMQ+VGVEA
Sbjct: 541 GLCNQNKLEETKLIIKEMKKRGIRANPVIYTTIIDAYFKAGKSSDALDLLQEMQEVGVEA 600

Query: 601 TVVTYCVLIDGLCKTGMVELAVDYFGRMSDFGLQPNVAVYTALIDGLCKINCIESAKKLF 660
           TVVTYCVLIDGLCKTGMVE+AVDYFGRMSDFG+QPNVAVYTALIDGLCKINCIESAKKLF
Sbjct: 601 TVVTYCVLIDGLCKTGMVEVAVDYFGRMSDFGVQPNVAVYTALIDGLCKINCIESAKKLF 660

Query: 661 DEMQCRGMTPDKAAFTALIDGNLKLGNLQEALNLISKMTELAIDFDLHAYTSLVSGFSQC 720
           DEMQCRGMTPDK AFTALIDGNLKLGNLQEALNLISKMTEL I+FDLHAYT+LVSGFSQC
Sbjct: 661 DEMQCRGMTPDKTAFTALIDGNLKLGNLQEALNLISKMTELVIEFDLHAYTTLVSGFSQC 720

Query: 721 GELHQARKFFDEMIEKGILPEEILCICLLREYYKLGQLDEAIELKNEMQRRGLIAEKCSH 752
           GELHQARKFF+EMIEKGILP+EILCICLLREY KLG LDEAIELKNEMQRRGLI EKCSH
Sbjct: 721 GELHQARKFFNEMIEKGILPDEILCICLLREYNKLGHLDEAIELKNEMQRRGLITEKCSH 780

BLAST of Tan0020971 vs. NCBI nr
Match: XP_022959692.1 (putative pentatricopeptide repeat-containing protein At2g02150 isoform X1 [Cucurbita moschata] >XP_022959700.1 putative pentatricopeptide repeat-containing protein At2g02150 isoform X1 [Cucurbita moschata])

HSP 1 Score: 1363.6 bits (3528), Expect = 0.0e+00
Identity = 679/786 (86.39%), Postives = 718/786 (91.35%), Query Frame = 0

Query: 1   MKLSVEGLAFASLLSAMLLFFRNLFHVSRRASCRVISLSSSSSHPGCLSFKVFNASSSLT 60
           MKLSVE LAFASL SAMLLFFR+LFHVSRRAS RVISLS +SSHPGCLSF VFN  SSLT
Sbjct: 1   MKLSVEVLAFASLFSAMLLFFRSLFHVSRRASYRVISLSLNSSHPGCLSFNVFNGPSSLT 60

Query: 61  LINGCYISCPFFWFTSFLCIFRLPFVSYSNTNNSFECLDIGSLRKIIQQDLWNDPKIVTL 120
            +NG YISCPFFWFTSFLCIFRLPFVSYS TN+SFE LDIGSLRKIIQQDLWNDPKIV L
Sbjct: 61  SMNGYYISCPFFWFTSFLCIFRLPFVSYSITNDSFELLDIGSLRKIIQQDLWNDPKIVVL 120

Query: 121 FDSALAPIWVPRILVELKNDPKLALKFFKWAESQIGFRHTTESYCIVVHMLFRARMYTNA 180
           FDSALAPIWV +ILVELK DPKLALKFFKWA + IGFRHTTESYCI+VHMLFRARMYTNA
Sbjct: 121 FDSALAPIWVSKILVELKEDPKLALKFFKWAGTHIGFRHTTESYCIIVHMLFRARMYTNA 180

Query: 181 HDIIKEVILKNRADLILPVCNIFDMLWSTRNICVSGSGVFDVLFSVLIELDLLEEANECF 240
           HDI+KE++LK+R DLILPVCN+FD+LWSTRN CVSG+GVFDVLFSVL+EL LLEEANECF
Sbjct: 181 HDIMKEMVLKSRTDLILPVCNVFDILWSTRNFCVSGTGVFDVLFSVLVELGLLEEANECF 240

Query: 241 SRMRKFRTLPKARSCNFLLHRLSKSGNGQLVRKFFNDMIGAGITPSVFTYNVMIDHLCKE 300
           S+MRKFRTLPKARSCNFLLHRLSKSGNGQLVRKFF+DMIGAGI PSVFTYNVMIDHLCKE
Sbjct: 241 SKMRKFRTLPKARSCNFLLHRLSKSGNGQLVRKFFHDMIGAGIAPSVFTYNVMIDHLCKE 300

Query: 301 GDLENARRLFVQMRQMGFSPDVVTYNSLIDGYG--------------------------- 360
           GDLENAR LFVQMR MGFSPDVVTYNSLIDGYG                           
Sbjct: 301 GDLENARSLFVQMRTMGFSPDVVTYNSLIDGYGKVGLLKESVYLFNEMKDVGCVPDVITY 360

Query: 361 --------KFEKMPRAFEYLSEMKNNGLKPNVVTYSTLIDAFCKDGMMQGAIKLFVDMRR 420
                   KFEKMP+AFEYLSEMKN GLKPNVVTYSTLIDAFCK+GMMQGAIKLFVDMRR
Sbjct: 361 NALINCFCKFEKMPQAFEYLSEMKNIGLKPNVVTYSTLIDAFCKEGMMQGAIKLFVDMRR 420

Query: 421 VGLLPNEFTYTSLIDANCKAGNLTEAWKLSNDMLQAGVNLNIVTYTALMDGLCEDGRMVE 480
           VGLLPNEFTYTSLIDANCKAGNLTEAWKLSNDMLQAGVNLNIVTYTALMDGLCEDGRM+E
Sbjct: 421 VGLLPNEFTYTSLIDANCKAGNLTEAWKLSNDMLQAGVNLNIVTYTALMDGLCEDGRMME 480

Query: 481 AEEVFRAMLKDGISPNQQVYTALVHGYIKAERLGDAMEILKQMTECDIKPDLILYGTILW 540
           AEEVFRAMLKDGISPNQQVYTALVHGYIKAE++ DA+EILKQMTEC IKPDL+LYGTI+W
Sbjct: 481 AEEVFRAMLKDGISPNQQVYTALVHGYIKAEKMEDALEILKQMTECGIKPDLVLYGTIIW 540

Query: 541 GLCNQSKLEETKLIIKEMKSRGISANPVIYTTIIDAYFKAGKSSDALNLLQEMQDVGVEA 600
           GLCNQ+KLEETKLIIKEMKSRGI ANPVIYTTIIDAYFKAGKSSDAL+LLQEMQ+VGVEA
Sbjct: 541 GLCNQNKLEETKLIIKEMKSRGIRANPVIYTTIIDAYFKAGKSSDALDLLQEMQEVGVEA 600

Query: 601 TVVTYCVLIDGLCKTGMVELAVDYFGRMSDFGLQPNVAVYTALIDGLCKINCIESAKKLF 660
           TVVTYCVLIDGLCKTGMVE+AVDYFGRMSDFG+QPNVAVYTALIDGLCKINCIESA+KLF
Sbjct: 601 TVVTYCVLIDGLCKTGMVEVAVDYFGRMSDFGVQPNVAVYTALIDGLCKINCIESAEKLF 660

Query: 661 DEMQCRGMTPDKAAFTALIDGNLKLGNLQEALNLISKMTELAIDFDLHAYTSLVSGFSQC 720
           +EMQCRGMTPDK AFTALIDGNLKLGNLQE LNLISKMTEL I+FDLHAYT+LVSGFSQC
Sbjct: 661 EEMQCRGMTPDKTAFTALIDGNLKLGNLQETLNLISKMTELVIEFDLHAYTTLVSGFSQC 720

Query: 721 GELHQARKFFDEMIEKGILPEEILCICLLREYYKLGQLDEAIELKNEMQRRGLIAEKCSH 752
           GELHQARKFF+EMIEKGILP+EILCICLL+EY KLG LDEAI+LKNEMQRRGLI EKCSH
Sbjct: 721 GELHQARKFFNEMIEKGILPDEILCICLLKEYNKLGHLDEAIKLKNEMQRRGLITEKCSH 780

BLAST of Tan0020971 vs. NCBI nr
Match: XP_023534824.1 (putative pentatricopeptide repeat-containing protein At2g02150 isoform X1 [Cucurbita pepo subsp. pepo] >XP_023534833.1 putative pentatricopeptide repeat-containing protein At2g02150 isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1358.6 bits (3515), Expect = 0.0e+00
Identity = 676/786 (86.01%), Postives = 716/786 (91.09%), Query Frame = 0

Query: 1   MKLSVEGLAFASLLSAMLLFFRNLFHVSRRASCRVISLSSSSSHPGCLSFKVFNASSSLT 60
           MKLSVE LAFASL SAML FFR+LFHVSRRAS RVISLS +SSHPGCLSF VFN  SSLT
Sbjct: 1   MKLSVEVLAFASLFSAMLPFFRSLFHVSRRASYRVISLSLNSSHPGCLSFHVFNGPSSLT 60

Query: 61  LINGCYISCPFFWFTSFLCIFRLPFVSYSNTNNSFECLDIGSLRKIIQQDLWNDPKIVTL 120
            ING +ISCPFFWFTSFLCIFRLPFVSYS TN+SFE LDI SLRKIIQQDLWNDPKIV L
Sbjct: 61  SINGYHISCPFFWFTSFLCIFRLPFVSYSITNDSFELLDIDSLRKIIQQDLWNDPKIVVL 120

Query: 121 FDSALAPIWVPRILVELKNDPKLALKFFKWAESQIGFRHTTESYCIVVHMLFRARMYTNA 180
           FDS+LAPIWV +ILVELK DP LALKFFKWA + IGFRHTTESYCI+VHMLFRARMYTNA
Sbjct: 121 FDSSLAPIWVSKILVELKEDPNLALKFFKWAGTHIGFRHTTESYCIIVHMLFRARMYTNA 180

Query: 181 HDIIKEVILKNRADLILPVCNIFDMLWSTRNICVSGSGVFDVLFSVLIELDLLEEANECF 240
           HDI+KE++LK+R DLILPVCN+FD+LWSTRN CVSG+GVFDVLFSVL+EL LLEEANECF
Sbjct: 181 HDIMKEMVLKSRTDLILPVCNVFDILWSTRNFCVSGTGVFDVLFSVLVELGLLEEANECF 240

Query: 241 SRMRKFRTLPKARSCNFLLHRLSKSGNGQLVRKFFNDMIGAGITPSVFTYNVMIDHLCKE 300
           S+MRKFRTLPKARSCNFLLHRLSK+GNGQLVRKFF+DM+GAGI PSVFTYNVMIDHLCKE
Sbjct: 241 SKMRKFRTLPKARSCNFLLHRLSKAGNGQLVRKFFHDMVGAGIAPSVFTYNVMIDHLCKE 300

Query: 301 GDLENARRLFVQMRQMGFSPDVVTYNSLIDGYG--------------------------- 360
           GDLENAR LFVQMR MGFSPDVVTYNSLIDGYG                           
Sbjct: 301 GDLENARSLFVQMRTMGFSPDVVTYNSLIDGYGKVGLLKESVYLFNEMKDVGCVPDVITY 360

Query: 361 --------KFEKMPRAFEYLSEMKNNGLKPNVVTYSTLIDAFCKDGMMQGAIKLFVDMRR 420
                   KFEKMP+AFEYLSEMKNNGLKPNVVTYSTLIDAFCK+GMMQGAIKLFVDMRR
Sbjct: 361 NALINCFCKFEKMPQAFEYLSEMKNNGLKPNVVTYSTLIDAFCKEGMMQGAIKLFVDMRR 420

Query: 421 VGLLPNEFTYTSLIDANCKAGNLTEAWKLSNDMLQAGVNLNIVTYTALMDGLCEDGRMVE 480
           VGLLPNEFTYTSLIDANCKAGNLTEAWKLSNDMLQAGVNLNIVTYTALMDGLCEDGRM+E
Sbjct: 421 VGLLPNEFTYTSLIDANCKAGNLTEAWKLSNDMLQAGVNLNIVTYTALMDGLCEDGRMME 480

Query: 481 AEEVFRAMLKDGISPNQQVYTALVHGYIKAERLGDAMEILKQMTECDIKPDLILYGTILW 540
           AEEVFRAMLKDGISPNQQVYTALVHGYIKAE++ DA+EILKQ+T+C IKPDL+LYGTI+W
Sbjct: 481 AEEVFRAMLKDGISPNQQVYTALVHGYIKAEKMEDALEILKQITKCGIKPDLVLYGTIIW 540

Query: 541 GLCNQSKLEETKLIIKEMKSRGISANPVIYTTIIDAYFKAGKSSDALNLLQEMQDVGVEA 600
           GLCNQ+KLEETKLIIKEMKSRGI ANPVIYTTIIDAYFKAGK SDAL+LLQEMQ+VGVEA
Sbjct: 541 GLCNQNKLEETKLIIKEMKSRGIRANPVIYTTIIDAYFKAGKGSDALDLLQEMQEVGVEA 600

Query: 601 TVVTYCVLIDGLCKTGMVELAVDYFGRMSDFGLQPNVAVYTALIDGLCKINCIESAKKLF 660
           TVVTYCVLIDGLCKTGMVE+AVDYFGRMSDFG+QPNVAVYTALIDGLCKINCIESAKKLF
Sbjct: 601 TVVTYCVLIDGLCKTGMVEVAVDYFGRMSDFGVQPNVAVYTALIDGLCKINCIESAKKLF 660

Query: 661 DEMQCRGMTPDKAAFTALIDGNLKLGNLQEALNLISKMTELAIDFDLHAYTSLVSGFSQC 720
           DEMQCRGMTPDK AFTALIDGNLKLGNLQEALNLISKMTEL I+FDLHAYT+LVSGFSQC
Sbjct: 661 DEMQCRGMTPDKTAFTALIDGNLKLGNLQEALNLISKMTELVIEFDLHAYTTLVSGFSQC 720

Query: 721 GELHQARKFFDEMIEKGILPEEILCICLLREYYKLGQLDEAIELKNEMQRRGLIAEKCSH 752
           GELHQARKFF+EMIEKGILP+EILCICLLREY KLG LDEAIELKNEMQRRGLI EKCSH
Sbjct: 721 GELHQARKFFNEMIEKGILPDEILCICLLREYNKLGHLDEAIELKNEMQRRGLITEKCSH 780

BLAST of Tan0020971 vs. ExPASy TrEMBL
Match: A0A6J1H6R2 (putative pentatricopeptide repeat-containing protein At2g02150 isoform X3 OS=Cucurbita moschata OX=3662 GN=LOC111460688 PE=4 SV=1)

HSP 1 Score: 1381.3 bits (3574), Expect = 0.0e+00
Identity = 679/751 (90.41%), Postives = 718/751 (95.61%), Query Frame = 0

Query: 1   MKLSVEGLAFASLLSAMLLFFRNLFHVSRRASCRVISLSSSSSHPGCLSFKVFNASSSLT 60
           MKLSVE LAFASL SAMLLFFR+LFHVSRRAS RVISLS +SSHPGCLSF VFN  SSLT
Sbjct: 1   MKLSVEVLAFASLFSAMLLFFRSLFHVSRRASYRVISLSLNSSHPGCLSFNVFNGPSSLT 60

Query: 61  LINGCYISCPFFWFTSFLCIFRLPFVSYSNTNNSFECLDIGSLRKIIQQDLWNDPKIVTL 120
            +NG YISCPFFWFTSFLCIFRLPFVSYS TN+SFE LDIGSLRKIIQQDLWNDPKIV L
Sbjct: 61  SMNGYYISCPFFWFTSFLCIFRLPFVSYSITNDSFELLDIGSLRKIIQQDLWNDPKIVVL 120

Query: 121 FDSALAPIWVPRILVELKNDPKLALKFFKWAESQIGFRHTTESYCIVVHMLFRARMYTNA 180
           FDSALAPIWV +ILVELK DPKLALKFFKWA + IGFRHTTESYCI+VHMLFRARMYTNA
Sbjct: 121 FDSALAPIWVSKILVELKEDPKLALKFFKWAGTHIGFRHTTESYCIIVHMLFRARMYTNA 180

Query: 181 HDIIKEVILKNRADLILPVCNIFDMLWSTRNICVSGSGVFDVLFSVLIELDLLEEANECF 240
           HDI+KE++LK+R DLILPVCN+FD+LWSTRN CVSG+GVFDVLFSVL+EL LLEEANECF
Sbjct: 181 HDIMKEMVLKSRTDLILPVCNVFDILWSTRNFCVSGTGVFDVLFSVLVELGLLEEANECF 240

Query: 241 SRMRKFRTLPKARSCNFLLHRLSKSGNGQLVRKFFNDMIGAGITPSVFTYNVMIDHLCKE 300
           S+MRKFRTLPKARSCNFLLHRLSKSGNGQLVRKFF+DMIGAGI PSVFTYNVMIDHLCKE
Sbjct: 241 SKMRKFRTLPKARSCNFLLHRLSKSGNGQLVRKFFHDMIGAGIAPSVFTYNVMIDHLCKE 300

Query: 301 GDLENARRLFVQMRQMGFSPDVVTYNSLIDGYGKFEKMPRAFEYLSEMKNNGLKPNVVTY 360
           GDLENAR LFVQMR MGFSPDVVTYNSLIDGYGKFEKMP+AFEYLSEMKN GLKPNVVTY
Sbjct: 301 GDLENARSLFVQMRTMGFSPDVVTYNSLIDGYGKFEKMPQAFEYLSEMKNIGLKPNVVTY 360

Query: 361 STLIDAFCKDGMMQGAIKLFVDMRRVGLLPNEFTYTSLIDANCKAGNLTEAWKLSNDMLQ 420
           STLIDAFCK+GMMQGAIKLFVDMRRVGLLPNEFTYTSLIDANCKAGNLTEAWKLSNDMLQ
Sbjct: 361 STLIDAFCKEGMMQGAIKLFVDMRRVGLLPNEFTYTSLIDANCKAGNLTEAWKLSNDMLQ 420

Query: 421 AGVNLNIVTYTALMDGLCEDGRMVEAEEVFRAMLKDGISPNQQVYTALVHGYIKAERLGD 480
           AGVNLNIVTYTALMDGLCEDGRM+EAEEVFRAMLKDGISPNQQVYTALVHGYIKAE++ D
Sbjct: 421 AGVNLNIVTYTALMDGLCEDGRMMEAEEVFRAMLKDGISPNQQVYTALVHGYIKAEKMED 480

Query: 481 AMEILKQMTECDIKPDLILYGTILWGLCNQSKLEETKLIIKEMKSRGISANPVIYTTIID 540
           A+EILKQMTEC IKPDL+LYGTI+WGLCNQ+KLEETKLIIKEMKSRGI ANPVIYTTIID
Sbjct: 481 ALEILKQMTECGIKPDLVLYGTIIWGLCNQNKLEETKLIIKEMKSRGIRANPVIYTTIID 540

Query: 541 AYFKAGKSSDALNLLQEMQDVGVEATVVTYCVLIDGLCKTGMVELAVDYFGRMSDFGLQP 600
           AYFKAGKSSDAL+LLQEMQ+VGVEATVVTYCVLIDGLCKTGMVE+AVDYFGRMSDFG+QP
Sbjct: 541 AYFKAGKSSDALDLLQEMQEVGVEATVVTYCVLIDGLCKTGMVEVAVDYFGRMSDFGVQP 600

Query: 601 NVAVYTALIDGLCKINCIESAKKLFDEMQCRGMTPDKAAFTALIDGNLKLGNLQEALNLI 660
           NVAVYTALIDGLCKINCIESA+KLF+EMQCRGMTPDK AFTALIDGNLKLGNLQE LNLI
Sbjct: 601 NVAVYTALIDGLCKINCIESAEKLFEEMQCRGMTPDKTAFTALIDGNLKLGNLQETLNLI 660

Query: 661 SKMTELAIDFDLHAYTSLVSGFSQCGELHQARKFFDEMIEKGILPEEILCICLLREYYKL 720
           SKMTEL I+FDLHAYT+LVSGFSQCGELHQARKFF+EMIEKGILP+EILCICLL+EY KL
Sbjct: 661 SKMTELVIEFDLHAYTTLVSGFSQCGELHQARKFFNEMIEKGILPDEILCICLLKEYNKL 720

Query: 721 GQLDEAIELKNEMQRRGLIAEKCSHAIPSLK 752
           G LDEAI+LKNEMQRRGLI EKCSH +PSLK
Sbjct: 721 GHLDEAIKLKNEMQRRGLITEKCSHEVPSLK 751

BLAST of Tan0020971 vs. ExPASy TrEMBL
Match: A0A6J1H589 (putative pentatricopeptide repeat-containing protein At2g02150 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111460688 PE=4 SV=1)

HSP 1 Score: 1363.6 bits (3528), Expect = 0.0e+00
Identity = 679/786 (86.39%), Postives = 718/786 (91.35%), Query Frame = 0

Query: 1   MKLSVEGLAFASLLSAMLLFFRNLFHVSRRASCRVISLSSSSSHPGCLSFKVFNASSSLT 60
           MKLSVE LAFASL SAMLLFFR+LFHVSRRAS RVISLS +SSHPGCLSF VFN  SSLT
Sbjct: 1   MKLSVEVLAFASLFSAMLLFFRSLFHVSRRASYRVISLSLNSSHPGCLSFNVFNGPSSLT 60

Query: 61  LINGCYISCPFFWFTSFLCIFRLPFVSYSNTNNSFECLDIGSLRKIIQQDLWNDPKIVTL 120
            +NG YISCPFFWFTSFLCIFRLPFVSYS TN+SFE LDIGSLRKIIQQDLWNDPKIV L
Sbjct: 61  SMNGYYISCPFFWFTSFLCIFRLPFVSYSITNDSFELLDIGSLRKIIQQDLWNDPKIVVL 120

Query: 121 FDSALAPIWVPRILVELKNDPKLALKFFKWAESQIGFRHTTESYCIVVHMLFRARMYTNA 180
           FDSALAPIWV +ILVELK DPKLALKFFKWA + IGFRHTTESYCI+VHMLFRARMYTNA
Sbjct: 121 FDSALAPIWVSKILVELKEDPKLALKFFKWAGTHIGFRHTTESYCIIVHMLFRARMYTNA 180

Query: 181 HDIIKEVILKNRADLILPVCNIFDMLWSTRNICVSGSGVFDVLFSVLIELDLLEEANECF 240
           HDI+KE++LK+R DLILPVCN+FD+LWSTRN CVSG+GVFDVLFSVL+EL LLEEANECF
Sbjct: 181 HDIMKEMVLKSRTDLILPVCNVFDILWSTRNFCVSGTGVFDVLFSVLVELGLLEEANECF 240

Query: 241 SRMRKFRTLPKARSCNFLLHRLSKSGNGQLVRKFFNDMIGAGITPSVFTYNVMIDHLCKE 300
           S+MRKFRTLPKARSCNFLLHRLSKSGNGQLVRKFF+DMIGAGI PSVFTYNVMIDHLCKE
Sbjct: 241 SKMRKFRTLPKARSCNFLLHRLSKSGNGQLVRKFFHDMIGAGIAPSVFTYNVMIDHLCKE 300

Query: 301 GDLENARRLFVQMRQMGFSPDVVTYNSLIDGYG--------------------------- 360
           GDLENAR LFVQMR MGFSPDVVTYNSLIDGYG                           
Sbjct: 301 GDLENARSLFVQMRTMGFSPDVVTYNSLIDGYGKVGLLKESVYLFNEMKDVGCVPDVITY 360

Query: 361 --------KFEKMPRAFEYLSEMKNNGLKPNVVTYSTLIDAFCKDGMMQGAIKLFVDMRR 420
                   KFEKMP+AFEYLSEMKN GLKPNVVTYSTLIDAFCK+GMMQGAIKLFVDMRR
Sbjct: 361 NALINCFCKFEKMPQAFEYLSEMKNIGLKPNVVTYSTLIDAFCKEGMMQGAIKLFVDMRR 420

Query: 421 VGLLPNEFTYTSLIDANCKAGNLTEAWKLSNDMLQAGVNLNIVTYTALMDGLCEDGRMVE 480
           VGLLPNEFTYTSLIDANCKAGNLTEAWKLSNDMLQAGVNLNIVTYTALMDGLCEDGRM+E
Sbjct: 421 VGLLPNEFTYTSLIDANCKAGNLTEAWKLSNDMLQAGVNLNIVTYTALMDGLCEDGRMME 480

Query: 481 AEEVFRAMLKDGISPNQQVYTALVHGYIKAERLGDAMEILKQMTECDIKPDLILYGTILW 540
           AEEVFRAMLKDGISPNQQVYTALVHGYIKAE++ DA+EILKQMTEC IKPDL+LYGTI+W
Sbjct: 481 AEEVFRAMLKDGISPNQQVYTALVHGYIKAEKMEDALEILKQMTECGIKPDLVLYGTIIW 540

Query: 541 GLCNQSKLEETKLIIKEMKSRGISANPVIYTTIIDAYFKAGKSSDALNLLQEMQDVGVEA 600
           GLCNQ+KLEETKLIIKEMKSRGI ANPVIYTTIIDAYFKAGKSSDAL+LLQEMQ+VGVEA
Sbjct: 541 GLCNQNKLEETKLIIKEMKSRGIRANPVIYTTIIDAYFKAGKSSDALDLLQEMQEVGVEA 600

Query: 601 TVVTYCVLIDGLCKTGMVELAVDYFGRMSDFGLQPNVAVYTALIDGLCKINCIESAKKLF 660
           TVVTYCVLIDGLCKTGMVE+AVDYFGRMSDFG+QPNVAVYTALIDGLCKINCIESA+KLF
Sbjct: 601 TVVTYCVLIDGLCKTGMVEVAVDYFGRMSDFGVQPNVAVYTALIDGLCKINCIESAEKLF 660

Query: 661 DEMQCRGMTPDKAAFTALIDGNLKLGNLQEALNLISKMTELAIDFDLHAYTSLVSGFSQC 720
           +EMQCRGMTPDK AFTALIDGNLKLGNLQE LNLISKMTEL I+FDLHAYT+LVSGFSQC
Sbjct: 661 EEMQCRGMTPDKTAFTALIDGNLKLGNLQETLNLISKMTELVIEFDLHAYTTLVSGFSQC 720

Query: 721 GELHQARKFFDEMIEKGILPEEILCICLLREYYKLGQLDEAIELKNEMQRRGLIAEKCSH 752
           GELHQARKFF+EMIEKGILP+EILCICLL+EY KLG LDEAI+LKNEMQRRGLI EKCSH
Sbjct: 721 GELHQARKFFNEMIEKGILPDEILCICLLKEYNKLGHLDEAIKLKNEMQRRGLITEKCSH 780

BLAST of Tan0020971 vs. ExPASy TrEMBL
Match: A0A6J1CX77 (putative pentatricopeptide repeat-containing protein At2g02150 OS=Momordica charantia OX=3673 GN=LOC111015641 PE=4 SV=1)

HSP 1 Score: 1333.9 bits (3451), Expect = 0.0e+00
Identity = 670/785 (85.35%), Postives = 706/785 (89.94%), Query Frame = 0

Query: 1   MKLSVEGLAFASLLSAMLLFFRNLFHVSRRASCRVISLSSSSSHPGCLSFKVFNASSSLT 60
           MKL VE LAFAS LSAMLLFFR+LFHVSRRAS RVI+LSS SSHPGCLSF +FNA SS  
Sbjct: 1   MKLCVEVLAFASFLSAMLLFFRSLFHVSRRASHRVIALSSISSHPGCLSFNIFNAPSSK- 60

Query: 61  LINGCYISCPFFWFTSFLCIFRLPFVSYSNTNNSFECLDIGSLRKIIQQDLWNDPKIVTL 120
             NG  IS P FWF SFLCIFRLPFV+YSNTNNSFE LD GSLRKIIQQDLWNDP IV L
Sbjct: 61  --NGYCISFPSFWFASFLCIFRLPFVTYSNTNNSFEFLDFGSLRKIIQQDLWNDPMIVVL 120

Query: 121 FDSALAPIWVPRILVELKNDPKLALKFFKWAESQIGFRHTTESYCIVVHMLFRARMYTNA 180
           FDSALAPIWV +ILVE K DPKLA KFFKWA SQ+GFRHTTE+YCIVVH+LFRARMY NA
Sbjct: 121 FDSALAPIWVSKILVEFKEDPKLAFKFFKWAGSQVGFRHTTEAYCIVVHILFRARMYANA 180

Query: 181 HDIIKEVILKNRADLILPVCNIFDMLWSTRNICVSGSGVFDVLFSVLIELDLLEEANECF 240
           HDIIKEVILK++ DL+LPVC IFD+LWSTRNI V G+GVFDVLFSVL+EL LLEEANECF
Sbjct: 181 HDIIKEVILKSQNDLVLPVCKIFDILWSTRNIFVLGTGVFDVLFSVLVELGLLEEANECF 240

Query: 241 SRMRKFRTLPKARSCNFLLHRLSKSGNGQLVRKFFNDMIGAGITPSVFTYNVMIDHLCKE 300
            RMRKFRTLPKARSCNFLLHRLSKSG GQLVRKFF+DMIGAGI PSVFTYNVMID+LCKE
Sbjct: 241 LRMRKFRTLPKARSCNFLLHRLSKSGKGQLVRKFFHDMIGAGIAPSVFTYNVMIDYLCKE 300

Query: 301 GDLENARRLFVQMRQMGFSPDVVTYNSLIDGYG--------------------------- 360
           GDLENARRLFVQMRQMGFSPDVVTYNSLIDGYG                           
Sbjct: 301 GDLENARRLFVQMRQMGFSPDVVTYNSLIDGYGKVGLLEESVYLFNEMKGAGCVPDVITY 360

Query: 361 --------KFEKMPRAFEYLSEMKNNGLKPNVVTYSTLIDAFCKDGMMQGAIKLFVDMRR 420
                   KFEKMPRAFEYLSEMKNNGLKPNVVTYSTLIDAFCK+G+MQGAIKLFVDMRR
Sbjct: 361 NALINCFCKFEKMPRAFEYLSEMKNNGLKPNVVTYSTLIDAFCKEGIMQGAIKLFVDMRR 420

Query: 421 VGLLPNEFTYTSLIDANCKAGNLTEAWKLSNDMLQAGVNLNIVTYTALMDGLCEDGRMVE 480
           VGL+PNEFTYTSLIDANCKAGNLTEAWKLS+DMLQAGVNLNIVTYTALMDGLCEDGRM E
Sbjct: 421 VGLVPNEFTYTSLIDANCKAGNLTEAWKLSHDMLQAGVNLNIVTYTALMDGLCEDGRMTE 480

Query: 481 AEEVFRAMLKDGISPNQQVYTALVHGYIKAERLGDAMEILKQMTECDIKPDLILYGTILW 540
           AEEV+RAMLKDGISPNQQVYTALVHGYIKAER+ DAMEILKQMTEC+IKPDLILYGTI+W
Sbjct: 481 AEEVYRAMLKDGISPNQQVYTALVHGYIKAERMEDAMEILKQMTECNIKPDLILYGTIIW 540

Query: 541 GLCNQSKLEETKLIIKEMKSRGISANPVIYTTIIDAYFKAGKSSDALNLLQEMQDVGVEA 600
           GLC+Q+KLEETKLIIKEMKSRGI+ANPVIYTTIIDAYFKAG+SSDA+NLLQEMQD G+EA
Sbjct: 541 GLCSQNKLEETKLIIKEMKSRGINANPVIYTTIIDAYFKAGESSDAINLLQEMQDAGIEA 600

Query: 601 TVVTYCVLIDGLCKTGMVELAVDYFGRMSDFGLQPNVAVYTALIDGLCKINCIESAKKLF 660
           TVVTYCVLIDGLCKTG VELAVDYFGRMS FGLQPNVAVYTALIDGLCK NC+ESAKKLF
Sbjct: 601 TVVTYCVLIDGLCKTGKVELAVDYFGRMSAFGLQPNVAVYTALIDGLCKTNCVESAKKLF 660

Query: 661 DEMQCRGMTPDKAAFTALIDGNLKLGNLQEALNLISKMTELAIDFDLHAYTSLVSGFSQC 720
           DEMQCRGM PDK AFTALIDGNLKLGNLQEALNL S+MTELAI+FDLHAYTSLVSGFSQC
Sbjct: 661 DEMQCRGMAPDKTAFTALIDGNLKLGNLQEALNLSSRMTELAIEFDLHAYTSLVSGFSQC 720

Query: 721 GELHQARKFFDEMIEKGILPEEILCICLLREYYKLGQLDEAIELKNEMQRRGLIAEKCSH 751
           GELHQARKFFDEM+EKGILPEEILCICLLREYYKLGQLDEAIELK+EMQRRGLI EKCSH
Sbjct: 721 GELHQARKFFDEMVEKGILPEEILCICLLREYYKLGQLDEAIELKSEMQRRGLITEKCSH 780

BLAST of Tan0020971 vs. ExPASy TrEMBL
Match: A0A6J1FET4 (putative pentatricopeptide repeat-containing protein At2g02150 OS=Cucurbita moschata OX=3662 GN=LOC111444847 PE=4 SV=1)

HSP 1 Score: 1333.9 bits (3451), Expect = 0.0e+00
Identity = 668/781 (85.53%), Postives = 707/781 (90.52%), Query Frame = 0

Query: 6   EGLAFASLLSAMLLFFRNLFHVSRRASCRVISLSSSSSHPGCLSFKVFNASSSLTLINGC 65
           E   FASLLSAMLLFFR LF VSRRAS RVISLSS+SSHPGCLSF  FNASSSLT INGC
Sbjct: 1   EFFPFASLLSAMLLFFRGLFQVSRRASYRVISLSSNSSHPGCLSFNAFNASSSLTSINGC 60

Query: 66  YISCPFFWFTSFLCIFRLPFVSYSNTNNSFECLDIGSLRKIIQQDLWNDPKIVTLFDSAL 125
           YISC   WF SFLCIFRLPFVSYSNTN+SFE LDIGSLRKIIQQDLWNDPKIV LFDSAL
Sbjct: 61  YISC--LWFASFLCIFRLPFVSYSNTNSSFESLDIGSLRKIIQQDLWNDPKIVILFDSAL 120

Query: 126 APIWVPRILVELKNDPKLALKFFKWAESQIGFRHTTESYCIVVHMLFRARMYTNAHDIIK 185
           APIWV +ILVELK DPKLALKFFKWA SQIGF HTTESYCI+ HMLF ARMYTNAHDIIK
Sbjct: 121 APIWVSKILVELKEDPKLALKFFKWAGSQIGFCHTTESYCIIAHMLFCARMYTNAHDIIK 180

Query: 186 EVILKNRADLILPVCNIFDMLWSTRNICVSGSGVFDVLFSVLIELDLLEEANECFSRMRK 245
           EVILK R D+I PVCNIFDMLWSTRN+CVSG+GVFD+LFSVL+EL LLEEANECFSRMRK
Sbjct: 181 EVILKCRIDMIFPVCNIFDMLWSTRNVCVSGTGVFDILFSVLVELGLLEEANECFSRMRK 240

Query: 246 FRTLPKARSCNFLLHRLSKSGNGQLVRKFFNDMIGAGITPSVFTYNVMIDHLCKEGDLEN 305
           FRTLPKARSCNFLLHRLSKSGNGQLV+ FFNDMIGAGI PSVFTYNVMID+LCKEGDLE+
Sbjct: 241 FRTLPKARSCNFLLHRLSKSGNGQLVKNFFNDMIGAGIAPSVFTYNVMIDYLCKEGDLES 300

Query: 306 ARRLFVQMRQMGFSPDVVTYNSLIDGYG-------------------------------- 365
           ARRLFVQMRQMGFSPDVVTYNSLIDGYG                                
Sbjct: 301 ARRLFVQMRQMGFSPDVVTYNSLIDGYGKVGLLEESVYLFKEMKDVGCVPDVITYNALIN 360

Query: 366 ---KFEKMPRAFEYLSEMKNNGLKPNVVTYSTLIDAFCKDGMMQGAIKLFVDMRRVGLLP 425
              KFEKMPRAFEYLSEMKN+GLKPNVVTYSTLIDAFCK+GMMQ AIKLFVDMRRVGLLP
Sbjct: 361 CFCKFEKMPRAFEYLSEMKNSGLKPNVVTYSTLIDAFCKEGMMQYAIKLFVDMRRVGLLP 420

Query: 426 NEFTYTSLIDANCKAGNLTEAWKLSNDMLQAGVNLNIVTYTALMDGLCEDGRMVEAEEVF 485
           NEFTYTSLIDANCKAGNLTEAWKLSNDMLQAGVNLN+V+YTALMDGLCEDGRM+EAEEVF
Sbjct: 421 NEFTYTSLIDANCKAGNLTEAWKLSNDMLQAGVNLNVVSYTALMDGLCEDGRMMEAEEVF 480

Query: 486 RAMLKDGISPNQQVYTALVHGYIKAERLGDAMEILKQMTECDIKPDLILYGTILWGLCNQ 545
           +AMLKDG+SPNQQVYTALVHGYIKAER+ DAMEILKQMTEC+IKPDLILYGTI+WGLC+Q
Sbjct: 481 KAMLKDGLSPNQQVYTALVHGYIKAERMEDAMEILKQMTECNIKPDLILYGTIIWGLCSQ 540

Query: 546 SKLEETKLIIKEMKSRGISANPVIYTTIIDAYFKAGKSSDALNLLQEMQDVGVEATVVTY 605
           +KLEETKLIIKEMKS+GISANPVIYTTI+DAYFKAGKSSDA+NLL +MQD+GVEATVVTY
Sbjct: 541 NKLEETKLIIKEMKSQGISANPVIYTTIMDAYFKAGKSSDAINLLHKMQDMGVEATVVTY 600

Query: 606 CVLIDGLCKTGMVELAVDYFGRMSDFGLQPNVAVYTALIDGLCKINCIESAKKLFDEMQC 665
           CVLIDGLCKTGMVELAVDYFGRMSD GLQPNVAVYTALIDGLCK NCIESAKKLFDEMQ 
Sbjct: 601 CVLIDGLCKTGMVELAVDYFGRMSDLGLQPNVAVYTALIDGLCKTNCIESAKKLFDEMQY 660

Query: 666 RGMTPDKAAFTALIDGNLKLGNLQEALNLISKMTELAIDFDLHAYTSLVSGFSQCGELHQ 725
           RGMTPDK AFTALIDGNLKLGNLQEAL+LIS+MT+LAI+FDLHAYTS+VSGFSQCG+LHQ
Sbjct: 661 RGMTPDKTAFTALIDGNLKLGNLQEALDLISRMTDLAIEFDLHAYTSMVSGFSQCGDLHQ 720

Query: 726 ARKFFDEMIEKGILPEEILCICLLREYYKLGQLDEAIELKNEMQRRGLIAEKCSHAIPSL 752
           ARKFF+EMIEKGILPEEILC CLLREYYKLGQLDEAIELKNEM+RRGLI E CS  +PSL
Sbjct: 721 ARKFFNEMIEKGILPEEILCTCLLREYYKLGQLDEAIELKNEMRRRGLITENCSLEVPSL 779

BLAST of Tan0020971 vs. ExPASy TrEMBL
Match: A0A6J1K035 (putative pentatricopeptide repeat-containing protein At2g02150 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111489432 PE=4 SV=1)

HSP 1 Score: 1295.0 bits (3350), Expect = 0.0e+00
Identity = 646/761 (84.89%), Postives = 688/761 (90.41%), Query Frame = 0

Query: 17  MLLFFRNLFHVSRRASCRVISLSSSSSHPGCLSFKVFNASSSLTLINGCYISCPFFWFTS 76
           MLLFFR LF VSRRAS RVISLSS+SSHPGCLS   FNASSSLT ING YISC  FWFTS
Sbjct: 1   MLLFFRGLFQVSRRASYRVISLSSNSSHPGCLSSNAFNASSSLTSINGYYISC--FWFTS 60

Query: 77  FLCIFRLPFVSYSNTNNSFECLDIGSLRKIIQQDLWNDPKIVTLFDSALAPIWVPRILVE 136
           F+C+FRLPFVSYSNTN+SFE LDIG LRKIIQQDLWNDPKIV LFDSALAPIWV +ILVE
Sbjct: 61  FVCMFRLPFVSYSNTNSSFELLDIGYLRKIIQQDLWNDPKIVILFDSALAPIWVSKILVE 120

Query: 137 LKNDPKLALKFFKWAESQIGFRHTTESYCIVVHMLFRARMYTNAHDIIKEVILKNRADLI 196
           LK DPKLALKFFKWA SQIGF H TESYCI+ HMLF ARMYTNAHDIIKEVILK R D+I
Sbjct: 121 LKEDPKLALKFFKWAGSQIGFCHATESYCIIAHMLFCARMYTNAHDIIKEVILKCRIDMI 180

Query: 197 LPVCNIFDMLWSTRNICVSGSGVFDVLFSVLIELDLLEEANECFSRMRKFRTLPKARSCN 256
            PVCNIFDMLWSTRN+CVSG+GVFD+LFSVL+EL LLEEANECFSRMRKFRTLPKARSCN
Sbjct: 181 FPVCNIFDMLWSTRNVCVSGTGVFDILFSVLVELGLLEEANECFSRMRKFRTLPKARSCN 240

Query: 257 FLLHRLSKSGNGQLVRKFFNDMIGAGITPSVFTYNVMIDHLCKEGDLENARRLFVQMRQM 316
           FLLHRLSKSGNGQLV+KFFNDMIGAGI PSVFTYNVM+D+LCKEGDLENARRLFVQMRQM
Sbjct: 241 FLLHRLSKSGNGQLVKKFFNDMIGAGIAPSVFTYNVMVDYLCKEGDLENARRLFVQMRQM 300

Query: 317 GFSPDVVTYNSLIDGYG-----------------------------------KFEKMPRA 376
           GFSPDVVTYNSLIDGYG                                   KFEKMPRA
Sbjct: 301 GFSPDVVTYNSLIDGYGKVGLLEESVYLFKEMKDVGCVPDVITYNALINCFCKFEKMPRA 360

Query: 377 FEYLSEMKNNGLKPNVVTYSTLIDAFCKDGMMQGAIKLFVDMRRVGLLPNEFTYTSLIDA 436
           FEYLSEMKN+GLKPNVVTYSTLIDAFCK GMMQ AIKLFVDMRRVGLLPNEFTYTSLIDA
Sbjct: 361 FEYLSEMKNSGLKPNVVTYSTLIDAFCKGGMMQYAIKLFVDMRRVGLLPNEFTYTSLIDA 420

Query: 437 NCKAGNLTEAWKLSNDMLQAGVNLNIVTYTALMDGLCEDGRMVEAEEVFRAMLKDGISPN 496
           NCKAGNLTEAWKLSNDMLQAGVNLN+V+YTALMDGLCEDGRM+EAEEVF+AMLKDG+SPN
Sbjct: 421 NCKAGNLTEAWKLSNDMLQAGVNLNVVSYTALMDGLCEDGRMMEAEEVFKAMLKDGLSPN 480

Query: 497 QQVYTALVHGYIKAERLGDAMEILKQMTECDIKPDLILYGTILWGLCNQSKLEETKLIIK 556
           QQ+YTALVHGYIKAER+ DAMEILKQMTEC+IKPDLILYGT++WGLC+Q+KLEETKLIIK
Sbjct: 481 QQLYTALVHGYIKAERMEDAMEILKQMTECNIKPDLILYGTVIWGLCSQNKLEETKLIIK 540

Query: 557 EMKSRGISANPVIYTTIIDAYFKAGKSSDALNLLQEMQDVGVEATVVTYCVLIDGLCKTG 616
           EMKS+GISANPVIYTTI+DAYFKAGKSSDA+NLL +MQD+GVEATVVTYCVLIDGLCKTG
Sbjct: 541 EMKSQGISANPVIYTTIMDAYFKAGKSSDAINLLHKMQDMGVEATVVTYCVLIDGLCKTG 600

Query: 617 MVELAVDYFGRMSDFGLQPNVAVYTALIDGLCKINCIESAKKLFDEMQCRGMTPDKAAFT 676
           +VELA DYF RMSD GLQPNVAVYTALIDGLCK NCIESAKKLFDEMQ RGMTPDK AFT
Sbjct: 601 LVELAFDYFSRMSDLGLQPNVAVYTALIDGLCKTNCIESAKKLFDEMQYRGMTPDKTAFT 660

Query: 677 ALIDGNLKLGNLQEALNLISKMTELAIDFDLHAYTSLVSGFSQCGELHQARKFFDEMIEK 736
           ALIDGNLKLGNLQEAL+LIS+MT+LAI+FDLHAYTS+VSGFSQCG+LHQARKF +EMIEK
Sbjct: 661 ALIDGNLKLGNLQEALDLISRMTDLAIEFDLHAYTSMVSGFSQCGDLHQARKFLNEMIEK 720

Query: 737 GILPEEILCICLLREYYKLGQLDEAIELKNEMQRRGLIAEK 743
           GILPEEILC CLLREYYKLGQLDEAIELKNEM+RRGLI E+
Sbjct: 721 GILPEEILCTCLLREYYKLGQLDEAIELKNEMRRRGLITEQ 759

BLAST of Tan0020971 vs. TAIR 10
Match: AT2G02150.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 829.7 bits (2142), Expect = 1.9e-240
Identity = 412/771 (53.44%), Postives = 545/771 (70.69%), Query Frame = 0

Query: 17  MLLFFRNLFHVSRRASCRVISLSSSSSH-PGCLSFKVFNASSSLTLINGCYISCPFFWFT 76
           M    RN  HV+RR    V   SSS S     L F + + S S       +ISCPF WFT
Sbjct: 1   MFCSLRNFLHVNRRFPRHVSPSSSSLSQIQSPLCFPLSSPSPS----QSSFISCPFVWFT 60

Query: 77  SFLCIFRLPFVSYSNTNNSFECLDIGSLRKIIQQDLWNDPKIVTLFDSALAPIWVPRILV 136
           SFLCI R PFV+ S T+   E  D   +RK++  DLW+DP +  LFD  LAPIWVPR+LV
Sbjct: 61  SFLCIIRYPFVTKSGTSTYSEDFDRDWIRKVVHNDLWDDPGLEKLFDLTLAPIWVPRVLV 120

Query: 137 ELKNDPKLALKFFKWAESQIGFRHTTESYCIVVHMLFRARMYTNAHDIIKEVILKNRADL 196
           ELK DPKLA KFFKW+ ++ GF+H+ ESYCIV H+LF ARMY +A+ ++KE++L ++AD 
Sbjct: 121 ELKEDPKLAFKFFKWSMTRNGFKHSVESYCIVAHILFCARMYYDANSVLKEMVL-SKAD- 180

Query: 197 ILPVCNIFDMLWSTRNICVSGSGVFDVLFSVLIELDLLEEANECFSRMRKFRTLPKARSC 256
               C++FD+LWSTRN+CV G GVFD LFSVLI+L +LEEA +CFS+M++FR  PK RSC
Sbjct: 181 ----CDVFDVLWSTRNVCVPGFGVFDALFSVLIDLGMLEEAIQCFSKMKRFRVFPKTRSC 240

Query: 257 NFLLHRLSKSGNGQLVRKFFNDMIGAGITPSVFTYNVMIDHLCKEGDLENARRLFVQMRQ 316
           N LLHR +K G    V++FF DMIGAG  P+VFTYN+MID +CKEGD+E AR LF +M+ 
Sbjct: 241 NGLLHRFAKLGKTDDVKRFFKDMIGAGARPTVFTYNIMIDCMCKEGDVEAARGLFEEMKF 300

Query: 317 MGFSPDVVTYNSLIDGYG-----------------------------------KFEKMPR 376
            G  PD VTYNS+IDG+G                                   KF K+P 
Sbjct: 301 RGLVPDTVTYNSMIDGFGKVGRLDDTVCFFEEMKDMCCEPDVITYNALINCFCKFGKLPI 360

Query: 377 AFEYLSEMKNNGLKPNVVTYSTLIDAFCKDGMMQGAIKLFVDMRRVGLLPNEFTYTSLID 436
             E+  EMK NGLKPNVV+YSTL+DAFCK+GMMQ AIK +VDMRRVGL+PNE+TYTSLID
Sbjct: 361 GLEFYREMKGNGLKPNVVSYSTLVDAFCKEGMMQQAIKFYVDMRRVGLVPNEYTYTSLID 420

Query: 437 ANCKAGNLTEAWKLSNDMLQAGVNLNIVTYTALMDGLCEDGRMVEAEEVFRAMLKDGISP 496
           ANCK GNL++A++L N+MLQ GV  N+VTYTAL+DGLC+  RM EAEE+F  M   G+ P
Sbjct: 421 ANCKIGNLSDAFRLGNEMLQVGVEWNVVTYTALIDGLCDAERMKEAEELFGKMDTAGVIP 480

Query: 497 NQQVYTALVHGYIKAERLGDAMEILKQMTECDIKPDLILYGTILWGLCNQSKLEETKLII 556
           N   Y AL+HG++KA+ +  A+E+L ++    IKPDL+LYGT +WGLC+  K+E  K+++
Sbjct: 481 NLASYNALIHGFVKAKNMDRALELLNELKGRGIKPDLLLYGTFIWGLCSLEKIEAAKVVM 540

Query: 557 KEMKSRGISANPVIYTTIIDAYFKAGKSSDALNLLQEMQDVGVEATVVTYCVLIDGLCKT 616
            EMK  GI AN +IYTT++DAYFK+G  ++ L+LL EM+++ +E TVVT+CVLIDGLCK 
Sbjct: 541 NEMKECGIKANSLIYTTLMDAYFKSGNPTEGLHLLDEMKELDIEVTVVTFCVLIDGLCKN 600

Query: 617 GMVELAVDYFGRMS-DFGLQPNVAVYTALIDGLCKINCIESAKKLFDEMQCRGMTPDKAA 676
            +V  AVDYF R+S DFGLQ N A++TA+IDGLCK N +E+A  LF++M  +G+ PD+ A
Sbjct: 601 KLVSKAVDYFNRISNDFGLQANAAIFTAMIDGLCKDNQVEAATTLFEQMVQKGLVPDRTA 660

Query: 677 FTALIDGNLKLGNLQEALNLISKMTELAIDFDLHAYTSLVSGFSQCGELHQARKFFDEMI 736
           +T+L+DGN K GN+ EAL L  KM E+ +  DL AYTSLV G S C +L +AR F +EMI
Sbjct: 661 YTSLMDGNFKQGNVLEALALRDKMAEIGMKLDLLAYTSLVWGLSHCNQLQKARSFLEEMI 720

Query: 737 EKGILPEEILCICLLREYYKLGQLDEAIELKNEMQRRGLIAEKCSHAIPSL 751
            +GI P+E+LCI +L+++Y+LG +DEA+EL++ + +  L+     +A+P++
Sbjct: 721 GEGIHPDEVLCISVLKKHYELGCIDEAVELQSYLMKHQLLTSDNDNALPNM 761

BLAST of Tan0020971 vs. TAIR 10
Match: AT2G01740.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 348.6 bits (893), Expect = 1.3e-95
Identity = 197/546 (36.08%), Postives = 299/546 (54.76%), Query Frame = 0

Query: 232 LLEEANECFSRMRKFRTLPKARSCNFLLHRLSKSGNGQLVRKFFNDMIGAGIT------- 291
           ++ EA +  SR+RK   LP   +CN  +H+L  S  G L  KF   ++  G T       
Sbjct: 1   MVREALQFLSRLRKSSNLPDPFTCNKHIHQLINSNCGILSLKFLAYLVSRGYTPHRSSFN 60

Query: 292 ----------------------------PSVFTYNVMIDHLCKEGDLENARRLFVQMR-Q 351
                                       P V +YN +ID  C+ GD+ +A  +   +R  
Sbjct: 61  SVVSFVCKLGQVKFAEDIVHSMPRFGCEPDVISYNSLIDGHCRNGDIRSASLVLESLRAS 120

Query: 352 MGF--SPDVVTYNSLIDGYGKFEKMPRAFEYLSEMKNNGLKPNVVTYSTLIDAFCKDGMM 411
            GF   PD+V++NSL +G+ K + +   F Y+  M      PNVVTYST ID FCK G +
Sbjct: 121 HGFICKPDIVSFNSLFNGFSKMKMLDEVFVYMGVML-KCCSPNVVTYSTWIDTFCKSGEL 180

Query: 412 QGAIKLFVDMRRVGLLPNEFTYTSLIDANCKAGNLTEAWKLSNDMLQAGVNLNIVTYTAL 471
           Q A+K F  M+R  L PN  T+T LID  CKAG+L  A  L  +M +  ++LN+VTYTAL
Sbjct: 181 QLALKSFHSMKRDALSPNVVTFTCLIDGYCKAGDLEVAVSLYKEMRRVRMSLNVVTYTAL 240

Query: 472 MDGLCEDGRMVEAEEVFRAMLKDGISPNQQVYTALVHGYIKAERLGDAMEILKQMTECDI 531
           +DG C+ G M  AEE++  M++D + PN  VYT ++ G+ +     +AM+ L +M    +
Sbjct: 241 IDGFCKKGEMQRAEEMYSRMVEDRVEPNSLVYTTIIDGFFQRGDSDNAMKFLAKMLNQGM 300

Query: 532 KPDLILYGTILWGLCNQSKLEETKLIIKEMKSRGISANPVIYTTIIDAYFKAGKSSDALN 591
           + D+  YG I+ GLC   KL+E   I+++M+   +  + VI+TT+++AYFK+G+   A+N
Sbjct: 301 RLDITAYGVIISGLCGNGKLKEATEIVEDMEKSDLVPDMVIFTTMMNAYFKSGRMKAAVN 360

Query: 592 LLQEMQDVGVEATVVTYCVLIDGLCKTGMVELAVDYFGRMSDFGLQPNVAVYTALIDGLC 651
           +  ++ + G E  VV    +IDG+ K G +  A+ YF        + N  +YT LID LC
Sbjct: 361 MYHKLIERGFEPDVVALSTMIDGIAKNGQLHEAIVYFCIE-----KANDVMYTVLIDALC 420

Query: 652 KINCIESAKKLFDEMQCRGMTPDKAAFTALIDGNLKLGNLQEALNLISKMTELAIDFDLH 711
           K       ++LF ++   G+ PDK  +T+ I G  K GNL +A  L ++M +  +  DL 
Sbjct: 421 KEGDFIEVERLFSKISEAGLVPDKFMYTSWIAGLCKQGNLVDAFKLKTRMVQEGLLLDLL 480

Query: 712 AYTSLVSGFSQCGELHQARKFFDEMIEKGILPEEILCICLLREYYKLGQLDEAIELKNEM 740
           AYT+L+ G +  G + +AR+ FDEM+  GI P+  +   L+R Y K G +  A +L  +M
Sbjct: 481 AYTTLIYGLASKGLMVEARQVFDEMLNSGISPDSAVFDLLIRAYEKEGNMAAASDLLLDM 540

BLAST of Tan0020971 vs. TAIR 10
Match: AT1G05670.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 340.5 bits (872), Expect = 3.4e-93
Identity = 189/616 (30.68%), Postives = 317/616 (51.46%), Query Frame = 0

Query: 128 IWVPRILVELKNDPKLALKFFKWAESQIGFRHTTESYCIVVHMLFRARMYTNAHDIIKEV 187
           IWV   L+++K D +L L FF WA S+       ES CIV+H+   ++    A  +I   
Sbjct: 91  IWV---LMKIKCDYRLVLDFFDWARSRRD--SNLESLCIVIHLAVASKDLKVAQSLISSF 150

Query: 188 ILKNRADLILPVCNIFDMLWSTRNICVSGSGVFDVLFSVLIELDLLEEANECFSRMRKFR 247
             + + ++       FD+L  T     S   VFDV F VL++  LL EA   F +M  + 
Sbjct: 151 WERPKLNVTDSFVQFFDLLVYTYKDWGSDPRVFDVFFQVLVDFGLLREARRVFEKMLNYG 210

Query: 248 TLPKARSCNFLLHRLSKSGNGQLVRKF-FNDMIGAGITPSVFTYNVMIDHLCKEGDLENA 307
            +    SCN  L RLSK           F +    G+  +V +YN++I  +C+ G ++ A
Sbjct: 211 LVLSVDSCNVYLTRLSKDCYKTATAIIVFREFPEVGVCWNVASYNIVIHFVCQLGRIKEA 270

Query: 308 RRLFVQMRQMGFSPDVVTYNSLIDGYGKFEKMPRAFEYLSEMKNNGLKPNVVTYSTLIDA 367
             L + M   G++PDV++Y+++++GY +F ++ + ++ +  MK  GLKPN   Y ++I  
Sbjct: 271 HHLLLLMELKGYTPDVISYSTVVNGYCRFGELDKVWKLIEVMKRKGLKPNSYIYGSIIGL 330

Query: 368 FCKDGMMQGAIKLFVDMRRVGLLPNEFTYTSLIDANCKAGNLTEAWKLSNDMLQAGVNLN 427
            C+   +  A + F +M R G+LP+   YT+LID  CK G++  A K   +M    +  +
Sbjct: 331 LCRICKLAEAEEAFSEMIRQGILPDTVVYTTLIDGFCKRGDIRAASKFFYEMHSRDITPD 390

Query: 428 IVTYTALMDGLCEDGRMVEAEEVFRAMLKDGISPNQQVYTALVHGYIKAERLGDAMEILK 487
           ++TYTA++ G C+ G MVEA ++F  M   G+ P+   +T L++GY KA  + DA  +  
Sbjct: 391 VLTYTAIISGFCQIGDMVEAGKLFHEMFCKGLEPDSVTFTELINGYCKAGHMKDAFRVHN 450

Query: 488 QMTECDIKPDLILYGTILWGLCNQSKLEETKLIIKEMKSRGISANPVIYTTIIDAYFKAG 547
            M +    P+++ Y T++ GLC +  L+    ++ EM   G+  N   Y +I++   K+G
Sbjct: 451 HMIQAGCSPNVVTYTTLIDGLCKEGDLDSANELLHEMWKIGLQPNIFTYNSIVNGLCKSG 510

Query: 548 KSSDALNLLQEMQDVGVEATVVTYCVLIDGLCKTGMVELAVDYFGRMSDFGLQPNVAVYT 607
              +A+ L+ E +  G+ A  VTY  L+D  CK+G ++ A +    M   GLQP +  + 
Sbjct: 511 NIEEAVKLVGEFEAAGLNADTVTYTTLMDAYCKSGEMDKAQEILKEMLGKGLQPTIVTFN 570

Query: 608 ALIDGLCKINCIESAKKLFDEMQCRGMTPDKAAFTALIDGNLKLGNLQEALNLISKMTEL 667
            L++G C    +E  +KL + M  +G+ P+   F +L+       NL+ A  +   M   
Sbjct: 571 VLMNGFCLHGMLEDGEKLLNWMLAKGIAPNATTFNSLVKQYCIRNNLKAATAIYKDMCSR 630

Query: 668 AIDFDLHAYTSLVSGFSQCGELHQARKFFDEMIEKGILPEEILCICLLREYYKLGQLDEA 727
            +  D   Y +LV G  +   + +A   F EM  KG          L++ + K  +  EA
Sbjct: 631 GVGPDGKTYENLVKGHCKARNMKEAWFLFQEMKGKGFSVSVSTYSVLIKGFLKRKKFLEA 690

Query: 728 IELKNEMQRRGLIAEK 743
            E+ ++M+R GL A+K
Sbjct: 691 REVFDQMRREGLAADK 701

BLAST of Tan0020971 vs. TAIR 10
Match: AT1G05670.2 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 340.5 bits (872), Expect = 3.4e-93
Identity = 189/616 (30.68%), Postives = 317/616 (51.46%), Query Frame = 0

Query: 128 IWVPRILVELKNDPKLALKFFKWAESQIGFRHTTESYCIVVHMLFRARMYTNAHDIIKEV 187
           IWV   L+++K D +L L FF WA S+       ES CIV+H+   ++    A  +I   
Sbjct: 91  IWV---LMKIKCDYRLVLDFFDWARSRRD--SNLESLCIVIHLAVASKDLKVAQSLISSF 150

Query: 188 ILKNRADLILPVCNIFDMLWSTRNICVSGSGVFDVLFSVLIELDLLEEANECFSRMRKFR 247
             + + ++       FD+L  T     S   VFDV F VL++  LL EA   F +M  + 
Sbjct: 151 WERPKLNVTDSFVQFFDLLVYTYKDWGSDPRVFDVFFQVLVDFGLLREARRVFEKMLNYG 210

Query: 248 TLPKARSCNFLLHRLSKSGNGQLVRKF-FNDMIGAGITPSVFTYNVMIDHLCKEGDLENA 307
            +    SCN  L RLSK           F +    G+  +V +YN++I  +C+ G ++ A
Sbjct: 211 LVLSVDSCNVYLTRLSKDCYKTATAIIVFREFPEVGVCWNVASYNIVIHFVCQLGRIKEA 270

Query: 308 RRLFVQMRQMGFSPDVVTYNSLIDGYGKFEKMPRAFEYLSEMKNNGLKPNVVTYSTLIDA 367
             L + M   G++PDV++Y+++++GY +F ++ + ++ +  MK  GLKPN   Y ++I  
Sbjct: 271 HHLLLLMELKGYTPDVISYSTVVNGYCRFGELDKVWKLIEVMKRKGLKPNSYIYGSIIGL 330

Query: 368 FCKDGMMQGAIKLFVDMRRVGLLPNEFTYTSLIDANCKAGNLTEAWKLSNDMLQAGVNLN 427
            C+   +  A + F +M R G+LP+   YT+LID  CK G++  A K   +M    +  +
Sbjct: 331 LCRICKLAEAEEAFSEMIRQGILPDTVVYTTLIDGFCKRGDIRAASKFFYEMHSRDITPD 390

Query: 428 IVTYTALMDGLCEDGRMVEAEEVFRAMLKDGISPNQQVYTALVHGYIKAERLGDAMEILK 487
           ++TYTA++ G C+ G MVEA ++F  M   G+ P+   +T L++GY KA  + DA  +  
Sbjct: 391 VLTYTAIISGFCQIGDMVEAGKLFHEMFCKGLEPDSVTFTELINGYCKAGHMKDAFRVHN 450

Query: 488 QMTECDIKPDLILYGTILWGLCNQSKLEETKLIIKEMKSRGISANPVIYTTIIDAYFKAG 547
            M +    P+++ Y T++ GLC +  L+    ++ EM   G+  N   Y +I++   K+G
Sbjct: 451 HMIQAGCSPNVVTYTTLIDGLCKEGDLDSANELLHEMWKIGLQPNIFTYNSIVNGLCKSG 510

Query: 548 KSSDALNLLQEMQDVGVEATVVTYCVLIDGLCKTGMVELAVDYFGRMSDFGLQPNVAVYT 607
              +A+ L+ E +  G+ A  VTY  L+D  CK+G ++ A +    M   GLQP +  + 
Sbjct: 511 NIEEAVKLVGEFEAAGLNADTVTYTTLMDAYCKSGEMDKAQEILKEMLGKGLQPTIVTFN 570

Query: 608 ALIDGLCKINCIESAKKLFDEMQCRGMTPDKAAFTALIDGNLKLGNLQEALNLISKMTEL 667
            L++G C    +E  +KL + M  +G+ P+   F +L+       NL+ A  +   M   
Sbjct: 571 VLMNGFCLHGMLEDGEKLLNWMLAKGIAPNATTFNSLVKQYCIRNNLKAATAIYKDMCSR 630

Query: 668 AIDFDLHAYTSLVSGFSQCGELHQARKFFDEMIEKGILPEEILCICLLREYYKLGQLDEA 727
            +  D   Y +LV G  +   + +A   F EM  KG          L++ + K  +  EA
Sbjct: 631 GVGPDGKTYENLVKGHCKARNMKEAWFLFQEMKGKGFSVSVSTYSVLIKGFLKRKKFLEA 690

Query: 728 IELKNEMQRRGLIAEK 743
            E+ ++M+R GL A+K
Sbjct: 691 REVFDQMRREGLAADK 701

BLAST of Tan0020971 vs. TAIR 10
Match: AT5G39710.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 332.0 bits (850), Expect = 1.2e-90
Identity = 185/623 (29.70%), Postives = 323/623 (51.85%), Query Frame = 0

Query: 133 ILVELKNDPKLALKFFKWAESQIGFRHTTESYCIVVHMLFRARMYTNAHDIIKEVILKNR 192
           +L++ +ND  L LKF  WA     F  T    CI +H+L + ++Y  A  + ++V  K  
Sbjct: 54  LLLKSQNDQALILKFLNWANPHQFF--TLRCKCITLHILTKFKLYKTAQILAEDVAAKTL 113

Query: 193 ADLILPVCNIFDMLWSTRNICVSGSGVFDVLFSVLIELDLLEEANECFSRMRKFRTLPKA 252
            D    +  +F  L  T ++C S S VFD++      L L+++A       +    +P  
Sbjct: 114 DDEYASL--VFKSLQETYDLCYSTSSVFDLVVKSYSRLSLIDKALSIVHLAQAHGFMPGV 173

Query: 253 RSCNFLLHRLSKS-GNGQLVRKFFNDMIGAGITPSVFTYNVMIDHLCKEGDLENARRLFV 312
            S N +L    +S  N       F +M+ + ++P+VFTYN++I   C  G+++ A  LF 
Sbjct: 174 LSYNAVLDATIRSKRNISFAENVFKEMLESQVSPNVFTYNILIRGFCFAGNIDVALTLFD 233

Query: 313 QMRQMGFSPDVVTYNSLIDGYGKFEKMPRAFEYLSEMKNNGLKPNVVTYSTLIDAFCKDG 372
           +M   G  P+VVTYN+LIDGY K  K+   F+ L  M   GL+PN+++Y+ +I+  C++G
Sbjct: 234 KMETKGCLPNVVTYNTLIDGYCKLRKIDDGFKLLRSMALKGLEPNLISYNVVINGLCREG 293

Query: 373 MMQGAIKLFVDMRRVGLLPNEFTYTSLIDANCKAGNLTEAWKLSNDMLQAGVNLNIVTYT 432
            M+    +  +M R G   +E TY +LI   CK GN  +A  +  +ML+ G+  +++TYT
Sbjct: 294 RMKEVSFVLTEMNRRGYSLDEVTYNTLIKGYCKEGNFHQALVMHAEMLRHGLTPSVITYT 353

Query: 433 ALMDGLCEDGRMVEAEEVFRAMLKDGISPNQQVYTALVHGYIKAERLGDAMEILKQMTEC 492
           +L+  +C+ G M  A E    M   G+ PN++ YT LV G+ +   + +A  +L++M + 
Sbjct: 354 SLIHSMCKAGNMNRAMEFLDQMRVRGLCPNERTYTTLVDGFSQKGYMNEAYRVLREMNDN 413

Query: 493 DIKPDLILYGTILWGLCNQSKLEETKLIIKEMKSRGISANPVIYTTIIDAYFKAGKSSDA 552
              P ++ Y  ++ G C   K+E+   ++++MK +G+S + V Y+T++  + ++    +A
Sbjct: 414 GFSPSVVTYNALINGHCVTGKMEDAIAVLEDMKEKGLSPDVVSYSTVLSGFCRSYDVDEA 473

Query: 553 LNLLQEMQDVGVEATVVTYCVLIDGLCKTGMVELAVDYFGRMSDFGLQPNVAVYTALIDG 612
           L + +EM + G++   +TY  LI G C+    + A D +  M   GL P+   YTALI+ 
Sbjct: 474 LRVKREMVEKGIKPDTITYSSLIQGFCEQRRTKEACDLYEEMLRVGLPPDEFTYTALINA 533

Query: 613 LCKINCIESAKKLFDEMQCRGMTPDKAAFTALIDGNLKLGNLQEALNLISK--------- 672
            C    +E A +L +EM  +G+ PD   ++ LI+G  K    +EA  L+ K         
Sbjct: 534 YCMEGDLEKALQLHNEMVEKGVLPDVVTYSVLINGLNKQSRTREAKRLLLKLFYEESVPS 593

Query: 673 ------MTELAIDFDLHAYTSLVSGFSQCGELHQARKFFDEMIEKGILPEEILCICLLRE 732
                 + E   + +  +  SL+ GF   G + +A + F+ M+ K   P+      ++  
Sbjct: 594 DVTYHTLIENCSNIEFKSVVSLIKGFCMKGMMTEADQVFESMLGKNHKPDGTAYNIMIHG 653

Query: 733 YYKLGQLDEAIELKNEMQRRGLI 740
           + + G + +A  L  EM + G +
Sbjct: 654 HCRAGDIRKAYTLYKEMVKSGFL 672

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
P0C8942.6e-23953.44Putative pentatricopeptide repeat-containing protein At2g02150 OS=Arabidopsis th... [more]
Q9ZUA21.8e-9436.08Pentatricopeptide repeat-containing protein At2g01740 OS=Arabidopsis thaliana OX... [more]
Q0WVK74.8e-9230.68Pentatricopeptide repeat-containing protein At1g05670, mitochondrial OS=Arabidop... [more]
Q9FIX31.7e-8929.70Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana OX... [more]
Q9LFC51.6e-8228.40Pentatricopeptide repeat-containing protein At5g01110 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
XP_022959718.10.0e+0090.41putative pentatricopeptide repeat-containing protein At2g02150 isoform X3 [Cucur... [more]
XP_023534850.10.0e+0090.01putative pentatricopeptide repeat-containing protein At2g02150 isoform X3 [Cucur... [more]
KAG6601913.10.0e+0086.77putative pentatricopeptide repeat-containing protein, partial [Cucurbita argyros... [more]
XP_022959692.10.0e+0086.39putative pentatricopeptide repeat-containing protein At2g02150 isoform X1 [Cucur... [more]
XP_023534824.10.0e+0086.01putative pentatricopeptide repeat-containing protein At2g02150 isoform X1 [Cucur... [more]
Match NameE-valueIdentityDescription
A0A6J1H6R20.0e+0090.41putative pentatricopeptide repeat-containing protein At2g02150 isoform X3 OS=Cuc... [more]
A0A6J1H5890.0e+0086.39putative pentatricopeptide repeat-containing protein At2g02150 isoform X1 OS=Cuc... [more]
A0A6J1CX770.0e+0085.35putative pentatricopeptide repeat-containing protein At2g02150 OS=Momordica char... [more]
A0A6J1FET40.0e+0085.53putative pentatricopeptide repeat-containing protein At2g02150 OS=Cucurbita mosc... [more]
A0A6J1K0350.0e+0084.89putative pentatricopeptide repeat-containing protein At2g02150 isoform X1 OS=Cuc... [more]
Match NameE-valueIdentityDescription
AT2G02150.11.9e-24053.44Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT2G01740.11.3e-9536.08Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G05670.13.4e-9330.68Pentatricopeptide repeat (PPR-like) superfamily protein [more]
AT1G05670.23.4e-9330.68Pentatricopeptide repeat (PPR-like) superfamily protein [more]
AT5G39710.11.2e-9029.70Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 393..427
e-value: 3.1E-9
score: 34.4
coord: 709..740
e-value: 4.0E-7
score: 27.8
coord: 745..773
e-value: 0.0021
score: 16.1
coord: 288..322
e-value: 2.0E-10
score: 38.2
coord: 603..637
e-value: 1.3E-9
score: 35.6
coord: 254..286
e-value: 0.0022
score: 16.1
coord: 358..392
e-value: 8.0E-7
score: 26.9
coord: 323..357
e-value: 4.9E-12
score: 43.2
coord: 639..671
e-value: 3.9E-9
score: 34.1
coord: 499..531
e-value: 8.8E-7
score: 26.7
coord: 463..496
e-value: 1.7E-10
score: 38.4
coord: 428..461
e-value: 1.6E-5
score: 22.8
coord: 568..601
e-value: 3.8E-7
score: 27.9
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 422..453
e-value: 4.6E-9
score: 35.9
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 747..773
e-value: 0.0031
score: 17.7
coord: 534..563
e-value: 0.058
score: 13.7
coord: 709..738
e-value: 7.8E-7
score: 28.9
coord: 254..283
e-value: 0.51
score: 10.7
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 355..404
e-value: 1.3E-18
score: 66.9
coord: 461..506
e-value: 6.2E-12
score: 45.5
coord: 285..325
e-value: 4.2E-13
score: 49.3
coord: 566..614
e-value: 4.2E-15
score: 55.7
coord: 635..681
e-value: 2.3E-13
score: 50.1
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 251..285
score: 8.769097
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 601..635
score: 12.090371
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 461..495
score: 13.701682
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 706..740
score: 12.945353
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 636..670
score: 12.210946
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 566..600
score: 11.476539
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 496..530
score: 11.081932
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 391..425
score: 13.054966
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 321..355
score: 14.140135
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 741..775
score: 9.152743
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 356..390
score: 12.41921
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 286..320
score: 13.778412
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 426..460
score: 11.32308
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 531..565
score: 9.580234
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 551..700
e-value: 5.9E-43
score: 149.4
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 130..278
e-value: 1.8E-15
score: 58.9
coord: 279..346
e-value: 7.1E-24
score: 86.4
coord: 701..771
e-value: 1.6E-13
score: 52.6
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 442..550
e-value: 3.1E-32
score: 113.4
coord: 347..441
e-value: 4.3E-34
score: 119.4
NoneNo IPR availablePANTHERPTHR45613PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 130..778
NoneNo IPR availablePANTHERPTHR45613:SF358OS06G0565000 PROTEINcoord: 130..778
NoneNo IPR availableSUPERFAMILY81901HCP-likecoord: 500..765
NoneNo IPR availableSUPERFAMILY81901HCP-likecoord: 273..536

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0020971.1Tan0020971.1mRNA
Tan0020971.2Tan0020971.2mRNA
Tan0020971.3Tan0020971.3mRNA
Tan0020971.4Tan0020971.4mRNA
Tan0020971.5Tan0020971.5mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0005515 protein binding