HG10004082 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10004082
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPentatricopeptide repeat-containing protein
LocationChr08: 13482905 .. 13484589 (-)
RNA-Seq ExpressionHG10004082
SyntenyHG10004082
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCCTACTTCTACCTTCTGTTACAGAAATGTTCTTCCTTCTCCCAAATCAAGCAACTCCAAGCGAACCTCATCATCAATGGCCATTTCCAATTCTCTCCCTCTCGCACTAAGCTTCTCGAGCTCTGCGCCATCTCCTCCTTCGGCGACCTTTCTTATGCCCTCCATATTTTCCGCCATATCCGGTACCCTTCGACCAACTATTGGAACGCCGTCATTCGCGGCACCGCCCTGAGCTCCGATCCCGCAAATGCCGTTATCTGGTACAGGGAAATGGCTGCGTCAAATGGGCCTCACAGAATTGACGCTCTCACATGCTCTTTTGCCCTCAAAGCCTGTGCCCGCGCCTTGGCTCGTTTTGAAGCGATGCAATTACATTCGCAGCTTTTGCGATTTGGGCTCAATGCTGATGTTCTCCTACAGACTACATTGCTTGATGCGTATGCGAAAGTTGGGGATCTCGATCTTGCCCAGAAACTGTTCGACGAAATGGCAACCAGATATTGCTTCGTGGAACGCGTTGATTGCTGGGTTTGCTCAGGGAAGTCGACCAGGTGATGCTATAATGATGTTTAAGAGAATGAAGGTTGATGGAAATTTGAGACCCAATGAAGTAACTGTTCAAGGTGCTCTGTTGGCGTGTTCACAATTGGGTGCTTTGAAAGAAGGGGAAAATGTTCATAAACATATAGTAGAGGAGAATTTAGATATGAATATGCAGGTTTGTAATGTCGTTATTGATATGTATGCTAAATGTGGATCTGTGGATAAAGCTTATCAGGTGTTTGAGAACATGAGGTGTGACAAAAGTTTGATCACTTGGAATACAATGATAATGGCGTCTGCAATGCATGGTGATGGATACAAAGCGTTGGATCTTTTTGAAGAGTTGGGTCGATCTGGAATGTCCCCCGAGGCTCGTAGAGGATGGGCTGAAGCTGTTCAATTCAATGGCGCAAAGGAGGTTGGAGCCAAATATAAAGCATTATGGAGCCATGGTTGATTTGTTAGGTCGAGCAGGGCGTCTCAAAGAAGCTTATGAAATTGTAAATTCAATGCATTTCCCTAATATGGTACTCTGGCAAACATTGCTTGGTGCTTGCAGGACATATGGGAATGTAGAAATGGCAGAACTGGCATCAAGGAAGCTAGTAGAGATGGGATTTATTAACTGTGGTGATTTTGTTTTGTTATCGAATGTGTATGCCGCCCGCAAGAGATGGGATGATGTTGGGAGAGTTAGGGACGCCATGAGAAGAAGGGATGTGAAGAAGACACCAGGATTTAGTTACATAGAAGTAAAAGGTAAGATGCACAAATTTGTATATGGTGATCAAAGCCACTCGAGTTGCCGTGAGATCTATGCCAAGCTTGATGAGATCAAGTTCAGGATCAAAGCCTATGGATATGTAGCTGAAACTGGCAATGTATTGCATGATATTGGAGATGAGGACAAGGAGAATGCACTGTGTTATCACAGTGAGAAGCTCGCTGTGGCTTTTGGATTGACTTGTACTGAAGAAGGGACCCCAAATCAAGTGATTAAGAATTTAAGGATTTGTGGGGATTGTCATGTTGTTATTAAACTAATATCTAAGATTTATAATCGAGAAATCATTGTAAGAGACAGAACTCGATTTCATCGATTTAATGAAGGTTTGTGTTCTTGCAAAGATTATTGGTGA

mRNA sequence

ATGGCCTACTTCTACCTTCTGTTACAGAAATGTTCTTCCTTCTCCCAAATCAAGCAACTCCAAGCGAACCTCATCATCAATGGCCATTTCCAATTCTCTCCCTCTCGCACTAAGCTTCTCGAGCTCTGCGCCATCTCCTCCTTCGGCGACCTTTCTTATGCCCTCCATATTTTCCGCCATATCCGGTACCCTTCGACCAACTATTGGAACGCCGTCATTCGCGGCACCGCCCTGAGCTCCGATCCCGCAAATGCCGTTATCTGGTACAGGGAAATGGCTGCGTCAAATGGGCCTCACAGAATTGACGCTCTCACATGCTCTTTTGCCCTCAAAGCCTGTGCCCGCGCCTTGGCTCGTTTTGAAGCGATGCAATTACATTCGCAGCTTTTGCGATTTGGGCTCAATGCTGATGTTCTCCTACAGACTACATTGCTTGATGCGTATGCGAAAGTTGGGGATCTCGATCTTGCCCAGAAACTGTTCGACGAAATGGCAACCAGATATTGCTTCGTGGAACGCGTTGATTGCTGGAGTTGGGTCGATCTGGAATGTCCCCCGAGGCTCGTAGAGGATGGGCTGAAGCTGTTCAATTCAATGGCGCAAAGGAGGTTGGAGCCAAATATAAAGCATTATGGAGCCATGGTTGATTTGTTAGGTCGAGCAGGGCGTCTCAAAGAAGCTTATGAAATTGTAAATTCAATGCATTTCCCTAATATGGTACTCTGGCAAACATTGCTTGGTGCTTGCAGGACATATGGGAATGTAGAAATGGCAGAACTGGCATCAAGGAAGCTAGTAGAGATGGGATTTATTAACTGTGGTGATTTTGTTTTGTTATCGAATGTGTATGCCGCCCGCAAGAGATGGGATGATGTTGGGAGAGTTAGGGACGCCATGAGAAGAAGGGATGTGAAGAAGACACCAGGATTTAGTTACATAGAAGTAAAAGGTAAGATGCACAAATTTGTATATGGTGATCAAAGCCACTCGAGTTGCCGTGAGATCTATGCCAAGCTTGATGAGATCAAGTTCAGGATCAAAGCCTATGGATATGTAGCTGAAACTGGCAATGTATTGCATGATATTGGAGATGAGGACAAGGAGAATGCACTGTGTTATCACAGTGAGAAGCTCGCTGTGGCTTTTGGATTGACTTGTACTGAAGAAGGGACCCCAAATCAAGTGATTAAGAATTTAAGGATTTGTGGGGATTGTCATGTTGTTATTAAACTAATATCTAAGATTTATAATCGAGAAATCATTGTAAGAGACAGAACTCGATTTCATCGATTTAATGAAGGTTTGTGTTCTTGCAAAGATTATTGGTGA

Coding sequence (CDS)

ATGGCCTACTTCTACCTTCTGTTACAGAAATGTTCTTCCTTCTCCCAAATCAAGCAACTCCAAGCGAACCTCATCATCAATGGCCATTTCCAATTCTCTCCCTCTCGCACTAAGCTTCTCGAGCTCTGCGCCATCTCCTCCTTCGGCGACCTTTCTTATGCCCTCCATATTTTCCGCCATATCCGGTACCCTTCGACCAACTATTGGAACGCCGTCATTCGCGGCACCGCCCTGAGCTCCGATCCCGCAAATGCCGTTATCTGGTACAGGGAAATGGCTGCGTCAAATGGGCCTCACAGAATTGACGCTCTCACATGCTCTTTTGCCCTCAAAGCCTGTGCCCGCGCCTTGGCTCGTTTTGAAGCGATGCAATTACATTCGCAGCTTTTGCGATTTGGGCTCAATGCTGATGTTCTCCTACAGACTACATTGCTTGATGCGTATGCGAAAGTTGGGGATCTCGATCTTGCCCAGAAACTGTTCGACGAAATGGCAACCAGATATTGCTTCGTGGAACGCGTTGATTGCTGGAGTTGGGTCGATCTGGAATGTCCCCCGAGGCTCGTAGAGGATGGGCTGAAGCTGTTCAATTCAATGGCGCAAAGGAGGTTGGAGCCAAATATAAAGCATTATGGAGCCATGGTTGATTTGTTAGGTCGAGCAGGGCGTCTCAAAGAAGCTTATGAAATTGTAAATTCAATGCATTTCCCTAATATGGTACTCTGGCAAACATTGCTTGGTGCTTGCAGGACATATGGGAATGTAGAAATGGCAGAACTGGCATCAAGGAAGCTAGTAGAGATGGGATTTATTAACTGTGGTGATTTTGTTTTGTTATCGAATGTGTATGCCGCCCGCAAGAGATGGGATGATGTTGGGAGAGTTAGGGACGCCATGAGAAGAAGGGATGTGAAGAAGACACCAGGATTTAGTTACATAGAAGTAAAAGGTAAGATGCACAAATTTGTATATGGTGATCAAAGCCACTCGAGTTGCCGTGAGATCTATGCCAAGCTTGATGAGATCAAGTTCAGGATCAAAGCCTATGGATATGTAGCTGAAACTGGCAATGTATTGCATGATATTGGAGATGAGGACAAGGAGAATGCACTGTGTTATCACAGTGAGAAGCTCGCTGTGGCTTTTGGATTGACTTGTACTGAAGAAGGGACCCCAAATCAAGTGATTAAGAATTTAAGGATTTGTGGGGATTGTCATGTTGTTATTAAACTAATATCTAAGATTTATAATCGAGAAATCATTGTAAGAGACAGAACTCGATTTCATCGATTTAATGAAGGTTTGTGTTCTTGCAAAGATTATTGGTGA

Protein sequence

MAYFYLLLQKCSSFSQIKQLQANLIINGHFQFSPSRTKLLELCAISSFGDLSYALHIFRHIRYPSTNYWNAVIRGTALSSDPANAVIWYREMAASNGPHRIDALTCSFALKACARALARFEAMQLHSQLLRFGLNADVLLQTTLLDAYAKVGDLDLAQKLFDEMATRYCFVERVDCWSWVDLECPPRLVEDGLKLFNSMAQRRLEPNIKHYGAMVDLLGRAGRLKEAYEIVNSMHFPNMVLWQTLLGACRTYGNVEMAELASRKLVEMGFINCGDFVLLSNVYAARKRWDDVGRVRDAMRRRDVKKTPGFSYIEVKGKMHKFVYGDQSHSSCREIYAKLDEIKFRIKAYGYVAETGNVLHDIGDEDKENALCYHSEKLAVAFGLTCTEEGTPNQVIKNLRICGDCHVVIKLISKIYNREIIVRDRTRFHRFNEGLCSCKDYW
Homology
BLAST of HG10004082 vs. NCBI nr
Match: XP_038885014.1 (pentatricopeptide repeat-containing protein At1g34160 isoform X2 [Benincasa hispida] >XP_038885015.1 pentatricopeptide repeat-containing protein At1g34160 isoform X2 [Benincasa hispida] >XP_038885016.1 pentatricopeptide repeat-containing protein At1g34160 isoform X2 [Benincasa hispida])

HSP 1 Score: 743.4 bits (1918), Expect = 1.1e-210
Identity = 398/576 (69.10%), Postives = 411/576 (71.35%), Query Frame = 0

Query: 1   MAYFYLLLQKCSSFSQIKQLQANLIINGHFQFSPSRTKLLELCAISSFGDLSYALHIFRH 60
           MAYF LLLQKCSSFSQIKQLQANLIING FQFS SRTKLLELCAISSFGDLSYA+HIFR+
Sbjct: 1   MAYFNLLLQKCSSFSQIKQLQANLIINGDFQFSSSRTKLLELCAISSFGDLSYAIHIFRY 60

Query: 61  IRYPSTNYWNAVIRGTALSSDPANAVIWYREMAASNGPHRIDALTCSFALKACARALARF 120
           IRYPSTN WNAVIRGTALSSDPANAV  YR MAASNGPHRIDALTCSFALKACARALAR 
Sbjct: 61  IRYPSTNDWNAVIRGTALSSDPANAVFCYRAMAASNGPHRIDALTCSFALKACARALARS 120

Query: 121 EAMQLHSQLLRFGLNADVLLQTTLLDAYAKVGDLDLAQKLFDEM---------------- 180
           EAMQLHSQLLRFG NADVLLQTTLLDAYAKVGDLDLAQKLFDEM                
Sbjct: 121 EAMQLHSQLLRFGFNADVLLQTTLLDAYAKVGDLDLAQKLFDEMPQPDIASWNALIAGFA 180

Query: 181 ----------------------------------------------ATRYCFVERVDC-- 240
                                                           +Y   E++D   
Sbjct: 181 QGSRPGDAIMLFKRMKENGNLRPNEVTVQGALLACSQLGALKEGENVHKYIVEEKLDMNV 240

Query: 241 --------------------WSWVDLECPPR----------------------------- 300
                               W + ++ C                                
Sbjct: 241 QVCNVVIDMYAKCGSVDKAYWVFENMRCEKSLITWNTMIMAFAMHGDGYKASDLFEKMVR 300

Query: 301 ---------------------LVEDGLKLFNSMAQRRLEPNIKHYGAMVDLLGRAGRLKE 360
                                LVEDGLKLF+SM QR L PNIKHYGAMVDLLGRAGRLKE
Sbjct: 301 SGMSPDAVSYLSVLCACNHAGLVEDGLKLFDSMVQRGLAPNIKHYGAMVDLLGRAGRLKE 360

Query: 361 AYEIVNSMHFPNMVLWQTLLGACRTYGNVEMAELASRKLVEMGFINCGDFVLLSNVYAAR 420
           AYEIVNSM FPNMVLWQTLLGACRTYGNVEMAELASRKLVEMGFI+CGDFVLLSNVYA R
Sbjct: 361 AYEIVNSMPFPNMVLWQTLLGACRTYGNVEMAELASRKLVEMGFISCGDFVLLSNVYATR 420

Query: 421 KRWDDVGRVRDAMRRRDVKKTPGFSYIEVKGKMHKFVYGDQSHSSCREIYAKLDEIKFRI 443
           +RWDDVGRVRDAMRRRDVKKTPGFSYIEVKGKMHKFVYGDQSH S REIYAKLDEIKFRI
Sbjct: 421 QRWDDVGRVRDAMRRRDVKKTPGFSYIEVKGKMHKFVYGDQSHLSHREIYAKLDEIKFRI 480

BLAST of HG10004082 vs. NCBI nr
Match: XP_031743171.1 (pentatricopeptide repeat-containing protein At1g34160 [Cucumis sativus] >KGN46094.1 hypothetical protein Csa_004906 [Cucumis sativus])

HSP 1 Score: 726.1 bits (1873), Expect = 1.8e-205
Identity = 388/576 (67.36%), Postives = 408/576 (70.83%), Query Frame = 0

Query: 1   MAYFYLLLQKCSSFSQIKQLQANLIINGHFQFSPSRTKLLELCAISSFGDLSYALHIFRH 60
           MAYF LLLQKCSSFSQIKQLQANLIING F FS SRTKLLELCAISSFGDLSYALHIFR+
Sbjct: 1   MAYFNLLLQKCSSFSQIKQLQANLIINGDFHFSSSRTKLLELCAISSFGDLSYALHIFRY 60

Query: 61  IRYPSTNYWNAVIRGTALSSDPANAVIWYREMAASNGPHRIDALTCSFALKACARALARF 120
           I YPSTN WNAVIRGTALSSDPANAV WYR MAASNG HRIDALTCSFALKACARALAR 
Sbjct: 61  IPYPSTNDWNAVIRGTALSSDPANAVFWYRAMAASNGLHRIDALTCSFALKACARALARS 120

Query: 121 EAMQLHSQLLRFGLNADVLLQTTLLDAYAKVGDLDLAQKLFDEM---------------- 180
           EA+QLHSQLLRFG NADVLLQTTLLDAYAK+GDLDLAQKLFDEM                
Sbjct: 121 EAIQLHSQLLRFGFNADVLLQTTLLDAYAKIGDLDLAQKLFDEMPQPDIASWNALIAGFA 180

Query: 181 ----------------------------------------------ATRYCFVERVDC-- 240
                                                           +Y   E+++   
Sbjct: 181 QGSRPADAIMTFKRMKVDGNLRPNAVTVQGALLACSQLGALKEGESVHKYIVEEKLNSNV 240

Query: 241 --------------------WSWVDLECPPR----------------------------- 300
                               W + ++ C                                
Sbjct: 241 QVCNVVIDMYAKCGSMDKAYWVFENMRCDKSLITWNTMIMAFAMHGDGHKALDLFEKLGR 300

Query: 301 ---------------------LVEDGLKLFNSMAQRRLEPNIKHYGAMVDLLGRAGRLKE 360
                                LVEDGLKLFNSM QR LEPNIKHYG+MVDLLGRAGRLKE
Sbjct: 301 SGMSPDAVSYLAVLCACNHAGLVEDGLKLFNSMTQRGLEPNIKHYGSMVDLLGRAGRLKE 360

Query: 361 AYEIVNSMHFPNMVLWQTLLGACRTYGNVEMAELASRKLVEMGFINCGDFVLLSNVYAAR 420
           AY+IV+S+ FPNMVLWQTLLGACRTYG+VEMAELASRKLVEMGFI+CGDFVLLSNVYAAR
Sbjct: 361 AYDIVSSLPFPNMVLWQTLLGACRTYGDVEMAELASRKLVEMGFISCGDFVLLSNVYAAR 420

Query: 421 KRWDDVGRVRDAMRRRDVKKTPGFSYIEVKGKMHKFVYGDQSHSSCREIYAKLDEIKFRI 443
           +RWDDVGRVRDAMRRRDVKKTPGFSYIE+KGKM+KFV GDQSHSSCREIYAKLDEI  RI
Sbjct: 421 QRWDDVGRVRDAMRRRDVKKTPGFSYIEIKGKMYKFVNGDQSHSSCREIYAKLDEINLRI 480

BLAST of HG10004082 vs. NCBI nr
Match: XP_008456696.1 (PREDICTED: pentatricopeptide repeat-containing protein At1g34160 [Cucumis melo])

HSP 1 Score: 724.2 bits (1868), Expect = 7.0e-205
Identity = 385/576 (66.84%), Postives = 406/576 (70.49%), Query Frame = 0

Query: 1   MAYFYLLLQKCSSFSQIKQLQANLIINGHFQFSPSRTKLLELCAISSFGDLSYALHIFRH 60
           MAYF LLLQKCSSFS IKQLQANLIING F FS SRTKLLELCA+SS GDLSYALHIFR+
Sbjct: 1   MAYFNLLLQKCSSFSHIKQLQANLIINGDFHFSSSRTKLLELCAVSSCGDLSYALHIFRY 60

Query: 61  IRYPSTNYWNAVIRGTALSSDPANAVIWYREMAASNGPHRIDALTCSFALKACARALARF 120
           IRYPSTN WNA+IRGTALSSDPANAV+WYR MAASNGPHRIDALTCSFALKACARALA  
Sbjct: 61  IRYPSTNDWNAIIRGTALSSDPANAVVWYRAMAASNGPHRIDALTCSFALKACARALACS 120

Query: 121 EAMQLHSQLLRFGLNADVLLQTTLLDAYAKVGDLDLAQKLFDEM---------------- 180
           EA+QLHSQLLRFG NADVLLQTTLLD YAKVGDLDLAQKLFDEM                
Sbjct: 121 EAIQLHSQLLRFGFNADVLLQTTLLDVYAKVGDLDLAQKLFDEMPRPDIASWNALISGFA 180

Query: 181 ----------------------------------------------ATRYCFVERVDC-- 240
                                                           +Y   E++D   
Sbjct: 181 QGSRPADAIMMFKRMKEGGNLRPNAVTVQGALLACSQLGTLKEGENVHKYIVEEKLDMNV 240

Query: 241 --------------------WSWVDLECPPR----------------------------- 300
                               W + ++ C                                
Sbjct: 241 QVCNVVIDMYAKCGSMDKAYWVFENMRCDKSLITWNTMIMAFAMHGDGYEALDLFKKLGR 300

Query: 301 ---------------------LVEDGLKLFNSMAQRRLEPNIKHYGAMVDLLGRAGRLKE 360
                                LVEDGLKLFN MAQR LEPNIKHYG+MVDLLGRAGRLKE
Sbjct: 301 SGMSPDAVSYLAVLCACNHAGLVEDGLKLFNLMAQRGLEPNIKHYGSMVDLLGRAGRLKE 360

Query: 361 AYEIVNSMHFPNMVLWQTLLGACRTYGNVEMAELASRKLVEMGFINCGDFVLLSNVYAAR 420
           AY+IVNS+ FPNMVLWQTLLGACRTYG+VEMAELAS KLVEMGFI+CGDFVLLSNVYAAR
Sbjct: 361 AYDIVNSLPFPNMVLWQTLLGACRTYGDVEMAELASGKLVEMGFISCGDFVLLSNVYAAR 420

Query: 421 KRWDDVGRVRDAMRRRDVKKTPGFSYIEVKGKMHKFVYGDQSHSSCREIYAKLDEIKFRI 443
           +RWDDVGRVRDAMR RDVKKTPGFSYIE+KGKM++FVYGDQSHSSCREIYAKLDEI  RI
Sbjct: 421 QRWDDVGRVRDAMRIRDVKKTPGFSYIEIKGKMYQFVYGDQSHSSCREIYAKLDEINLRI 480

BLAST of HG10004082 vs. NCBI nr
Match: XP_038885002.1 (pentatricopeptide repeat-containing protein At1g34160 isoform X1 [Benincasa hispida] >XP_038885003.1 pentatricopeptide repeat-containing protein At1g34160 isoform X1 [Benincasa hispida] >XP_038885004.1 pentatricopeptide repeat-containing protein At1g34160 isoform X1 [Benincasa hispida] >XP_038885005.1 pentatricopeptide repeat-containing protein At1g34160 isoform X1 [Benincasa hispida] >XP_038885006.1 pentatricopeptide repeat-containing protein At1g34160 isoform X1 [Benincasa hispida] >XP_038885008.1 pentatricopeptide repeat-containing protein At1g34160 isoform X1 [Benincasa hispida] >XP_038885009.1 pentatricopeptide repeat-containing protein At1g34160 isoform X1 [Benincasa hispida] >XP_038885010.1 pentatricopeptide repeat-containing protein At1g34160 isoform X1 [Benincasa hispida] >XP_038885011.1 pentatricopeptide repeat-containing protein At1g34160 isoform X1 [Benincasa hispida] >XP_038885012.1 pentatricopeptide repeat-containing protein At1g34160 isoform X1 [Benincasa hispida] >XP_038885013.1 pentatricopeptide repeat-containing protein At1g34160 isoform X1 [Benincasa hispida])

HSP 1 Score: 723.4 bits (1866), Expect = 1.2e-204
Identity = 391/571 (68.48%), Postives = 405/571 (70.93%), Query Frame = 0

Query: 1   MAYFYLLLQKCSSFSQIKQLQANLIINGHFQFSPSRTKLLELCAISSFGDLSYALHIFRH 60
           MAYF LLLQKCSSFSQIKQLQANLIING FQFS SRTKLLELCAISSFGDLSYA+HIFR+
Sbjct: 1   MAYFNLLLQKCSSFSQIKQLQANLIINGDFQFSSSRTKLLELCAISSFGDLSYAIHIFRY 60

Query: 61  IRYPSTNYWNAVIRGTALSSDPANAVIWYREMAASNGPHRIDALTCSFALKACARALARF 120
           IRYPSTN WNAVIRGTALSSDPANAV  YR MAASNGPHRIDALTCSFALKACARALAR 
Sbjct: 61  IRYPSTNDWNAVIRGTALSSDPANAVFCYRAMAASNGPHRIDALTCSFALKACARALARS 120

Query: 121 EAMQLHSQLLRFGLNADVLLQTTLLDAYAKVGDLDLAQKLFDEM---------------- 180
           EAMQLHSQLLRFG NADVLLQTTLLDAYAKVGDLDLAQKLFDEM                
Sbjct: 121 EAMQLHSQLLRFGFNADVLLQTTLLDAYAKVGDLDLAQKLFDEMPQPDIASWNALIAGFA 180

Query: 181 ----------------------------------------------ATRYCFVERVDC-- 240
                                                           +Y   E++D   
Sbjct: 181 QGSRPGDAIMLFKRMKENGNLRPNEVTVQGALLACSQLGALKEGENVHKYIVEEKLDMNV 240

Query: 241 --------------------WSWVDLECPPR----------------------------- 300
                               W + ++ C                                
Sbjct: 241 QVCNVVIDMYAKCGSVDKAYWVFENMRCEKSLITWNTMIMAFAMHGDGYKASDLFEKMVR 300

Query: 301 ---------------------LVEDGLKLFNSMAQRRLEPNIKHYGAMVDLLGRAGRLKE 360
                                LVEDGLKLF+SM QR L PNIKHYGAMVDLLGRAGRLKE
Sbjct: 301 SGMSPDAVSYLSVLCACNHAGLVEDGLKLFDSMVQRGLAPNIKHYGAMVDLLGRAGRLKE 360

Query: 361 AYEIVNSMHFPNMVLWQTLLGACRTYGNVEMAELASRKLVEMGFINCGDFVLLSNVYAAR 420
           AYEIVNSM FPNMVLWQTLLGACRTYGNVEMAELASRKLVEMGFI+CGDFVLLSNVYA R
Sbjct: 361 AYEIVNSMPFPNMVLWQTLLGACRTYGNVEMAELASRKLVEMGFISCGDFVLLSNVYATR 420

Query: 421 KRWDDVGRVRDAMRRRDVKKTPGFSYIEVKGKMHKFVYGDQSHSSCREIYAKLDEIKFRI 438
           +RWDDVGRVRDAMRRRDVKKTPGFSYIEVKGKMHKFVYGDQSH S REIYAKLDEIKFRI
Sbjct: 421 QRWDDVGRVRDAMRRRDVKKTPGFSYIEVKGKMHKFVYGDQSHLSHREIYAKLDEIKFRI 480

BLAST of HG10004082 vs. NCBI nr
Match: XP_022133715.1 (pentatricopeptide repeat-containing protein At1g34160 [Momordica charantia])

HSP 1 Score: 703.4 bits (1814), Expect = 1.3e-198
Identity = 376/576 (65.28%), Postives = 397/576 (68.92%), Query Frame = 0

Query: 1   MAYFYLLLQKCSSFSQIKQLQANLIINGHFQFSPSRTKLLELCAISSFGDLSYALHIFRH 60
           MAYF LLLQKCSSFSQIKQLQANLI NG FQ S SRTKLLELCAIS FGDL +A+ IFRH
Sbjct: 18  MAYFDLLLQKCSSFSQIKQLQANLITNGRFQLSSSRTKLLELCAISPFGDLPHAIRIFRH 77

Query: 61  IRYPSTNYWNAVIRGTALSSDPANAVIWYREMAASNGPHRIDALTCSFALKACARALARF 120
           IR P TN WNAVIRGTALSSDPANAV+WYR MAAS GPHR+DALTCSF LKACARALAR 
Sbjct: 78  IRAPPTNDWNAVIRGTALSSDPANAVLWYRAMAASIGPHRVDALTCSFTLKACARALARS 137

Query: 121 EAMQLHSQLLRFGLNADVLLQTTLLDAYAKVGDLDLAQKLFDEM---------------- 180
           EAMQLHSQLLRFG +AD+LLQTTLLDAYAKVGDLD AQKLFDE+                
Sbjct: 138 EAMQLHSQLLRFGFDADILLQTTLLDAYAKVGDLDRAQKLFDEIPQPDIASWNALIAGFA 197

Query: 181 ----------------------------------------------ATRYCFVERVDC-- 240
                                                           +Y   E +D   
Sbjct: 198 QGSRPGDAIALFKRMKEDGYLRPNEVTVQGALLACSQLGALKEGEEVHKYIIEENLDMNV 257

Query: 241 --------------------WSWVDLECPPR----------------------------- 300
                               W + ++ C                                
Sbjct: 258 QVCNVVIDMYAKCGSVDKAYWVFQNMRCEKSLISWNTMIMAFAIHGHGYKALDLFEKLGL 317

Query: 301 ---------------------LVEDGLKLFNSMAQRRLEPNIKHYGAMVDLLGRAGRLKE 360
                                LVEDG+KLFNSM +R L PNIKHYG++VDLLGRAGRLKE
Sbjct: 318 SGISPDAVSYLVVLCACNHGGLVEDGVKLFNSMGKRGLAPNIKHYGSVVDLLGRAGRLKE 377

Query: 361 AYEIVNSMHFPNMVLWQTLLGACRTYGNVEMAELASRKLVEMGFINCGDFVLLSNVYAAR 420
           AYEIVNSM FPNMVLWQTLLGACRTYGNVEMAELASRKLVEMGFI+CGDFVLLSNVYAA 
Sbjct: 378 AYEIVNSMPFPNMVLWQTLLGACRTYGNVEMAELASRKLVEMGFISCGDFVLLSNVYAAC 437

Query: 421 KRWDDVGRVRDAMRRRDVKKTPGFSYIEVKGKMHKFVYGDQSHSSCREIYAKLDEIKFRI 443
           +RWDDVGR+RDAMRRRDVKKTPGFSY EVKGKMHKF YGDQ+HSSC EIYAKLDEIKFRI
Sbjct: 438 QRWDDVGRIRDAMRRRDVKKTPGFSYTEVKGKMHKFAYGDQNHSSCHEIYAKLDEIKFRI 497

BLAST of HG10004082 vs. ExPASy Swiss-Prot
Match: Q9FX24 (Pentatricopeptide repeat-containing protein At1g34160 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H68 PE=2 SV=2)

HSP 1 Score: 469.5 bits (1207), Expect = 4.1e-131
Identity = 277/577 (48.01%), Postives = 335/577 (58.06%), Query Frame = 0

Query: 3   YFYLLLQKCSSFSQIKQLQANLIINGHFQFSPSRTKLLELCAISSFGDLSYALHIFRHIR 62
           Y   ++QKC SFSQIKQLQ++ +  GHFQ S  R++LLE CAIS FGDLS+A+ IFR+I 
Sbjct: 5   YMETMIQKCVSFSQIKQLQSHFLTAGHFQSSFLRSRLLERCAISPFGDLSFAVQIFRYIP 64

Query: 63  YPSTNYWNAVIRGTALSSDPANAVIWYREM----AASNGPHRIDALTCSFALKACARALA 122
            P TN WNA+IRG A SS P+ A  WYR M    ++S+   R+DALTCSF LKACARAL 
Sbjct: 65  KPLTNDWNAIIRGFAGSSHPSLAFSWYRSMLQQSSSSSAICRVDALTCSFTLKACARALC 124

Query: 123 RFEAMQLHSQLLRFGLNADVLLQTTLLDAYAKVGDLDLAQKLFDEMATR----------- 182
                QLH Q+ R GL+AD LL TTLLDAY+K GDL  A KLFDEM  R           
Sbjct: 125 SSAMDQLHCQINRRGLSADSLLCTTLLDAYSKNGDLISAYKLFDEMPVRDVASWNALIAG 184

Query: 183 ------------------------------------------------------------ 242
                                                                       
Sbjct: 185 LVSGNRASEAMELYKRMETEGIRRSEVTVVAALGACSHLGDVKEGENIFHGYSNDNVIVS 244

Query: 243 --------YC-FVER-------------VDCWSWV---------------------DLEC 302
                    C FV++             V  W+ +                     D   
Sbjct: 245 NAAIDMYSKCGFVDKAYQVFEQFTGKKSVVTWNTMITGFAVHGEAHRALEIFDKLEDNGI 304

Query: 303 PP---------------RLVEDGLKLFNSMAQRRLEPNIKHYGAMVDLLGRAGRLKEAYE 362
            P                LVE GL +FN+MA + +E N+KHYG +VDLL RAGRL+EA++
Sbjct: 305 KPDDVSYLAALTACRHAGLVEYGLSVFNNMACKGVERNMKHYGCVVDLLSRAGRLREAHD 364

Query: 363 IVNSMH-FPNMVLWQTLLGACRTYGNVEMAELASRKLVEMGFINCGDFVLLSNVYAARKR 422
           I+ SM   P+ VLWQ+LLGA   Y +VEMAE+ASR++ EMG  N GDFVLLSNVYAA+ R
Sbjct: 365 IICSMSMIPDPVLWQSLLGASEIYSDVEMAEIASREIKEMGVNNDGDFVLLSNVYAAQGR 424

Query: 423 WDDVGRVRDAMRRRDVKKTPGFSYIEVKGKMHKFVYGDQSHSSCREIYAKLDEIKFRIKA 443
           W DVGRVRD M  + VKK PG SYIE KG +H+F   D+SH   REIY K+DEI+F+I+ 
Sbjct: 425 WKDVGRVRDDMESKQVKKIPGLSYIEAKGTIHEFYNSDKSHEQWREIYEKIDEIRFKIRE 484

BLAST of HG10004082 vs. ExPASy Swiss-Prot
Match: B8YEK4 (Pentatricopeptide repeat-containing protein OGR1, mitochondrial OS=Oryza sativa subsp. japonica OX=39947 GN=OGR1 PE=2 SV=1)

HSP 1 Score: 379.8 bits (974), Expect = 4.2e-104
Identity = 218/474 (45.99%), Postives = 284/474 (59.92%), Query Frame = 0

Query: 10  KCSSFSQIKQLQANLIINGHFQFSPSRTKLLELCAISSFGDLSYALHIFRHIRYPSTNYW 69
           +CS      QL A ++  G        T LL+  + +  GDL+ A  +F  +       W
Sbjct: 123 RCSDAHTTVQLHALVLRLGVAADVRLLTTLLD--SYAKCGDLASARKVFDEMTVRDVATW 182

Query: 70  NAVIRGTALSSDPANAVIWYREMAAS--NGPHRID--ALTCSFALKACARALARFEAMQL 129
           N+++ G A  ++P  A+  +  +A S    P R +   +T   AL ACA+     + M +
Sbjct: 183 NSLLAGLAQGTEPNLALALFHRLANSFQELPSREEPNEVTIVAALSACAQIGLLKDGMYV 242

Query: 130 HSQLLRFGLNADVLLQTTLLDAYAKVGDLDL----------------------------- 189
           H    RFGL+ +V +  +L+D Y+K G L                               
Sbjct: 243 HEFAKRFGLDRNVRVCNSLIDMYSKCGSLSRALDVFHSIKPEDQTLVSYNAAIQAHSMHG 302

Query: 190 ----AQKLFDEMATRYCFVERVDCWSWVDLEC---PPRLVEDGLKLFNSMAQRRLEPNIK 249
               A +LFDEM TR       D  +++ + C      LV+DGL++FNSM   R+ PN+K
Sbjct: 303 HGGDALRLFDEMPTRI----EPDGVTYLAVLCGCNHSGLVDDGLRVFNSM---RVAPNMK 362

Query: 250 HYGAMVDLLGRAGRLKEAYEIVNSMHFP-NMVLWQTLLGACRTYGNVEMAELASRKLVEM 309
           HYG +VDLLGRAGRL EAY+ V SM FP ++VLWQTLLGA + +G VE+AELA+ KL E+
Sbjct: 363 HYGTIVDLLGRAGRLTEAYDTVISMPFPADIVLWQTLLGAAKMHGVVELAELAANKLAEL 422

Query: 310 GFINCGDFVLLSNVYAARKRWDDVGRVRDAMRRRDVKKTPGFSYIEVKGKMHKFVYGDQS 369
           G    GD+VLLSNVYA++ RW DVGRVRD MR  DV+K PGFSY E+ G MHKF+ GD+ 
Sbjct: 423 GSNVDGDYVLLSNVYASKARWMDVGRVRDTMRSNDVRKVPGFSYTEIDGVMHKFINGDKE 482

Query: 370 HSSCREIYAKLDEIKFRIKAYGYVAETGNVLHDIGDEDKENALCYHSEKLAVAFGLTCTE 429
           H   +EIY  L++I  RI   GY  ET NVLHDIG+E+K+ ALCYHSEKLA+AFGL  T 
Sbjct: 483 HPRWQEIYRALEDIVSRISELGYEPETSNVLHDIGEEEKQYALCYHSEKLAIAFGLIATP 542

Query: 430 EGTPNQVIKNLRICGDCHVVIKLISKIYNREIIVRDRTRFHRFNEGLCSCKDYW 443
            G   +VIKNLRICGDCHVV KLISK Y R I++RDR RFHRF +G CSC+DYW
Sbjct: 543 PGETLRVIKNLRICGDCHVVAKLISKAYGRVIVIRDRARFHRFEDGQCSCRDYW 587

BLAST of HG10004082 vs. ExPASy Swiss-Prot
Match: A8MQA3 (Pentatricopeptide repeat-containing protein At4g21065 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H28 PE=2 SV=2)

HSP 1 Score: 329.3 bits (843), Expect = 6.5e-89
Identity = 175/441 (39.68%), Postives = 263/441 (59.64%), Query Frame = 0

Query: 36  RTKLLELCAISSFGDLSYALHIFRHIRYPSTNYWNAVIRGTALSSDPANAVIWYREMAAS 95
           +  LL L A  + GD++ A  +F  +       WN+VI G A +  P  A+  Y EM + 
Sbjct: 159 QNSLLHLYA--NCGDVASAYKVFDKMPEKDLVAWNSVINGFAENGKPEEALALYTEMNSK 218

Query: 96  NGPHRIDALTCSFALKACARALARFEAMQLHSQLLRFGLNADVLLQTTLLDAYAKVGDLD 155
               + D  T    L ACA+  A     ++H  +++ GL  ++     LLD YA+ G ++
Sbjct: 219 G--IKPDGFTIVSLLSACAKIGALTLGKRVHVYMIKVGLTRNLHSSNVLLDLYARCGRVE 278

Query: 156 LAQKLFDEMATRYCF---------------VERVDCWSWVD-----LECPPR-------- 215
            A+ LFDEM  +                   E ++ + +++     L C           
Sbjct: 279 EAKTLFDEMVDKNSVSWTSLIVGLAVNGFGKEAIELFKYMESTEGLLPCEITFVGILYAC 338

Query: 216 ----LVEDGLKLFNSMAQR-RLEPNIKHYGAMVDLLGRAGRLKEAYEIVNSMHF-PNMVL 275
               +V++G + F  M +  ++EP I+H+G MVDLL RAG++K+AYE + SM   PN+V+
Sbjct: 339 SHCGMVKEGFEYFRRMREEYKIEPRIEHFGCMVDLLARAGQVKKAYEYIKSMPMQPNVVI 398

Query: 276 WQTLLGACRTYGNVEMAELASRKLVEMGFINCGDFVLLSNVYAARKRWDDVGRVRDAMRR 335
           W+TLLGAC  +G+ ++AE A  +++++   + GD+VLLSN+YA+ +RW DV ++R  M R
Sbjct: 399 WRTLLGACTVHGDSDLAEFARIQILQLEPNHSGDYVLLSNMYASEQRWSDVQKIRKQMLR 458

Query: 336 RDVKKTPGFSYIEVKGKMHKFVYGDQSHSSCREIYAKLDEIKFRIKAYGYVAETGNVLHD 395
             VKK PG S +EV  ++H+F+ GD+SH     IYAKL E+  R+++ GYV +  NV  D
Sbjct: 459 DGVKKVPGHSLVEVGNRVHEFLMGDKSHPQSDAIYAKLKEMTGRLRSEGYVPQISNVYVD 518

Query: 396 IGDEDKENALCYHSEKLAVAFGLTCTEEGTPNQVIKNLRICGDCHVVIKLISKIYNREII 443
           + +E+KENA+ YHSEK+A+AF L  T E +P  V+KNLR+C DCH+ IKL+SK+YNREI+
Sbjct: 519 VEEEEKENAVVYHSEKIAIAFMLISTPERSPITVVKNLRVCADCHLAIKLVSKVYNREIV 578

BLAST of HG10004082 vs. ExPASy Swiss-Prot
Match: Q9LW32 (Pentatricopeptide repeat-containing protein At3g26782, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-H34 PE=2 SV=1)

HSP 1 Score: 296.2 bits (757), Expect = 6.1e-79
Identity = 167/442 (37.78%), Postives = 247/442 (55.88%), Query Frame = 0

Query: 39  LLELCAISSFGDLSYALHIFRHIRYPSTNYWNAVIRGTALSSDPANAVIWYREMAASNGP 98
           LL+  A    G ++ A  IF  I       +N+++   A S     A   +R +   N  
Sbjct: 224 LLDAYAKGGEGGVAVARKIFDQIVDKDRVSYNSIMSVYAQSGMSNEAFEVFRRL-VKNKV 283

Query: 99  HRIDALTCSFALKACARALARFEAMQLHSQLLRFGLNADVLLQTTLLDAYAKVGDLDLAQ 158
              +A+T S  L A + + A      +H Q++R GL  DV++ T+++D Y K G ++ A+
Sbjct: 284 VTFNAITLSTVLLAVSHSGALRIGKCIHDQVIRMGLEDDVIVGTSIIDMYCKCGRVETAR 343

Query: 159 KLFDEMATRYCFVERVDCWSWV------------DLECPPRLVEDGL------------- 218
           K FD M  +      V  W+ +             LE  P +++ G+             
Sbjct: 344 KAFDRMKNK-----NVRSWTAMIAGYGMHGHAAKALELFPAMIDSGVRPNYITFVSVLAA 403

Query: 219 -----------KLFNSMAQR-RLEPNIKHYGAMVDLLGRAGRLKEAYEIVNSMHF-PNMV 278
                      + FN+M  R  +EP ++HYG MVDLLGRAG L++AY+++  M   P+ +
Sbjct: 404 CSHAGLHVEGWRWFNAMKGRFGVEPGLEHYGCMVDLLGRAGFLQKAYDLIQRMKMKPDSI 463

Query: 279 LWQTLLGACRTYGNVEMAELASRKLVEMGFINCGDFVLLSNVYAARKRWDDVGRVRDAMR 338
           +W +LL ACR + NVE+AE++  +L E+   NCG ++LLS++YA   RW DV RVR  M+
Sbjct: 464 IWSSLLAACRIHKNVELAEISVARLFELDSSNCGYYMLLSHIYADAGRWKDVERVRMIMK 523

Query: 339 RRDVKKTPGFSYIEVKGKMHKFVYGDQSHSSCREIYAKLDEIKFRIKAYGYVAETGNVLH 398
            R + K PGFS +E+ G++H F+ GD+ H    +IY  L E+  ++   GYV+ T +V H
Sbjct: 524 NRGLVKPPGFSLLELNGEVHVFLIGDEEHPQREKIYEFLAELNRKLLEAGYVSNTSSVCH 583

Query: 399 DIGDEDKENALCYHSEKLAVAFGLTCTEEGTPNQVIKNLRICGDCHVVIKLISKIYNREI 443
           D+ +E+KE  L  HSEKLA+AFG+  T  G+   V+KNLR+C DCH VIKLISKI +RE 
Sbjct: 584 DVDEEEKEMTLRVHSEKLAIAFGIMNTVPGSTVNVVKNLRVCSDCHNVIKLISKIVDREF 643

BLAST of HG10004082 vs. ExPASy Swiss-Prot
Match: Q683I9 (Pentatricopeptide repeat-containing protein At3g62890 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H82 PE=2 SV=1)

HSP 1 Score: 294.7 bits (753), Expect = 1.8e-78
Identity = 162/437 (37.07%), Postives = 245/437 (56.06%), Query Frame = 0

Query: 44  AISSFGDLSYALHIFRHIRYPSTNYWNAVIRGTALSSDPANAVIWYREMAASNGPH---R 103
           A +  G +  A  +F  +   +   W+ +I G  +      A+  +REM          R
Sbjct: 137 AYAKAGLIDDARKLFDEMPERNVISWSCLINGYVMCGKYKEALDLFREMQLPKPNEAFVR 196

Query: 104 IDALTCSFALKACARALARFEAMQLHSQLLRFGLNADVLLQTTLLDAYAKVGDLDLAQKL 163
            +  T S  L AC R  A  +   +H+ + ++ +  D++L T L+D YAK G L+ A+++
Sbjct: 197 PNEFTMSTVLSACGRLGALEQGKWVHAYIDKYHVEIDIVLGTALIDMYAKCGSLERAKRV 256

Query: 164 FDEMATR---YCFVERVDCWSWVDL-----------------------------ECPPR- 223
           F+ + ++     +   + C +   L                              C  R 
Sbjct: 257 FNALGSKKDVKAYSAMICCLAMYGLTDECFQLFSEMTTSDNINPNSVTFVGILGACVHRG 316

Query: 224 LVEDGLKLFNSMAQR-RLEPNIKHYGAMVDLLGRAGRLKEAYEIVNSMHF-PNMVLWQTL 283
           L+ +G   F  M +   + P+I+HYG MVDL GR+G +KEA   + SM   P++++W +L
Sbjct: 317 LINEGKSYFKMMIEEFGITPSIQHYGCMVDLYGRSGLIKEAESFIASMPMEPDVLIWGSL 376

Query: 284 LGACRTYGNVEMAELASRKLVEMGFINCGDFVLLSNVYAARKRWDDVGRVRDAMRRRDVK 343
           L   R  G+++  E A ++L+E+  +N G +VLLSNVYA   RW +V  +R  M  + + 
Sbjct: 377 LSGSRMLGDIKTCEGALKRLIELDPMNSGAYVLLSNVYAKTGRWMEVKCIRHEMEVKGIN 436

Query: 344 KTPGFSYIEVKGKMHKFVYGDQSHSSCREIYAKLDEIKFRIKAYGYVAETGNVLHDIGDE 403
           K PG SY+EV+G +H+FV GD+S      IYA LDEI  R++  GYV +T  VL D+ ++
Sbjct: 437 KVPGCSYVEVEGVVHEFVVGDESQQESERIYAMLDEIMQRLREAGYVTDTKEVLLDLNEK 496

Query: 404 DKENALCYHSEKLAVAFGLTCTEEGTPNQVIKNLRICGDCHVVIKLISKIYNREIIVRDR 443
           DKE AL YHSEKLA+AF L  T  GTP ++IKNLRICGDCH+V+K+ISK+++REI+VRD 
Sbjct: 497 DKEIALSYHSEKLAIAFCLMKTRPGTPVRIIKNLRICGDCHLVMKMISKLFSREIVVRDC 556

BLAST of HG10004082 vs. ExPASy TrEMBL
Match: A0A0A0KB77 (DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G052690 PE=3 SV=1)

HSP 1 Score: 726.1 bits (1873), Expect = 8.9e-206
Identity = 388/576 (67.36%), Postives = 408/576 (70.83%), Query Frame = 0

Query: 1   MAYFYLLLQKCSSFSQIKQLQANLIINGHFQFSPSRTKLLELCAISSFGDLSYALHIFRH 60
           MAYF LLLQKCSSFSQIKQLQANLIING F FS SRTKLLELCAISSFGDLSYALHIFR+
Sbjct: 1   MAYFNLLLQKCSSFSQIKQLQANLIINGDFHFSSSRTKLLELCAISSFGDLSYALHIFRY 60

Query: 61  IRYPSTNYWNAVIRGTALSSDPANAVIWYREMAASNGPHRIDALTCSFALKACARALARF 120
           I YPSTN WNAVIRGTALSSDPANAV WYR MAASNG HRIDALTCSFALKACARALAR 
Sbjct: 61  IPYPSTNDWNAVIRGTALSSDPANAVFWYRAMAASNGLHRIDALTCSFALKACARALARS 120

Query: 121 EAMQLHSQLLRFGLNADVLLQTTLLDAYAKVGDLDLAQKLFDEM---------------- 180
           EA+QLHSQLLRFG NADVLLQTTLLDAYAK+GDLDLAQKLFDEM                
Sbjct: 121 EAIQLHSQLLRFGFNADVLLQTTLLDAYAKIGDLDLAQKLFDEMPQPDIASWNALIAGFA 180

Query: 181 ----------------------------------------------ATRYCFVERVDC-- 240
                                                           +Y   E+++   
Sbjct: 181 QGSRPADAIMTFKRMKVDGNLRPNAVTVQGALLACSQLGALKEGESVHKYIVEEKLNSNV 240

Query: 241 --------------------WSWVDLECPPR----------------------------- 300
                               W + ++ C                                
Sbjct: 241 QVCNVVIDMYAKCGSMDKAYWVFENMRCDKSLITWNTMIMAFAMHGDGHKALDLFEKLGR 300

Query: 301 ---------------------LVEDGLKLFNSMAQRRLEPNIKHYGAMVDLLGRAGRLKE 360
                                LVEDGLKLFNSM QR LEPNIKHYG+MVDLLGRAGRLKE
Sbjct: 301 SGMSPDAVSYLAVLCACNHAGLVEDGLKLFNSMTQRGLEPNIKHYGSMVDLLGRAGRLKE 360

Query: 361 AYEIVNSMHFPNMVLWQTLLGACRTYGNVEMAELASRKLVEMGFINCGDFVLLSNVYAAR 420
           AY+IV+S+ FPNMVLWQTLLGACRTYG+VEMAELASRKLVEMGFI+CGDFVLLSNVYAAR
Sbjct: 361 AYDIVSSLPFPNMVLWQTLLGACRTYGDVEMAELASRKLVEMGFISCGDFVLLSNVYAAR 420

Query: 421 KRWDDVGRVRDAMRRRDVKKTPGFSYIEVKGKMHKFVYGDQSHSSCREIYAKLDEIKFRI 443
           +RWDDVGRVRDAMRRRDVKKTPGFSYIE+KGKM+KFV GDQSHSSCREIYAKLDEI  RI
Sbjct: 421 QRWDDVGRVRDAMRRRDVKKTPGFSYIEIKGKMYKFVNGDQSHSSCREIYAKLDEINLRI 480

BLAST of HG10004082 vs. ExPASy TrEMBL
Match: A0A1S3C3U7 (pentatricopeptide repeat-containing protein At1g34160 OS=Cucumis melo OX=3656 GN=LOC103496566 PE=3 SV=1)

HSP 1 Score: 724.2 bits (1868), Expect = 3.4e-205
Identity = 385/576 (66.84%), Postives = 406/576 (70.49%), Query Frame = 0

Query: 1   MAYFYLLLQKCSSFSQIKQLQANLIINGHFQFSPSRTKLLELCAISSFGDLSYALHIFRH 60
           MAYF LLLQKCSSFS IKQLQANLIING F FS SRTKLLELCA+SS GDLSYALHIFR+
Sbjct: 1   MAYFNLLLQKCSSFSHIKQLQANLIINGDFHFSSSRTKLLELCAVSSCGDLSYALHIFRY 60

Query: 61  IRYPSTNYWNAVIRGTALSSDPANAVIWYREMAASNGPHRIDALTCSFALKACARALARF 120
           IRYPSTN WNA+IRGTALSSDPANAV+WYR MAASNGPHRIDALTCSFALKACARALA  
Sbjct: 61  IRYPSTNDWNAIIRGTALSSDPANAVVWYRAMAASNGPHRIDALTCSFALKACARALACS 120

Query: 121 EAMQLHSQLLRFGLNADVLLQTTLLDAYAKVGDLDLAQKLFDEM---------------- 180
           EA+QLHSQLLRFG NADVLLQTTLLD YAKVGDLDLAQKLFDEM                
Sbjct: 121 EAIQLHSQLLRFGFNADVLLQTTLLDVYAKVGDLDLAQKLFDEMPRPDIASWNALISGFA 180

Query: 181 ----------------------------------------------ATRYCFVERVDC-- 240
                                                           +Y   E++D   
Sbjct: 181 QGSRPADAIMMFKRMKEGGNLRPNAVTVQGALLACSQLGTLKEGENVHKYIVEEKLDMNV 240

Query: 241 --------------------WSWVDLECPPR----------------------------- 300
                               W + ++ C                                
Sbjct: 241 QVCNVVIDMYAKCGSMDKAYWVFENMRCDKSLITWNTMIMAFAMHGDGYEALDLFKKLGR 300

Query: 301 ---------------------LVEDGLKLFNSMAQRRLEPNIKHYGAMVDLLGRAGRLKE 360
                                LVEDGLKLFN MAQR LEPNIKHYG+MVDLLGRAGRLKE
Sbjct: 301 SGMSPDAVSYLAVLCACNHAGLVEDGLKLFNLMAQRGLEPNIKHYGSMVDLLGRAGRLKE 360

Query: 361 AYEIVNSMHFPNMVLWQTLLGACRTYGNVEMAELASRKLVEMGFINCGDFVLLSNVYAAR 420
           AY+IVNS+ FPNMVLWQTLLGACRTYG+VEMAELAS KLVEMGFI+CGDFVLLSNVYAAR
Sbjct: 361 AYDIVNSLPFPNMVLWQTLLGACRTYGDVEMAELASGKLVEMGFISCGDFVLLSNVYAAR 420

Query: 421 KRWDDVGRVRDAMRRRDVKKTPGFSYIEVKGKMHKFVYGDQSHSSCREIYAKLDEIKFRI 443
           +RWDDVGRVRDAMR RDVKKTPGFSYIE+KGKM++FVYGDQSHSSCREIYAKLDEI  RI
Sbjct: 421 QRWDDVGRVRDAMRIRDVKKTPGFSYIEIKGKMYQFVYGDQSHSSCREIYAKLDEINLRI 480

BLAST of HG10004082 vs. ExPASy TrEMBL
Match: A0A6J1BW13 (pentatricopeptide repeat-containing protein At1g34160 OS=Momordica charantia OX=3673 GN=LOC111006231 PE=3 SV=1)

HSP 1 Score: 703.4 bits (1814), Expect = 6.2e-199
Identity = 376/576 (65.28%), Postives = 397/576 (68.92%), Query Frame = 0

Query: 1   MAYFYLLLQKCSSFSQIKQLQANLIINGHFQFSPSRTKLLELCAISSFGDLSYALHIFRH 60
           MAYF LLLQKCSSFSQIKQLQANLI NG FQ S SRTKLLELCAIS FGDL +A+ IFRH
Sbjct: 18  MAYFDLLLQKCSSFSQIKQLQANLITNGRFQLSSSRTKLLELCAISPFGDLPHAIRIFRH 77

Query: 61  IRYPSTNYWNAVIRGTALSSDPANAVIWYREMAASNGPHRIDALTCSFALKACARALARF 120
           IR P TN WNAVIRGTALSSDPANAV+WYR MAAS GPHR+DALTCSF LKACARALAR 
Sbjct: 78  IRAPPTNDWNAVIRGTALSSDPANAVLWYRAMAASIGPHRVDALTCSFTLKACARALARS 137

Query: 121 EAMQLHSQLLRFGLNADVLLQTTLLDAYAKVGDLDLAQKLFDEM---------------- 180
           EAMQLHSQLLRFG +AD+LLQTTLLDAYAKVGDLD AQKLFDE+                
Sbjct: 138 EAMQLHSQLLRFGFDADILLQTTLLDAYAKVGDLDRAQKLFDEIPQPDIASWNALIAGFA 197

Query: 181 ----------------------------------------------ATRYCFVERVDC-- 240
                                                           +Y   E +D   
Sbjct: 198 QGSRPGDAIALFKRMKEDGYLRPNEVTVQGALLACSQLGALKEGEEVHKYIIEENLDMNV 257

Query: 241 --------------------WSWVDLECPPR----------------------------- 300
                               W + ++ C                                
Sbjct: 258 QVCNVVIDMYAKCGSVDKAYWVFQNMRCEKSLISWNTMIMAFAIHGHGYKALDLFEKLGL 317

Query: 301 ---------------------LVEDGLKLFNSMAQRRLEPNIKHYGAMVDLLGRAGRLKE 360
                                LVEDG+KLFNSM +R L PNIKHYG++VDLLGRAGRLKE
Sbjct: 318 SGISPDAVSYLVVLCACNHGGLVEDGVKLFNSMGKRGLAPNIKHYGSVVDLLGRAGRLKE 377

Query: 361 AYEIVNSMHFPNMVLWQTLLGACRTYGNVEMAELASRKLVEMGFINCGDFVLLSNVYAAR 420
           AYEIVNSM FPNMVLWQTLLGACRTYGNVEMAELASRKLVEMGFI+CGDFVLLSNVYAA 
Sbjct: 378 AYEIVNSMPFPNMVLWQTLLGACRTYGNVEMAELASRKLVEMGFISCGDFVLLSNVYAAC 437

Query: 421 KRWDDVGRVRDAMRRRDVKKTPGFSYIEVKGKMHKFVYGDQSHSSCREIYAKLDEIKFRI 443
           +RWDDVGR+RDAMRRRDVKKTPGFSY EVKGKMHKF YGDQ+HSSC EIYAKLDEIKFRI
Sbjct: 438 QRWDDVGRIRDAMRRRDVKKTPGFSYTEVKGKMHKFAYGDQNHSSCHEIYAKLDEIKFRI 497

BLAST of HG10004082 vs. ExPASy TrEMBL
Match: A0A6J1H8R5 (pentatricopeptide repeat-containing protein At1g34160 OS=Cucurbita moschata OX=3662 GN=LOC111461576 PE=3 SV=1)

HSP 1 Score: 701.0 bits (1808), Expect = 3.1e-198
Identity = 370/576 (64.24%), Postives = 397/576 (68.92%), Query Frame = 0

Query: 1   MAYFYLLLQKCSSFSQIKQLQANLIINGHFQFSPSRTKLLELCAISSFGDLSYALHIFRH 60
           MAYF LLLQKCSSFSQIKQLQANLI NGHF FS SRTKLLELCAIS FGDLS+ALHIFRH
Sbjct: 1   MAYFDLLLQKCSSFSQIKQLQANLITNGHFHFSSSRTKLLELCAISPFGDLSHALHIFRH 60

Query: 61  IRYPSTNYWNAVIRGTALSSDPANAVIWYREMAASNGPHRIDALTCSFALKACARALARF 120
           I  PST  WNAVIRGTALSS+P+NA+ WYR M ASNGPHR+DALTCSFALKACARALAR 
Sbjct: 61  IHSPSTKDWNAVIRGTALSSNPSNAIFWYRTMTASNGPHRVDALTCSFALKACARALARS 120

Query: 121 EAMQLHSQLLRFGLNADVLLQTTLLDAYAKVGDLDLAQKLFDEM---------------- 180
           E MQLHSQ+LRFG +ADVLLQTTLLDAYAKV DLD AQK+FDEM                
Sbjct: 121 EVMQLHSQVLRFGFDADVLLQTTLLDAYAKVEDLDQAQKVFDEMPEPDIASWNSLIAGFA 180

Query: 181 ----------------------------------------------ATRYCFVERVDC-- 240
                                                           +Y   E +D   
Sbjct: 181 QGGRPSDAIDLFKRMKEDGNLRPNEVTVQGALSACSQLGTLKEGENVRKYIAEENLDTVV 240

Query: 241 --------------------WSWVDLECPPR----------------------------- 300
                               W + ++ C                                
Sbjct: 241 QVCNVVIDMYAKCGSVDKAYWVFENMRCEKSLITWNTMIMAFAMHGDGHKALDLFEKLGR 300

Query: 301 ---------------------LVEDGLKLFNSMAQRRLEPNIKHYGAMVDLLGRAGRLKE 360
                                L+EDGLKLFNSM QR + PNIKHYG +VDLLGRAGRLKE
Sbjct: 301 SGIYPDAISYLAVLCACNHAGLIEDGLKLFNSMVQRGVAPNIKHYGVVVDLLGRAGRLKE 360

Query: 361 AYEIVNSMHFPNMVLWQTLLGACRTYGNVEMAELASRKLVEMGFINCGDFVLLSNVYAAR 420
           AYEIV+SM FPNMVLWQTLLGACRTYG+V+MAE+ASRKLVEMGFI+CGDFVLLSNVYAAR
Sbjct: 361 AYEIVSSMPFPNMVLWQTLLGACRTYGDVKMAEMASRKLVEMGFISCGDFVLLSNVYAAR 420

Query: 421 KRWDDVGRVRDAMRRRDVKKTPGFSYIEVKGKMHKFVYGDQSHSSCREIYAKLDEIKFRI 443
           +RWDDVGRVRDAMRRRDVKKTPGFSYIEVKG MHKF+YGD+SHSSCREIYAKLDEI FRI
Sbjct: 421 RRWDDVGRVRDAMRRRDVKKTPGFSYIEVKGNMHKFLYGDRSHSSCREIYAKLDEIMFRI 480

BLAST of HG10004082 vs. ExPASy TrEMBL
Match: A0A6J1JI91 (pentatricopeptide repeat-containing protein At1g34160 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111487208 PE=3 SV=1)

HSP 1 Score: 692.2 bits (1785), Expect = 1.4e-195
Identity = 367/576 (63.72%), Postives = 395/576 (68.58%), Query Frame = 0

Query: 1   MAYFYLLLQKCSSFSQIKQLQANLIINGHFQFSPSRTKLLELCAISSFGDLSYALHIFRH 60
           MAYF LLLQKCSSFSQIKQLQANLI NGHF FS SRTKLLELCAIS FGDLS+ALHIFRH
Sbjct: 23  MAYFDLLLQKCSSFSQIKQLQANLITNGHFHFSSSRTKLLELCAISPFGDLSHALHIFRH 82

Query: 61  IRYPSTNYWNAVIRGTALSSDPANAVIWYREMAASNGPHRIDALTCSFALKACARALARF 120
           +  PST  WNAVIRGTALSS+P+NA+ WYR M ASNGPHR+DALTCSFALKACARALAR 
Sbjct: 83  VHSPSTKDWNAVIRGTALSSNPSNALFWYRTMNASNGPHRVDALTCSFALKACARALARS 142

Query: 121 EAMQLHSQLLRFGLNADVLLQTTLLDAYAKVGDLDLAQKLFDEM---------------- 180
           E MQLHSQ+LRFG +ADVLLQTTLLDAYAKV DLD AQK+FDEM                
Sbjct: 143 EVMQLHSQVLRFGFDADVLLQTTLLDAYAKVEDLDQAQKVFDEMPEPDIASWNSLIAGFA 202

Query: 181 ----------------------------------------------ATRYCFVERVDC-- 240
                                                           +Y   E +D   
Sbjct: 203 QGGRPSDAIDLFKRMKEDGNLRPNEVTVQGALSACSQLGTLKEGENVHKYIVEENLDTIV 262

Query: 241 --------------------WSWVDLECPPR----------------------------- 300
                               W + ++ C                                
Sbjct: 263 QVCNVVIDMYAKCGSVDKAYWVFENMRCEKSLITWNTMIMAFAMHGDGYKALDLFEKLGR 322

Query: 301 ---------------------LVEDGLKLFNSMAQRRLEPNIKHYGAMVDLLGRAGRLKE 360
                                L+EDGLKL NSM QR + PNIKHYG +VDLLGRAGRLKE
Sbjct: 323 SGICPDAISYLAVLCACNHAGLIEDGLKLCNSMMQRGVAPNIKHYGVVVDLLGRAGRLKE 382

Query: 361 AYEIVNSMHFPNMVLWQTLLGACRTYGNVEMAELASRKLVEMGFINCGDFVLLSNVYAAR 420
           AYEIV+SM FPNMVLWQTLLGACRTYG+V+MAE+ASRKLVEMGFI+CGDFVLLSNVYAAR
Sbjct: 383 AYEIVSSMPFPNMVLWQTLLGACRTYGDVKMAEMASRKLVEMGFISCGDFVLLSNVYAAR 442

Query: 421 KRWDDVGRVRDAMRRRDVKKTPGFSYIEVKGKMHKFVYGDQSHSSCREIYAKLDEIKFRI 443
           +RWDDVGRVRDAMRRRDVKKTPGFSYIEVKG MHKF+YGD+SHSSCREIYAKLDEI FRI
Sbjct: 443 RRWDDVGRVRDAMRRRDVKKTPGFSYIEVKGNMHKFLYGDRSHSSCREIYAKLDEIMFRI 502

BLAST of HG10004082 vs. TAIR 10
Match: AT1G34160.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 469.5 bits (1207), Expect = 2.9e-132
Identity = 277/577 (48.01%), Postives = 335/577 (58.06%), Query Frame = 0

Query: 3   YFYLLLQKCSSFSQIKQLQANLIINGHFQFSPSRTKLLELCAISSFGDLSYALHIFRHIR 62
           Y   ++QKC SFSQIKQLQ++ +  GHFQ S  R++LLE CAIS FGDLS+A+ IFR+I 
Sbjct: 5   YMETMIQKCVSFSQIKQLQSHFLTAGHFQSSFLRSRLLERCAISPFGDLSFAVQIFRYIP 64

Query: 63  YPSTNYWNAVIRGTALSSDPANAVIWYREM----AASNGPHRIDALTCSFALKACARALA 122
            P TN WNA+IRG A SS P+ A  WYR M    ++S+   R+DALTCSF LKACARAL 
Sbjct: 65  KPLTNDWNAIIRGFAGSSHPSLAFSWYRSMLQQSSSSSAICRVDALTCSFTLKACARALC 124

Query: 123 RFEAMQLHSQLLRFGLNADVLLQTTLLDAYAKVGDLDLAQKLFDEMATR----------- 182
                QLH Q+ R GL+AD LL TTLLDAY+K GDL  A KLFDEM  R           
Sbjct: 125 SSAMDQLHCQINRRGLSADSLLCTTLLDAYSKNGDLISAYKLFDEMPVRDVASWNALIAG 184

Query: 183 ------------------------------------------------------------ 242
                                                                       
Sbjct: 185 LVSGNRASEAMELYKRMETEGIRRSEVTVVAALGACSHLGDVKEGENIFHGYSNDNVIVS 244

Query: 243 --------YC-FVER-------------VDCWSWV---------------------DLEC 302
                    C FV++             V  W+ +                     D   
Sbjct: 245 NAAIDMYSKCGFVDKAYQVFEQFTGKKSVVTWNTMITGFAVHGEAHRALEIFDKLEDNGI 304

Query: 303 PP---------------RLVEDGLKLFNSMAQRRLEPNIKHYGAMVDLLGRAGRLKEAYE 362
            P                LVE GL +FN+MA + +E N+KHYG +VDLL RAGRL+EA++
Sbjct: 305 KPDDVSYLAALTACRHAGLVEYGLSVFNNMACKGVERNMKHYGCVVDLLSRAGRLREAHD 364

Query: 363 IVNSMH-FPNMVLWQTLLGACRTYGNVEMAELASRKLVEMGFINCGDFVLLSNVYAARKR 422
           I+ SM   P+ VLWQ+LLGA   Y +VEMAE+ASR++ EMG  N GDFVLLSNVYAA+ R
Sbjct: 365 IICSMSMIPDPVLWQSLLGASEIYSDVEMAEIASREIKEMGVNNDGDFVLLSNVYAAQGR 424

Query: 423 WDDVGRVRDAMRRRDVKKTPGFSYIEVKGKMHKFVYGDQSHSSCREIYAKLDEIKFRIKA 443
           W DVGRVRD M  + VKK PG SYIE KG +H+F   D+SH   REIY K+DEI+F+I+ 
Sbjct: 425 WKDVGRVRDDMESKQVKKIPGLSYIEAKGTIHEFYNSDKSHEQWREIYEKIDEIRFKIRE 484

BLAST of HG10004082 vs. TAIR 10
Match: AT4G21065.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 329.3 bits (843), Expect = 4.7e-90
Identity = 175/441 (39.68%), Postives = 263/441 (59.64%), Query Frame = 0

Query: 36  RTKLLELCAISSFGDLSYALHIFRHIRYPSTNYWNAVIRGTALSSDPANAVIWYREMAAS 95
           +  LL L A  + GD++ A  +F  +       WN+VI G A +  P  A+  Y EM + 
Sbjct: 159 QNSLLHLYA--NCGDVASAYKVFDKMPEKDLVAWNSVINGFAENGKPEEALALYTEMNSK 218

Query: 96  NGPHRIDALTCSFALKACARALARFEAMQLHSQLLRFGLNADVLLQTTLLDAYAKVGDLD 155
               + D  T    L ACA+  A     ++H  +++ GL  ++     LLD YA+ G ++
Sbjct: 219 G--IKPDGFTIVSLLSACAKIGALTLGKRVHVYMIKVGLTRNLHSSNVLLDLYARCGRVE 278

Query: 156 LAQKLFDEMATRYCF---------------VERVDCWSWVD-----LECPPR-------- 215
            A+ LFDEM  +                   E ++ + +++     L C           
Sbjct: 279 EAKTLFDEMVDKNSVSWTSLIVGLAVNGFGKEAIELFKYMESTEGLLPCEITFVGILYAC 338

Query: 216 ----LVEDGLKLFNSMAQR-RLEPNIKHYGAMVDLLGRAGRLKEAYEIVNSMHF-PNMVL 275
               +V++G + F  M +  ++EP I+H+G MVDLL RAG++K+AYE + SM   PN+V+
Sbjct: 339 SHCGMVKEGFEYFRRMREEYKIEPRIEHFGCMVDLLARAGQVKKAYEYIKSMPMQPNVVI 398

Query: 276 WQTLLGACRTYGNVEMAELASRKLVEMGFINCGDFVLLSNVYAARKRWDDVGRVRDAMRR 335
           W+TLLGAC  +G+ ++AE A  +++++   + GD+VLLSN+YA+ +RW DV ++R  M R
Sbjct: 399 WRTLLGACTVHGDSDLAEFARIQILQLEPNHSGDYVLLSNMYASEQRWSDVQKIRKQMLR 458

Query: 336 RDVKKTPGFSYIEVKGKMHKFVYGDQSHSSCREIYAKLDEIKFRIKAYGYVAETGNVLHD 395
             VKK PG S +EV  ++H+F+ GD+SH     IYAKL E+  R+++ GYV +  NV  D
Sbjct: 459 DGVKKVPGHSLVEVGNRVHEFLMGDKSHPQSDAIYAKLKEMTGRLRSEGYVPQISNVYVD 518

Query: 396 IGDEDKENALCYHSEKLAVAFGLTCTEEGTPNQVIKNLRICGDCHVVIKLISKIYNREII 443
           + +E+KENA+ YHSEK+A+AF L  T E +P  V+KNLR+C DCH+ IKL+SK+YNREI+
Sbjct: 519 VEEEEKENAVVYHSEKIAIAFMLISTPERSPITVVKNLRVCADCHLAIKLVSKVYNREIV 578

BLAST of HG10004082 vs. TAIR 10
Match: AT4G21065.2 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 329.3 bits (843), Expect = 4.7e-90
Identity = 175/441 (39.68%), Postives = 263/441 (59.64%), Query Frame = 0

Query: 36  RTKLLELCAISSFGDLSYALHIFRHIRYPSTNYWNAVIRGTALSSDPANAVIWYREMAAS 95
           +  LL L A  + GD++ A  +F  +       WN+VI G A +  P  A+  Y EM + 
Sbjct: 26  QNSLLHLYA--NCGDVASAYKVFDKMPEKDLVAWNSVINGFAENGKPEEALALYTEMNSK 85

Query: 96  NGPHRIDALTCSFALKACARALARFEAMQLHSQLLRFGLNADVLLQTTLLDAYAKVGDLD 155
               + D  T    L ACA+  A     ++H  +++ GL  ++     LLD YA+ G ++
Sbjct: 86  G--IKPDGFTIVSLLSACAKIGALTLGKRVHVYMIKVGLTRNLHSSNVLLDLYARCGRVE 145

Query: 156 LAQKLFDEMATRYCF---------------VERVDCWSWVD-----LECPPR-------- 215
            A+ LFDEM  +                   E ++ + +++     L C           
Sbjct: 146 EAKTLFDEMVDKNSVSWTSLIVGLAVNGFGKEAIELFKYMESTEGLLPCEITFVGILYAC 205

Query: 216 ----LVEDGLKLFNSMAQR-RLEPNIKHYGAMVDLLGRAGRLKEAYEIVNSMHF-PNMVL 275
               +V++G + F  M +  ++EP I+H+G MVDLL RAG++K+AYE + SM   PN+V+
Sbjct: 206 SHCGMVKEGFEYFRRMREEYKIEPRIEHFGCMVDLLARAGQVKKAYEYIKSMPMQPNVVI 265

Query: 276 WQTLLGACRTYGNVEMAELASRKLVEMGFINCGDFVLLSNVYAARKRWDDVGRVRDAMRR 335
           W+TLLGAC  +G+ ++AE A  +++++   + GD+VLLSN+YA+ +RW DV ++R  M R
Sbjct: 266 WRTLLGACTVHGDSDLAEFARIQILQLEPNHSGDYVLLSNMYASEQRWSDVQKIRKQMLR 325

Query: 336 RDVKKTPGFSYIEVKGKMHKFVYGDQSHSSCREIYAKLDEIKFRIKAYGYVAETGNVLHD 395
             VKK PG S +EV  ++H+F+ GD+SH     IYAKL E+  R+++ GYV +  NV  D
Sbjct: 326 DGVKKVPGHSLVEVGNRVHEFLMGDKSHPQSDAIYAKLKEMTGRLRSEGYVPQISNVYVD 385

Query: 396 IGDEDKENALCYHSEKLAVAFGLTCTEEGTPNQVIKNLRICGDCHVVIKLISKIYNREII 443
           + +E+KENA+ YHSEK+A+AF L  T E +P  V+KNLR+C DCH+ IKL+SK+YNREI+
Sbjct: 386 VEEEEKENAVVYHSEKIAIAFMLISTPERSPITVVKNLRVCADCHLAIKLVSKVYNREIV 445

BLAST of HG10004082 vs. TAIR 10
Match: AT3G26782.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 296.2 bits (757), Expect = 4.4e-80
Identity = 167/442 (37.78%), Postives = 247/442 (55.88%), Query Frame = 0

Query: 39  LLELCAISSFGDLSYALHIFRHIRYPSTNYWNAVIRGTALSSDPANAVIWYREMAASNGP 98
           LL+  A    G ++ A  IF  I       +N+++   A S     A   +R +   N  
Sbjct: 224 LLDAYAKGGEGGVAVARKIFDQIVDKDRVSYNSIMSVYAQSGMSNEAFEVFRRL-VKNKV 283

Query: 99  HRIDALTCSFALKACARALARFEAMQLHSQLLRFGLNADVLLQTTLLDAYAKVGDLDLAQ 158
              +A+T S  L A + + A      +H Q++R GL  DV++ T+++D Y K G ++ A+
Sbjct: 284 VTFNAITLSTVLLAVSHSGALRIGKCIHDQVIRMGLEDDVIVGTSIIDMYCKCGRVETAR 343

Query: 159 KLFDEMATRYCFVERVDCWSWV------------DLECPPRLVEDGL------------- 218
           K FD M  +      V  W+ +             LE  P +++ G+             
Sbjct: 344 KAFDRMKNK-----NVRSWTAMIAGYGMHGHAAKALELFPAMIDSGVRPNYITFVSVLAA 403

Query: 219 -----------KLFNSMAQR-RLEPNIKHYGAMVDLLGRAGRLKEAYEIVNSMHF-PNMV 278
                      + FN+M  R  +EP ++HYG MVDLLGRAG L++AY+++  M   P+ +
Sbjct: 404 CSHAGLHVEGWRWFNAMKGRFGVEPGLEHYGCMVDLLGRAGFLQKAYDLIQRMKMKPDSI 463

Query: 279 LWQTLLGACRTYGNVEMAELASRKLVEMGFINCGDFVLLSNVYAARKRWDDVGRVRDAMR 338
           +W +LL ACR + NVE+AE++  +L E+   NCG ++LLS++YA   RW DV RVR  M+
Sbjct: 464 IWSSLLAACRIHKNVELAEISVARLFELDSSNCGYYMLLSHIYADAGRWKDVERVRMIMK 523

Query: 339 RRDVKKTPGFSYIEVKGKMHKFVYGDQSHSSCREIYAKLDEIKFRIKAYGYVAETGNVLH 398
            R + K PGFS +E+ G++H F+ GD+ H    +IY  L E+  ++   GYV+ T +V H
Sbjct: 524 NRGLVKPPGFSLLELNGEVHVFLIGDEEHPQREKIYEFLAELNRKLLEAGYVSNTSSVCH 583

Query: 399 DIGDEDKENALCYHSEKLAVAFGLTCTEEGTPNQVIKNLRICGDCHVVIKLISKIYNREI 443
           D+ +E+KE  L  HSEKLA+AFG+  T  G+   V+KNLR+C DCH VIKLISKI +RE 
Sbjct: 584 DVDEEEKEMTLRVHSEKLAIAFGIMNTVPGSTVNVVKNLRVCSDCHNVIKLISKIVDREF 643

BLAST of HG10004082 vs. TAIR 10
Match: AT3G62890.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 294.7 bits (753), Expect = 1.3e-79
Identity = 162/437 (37.07%), Postives = 245/437 (56.06%), Query Frame = 0

Query: 44  AISSFGDLSYALHIFRHIRYPSTNYWNAVIRGTALSSDPANAVIWYREMAASNGPH---R 103
           A +  G +  A  +F  +   +   W+ +I G  +      A+  +REM          R
Sbjct: 137 AYAKAGLIDDARKLFDEMPERNVISWSCLINGYVMCGKYKEALDLFREMQLPKPNEAFVR 196

Query: 104 IDALTCSFALKACARALARFEAMQLHSQLLRFGLNADVLLQTTLLDAYAKVGDLDLAQKL 163
            +  T S  L AC R  A  +   +H+ + ++ +  D++L T L+D YAK G L+ A+++
Sbjct: 197 PNEFTMSTVLSACGRLGALEQGKWVHAYIDKYHVEIDIVLGTALIDMYAKCGSLERAKRV 256

Query: 164 FDEMATR---YCFVERVDCWSWVDL-----------------------------ECPPR- 223
           F+ + ++     +   + C +   L                              C  R 
Sbjct: 257 FNALGSKKDVKAYSAMICCLAMYGLTDECFQLFSEMTTSDNINPNSVTFVGILGACVHRG 316

Query: 224 LVEDGLKLFNSMAQR-RLEPNIKHYGAMVDLLGRAGRLKEAYEIVNSMHF-PNMVLWQTL 283
           L+ +G   F  M +   + P+I+HYG MVDL GR+G +KEA   + SM   P++++W +L
Sbjct: 317 LINEGKSYFKMMIEEFGITPSIQHYGCMVDLYGRSGLIKEAESFIASMPMEPDVLIWGSL 376

Query: 284 LGACRTYGNVEMAELASRKLVEMGFINCGDFVLLSNVYAARKRWDDVGRVRDAMRRRDVK 343
           L   R  G+++  E A ++L+E+  +N G +VLLSNVYA   RW +V  +R  M  + + 
Sbjct: 377 LSGSRMLGDIKTCEGALKRLIELDPMNSGAYVLLSNVYAKTGRWMEVKCIRHEMEVKGIN 436

Query: 344 KTPGFSYIEVKGKMHKFVYGDQSHSSCREIYAKLDEIKFRIKAYGYVAETGNVLHDIGDE 403
           K PG SY+EV+G +H+FV GD+S      IYA LDEI  R++  GYV +T  VL D+ ++
Sbjct: 437 KVPGCSYVEVEGVVHEFVVGDESQQESERIYAMLDEIMQRLREAGYVTDTKEVLLDLNEK 496

Query: 404 DKENALCYHSEKLAVAFGLTCTEEGTPNQVIKNLRICGDCHVVIKLISKIYNREIIVRDR 443
           DKE AL YHSEKLA+AF L  T  GTP ++IKNLRICGDCH+V+K+ISK+++REI+VRD 
Sbjct: 497 DKEIALSYHSEKLAIAFCLMKTRPGTPVRIIKNLRICGDCHLVMKMISKLFSREIVVRDC 556

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038885014.11.1e-21069.10pentatricopeptide repeat-containing protein At1g34160 isoform X2 [Benincasa hisp... [more]
XP_031743171.11.8e-20567.36pentatricopeptide repeat-containing protein At1g34160 [Cucumis sativus] >KGN4609... [more]
XP_008456696.17.0e-20566.84PREDICTED: pentatricopeptide repeat-containing protein At1g34160 [Cucumis melo][more]
XP_038885002.11.2e-20468.48pentatricopeptide repeat-containing protein At1g34160 isoform X1 [Benincasa hisp... [more]
XP_022133715.11.3e-19865.28pentatricopeptide repeat-containing protein At1g34160 [Momordica charantia][more]
Match NameE-valueIdentityDescription
Q9FX244.1e-13148.01Pentatricopeptide repeat-containing protein At1g34160 OS=Arabidopsis thaliana OX... [more]
B8YEK44.2e-10445.99Pentatricopeptide repeat-containing protein OGR1, mitochondrial OS=Oryza sativa ... [more]
A8MQA36.5e-8939.68Pentatricopeptide repeat-containing protein At4g21065 OS=Arabidopsis thaliana OX... [more]
Q9LW326.1e-7937.78Pentatricopeptide repeat-containing protein At3g26782, mitochondrial OS=Arabidop... [more]
Q683I91.8e-7837.07Pentatricopeptide repeat-containing protein At3g62890 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
A0A0A0KB778.9e-20667.36DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G0526... [more]
A0A1S3C3U73.4e-20566.84pentatricopeptide repeat-containing protein At1g34160 OS=Cucumis melo OX=3656 GN... [more]
A0A6J1BW136.2e-19965.28pentatricopeptide repeat-containing protein At1g34160 OS=Momordica charantia OX=... [more]
A0A6J1H8R53.1e-19864.24pentatricopeptide repeat-containing protein At1g34160 OS=Cucurbita moschata OX=3... [more]
A0A6J1JI911.4e-19563.72pentatricopeptide repeat-containing protein At1g34160 isoform X1 OS=Cucurbita ma... [more]
Match NameE-valueIdentityDescription
AT1G34160.12.9e-13248.01Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT4G21065.14.7e-9039.68Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT4G21065.24.7e-9039.68Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT3G26782.14.4e-8037.78Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT3G62890.11.3e-7937.07Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 7..180
e-value: 2.9E-18
score: 68.4
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 184..313
e-value: 1.0E-13
score: 53.0
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 308..431
e-value: 1.9E-38
score: 131.2
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 211..234
e-value: 0.0086
score: 16.3
coord: 142..167
e-value: 1.0E-4
score: 22.3
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 211..235
e-value: 0.0024
score: 15.9
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 207..241
score: 8.604678
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 137..171
score: 9.119859
NoneNo IPR availablePANTHERPTHR47924PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 188..435
NoneNo IPR availablePANTHERPTHR47924:SF41BNAC07G48870D PROTEINcoord: 5..167
coord: 188..435
NoneNo IPR availablePANTHERPTHR47924PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 5..167

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10004082.1HG10004082.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding