Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCATCCTCGTCACCGCAATACGGGAAATGGGTTTAGGTCTAGTTCCATGGGAGTGGGATTAGCCTCTTCTCGAATTTCTCCTGAGGGATCGGTTAGGGGCCATGGAGGCGGATATGGAAATGATTATCGCAACTTTAATCACCCTAGTGGCTTTGGGCGTGGTCAGGGATACCCCAAATCATATCAATCATCCCAGTCATTGCCTCCTCCGCGAAGAGGTGGAGGTTCTGTTGACATTTTTATGGAAGCAGGTCGTCTTGCAGCTGAATACCTGGTTTCTCAAGGTGTGCTACCATCTAGTGTTTTGTCTGGTAAATGGCAAGGTGGAAATTTGAGGAGGGAGTCTTCTGAATCTCAGGAATTAGGTCCACTTCGCGAAGAAGGACGAACATCTACTGCTGCCCCTCCTCATTCAGGACATGTTGGACCTGATTCTGGGTCGGGTAGGAGGCATCCAAATGATGAGTATAGTTCAGCAGCATCAAGAAATCATTTGAGAGGACGAAGAAGATCCACTTCTTTCCGAAGTAGTGGTTCTGATTGGAGTGGCCAAGATTACAGTAGGAGCAATAATTACAATGATAGAGCTAGAGCTTCTCCAGATACAGAGGCTTATGATGACACTGATAATCTTTATGTTTATGGTAATCAACAGCAAACTGGTGAAGAAGTTGGTTCTGAATTGCAAGATCTCAAACCTTCCAAGTTAGAACCAAAAGGTGACACGCCTGAGGATTCAGGACCTGAACTGGTCAAGTACCCTTTGCCAGATGATGCAGGCTCAAAGGCAAGTACTTCTGCTGTTGTGAAAGATCTTCCTTCTGAGCCGAAACTTGCTAAAGATTCTGATGATTTAAGTAACATAGATTCAGGGTCTGAGGAGGTGAAAAATAGTACCAATATCAATGAAACTGAGAAACACTGTGTGACGGAGAAACTATCTGTCCAGAATGAAGCTGATGATGGTGATCCCTTGGTGAAGCCAGAGACTGATCTACTAGCATTTTGCAGATTTACTAAGTTCCCCACCAAAACTCGCTCTGCATTGGCATACAAGGTTTCTAAGGCAGATCCTATTACAGCTGTGTCTGAACGACCTTCTGTTATCAACACTAACAGAGGATCTGAAATTTCACTTGACAGTAACCCTTCCAGTTGTGCATTATCTGGTGCTGTATCAGCTAAAAAACACGATGTTGAGAATCTAAACTCTGAAAGATCTAAACCAGAAGCTGTTGAAAAAACTGATACAGTGGAGGAGCTATATCCTAAATTCGGTGAGAAGGCTAGCTCTTTGACGTCTCAATCTTTCCAGCACGGACCATTTTGGAATGAGAGCAAGGAAGAATCTTGTCAAATTCCTGCTGTAATTGGAAGGAGTGATTTGATGTTTGAGGAAAGGGGTCAAAAGCGATCTTTGGATGAGAGTGATGTGGGGGAGGGGAACAAAAAACCAAGAGAATGGATTCCGTTGATGACTTCTAAGGAAGATGAATCCTTTGACCTTCTGAAGTTTGATAAAACGAAACTTAGTTCTGAGGAAAGCAAGCCAGCATGTGATAATGAAGTTATTGTAGCTCCTGACTGTGTGAACTCAGTGGATGGTTTTCATTTCATAAAAGGTGGGGGTGAGCAATGTGTTGATTATGCACAAGAAAAGCAGCTTTTTCCGAATTCATTTAAGATCTGTGACCTTAACCTTATGGAGGCTTCTGATATACATGATAATCATGAAAATAATCTTCTCATCTTCCCATCAATTTCTGAAACGAAAAGAGAAAGAACACCTGTTGATATTGACTTGTCTATAAGCAATGCTACTGAATTTGGTCAAAATAGTGTGGTGGCTGGCGGTAAAGAGATTGAAATTATTGATTTGGAAGATGACTCTGCTGCCGAAGTAGACAAGACCTTCCATAATGCAGAGAGAAAGTAAGTTACCTCATCTTTTAATCAAATGCTATTTCAGTTTTACATTTTCCTAAATATGCTGGTACACCTCACCTCACCTACCTTTTGTTAGAATTTTCTCTTTTTGAGAAGTTCTTAAATTGTAAACAAACTTTGTGGTAAACTGTATTTCTGCAATATTCAGGAGCGAGACTGTATTTACCGGATTGGATGGCTTTCCTAACAACGCACAAAATAGTGGAGATATGCCTGATGTTCAAGATGGCTATGGACTTATGATTTCAGAGTTGCTTGGAGCAGAATTCCCCAACTGTGCTTCTGTACAAGGAGACATTAATTCAATGCACAATGAAATGCCTCTTTCCAATGGAGAAGTACTTCCTTGCTCATATCCAACTTCGTCATTTATTGAAGATTCTTCTTTTTGATACTTTGGCTGGCTTCTTAAAGATTATATTGTTGTGAGGTTCTTACTTCGTTAAATGTTAATATTATATATTTTAGGAGTGCCCAAGTCTCAATGCCCATCACAATTTACTTCCATCTCCACCTTCTTCGGGTCTTTAGTGATTTAGAACTATATGAATTCATAGGCATTCCCTCCTGACCTTGATATTGATTTAATGATCTGTCATTTGCTTATTATTTCAGGGAGCTCTGGCGGATGATGATCTAATTTACATGTCCCTGGGGGAAATTCCATTGAGTATGCCAGAATTTTAA
mRNA sequence
ATGCATCCTCGTCACCGCAATACGGGAAATGGGTTTAGGTCTAGTTCCATGGGAGTGGGATTAGCCTCTTCTCGAATTTCTCCTGAGGGATCGGTTAGGGGCCATGGAGGCGGATATGGAAATGATTATCGCAACTTTAATCACCCTAGTGGCTTTGGGCGTGGTCAGGGATACCCCAAATCATATCAATCATCCCAGTCATTGCCTCCTCCGCGAAGAGGTGGAGGTTCTGTTGACATTTTTATGGAAGCAGGTCGTCTTGCAGCTGAATACCTGGTTTCTCAAGGTGTGCTACCATCTAGTGTTTTGTCTGGTAAATGGCAAGGTGGAAATTTGAGGAGGGAGTCTTCTGAATCTCAGGAATTAGGTCCACTTCGCGAAGAAGGACGAACATCTACTGCTGCCCCTCCTCATTCAGGACATGTTGGACCTGATTCTGGGTCGGGTAGGAGGCATCCAAATGATGAGTATAGTTCAGCAGCATCAAGAAATCATTTGAGAGGACGAAGAAGATCCACTTCTTTCCGAAGTAGTGGTTCTGATTGGAGTGGCCAAGATTACAGTAGGAGCAATAATTACAATGATAGAGCTAGAGCTTCTCCAGATACAGAGGCTTATGATGACACTGATAATCTTTATGTTTATGGTAATCAACAGCAAACTGGTGAAGAAGTTGGTTCTGAATTGCAAGATCTCAAACCTTCCAAGTTAGAACCAAAAGGTGACACGCCTGAGGATTCAGGACCTGAACTGGTCAAGTACCCTTTGCCAGATGATGCAGGCTCAAAGGCAAGTACTTCTGCTGTTGTGAAAGATCTTCCTTCTGAGCCGAAACTTGCTAAAGATTCTGATGATTTAAGTAACATAGATTCAGGGTCTGAGGAGGTGAAAAATAGTACCAATATCAATGAAACTGAGAAACACTGTGTGACGGAGAAACTATCTGTCCAGAATGAAGCTGATGATGGTGATCCCTTGGTGAAGCCAGAGACTGATCTACTAGCATTTTGCAGATTTACTAAGTTCCCCACCAAAACTCGCTCTGCATTGGCATACAAGGTTTCTAAGGCAGATCCTATTACAGCTGTGTCTGAACGACCTTCTGTTATCAACACTAACAGAGGATCTGAAATTTCACTTGACAGTAACCCTTCCAGTTGTGCATTATCTGGTGCTGTATCAGCTAAAAAACACGATGTTGAGAATCTAAACTCTGAAAGATCTAAACCAGAAGCTGTTGAAAAAACTGATACAGTGGAGGAGCTATATCCTAAATTCGGTGAGAAGGCTAGCTCTTTGACGTCTCAATCTTTCCAGCACGGACCATTTTGGAATGAGAGCAAGGAAGAATCTTGTCAAATTCCTGCTGTAATTGGAAGGAGTGATTTGATGTTTGAGGAAAGGGGTCAAAAGCGATCTTTGGATGAGAGTGATGTGGGGGAGGGGAACAAAAAACCAAGAGAATGGATTCCGTTGATGACTTCTAAGGAAGATGAATCCTTTGACCTTCTGAAGTTTGATAAAACGAAACTTAGTTCTGAGGAAAGCAAGCCAGCATGTGATAATGAAGTTATTGTAGCTCCTGACTGTGTGAACTCAGTGGATGGTTTTCATTTCATAAAAGGTGGGGGTGAGCAATGTGTTGATTATGCACAAGAAAAGCAGCTTTTTCCGAATTCATTTAAGATCTGTGACCTTAACCTTATGGAGGCTTCTGATATACATGATAATCATGAAAATAATCTTCTCATCTTCCCATCAATTTCTGAAACGAAAAGAGAAAGAACACCTGTTGATATTGACTTGTCTATAAGCAATGCTACTGAATTTGGTCAAAATAGTGTGGTGGCTGGCGGTAAAGAGATTGAAATTATTGATTTGGAAGATGACTCTGCTGCCGAAGTAGACAAGACCTTCCATAATGCAGAGAGAAAGAGCGAGACTGTATTTACCGGATTGGATGGCTTTCCTAACAACGCACAAAATAGTGGAGATATGCCTGATGTTCAAGATGGCTATGGACTTATGATTTCAGAGTTGCTTGGAGCAGAATTCCCCAACTGTGCTTCTGTACAAGGAGACATTAATTCAATGCACAATGAAATGCCTCTTTCCAATGGAGAAGGAGCTCTGGCGGATGATGATCTAATTTACATGTCCCTGGGGGAAATTCCATTGAGTATGCCAGAATTTTAA
Coding sequence (CDS)
ATGCATCCTCGTCACCGCAATACGGGAAATGGGTTTAGGTCTAGTTCCATGGGAGTGGGATTAGCCTCTTCTCGAATTTCTCCTGAGGGATCGGTTAGGGGCCATGGAGGCGGATATGGAAATGATTATCGCAACTTTAATCACCCTAGTGGCTTTGGGCGTGGTCAGGGATACCCCAAATCATATCAATCATCCCAGTCATTGCCTCCTCCGCGAAGAGGTGGAGGTTCTGTTGACATTTTTATGGAAGCAGGTCGTCTTGCAGCTGAATACCTGGTTTCTCAAGGTGTGCTACCATCTAGTGTTTTGTCTGGTAAATGGCAAGGTGGAAATTTGAGGAGGGAGTCTTCTGAATCTCAGGAATTAGGTCCACTTCGCGAAGAAGGACGAACATCTACTGCTGCCCCTCCTCATTCAGGACATGTTGGACCTGATTCTGGGTCGGGTAGGAGGCATCCAAATGATGAGTATAGTTCAGCAGCATCAAGAAATCATTTGAGAGGACGAAGAAGATCCACTTCTTTCCGAAGTAGTGGTTCTGATTGGAGTGGCCAAGATTACAGTAGGAGCAATAATTACAATGATAGAGCTAGAGCTTCTCCAGATACAGAGGCTTATGATGACACTGATAATCTTTATGTTTATGGTAATCAACAGCAAACTGGTGAAGAAGTTGGTTCTGAATTGCAAGATCTCAAACCTTCCAAGTTAGAACCAAAAGGTGACACGCCTGAGGATTCAGGACCTGAACTGGTCAAGTACCCTTTGCCAGATGATGCAGGCTCAAAGGCAAGTACTTCTGCTGTTGTGAAAGATCTTCCTTCTGAGCCGAAACTTGCTAAAGATTCTGATGATTTAAGTAACATAGATTCAGGGTCTGAGGAGGTGAAAAATAGTACCAATATCAATGAAACTGAGAAACACTGTGTGACGGAGAAACTATCTGTCCAGAATGAAGCTGATGATGGTGATCCCTTGGTGAAGCCAGAGACTGATCTACTAGCATTTTGCAGATTTACTAAGTTCCCCACCAAAACTCGCTCTGCATTGGCATACAAGGTTTCTAAGGCAGATCCTATTACAGCTGTGTCTGAACGACCTTCTGTTATCAACACTAACAGAGGATCTGAAATTTCACTTGACAGTAACCCTTCCAGTTGTGCATTATCTGGTGCTGTATCAGCTAAAAAACACGATGTTGAGAATCTAAACTCTGAAAGATCTAAACCAGAAGCTGTTGAAAAAACTGATACAGTGGAGGAGCTATATCCTAAATTCGGTGAGAAGGCTAGCTCTTTGACGTCTCAATCTTTCCAGCACGGACCATTTTGGAATGAGAGCAAGGAAGAATCTTGTCAAATTCCTGCTGTAATTGGAAGGAGTGATTTGATGTTTGAGGAAAGGGGTCAAAAGCGATCTTTGGATGAGAGTGATGTGGGGGAGGGGAACAAAAAACCAAGAGAATGGATTCCGTTGATGACTTCTAAGGAAGATGAATCCTTTGACCTTCTGAAGTTTGATAAAACGAAACTTAGTTCTGAGGAAAGCAAGCCAGCATGTGATAATGAAGTTATTGTAGCTCCTGACTGTGTGAACTCAGTGGATGGTTTTCATTTCATAAAAGGTGGGGGTGAGCAATGTGTTGATTATGCACAAGAAAAGCAGCTTTTTCCGAATTCATTTAAGATCTGTGACCTTAACCTTATGGAGGCTTCTGATATACATGATAATCATGAAAATAATCTTCTCATCTTCCCATCAATTTCTGAAACGAAAAGAGAAAGAACACCTGTTGATATTGACTTGTCTATAAGCAATGCTACTGAATTTGGTCAAAATAGTGTGGTGGCTGGCGGTAAAGAGATTGAAATTATTGATTTGGAAGATGACTCTGCTGCCGAAGTAGACAAGACCTTCCATAATGCAGAGAGAAAGAGCGAGACTGTATTTACCGGATTGGATGGCTTTCCTAACAACGCACAAAATAGTGGAGATATGCCTGATGTTCAAGATGGCTATGGACTTATGATTTCAGAGTTGCTTGGAGCAGAATTCCCCAACTGTGCTTCTGTACAAGGAGACATTAATTCAATGCACAATGAAATGCCTCTTTCCAATGGAGAAGGAGCTCTGGCGGATGATGATCTAATTTACATGTCCCTGGGGGAAATTCCATTGAGTATGCCAGAATTTTAA
Protein sequence
MHPRHRNTGNGFRSSSMGVGLASSRISPEGSVRGHGGGYGNDYRNFNHPSGFGRGQGYPKSYQSSQSLPPPRRGGGSVDIFMEAGRLAAEYLVSQGVLPSSVLSGKWQGGNLRRESSESQELGPLREEGRTSTAAPPHSGHVGPDSGSGRRHPNDEYSSAASRNHLRGRRRSTSFRSSGSDWSGQDYSRSNNYNDRARASPDTEAYDDTDNLYVYGNQQQTGEEVGSELQDLKPSKLEPKGDTPEDSGPELVKYPLPDDAGSKASTSAVVKDLPSEPKLAKDSDDLSNIDSGSEEVKNSTNINETEKHCVTEKLSVQNEADDGDPLVKPETDLLAFCRFTKFPTKTRSALAYKVSKADPITAVSERPSVINTNRGSEISLDSNPSSCALSGAVSAKKHDVENLNSERSKPEAVEKTDTVEELYPKFGEKASSLTSQSFQHGPFWNESKEESCQIPAVIGRSDLMFEERGQKRSLDESDVGEGNKKPREWIPLMTSKEDESFDLLKFDKTKLSSEESKPACDNEVIVAPDCVNSVDGFHFIKGGGEQCVDYAQEKQLFPNSFKICDLNLMEASDIHDNHENNLLIFPSISETKRERTPVDIDLSISNATEFGQNSVVAGGKEIEIIDLEDDSAAEVDKTFHNAERKSETVFTGLDGFPNNAQNSGDMPDVQDGYGLMISELLGAEFPNCASVQGDINSMHNEMPLSNGEGALADDDLIYMSLGEIPLSMPEF
Homology
BLAST of HG10022340 vs. NCBI nr
Match:
XP_038897810.1 (uncharacterized protein At4g26450 isoform X1 [Benincasa hispida])
HSP 1 Score: 1320.1 bits (3415), Expect = 0.0e+00
Identity = 684/732 (93.44%), Postives = 698/732 (95.36%), Query Frame = 0
Query: 1 MHPRHRNTGNGFRSSSMGVGLASSRISPEGSVRGHGGGYGNDYRNFNHPSGFGRGQGYPK 60
MHPR RNTGNGFRSSSMGVGLASSRISPEGSVRGHGGGYGNDYRNFNHPSGFGRGQGYPK
Sbjct: 1 MHPRQRNTGNGFRSSSMGVGLASSRISPEGSVRGHGGGYGNDYRNFNHPSGFGRGQGYPK 60
Query: 61 SYQSSQSLPPPRRGGGSVDIFMEAGRLAAEYLVSQGVLPSSVLSGKWQGGNLRRESSESQ 120
SYQS+Q LPPPRRGGGSVDIFMEAGRLAAEYLVSQG+LPSSVLSGKWQGG+LRRESSE Q
Sbjct: 61 SYQSAQPLPPPRRGGGSVDIFMEAGRLAAEYLVSQGLLPSSVLSGKWQGGSLRRESSEFQ 120
Query: 121 ELGPLREEGRTSTAAPPHSGHVGPDSGSGRRHPNDEYSSAASRNHLRGRRRSTSFRSSGS 180
ELGP REEGRTST APPHSGHVGPDSGSGRRHPNDEYSSA SRNHLRGRRRSTSFRSSGS
Sbjct: 121 ELGPFREEGRTSTTAPPHSGHVGPDSGSGRRHPNDEYSSAPSRNHLRGRRRSTSFRSSGS 180
Query: 181 DWSGQDYSRSNNYNDRARASPDTEAYDDTDNLYVYGNQQQTGEEVGSELQDLKPSKLEPK 240
DWSGQDYSRSNNYNDRARASP TEAYDDTDNLYVYGNQQQTGEE G+ELQDLKPSKLE K
Sbjct: 181 DWSGQDYSRSNNYNDRARASPGTEAYDDTDNLYVYGNQQQTGEEGGTELQDLKPSKLERK 240
Query: 241 GDTPEDSGPELVKYPLPDDAGSKASTSAVVKDLPSEPKLAKDSDDLSNIDSGSEEVKNST 300
DTPEDSGPELVKYPLPDDAGSKASTSAV KDLPSE KLAKDSDDLSNIDSGSEEVKNST
Sbjct: 241 VDTPEDSGPELVKYPLPDDAGSKASTSAVGKDLPSESKLAKDSDDLSNIDSGSEEVKNST 300
Query: 301 NINETEKHCVTEKLSVQNEADDGDPLVKPETDLLAFCRFTKFPTKTRSALAYKVSKADPI 360
NINETEKHCV EKLSVQNEADDGDPLVK ETDLLAFCRFTKFPTKTRSALAYKVSKADP+
Sbjct: 301 NINETEKHCVAEKLSVQNEADDGDPLVKHETDLLAFCRFTKFPTKTRSALAYKVSKADPV 360
Query: 361 TAVSERPSVINTNRGSEISLDSNPSSCALSGAVSAKKHDVENLNSERSKPEAVEKTDTVE 420
T VSE+P V+NTN SEISLDSNPSSC LSGAVSAKKHDVEN+NSERSKPEAVEKT T+E
Sbjct: 361 TTVSEQP-VVNTNIESEISLDSNPSSCTLSGAVSAKKHDVENINSERSKPEAVEKTGTME 420
Query: 421 ELYPKFGEKASSLTSQSFQHGPFWNESKEESCQIPAVIGRSDLMFEERGQKRSLDESDVG 480
ELYP+FGEKASSLTSQSFQHGPFWNESKEESCQ PAVIGRSD MFEER QKRSLDESDV
Sbjct: 421 ELYPRFGEKASSLTSQSFQHGPFWNESKEESCQSPAVIGRSD-MFEERSQKRSLDESDVA 480
Query: 481 EGNKKPREWIPLMTSKEDESFDLLKFDKTKLSSEESKPACDNEVIVAPDCVNSVDGFHFI 540
EGNKKPREWIPLMTSKEDESFDLLKFDKTK+SSEESK ACDNEVIVA DCVNSVDGFHFI
Sbjct: 481 EGNKKPREWIPLMTSKEDESFDLLKFDKTKVSSEESKQACDNEVIVA-DCVNSVDGFHFI 540
Query: 541 KGGGEQCVDYAQEKQLFPNSFKICDLNLMEASDIHDNHENN-LLIFPSISETKRERTPVD 600
KGGGEQCVDYAQEKQLFPNSFKICDLNLMEASDIHDNHENN LLIFPSISETKRER P+D
Sbjct: 541 KGGGEQCVDYAQEKQLFPNSFKICDLNLMEASDIHDNHENNPLLIFPSISETKRERAPLD 600
Query: 601 IDLSISNATEFGQNSVVAGGKEIEIIDLEDDSAAEVDKTFHNAERKSETVFTGLDGFPNN 660
IDLSISNATEFG++SVV GGKEIEIIDLEDDS AEVDKTFHN ERK ETVFTGLDGFPNN
Sbjct: 601 IDLSISNATEFGRDSVVTGGKEIEIIDLEDDSTAEVDKTFHNPERKCETVFTGLDGFPNN 660
Query: 661 AQNSGDMPDVQDGYGLMISELLGAEFPNCASVQGDINSMHNEMPLSNGEGALADDDLIYM 720
QNSGDMPDVQDGYGLMISELLGAEFPNCASVQGDINSMHNEMPLSNGE ALADDDLIYM
Sbjct: 661 PQNSGDMPDVQDGYGLMISELLGAEFPNCASVQGDINSMHNEMPLSNGEEALADDDLIYM 720
Query: 721 SLGEIPLSMPEF 732
SLGEIPLSMPEF
Sbjct: 721 SLGEIPLSMPEF 729
BLAST of HG10022340 vs. NCBI nr
Match:
XP_008447800.1 (PREDICTED: uncharacterized protein At4g26450 [Cucumis melo] >XP_008447801.1 PREDICTED: uncharacterized protein At4g26450 [Cucumis melo])
HSP 1 Score: 1312.7 bits (3396), Expect = 0.0e+00
Identity = 675/732 (92.21%), Postives = 694/732 (94.81%), Query Frame = 0
Query: 1 MHPRHRNTGNGFRSSSMGVGLASSRISPEGSVRGHGGGYGNDYRNFNHPSGFGRGQGYPK 60
MHPR RNTGNGFRS SMGVGLASSRISPEGSVRGH GGYGNDYRNFNHPSGFGRGQGYPK
Sbjct: 1 MHPRQRNTGNGFRSGSMGVGLASSRISPEGSVRGH-GGYGNDYRNFNHPSGFGRGQGYPK 60
Query: 61 SYQSSQSLPPPRRGGGSVDIFMEAGRLAAEYLVSQGVLPSSVLSGKWQGGNLRRESSESQ 120
SYQSSQSLPPPRRGGGSVDIFMEAGRLAAEYLVSQG+LPSSVLSGKWQGG+LRR+SSE Q
Sbjct: 61 SYQSSQSLPPPRRGGGSVDIFMEAGRLAAEYLVSQGLLPSSVLSGKWQGGSLRRDSSEFQ 120
Query: 121 ELGPLREEGRTSTAAPPHSGHVGPDSGSGRRHPNDEYSSAASRNHLRGRRRSTSFRSSGS 180
ELGPLREEGRTSTAAPPHSG+VGPDSGSGRRHPNDEYSSA SRNHLRGRRRSTSFRS+GS
Sbjct: 121 ELGPLREEGRTSTAAPPHSGYVGPDSGSGRRHPNDEYSSAPSRNHLRGRRRSTSFRSTGS 180
Query: 181 DWSGQDYSRSNNYNDRARASPDTEAYDDTDNLYVYGNQQQTGEEVGSELQDLKPSKLEPK 240
DWSGQDYSRSNN NDR RASPDTEAYDDTDNLY YGNQQQTGEEVG+ELQDLK S LE K
Sbjct: 181 DWSGQDYSRSNNCNDRGRASPDTEAYDDTDNLYAYGNQQQTGEEVGTELQDLKSSNLEQK 240
Query: 241 GDTPEDSGPELVKYPLPDDAGSKASTSAVVKDLPSEPKLAKDSDDLSNIDSGSEEVKNST 300
GDTPEDSGPELVKYPLPDDAGSKAS SAV KD PSEPKLAKDSDDLSNID GSEEVK+ST
Sbjct: 241 GDTPEDSGPELVKYPLPDDAGSKASNSAVGKDPPSEPKLAKDSDDLSNIDLGSEEVKHST 300
Query: 301 NINETEKHCVTEKLSVQNEADDGDPLVKPETDLLAFCRFTKFPTKTRSALAYKVSKADPI 360
NINETEKHCV EKLSVQNEA DGD LVK ETDLLAFCRFTKFPTKTRSALAYKVSKADPI
Sbjct: 301 NINETEKHCVAEKLSVQNEAGDGDSLVKQETDLLAFCRFTKFPTKTRSALAYKVSKADPI 360
Query: 361 TAVSERPSVINTNRGSEISLDSNPSSCALSGAVSAKKHDVENLNSERSKPEAVEKTDTVE 420
T SE+PSVINTNR SE SLDS+PSSCALSGAVSAKKHDVEN+NS+ SKPEAVEK T+E
Sbjct: 361 TTASEQPSVINTNRDSETSLDSSPSSCALSGAVSAKKHDVENINSKSSKPEAVEKAGTME 420
Query: 421 ELYPKFGEKASSLTSQSFQHGPFWNESKEESCQIPAVIGRSDLMFEERGQKRSLDESDVG 480
ELYP++ EKA SL+SQSFQHGPFWNESKEESCQ PAV+GRSDLMFEERGQKRSLDESDVG
Sbjct: 421 ELYPRYSEKAGSLSSQSFQHGPFWNESKEESCQSPAVVGRSDLMFEERGQKRSLDESDVG 480
Query: 481 EGNKKPREWIPLMTSKEDESFDLLKFDKTKLSSEESKPACDNEVIVAPDCVNSVDGFHFI 540
EGNKKPREWIPLMTSKEDESFDLLKFDKTK+SSEESKPACDNEVIVA DC NSVDGFHFI
Sbjct: 481 EGNKKPREWIPLMTSKEDESFDLLKFDKTKVSSEESKPACDNEVIVAADCENSVDGFHFI 540
Query: 541 KGGGEQCVDYAQEKQLFPNSFKICDLNLMEASDIHDNHENN-LLIFPSISETKRERTPVD 600
KGGGEQCVDYAQEKQLFPNSFKICDLNLMEASDIHDNHENN LLIFPSISETKRE PVD
Sbjct: 541 KGGGEQCVDYAQEKQLFPNSFKICDLNLMEASDIHDNHENNPLLIFPSISETKREIAPVD 600
Query: 601 IDLSISNATEFGQNSVVAGGKEIEIIDLEDDSAAEVDKTFHNAERKSETVFTGLDGFPNN 660
IDLSISNATEFGQNSVVAGGKEIEIIDLEDDS AEVDKTFHNAERKSETVFTGLDGFPNN
Sbjct: 601 IDLSISNATEFGQNSVVAGGKEIEIIDLEDDSTAEVDKTFHNAERKSETVFTGLDGFPNN 660
Query: 661 AQNSGDMPDVQDGYGLMISELLGAEFPNCASVQGDINSMHNEMPLSNGEGALADDDLIYM 720
AQN+GDMPDVQDGYGLMISELLGAEF NC SVQGDINS+HNEMPLSNGEG LADDDLIYM
Sbjct: 661 AQNNGDMPDVQDGYGLMISELLGAEFSNCGSVQGDINSIHNEMPLSNGEGTLADDDLIYM 720
Query: 721 SLGEIPLSMPEF 732
SLGEIPLSMPEF
Sbjct: 721 SLGEIPLSMPEF 731
BLAST of HG10022340 vs. NCBI nr
Match:
XP_004139761.1 (uncharacterized protein At4g26450 isoform X1 [Cucumis sativus] >XP_031744602.1 uncharacterized protein At4g26450 isoform X1 [Cucumis sativus] >KGN44098.1 hypothetical protein Csa_016694 [Cucumis sativus])
HSP 1 Score: 1300.0 bits (3363), Expect = 0.0e+00
Identity = 670/732 (91.53%), Postives = 689/732 (94.13%), Query Frame = 0
Query: 1 MHPRHRNTGNGFRSSSMGVGLASSRISPEGSVRGHGGGYGNDYRNFNHPSGFGRGQGYPK 60
MHPR RNTGNGFRS SMGVGLASSRISPEGSVRGH GGYGNDYRNFNHPSGFGRGQGYPK
Sbjct: 1 MHPRQRNTGNGFRSGSMGVGLASSRISPEGSVRGH-GGYGNDYRNFNHPSGFGRGQGYPK 60
Query: 61 SYQSSQSLPPPRRGGGSVDIFMEAGRLAAEYLVSQGVLPSSVLSGKWQGGNLRRESSESQ 120
SYQSSQSLPPPRRGGGSVDIFMEAGRLAAEYLVSQG+LPSSVLSGKWQGG+LRR+SSE Q
Sbjct: 61 SYQSSQSLPPPRRGGGSVDIFMEAGRLAAEYLVSQGLLPSSVLSGKWQGGSLRRDSSEFQ 120
Query: 121 ELGPLREEGRTSTAAPPHSGHVGPDSGSGRRHPNDEYSSAASRNHLRGRRRSTSFRSSGS 180
ELGPLREEGRTSTA PPHSG+VGPDSGSGRRHPNDEYSS SRNHLRGRRRSTSFRS+GS
Sbjct: 121 ELGPLREEGRTSTAPPPHSGYVGPDSGSGRRHPNDEYSSTPSRNHLRGRRRSTSFRSTGS 180
Query: 181 DWSGQDYSRSNNYNDRARASPDTEAYDDTDNLYVYGNQQQTGEEVGSELQDLKPSKLEPK 240
DW GQDYSRSNN NDR RASPDTEAYDDTDNLY YGNQQQTGEEVG+ELQDLK SKLE K
Sbjct: 181 DWIGQDYSRSNNCNDRGRASPDTEAYDDTDNLYAYGNQQQTGEEVGTELQDLKSSKLEQK 240
Query: 241 GDTPEDSGPELVKYPLPDDAGSKASTSAVVKDLPSEPKLAKDSDDLSNIDSGSEEVKNST 300
GDTPEDSGPELVKYPLPDDAGSKA+ SAV KD PSEPKLAKDSDDLSN+D GSEEVK+ST
Sbjct: 241 GDTPEDSGPELVKYPLPDDAGSKANNSAVGKDPPSEPKLAKDSDDLSNVDLGSEEVKHST 300
Query: 301 NINETEKHCVTEKLSVQNEADDGDPLVKPETDLLAFCRFTKFPTKTRSALAYKVSKADPI 360
NINETEKHCV EKLS QNEA DGD LVK ETDLLAFCRFTKFPTKTRSALAYKVSKADPI
Sbjct: 301 NINETEKHCVAEKLSGQNEAGDGDSLVKQETDLLAFCRFTKFPTKTRSALAYKVSKADPI 360
Query: 361 TAVSERPSVINTNRGSEISLDSNPSSCALSGAVSAKKHDVENLNSERSKPEAVEKTDTVE 420
T VSE PSVINTNR SE S+D +PSSCALSGAVSAKK DVENLNS+RSKPEAVEK T+E
Sbjct: 361 TTVSEHPSVINTNRDSETSIDCSPSSCALSGAVSAKKLDVENLNSKRSKPEAVEKAGTME 420
Query: 421 ELYPKFGEKASSLTSQSFQHGPFWNESKEESCQIPAVIGRSDLMFEERGQKRSLDESDVG 480
EL P++ EKA SLTSQSFQHGPFWNESKEESCQ PA +GRSDLMFEERGQKRSLDESDVG
Sbjct: 421 ELCPRYSEKAGSLTSQSFQHGPFWNESKEESCQSPAGVGRSDLMFEERGQKRSLDESDVG 480
Query: 481 EGNKKPREWIPLMTSKEDESFDLLKFDKTKLSSEESKPACDNEVIVAPDCVNSVDGFHFI 540
EGNKKPREWIPLMTSKEDESFDLLKFDKTK+SSEESKPACDNEV+VA DCVNSVDGFHFI
Sbjct: 481 EGNKKPREWIPLMTSKEDESFDLLKFDKTKVSSEESKPACDNEVVVAADCVNSVDGFHFI 540
Query: 541 KGGGEQCVDYAQEKQLFPNSFKICDLNLMEASDIHDNHENN-LLIFPSISETKRERTPVD 600
KGGGEQCVDYAQEKQLFPNSFKICDLNLMEASDIHDNHENN LLIFPSISETKRE PVD
Sbjct: 541 KGGGEQCVDYAQEKQLFPNSFKICDLNLMEASDIHDNHENNPLLIFPSISETKREIAPVD 600
Query: 601 IDLSISNATEFGQNSVVAGGKEIEIIDLEDDSAAEVDKTFHNAERKSETVFTGLDGFPNN 660
IDLSISNATEFGQNSVVAGGKEIEIIDLEDDS AEVDKTFHNAERKSETVF GLDGFPNN
Sbjct: 601 IDLSISNATEFGQNSVVAGGKEIEIIDLEDDSTAEVDKTFHNAERKSETVFNGLDGFPNN 660
Query: 661 AQNSGDMPDVQDGYGLMISELLGAEFPNCASVQGDINSMHNEMPLSNGEGALADDDLIYM 720
AQNSGDMPDVQDGYGLMISELLGAEF NCASVQGDINS+HNEMPLSNGEG LADDDLIYM
Sbjct: 661 AQNSGDMPDVQDGYGLMISELLGAEFSNCASVQGDINSIHNEMPLSNGEGTLADDDLIYM 720
Query: 721 SLGEIPLSMPEF 732
SLGEIPLSMPEF
Sbjct: 721 SLGEIPLSMPEF 731
BLAST of HG10022340 vs. NCBI nr
Match:
XP_038897811.1 (uncharacterized protein At4g26450 isoform X2 [Benincasa hispida])
HSP 1 Score: 1277.3 bits (3304), Expect = 0.0e+00
Identity = 662/709 (93.37%), Postives = 676/709 (95.35%), Query Frame = 0
Query: 1 MHPRHRNTGNGFRSSSMGVGLASSRISPEGSVRGHGGGYGNDYRNFNHPSGFGRGQGYPK 60
MHPR RNTGNGFRSSSMGVGLASSRISPEGSVRGHGGGYGNDYRNFNHPSGFGRGQGYPK
Sbjct: 1 MHPRQRNTGNGFRSSSMGVGLASSRISPEGSVRGHGGGYGNDYRNFNHPSGFGRGQGYPK 60
Query: 61 SYQSSQSLPPPRRGGGSVDIFMEAGRLAAEYLVSQGVLPSSVLSGKWQGGNLRRESSESQ 120
SYQS+Q LPPPRRGGGSVDIFMEAGRLAAEYLVSQG+LPSSVLSGKWQGG+LRRESSE Q
Sbjct: 61 SYQSAQPLPPPRRGGGSVDIFMEAGRLAAEYLVSQGLLPSSVLSGKWQGGSLRRESSEFQ 120
Query: 121 ELGPLREEGRTSTAAPPHSGHVGPDSGSGRRHPNDEYSSAASRNHLRGRRRSTSFRSSGS 180
ELGP REEGRTST APPHSGHVGPDSGSGRRHPNDEYSSA SRNHLRGRRRSTSFRSSGS
Sbjct: 121 ELGPFREEGRTSTTAPPHSGHVGPDSGSGRRHPNDEYSSAPSRNHLRGRRRSTSFRSSGS 180
Query: 181 DWSGQDYSRSNNYNDRARASPDTEAYDDTDNLYVYGNQQQTGEEVGSELQDLKPSKLEPK 240
DWSGQDYSRSNNYNDRARASP TEAYDDTDNLYVYGNQQQTGEE G+ELQDLKPSKLE K
Sbjct: 181 DWSGQDYSRSNNYNDRARASPGTEAYDDTDNLYVYGNQQQTGEEGGTELQDLKPSKLERK 240
Query: 241 GDTPEDSGPELVKYPLPDDAGSKASTSAVVKDLPSEPKLAKDSDDLSNIDSGSEEVKNST 300
DTPEDSGPELVKYPLPDDAGSKASTSAV KDLPSE KLAKDSDDLSNIDSGSEEVKNST
Sbjct: 241 VDTPEDSGPELVKYPLPDDAGSKASTSAVGKDLPSESKLAKDSDDLSNIDSGSEEVKNST 300
Query: 301 NINETEKHCVTEKLSVQNEADDGDPLVKPETDLLAFCRFTKFPTKTRSALAYKVSKADPI 360
NINETEKHCV EKLSVQNEADDGDPLVK ETDLLAFCRFTKFPTKTRSALAYKVSKADP+
Sbjct: 301 NINETEKHCVAEKLSVQNEADDGDPLVKHETDLLAFCRFTKFPTKTRSALAYKVSKADPV 360
Query: 361 TAVSERPSVINTNRGSEISLDSNPSSCALSGAVSAKKHDVENLNSERSKPEAVEKTDTVE 420
T VSE+P V+NTN SEISLDSNPSSC LSGAVSAKKHDVEN+NSERSKPEAVEKT T+E
Sbjct: 361 TTVSEQP-VVNTNIESEISLDSNPSSCTLSGAVSAKKHDVENINSERSKPEAVEKTGTME 420
Query: 421 ELYPKFGEKASSLTSQSFQHGPFWNESKEESCQIPAVIGRSDLMFEERGQKRSLDESDVG 480
ELYP+FGEKASSLTSQSFQHGPFWNESKEESCQ PAVIGRSD MFEER QKRSLDESDV
Sbjct: 421 ELYPRFGEKASSLTSQSFQHGPFWNESKEESCQSPAVIGRSD-MFEERSQKRSLDESDVA 480
Query: 481 EGNKKPREWIPLMTSKEDESFDLLKFDKTKLSSEESKPACDNEVIVAPDCVNSVDGFHFI 540
EGNKKPREWIPLMTSKEDESFDLLKFDKTK+SSEESK ACDNEVIVA DCVNSVDGFHFI
Sbjct: 481 EGNKKPREWIPLMTSKEDESFDLLKFDKTKVSSEESKQACDNEVIVA-DCVNSVDGFHFI 540
Query: 541 KGGGEQCVDYAQEKQLFPNSFKICDLNLMEASDIHDNHENN-LLIFPSISETKRERTPVD 600
KGGGEQCVDYAQEKQLFPNSFKICDLNLMEASDIHDNHENN LLIFPSISETKRER P+D
Sbjct: 541 KGGGEQCVDYAQEKQLFPNSFKICDLNLMEASDIHDNHENNPLLIFPSISETKRERAPLD 600
Query: 601 IDLSISNATEFGQNSVVAGGKEIEIIDLEDDSAAEVDKTFHNAERKSETVFTGLDGFPNN 660
IDLSISNATEFG++SVV GGKEIEIIDLEDDS AEVDKTFHN ERK ETVFTGLDGFPNN
Sbjct: 601 IDLSISNATEFGRDSVVTGGKEIEIIDLEDDSTAEVDKTFHNPERKCETVFTGLDGFPNN 660
Query: 661 AQNSGDMPDVQDGYGLMISELLGAEFPNCASVQGDINSMHNEMPLSNGE 709
QNSGDMPDVQDGYGLMISELLGAEFPNCASVQGDINSMHNEMPLSNGE
Sbjct: 661 PQNSGDMPDVQDGYGLMISELLGAEFPNCASVQGDINSMHNEMPLSNGE 706
BLAST of HG10022340 vs. NCBI nr
Match:
TYK23238.1 (uncharacterized protein E5676_scaffold142G002830 [Cucumis melo var. makuwa])
HSP 1 Score: 1268.8 bits (3282), Expect = 0.0e+00
Identity = 653/709 (92.10%), Postives = 672/709 (94.78%), Query Frame = 0
Query: 1 MHPRHRNTGNGFRSSSMGVGLASSRISPEGSVRGHGGGYGNDYRNFNHPSGFGRGQGYPK 60
MHPR RNTGNGFRS SMGVGLASSRISPEGSVRGH GGYGNDYRNFNHPSGFGRGQGYPK
Sbjct: 1 MHPRQRNTGNGFRSGSMGVGLASSRISPEGSVRGH-GGYGNDYRNFNHPSGFGRGQGYPK 60
Query: 61 SYQSSQSLPPPRRGGGSVDIFMEAGRLAAEYLVSQGVLPSSVLSGKWQGGNLRRESSESQ 120
SYQSSQSLPPPRRGGGSVDIFMEAGRLAAEYLVSQG+LPSSVLSGKWQGG+LRR+SSE Q
Sbjct: 61 SYQSSQSLPPPRRGGGSVDIFMEAGRLAAEYLVSQGLLPSSVLSGKWQGGSLRRDSSEFQ 120
Query: 121 ELGPLREEGRTSTAAPPHSGHVGPDSGSGRRHPNDEYSSAASRNHLRGRRRSTSFRSSGS 180
ELGPLREEGRTSTAAPPHSG+VGPDSGSGRRHPNDEYSSA SRNHLRGRRRSTSFRS+GS
Sbjct: 121 ELGPLREEGRTSTAAPPHSGYVGPDSGSGRRHPNDEYSSAPSRNHLRGRRRSTSFRSTGS 180
Query: 181 DWSGQDYSRSNNYNDRARASPDTEAYDDTDNLYVYGNQQQTGEEVGSELQDLKPSKLEPK 240
DWSGQDYSRSNN NDR RASPDTEAYDDTDNLY YGNQQQTGEEVG+ELQDLK S LE K
Sbjct: 181 DWSGQDYSRSNNCNDRGRASPDTEAYDDTDNLYAYGNQQQTGEEVGTELQDLKSSNLEQK 240
Query: 241 GDTPEDSGPELVKYPLPDDAGSKASTSAVVKDLPSEPKLAKDSDDLSNIDSGSEEVKNST 300
GDTPEDSGPELVKYPLPDDAGSKAS SAV KD PSEPKLAKDSDDLSNID GSEEVK+ST
Sbjct: 241 GDTPEDSGPELVKYPLPDDAGSKASNSAVGKDPPSEPKLAKDSDDLSNIDLGSEEVKHST 300
Query: 301 NINETEKHCVTEKLSVQNEADDGDPLVKPETDLLAFCRFTKFPTKTRSALAYKVSKADPI 360
NINETEKHCV EKLSVQNEA DGD LVK ETDLLAFCRFTKFPTKTRSALAYKVSKADPI
Sbjct: 301 NINETEKHCVAEKLSVQNEAGDGDSLVKQETDLLAFCRFTKFPTKTRSALAYKVSKADPI 360
Query: 361 TAVSERPSVINTNRGSEISLDSNPSSCALSGAVSAKKHDVENLNSERSKPEAVEKTDTVE 420
T SE+PSVINTNR SE SLDS+PSSCALSGAVSAKKHDVEN+NS+ SKPEAVEK T+E
Sbjct: 361 TTASEQPSVINTNRDSETSLDSSPSSCALSGAVSAKKHDVENINSKSSKPEAVEKAGTME 420
Query: 421 ELYPKFGEKASSLTSQSFQHGPFWNESKEESCQIPAVIGRSDLMFEERGQKRSLDESDVG 480
ELYP++ EKA SL+SQSFQHGPFWNESKEESCQ PAV+GRSDLMFEERGQKRSLDESDVG
Sbjct: 421 ELYPRYSEKAGSLSSQSFQHGPFWNESKEESCQSPAVVGRSDLMFEERGQKRSLDESDVG 480
Query: 481 EGNKKPREWIPLMTSKEDESFDLLKFDKTKLSSEESKPACDNEVIVAPDCVNSVDGFHFI 540
EGNKKPREWIPLMTSKEDESFDLLKFDKTK+SSEESKPACDNEVIVA DC NSVDGFHFI
Sbjct: 481 EGNKKPREWIPLMTSKEDESFDLLKFDKTKVSSEESKPACDNEVIVAADCENSVDGFHFI 540
Query: 541 KGGGEQCVDYAQEKQLFPNSFKICDLNLMEASDIHDNHENN-LLIFPSISETKRERTPVD 600
KGGGEQCVDYAQEKQLFPNSFKICDLNLMEASDIHDNHENN LLIFPSISETKRE PVD
Sbjct: 541 KGGGEQCVDYAQEKQLFPNSFKICDLNLMEASDIHDNHENNPLLIFPSISETKREIAPVD 600
Query: 601 IDLSISNATEFGQNSVVAGGKEIEIIDLEDDSAAEVDKTFHNAERKSETVFTGLDGFPNN 660
IDLSISNATEFGQNSVVAGGKEIEIIDLEDDS AEVDKTFHNAERKSETVFTGLDGFPNN
Sbjct: 601 IDLSISNATEFGQNSVVAGGKEIEIIDLEDDSTAEVDKTFHNAERKSETVFTGLDGFPNN 660
Query: 661 AQNSGDMPDVQDGYGLMISELLGAEFPNCASVQGDINSMHNEMPLSNGE 709
AQN+GDMPDVQDGYGLMISELLGAEF NC SVQGDINS+HNEMPLSNGE
Sbjct: 661 AQNNGDMPDVQDGYGLMISELLGAEFSNCGSVQGDINSIHNEMPLSNGE 708
BLAST of HG10022340 vs. ExPASy Swiss-Prot
Match:
P0CB21 (Uncharacterized protein At4g26450 OS=Arabidopsis thaliana OX=3702 GN=At4g26450 PE=4 SV=1)
HSP 1 Score: 284.6 bits (727), Expect = 3.1e-75
Identity = 261/765 (34.12%), Postives = 383/765 (50.07%), Query Frame = 0
Query: 1 MHPRHRNTGNGFRSSSMGVGLASSRISPEGSVRGHGGGYGNDYRNFNHPSGFGRGQGYPK 60
MH R RN GNG+RS S+G+G++ SRISPE +RGH G YG+++++ G+GRG+G K
Sbjct: 1 MHARQRNVGNGYRSGSIGMGMSGSRISPERPMRGH-GFYGSEHQHRGFNRGYGRGRGRSK 60
Query: 61 SY--QSSQSLPPP---RRGGGSVDIFMEAGRLAAEYLVSQGVLPSSVLSGKWQGGNLRRE 120
SY Q LPPP RR G D+FMEAGRLA EYLVSQGVLP +VLS KWQ GN R++
Sbjct: 61 SYHNQLPPPLPPPPVQRRSSGG-DVFMEAGRLATEYLVSQGVLPQTVLSSKWQNGNFRKQ 120
Query: 121 SSESQELGPLREEGRTSTAAPPHSGHVGPDSGSGRRHPNDEYSSAASRNHLRGRRRSTSF 180
+ E Q +EE R +AP + +R D YSSA SRN L+GRR
Sbjct: 121 AGEFQS-SRSQEEARMDVSAP----------AAEKRRYIDGYSSAGSRNSLKGRR----- 180
Query: 181 RSSGSDWSGQDYSRSNNYNDRARASPDTEAYDDTDNLYVYGNQ--QQTGEEVGSELQDLK 240
S D+ RS ++++R++A +TE DD+ V G+Q Q E++ S +Q
Sbjct: 181 ----SHRYDSDFGRSGSWSERSKAF-ETETGDDS----VSGHQEEQPLAEDIASSVQRSA 240
Query: 241 PSKLEPKGDTPEDSGPELVKYPLPDDAGSKASTSAVVKDLPSEPKLAKDSDDLSNIDSGS 300
+ K + DS L KY L D+A SK +S+ KD+ + +++K S+ S++ +GS
Sbjct: 241 SGEFMRKCEGAGDSESVLDKYNLQDEAQSKTGSSSAGKDIVQDCEISKVSEGSSSLSAGS 300
Query: 301 EEVK-----------NSTNINETEKHCVTEKLSVQNEADDGDPLVKPETDLLAFCRFTKF 360
E+K N T I + H E S+ + + K DL C+F K
Sbjct: 301 GEMKGRSGGNGGEDENQTAIEDGSIHQRCEDASIDQQCGADESFTKSGIDLATLCKFEKV 360
Query: 361 PTKTRSALAYKVSKADPITAVSERPSVINTNRGSEISLDSNPSSCALSGAVSAKKHDVEN 420
PT+TRS+L K P +S + N G E D + C G S K +
Sbjct: 361 PTRTRSSLTAK----GPKLYLSHNIKDTSHNSGLE-EEDQTENRCETRGQSSGKADSTGD 420
Query: 421 LNSERSKPEAVEKTDTVEELYPKFGEKASSLTSQSFQHGPFWNESKEESCQIPAVIGRSD 480
N + VE V+ + +++S S + N KE ++P + RS
Sbjct: 421 ENDQ------VEDFALVQYIENSKCHRSNSFPSSILRD----NSEKESGLELPN-LHRSH 480
Query: 481 LMFEERGQKRSLDESDVGEGNKKPREWIPLMTSKEDESFDLLKFDKTKLS-SEESKPACD 540
+ + G+KR + SD+ EG+K+ R+W+ + S+ +E F++ K + EE K +
Sbjct: 481 SV-GKVGEKRPGEGSDLEEGSKRQRDWVAV--SEANERFNMFKTSGNQCDPEEEGKTSSF 540
Query: 541 NEVIVAPDCVNSVDGFHFIKGG-------GEQCVDYAQEKQLFPNSFKICDLNLMEASDI 600
N+ ++ V + G YA+E QLFP SFK+CDLNL ASD+
Sbjct: 541 NKRLIDGAAGKRVSHESLVNNSTYNRTHTGRTGPGYAEEHQLFPASFKMCDLNLGGASDV 600
Query: 601 HDNHENNLLIFPSISETKRERTPVDIDLSISNAT---EFGQNSVVAGGKEIEIIDLEDDS 660
+D K R VD DLSIS+++ EFG ++ ++ GKEIE+I+L+DD
Sbjct: 601 NDG-------------IKESRQAVDFDLSISSSSKSLEFGTSTRMSNGKEIEVINLDDDQ 660
Query: 661 AAEVDKTFHNAERKSETV-FTGLDGFPNNAQNSGDMPDVQDGYGLMISELLGAEFPNCAS 720
EV K+ ++ RK E + G+D D+PD + LM+ E L + F
Sbjct: 661 --EVVKSSNDPGRKQEAAPYMGID----------DVPDYNE--RLMMVEYLDS-FTPINQ 691
Query: 721 VQGDINSMHNEMPLSNGEGAL--------ADDDLIYMSLGEIPLS 728
+ +N + L + EGA+ DDD I+MSLGEIPL+
Sbjct: 721 GTSSVPQNNNTVSLQDREGAIGNDQVPNNTDDDSIFMSLGEIPLT 691
BLAST of HG10022340 vs. ExPASy TrEMBL
Match:
A0A1S3BHN6 (uncharacterized protein At4g26450 OS=Cucumis melo OX=3656 GN=LOC103490188 PE=4 SV=1)
HSP 1 Score: 1312.7 bits (3396), Expect = 0.0e+00
Identity = 675/732 (92.21%), Postives = 694/732 (94.81%), Query Frame = 0
Query: 1 MHPRHRNTGNGFRSSSMGVGLASSRISPEGSVRGHGGGYGNDYRNFNHPSGFGRGQGYPK 60
MHPR RNTGNGFRS SMGVGLASSRISPEGSVRGH GGYGNDYRNFNHPSGFGRGQGYPK
Sbjct: 1 MHPRQRNTGNGFRSGSMGVGLASSRISPEGSVRGH-GGYGNDYRNFNHPSGFGRGQGYPK 60
Query: 61 SYQSSQSLPPPRRGGGSVDIFMEAGRLAAEYLVSQGVLPSSVLSGKWQGGNLRRESSESQ 120
SYQSSQSLPPPRRGGGSVDIFMEAGRLAAEYLVSQG+LPSSVLSGKWQGG+LRR+SSE Q
Sbjct: 61 SYQSSQSLPPPRRGGGSVDIFMEAGRLAAEYLVSQGLLPSSVLSGKWQGGSLRRDSSEFQ 120
Query: 121 ELGPLREEGRTSTAAPPHSGHVGPDSGSGRRHPNDEYSSAASRNHLRGRRRSTSFRSSGS 180
ELGPLREEGRTSTAAPPHSG+VGPDSGSGRRHPNDEYSSA SRNHLRGRRRSTSFRS+GS
Sbjct: 121 ELGPLREEGRTSTAAPPHSGYVGPDSGSGRRHPNDEYSSAPSRNHLRGRRRSTSFRSTGS 180
Query: 181 DWSGQDYSRSNNYNDRARASPDTEAYDDTDNLYVYGNQQQTGEEVGSELQDLKPSKLEPK 240
DWSGQDYSRSNN NDR RASPDTEAYDDTDNLY YGNQQQTGEEVG+ELQDLK S LE K
Sbjct: 181 DWSGQDYSRSNNCNDRGRASPDTEAYDDTDNLYAYGNQQQTGEEVGTELQDLKSSNLEQK 240
Query: 241 GDTPEDSGPELVKYPLPDDAGSKASTSAVVKDLPSEPKLAKDSDDLSNIDSGSEEVKNST 300
GDTPEDSGPELVKYPLPDDAGSKAS SAV KD PSEPKLAKDSDDLSNID GSEEVK+ST
Sbjct: 241 GDTPEDSGPELVKYPLPDDAGSKASNSAVGKDPPSEPKLAKDSDDLSNIDLGSEEVKHST 300
Query: 301 NINETEKHCVTEKLSVQNEADDGDPLVKPETDLLAFCRFTKFPTKTRSALAYKVSKADPI 360
NINETEKHCV EKLSVQNEA DGD LVK ETDLLAFCRFTKFPTKTRSALAYKVSKADPI
Sbjct: 301 NINETEKHCVAEKLSVQNEAGDGDSLVKQETDLLAFCRFTKFPTKTRSALAYKVSKADPI 360
Query: 361 TAVSERPSVINTNRGSEISLDSNPSSCALSGAVSAKKHDVENLNSERSKPEAVEKTDTVE 420
T SE+PSVINTNR SE SLDS+PSSCALSGAVSAKKHDVEN+NS+ SKPEAVEK T+E
Sbjct: 361 TTASEQPSVINTNRDSETSLDSSPSSCALSGAVSAKKHDVENINSKSSKPEAVEKAGTME 420
Query: 421 ELYPKFGEKASSLTSQSFQHGPFWNESKEESCQIPAVIGRSDLMFEERGQKRSLDESDVG 480
ELYP++ EKA SL+SQSFQHGPFWNESKEESCQ PAV+GRSDLMFEERGQKRSLDESDVG
Sbjct: 421 ELYPRYSEKAGSLSSQSFQHGPFWNESKEESCQSPAVVGRSDLMFEERGQKRSLDESDVG 480
Query: 481 EGNKKPREWIPLMTSKEDESFDLLKFDKTKLSSEESKPACDNEVIVAPDCVNSVDGFHFI 540
EGNKKPREWIPLMTSKEDESFDLLKFDKTK+SSEESKPACDNEVIVA DC NSVDGFHFI
Sbjct: 481 EGNKKPREWIPLMTSKEDESFDLLKFDKTKVSSEESKPACDNEVIVAADCENSVDGFHFI 540
Query: 541 KGGGEQCVDYAQEKQLFPNSFKICDLNLMEASDIHDNHENN-LLIFPSISETKRERTPVD 600
KGGGEQCVDYAQEKQLFPNSFKICDLNLMEASDIHDNHENN LLIFPSISETKRE PVD
Sbjct: 541 KGGGEQCVDYAQEKQLFPNSFKICDLNLMEASDIHDNHENNPLLIFPSISETKREIAPVD 600
Query: 601 IDLSISNATEFGQNSVVAGGKEIEIIDLEDDSAAEVDKTFHNAERKSETVFTGLDGFPNN 660
IDLSISNATEFGQNSVVAGGKEIEIIDLEDDS AEVDKTFHNAERKSETVFTGLDGFPNN
Sbjct: 601 IDLSISNATEFGQNSVVAGGKEIEIIDLEDDSTAEVDKTFHNAERKSETVFTGLDGFPNN 660
Query: 661 AQNSGDMPDVQDGYGLMISELLGAEFPNCASVQGDINSMHNEMPLSNGEGALADDDLIYM 720
AQN+GDMPDVQDGYGLMISELLGAEF NC SVQGDINS+HNEMPLSNGEG LADDDLIYM
Sbjct: 661 AQNNGDMPDVQDGYGLMISELLGAEFSNCGSVQGDINSIHNEMPLSNGEGTLADDDLIYM 720
Query: 721 SLGEIPLSMPEF 732
SLGEIPLSMPEF
Sbjct: 721 SLGEIPLSMPEF 731
BLAST of HG10022340 vs. ExPASy TrEMBL
Match:
A0A0A0K3M2 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G186170 PE=4 SV=1)
HSP 1 Score: 1300.0 bits (3363), Expect = 0.0e+00
Identity = 670/732 (91.53%), Postives = 689/732 (94.13%), Query Frame = 0
Query: 1 MHPRHRNTGNGFRSSSMGVGLASSRISPEGSVRGHGGGYGNDYRNFNHPSGFGRGQGYPK 60
MHPR RNTGNGFRS SMGVGLASSRISPEGSVRGH GGYGNDYRNFNHPSGFGRGQGYPK
Sbjct: 1 MHPRQRNTGNGFRSGSMGVGLASSRISPEGSVRGH-GGYGNDYRNFNHPSGFGRGQGYPK 60
Query: 61 SYQSSQSLPPPRRGGGSVDIFMEAGRLAAEYLVSQGVLPSSVLSGKWQGGNLRRESSESQ 120
SYQSSQSLPPPRRGGGSVDIFMEAGRLAAEYLVSQG+LPSSVLSGKWQGG+LRR+SSE Q
Sbjct: 61 SYQSSQSLPPPRRGGGSVDIFMEAGRLAAEYLVSQGLLPSSVLSGKWQGGSLRRDSSEFQ 120
Query: 121 ELGPLREEGRTSTAAPPHSGHVGPDSGSGRRHPNDEYSSAASRNHLRGRRRSTSFRSSGS 180
ELGPLREEGRTSTA PPHSG+VGPDSGSGRRHPNDEYSS SRNHLRGRRRSTSFRS+GS
Sbjct: 121 ELGPLREEGRTSTAPPPHSGYVGPDSGSGRRHPNDEYSSTPSRNHLRGRRRSTSFRSTGS 180
Query: 181 DWSGQDYSRSNNYNDRARASPDTEAYDDTDNLYVYGNQQQTGEEVGSELQDLKPSKLEPK 240
DW GQDYSRSNN NDR RASPDTEAYDDTDNLY YGNQQQTGEEVG+ELQDLK SKLE K
Sbjct: 181 DWIGQDYSRSNNCNDRGRASPDTEAYDDTDNLYAYGNQQQTGEEVGTELQDLKSSKLEQK 240
Query: 241 GDTPEDSGPELVKYPLPDDAGSKASTSAVVKDLPSEPKLAKDSDDLSNIDSGSEEVKNST 300
GDTPEDSGPELVKYPLPDDAGSKA+ SAV KD PSEPKLAKDSDDLSN+D GSEEVK+ST
Sbjct: 241 GDTPEDSGPELVKYPLPDDAGSKANNSAVGKDPPSEPKLAKDSDDLSNVDLGSEEVKHST 300
Query: 301 NINETEKHCVTEKLSVQNEADDGDPLVKPETDLLAFCRFTKFPTKTRSALAYKVSKADPI 360
NINETEKHCV EKLS QNEA DGD LVK ETDLLAFCRFTKFPTKTRSALAYKVSKADPI
Sbjct: 301 NINETEKHCVAEKLSGQNEAGDGDSLVKQETDLLAFCRFTKFPTKTRSALAYKVSKADPI 360
Query: 361 TAVSERPSVINTNRGSEISLDSNPSSCALSGAVSAKKHDVENLNSERSKPEAVEKTDTVE 420
T VSE PSVINTNR SE S+D +PSSCALSGAVSAKK DVENLNS+RSKPEAVEK T+E
Sbjct: 361 TTVSEHPSVINTNRDSETSIDCSPSSCALSGAVSAKKLDVENLNSKRSKPEAVEKAGTME 420
Query: 421 ELYPKFGEKASSLTSQSFQHGPFWNESKEESCQIPAVIGRSDLMFEERGQKRSLDESDVG 480
EL P++ EKA SLTSQSFQHGPFWNESKEESCQ PA +GRSDLMFEERGQKRSLDESDVG
Sbjct: 421 ELCPRYSEKAGSLTSQSFQHGPFWNESKEESCQSPAGVGRSDLMFEERGQKRSLDESDVG 480
Query: 481 EGNKKPREWIPLMTSKEDESFDLLKFDKTKLSSEESKPACDNEVIVAPDCVNSVDGFHFI 540
EGNKKPREWIPLMTSKEDESFDLLKFDKTK+SSEESKPACDNEV+VA DCVNSVDGFHFI
Sbjct: 481 EGNKKPREWIPLMTSKEDESFDLLKFDKTKVSSEESKPACDNEVVVAADCVNSVDGFHFI 540
Query: 541 KGGGEQCVDYAQEKQLFPNSFKICDLNLMEASDIHDNHENN-LLIFPSISETKRERTPVD 600
KGGGEQCVDYAQEKQLFPNSFKICDLNLMEASDIHDNHENN LLIFPSISETKRE PVD
Sbjct: 541 KGGGEQCVDYAQEKQLFPNSFKICDLNLMEASDIHDNHENNPLLIFPSISETKREIAPVD 600
Query: 601 IDLSISNATEFGQNSVVAGGKEIEIIDLEDDSAAEVDKTFHNAERKSETVFTGLDGFPNN 660
IDLSISNATEFGQNSVVAGGKEIEIIDLEDDS AEVDKTFHNAERKSETVF GLDGFPNN
Sbjct: 601 IDLSISNATEFGQNSVVAGGKEIEIIDLEDDSTAEVDKTFHNAERKSETVFNGLDGFPNN 660
Query: 661 AQNSGDMPDVQDGYGLMISELLGAEFPNCASVQGDINSMHNEMPLSNGEGALADDDLIYM 720
AQNSGDMPDVQDGYGLMISELLGAEF NCASVQGDINS+HNEMPLSNGEG LADDDLIYM
Sbjct: 661 AQNSGDMPDVQDGYGLMISELLGAEFSNCASVQGDINSIHNEMPLSNGEGTLADDDLIYM 720
Query: 721 SLGEIPLSMPEF 732
SLGEIPLSMPEF
Sbjct: 721 SLGEIPLSMPEF 731
BLAST of HG10022340 vs. ExPASy TrEMBL
Match:
A0A5D3DIE9 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold142G002830 PE=4 SV=1)
HSP 1 Score: 1268.8 bits (3282), Expect = 0.0e+00
Identity = 653/709 (92.10%), Postives = 672/709 (94.78%), Query Frame = 0
Query: 1 MHPRHRNTGNGFRSSSMGVGLASSRISPEGSVRGHGGGYGNDYRNFNHPSGFGRGQGYPK 60
MHPR RNTGNGFRS SMGVGLASSRISPEGSVRGH GGYGNDYRNFNHPSGFGRGQGYPK
Sbjct: 1 MHPRQRNTGNGFRSGSMGVGLASSRISPEGSVRGH-GGYGNDYRNFNHPSGFGRGQGYPK 60
Query: 61 SYQSSQSLPPPRRGGGSVDIFMEAGRLAAEYLVSQGVLPSSVLSGKWQGGNLRRESSESQ 120
SYQSSQSLPPPRRGGGSVDIFMEAGRLAAEYLVSQG+LPSSVLSGKWQGG+LRR+SSE Q
Sbjct: 61 SYQSSQSLPPPRRGGGSVDIFMEAGRLAAEYLVSQGLLPSSVLSGKWQGGSLRRDSSEFQ 120
Query: 121 ELGPLREEGRTSTAAPPHSGHVGPDSGSGRRHPNDEYSSAASRNHLRGRRRSTSFRSSGS 180
ELGPLREEGRTSTAAPPHSG+VGPDSGSGRRHPNDEYSSA SRNHLRGRRRSTSFRS+GS
Sbjct: 121 ELGPLREEGRTSTAAPPHSGYVGPDSGSGRRHPNDEYSSAPSRNHLRGRRRSTSFRSTGS 180
Query: 181 DWSGQDYSRSNNYNDRARASPDTEAYDDTDNLYVYGNQQQTGEEVGSELQDLKPSKLEPK 240
DWSGQDYSRSNN NDR RASPDTEAYDDTDNLY YGNQQQTGEEVG+ELQDLK S LE K
Sbjct: 181 DWSGQDYSRSNNCNDRGRASPDTEAYDDTDNLYAYGNQQQTGEEVGTELQDLKSSNLEQK 240
Query: 241 GDTPEDSGPELVKYPLPDDAGSKASTSAVVKDLPSEPKLAKDSDDLSNIDSGSEEVKNST 300
GDTPEDSGPELVKYPLPDDAGSKAS SAV KD PSEPKLAKDSDDLSNID GSEEVK+ST
Sbjct: 241 GDTPEDSGPELVKYPLPDDAGSKASNSAVGKDPPSEPKLAKDSDDLSNIDLGSEEVKHST 300
Query: 301 NINETEKHCVTEKLSVQNEADDGDPLVKPETDLLAFCRFTKFPTKTRSALAYKVSKADPI 360
NINETEKHCV EKLSVQNEA DGD LVK ETDLLAFCRFTKFPTKTRSALAYKVSKADPI
Sbjct: 301 NINETEKHCVAEKLSVQNEAGDGDSLVKQETDLLAFCRFTKFPTKTRSALAYKVSKADPI 360
Query: 361 TAVSERPSVINTNRGSEISLDSNPSSCALSGAVSAKKHDVENLNSERSKPEAVEKTDTVE 420
T SE+PSVINTNR SE SLDS+PSSCALSGAVSAKKHDVEN+NS+ SKPEAVEK T+E
Sbjct: 361 TTASEQPSVINTNRDSETSLDSSPSSCALSGAVSAKKHDVENINSKSSKPEAVEKAGTME 420
Query: 421 ELYPKFGEKASSLTSQSFQHGPFWNESKEESCQIPAVIGRSDLMFEERGQKRSLDESDVG 480
ELYP++ EKA SL+SQSFQHGPFWNESKEESCQ PAV+GRSDLMFEERGQKRSLDESDVG
Sbjct: 421 ELYPRYSEKAGSLSSQSFQHGPFWNESKEESCQSPAVVGRSDLMFEERGQKRSLDESDVG 480
Query: 481 EGNKKPREWIPLMTSKEDESFDLLKFDKTKLSSEESKPACDNEVIVAPDCVNSVDGFHFI 540
EGNKKPREWIPLMTSKEDESFDLLKFDKTK+SSEESKPACDNEVIVA DC NSVDGFHFI
Sbjct: 481 EGNKKPREWIPLMTSKEDESFDLLKFDKTKVSSEESKPACDNEVIVAADCENSVDGFHFI 540
Query: 541 KGGGEQCVDYAQEKQLFPNSFKICDLNLMEASDIHDNHENN-LLIFPSISETKRERTPVD 600
KGGGEQCVDYAQEKQLFPNSFKICDLNLMEASDIHDNHENN LLIFPSISETKRE PVD
Sbjct: 541 KGGGEQCVDYAQEKQLFPNSFKICDLNLMEASDIHDNHENNPLLIFPSISETKREIAPVD 600
Query: 601 IDLSISNATEFGQNSVVAGGKEIEIIDLEDDSAAEVDKTFHNAERKSETVFTGLDGFPNN 660
IDLSISNATEFGQNSVVAGGKEIEIIDLEDDS AEVDKTFHNAERKSETVFTGLDGFPNN
Sbjct: 601 IDLSISNATEFGQNSVVAGGKEIEIIDLEDDSTAEVDKTFHNAERKSETVFTGLDGFPNN 660
Query: 661 AQNSGDMPDVQDGYGLMISELLGAEFPNCASVQGDINSMHNEMPLSNGE 709
AQN+GDMPDVQDGYGLMISELLGAEF NC SVQGDINS+HNEMPLSNGE
Sbjct: 661 AQNNGDMPDVQDGYGLMISELLGAEFSNCGSVQGDINSIHNEMPLSNGE 708
BLAST of HG10022340 vs. ExPASy TrEMBL
Match:
A0A6J1FDK2 (uncharacterized protein At4g26450-like isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111443038 PE=4 SV=1)
HSP 1 Score: 1236.1 bits (3197), Expect = 0.0e+00
Identity = 638/733 (87.04%), Postives = 669/733 (91.27%), Query Frame = 0
Query: 1 MHPRHRNTGNGFRSSSMGVGLASSRISPEGSVRGHGGGYGNDYRNFNHPSGFGRGQGYPK 60
MHPRHRNTGNGFRS SMGVGLASSRISPEGSVRGHGG YGNDYRNFNHPS FGRGQGYPK
Sbjct: 1 MHPRHRNTGNGFRSGSMGVGLASSRISPEGSVRGHGGAYGNDYRNFNHPSSFGRGQGYPK 60
Query: 61 SYQSSQSLPPPRRGGGSVDIFMEAGRLAAEYLVSQGVLPSSVLSGKWQGGNLRRESSESQ 120
SYQSSQSLPPPRRGGGSVDIFMEAGRLAAEYLVSQG+LPSSVLSGKWQGG+LRRESSE Q
Sbjct: 61 SYQSSQSLPPPRRGGGSVDIFMEAGRLAAEYLVSQGLLPSSVLSGKWQGGSLRRESSEFQ 120
Query: 121 ELGPLREEGRTSTAAPPHSGHVGPDSGSGRRHPNDEYSSAASRNHLRGRRRSTSFRSSGS 180
E GP REEGRTS AAPPHSGHVGPD G GRR P+DEYS A SRNHLRGRRRS+SFRSSGS
Sbjct: 121 EFGPFREEGRTSNAAPPHSGHVGPDFGPGRRQPSDEYSLAPSRNHLRGRRRSSSFRSSGS 180
Query: 181 DWSGQDYSRSNNYNDRARASPDTEAYDDTDNLYVYGNQQQTGEEVGSELQDLKPSKLEPK 240
DWSGQ+YSRSNNYNDRARAS DTEAYDD DNL YGN+QQTGEEVG+E Q LKPSKL K
Sbjct: 181 DWSGQEYSRSNNYNDRARASSDTEAYDDADNLSAYGNRQQTGEEVGTEFQVLKPSKLGRK 240
Query: 241 GDTPEDSGPELVKYPLPDDAGSKASTSAVVKDLPSEPKLAKDSDDLSNIDSGSEEVKNST 300
GDTPEDSGPELVKYPLPDD GSKAS SAV KDLP+E KLAKDSDDL N+DSG EEVKN+T
Sbjct: 241 GDTPEDSGPELVKYPLPDDVGSKASISAVAKDLPNEQKLAKDSDDLCNVDSGFEEVKNNT 300
Query: 301 NINETEKHCVTEKLSVQNEADDGDPLVKPETDLLAFCRFTKFPTKTRSALAYKVSKADPI 360
N NETEKHCV EKLS+QN+ADDGD +VKPETDLLAFCRFTKFPTKTRSALAYKVSKADPI
Sbjct: 301 NTNETEKHCVAEKLSMQNKADDGDSVVKPETDLLAFCRFTKFPTKTRSALAYKVSKADPI 360
Query: 361 TAVSERPSVINTNRGSEISLDSNPSSCALSGAVSAKKHDVENLNSERSKPEAVEKTDTVE 420
VSE+PSVIN R SE+S+DSN S+CALSGAVSAK DVENLNSERS+P+A E+ +E
Sbjct: 361 ATVSEQPSVINPIR-SELSIDSNSSTCALSGAVSAKNLDVENLNSERSEPQATEEAVIME 420
Query: 421 ELYPKFGEKASSLTSQSFQHGPFWNESKEESCQIPAVIGRSDLMFEERGQKRSLDESDVG 480
E YP+FGEKASSLTS SFQHGPFWNESKEESC+ PAV GRS+ MFEE+GQKRSLDESDVG
Sbjct: 421 EAYPRFGEKASSLTSHSFQHGPFWNESKEESCRTPAVFGRSNSMFEEKGQKRSLDESDVG 480
Query: 481 EGNKKPREWIPLMTSK-EDESFDLLKFDKTKLSSEESKPACDNEVIVAPDCVNSVDGFHF 540
+GNKKPREWIPLM SK EDE+FDLLKFDK K+SSEESKP CDNEVIVA DCVNSVDGFHF
Sbjct: 481 DGNKKPREWIPLMNSKEEDEAFDLLKFDKAKVSSEESKPECDNEVIVAADCVNSVDGFHF 540
Query: 541 IKGGGEQCVDYAQEKQLFPNSFKICDLNLMEASDIHDNHENN-LLIFPSISETKRERTPV 600
IK GGEQCVDYAQEKQLFPNSFKICDLNLMEASDIHDNHENN LLIF SISE+KRER PV
Sbjct: 541 IKNGGEQCVDYAQEKQLFPNSFKICDLNLMEASDIHDNHENNPLLIFQSISESKRERAPV 600
Query: 601 DIDLSISNATEFGQNSVVAGGKEIEIIDLEDDSAAEVDKTFHNAERKSETVFTGLDGFPN 660
+I LSISNATEFGQNSVVAGGKEIEIIDLEDDS A+VDKTFHN E K E VFTGLDGFPN
Sbjct: 601 EIGLSISNATEFGQNSVVAGGKEIEIIDLEDDSNAQVDKTFHNLETKREPVFTGLDGFPN 660
Query: 661 NAQNSGDMPDVQDGYGLMISELLGAEFPNCASVQGDINSMHNEMPLSNGEGALADDDLIY 720
N QNSGDMPDV DGYGLMISELLGAEFPNCASVQGDINSMHNEM LSNGEGALADDDLIY
Sbjct: 661 NEQNSGDMPDVHDGYGLMISELLGAEFPNCASVQGDINSMHNEMSLSNGEGALADDDLIY 720
Query: 721 MSLGEIPLSMPEF 732
MSLGEIPLSMPEF
Sbjct: 721 MSLGEIPLSMPEF 732
BLAST of HG10022340 vs. ExPASy TrEMBL
Match:
A0A6J1IGH6 (uncharacterized protein At4g26450-like isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111476727 PE=4 SV=1)
HSP 1 Score: 1222.2 bits (3161), Expect = 0.0e+00
Identity = 633/733 (86.36%), Postives = 664/733 (90.59%), Query Frame = 0
Query: 1 MHPRHRNTGNGFRSSSMGVGLASSRISPEGSVRGHGGGYGNDYRNFNHPSGFGRGQGYPK 60
MHPRHRNTGNGFRSSSMGVGLASSRISPEGSVRGHGG YGNDYRNFNHPS FGRGQGYPK
Sbjct: 1 MHPRHRNTGNGFRSSSMGVGLASSRISPEGSVRGHGGAYGNDYRNFNHPSSFGRGQGYPK 60
Query: 61 SYQSSQSLPPPRRGGGSVDIFMEAGRLAAEYLVSQGVLPSSVLSGKWQGGNLRRESSESQ 120
SYQSSQSLPPPRRGGGSVDIFMEAGRLAAEYLVSQG+LPSSVLSGKWQGG+LRRESSE Q
Sbjct: 61 SYQSSQSLPPPRRGGGSVDIFMEAGRLAAEYLVSQGLLPSSVLSGKWQGGSLRRESSEFQ 120
Query: 121 ELGPLREEGRTSTAAPPHSGHVGPDSGSGRRHPNDEYSSAASRNHLRGRRRSTSFRSSGS 180
ELG REEGRTS AAPPHSGHVGPD GSGRR P+DEYS A SRNHLRGRRRS+SFRSSGS
Sbjct: 121 ELGQFREEGRTSNAAPPHSGHVGPDFGSGRRQPSDEYSLAPSRNHLRGRRRSSSFRSSGS 180
Query: 181 DWSGQDYSRSNNYNDRARASPDTEAYDDTDNLYVYGNQQQTGEEVGSELQDLKPSKLEPK 240
DWSGQ+YSRSNNYNDRARAS DTEAYDDTDNL YGN+Q TGEEVG+E Q LKPSKL K
Sbjct: 181 DWSGQEYSRSNNYNDRARASSDTEAYDDTDNLSAYGNRQHTGEEVGTEFQVLKPSKLGRK 240
Query: 241 GDTPEDSGPELVKYPLPDDAGSKASTSAVVKDLPSEPKLAKDSDDLSNIDSGSEEVKNST 300
GDTPEDSGPELVKYPLPDD GSK S SAV KDLP+E KLAKDSDDL N+DSG EEVKN+T
Sbjct: 241 GDTPEDSGPELVKYPLPDDVGSKTSISAVGKDLPNEQKLAKDSDDLCNVDSGFEEVKNNT 300
Query: 301 NINETEKHCVTEKLSVQNEADDGDPLVKPETDLLAFCRFTKFPTKTRSALAYKVSKADPI 360
N NETEKHCV EKLS+QN+ADDGD LVKPETDLLAFCRFTKFPTKTRSALAYKVSKADPI
Sbjct: 301 NTNETEKHCVAEKLSMQNKADDGDSLVKPETDLLAFCRFTKFPTKTRSALAYKVSKADPI 360
Query: 361 TAVSERPSVINTNRGSEISLDSNPSSCALSGAVSAKKHDVENLNSERSKPEAVEKTDTVE 420
VS +PSVIN NR SE+S+D N S+ ALSGAVSAK DVENLN ERS+P+A E+ +E
Sbjct: 361 ATVSAQPSVINPNR-SELSIDGNSSTYALSGAVSAKNLDVENLNYERSEPQATEEAVIME 420
Query: 421 ELYPKFGEKASSLTSQSFQHGPFWNESKEESCQIPAVIGRSDLMFEERGQKRSLDESDVG 480
E YP+FGEKASSL S SFQHGPFWNE+KEESC+ PAV GRS+ MFEER QKR LDESDVG
Sbjct: 421 EAYPRFGEKASSLMSHSFQHGPFWNENKEESCRTPAVFGRSNSMFEERSQKRPLDESDVG 480
Query: 481 EGNKKPREWIPLMTSK-EDESFDLLKFDKTKLSSEESKPACDNEVIVAPDCVNSVDGFHF 540
+GNKKPREWIPLM SK EDE+FDLLKFDK K+SSEESKP CDNEVIVA DCVNSVDGFHF
Sbjct: 481 DGNKKPREWIPLMNSKEEDEAFDLLKFDKAKVSSEESKPECDNEVIVAADCVNSVDGFHF 540
Query: 541 IKGGGEQCVDYAQEKQLFPNSFKICDLNLMEASDIHDNHENN-LLIFPSISETKRERTPV 600
IK GG+QCVDYAQEKQLFPNSFKICDLNLMEASDIHDNHENN LLIF SISE+KRER PV
Sbjct: 541 IKSGGDQCVDYAQEKQLFPNSFKICDLNLMEASDIHDNHENNPLLIFQSISESKRERAPV 600
Query: 601 DIDLSISNATEFGQNSVVAGGKEIEIIDLEDDSAAEVDKTFHNAERKSETVFTGLDGFPN 660
DI LSISNATEFGQNSVVAGGKEIEIIDLEDDS A+VDKTFHN E K E VFTGLDGFPN
Sbjct: 601 DIGLSISNATEFGQNSVVAGGKEIEIIDLEDDSNAQVDKTFHNPETKREPVFTGLDGFPN 660
Query: 661 NAQNSGDMPDVQDGYGLMISELLGAEFPNCASVQGDINSMHNEMPLSNGEGALADDDLIY 720
N QNSGDMPDV DGYGLMISELLGAEFPNCASVQGDINSMHN+M LSNGEGALADDDLIY
Sbjct: 661 NEQNSGDMPDVHDGYGLMISELLGAEFPNCASVQGDINSMHNQMSLSNGEGALADDDLIY 720
Query: 721 MSLGEIPLSMPEF 732
MSLGEIPLSMPEF
Sbjct: 721 MSLGEIPLSMPEF 732
BLAST of HG10022340 vs. TAIR 10
Match:
AT4G26450.1 (unknown protein; Has 614 Blast hits to 492 proteins in 137 species: Archae - 0; Bacteria - 94; Metazoa - 255; Fungi - 76; Plants - 69; Viruses - 0; Other Eukaryotes - 120 (source: NCBI BLink). )
HSP 1 Score: 284.6 bits (727), Expect = 2.2e-76
Identity = 261/765 (34.12%), Postives = 383/765 (50.07%), Query Frame = 0
Query: 1 MHPRHRNTGNGFRSSSMGVGLASSRISPEGSVRGHGGGYGNDYRNFNHPSGFGRGQGYPK 60
MH R RN GNG+RS S+G+G++ SRISPE +RGH G YG+++++ G+GRG+G K
Sbjct: 1 MHARQRNVGNGYRSGSIGMGMSGSRISPERPMRGH-GFYGSEHQHRGFNRGYGRGRGRSK 60
Query: 61 SY--QSSQSLPPP---RRGGGSVDIFMEAGRLAAEYLVSQGVLPSSVLSGKWQGGNLRRE 120
SY Q LPPP RR G D+FMEAGRLA EYLVSQGVLP +VLS KWQ GN R++
Sbjct: 61 SYHNQLPPPLPPPPVQRRSSGG-DVFMEAGRLATEYLVSQGVLPQTVLSSKWQNGNFRKQ 120
Query: 121 SSESQELGPLREEGRTSTAAPPHSGHVGPDSGSGRRHPNDEYSSAASRNHLRGRRRSTSF 180
+ E Q +EE R +AP + +R D YSSA SRN L+GRR
Sbjct: 121 AGEFQS-SRSQEEARMDVSAP----------AAEKRRYIDGYSSAGSRNSLKGRR----- 180
Query: 181 RSSGSDWSGQDYSRSNNYNDRARASPDTEAYDDTDNLYVYGNQ--QQTGEEVGSELQDLK 240
S D+ RS ++++R++A +TE DD+ V G+Q Q E++ S +Q
Sbjct: 181 ----SHRYDSDFGRSGSWSERSKAF-ETETGDDS----VSGHQEEQPLAEDIASSVQRSA 240
Query: 241 PSKLEPKGDTPEDSGPELVKYPLPDDAGSKASTSAVVKDLPSEPKLAKDSDDLSNIDSGS 300
+ K + DS L KY L D+A SK +S+ KD+ + +++K S+ S++ +GS
Sbjct: 241 SGEFMRKCEGAGDSESVLDKYNLQDEAQSKTGSSSAGKDIVQDCEISKVSEGSSSLSAGS 300
Query: 301 EEVK-----------NSTNINETEKHCVTEKLSVQNEADDGDPLVKPETDLLAFCRFTKF 360
E+K N T I + H E S+ + + K DL C+F K
Sbjct: 301 GEMKGRSGGNGGEDENQTAIEDGSIHQRCEDASIDQQCGADESFTKSGIDLATLCKFEKV 360
Query: 361 PTKTRSALAYKVSKADPITAVSERPSVINTNRGSEISLDSNPSSCALSGAVSAKKHDVEN 420
PT+TRS+L K P +S + N G E D + C G S K +
Sbjct: 361 PTRTRSSLTAK----GPKLYLSHNIKDTSHNSGLE-EEDQTENRCETRGQSSGKADSTGD 420
Query: 421 LNSERSKPEAVEKTDTVEELYPKFGEKASSLTSQSFQHGPFWNESKEESCQIPAVIGRSD 480
N + VE V+ + +++S S + N KE ++P + RS
Sbjct: 421 ENDQ------VEDFALVQYIENSKCHRSNSFPSSILRD----NSEKESGLELPN-LHRSH 480
Query: 481 LMFEERGQKRSLDESDVGEGNKKPREWIPLMTSKEDESFDLLKFDKTKLS-SEESKPACD 540
+ + G+KR + SD+ EG+K+ R+W+ + S+ +E F++ K + EE K +
Sbjct: 481 SV-GKVGEKRPGEGSDLEEGSKRQRDWVAV--SEANERFNMFKTSGNQCDPEEEGKTSSF 540
Query: 541 NEVIVAPDCVNSVDGFHFIKGG-------GEQCVDYAQEKQLFPNSFKICDLNLMEASDI 600
N+ ++ V + G YA+E QLFP SFK+CDLNL ASD+
Sbjct: 541 NKRLIDGAAGKRVSHESLVNNSTYNRTHTGRTGPGYAEEHQLFPASFKMCDLNLGGASDV 600
Query: 601 HDNHENNLLIFPSISETKRERTPVDIDLSISNAT---EFGQNSVVAGGKEIEIIDLEDDS 660
+D K R VD DLSIS+++ EFG ++ ++ GKEIE+I+L+DD
Sbjct: 601 NDG-------------IKESRQAVDFDLSISSSSKSLEFGTSTRMSNGKEIEVINLDDDQ 660
Query: 661 AAEVDKTFHNAERKSETV-FTGLDGFPNNAQNSGDMPDVQDGYGLMISELLGAEFPNCAS 720
EV K+ ++ RK E + G+D D+PD + LM+ E L + F
Sbjct: 661 --EVVKSSNDPGRKQEAAPYMGID----------DVPDYNE--RLMMVEYLDS-FTPINQ 691
Query: 721 VQGDINSMHNEMPLSNGEGAL--------ADDDLIYMSLGEIPLS 728
+ +N + L + EGA+ DDD I+MSLGEIPL+
Sbjct: 721 GTSSVPQNNNTVSLQDREGAIGNDQVPNNTDDDSIFMSLGEIPLT 691
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_038897810.1 | 0.0e+00 | 93.44 | uncharacterized protein At4g26450 isoform X1 [Benincasa hispida] | [more] |
XP_008447800.1 | 0.0e+00 | 92.21 | PREDICTED: uncharacterized protein At4g26450 [Cucumis melo] >XP_008447801.1 PRED... | [more] |
XP_004139761.1 | 0.0e+00 | 91.53 | uncharacterized protein At4g26450 isoform X1 [Cucumis sativus] >XP_031744602.1 u... | [more] |
XP_038897811.1 | 0.0e+00 | 93.37 | uncharacterized protein At4g26450 isoform X2 [Benincasa hispida] | [more] |
TYK23238.1 | 0.0e+00 | 92.10 | uncharacterized protein E5676_scaffold142G002830 [Cucumis melo var. makuwa] | [more] |
Match Name | E-value | Identity | Description | |
P0CB21 | 3.1e-75 | 34.12 | Uncharacterized protein At4g26450 OS=Arabidopsis thaliana OX=3702 GN=At4g26450 P... | [more] |
Match Name | E-value | Identity | Description | |
A0A1S3BHN6 | 0.0e+00 | 92.21 | uncharacterized protein At4g26450 OS=Cucumis melo OX=3656 GN=LOC103490188 PE=4 S... | [more] |
A0A0A0K3M2 | 0.0e+00 | 91.53 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G186170 PE=4 SV=1 | [more] |
A0A5D3DIE9 | 0.0e+00 | 92.10 | Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... | [more] |
A0A6J1FDK2 | 0.0e+00 | 87.04 | uncharacterized protein At4g26450-like isoform X1 OS=Cucurbita moschata OX=3662 ... | [more] |
A0A6J1IGH6 | 0.0e+00 | 86.36 | uncharacterized protein At4g26450-like isoform X1 OS=Cucurbita maxima OX=3661 GN... | [more] |
Match Name | E-value | Identity | Description | |
AT4G26450.1 | 2.2e-76 | 34.12 | unknown protein; Has 614 Blast hits to 492 proteins in 137 species: Archae - 0; ... | [more] |