Homology
BLAST of HG10015472 vs. NCBI nr
Match:
XP_038891423.1 (uncharacterized protein LOC120080842 [Benincasa hispida])
HSP 1 Score: 1041.6 bits (2692), Expect = 2.6e-300
Identity = 541/580 (93.28%), Postives = 565/580 (97.41%), Query Frame = 0
Query: 1 MCSSSSFASTFSLFLTKSPSISRRRNVLSPNSHLFLGHLRPSTNSTFRITASITERDLDL 60
MCSSSSFASTFSLFLTKSPSISRRRNV+SPNSHLFLGHLRP TNSTFRITASITERDL+L
Sbjct: 1 MCSSSSFASTFSLFLTKSPSISRRRNVISPNSHLFLGHLRP-TNSTFRITASITERDLEL 60
Query: 61 SSWFNPDQPNNDDAYGGWIFLNSPTSVAKTEKRGLPRFVIGVVGTALVVLFAVVAQISLS 120
SSWFNPDQPN DD YGGWIFLNSPTSVAKTEK+GLPRF+IGVVGT+LVVLFAV+AQISLS
Sbjct: 61 SSWFNPDQPNGDDTYGGWIFLNSPTSVAKTEKQGLPRFLIGVVGTSLVVLFAVIAQISLS 120
Query: 121 RRGFKFQW-RTPLRSLEGVFSRTENVSDHGKTVEDSLTNDDLPTKSGAESIPDSKIDDAV 180
RRGFKFQW RTPLRSLEGVFSR ENVSD GKTVED+LTNDDLP +SGAESIPDSKIDD+V
Sbjct: 121 RRGFKFQWRRTPLRSLEGVFSRMENVSDEGKTVEDTLTNDDLPIESGAESIPDSKIDDSV 180
Query: 181 TSDSGNKLERVIITVPVDSAQDEAISILKKLKVIEDNINAGELCSRREYARWLVRMYSSL 240
TSDSGNKLERVI+TVPVDSAQDEA+SILKKLKV+ED+INAGELCSRREYARWLVRMYSSL
Sbjct: 181 TSDSGNKLERVILTVPVDSAQDEALSILKKLKVMEDDINAGELCSRREYARWLVRMYSSL 240
Query: 241 ERNPKHHIIPAVSLSGSTVAAFDDISFEDPDFESIQALAEAGIIPSKLSPNYGYDGLGDR 300
ERNPKHHIIP+VSLSGSTVAAFDDISFEDPDFESIQALAEAGI+PSKLSPNYGYDGLGDR
Sbjct: 241 ERNPKHHIIPSVSLSGSTVAAFDDISFEDPDFESIQALAEAGIVPSKLSPNYGYDGLGDR 300
Query: 301 ERTCFFPERFVSRQTLIDWKAQLDYEFVPGMLERISSTKVDFMDLKEISSEASPQLFMDI 360
E+T FFPERFVSRQTLIDWKAQLDYEF GMLE+ISSTKVDFMDLKEISSEASPQLFMDI
Sbjct: 301 EKTYFFPERFVSRQTLIDWKAQLDYEFASGMLEQISSTKVDFMDLKEISSEASPQLFMDI 360
Query: 361 LAGERSILRKVFGRIKRFQPNKPSTKAQVAVTLVSGRMTEAIAAELSRLESESSARKAEI 420
LAGERSILRKVFGRIKRFQPNKPSTKAQVAVTL SGRMTEAI+AELSRLESESSARKAEI
Sbjct: 361 LAGERSILRKVFGRIKRFQPNKPSTKAQVAVTLASGRMTEAISAELSRLESESSARKAEI 420
Query: 421 EDIKLELVERGDIQRYWDKKLTEERKHLIKVEELYLAAVSDLGEEKIVQEKFLSEYLKEK 480
EDIKLELVERGDIQRYWDKKLTEE+K L+KVEELYLAAV+DLGEEKIVQEKF SEYLKEK
Sbjct: 421 EDIKLELVERGDIQRYWDKKLTEEKKRLMKVEELYLAAVNDLGEEKIVQEKFFSEYLKEK 480
Query: 481 ASIDCQRQLLLSLKEEVDGMTQKLLSERSVCETEQSELHNMHADLQNQLEEMLDTKSVLE 540
SIDCQRQLLLSLKEEVDGMT+KLLSERSVCETEQS+LHNMHADLQNQLE MLDTK+VLE
Sbjct: 481 TSIDCQRQLLLSLKEEVDGMTEKLLSERSVCETEQSDLHNMHADLQNQLEGMLDTKAVLE 540
Query: 541 AEKEALRILRSWVEDEARKSQARAKVLEEVGRRWKWDDQA 580
AEKEALRILRSWVEDEARKSQARAKVLEEVGRRWKWDDQA
Sbjct: 541 AEKEALRILRSWVEDEARKSQARAKVLEEVGRRWKWDDQA 579
BLAST of HG10015472 vs. NCBI nr
Match:
XP_008446541.1 (PREDICTED: uncharacterized protein LOC103489242 [Cucumis melo])
HSP 1 Score: 995.0 bits (2571), Expect = 2.8e-286
Identity = 517/578 (89.45%), Postives = 542/578 (93.77%), Query Frame = 0
Query: 1 MCSSSSFASTFSLFLTKSPSISRRRNVLSPNSHLFLGHLRPSTNSTFRITASITERDLDL 60
MCSSSSF STFS FLTKSPSISRRR VL PNSHLFL HLRP TNSTFRI ASITE DL+L
Sbjct: 1 MCSSSSFPSTFSSFLTKSPSISRRRTVLFPNSHLFLAHLRP-TNSTFRIAASITEPDLEL 60
Query: 61 SSWFNPDQPNNDDAYGGWIFLNSPTSVAKTEKRGLPRFVIGVVGTALVVLFAVVAQISLS 120
SSWFN DQPN DAYGGW+FLNSPTS KTEKRGL R VIGVVGT+LVVLFAV+AQISLS
Sbjct: 61 SSWFNSDQPNTGDAYGGWVFLNSPTSDVKTEKRGLSRSVIGVVGTSLVVLFAVIAQISLS 120
Query: 121 RRGFKFQWRTPLRSLEGVFSRTENVSDHGKTVEDSLTNDDLPTKSGAESIPDSKIDDAVT 180
RRGFKFQWR PLRSLEG+FS TENV D GKTVEDSL NDDLPT+S AESI DSKIDD +T
Sbjct: 121 RRGFKFQWRIPLRSLEGIFSHTENVGDRGKTVEDSLPNDDLPTESRAESIADSKIDDTIT 180
Query: 181 SDSGNKLERVIITVPVDSAQDEAISILKKLKVIEDNINAGELCSRREYARWLVRMYSSLE 240
SDSGNKLERVIIT+PVDS QDEA+SILKKLKVIE++IN GELCSRREYARWLVRMYSSLE
Sbjct: 181 SDSGNKLERVIITIPVDSTQDEALSILKKLKVIEEDINGGELCSRREYARWLVRMYSSLE 240
Query: 241 RNPKHHIIPAVSLSGSTVAAFDDISFEDPDFESIQALAEAGIIPSKLSPNYGYDGLGDRE 300
RNPKHHIIP+VSLSGSTVAAFDDISFEDPDFESIQALAEAG++PSKLSPNYGYDGLGDRE
Sbjct: 241 RNPKHHIIPSVSLSGSTVAAFDDISFEDPDFESIQALAEAGVVPSKLSPNYGYDGLGDRE 300
Query: 301 RTCFFPERFVSRQTLIDWKAQLDYEFVPGMLERISSTKVDFMDLKEISSEASPQLFMDIL 360
RT FFPERFVSRQ LIDWK QLDYEFVPGMLERISSTKVDFMDLKEISSEASPQLFMDIL
Sbjct: 301 RTYFFPERFVSRQALIDWKVQLDYEFVPGMLERISSTKVDFMDLKEISSEASPQLFMDIL 360
Query: 361 AGERSILRKVFGRIKRFQPNKPSTKAQVAVTLVSGRMTEAIAAELSRLESESSARKAEIE 420
AGERSILRKVFGR+KRFQPNKP+TKAQVAVTL SGRM EAI+AELSRLESESSARKAEIE
Sbjct: 361 AGERSILRKVFGRMKRFQPNKPATKAQVAVTLASGRMAEAISAELSRLESESSARKAEIE 420
Query: 421 DIKLELVERGDIQRYWDKKLTEERKHLIKVEELYLAAVSDLGEEKIVQEKFLSEYLKEKA 480
DIKLELVERGDIQRYWDKKLTEE+K L+ +EELYLAAVS+LGEEK+VQEK SEYLKEKA
Sbjct: 421 DIKLELVERGDIQRYWDKKLTEEKKRLLDMEELYLAAVSNLGEEKMVQEKIFSEYLKEKA 480
Query: 481 SIDCQRQLLLSLKEEVDGMTQKLLSERSVCETEQSELHNMHADLQNQLEEMLDTKSVLEA 540
SIDCQRQLLLSLKEEVDGMT+KLLSERSVCETEQ+ELHNM ADLQNQLE MLDTKSVLEA
Sbjct: 481 SIDCQRQLLLSLKEEVDGMTEKLLSERSVCETEQNELHNMRADLQNQLEGMLDTKSVLEA 540
Query: 541 EKEALRILRSWVEDEARKSQARAKVLEEVGRRWKWDDQ 579
EKEALRILR+WVEDEARKSQARAKVLEEVGRRWKWDDQ
Sbjct: 541 EKEALRILRTWVEDEARKSQARAKVLEEVGRRWKWDDQ 577
BLAST of HG10015472 vs. NCBI nr
Match:
XP_011655755.1 (uncharacterized protein LOC101214855 [Cucumis sativus] >KGN52039.1 hypothetical protein Csa_009102 [Cucumis sativus])
HSP 1 Score: 988.8 bits (2555), Expect = 2.0e-284
Identity = 512/579 (88.43%), Postives = 542/579 (93.61%), Query Frame = 0
Query: 1 MCSSSSFASTFSLFLTKSPSISRRRNVLSPNSHLFLGHLRPSTNSTFRITASITERDLDL 60
MCSSSSF STFSLFLTKSPSISRRR +L PNSHLFL HLRP TNSTFRI ASITE DL L
Sbjct: 1 MCSSSSFPSTFSLFLTKSPSISRRRTLLFPNSHLFLPHLRP-TNSTFRIAASITEPDLHL 60
Query: 61 SSWFNPDQPNNDDAYGGWIFLNSPTSVAKTEKRGLPRFVIGVVGTALVVLFAVVAQISLS 120
SSW N DQPN D YGGW+FLN+PT+ AK EKRGL RFVIGVVGT+LVVLFAV+AQISLS
Sbjct: 61 SSWSNFDQPNTGDTYGGWVFLNTPTTDAKIEKRGLSRFVIGVVGTSLVVLFAVIAQISLS 120
Query: 121 RRGFKFQWRTPLRSLEGVFSRTENVSDHGKTVEDSLTNDDLPTKSGAESIPDSKIDDAVT 180
RRGFKFQWR PLRSLEG+FS TENV D GKTVEDSLTNDDLPT+SGAESI DSKIDDA+T
Sbjct: 121 RRGFKFQWRIPLRSLEGIFSHTENVGDQGKTVEDSLTNDDLPTESGAESITDSKIDDAIT 180
Query: 181 SDSGNKLERVIITVPVDSAQDEAISILKKLKVIEDNINAGELCSRREYARWLVRMYSSLE 240
SDSGNKL+RVII +PVDS QDEA+SILKKLKVIE++INAGELCSRREYARWLV MYSSLE
Sbjct: 181 SDSGNKLQRVIIAIPVDSTQDEALSILKKLKVIEEDINAGELCSRREYARWLVHMYSSLE 240
Query: 241 RNPKHHIIPAVSLSGSTVAAFDDISFEDPDFESIQALAEAGIIPSKLSPNYGYDGLGDRE 300
RNPKHHIIP+VSLSGSTVAAFDDISFEDPDFESIQALAEAG++PSKLSPNYGYDGLGD+E
Sbjct: 241 RNPKHHIIPSVSLSGSTVAAFDDISFEDPDFESIQALAEAGVVPSKLSPNYGYDGLGDQE 300
Query: 301 RTCFFPERFVSRQTLIDWKAQLDYEFVPGMLERISSTKVDFMDLKEISSEASPQLFMDIL 360
RT FFPERFVSRQTLIDWK QLDYEFVPGMLERISS KVDFMDLKEISSEASPQLFMDIL
Sbjct: 301 RTYFFPERFVSRQTLIDWKVQLDYEFVPGMLERISSAKVDFMDLKEISSEASPQLFMDIL 360
Query: 361 AGERSILRKVFGRIKRFQPNKPSTKAQVAVTLVSGRMTEAIAAELSRLESESSARKAEIE 420
AGERSILRKVFG+IKRFQPNKP+TKAQVAVTL SGRM EAIAAELSRLESESSARKAEIE
Sbjct: 361 AGERSILRKVFGQIKRFQPNKPATKAQVAVTLASGRMAEAIAAELSRLESESSARKAEIE 420
Query: 421 DIKLELVERGDIQRYWDKKLTEERKHLIKVEELYLAAVSDLGEEKIVQEKFLSEYLKEKA 480
DIKLELVERGDIQRYWDKKLTEE+K L+ VEELYLAA+S+LGEEK+VQEK SEYLKEKA
Sbjct: 421 DIKLELVERGDIQRYWDKKLTEEKKRLLDVEELYLAAISNLGEEKMVQEKIFSEYLKEKA 480
Query: 481 SIDCQRQLLLSLKEEVDGMTQKLLSERSVCETEQSELHNMHADLQNQLEEMLDTKSVLEA 540
SIDCQRQLLLSL EEVDG+ +K+LSERSVCETEQ+ELHNMH DLQNQLE MLDTKSVLEA
Sbjct: 481 SIDCQRQLLLSLNEEVDGIAEKILSERSVCETEQNELHNMHTDLQNQLEGMLDTKSVLEA 540
Query: 541 EKEALRILRSWVEDEARKSQARAKVLEEVGRRWKWDDQA 580
EKEALRILR+WVEDEARKSQARAKVLEEVGRRWKWDDQA
Sbjct: 541 EKEALRILRTWVEDEARKSQARAKVLEEVGRRWKWDDQA 578
BLAST of HG10015472 vs. NCBI nr
Match:
KAA0034521.1 (protein CHUP1 [Cucumis melo var. makuwa] >TYK09075.1 protein CHUP1 [Cucumis melo var. makuwa])
HSP 1 Score: 981.9 bits (2537), Expect = 2.4e-282
Identity = 512/575 (89.04%), Postives = 538/575 (93.57%), Query Frame = 0
Query: 1 MCSSSSFASTFSLFLTKSPSISRRRNVLSPNSHLFLGHLRPSTNSTFRITASITERDLDL 60
MCSSSSF STFS FLTKSPSISRRR VL PNSHLFL HLRP TNSTFRI ASITE DL+L
Sbjct: 1 MCSSSSFPSTFSSFLTKSPSISRRRTVLFPNSHLFLAHLRP-TNSTFRIAASITEPDLEL 60
Query: 61 SSWFNPDQPNNDDAYGGWIFLNSPTSVAKTEKRGLPRFVIGVVGTALVVLFAVVAQISLS 120
SSWFN DQPN DAYGGW+FLNSPTS KTEKRGL R VIGVVGT+LVVLFAV+AQISLS
Sbjct: 61 SSWFNSDQPNTGDAYGGWVFLNSPTSDVKTEKRGLSRSVIGVVGTSLVVLFAVIAQISLS 120
Query: 121 RRGFKFQWRTPLRSLEGVFSRTENVSDHGKTVEDSLTNDDLPTKSGAESIPDSKIDDAVT 180
RRGFKFQWR PLRSLEG+FS TENV D GKTVEDSL NDDLPT+S AESI DSKIDD +T
Sbjct: 121 RRGFKFQWRIPLRSLEGIFSHTENVGDRGKTVEDSLPNDDLPTESRAESIADSKIDDTIT 180
Query: 181 SDSGNKLERVIITVPVDSAQDEAISILKKLKVIEDNINAGELCSRREYARWLVRMYSSLE 240
SDSGNKLERVIIT+PVDS QDEA+SILKKLKVIE++IN GELCSRREYARWLVRMYSSLE
Sbjct: 181 SDSGNKLERVIITIPVDSTQDEALSILKKLKVIEEDINGGELCSRREYARWLVRMYSSLE 240
Query: 241 RNPKHHIIPAVSLSGSTVAAFDDISFEDPDFESIQALAEAGIIPSKLSPNYGYDGLGDRE 300
RNPKHHIIP+VSLSGSTVAAFDDISFEDPDFESIQALAEAG++PSKLSPNYGYDGLGDRE
Sbjct: 241 RNPKHHIIPSVSLSGSTVAAFDDISFEDPDFESIQALAEAGVVPSKLSPNYGYDGLGDRE 300
Query: 301 RTCFFPERFVSRQTLIDWKAQLDYEFVPGMLERISSTKVDFMDLKEISSEASPQLFMDIL 360
RT FFPERFVSRQ LIDWK QLDYEFVPGMLERISSTKVDFMDLKEISSEASPQLFMDIL
Sbjct: 301 RTYFFPERFVSRQALIDWKVQLDYEFVPGMLERISSTKVDFMDLKEISSEASPQLFMDIL 360
Query: 361 AGERSILRKVFGRIKRFQPNKPSTKAQVAVTLVSGRMTEAIAAELSRLESESSARKAEIE 420
AGERSILRKVFGR+KRFQPNKP+TKAQVAVTL SGRM EAI+AELSRLESESSARKAEIE
Sbjct: 361 AGERSILRKVFGRMKRFQPNKPATKAQVAVTLASGRMAEAISAELSRLESESSARKAEIE 420
Query: 421 DIKLELVERGDIQRYWDKKLTEERKHLIKVEELYLAAVSDLGEEKIVQEKFLSEYLKEKA 480
DIKLELVERGDIQRYWDKKLTEE+K L+ +EELYLAAVS+LGEEK+VQEK SEYLKEKA
Sbjct: 421 DIKLELVERGDIQRYWDKKLTEEKKRLLDMEELYLAAVSNLGEEKMVQEKIFSEYLKEKA 480
Query: 481 SIDCQRQLLLSLKEEVDGMTQKLLSERSVCETEQSELHNMHADLQNQLEEMLDTKSVLEA 540
SIDCQRQLLLSLKEEVDGMT+KLLSERSVCETEQ+ELHNM ADLQNQLE MLDTKSVLEA
Sbjct: 481 SIDCQRQLLLSLKEEVDGMTEKLLSERSVCETEQNELHNMRADLQNQLEGMLDTKSVLEA 540
Query: 541 EKEALRILRSWVEDEARKSQARAKVLEEVGRRWKW 576
EKEALRILR+WVEDEARKSQARAKVLEEVGRR +W
Sbjct: 541 EKEALRILRTWVEDEARKSQARAKVLEEVGRRLQW 574
BLAST of HG10015472 vs. NCBI nr
Match:
XP_022976135.1 (uncharacterized protein LOC111476598 [Cucurbita maxima])
HSP 1 Score: 974.9 bits (2519), Expect = 3.0e-280
Identity = 512/579 (88.43%), Postives = 539/579 (93.09%), Query Frame = 0
Query: 1 MCSSSSFASTFSLFLTKSPSISRRRNVLSPNSHLFLGHLRPSTNSTFRITASITERDLDL 60
MCSSSSFASTFSLF KSPSISRRRNV+ P SHLFLGHLRP TNS FRI ASITERDLDL
Sbjct: 1 MCSSSSFASTFSLFPAKSPSISRRRNVIPPYSHLFLGHLRP-TNSKFRIAASITERDLDL 60
Query: 61 SSWFNPDQPNNDDAYGGWIFLNSPTSVAKTEKRGLPRFVIGVVGTALVVLFAVVAQISLS 120
SSW NPD PNN D YGGWIFLNSPTS AK +RG+PRFVIGVVGT+LVVLFA ++ ISLS
Sbjct: 61 SSWINPDHPNN-DGYGGWIFLNSPTSDAKIGRRGVPRFVIGVVGTSLVVLFAAISHISLS 120
Query: 121 RRGFKFQWRTPLRSLEGVFSRTENVSDHGKTVEDSLTNDDLPTKSGAESIPDSKIDDAVT 180
RRGFKFQWRTPLRSLEGVFSR EN SD GKTVEDSLTN DLPT+SGAESIPDSK+ DAVT
Sbjct: 121 RRGFKFQWRTPLRSLEGVFSRMENESDQGKTVEDSLTNHDLPTESGAESIPDSKVYDAVT 180
Query: 181 SDSGNKLERVIITVPVDSAQDEAISILKKLKVIEDNINAGELCSRREYARWLVRMYSSLE 240
SDSGNK ERVIITVPVDSAQDEA+SILKKLKVIED+I+AGELCSRREYARWLV MYSSLE
Sbjct: 181 SDSGNKPERVIITVPVDSAQDEALSILKKLKVIEDDIDAGELCSRREYARWLVHMYSSLE 240
Query: 241 RNPKHHIIPAVSLSGSTVAAFDDISFEDPDFESIQALAEAGIIPSKLSPNYGYDGLGDRE 300
RNPKHHIIP+V LSGST+AAFDDIS EDPDFESIQALAEAGIIPSKLSPNYGYDGLGDRE
Sbjct: 241 RNPKHHIIPSVLLSGSTIAAFDDISMEDPDFESIQALAEAGIIPSKLSPNYGYDGLGDRE 300
Query: 301 RTCFFPERFVSRQTLIDWKAQLDYEFVPGMLERISSTKVDFMDLKEISSEASPQLFMDIL 360
+T FFPERFVSRQTLIDWKAQLDYE VPG+LERISSTKV FMDLKEISSEASPQLFMDIL
Sbjct: 301 KTYFFPERFVSRQTLIDWKAQLDYEVVPGILERISSTKVGFMDLKEISSEASPQLFMDIL 360
Query: 361 AGERSILRKVFGRIKRFQPNKPSTKAQVAVTLVSGRMTEAIAAELSRLESESSARKAEIE 420
AGE SI RKVFG+IKRFQPNKPSTKAQVAV LVSGRM EAI+ ELSRLESESSARKAEIE
Sbjct: 361 AGESSIHRKVFGQIKRFQPNKPSTKAQVAVALVSGRMAEAISFELSRLESESSARKAEIE 420
Query: 421 DIKLELVERGDIQRYWDKKLTEERKHLIKVEELYLAAVSDLGEEKIVQEKFLSEYLKEKA 480
DIKLEL+ERGDIQRYWDKKLTEE++ LIKVEELYL A+SDLGEEK+VQEKF SEYLKEKA
Sbjct: 421 DIKLELLERGDIQRYWDKKLTEEKERLIKVEELYLTAISDLGEEKMVQEKFFSEYLKEKA 480
Query: 481 SIDCQRQLLLSLKEEVDGMTQKLLSERSVCETEQSELHNMHADLQNQLEEMLDTKSVLEA 540
SI+CQRQLLLSLKEEVDGMT+KLLSE S+CE E+SELHNMHA LQ+QLE MLDTKSVLEA
Sbjct: 481 SINCQRQLLLSLKEEVDGMTEKLLSETSICEAEKSELHNMHARLQSQLEVMLDTKSVLEA 540
Query: 541 EKEALRILRSWVEDEARKSQARAKVLEEVGRRWKWDDQA 580
EKEALRILRSWVEDEARKSQARAKVLEEVGRRWKWDDQA
Sbjct: 541 EKEALRILRSWVEDEARKSQARAKVLEEVGRRWKWDDQA 577
BLAST of HG10015472 vs. ExPASy TrEMBL
Match:
A0A1S3BFY7 (uncharacterized protein LOC103489242 OS=Cucumis melo OX=3656 GN=LOC103489242 PE=4 SV=1)
HSP 1 Score: 995.0 bits (2571), Expect = 1.3e-286
Identity = 517/578 (89.45%), Postives = 542/578 (93.77%), Query Frame = 0
Query: 1 MCSSSSFASTFSLFLTKSPSISRRRNVLSPNSHLFLGHLRPSTNSTFRITASITERDLDL 60
MCSSSSF STFS FLTKSPSISRRR VL PNSHLFL HLRP TNSTFRI ASITE DL+L
Sbjct: 1 MCSSSSFPSTFSSFLTKSPSISRRRTVLFPNSHLFLAHLRP-TNSTFRIAASITEPDLEL 60
Query: 61 SSWFNPDQPNNDDAYGGWIFLNSPTSVAKTEKRGLPRFVIGVVGTALVVLFAVVAQISLS 120
SSWFN DQPN DAYGGW+FLNSPTS KTEKRGL R VIGVVGT+LVVLFAV+AQISLS
Sbjct: 61 SSWFNSDQPNTGDAYGGWVFLNSPTSDVKTEKRGLSRSVIGVVGTSLVVLFAVIAQISLS 120
Query: 121 RRGFKFQWRTPLRSLEGVFSRTENVSDHGKTVEDSLTNDDLPTKSGAESIPDSKIDDAVT 180
RRGFKFQWR PLRSLEG+FS TENV D GKTVEDSL NDDLPT+S AESI DSKIDD +T
Sbjct: 121 RRGFKFQWRIPLRSLEGIFSHTENVGDRGKTVEDSLPNDDLPTESRAESIADSKIDDTIT 180
Query: 181 SDSGNKLERVIITVPVDSAQDEAISILKKLKVIEDNINAGELCSRREYARWLVRMYSSLE 240
SDSGNKLERVIIT+PVDS QDEA+SILKKLKVIE++IN GELCSRREYARWLVRMYSSLE
Sbjct: 181 SDSGNKLERVIITIPVDSTQDEALSILKKLKVIEEDINGGELCSRREYARWLVRMYSSLE 240
Query: 241 RNPKHHIIPAVSLSGSTVAAFDDISFEDPDFESIQALAEAGIIPSKLSPNYGYDGLGDRE 300
RNPKHHIIP+VSLSGSTVAAFDDISFEDPDFESIQALAEAG++PSKLSPNYGYDGLGDRE
Sbjct: 241 RNPKHHIIPSVSLSGSTVAAFDDISFEDPDFESIQALAEAGVVPSKLSPNYGYDGLGDRE 300
Query: 301 RTCFFPERFVSRQTLIDWKAQLDYEFVPGMLERISSTKVDFMDLKEISSEASPQLFMDIL 360
RT FFPERFVSRQ LIDWK QLDYEFVPGMLERISSTKVDFMDLKEISSEASPQLFMDIL
Sbjct: 301 RTYFFPERFVSRQALIDWKVQLDYEFVPGMLERISSTKVDFMDLKEISSEASPQLFMDIL 360
Query: 361 AGERSILRKVFGRIKRFQPNKPSTKAQVAVTLVSGRMTEAIAAELSRLESESSARKAEIE 420
AGERSILRKVFGR+KRFQPNKP+TKAQVAVTL SGRM EAI+AELSRLESESSARKAEIE
Sbjct: 361 AGERSILRKVFGRMKRFQPNKPATKAQVAVTLASGRMAEAISAELSRLESESSARKAEIE 420
Query: 421 DIKLELVERGDIQRYWDKKLTEERKHLIKVEELYLAAVSDLGEEKIVQEKFLSEYLKEKA 480
DIKLELVERGDIQRYWDKKLTEE+K L+ +EELYLAAVS+LGEEK+VQEK SEYLKEKA
Sbjct: 421 DIKLELVERGDIQRYWDKKLTEEKKRLLDMEELYLAAVSNLGEEKMVQEKIFSEYLKEKA 480
Query: 481 SIDCQRQLLLSLKEEVDGMTQKLLSERSVCETEQSELHNMHADLQNQLEEMLDTKSVLEA 540
SIDCQRQLLLSLKEEVDGMT+KLLSERSVCETEQ+ELHNM ADLQNQLE MLDTKSVLEA
Sbjct: 481 SIDCQRQLLLSLKEEVDGMTEKLLSERSVCETEQNELHNMRADLQNQLEGMLDTKSVLEA 540
Query: 541 EKEALRILRSWVEDEARKSQARAKVLEEVGRRWKWDDQ 579
EKEALRILR+WVEDEARKSQARAKVLEEVGRRWKWDDQ
Sbjct: 541 EKEALRILRTWVEDEARKSQARAKVLEEVGRRWKWDDQ 577
BLAST of HG10015472 vs. ExPASy TrEMBL
Match:
A0A0A0KT43 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G608270 PE=4 SV=1)
HSP 1 Score: 988.8 bits (2555), Expect = 9.6e-285
Identity = 512/579 (88.43%), Postives = 542/579 (93.61%), Query Frame = 0
Query: 1 MCSSSSFASTFSLFLTKSPSISRRRNVLSPNSHLFLGHLRPSTNSTFRITASITERDLDL 60
MCSSSSF STFSLFLTKSPSISRRR +L PNSHLFL HLRP TNSTFRI ASITE DL L
Sbjct: 1 MCSSSSFPSTFSLFLTKSPSISRRRTLLFPNSHLFLPHLRP-TNSTFRIAASITEPDLHL 60
Query: 61 SSWFNPDQPNNDDAYGGWIFLNSPTSVAKTEKRGLPRFVIGVVGTALVVLFAVVAQISLS 120
SSW N DQPN D YGGW+FLN+PT+ AK EKRGL RFVIGVVGT+LVVLFAV+AQISLS
Sbjct: 61 SSWSNFDQPNTGDTYGGWVFLNTPTTDAKIEKRGLSRFVIGVVGTSLVVLFAVIAQISLS 120
Query: 121 RRGFKFQWRTPLRSLEGVFSRTENVSDHGKTVEDSLTNDDLPTKSGAESIPDSKIDDAVT 180
RRGFKFQWR PLRSLEG+FS TENV D GKTVEDSLTNDDLPT+SGAESI DSKIDDA+T
Sbjct: 121 RRGFKFQWRIPLRSLEGIFSHTENVGDQGKTVEDSLTNDDLPTESGAESITDSKIDDAIT 180
Query: 181 SDSGNKLERVIITVPVDSAQDEAISILKKLKVIEDNINAGELCSRREYARWLVRMYSSLE 240
SDSGNKL+RVII +PVDS QDEA+SILKKLKVIE++INAGELCSRREYARWLV MYSSLE
Sbjct: 181 SDSGNKLQRVIIAIPVDSTQDEALSILKKLKVIEEDINAGELCSRREYARWLVHMYSSLE 240
Query: 241 RNPKHHIIPAVSLSGSTVAAFDDISFEDPDFESIQALAEAGIIPSKLSPNYGYDGLGDRE 300
RNPKHHIIP+VSLSGSTVAAFDDISFEDPDFESIQALAEAG++PSKLSPNYGYDGLGD+E
Sbjct: 241 RNPKHHIIPSVSLSGSTVAAFDDISFEDPDFESIQALAEAGVVPSKLSPNYGYDGLGDQE 300
Query: 301 RTCFFPERFVSRQTLIDWKAQLDYEFVPGMLERISSTKVDFMDLKEISSEASPQLFMDIL 360
RT FFPERFVSRQTLIDWK QLDYEFVPGMLERISS KVDFMDLKEISSEASPQLFMDIL
Sbjct: 301 RTYFFPERFVSRQTLIDWKVQLDYEFVPGMLERISSAKVDFMDLKEISSEASPQLFMDIL 360
Query: 361 AGERSILRKVFGRIKRFQPNKPSTKAQVAVTLVSGRMTEAIAAELSRLESESSARKAEIE 420
AGERSILRKVFG+IKRFQPNKP+TKAQVAVTL SGRM EAIAAELSRLESESSARKAEIE
Sbjct: 361 AGERSILRKVFGQIKRFQPNKPATKAQVAVTLASGRMAEAIAAELSRLESESSARKAEIE 420
Query: 421 DIKLELVERGDIQRYWDKKLTEERKHLIKVEELYLAAVSDLGEEKIVQEKFLSEYLKEKA 480
DIKLELVERGDIQRYWDKKLTEE+K L+ VEELYLAA+S+LGEEK+VQEK SEYLKEKA
Sbjct: 421 DIKLELVERGDIQRYWDKKLTEEKKRLLDVEELYLAAISNLGEEKMVQEKIFSEYLKEKA 480
Query: 481 SIDCQRQLLLSLKEEVDGMTQKLLSERSVCETEQSELHNMHADLQNQLEEMLDTKSVLEA 540
SIDCQRQLLLSL EEVDG+ +K+LSERSVCETEQ+ELHNMH DLQNQLE MLDTKSVLEA
Sbjct: 481 SIDCQRQLLLSLNEEVDGIAEKILSERSVCETEQNELHNMHTDLQNQLEGMLDTKSVLEA 540
Query: 541 EKEALRILRSWVEDEARKSQARAKVLEEVGRRWKWDDQA 580
EKEALRILR+WVEDEARKSQARAKVLEEVGRRWKWDDQA
Sbjct: 541 EKEALRILRTWVEDEARKSQARAKVLEEVGRRWKWDDQA 578
BLAST of HG10015472 vs. ExPASy TrEMBL
Match:
A0A5A7SYJ4 (Protein CHUP1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold475G00650 PE=4 SV=1)
HSP 1 Score: 981.9 bits (2537), Expect = 1.2e-282
Identity = 512/575 (89.04%), Postives = 538/575 (93.57%), Query Frame = 0
Query: 1 MCSSSSFASTFSLFLTKSPSISRRRNVLSPNSHLFLGHLRPSTNSTFRITASITERDLDL 60
MCSSSSF STFS FLTKSPSISRRR VL PNSHLFL HLRP TNSTFRI ASITE DL+L
Sbjct: 1 MCSSSSFPSTFSSFLTKSPSISRRRTVLFPNSHLFLAHLRP-TNSTFRIAASITEPDLEL 60
Query: 61 SSWFNPDQPNNDDAYGGWIFLNSPTSVAKTEKRGLPRFVIGVVGTALVVLFAVVAQISLS 120
SSWFN DQPN DAYGGW+FLNSPTS KTEKRGL R VIGVVGT+LVVLFAV+AQISLS
Sbjct: 61 SSWFNSDQPNTGDAYGGWVFLNSPTSDVKTEKRGLSRSVIGVVGTSLVVLFAVIAQISLS 120
Query: 121 RRGFKFQWRTPLRSLEGVFSRTENVSDHGKTVEDSLTNDDLPTKSGAESIPDSKIDDAVT 180
RRGFKFQWR PLRSLEG+FS TENV D GKTVEDSL NDDLPT+S AESI DSKIDD +T
Sbjct: 121 RRGFKFQWRIPLRSLEGIFSHTENVGDRGKTVEDSLPNDDLPTESRAESIADSKIDDTIT 180
Query: 181 SDSGNKLERVIITVPVDSAQDEAISILKKLKVIEDNINAGELCSRREYARWLVRMYSSLE 240
SDSGNKLERVIIT+PVDS QDEA+SILKKLKVIE++IN GELCSRREYARWLVRMYSSLE
Sbjct: 181 SDSGNKLERVIITIPVDSTQDEALSILKKLKVIEEDINGGELCSRREYARWLVRMYSSLE 240
Query: 241 RNPKHHIIPAVSLSGSTVAAFDDISFEDPDFESIQALAEAGIIPSKLSPNYGYDGLGDRE 300
RNPKHHIIP+VSLSGSTVAAFDDISFEDPDFESIQALAEAG++PSKLSPNYGYDGLGDRE
Sbjct: 241 RNPKHHIIPSVSLSGSTVAAFDDISFEDPDFESIQALAEAGVVPSKLSPNYGYDGLGDRE 300
Query: 301 RTCFFPERFVSRQTLIDWKAQLDYEFVPGMLERISSTKVDFMDLKEISSEASPQLFMDIL 360
RT FFPERFVSRQ LIDWK QLDYEFVPGMLERISSTKVDFMDLKEISSEASPQLFMDIL
Sbjct: 301 RTYFFPERFVSRQALIDWKVQLDYEFVPGMLERISSTKVDFMDLKEISSEASPQLFMDIL 360
Query: 361 AGERSILRKVFGRIKRFQPNKPSTKAQVAVTLVSGRMTEAIAAELSRLESESSARKAEIE 420
AGERSILRKVFGR+KRFQPNKP+TKAQVAVTL SGRM EAI+AELSRLESESSARKAEIE
Sbjct: 361 AGERSILRKVFGRMKRFQPNKPATKAQVAVTLASGRMAEAISAELSRLESESSARKAEIE 420
Query: 421 DIKLELVERGDIQRYWDKKLTEERKHLIKVEELYLAAVSDLGEEKIVQEKFLSEYLKEKA 480
DIKLELVERGDIQRYWDKKLTEE+K L+ +EELYLAAVS+LGEEK+VQEK SEYLKEKA
Sbjct: 421 DIKLELVERGDIQRYWDKKLTEEKKRLLDMEELYLAAVSNLGEEKMVQEKIFSEYLKEKA 480
Query: 481 SIDCQRQLLLSLKEEVDGMTQKLLSERSVCETEQSELHNMHADLQNQLEEMLDTKSVLEA 540
SIDCQRQLLLSLKEEVDGMT+KLLSERSVCETEQ+ELHNM ADLQNQLE MLDTKSVLEA
Sbjct: 481 SIDCQRQLLLSLKEEVDGMTEKLLSERSVCETEQNELHNMRADLQNQLEGMLDTKSVLEA 540
Query: 541 EKEALRILRSWVEDEARKSQARAKVLEEVGRRWKW 576
EKEALRILR+WVEDEARKSQARAKVLEEVGRR +W
Sbjct: 541 EKEALRILRTWVEDEARKSQARAKVLEEVGRRLQW 574
BLAST of HG10015472 vs. ExPASy TrEMBL
Match:
A0A6J1IIN2 (uncharacterized protein LOC111476598 OS=Cucurbita maxima OX=3661 GN=LOC111476598 PE=4 SV=1)
HSP 1 Score: 974.9 bits (2519), Expect = 1.4e-280
Identity = 512/579 (88.43%), Postives = 539/579 (93.09%), Query Frame = 0
Query: 1 MCSSSSFASTFSLFLTKSPSISRRRNVLSPNSHLFLGHLRPSTNSTFRITASITERDLDL 60
MCSSSSFASTFSLF KSPSISRRRNV+ P SHLFLGHLRP TNS FRI ASITERDLDL
Sbjct: 1 MCSSSSFASTFSLFPAKSPSISRRRNVIPPYSHLFLGHLRP-TNSKFRIAASITERDLDL 60
Query: 61 SSWFNPDQPNNDDAYGGWIFLNSPTSVAKTEKRGLPRFVIGVVGTALVVLFAVVAQISLS 120
SSW NPD PNN D YGGWIFLNSPTS AK +RG+PRFVIGVVGT+LVVLFA ++ ISLS
Sbjct: 61 SSWINPDHPNN-DGYGGWIFLNSPTSDAKIGRRGVPRFVIGVVGTSLVVLFAAISHISLS 120
Query: 121 RRGFKFQWRTPLRSLEGVFSRTENVSDHGKTVEDSLTNDDLPTKSGAESIPDSKIDDAVT 180
RRGFKFQWRTPLRSLEGVFSR EN SD GKTVEDSLTN DLPT+SGAESIPDSK+ DAVT
Sbjct: 121 RRGFKFQWRTPLRSLEGVFSRMENESDQGKTVEDSLTNHDLPTESGAESIPDSKVYDAVT 180
Query: 181 SDSGNKLERVIITVPVDSAQDEAISILKKLKVIEDNINAGELCSRREYARWLVRMYSSLE 240
SDSGNK ERVIITVPVDSAQDEA+SILKKLKVIED+I+AGELCSRREYARWLV MYSSLE
Sbjct: 181 SDSGNKPERVIITVPVDSAQDEALSILKKLKVIEDDIDAGELCSRREYARWLVHMYSSLE 240
Query: 241 RNPKHHIIPAVSLSGSTVAAFDDISFEDPDFESIQALAEAGIIPSKLSPNYGYDGLGDRE 300
RNPKHHIIP+V LSGST+AAFDDIS EDPDFESIQALAEAGIIPSKLSPNYGYDGLGDRE
Sbjct: 241 RNPKHHIIPSVLLSGSTIAAFDDISMEDPDFESIQALAEAGIIPSKLSPNYGYDGLGDRE 300
Query: 301 RTCFFPERFVSRQTLIDWKAQLDYEFVPGMLERISSTKVDFMDLKEISSEASPQLFMDIL 360
+T FFPERFVSRQTLIDWKAQLDYE VPG+LERISSTKV FMDLKEISSEASPQLFMDIL
Sbjct: 301 KTYFFPERFVSRQTLIDWKAQLDYEVVPGILERISSTKVGFMDLKEISSEASPQLFMDIL 360
Query: 361 AGERSILRKVFGRIKRFQPNKPSTKAQVAVTLVSGRMTEAIAAELSRLESESSARKAEIE 420
AGE SI RKVFG+IKRFQPNKPSTKAQVAV LVSGRM EAI+ ELSRLESESSARKAEIE
Sbjct: 361 AGESSIHRKVFGQIKRFQPNKPSTKAQVAVALVSGRMAEAISFELSRLESESSARKAEIE 420
Query: 421 DIKLELVERGDIQRYWDKKLTEERKHLIKVEELYLAAVSDLGEEKIVQEKFLSEYLKEKA 480
DIKLEL+ERGDIQRYWDKKLTEE++ LIKVEELYL A+SDLGEEK+VQEKF SEYLKEKA
Sbjct: 421 DIKLELLERGDIQRYWDKKLTEEKERLIKVEELYLTAISDLGEEKMVQEKFFSEYLKEKA 480
Query: 481 SIDCQRQLLLSLKEEVDGMTQKLLSERSVCETEQSELHNMHADLQNQLEEMLDTKSVLEA 540
SI+CQRQLLLSLKEEVDGMT+KLLSE S+CE E+SELHNMHA LQ+QLE MLDTKSVLEA
Sbjct: 481 SINCQRQLLLSLKEEVDGMTEKLLSETSICEAEKSELHNMHARLQSQLEVMLDTKSVLEA 540
Query: 541 EKEALRILRSWVEDEARKSQARAKVLEEVGRRWKWDDQA 580
EKEALRILRSWVEDEARKSQARAKVLEEVGRRWKWDDQA
Sbjct: 541 EKEALRILRSWVEDEARKSQARAKVLEEVGRRWKWDDQA 577
BLAST of HG10015472 vs. ExPASy TrEMBL
Match:
A0A6J1GZ54 (uncharacterized protein LOC111458840 OS=Cucurbita moschata OX=3662 GN=LOC111458840 PE=4 SV=1)
HSP 1 Score: 967.6 bits (2500), Expect = 2.3e-278
Identity = 508/579 (87.74%), Postives = 538/579 (92.92%), Query Frame = 0
Query: 1 MCSSSSFASTFSLFLTKSPSISRRRNVLSPNSHLFLGHLRPSTNSTFRITASITERDLDL 60
MCSSSSFASTFSLF KSPSISRR NV+ P SHLFLGHLRP TNS FRI ASITERDL+L
Sbjct: 1 MCSSSSFASTFSLFPAKSPSISRRHNVIPPYSHLFLGHLRP-TNSKFRIAASITERDLEL 60
Query: 61 SSWFNPDQPNNDDAYGGWIFLNSPTSVAKTEKRGLPRFVIGVVGTALVVLFAVVAQISLS 120
SSW NPD PNN D YGGWIFLNSPTS AKTE+RG+PRFVIGVVGT+LVVLFA ++ ISLS
Sbjct: 61 SSWLNPDHPNN-DGYGGWIFLNSPTSDAKTERRGVPRFVIGVVGTSLVVLFAAISHISLS 120
Query: 121 RRGFKFQWRTPLRSLEGVFSRTENVSDHGKTVEDSLTNDDLPTKSGAESIPDSKIDDAVT 180
RRGFKFQWRTPLRSLEGVFSR EN SD GKTVEDSLTN DLPT+SGAES+ DSK+ DAVT
Sbjct: 121 RRGFKFQWRTPLRSLEGVFSRMENESDQGKTVEDSLTNHDLPTESGAESMLDSKVYDAVT 180
Query: 181 SDSGNKLERVIITVPVDSAQDEAISILKKLKVIEDNINAGELCSRREYARWLVRMYSSLE 240
SDSGNK ERVIIT PVDSAQDEA+SILKKLKVIED+I+AGELCSRREYARWLV MYSSLE
Sbjct: 181 SDSGNKPERVIITTPVDSAQDEALSILKKLKVIEDDIDAGELCSRREYARWLVHMYSSLE 240
Query: 241 RNPKHHIIPAVSLSGSTVAAFDDISFEDPDFESIQALAEAGIIPSKLSPNYGYDGLGDRE 300
RNPKHHIIP+V LSGST+AAFDDIS EDPDFESIQALAEAGIIPSKLSPNYGYDGLGDRE
Sbjct: 241 RNPKHHIIPSVLLSGSTIAAFDDISMEDPDFESIQALAEAGIIPSKLSPNYGYDGLGDRE 300
Query: 301 RTCFFPERFVSRQTLIDWKAQLDYEFVPGMLERISSTKVDFMDLKEISSEASPQLFMDIL 360
+T FFPERFVSRQTLIDWKAQLDYE VPG+LERISSTKV FMDLKEISSEASPQLFMDIL
Sbjct: 301 KTYFFPERFVSRQTLIDWKAQLDYEVVPGILERISSTKVGFMDLKEISSEASPQLFMDIL 360
Query: 361 AGERSILRKVFGRIKRFQPNKPSTKAQVAVTLVSGRMTEAIAAELSRLESESSARKAEIE 420
AGERSI RKVFG+IKRFQPNKPSTKAQVAV LVSGRM EAI+ ELSRLESE SARKAEIE
Sbjct: 361 AGERSIHRKVFGQIKRFQPNKPSTKAQVAVALVSGRMAEAISFELSRLESERSARKAEIE 420
Query: 421 DIKLELVERGDIQRYWDKKLTEERKHLIKVEELYLAAVSDLGEEKIVQEKFLSEYLKEKA 480
DIKLEL+ERGDIQRYWDKKLTEE++ LIKVEELYL A+SDLGE+K+VQEKF SEYLKEKA
Sbjct: 421 DIKLELLERGDIQRYWDKKLTEEKERLIKVEELYLTAISDLGEQKMVQEKFFSEYLKEKA 480
Query: 481 SIDCQRQLLLSLKEEVDGMTQKLLSERSVCETEQSELHNMHADLQNQLEEMLDTKSVLEA 540
SI+CQRQLLLSLKEEVDGMT+KLLSERSVCE E+SELH+MHA LQ+QLE LDTKSVLEA
Sbjct: 481 SINCQRQLLLSLKEEVDGMTEKLLSERSVCEAEKSELHDMHARLQSQLEVTLDTKSVLEA 540
Query: 541 EKEALRILRSWVEDEARKSQARAKVLEEVGRRWKWDDQA 580
EKEALRILRSWVEDEARKSQARAKVLEEVGRRWKWDDQA
Sbjct: 541 EKEALRILRSWVEDEARKSQARAKVLEEVGRRWKWDDQA 577
BLAST of HG10015472 vs. TAIR 10
Match:
AT3G25680.1 (FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 21 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: S-layer homology domain (InterPro:IPR001119); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G23890.1); Has 2454 Blast hits to 2065 proteins in 355 species: Archae - 39; Bacteria - 284; Metazoa - 1081; Fungi - 166; Plants - 264; Viruses - 45; Other Eukaryotes - 575 (source: NCBI BLink). )
HSP 1 Score: 486.1 bits (1250), Expect = 3.9e-137
Identity = 277/543 (51.01%), Postives = 372/543 (68.51%), Query Frame = 0
Query: 41 PSTNSTFRITASITERDLDLSSWFNPDQPNNDDAYGGWIFLN----SPTSVAKTEKRGLP 100
P FRI AS++ +SW + + D YGGW SP S+ K + R +
Sbjct: 36 PHKPPRFRIVASLSG-----TSWVS---QASQDKYGGWALAEDETPSPHSITKKKWRNV- 95
Query: 101 RFVIGVVGTALVVLFAVVAQISLSRRGFKFQWRTPLRSLEGVFSRTENVSDHGKTVEDSL 160
VI VG++L V+ A +A S+SR+GF+F + L+ + +N ++L
Sbjct: 96 --VITGVGSSLAVVLATIAYFSISRKGFRFSFSNLLQYQNVELDQNDNEE------SETL 155
Query: 161 TNDDLPTKSGAESIPDSKIDDAVTSDSGNKLERVIITVPVDSAQDEAISILKKLKVIEDN 220
ND+ + S A S + D V S S K RV V VD+AQ EAI++LKKLK+ ED+
Sbjct: 156 FNDENNSPSEANSESVDYVSDNVDSTSTGKTHRVATPVAVDAAQQEAIAVLKKLKIYEDD 215
Query: 221 INAGELCSRREYARWLVRMYSSLERNPKHHIIPAVSLSGSTVAAFDDISFEDPDFESIQA 280
I A ELC++REYARWLVR S LERNP H I+PAV+L+GS++ AFDDI+ DPDFE IQA
Sbjct: 216 IVADELCTKREYARWLVRSNSLLERNPMHMIVPAVALAGSSIPAFDDINTSDPDFEYIQA 275
Query: 281 LAEAGIIPSKLSPNYGYDGLGDRERTCFFPERFVSRQTLIDWKAQLDYEFVPGMLERISS 340
LAEAGI SKLS G D D + F PE FVSR L++WKAQL+ F P ++E IS
Sbjct: 276 LAEAGITSSKLS---GEDSRNDLGNSNFNPESFVSRLDLVNWKAQLECGFHPEIMEEISR 335
Query: 341 TKVDFMDLKEISSEASPQLFMDILAGERSILRKVFGRIKRFQPNKPSTKAQVAVTLVSGR 400
TKVD++D K I+ + + F+D L G++S +R VFGRIKRFQPN+P TKAQ AV L SG+
Sbjct: 336 TKVDYIDTKNINPDMALGFFLDFLMGDKSTIRNVFGRIKRFQPNRPVTKAQAAVALTSGK 395
Query: 401 MTEAIAAELSRLESESSARKAEIEDIKLELVERGDIQRYWDKKLTEERKHLIKVEELYLA 460
M +AI AELSRLE+ES ++KAE E+I+ EL+E+G+I+++WD+K+ ER ++EELYL+
Sbjct: 396 MVKAITAELSRLEAESLSQKAETEEIRSELLEKGEIRQFWDEKIQAERSRGFEMEELYLS 455
Query: 461 AVSDLGEEKIVQEKFLSEYLKEKASIDCQRQLLLSLKEEVDGMTQKLLSERSVCETEQSE 520
V+++ EEK QEK+ +E LKEKA+IDCQ+QLL SL EE+D M+Q+L+S++SV TE S+
Sbjct: 456 RVNEVEEEKTTQEKWSAERLKEKAAIDCQKQLLNSLTEEIDEMSQRLISDKSVYLTEHSK 515
Query: 521 LHNMHADLQNQLEEMLDTKSVLEAEKEALRILRSWVEDEARKSQARAKVLEEVGRRWKWD 580
L M +DLQ++LE ++D +S+LEAE EALRILRSW+EDE + SQARAKVLEE GRRWKW+
Sbjct: 516 LQEMLSDLQSKLESLIDKRSILEAEVEALRILRSWIEDEGKASQARAKVLEEAGRRWKWN 558
BLAST of HG10015472 vs. TAIR 10
Match:
AT5G23890.1 (LOCATED IN: mitochondrion, chloroplast thylakoid membrane, chloroplast, plastid, chloroplast envelope; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 14 growth stages; CONTAINS InterPro DOMAIN/s: S-layer homology domain (InterPro:IPR001119); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G52410.2); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )
HSP 1 Score: 231.5 bits (589), Expect = 1.7e-60
Identity = 136/387 (35.14%), Postives = 226/387 (58.40%), Query Frame = 0
Query: 189 RVIITVPVDSAQDEAISILKKLKVIEDNINAGELCSRREYARWLVRMYSSLERNPKHHII 248
++++ V D Q +A + L+ LKVIE + +LC+RREYARWL+ S+L RN +
Sbjct: 426 KILVPVAADQIQCQAFAALQVLKVIETDTQPSDLCTRREYARWLISASSALSRNTTSKVY 485
Query: 249 PAVSLSGSTVAAFDDISFEDPDFESIQALAEAGIIPSKLSPNYGYDGLGDRERTCFF-PE 308
PA+ + T AFDDI+ EDPDF SIQ LAEAG+I SKLS D L D E T F PE
Sbjct: 486 PAMYIENVTELAFDDITPEDPDFSSIQGLAEAGLIASKLS---NRDLLDDVEGTFLFSPE 545
Query: 309 RFVSRQTLIDWKAQLDYEFVPGMLERISSTKVDFMDLKEISSEASPQLFMDILAGERSIL 368
+SRQ LI WK L+ +P +++ F+D+ +I+ +A P + D+ GE+ I
Sbjct: 546 SLLSRQDLISWKMALEKRQLPEADKKMLYKLSGFIDIDKINPDAWPSIIADLSTGEQGIA 605
Query: 369 RKVFGRIKRFQPNKPSTKAQVAVTLVSGRMTEAIAAELSRLESESSARKAEIEDIKLELV 428
FG + FQP+KP TK Q A+ L SG ++ ++ EL+R+E+ES A KA L
Sbjct: 606 ALAFGCTRLFQPHKPVTKGQAAIALSSGEASDIVSEELARIEAESMAEKAVSAHNALVAE 665
Query: 429 ERGDIQRYWDKKLTEERKHLIKVEELYLAAVSDLGEEKIVQEKFLSEYLKEKASIDCQRQ 488
D+ ++K+L+ ER+ + VE++ A +L + + +E+ +KE+A+++ + +
Sbjct: 666 VEKDVNASFEKELSMEREKIEAVEKMAELAKVELEQLREKREEENLALVKERAAVESEME 725
Query: 489 LLLSLKEEVDGMTQKLLSERSVCETEQSELHNMHADLQNQLEEMLDTKSVLEAEKEALRI 548
+L L+ + + + L+S ++ E+ + N+ + + + + + + LE E++AL +
Sbjct: 726 VLSRLRRDAEEKLEDLMSNKAEITFEKERVFNLRKEAEEESQRISKLQYELEVERKALSM 785
Query: 549 LRSWVEDEARKSQARAKVLEEVGRRWK 575
RSW E+EA+K++ + + LEE +RW+
Sbjct: 786 ARSWAEEEAKKAREQGRALEEARKRWE 809
BLAST of HG10015472 vs. TAIR 10
Match:
AT5G52410.2 (INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 13 plant structures; EXPRESSED DURING: 6 growth stages; CONTAINS InterPro DOMAIN/s: S-layer homology domain (InterPro:IPR001119); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G23890.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )
HSP 1 Score: 223.0 bits (567), Expect = 6.2e-58
Identity = 137/384 (35.68%), Postives = 214/384 (55.73%), Query Frame = 0
Query: 191 IITVPVDSAQDEAISILKKLKVIEDNINAGELCSRREYARWLVRMYSSLERNPKHHIIPA 250
I VD Q + + L+ LKVIE + +LC+RRE+ARW+V ++L RN + PA
Sbjct: 240 IFPTVVDPVQSQMFAALQALKVIESDALPYDLCTRREFARWVVSASNTLSRNSASKVYPA 299
Query: 251 VSLSGSTVAAFDDISFEDPDFESIQALAEAGIIPSKLSPNYGYDGLGDRERTCFFPERFV 310
+ + T AFDDI+ EDPDF IQ LAEAG+I SKLS N + R F PE +
Sbjct: 300 MYIENVTELAFDDITPEDPDFPFIQGLAEAGLISSKLSNNNMPS--SESSRVTFSPESPL 359
Query: 311 SRQTLIDWKAQLDYEFVPGMLERISSTKVDFMDLKEISSEASPQLFMDILAGERSILRKV 370
+RQ L+ WK L++ +P + F+D+ +I+ EA P L D+ AGE I
Sbjct: 360 TRQDLLSWKMALEFRQLPEADSKKLYQLSGFLDIDKINPEAWPALIADLSAGEHGITALS 419
Query: 371 FGRIKRFQPNKPSTKAQVAVTLVSGRMTEAIAAELSRLESESSARKAEIEDIKLELVERG 430
FGR + FQP+K TKAQ AV+L G E + EL+R+E+E+ A +L
Sbjct: 420 FGRTRLFQPSKAVTKAQTAVSLAIGDAFEVVGEELARIEAEAMAENVVCAHNELVAQVEK 479
Query: 431 DIQRYWDKKLTEERKHLIKVEELYLAAVSDLGEEKIVQEKFLSEYLKEKASIDCQRQLLL 490
DI ++K+L E++ + VE+L A S+L ++ +E+ +E+ SI+ + + L
Sbjct: 480 DINASFEKELLREKEIVDAVEKLAEEAKSELARLRVEKEEETLALERERTSIETEMEALA 539
Query: 491 SLKEEVDGMTQKLLSERSVCETEQSELHNMHADLQNQLEEMLDTKSVLEAEKEALRILRS 550
++ E++ Q L S ++ E+ + ++++ +E+L ++ LE E+ AL I R
Sbjct: 540 RIRNELEEQLQSLASNKAEMSYEKERFDRLQKQVEDENQEILRLQNELEVERNALSIARD 599
Query: 551 WVEDEARKSQARAKVLEEVGRRWK 575
W +DEAR+++ +AKVLEE RW+
Sbjct: 600 WAKDEARRAREQAKVLEEARGRWE 621
BLAST of HG10015472 vs. TAIR 10
Match:
AT5G52410.1 (CONTAINS InterPro DOMAIN/s: S-layer homology domain (InterPro:IPR001119); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G23890.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )
HSP 1 Score: 218.0 bits (554), Expect = 2.0e-56
Identity = 133/368 (36.14%), Postives = 208/368 (56.52%), Query Frame = 0
Query: 207 LKKLKVIEDNINAGELCSRREYARWLVRMYSSLERNPKHHIIPAVSLSGSTVAAFDDISF 266
L+ LKVIE + +LC+RRE+ARW+V ++L RN + PA+ + T AFDDI+
Sbjct: 5 LQALKVIESDALPYDLCTRREFARWVVSASNTLSRNSASKVYPAMYIENVTELAFDDITP 64
Query: 267 EDPDFESIQALAEAGIIPSKLSPNYGYDGLGDRERTCFFPERFVSRQTLIDWKAQLDYEF 326
EDPDF IQ LAEAG+I SKLS N + R F PE ++RQ L+ WK L++
Sbjct: 65 EDPDFPFIQGLAEAGLISSKLSNNNMPS--SESSRVTFSPESPLTRQDLLSWKMALEFRQ 124
Query: 327 VPGMLERISSTKVDFMDLKEISSEASPQLFMDILAGERSILRKVFGRIKRFQPNKPSTKA 386
+P + F+D+ +I+ EA P L D+ AGE I FGR + FQP+K TKA
Sbjct: 125 LPEADSKKLYQLSGFLDIDKINPEAWPALIADLSAGEHGITALSFGRTRLFQPSKAVTKA 184
Query: 387 QVAVTLVSGRMTEAIAAELSRLESESSARKAEIEDIKLELVERGDIQRYWDKKLTEERKH 446
Q AV+L G E + EL+R+E+E+ A +L DI ++K+L E++
Sbjct: 185 QTAVSLAIGDAFEVVGEELARIEAEAMAENVVCAHNELVAQVEKDINASFEKELLREKEI 244
Query: 447 LIKVEELYLAAVSDLGEEKIVQEKFLSEYLKEKASIDCQRQLLLSLKEEVDGMTQKLLSE 506
+ VE+L A S+L ++ +E+ +E+ SI+ + + L ++ E++ Q L S
Sbjct: 245 VDAVEKLAEEAKSELARLRVEKEEETLALERERTSIETEMEALARIRNELEEQLQSLASN 304
Query: 507 RSVCETEQSELHNMHADLQNQLEEMLDTKSVLEAEKEALRILRSWVEDEARKSQARAKVL 566
++ E+ + ++++ +E+L ++ LE E+ AL I R W +DEAR+++ +AKVL
Sbjct: 305 KAEMSYEKERFDRLQKQVEDENQEILRLQNELEVERNALSIARDWAKDEARRAREQAKVL 364
Query: 567 EEVGRRWK 575
EE RW+
Sbjct: 365 EEARGRWE 370
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_038891423.1 | 2.6e-300 | 93.28 | uncharacterized protein LOC120080842 [Benincasa hispida] | [more] |
XP_008446541.1 | 2.8e-286 | 89.45 | PREDICTED: uncharacterized protein LOC103489242 [Cucumis melo] | [more] |
XP_011655755.1 | 2.0e-284 | 88.43 | uncharacterized protein LOC101214855 [Cucumis sativus] >KGN52039.1 hypothetical ... | [more] |
KAA0034521.1 | 2.4e-282 | 89.04 | protein CHUP1 [Cucumis melo var. makuwa] >TYK09075.1 protein CHUP1 [Cucumis melo... | [more] |
XP_022976135.1 | 3.0e-280 | 88.43 | uncharacterized protein LOC111476598 [Cucurbita maxima] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A1S3BFY7 | 1.3e-286 | 89.45 | uncharacterized protein LOC103489242 OS=Cucumis melo OX=3656 GN=LOC103489242 PE=... | [more] |
A0A0A0KT43 | 9.6e-285 | 88.43 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G608270 PE=4 SV=1 | [more] |
A0A5A7SYJ4 | 1.2e-282 | 89.04 | Protein CHUP1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold475G00650 ... | [more] |
A0A6J1IIN2 | 1.4e-280 | 88.43 | uncharacterized protein LOC111476598 OS=Cucurbita maxima OX=3661 GN=LOC111476598... | [more] |
A0A6J1GZ54 | 2.3e-278 | 87.74 | uncharacterized protein LOC111458840 OS=Cucurbita moschata OX=3662 GN=LOC1114588... | [more] |
Match Name | E-value | Identity | Description | |
AT3G25680.1 | 3.9e-137 | 51.01 | FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknow... | [more] |
AT5G23890.1 | 1.7e-60 | 35.14 | LOCATED IN: mitochondrion, chloroplast thylakoid membrane, chloroplast, plastid,... | [more] |
AT5G52410.2 | 6.2e-58 | 35.68 | INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: ... | [more] |
AT5G52410.1 | 2.0e-56 | 36.14 | CONTAINS InterPro DOMAIN/s: S-layer homology domain (InterPro:IPR001119); BEST A... | [more] |