Homology
BLAST of HG10019849 vs. NCBI nr
Match:
XP_038903080.1 (probable mediator of RNA polymerase II transcription subunit 15c isoform X1 [Benincasa hispida])
HSP 1 Score: 1179.5 bits (3050), Expect = 0.0e+00
Identity = 624/804 (77.61%), Postives = 683/804 (84.95%), Query Frame = 0
Query: 1 MEKKVSLATATDWRMEITKATRVQKLRSICMTLKEQWSARCPDGMNVVAITNVAREYEMK 60
M+KK + ATATDWR EIT TR+QK R ICM L EQWSA DGMN+ AI+ +ARE+EM+
Sbjct: 1 MDKKATPATATDWRTEITNETRLQKFRLICMMLNEQWSAHHADGMNMKAISELAREHEME 60
Query: 61 LFNNAKSKDEYLNAGRTRQMSGRENHHGSSSCQAAVPNPQYHQPAEPNSLLRQHIQPTPQ 120
LF+ AKSKD+YLNAG TR+M+ RENHHGSSS Q AVPNPQYHQ AEPNSLLRQHIQPT Q
Sbjct: 61 LFSMAKSKDDYLNAG-TRKMNRRENHHGSSSTQVAVPNPQYHQSAEPNSLLRQHIQPTTQ 120
Query: 121 LHGQNPNVRQTHQQFVLHNQSGFSPQNTLNSQCEPHGFQRRDFGIHLSPEMFTQHPNLVN 180
LH QN NV QTHQQF +HNQ SPQNTLNSQ FQRRDFGIH SPEMFTQHPNLVN
Sbjct: 121 LHRQNLNVGQTHQQFGMHNQRHVSPQNTLNSQ-----FQRRDFGIHPSPEMFTQHPNLVN 180
Query: 181 LQPNENLTTHVKKEVNGEGFQASKSSHQHHTAIEQHKQQQSMGASS--EGNPNSEDWLDV 240
LQPNENLTT VK+EVNGEGFQASKSSHQHHTAIEQ+KQQQSMGAS+ E P SEDW DV
Sbjct: 181 LQPNENLTTQVKEEVNGEGFQASKSSHQHHTAIEQYKQQQSMGASAVPEEIPTSEDWHDV 240
Query: 241 AFAEKERVKKMCLPLLEKACEPSVQVAPTEHIQQHKLVPLDSLKKMVKFLQLPRDKIIVT 300
AFAE ER+KK LPLLEKACE S+QV E IQQ K D L+ M+KFLQLPRDKII+T
Sbjct: 241 AFAEMERLKKTYLPLLEKACELSLQVVQAEQIQQQK---HDPLRGMMKFLQLPRDKIILT 300
Query: 301 YNKEKFYRSLQTIEKLGNVFKSNINRANKQQVLHVGQPDLSGSRINPVQQSDNVKLHCQP 360
Y+KEKFYR LQTIEK+G KS N NKQQ LH GQP GSR+NPVQQS++VKLH QP
Sbjct: 301 YDKEKFYRCLQTIEKVGKAIKSKFNLGNKQQPLHGGQPGPGGSRLNPVQQSNSVKLHRQP 360
Query: 361 VIRATTGSSDSSSPIAPREKGSV---TDCIQKNLLQNRQHSENIKQESQSQWIQLRQKST 420
VIRATTG D +SPIA EKGSV TDCIQ NLLQNRQH ENI+ E +SQW+Q +Q +T
Sbjct: 361 VIRATTGFPDGTSPIASPEKGSVRSETDCIQINLLQNRQHFENIESEFRSQWLQPKQNAT 420
Query: 421 GNIPAIHRSGMSLKHHVNSNFSPQIYEAAQLSQIAQRPLPTNPCASSSHGRASPAPSSSI 480
GNIPAI+RSGMSL H+ SN SPQI+EA+QLSQ A+RPLPTNPC SS HGRASPAPSSSI
Sbjct: 421 GNIPAIYRSGMSLNHYC-SNVSPQIHEASQLSQFAERPLPTNPCGSSLHGRASPAPSSSI 480
Query: 481 VGLEKISPNVSYLPSSIFQFPQHCNARELLHPKAEIQVQSQKIRSSSAMTSPFAEPTSLG 540
V L+K SPNVSYL SS FQFPQ+CN E LHPKAEI+VQSQKIRSSSAMTSPFA PTSLG
Sbjct: 481 VRLDKSSPNVSYLSSSNFQFPQNCNPLEFLHPKAEIEVQSQKIRSSSAMTSPFATPTSLG 540
Query: 541 SNGKLSTGTQAHDRLVKAVKSLSNEALNVAVSGICSVGYMEDTITDPWCHAKATDLRLLD 600
SNG+L T TQAH+RL+KAV+SLSNEAL +AVSGI SVGY +D + DPWCHAK TD+RL D
Sbjct: 541 SNGQLPTATQAHNRLLKAVESLSNEALTIAVSGISSVGYSDDAMIDPWCHAKVTDVRLQD 600
Query: 601 GCGSSNNMKRKINAMALNDIPSPCSDIAGSEPTVTSSRKKLKKLSDYALLEEMRNINKQF 660
G GSSNNMKRKINA ALN+IPSPCSDI GSEPTVTS RKKLKKLSDY+LLEE+RNINKQF
Sbjct: 601 GSGSSNNMKRKINATALNNIPSPCSDITGSEPTVTSRRKKLKKLSDYSLLEELRNINKQF 660
Query: 661 IETVLELDLDENLNRRLANAGTVLRCSYSAVTDSKNSEPCPVKLPVLSVKLLVPLDYPED 720
+ETVLELDLDE+LNR+LANAGTVLRCSYSA T+ KNSE CPVKLPVLSVKLLVPLDYPED
Sbjct: 661 VETVLELDLDESLNRKLANAGTVLRCSYSAATECKNSEACPVKLPVLSVKLLVPLDYPED 720
Query: 721 YPVFLSKFNTDSSNVDEEFRDLSNEATSMLRAFLRTAPDCLSLEEYARVWDECARSVVSE 780
YPVFLSKFNT+S NVD+EFRDLSNEAT MLRAFLRTAPDCLSL EYARVWDECARSVVSE
Sbjct: 721 YPVFLSKFNTNSGNVDKEFRDLSNEATLMLRAFLRTAPDCLSLLEYARVWDECARSVVSE 780
Query: 781 YAQRVGGGCFSAQYGTWEDSVSAA 800
YAQR GGGCFS QYGTWED+V+ A
Sbjct: 781 YAQRAGGGCFSTQYGTWEDTVAVA 794
BLAST of HG10019849 vs. NCBI nr
Match:
XP_038903081.1 (mediator of RNA polymerase II transcription subunit 15a-like isoform X2 [Benincasa hispida])
HSP 1 Score: 1141.3 bits (2951), Expect = 0.0e+00
Identity = 604/774 (78.04%), Postives = 660/774 (85.27%), Query Frame = 0
Query: 31 MTLKEQWSARCPDGMNVVAITNVAREYEMKLFNNAKSKDEYLNAGRTRQMSGRENHHGSS 90
M L EQWSA DGMN+ AI+ +ARE+EM+LF+ AKSKD+YLNAG TR+M+ RENHHGSS
Sbjct: 1 MMLNEQWSAHHADGMNMKAISELAREHEMELFSMAKSKDDYLNAG-TRKMNRRENHHGSS 60
Query: 91 SCQAAVPNPQYHQPAEPNSLLRQHIQPTPQLHGQNPNVRQTHQQFVLHNQSGFSPQNTLN 150
S Q AVPNPQYHQ AEPNSLLRQHIQPT QLH QN NV QTHQQF +HNQ SPQNTLN
Sbjct: 61 STQVAVPNPQYHQSAEPNSLLRQHIQPTTQLHRQNLNVGQTHQQFGMHNQRHVSPQNTLN 120
Query: 151 SQCEPHGFQRRDFGIHLSPEMFTQHPNLVNLQPNENLTTHVKKEVNGEGFQASKSSHQHH 210
SQ FQRRDFGIH SPEMFTQHPNLVNLQPNENLTT VK+EVNGEGFQASKSSHQHH
Sbjct: 121 SQ-----FQRRDFGIHPSPEMFTQHPNLVNLQPNENLTTQVKEEVNGEGFQASKSSHQHH 180
Query: 211 TAIEQHKQQQSMGASS--EGNPNSEDWLDVAFAEKERVKKMCLPLLEKACEPSVQVAPTE 270
TAIEQ+KQQQSMGAS+ E P SEDW DVAFAE ER+KK LPLLEKACE S+QV E
Sbjct: 181 TAIEQYKQQQSMGASAVPEEIPTSEDWHDVAFAEMERLKKTYLPLLEKACELSLQVVQAE 240
Query: 271 HIQQHKLVPLDSLKKMVKFLQLPRDKIIVTYNKEKFYRSLQTIEKLGNVFKSNINRANKQ 330
IQQ K D L+ M+KFLQLPRDKII+TY+KEKFYR LQTIEK+G KS N NKQ
Sbjct: 241 QIQQQK---HDPLRGMMKFLQLPRDKIILTYDKEKFYRCLQTIEKVGKAIKSKFNLGNKQ 300
Query: 331 QVLHVGQPDLSGSRINPVQQSDNVKLHCQPVIRATTGSSDSSSPIAPREKGSV---TDCI 390
Q LH GQP GSR+NPVQQS++VKLH QPVIRATTG D +SPIA EKGSV TDCI
Sbjct: 301 QPLHGGQPGPGGSRLNPVQQSNSVKLHRQPVIRATTGFPDGTSPIASPEKGSVRSETDCI 360
Query: 391 QKNLLQNRQHSENIKQESQSQWIQLRQKSTGNIPAIHRSGMSLKHHVNSNFSPQIYEAAQ 450
Q NLLQNRQH ENI+ E +SQW+Q +Q +TGNIPAI+RSGMSL H+ SN SPQI+EA+Q
Sbjct: 361 QINLLQNRQHFENIESEFRSQWLQPKQNATGNIPAIYRSGMSLNHYC-SNVSPQIHEASQ 420
Query: 451 LSQIAQRPLPTNPCASSSHGRASPAPSSSIVGLEKISPNVSYLPSSIFQFPQHCNARELL 510
LSQ A+RPLPTNPC SS HGRASPAPSSSIV L+K SPNVSYL SS FQFPQ+CN E L
Sbjct: 421 LSQFAERPLPTNPCGSSLHGRASPAPSSSIVRLDKSSPNVSYLSSSNFQFPQNCNPLEFL 480
Query: 511 HPKAEIQVQSQKIRSSSAMTSPFAEPTSLGSNGKLSTGTQAHDRLVKAVKSLSNEALNVA 570
HPKAEI+VQSQKIRSSSAMTSPFA PTSLGSNG+L T TQAH+RL+KAV+SLSNEAL +A
Sbjct: 481 HPKAEIEVQSQKIRSSSAMTSPFATPTSLGSNGQLPTATQAHNRLLKAVESLSNEALTIA 540
Query: 571 VSGICSVGYMEDTITDPWCHAKATDLRLLDGCGSSNNMKRKINAMALNDIPSPCSDIAGS 630
VSGI SVGY +D + DPWCHAK TD+RL DG GSSNNMKRKINA ALN+IPSPCSDI GS
Sbjct: 541 VSGISSVGYSDDAMIDPWCHAKVTDVRLQDGSGSSNNMKRKINATALNNIPSPCSDITGS 600
Query: 631 EPTVTSSRKKLKKLSDYALLEEMRNINKQFIETVLELDLDENLNRRLANAGTVLRCSYSA 690
EPTVTS RKKLKKLSDY+LLEE+RNINKQF+ETVLELDLDE+LNR+LANAGTVLRCSYSA
Sbjct: 601 EPTVTSRRKKLKKLSDYSLLEELRNINKQFVETVLELDLDESLNRKLANAGTVLRCSYSA 660
Query: 691 VTDSKNSEPCPVKLPVLSVKLLVPLDYPEDYPVFLSKFNTDSSNVDEEFRDLSNEATSML 750
T+ KNSE CPVKLPVLSVKLLVPLDYPEDYPVFLSKFNT+S NVD+EFRDLSNEAT ML
Sbjct: 661 ATECKNSEACPVKLPVLSVKLLVPLDYPEDYPVFLSKFNTNSGNVDKEFRDLSNEATLML 720
Query: 751 RAFLRTAPDCLSLEEYARVWDECARSVVSEYAQRVGGGCFSAQYGTWEDSVSAA 800
RAFLRTAPDCLSL EYARVWDECARSVVSEYAQR GGGCFS QYGTWED+V+ A
Sbjct: 721 RAFLRTAPDCLSLLEYARVWDECARSVVSEYAQRAGGGCFSTQYGTWEDTVAVA 764
BLAST of HG10019849 vs. NCBI nr
Match:
XP_008443239.1 (PREDICTED: uncharacterized protein LOC103486878 [Cucumis melo] >XP_008443241.1 PREDICTED: uncharacterized protein LOC103486878 [Cucumis melo] >XP_008443242.1 PREDICTED: uncharacterized protein LOC103486878 [Cucumis melo] >XP_016899629.1 PREDICTED: uncharacterized protein LOC103486878 [Cucumis melo] >ADN33883.1 hypothetical protein [Cucumis melo subsp. melo] >KAA0053897.1 putative tartrate dehydrogenase/decarboxylase ttuC [Cucumis melo var. makuwa] >TYK25509.1 putative tartrate dehydrogenase/decarboxylase ttuC [Cucumis melo var. makuwa])
HSP 1 Score: 931.4 bits (2406), Expect = 5.2e-267
Identity = 530/804 (65.92%), Postives = 605/804 (75.25%), Query Frame = 0
Query: 1 MEKKVSLATA-TDWRMEITKATRVQKLRSICMTLKEQWSARCPDGMNVVAITNVAREYEM 60
M+KK S+ATA TDWR EITK TR +K SI M L+ Q+S + N+ I++ AR++EM
Sbjct: 1 MDKKASMATATTDWRTEITKETRQKKFHSIWMVLERQFSGQ----FNMNVISDHARKHEM 60
Query: 61 KLFNNAKSKDEYLNAGRTRQMSGRENHHGSSSCQAAVPNPQYHQPAEPNSLLRQHIQPTP 120
KLF+ A S DEYLNAG ++S RENH GSSS +AAV PQYH QPTP
Sbjct: 61 KLFSQANSTDEYLNAG-IGKLSKRENHRGSSSSRAAVVYPQYH-------------QPTP 120
Query: 121 QLHGQNPNVRQTHQQFVLHNQSGFSPQNTLNSQCEPHGFQRRDFGIHLSPEMFTQHPNLV 180
QL Q+P VRQ HQQF + NQS S QNT NSQ P GFQR+D GIHLS EMFTQHPN V
Sbjct: 121 QLRRQHPKVRQAHQQFAMQNQSCASLQNTSNSQSRPQGFQRQDIGIHLSSEMFTQHPNFV 180
Query: 181 NLQPNENLTTHVKKEVNGEGFQASKSSHQHHTAIEQHKQQQSMGASSEGNPNSEDWLDVA 240
NLTT V+KE N EGF+ASKS HQH EQHKQQ SM AS+E P+SE D A
Sbjct: 181 ------NLTTQVEKEANSEGFKASKSLHQH----EQHKQQHSMRASAERIPSSEVLHDAA 240
Query: 241 FAEKERVKKMCLPLLEKACEPSVQVAPTEHIQQHKLVPLDSLKKMVKFLQLPRDKIIVTY 300
FAE E++KK LPL+ KA EP +V P QHK + +L++++ F P++KII +Y
Sbjct: 241 FAEMEQLKKTFLPLIIKAYEPYRKVHPD---AQHKNRLMKTLERILTFFHSPKEKIIASY 300
Query: 301 NKEKFYRSLQTIEKLGNVFKSNINRANKQQVLHVGQPDLSGSRIN-PVQQSDNVKLHCQP 360
KE+FYR L+ IE+ GN K N N ANKQ LH GQP LSGSRIN P+QQSDNVKL CQ
Sbjct: 301 TKERFYRCLKYIEQFGNTIKCNTNVANKQSSLHGGQPGLSGSRINHPLQQSDNVKLPCQS 360
Query: 361 VIRATTGSSDSSSPIAPREKGSV---TDCIQKNLLQNRQHSENIKQESQSQWIQLRQKST 420
VIRATTGSS SSSPIAP+EKGSV TD IQKNLLQNRQH ++IK + QWI ++
Sbjct: 361 VIRATTGSSGSSSPIAPQEKGSVRSETDYIQKNLLQNRQHYKSIKSKVHPQWIH----AS 420
Query: 421 GNIPAIHRSGMSLKHHVNSNFSPQIYEAAQLSQIAQRPLPTNPCASSSHGRASPAPSSSI 480
GN PA +RSGMSL HH+NSNFS QI++A+QL +A+RP PT PC S +G ASPAPSS I
Sbjct: 421 GNTPATYRSGMSLNHHLNSNFSHQIHDASQLCHVAERPRPTKPCTSPLYGIASPAPSSPI 480
Query: 481 VGLEKISPNVSYLPSSIFQFPQHCNARELLHPKAEIQVQSQKIRSSSAMTSPFAEPTSLG 540
VGLEK SPNV+Y FQ PQHCN +LLH K E QV SQKIRSSSAMTSP AEPTS G
Sbjct: 481 VGLEKTSPNVTYHSGLNFQSPQHCNPYQLLHSKTETQVPSQKIRSSSAMTSPVAEPTSPG 540
Query: 541 SNGKLSTGTQAHDRLVKAVKSLSNEALNVAVSGICSVGYMEDTITDPWCHAKATDLRLLD 600
NG+ ST QA+ RL+KAV S S AL AVSGI SVGYMED I DP CHA T+LRLL+
Sbjct: 541 INGQFST-YQAYSRLLKAVGSSSRAALRAAVSGITSVGYMEDAIIDPRCHAMVTNLRLLN 600
Query: 601 GCGSSNNMKRKINAMALNDIPSPCSDIAGSEPTVTSSRKKLKKLSDYALLEEMRNINKQF 660
GCGSSNNMKRKINAMALN+IPSP SDI GSE TVTS KKLKK +D +LLEE+RNINKQF
Sbjct: 601 GCGSSNNMKRKINAMALNNIPSPRSDIPGSEETVTSRTKKLKKHTDSSLLEEIRNINKQF 660
Query: 661 IETVLELDLDENLNRRLANAGTVLRCSYSAVTDSKNSEPCPVKLPVLSVKLLVPLDYPED 720
IETVLELD+DENLNRRLANAGTVLRCSYSAV D NSE PVKLPVL++KLLVPLDYPED
Sbjct: 661 IETVLELDVDENLNRRLANAGTVLRCSYSAVIDGTNSEAYPVKLPVLTMKLLVPLDYPED 720
Query: 721 YPVFLSKFNTDSSNVDEEFRDLSNEATSMLRAFLRTAPDCLSLEEYARVWDECARSVVSE 780
YPVFLSKF++ SSNVDEE +LSNEA SMLRAFLRTAP+C+SLEEYARVWDECARSVVS+
Sbjct: 721 YPVFLSKFDSGSSNVDEECSNLSNEAMSMLRAFLRTAPECVSLEEYARVWDECARSVVSD 768
Query: 781 YAQRVGGGCFSAQYGTWEDSVSAA 800
Y QR GGG FSA+YGTWEDSV+ A
Sbjct: 781 YVQRAGGGSFSARYGTWEDSVATA 768
BLAST of HG10019849 vs. NCBI nr
Match:
XP_011652173.1 (uncharacterized protein LOC105434992 [Cucumis sativus] >KAE8651172.1 hypothetical protein Csa_002482 [Cucumis sativus])
HSP 1 Score: 840.1 bits (2169), Expect = 1.6e-239
Identity = 496/797 (62.23%), Postives = 563/797 (70.64%), Query Frame = 0
Query: 7 LATATDWRMEITKATRVQKLRSICMTLKEQWSARCPDGMNVVAITNVAREYEMKLFNNAK 66
+AT TDWR EIT+ TR Q +RSI M LKEQ P +NV I++ AR++EM LF+ AK
Sbjct: 1 MATPTDWRTEITQETRQQIVRSIYMMLKEQ-----PSELNVKLISDRARKHEMNLFSTAK 60
Query: 67 SKDEYLNAGRTRQMSGRENHHGSSSCQAAVPNPQYHQPAEPNSLLRQHIQPTPQLHGQNP 126
SK+EYL+ G T +M RENH GSSS QA V PQYHQPAE SLL QHIQ TPQLH Q+P
Sbjct: 61 SKEEYLSTG-TGKMIKRENHQGSSSGQAVVVYPQYHQPAEAKSLLLQHIQRTPQLHRQHP 120
Query: 127 NVRQTHQQFVLHNQSGFSPQNTLNSQCEPHGFQRRDFGIHLSPEMFTQHPNLVNLQPNEN 186
NVRQ HQQF + NQSG SPQNT NSQC P GF R+D GIHLS EMFTQHPN V N
Sbjct: 121 NVRQAHQQFAMQNQSGASPQNTSNSQCRPQGFPRQDTGIHLSSEMFTQHPNFV------N 180
Query: 187 LTTHVKKEVNGEGFQASKSSHQHHTAIEQHKQQQSMGASS--EGNPNSEDWLDVAFAEKE 246
LTT VKKEV+ EGF ASKSS EQ KQ GAS+ E PNSE W D AFAE E
Sbjct: 181 LTTQVKKEVDSEGFMASKSS-------EQRKQHSMSGASADPERIPNSEVWHDAAFAEME 240
Query: 247 RVKKMCLPLLEKACEPSVQVAPTEHIQQHKLVPLDSLKKMVKFLQLPRDKIIVTYNKEKF 306
++KK LP KA EP +V E + + L +++++++F Q ++KII +Y KEKF
Sbjct: 241 QLKKTFLPYFIKAYEPFRKVVHQEGLHRRGLG--KTIERILRFFQSSKEKIIASYTKEKF 300
Query: 307 YRSLQTIEKLGNVFKSNINRANKQQVLHVGQPDLSGSRIN-PVQQS-DNVKLHCQPVIRA 366
R LQ IE+ GN KSNIN NK LH GQP LSGSRIN PVQQS DNVKLHCQ VIR
Sbjct: 301 IRCLQYIEQSGNTIKSNINVVNKPSSLHDGQPGLSGSRINHPVQQSGDNVKLHCQSVIRT 360
Query: 367 TTGSSDSSSPIAPREKGSVTDCIQKNLLQNRQHSENIKQESQSQWIQLRQKSTGNIPAIH 426
TTGS SS +AP+E GS I+ + QWI +GN P +
Sbjct: 361 TTGSGSSS--VAPQEIGS------------------IRSKLHPQWIH----GSGNTPFTY 420
Query: 427 RSGMSLKHHVNSNFSPQIYEAAQLSQIAQRPLPTNPCASSSHGRASPAPSSSIVGLEKIS 486
RSG+SL H+NSNF S +A+RP PTNPC HGRASP PSSSIVGLEKIS
Sbjct: 421 RSGISLNPHLNSNF----------SHVAERPRPTNPCTYPLHGRASPPPSSSIVGLEKIS 480
Query: 487 PNVSYLPSSIFQFPQHCNARELLHPKAEIQVQSQKIRSSSAMTSPFAEPTSLGSNGKLST 546
PNV+Y SS F F HCN +LLH KAE+ AEPTSLG NG+LST
Sbjct: 481 PNVTYHSSSNFHFRPHCNPYQLLHSKAEM----------------IAEPTSLGINGQLST 540
Query: 547 GTQAHDRLVKAVKSLSNEALNVAVSGICSVGYMEDTITDPWCHAKATDLRLLDGCGSSNN 606
QAH+RL+KAV S S EAL AVSGI SVGYMED I DP C AK T+LRL+DG GSSNN
Sbjct: 541 -YQAHNRLLKAVGSSSEEALRAAVSGITSVGYMEDAIIDPQCRAKVTNLRLIDGFGSSNN 600
Query: 607 MKRKINAMALNDIPSPCSDIAGSEPTVTSSRKKLKKLSDYALLEEMRNINKQFIETVLEL 666
MKRKINAMALN+IPSP S+I GSE TVTS KKLKKLSD +LLEEMRNINKQFIETVLEL
Sbjct: 601 MKRKINAMALNNIPSPSSEILGSEETVTSRTKKLKKLSDSSLLEEMRNINKQFIETVLEL 660
Query: 667 DLDENLNRRLANAGTVLRCSYSAVTDSKNSEPCPVKLPVLSVKLLVPLDYPEDYPVFLSK 726
DLDENLN+RLANAGTVLR SYSAV+D NS VKLPVL++KLLVPLDYPEDYPVFLSK
Sbjct: 661 DLDENLNQRLANAGTVLRYSYSAVSDGTNS----VKLPVLTMKLLVPLDYPEDYPVFLSK 720
Query: 727 FNTDSSNVDEEFRDLSNEATSMLRAFLRTAPDCLSLEEYARVWDECARSVVSEYAQRVGG 786
F+ SSNVDEE R+LSN A SMLRAFLRTAP+C+SLE+YAR WDECARSV+SEY +R GG
Sbjct: 721 FDLSSSNVDEESRNLSNGAISMLRAFLRTAPECVSLEDYARAWDECARSVLSEYVRRAGG 721
Query: 787 GCFSAQYGTWEDSVSAA 800
G FSA+YG+WEDSV AA
Sbjct: 781 GSFSARYGSWEDSVVAA 721
BLAST of HG10019849 vs. NCBI nr
Match:
XP_022147805.1 (probable mediator of RNA polymerase II transcription subunit 15c isoform X1 [Momordica charantia])
HSP 1 Score: 772.7 bits (1994), Expect = 3.1e-219
Identity = 480/865 (55.49%), Postives = 574/865 (66.36%), Query Frame = 0
Query: 1 MEKKVSLATATDWRMEITKATRVQKLRSICMTLKEQWSARCPDGMNVVAITNVAREYEMK 60
ME+ +L A DWR EI R + + SIC LKEQW A DG+ ITN+A ++E +
Sbjct: 2 MEENGNLVVAPDWRAEINTEARRKIVDSICAKLKEQWPAYGHDGIISEGITNLAMKFEEE 61
Query: 61 LFNNAKSKDEYLNAGRTRQMSGREN-HHGSSSCQ-------------------------- 120
+FN A SKD Y+ +R+MS EN H GSSS Q
Sbjct: 62 MFNKANSKDHYI-YNISRKMSRIENRHEGSSSIQDHPAFSSDKSVVQKKKEAVFVSQPLP 121
Query: 121 ----AAVPNPQYH----QPAEPNSLLRQHI-QPTPQLH----GQNPNVRQTHQQFVLHNQ 180
P P H Q A PN LLRQ+I Q TPQ H QN N RQ HQQF +H+Q
Sbjct: 122 QMTNQPYPRPMVHNQRQQLAAPNPLLRQNIMQLTPQSHEQLQRQNLNARQLHQQFGMHSQ 181
Query: 181 SGFSPQNTLNSQCEPHGFQRRDFGIHLSPEMFTQHPNLVNLQPNENLTTHVKKEVNGEGF 240
S PQNTL S C P G Q RD G+++ P+MF H ++NLQPNENL +K+EV GE
Sbjct: 182 SCVRPQNTLISPCRPLGLQSRDSGVYIPPQMFPPHSEVMNLQPNENLRAQIKEEVIGEDV 241
Query: 241 QASKSSHQHH------TAIEQHKQQQSMGASSE--GNPNSEDWLDVAFAEKERVKKMCLP 300
QASK HH T IEQHKQ QSMG S+ NP EDW D A+ E + +K CLP
Sbjct: 242 QASKFVQPHHTPRGPNTMIEQHKQHQSMGVSAVLIENPKGEDWHDQAYNEMQGLKMTCLP 301
Query: 301 LL----EKACEPSVQVAPTEHIQQHKLVPLDSLKKMVKFLQLPRDKIIVTYNKEKFYRSL 360
LL E+A + ++V TE IQ+ K + ++KM FL+LPRDK I + KEKFY+S+
Sbjct: 302 LLKESYERAAKLCLEVGQTEQIQKCK-NHMSLMQKMTNFLELPRDK-ITYFTKEKFYQSM 361
Query: 361 QTIEKLGNVFKS-NINRANKQQVLHVGQPDLSGSRINPVQQSDNVKLHCQPVIRATTGSS 420
++IEK ++ N NKQQ LH GQP +S S INPVQ+SDN H QP+ A TGSS
Sbjct: 362 KSIEKFAKAHENRNTFLVNKQQPLH-GQPGISQSHINPVQRSDNANPHFQPLNLAATGSS 421
Query: 421 DSSSPIAPREKGS---VTDCIQKNLLQNRQHSENIKQESQSQWIQLRQKSTGNIPAIHRS 480
DSSSP P E GS + IQKNLLQ RQ E IKQE QS WIQ QKST +I AI+RS
Sbjct: 422 DSSSPRTPSEIGSSRPEANRIQKNLLQKRQQPEIIKQEFQSSWIQQMQKSTESIQAINRS 481
Query: 481 GMSLKHHVNSNFSPQIY-EAAQLSQIAQRPLPTNPCAS--SSHGRASPAPSSSIVGLEKI 540
G+SL++H+NS SPQ + EA+QLS+IA+R L +PC+S GRASP PSSS VGL K
Sbjct: 482 GVSLQNHLNSKHSPQSHEEASQLSKIAERALSKDPCSSVYGRDGRASPTPSSSTVGLGKS 541
Query: 541 SPNVSYLPSSIFQFPQHCNARELLHPKAEIQVQSQKIRSSSAMTSPFAEPTSLGSNGKLS 600
S NVS L S FQ+P+ N+ LL+ K +IQV+S +IRSS TS FAEPTSLGS+ LS
Sbjct: 542 SSNVSCLSSLNFQYPETRNSVNLLNSKTKIQVKSHEIRSSGISTSQFAEPTSLGSSQHLS 601
Query: 601 TGTQAHDRLVKAVKSLSNEALNVAVSGICSVGYMEDTITDP-----WCHAKATDLRLLDG 660
T TQ +RL+KAV S+S++AL A+SGI SVG M DT+T+P CH KA L L DG
Sbjct: 602 TATQPLNRLLKAVDSMSDQALRTAISGISSVGNMCDTVTEPSVFGTRCHQKAAHLSLQDG 661
Query: 661 CGSSNNMKRKINAMALNDIPSPCSDIAGSEPTVTSSRKKLKKLSDYALLEEMRNINKQFI 720
GSSNNMKRKI A+ LND+PSPCSD GSE TVTS KKLKKL+D ALLEEMRNIN++ +
Sbjct: 662 FGSSNNMKRKICAITLNDMPSPCSDNDGSELTVTSRSKKLKKLTDNALLEEMRNINQRLV 721
Query: 721 ETVLELDLDENLNRRLANAGTVLRCSYSAVTDSKN---SEPCPVKLPVLSVKLLVPLDYP 780
ETVLELD +N+NRR ANAGTV+RC+YSAV+D N +KLPVLSVKLLVPLDYP
Sbjct: 722 ETVLELDSAQNVNRRFANAGTVVRCTYSAVSDRNNLMFHFANTLKLPVLSVKLLVPLDYP 781
Query: 781 EDYPVFLSKFNTDSSNVDEEFRDLSNEATSMLRAFLRTAPDCLSLEEYARVWDECARSVV 799
EDYPVFLSKFNTD N DEE RDLS +ATSMLRAFLRTAP+ LSL EYAR WD+CAR VV
Sbjct: 782 EDYPVFLSKFNTDCGNEDEECRDLSRKATSMLRAFLRTAPERLSLGEYARAWDQCARYVV 841
BLAST of HG10019849 vs. ExPASy Swiss-Prot
Match:
F4I171 (Mediator of RNA polymerase II transcription subunit 15a OS=Arabidopsis thaliana OX=3702 GN=MED15A PE=1 SV=1)
HSP 1 Score: 137.1 bits (344), Expect = 8.6e-31
Identity = 126/414 (30.43%), Postives = 197/414 (47.58%), Query Frame = 0
Query: 432 VNSNFSPQIYEAAQLSQIAQRPLPTN-PCASSSHGRASPAPSSSIVGLEKISPNVSYLPS 491
++ + SPQ+ + ++++ P N P S APS V EK P S L
Sbjct: 931 MSQHLSPQVDQKNTVNKMGTPLQPANSPFVVPSPSSTPLAPSPMQVDSEK--PGSSSLSM 990
Query: 492 SIFQFPQHCNARELLHPKAEIQVQSQKIRSSSAMTSPFAEP------TSLGSNGKLSTGT 551
Q + ++ + + + I S+S + F P +S ++GK S
Sbjct: 991 GNIARQQATGMQGVVQ---SLAIGTPGI-SASPLLQEFTSPDGNILNSSTITSGKPSATE 1050
Query: 552 QAHDRLVKAVKSLSNEALNVAVSGICSVGYM------------------EDTITDPWCHA 611
+RL++AVKS+S +AL+ AVS I SV M ED + C
Sbjct: 1051 LPIERLIRAVKSISPQALSSAVSDIGSVVSMVDRIAGSAPGNGSRASVGEDLVAMTKCRL 1110
Query: 612 KATDLRLLDGCGSSNNMKRKINAMALN------DIPSPCSDIAGS-----EPTVTSSRKK 671
+A + +G ++ MKR AM L+ + AGS E T TS KK
Sbjct: 1111 QARNFMTQEGMMATKKMKRHTTAMPLSVASLGGSVGDNYKQFAGSETSDLESTATSDGKK 1170
Query: 672 LKKLSDYALLEEMRNINKQFIETVLELDLDENLN-------RRLANAGTVLRCSYSAVTD 731
+ +++ALLEE++ IN++ I+TV+E+ DE+ + GT +R S+ AV+
Sbjct: 1171 ARTETEHALLEEIKEINQRLIDTVVEISDDEDAADPSEVAISSIGCEGTTVRFSFIAVSL 1230
Query: 732 S---KNSEPCPVKLPVLSVKLLVPLDYPEDYPVFLSKFNTDSSNVDEEFRDLSNEATSML 791
S K P+ ++LLVP YP P L K ++S +E DLS++A +
Sbjct: 1231 SPALKAHLSSTQMSPIQPLRLLVPCSYPNGSPSLLDKLPVETSKENE---DLSSKAMARF 1290
Query: 792 RAFLRTAPDCLSLEEYARVWDECARSVVSEYAQRVGGGCFSAQYGTWEDSVSAA 800
LR+ +SL++ A+ WD CAR+V+ EYAQ+ GGG FS++YGTWE V+A+
Sbjct: 1291 NILLRSLSQPMSLKDIAKTWDACARAVICEYAQQFGGGTFSSKYGTWEKYVAAS 1335
BLAST of HG10019849 vs. ExPASy Swiss-Prot
Match:
Q9SHV7 (Probable mediator of RNA polymerase II transcription subunit 15c OS=Arabidopsis thaliana OX=3702 GN=MED15C PE=3 SV=1)
HSP 1 Score: 92.4 bits (228), Expect = 2.4e-17
Identity = 176/784 (22.45%), Postives = 307/784 (39.16%), Query Frame = 0
Query: 130 QTHQQFVLHNQSGFSPQNTLNSQCEPHGFQRRDFGIHLSPEMFTQHPNLVNLQPNENLTT 189
Q + LH S Q N Q G Q +S + Q+P + Q N NL
Sbjct: 191 QQNNNVTLHAMS----QQKNNLQSMTRGQQVGQSQPMMSQQYRQQYPMQQDPQ-NRNLQK 250
Query: 190 HVK-KEVNGEGFQASKSSHQHHTAIEQHKQQQSMGASSEG------------------NP 249
H+ + N FQA+ S Q +Q Q Q + ++ N
Sbjct: 251 HLDFVQNNTNQFQAASSLRQTQNITDQQNQPQQLERANPSILIMNIIVASQDSTGKTVNV 310
Query: 250 NSEDWLDVAFAEKERVKKMCLPLL----EKACEP--SVQVAPTEHIQQHKLVPLD----S 309
N+ +W + + + +++K+MCLP+L ++ E + P + +Q + L S
Sbjct: 311 NAGNWQEETYQKIKKLKEMCLPVLSLMHQRVAEKLRETESLPPQPMQAQWIEKLKAGKLS 370
Query: 310 LKKMVKFLQLPRDKI--------------IVTYNKEKFYRSLQTIEKLGNVFKSNINRAN 369
++ ++ FL + R + I+ + K + T ++ G S
Sbjct: 371 MEHLMFFLNVHRSSVSEKHRDKFSQYEYHILKFTKSQTMVLRPTQQQQGQFPPSQTAMQT 430
Query: 370 KQQVLHVGQ---PDLSGSRINPVQQSDNVKL---------HCQPVIRATTGS----SDSS 429
+ +HV Q + SR+ P Q++ L + +I A++G+ S
Sbjct: 431 QSPQVHVSQSLYKEQRRSRLMPSSQNEASSLLQIRPKLDPRDENIIMASSGNVMLPSVKQ 490
Query: 430 SPIAPREKGSVTDCIQKNLLQNRQHSENIKQESQSQWIQLRQKSTGNI------PAIHRS 489
+P A S +QK Q R H ++Q+ Q Q T + ++
Sbjct: 491 NPRAVNTNISSVQSLQK---QKRFHHRQMQQQQPQQGNHQHQMQTNEMNDVRMRERVNIK 550
Query: 490 GMSLKHHVNSN-----------FSPQIYEAAQLSQIAQRPLPT------NPCASSSHGRA 549
L+ V+S+ S QI + + Q LP P SS
Sbjct: 551 ARLLEQQVSSSQRQVPKQESNVSSSQIQNHSSPQLVDQHILPATINKTGTPLNSSGSAFV 610
Query: 550 SPAPSSSIVGLEKISPNVSYLPSSIFQFPQHCNARELLHPKAEIQVQSQKIRSSSAMTSP 609
+PAPS P S +P S+ P + + + + SSS + +
Sbjct: 611 APAPSP--------VPGDSEMPISVES------------PVSGVDEINSTLDSSSKLGT- 670
Query: 610 FAEPTSLGSNGKLSTGTQAHDRLVKAVKSLSNEALNVAVSGICSVGYMEDTITDPW---- 669
E L + DRL+KA ++ S ++L +VS I SV M D I +
Sbjct: 671 -QETPLLFVPPPEPITERPIDRLIKAFQAASPKSLAESVSEISSVISMVDMIGGSFPSSG 730
Query: 670 ---------CHAKATDLRLLDGCGSSNNMKRKINAMALN-----DIPSPCSDIAGSEPTV 729
+ + + S MKR IN + + D S + +
Sbjct: 731 GSRAGLGEDLSERTRNFTTHEETNLSKRMKRSINIVPPDMSSQIDSYEQLSSLESEVVST 790
Query: 730 TSSRKKLKKLS-DYALLEEMRNINKQFIETVLELDLDENLNRRLANAGTVLRCSYSAVTD 789
TSS K+ ++ YALL+E++ N + +ETV+E+ +++L GT++ C+Y+ V
Sbjct: 791 TSSGLKVNNIAPGYALLQEIKETNGRLVETVVEICDEDSL-------GTIVTCTYAPVAL 850
Query: 790 SKNSE-------------PCPVKLPVLSVKLLVPLDYPEDYPVFLSKFNTDSSNVDEEFR 800
S + C ++ + ++LL P+DYP P+ L + + D+S ++
Sbjct: 851 SATFKDHYKSGKIIFYVSKCLMQAQIQPLRLLFPMDYPYSSPIVLEEISFDTS--VHKYE 910
BLAST of HG10019849 vs. ExPASy TrEMBL
Match:
A0A1S4DUH4 (uncharacterized protein LOC103486878 OS=Cucumis melo OX=3656 GN=LOC103486878 PE=4 SV=1)
HSP 1 Score: 931.4 bits (2406), Expect = 2.5e-267
Identity = 530/804 (65.92%), Postives = 605/804 (75.25%), Query Frame = 0
Query: 1 MEKKVSLATA-TDWRMEITKATRVQKLRSICMTLKEQWSARCPDGMNVVAITNVAREYEM 60
M+KK S+ATA TDWR EITK TR +K SI M L+ Q+S + N+ I++ AR++EM
Sbjct: 1 MDKKASMATATTDWRTEITKETRQKKFHSIWMVLERQFSGQ----FNMNVISDHARKHEM 60
Query: 61 KLFNNAKSKDEYLNAGRTRQMSGRENHHGSSSCQAAVPNPQYHQPAEPNSLLRQHIQPTP 120
KLF+ A S DEYLNAG ++S RENH GSSS +AAV PQYH QPTP
Sbjct: 61 KLFSQANSTDEYLNAG-IGKLSKRENHRGSSSSRAAVVYPQYH-------------QPTP 120
Query: 121 QLHGQNPNVRQTHQQFVLHNQSGFSPQNTLNSQCEPHGFQRRDFGIHLSPEMFTQHPNLV 180
QL Q+P VRQ HQQF + NQS S QNT NSQ P GFQR+D GIHLS EMFTQHPN V
Sbjct: 121 QLRRQHPKVRQAHQQFAMQNQSCASLQNTSNSQSRPQGFQRQDIGIHLSSEMFTQHPNFV 180
Query: 181 NLQPNENLTTHVKKEVNGEGFQASKSSHQHHTAIEQHKQQQSMGASSEGNPNSEDWLDVA 240
NLTT V+KE N EGF+ASKS HQH EQHKQQ SM AS+E P+SE D A
Sbjct: 181 ------NLTTQVEKEANSEGFKASKSLHQH----EQHKQQHSMRASAERIPSSEVLHDAA 240
Query: 241 FAEKERVKKMCLPLLEKACEPSVQVAPTEHIQQHKLVPLDSLKKMVKFLQLPRDKIIVTY 300
FAE E++KK LPL+ KA EP +V P QHK + +L++++ F P++KII +Y
Sbjct: 241 FAEMEQLKKTFLPLIIKAYEPYRKVHPD---AQHKNRLMKTLERILTFFHSPKEKIIASY 300
Query: 301 NKEKFYRSLQTIEKLGNVFKSNINRANKQQVLHVGQPDLSGSRIN-PVQQSDNVKLHCQP 360
KE+FYR L+ IE+ GN K N N ANKQ LH GQP LSGSRIN P+QQSDNVKL CQ
Sbjct: 301 TKERFYRCLKYIEQFGNTIKCNTNVANKQSSLHGGQPGLSGSRINHPLQQSDNVKLPCQS 360
Query: 361 VIRATTGSSDSSSPIAPREKGSV---TDCIQKNLLQNRQHSENIKQESQSQWIQLRQKST 420
VIRATTGSS SSSPIAP+EKGSV TD IQKNLLQNRQH ++IK + QWI ++
Sbjct: 361 VIRATTGSSGSSSPIAPQEKGSVRSETDYIQKNLLQNRQHYKSIKSKVHPQWIH----AS 420
Query: 421 GNIPAIHRSGMSLKHHVNSNFSPQIYEAAQLSQIAQRPLPTNPCASSSHGRASPAPSSSI 480
GN PA +RSGMSL HH+NSNFS QI++A+QL +A+RP PT PC S +G ASPAPSS I
Sbjct: 421 GNTPATYRSGMSLNHHLNSNFSHQIHDASQLCHVAERPRPTKPCTSPLYGIASPAPSSPI 480
Query: 481 VGLEKISPNVSYLPSSIFQFPQHCNARELLHPKAEIQVQSQKIRSSSAMTSPFAEPTSLG 540
VGLEK SPNV+Y FQ PQHCN +LLH K E QV SQKIRSSSAMTSP AEPTS G
Sbjct: 481 VGLEKTSPNVTYHSGLNFQSPQHCNPYQLLHSKTETQVPSQKIRSSSAMTSPVAEPTSPG 540
Query: 541 SNGKLSTGTQAHDRLVKAVKSLSNEALNVAVSGICSVGYMEDTITDPWCHAKATDLRLLD 600
NG+ ST QA+ RL+KAV S S AL AVSGI SVGYMED I DP CHA T+LRLL+
Sbjct: 541 INGQFST-YQAYSRLLKAVGSSSRAALRAAVSGITSVGYMEDAIIDPRCHAMVTNLRLLN 600
Query: 601 GCGSSNNMKRKINAMALNDIPSPCSDIAGSEPTVTSSRKKLKKLSDYALLEEMRNINKQF 660
GCGSSNNMKRKINAMALN+IPSP SDI GSE TVTS KKLKK +D +LLEE+RNINKQF
Sbjct: 601 GCGSSNNMKRKINAMALNNIPSPRSDIPGSEETVTSRTKKLKKHTDSSLLEEIRNINKQF 660
Query: 661 IETVLELDLDENLNRRLANAGTVLRCSYSAVTDSKNSEPCPVKLPVLSVKLLVPLDYPED 720
IETVLELD+DENLNRRLANAGTVLRCSYSAV D NSE PVKLPVL++KLLVPLDYPED
Sbjct: 661 IETVLELDVDENLNRRLANAGTVLRCSYSAVIDGTNSEAYPVKLPVLTMKLLVPLDYPED 720
Query: 721 YPVFLSKFNTDSSNVDEEFRDLSNEATSMLRAFLRTAPDCLSLEEYARVWDECARSVVSE 780
YPVFLSKF++ SSNVDEE +LSNEA SMLRAFLRTAP+C+SLEEYARVWDECARSVVS+
Sbjct: 721 YPVFLSKFDSGSSNVDEECSNLSNEAMSMLRAFLRTAPECVSLEEYARVWDECARSVVSD 768
Query: 781 YAQRVGGGCFSAQYGTWEDSVSAA 800
Y QR GGG FSA+YGTWEDSV+ A
Sbjct: 781 YVQRAGGGSFSARYGTWEDSVATA 768
BLAST of HG10019849 vs. ExPASy TrEMBL
Match:
A0A5A7UH89 (Putative tartrate dehydrogenase/decarboxylase ttuC OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold352G006580 PE=4 SV=1)
HSP 1 Score: 931.4 bits (2406), Expect = 2.5e-267
Identity = 530/804 (65.92%), Postives = 605/804 (75.25%), Query Frame = 0
Query: 1 MEKKVSLATA-TDWRMEITKATRVQKLRSICMTLKEQWSARCPDGMNVVAITNVAREYEM 60
M+KK S+ATA TDWR EITK TR +K SI M L+ Q+S + N+ I++ AR++EM
Sbjct: 1 MDKKASMATATTDWRTEITKETRQKKFHSIWMVLERQFSGQ----FNMNVISDHARKHEM 60
Query: 61 KLFNNAKSKDEYLNAGRTRQMSGRENHHGSSSCQAAVPNPQYHQPAEPNSLLRQHIQPTP 120
KLF+ A S DEYLNAG ++S RENH GSSS +AAV PQYH QPTP
Sbjct: 61 KLFSQANSTDEYLNAG-IGKLSKRENHRGSSSSRAAVVYPQYH-------------QPTP 120
Query: 121 QLHGQNPNVRQTHQQFVLHNQSGFSPQNTLNSQCEPHGFQRRDFGIHLSPEMFTQHPNLV 180
QL Q+P VRQ HQQF + NQS S QNT NSQ P GFQR+D GIHLS EMFTQHPN V
Sbjct: 121 QLRRQHPKVRQAHQQFAMQNQSCASLQNTSNSQSRPQGFQRQDIGIHLSSEMFTQHPNFV 180
Query: 181 NLQPNENLTTHVKKEVNGEGFQASKSSHQHHTAIEQHKQQQSMGASSEGNPNSEDWLDVA 240
NLTT V+KE N EGF+ASKS HQH EQHKQQ SM AS+E P+SE D A
Sbjct: 181 ------NLTTQVEKEANSEGFKASKSLHQH----EQHKQQHSMRASAERIPSSEVLHDAA 240
Query: 241 FAEKERVKKMCLPLLEKACEPSVQVAPTEHIQQHKLVPLDSLKKMVKFLQLPRDKIIVTY 300
FAE E++KK LPL+ KA EP +V P QHK + +L++++ F P++KII +Y
Sbjct: 241 FAEMEQLKKTFLPLIIKAYEPYRKVHPD---AQHKNRLMKTLERILTFFHSPKEKIIASY 300
Query: 301 NKEKFYRSLQTIEKLGNVFKSNINRANKQQVLHVGQPDLSGSRIN-PVQQSDNVKLHCQP 360
KE+FYR L+ IE+ GN K N N ANKQ LH GQP LSGSRIN P+QQSDNVKL CQ
Sbjct: 301 TKERFYRCLKYIEQFGNTIKCNTNVANKQSSLHGGQPGLSGSRINHPLQQSDNVKLPCQS 360
Query: 361 VIRATTGSSDSSSPIAPREKGSV---TDCIQKNLLQNRQHSENIKQESQSQWIQLRQKST 420
VIRATTGSS SSSPIAP+EKGSV TD IQKNLLQNRQH ++IK + QWI ++
Sbjct: 361 VIRATTGSSGSSSPIAPQEKGSVRSETDYIQKNLLQNRQHYKSIKSKVHPQWIH----AS 420
Query: 421 GNIPAIHRSGMSLKHHVNSNFSPQIYEAAQLSQIAQRPLPTNPCASSSHGRASPAPSSSI 480
GN PA +RSGMSL HH+NSNFS QI++A+QL +A+RP PT PC S +G ASPAPSS I
Sbjct: 421 GNTPATYRSGMSLNHHLNSNFSHQIHDASQLCHVAERPRPTKPCTSPLYGIASPAPSSPI 480
Query: 481 VGLEKISPNVSYLPSSIFQFPQHCNARELLHPKAEIQVQSQKIRSSSAMTSPFAEPTSLG 540
VGLEK SPNV+Y FQ PQHCN +LLH K E QV SQKIRSSSAMTSP AEPTS G
Sbjct: 481 VGLEKTSPNVTYHSGLNFQSPQHCNPYQLLHSKTETQVPSQKIRSSSAMTSPVAEPTSPG 540
Query: 541 SNGKLSTGTQAHDRLVKAVKSLSNEALNVAVSGICSVGYMEDTITDPWCHAKATDLRLLD 600
NG+ ST QA+ RL+KAV S S AL AVSGI SVGYMED I DP CHA T+LRLL+
Sbjct: 541 INGQFST-YQAYSRLLKAVGSSSRAALRAAVSGITSVGYMEDAIIDPRCHAMVTNLRLLN 600
Query: 601 GCGSSNNMKRKINAMALNDIPSPCSDIAGSEPTVTSSRKKLKKLSDYALLEEMRNINKQF 660
GCGSSNNMKRKINAMALN+IPSP SDI GSE TVTS KKLKK +D +LLEE+RNINKQF
Sbjct: 601 GCGSSNNMKRKINAMALNNIPSPRSDIPGSEETVTSRTKKLKKHTDSSLLEEIRNINKQF 660
Query: 661 IETVLELDLDENLNRRLANAGTVLRCSYSAVTDSKNSEPCPVKLPVLSVKLLVPLDYPED 720
IETVLELD+DENLNRRLANAGTVLRCSYSAV D NSE PVKLPVL++KLLVPLDYPED
Sbjct: 661 IETVLELDVDENLNRRLANAGTVLRCSYSAVIDGTNSEAYPVKLPVLTMKLLVPLDYPED 720
Query: 721 YPVFLSKFNTDSSNVDEEFRDLSNEATSMLRAFLRTAPDCLSLEEYARVWDECARSVVSE 780
YPVFLSKF++ SSNVDEE +LSNEA SMLRAFLRTAP+C+SLEEYARVWDECARSVVS+
Sbjct: 721 YPVFLSKFDSGSSNVDEECSNLSNEAMSMLRAFLRTAPECVSLEEYARVWDECARSVVSD 768
Query: 781 YAQRVGGGCFSAQYGTWEDSVSAA 800
Y QR GGG FSA+YGTWEDSV+ A
Sbjct: 781 YVQRAGGGSFSARYGTWEDSVATA 768
BLAST of HG10019849 vs. ExPASy TrEMBL
Match:
E5GBP1 (KIX_2 domain-containing protein OS=Cucumis melo subsp. melo OX=412675 PE=4 SV=1)
HSP 1 Score: 931.4 bits (2406), Expect = 2.5e-267
Identity = 530/804 (65.92%), Postives = 605/804 (75.25%), Query Frame = 0
Query: 1 MEKKVSLATA-TDWRMEITKATRVQKLRSICMTLKEQWSARCPDGMNVVAITNVAREYEM 60
M+KK S+ATA TDWR EITK TR +K SI M L+ Q+S + N+ I++ AR++EM
Sbjct: 1 MDKKASMATATTDWRTEITKETRQKKFHSIWMVLERQFSGQ----FNMNVISDHARKHEM 60
Query: 61 KLFNNAKSKDEYLNAGRTRQMSGRENHHGSSSCQAAVPNPQYHQPAEPNSLLRQHIQPTP 120
KLF+ A S DEYLNAG ++S RENH GSSS +AAV PQYH QPTP
Sbjct: 61 KLFSQANSTDEYLNAG-IGKLSKRENHRGSSSSRAAVVYPQYH-------------QPTP 120
Query: 121 QLHGQNPNVRQTHQQFVLHNQSGFSPQNTLNSQCEPHGFQRRDFGIHLSPEMFTQHPNLV 180
QL Q+P VRQ HQQF + NQS S QNT NSQ P GFQR+D GIHLS EMFTQHPN V
Sbjct: 121 QLRRQHPKVRQAHQQFAMQNQSCASLQNTSNSQSRPQGFQRQDIGIHLSSEMFTQHPNFV 180
Query: 181 NLQPNENLTTHVKKEVNGEGFQASKSSHQHHTAIEQHKQQQSMGASSEGNPNSEDWLDVA 240
NLTT V+KE N EGF+ASKS HQH EQHKQQ SM AS+E P+SE D A
Sbjct: 181 ------NLTTQVEKEANSEGFKASKSLHQH----EQHKQQHSMRASAERIPSSEVLHDAA 240
Query: 241 FAEKERVKKMCLPLLEKACEPSVQVAPTEHIQQHKLVPLDSLKKMVKFLQLPRDKIIVTY 300
FAE E++KK LPL+ KA EP +V P QHK + +L++++ F P++KII +Y
Sbjct: 241 FAEMEQLKKTFLPLIIKAYEPYRKVHPD---AQHKNRLMKTLERILTFFHSPKEKIIASY 300
Query: 301 NKEKFYRSLQTIEKLGNVFKSNINRANKQQVLHVGQPDLSGSRIN-PVQQSDNVKLHCQP 360
KE+FYR L+ IE+ GN K N N ANKQ LH GQP LSGSRIN P+QQSDNVKL CQ
Sbjct: 301 TKERFYRCLKYIEQFGNTIKCNTNVANKQSSLHGGQPGLSGSRINHPLQQSDNVKLPCQS 360
Query: 361 VIRATTGSSDSSSPIAPREKGSV---TDCIQKNLLQNRQHSENIKQESQSQWIQLRQKST 420
VIRATTGSS SSSPIAP+EKGSV TD IQKNLLQNRQH ++IK + QWI ++
Sbjct: 361 VIRATTGSSGSSSPIAPQEKGSVRSETDYIQKNLLQNRQHYKSIKSKVHPQWIH----AS 420
Query: 421 GNIPAIHRSGMSLKHHVNSNFSPQIYEAAQLSQIAQRPLPTNPCASSSHGRASPAPSSSI 480
GN PA +RSGMSL HH+NSNFS QI++A+QL +A+RP PT PC S +G ASPAPSS I
Sbjct: 421 GNTPATYRSGMSLNHHLNSNFSHQIHDASQLCHVAERPRPTKPCTSPLYGIASPAPSSPI 480
Query: 481 VGLEKISPNVSYLPSSIFQFPQHCNARELLHPKAEIQVQSQKIRSSSAMTSPFAEPTSLG 540
VGLEK SPNV+Y FQ PQHCN +LLH K E QV SQKIRSSSAMTSP AEPTS G
Sbjct: 481 VGLEKTSPNVTYHSGLNFQSPQHCNPYQLLHSKTETQVPSQKIRSSSAMTSPVAEPTSPG 540
Query: 541 SNGKLSTGTQAHDRLVKAVKSLSNEALNVAVSGICSVGYMEDTITDPWCHAKATDLRLLD 600
NG+ ST QA+ RL+KAV S S AL AVSGI SVGYMED I DP CHA T+LRLL+
Sbjct: 541 INGQFST-YQAYSRLLKAVGSSSRAALRAAVSGITSVGYMEDAIIDPRCHAMVTNLRLLN 600
Query: 601 GCGSSNNMKRKINAMALNDIPSPCSDIAGSEPTVTSSRKKLKKLSDYALLEEMRNINKQF 660
GCGSSNNMKRKINAMALN+IPSP SDI GSE TVTS KKLKK +D +LLEE+RNINKQF
Sbjct: 601 GCGSSNNMKRKINAMALNNIPSPRSDIPGSEETVTSRTKKLKKHTDSSLLEEIRNINKQF 660
Query: 661 IETVLELDLDENLNRRLANAGTVLRCSYSAVTDSKNSEPCPVKLPVLSVKLLVPLDYPED 720
IETVLELD+DENLNRRLANAGTVLRCSYSAV D NSE PVKLPVL++KLLVPLDYPED
Sbjct: 661 IETVLELDVDENLNRRLANAGTVLRCSYSAVIDGTNSEAYPVKLPVLTMKLLVPLDYPED 720
Query: 721 YPVFLSKFNTDSSNVDEEFRDLSNEATSMLRAFLRTAPDCLSLEEYARVWDECARSVVSE 780
YPVFLSKF++ SSNVDEE +LSNEA SMLRAFLRTAP+C+SLEEYARVWDECARSVVS+
Sbjct: 721 YPVFLSKFDSGSSNVDEECSNLSNEAMSMLRAFLRTAPECVSLEEYARVWDECARSVVSD 768
Query: 781 YAQRVGGGCFSAQYGTWEDSVSAA 800
Y QR GGG FSA+YGTWEDSV+ A
Sbjct: 781 YVQRAGGGSFSARYGTWEDSVATA 768
BLAST of HG10019849 vs. ExPASy TrEMBL
Match:
A0A6J1D158 (probable mediator of RNA polymerase II transcription subunit 15c isoform X1 OS=Momordica charantia OX=3673 GN=LOC111016654 PE=4 SV=1)
HSP 1 Score: 772.7 bits (1994), Expect = 1.5e-219
Identity = 480/865 (55.49%), Postives = 574/865 (66.36%), Query Frame = 0
Query: 1 MEKKVSLATATDWRMEITKATRVQKLRSICMTLKEQWSARCPDGMNVVAITNVAREYEMK 60
ME+ +L A DWR EI R + + SIC LKEQW A DG+ ITN+A ++E +
Sbjct: 2 MEENGNLVVAPDWRAEINTEARRKIVDSICAKLKEQWPAYGHDGIISEGITNLAMKFEEE 61
Query: 61 LFNNAKSKDEYLNAGRTRQMSGREN-HHGSSSCQ-------------------------- 120
+FN A SKD Y+ +R+MS EN H GSSS Q
Sbjct: 62 MFNKANSKDHYI-YNISRKMSRIENRHEGSSSIQDHPAFSSDKSVVQKKKEAVFVSQPLP 121
Query: 121 ----AAVPNPQYH----QPAEPNSLLRQHI-QPTPQLH----GQNPNVRQTHQQFVLHNQ 180
P P H Q A PN LLRQ+I Q TPQ H QN N RQ HQQF +H+Q
Sbjct: 122 QMTNQPYPRPMVHNQRQQLAAPNPLLRQNIMQLTPQSHEQLQRQNLNARQLHQQFGMHSQ 181
Query: 181 SGFSPQNTLNSQCEPHGFQRRDFGIHLSPEMFTQHPNLVNLQPNENLTTHVKKEVNGEGF 240
S PQNTL S C P G Q RD G+++ P+MF H ++NLQPNENL +K+EV GE
Sbjct: 182 SCVRPQNTLISPCRPLGLQSRDSGVYIPPQMFPPHSEVMNLQPNENLRAQIKEEVIGEDV 241
Query: 241 QASKSSHQHH------TAIEQHKQQQSMGASSE--GNPNSEDWLDVAFAEKERVKKMCLP 300
QASK HH T IEQHKQ QSMG S+ NP EDW D A+ E + +K CLP
Sbjct: 242 QASKFVQPHHTPRGPNTMIEQHKQHQSMGVSAVLIENPKGEDWHDQAYNEMQGLKMTCLP 301
Query: 301 LL----EKACEPSVQVAPTEHIQQHKLVPLDSLKKMVKFLQLPRDKIIVTYNKEKFYRSL 360
LL E+A + ++V TE IQ+ K + ++KM FL+LPRDK I + KEKFY+S+
Sbjct: 302 LLKESYERAAKLCLEVGQTEQIQKCK-NHMSLMQKMTNFLELPRDK-ITYFTKEKFYQSM 361
Query: 361 QTIEKLGNVFKS-NINRANKQQVLHVGQPDLSGSRINPVQQSDNVKLHCQPVIRATTGSS 420
++IEK ++ N NKQQ LH GQP +S S INPVQ+SDN H QP+ A TGSS
Sbjct: 362 KSIEKFAKAHENRNTFLVNKQQPLH-GQPGISQSHINPVQRSDNANPHFQPLNLAATGSS 421
Query: 421 DSSSPIAPREKGS---VTDCIQKNLLQNRQHSENIKQESQSQWIQLRQKSTGNIPAIHRS 480
DSSSP P E GS + IQKNLLQ RQ E IKQE QS WIQ QKST +I AI+RS
Sbjct: 422 DSSSPRTPSEIGSSRPEANRIQKNLLQKRQQPEIIKQEFQSSWIQQMQKSTESIQAINRS 481
Query: 481 GMSLKHHVNSNFSPQIY-EAAQLSQIAQRPLPTNPCAS--SSHGRASPAPSSSIVGLEKI 540
G+SL++H+NS SPQ + EA+QLS+IA+R L +PC+S GRASP PSSS VGL K
Sbjct: 482 GVSLQNHLNSKHSPQSHEEASQLSKIAERALSKDPCSSVYGRDGRASPTPSSSTVGLGKS 541
Query: 541 SPNVSYLPSSIFQFPQHCNARELLHPKAEIQVQSQKIRSSSAMTSPFAEPTSLGSNGKLS 600
S NVS L S FQ+P+ N+ LL+ K +IQV+S +IRSS TS FAEPTSLGS+ LS
Sbjct: 542 SSNVSCLSSLNFQYPETRNSVNLLNSKTKIQVKSHEIRSSGISTSQFAEPTSLGSSQHLS 601
Query: 601 TGTQAHDRLVKAVKSLSNEALNVAVSGICSVGYMEDTITDP-----WCHAKATDLRLLDG 660
T TQ +RL+KAV S+S++AL A+SGI SVG M DT+T+P CH KA L L DG
Sbjct: 602 TATQPLNRLLKAVDSMSDQALRTAISGISSVGNMCDTVTEPSVFGTRCHQKAAHLSLQDG 661
Query: 661 CGSSNNMKRKINAMALNDIPSPCSDIAGSEPTVTSSRKKLKKLSDYALLEEMRNINKQFI 720
GSSNNMKRKI A+ LND+PSPCSD GSE TVTS KKLKKL+D ALLEEMRNIN++ +
Sbjct: 662 FGSSNNMKRKICAITLNDMPSPCSDNDGSELTVTSRSKKLKKLTDNALLEEMRNINQRLV 721
Query: 721 ETVLELDLDENLNRRLANAGTVLRCSYSAVTDSKN---SEPCPVKLPVLSVKLLVPLDYP 780
ETVLELD +N+NRR ANAGTV+RC+YSAV+D N +KLPVLSVKLLVPLDYP
Sbjct: 722 ETVLELDSAQNVNRRFANAGTVVRCTYSAVSDRNNLMFHFANTLKLPVLSVKLLVPLDYP 781
Query: 781 EDYPVFLSKFNTDSSNVDEEFRDLSNEATSMLRAFLRTAPDCLSLEEYARVWDECARSVV 799
EDYPVFLSKFNTD N DEE RDLS +ATSMLRAFLRTAP+ LSL EYAR WD+CAR VV
Sbjct: 782 EDYPVFLSKFNTDCGNEDEECRDLSRKATSMLRAFLRTAPERLSLGEYARAWDQCARYVV 841
BLAST of HG10019849 vs. ExPASy TrEMBL
Match:
A0A6J1D3G7 (mediator of RNA polymerase II transcription subunit 15a-like isoform X2 OS=Momordica charantia OX=3673 GN=LOC111016654 PE=4 SV=1)
HSP 1 Score: 731.9 bits (1888), Expect = 2.9e-207
Identity = 455/810 (56.17%), Postives = 543/810 (67.04%), Query Frame = 0
Query: 56 EYEMKLFNNAKSKDEYLNAGRTRQMSGREN-HHGSSSCQ--------------------- 115
++E ++FN A SKD Y+ +R+MS EN H GSSS Q
Sbjct: 2 KFEEEMFNKANSKDHYI-YNISRKMSRIENRHEGSSSIQDHPAFSSDKSVVQKKKEAVFV 61
Query: 116 ---------AAVPNPQYH----QPAEPNSLLRQHI-QPTPQLH----GQNPNVRQTHQQF 175
P P H Q A PN LLRQ+I Q TPQ H QN N RQ HQQF
Sbjct: 62 SQPLPQMTNQPYPRPMVHNQRQQLAAPNPLLRQNIMQLTPQSHEQLQRQNLNARQLHQQF 121
Query: 176 VLHNQSGFSPQNTLNSQCEPHGFQRRDFGIHLSPEMFTQHPNLVNLQPNENLTTHVKKEV 235
+H+QS PQNTL S C P G Q RD G+++ P+MF H ++NLQPNENL +K+EV
Sbjct: 122 GMHSQSCVRPQNTLISPCRPLGLQSRDSGVYIPPQMFPPHSEVMNLQPNENLRAQIKEEV 181
Query: 236 NGEGFQASKSSHQHH------TAIEQHKQQQSMGASSE--GNPNSEDWLDVAFAEKERVK 295
GE QASK HH T IEQHKQ QSMG S+ NP EDW D A+ E + +K
Sbjct: 182 IGEDVQASKFVQPHHTPRGPNTMIEQHKQHQSMGVSAVLIENPKGEDWHDQAYNEMQGLK 241
Query: 296 KMCLPLL----EKACEPSVQVAPTEHIQQHKLVPLDSLKKMVKFLQLPRDKIIVTYNKEK 355
CLPLL E+A + ++V TE IQ+ K + ++KM FL+LPRDK I + KEK
Sbjct: 242 MTCLPLLKESYERAAKLCLEVGQTEQIQKCK-NHMSLMQKMTNFLELPRDK-ITYFTKEK 301
Query: 356 FYRSLQTIEKLGNVFKS-NINRANKQQVLHVGQPDLSGSRINPVQQSDNVKLHCQPVIRA 415
FY+S+++IEK ++ N NKQQ LH GQP +S S INPVQ+SDN H QP+ A
Sbjct: 302 FYQSMKSIEKFAKAHENRNTFLVNKQQPLH-GQPGISQSHINPVQRSDNANPHFQPLNLA 361
Query: 416 TTGSSDSSSPIAPREKGS---VTDCIQKNLLQNRQHSENIKQESQSQWIQLRQKSTGNIP 475
TGSSDSSSP P E GS + IQKNLLQ RQ E IKQE QS WIQ QKST +I
Sbjct: 362 ATGSSDSSSPRTPSEIGSSRPEANRIQKNLLQKRQQPEIIKQEFQSSWIQQMQKSTESIQ 421
Query: 476 AIHRSGMSLKHHVNSNFSPQIY-EAAQLSQIAQRPLPTNPCAS--SSHGRASPAPSSSIV 535
AI+RSG+SL++H+NS SPQ + EA+QLS+IA+R L +PC+S GRASP PSSS V
Sbjct: 422 AINRSGVSLQNHLNSKHSPQSHEEASQLSKIAERALSKDPCSSVYGRDGRASPTPSSSTV 481
Query: 536 GLEKISPNVSYLPSSIFQFPQHCNARELLHPKAEIQVQSQKIRSSSAMTSPFAEPTSLGS 595
GL K S NVS L S FQ+P+ N+ LL+ K +IQV+S +IRSS TS FAEPTSLGS
Sbjct: 482 GLGKSSSNVSCLSSLNFQYPETRNSVNLLNSKTKIQVKSHEIRSSGISTSQFAEPTSLGS 541
Query: 596 NGKLSTGTQAHDRLVKAVKSLSNEALNVAVSGICSVGYMEDTITDP-----WCHAKATDL 655
+ LST TQ +RL+KAV S+S++AL A+SGI SVG M DT+T+P CH KA L
Sbjct: 542 SQHLSTATQPLNRLLKAVDSMSDQALRTAISGISSVGNMCDTVTEPSVFGTRCHQKAAHL 601
Query: 656 RLLDGCGSSNNMKRKINAMALNDIPSPCSDIAGSEPTVTSSRKKLKKLSDYALLEEMRNI 715
L DG GSSNNMKRKI A+ LND+PSPCSD GSE TVTS KKLKKL+D ALLEEMRNI
Sbjct: 602 SLQDGFGSSNNMKRKICAITLNDMPSPCSDNDGSELTVTSRSKKLKKLTDNALLEEMRNI 661
Query: 716 NKQFIETVLELDLDENLNRRLANAGTVLRCSYSAVTDSKN---SEPCPVKLPVLSVKLLV 775
N++ +ETVLELD +N+NRR ANAGTV+RC+YSAV+D N +KLPVLSVKLLV
Sbjct: 662 NQRLVETVLELDSAQNVNRRFANAGTVVRCTYSAVSDRNNLMFHFANTLKLPVLSVKLLV 721
Query: 776 PLDYPEDYPVFLSKFNTDSSNVDEEFRDLSNEATSMLRAFLRTAPDCLSLEEYARVWDEC 799
PLDYPEDYPVFLSKFNTD N DEE RDLS +ATSMLRAFLRTAP+ LSL EYAR WD+C
Sbjct: 722 PLDYPEDYPVFLSKFNTDCGNEDEECRDLSRKATSMLRAFLRTAPERLSLGEYARAWDQC 781
BLAST of HG10019849 vs. TAIR 10
Match:
AT1G15780.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G10440.1); Has 103701 Blast hits to 43153 proteins in 1828 species: Archae - 30; Bacteria - 7385; Metazoa - 38639; Fungi - 11531; Plants - 7727; Viruses - 307; Other Eukaryotes - 38082 (source: NCBI BLink). )
HSP 1 Score: 137.1 bits (344), Expect = 6.1e-32
Identity = 126/414 (30.43%), Postives = 197/414 (47.58%), Query Frame = 0
Query: 432 VNSNFSPQIYEAAQLSQIAQRPLPTN-PCASSSHGRASPAPSSSIVGLEKISPNVSYLPS 491
++ + SPQ+ + ++++ P N P S APS V EK P S L
Sbjct: 931 MSQHLSPQVDQKNTVNKMGTPLQPANSPFVVPSPSSTPLAPSPMQVDSEK--PGSSSLSM 990
Query: 492 SIFQFPQHCNARELLHPKAEIQVQSQKIRSSSAMTSPFAEP------TSLGSNGKLSTGT 551
Q + ++ + + + I S+S + F P +S ++GK S
Sbjct: 991 GNIARQQATGMQGVVQ---SLAIGTPGI-SASPLLQEFTSPDGNILNSSTITSGKPSATE 1050
Query: 552 QAHDRLVKAVKSLSNEALNVAVSGICSVGYM------------------EDTITDPWCHA 611
+RL++AVKS+S +AL+ AVS I SV M ED + C
Sbjct: 1051 LPIERLIRAVKSISPQALSSAVSDIGSVVSMVDRIAGSAPGNGSRASVGEDLVAMTKCRL 1110
Query: 612 KATDLRLLDGCGSSNNMKRKINAMALN------DIPSPCSDIAGS-----EPTVTSSRKK 671
+A + +G ++ MKR AM L+ + AGS E T TS KK
Sbjct: 1111 QARNFMTQEGMMATKKMKRHTTAMPLSVASLGGSVGDNYKQFAGSETSDLESTATSDGKK 1170
Query: 672 LKKLSDYALLEEMRNINKQFIETVLELDLDENLN-------RRLANAGTVLRCSYSAVTD 731
+ +++ALLEE++ IN++ I+TV+E+ DE+ + GT +R S+ AV+
Sbjct: 1171 ARTETEHALLEEIKEINQRLIDTVVEISDDEDAADPSEVAISSIGCEGTTVRFSFIAVSL 1230
Query: 732 S---KNSEPCPVKLPVLSVKLLVPLDYPEDYPVFLSKFNTDSSNVDEEFRDLSNEATSML 791
S K P+ ++LLVP YP P L K ++S +E DLS++A +
Sbjct: 1231 SPALKAHLSSTQMSPIQPLRLLVPCSYPNGSPSLLDKLPVETSKENE---DLSSKAMARF 1290
Query: 792 RAFLRTAPDCLSLEEYARVWDECARSVVSEYAQRVGGGCFSAQYGTWEDSVSAA 800
LR+ +SL++ A+ WD CAR+V+ EYAQ+ GGG FS++YGTWE V+A+
Sbjct: 1291 NILLRSLSQPMSLKDIAKTWDACARAVICEYAQQFGGGTFSSKYGTWEKYVAAS 1335
BLAST of HG10019849 vs. TAIR 10
Match:
AT2G10440.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G15780.1); Has 8319 Blast hits to 5104 proteins in 317 species: Archae - 0; Bacteria - 285; Metazoa - 1706; Fungi - 535; Plants - 320; Viruses - 18; Other Eukaryotes - 5455 (source: NCBI BLink). )
HSP 1 Score: 92.4 bits (228), Expect = 1.7e-18
Identity = 176/784 (22.45%), Postives = 307/784 (39.16%), Query Frame = 0
Query: 130 QTHQQFVLHNQSGFSPQNTLNSQCEPHGFQRRDFGIHLSPEMFTQHPNLVNLQPNENLTT 189
Q + LH S Q N Q G Q +S + Q+P + Q N NL
Sbjct: 191 QQNNNVTLHAMS----QQKNNLQSMTRGQQVGQSQPMMSQQYRQQYPMQQDPQ-NRNLQK 250
Query: 190 HVK-KEVNGEGFQASKSSHQHHTAIEQHKQQQSMGASSEG------------------NP 249
H+ + N FQA+ S Q +Q Q Q + ++ N
Sbjct: 251 HLDFVQNNTNQFQAASSLRQTQNITDQQNQPQQLERANPSILIMNIIVASQDSTGKTVNV 310
Query: 250 NSEDWLDVAFAEKERVKKMCLPLL----EKACEP--SVQVAPTEHIQQHKLVPLD----S 309
N+ +W + + + +++K+MCLP+L ++ E + P + +Q + L S
Sbjct: 311 NAGNWQEETYQKIKKLKEMCLPVLSLMHQRVAEKLRETESLPPQPMQAQWIEKLKAGKLS 370
Query: 310 LKKMVKFLQLPRDKI--------------IVTYNKEKFYRSLQTIEKLGNVFKSNINRAN 369
++ ++ FL + R + I+ + K + T ++ G S
Sbjct: 371 MEHLMFFLNVHRSSVSEKHRDKFSQYEYHILKFTKSQTMVLRPTQQQQGQFPPSQTAMQT 430
Query: 370 KQQVLHVGQ---PDLSGSRINPVQQSDNVKL---------HCQPVIRATTGS----SDSS 429
+ +HV Q + SR+ P Q++ L + +I A++G+ S
Sbjct: 431 QSPQVHVSQSLYKEQRRSRLMPSSQNEASSLLQIRPKLDPRDENIIMASSGNVMLPSVKQ 490
Query: 430 SPIAPREKGSVTDCIQKNLLQNRQHSENIKQESQSQWIQLRQKSTGNI------PAIHRS 489
+P A S +QK Q R H ++Q+ Q Q T + ++
Sbjct: 491 NPRAVNTNISSVQSLQK---QKRFHHRQMQQQQPQQGNHQHQMQTNEMNDVRMRERVNIK 550
Query: 490 GMSLKHHVNSN-----------FSPQIYEAAQLSQIAQRPLPT------NPCASSSHGRA 549
L+ V+S+ S QI + + Q LP P SS
Sbjct: 551 ARLLEQQVSSSQRQVPKQESNVSSSQIQNHSSPQLVDQHILPATINKTGTPLNSSGSAFV 610
Query: 550 SPAPSSSIVGLEKISPNVSYLPSSIFQFPQHCNARELLHPKAEIQVQSQKIRSSSAMTSP 609
+PAPS P S +P S+ P + + + + SSS + +
Sbjct: 611 APAPSP--------VPGDSEMPISVES------------PVSGVDEINSTLDSSSKLGT- 670
Query: 610 FAEPTSLGSNGKLSTGTQAHDRLVKAVKSLSNEALNVAVSGICSVGYMEDTITDPW---- 669
E L + DRL+KA ++ S ++L +VS I SV M D I +
Sbjct: 671 -QETPLLFVPPPEPITERPIDRLIKAFQAASPKSLAESVSEISSVISMVDMIGGSFPSSG 730
Query: 670 ---------CHAKATDLRLLDGCGSSNNMKRKINAMALN-----DIPSPCSDIAGSEPTV 729
+ + + S MKR IN + + D S + +
Sbjct: 731 GSRAGLGEDLSERTRNFTTHEETNLSKRMKRSINIVPPDMSSQIDSYEQLSSLESEVVST 790
Query: 730 TSSRKKLKKLS-DYALLEEMRNINKQFIETVLELDLDENLNRRLANAGTVLRCSYSAVTD 789
TSS K+ ++ YALL+E++ N + +ETV+E+ +++L GT++ C+Y+ V
Sbjct: 791 TSSGLKVNNIAPGYALLQEIKETNGRLVETVVEICDEDSL-------GTIVTCTYAPVAL 850
Query: 790 SKNSE-------------PCPVKLPVLSVKLLVPLDYPEDYPVFLSKFNTDSSNVDEEFR 800
S + C ++ + ++LL P+DYP P+ L + + D+S ++
Sbjct: 851 SATFKDHYKSGKIIFYVSKCLMQAQIQPLRLLFPMDYPYSSPIVLEEISFDTS--VHKYE 910
BLAST of HG10019849 vs. TAIR 10
Match:
AT2G10440.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G15780.1); Has 1628 Blast hits to 1350 proteins in 149 species: Archae - 0; Bacteria - 39; Metazoa - 480; Fungi - 159; Plants - 187; Viruses - 2; Other Eukaryotes - 761 (source: NCBI BLink). )
HSP 1 Score: 92.0 bits (227), Expect = 2.3e-18
Identity = 150/656 (22.87%), Postives = 267/656 (40.70%), Query Frame = 0
Query: 229 NPNSEDWLDVAFAEKERVKKMCLPLL----EKACEP--SVQVAPTEHIQQHKLVPLD--- 288
N N+ +W + + + +++K+MCLP+L ++ E + P + +Q + L
Sbjct: 224 NVNAGNWQEETYQKIKKLKEMCLPVLSLMHQRVAEKLRETESLPPQPMQAQWIEKLKAGK 283
Query: 289 -SLKKMVKFLQLPRDKI--------------IVTYNKEKFYRSLQTIEKLGNVFKSNINR 348
S++ ++ FL + R + I+ + K + T ++ G S
Sbjct: 284 LSMEHLMFFLNVHRSSVSEKHRDKFSQYEYHILKFTKSQTMVLRPTQQQQGQFPPSQTAM 343
Query: 349 ANKQQVLHVGQ---PDLSGSRINPVQQSDNVKL---------HCQPVIRATTGS----SD 408
+ +HV Q + SR+ P Q++ L + +I A++G+ S
Sbjct: 344 QTQSPQVHVSQSLYKEQRRSRLMPSSQNEASSLLQIRPKLDPRDENIIMASSGNVMLPSV 403
Query: 409 SSSPIAPREKGSVTDCIQKNLLQNRQHSENIKQESQSQWIQLRQKSTGNI------PAIH 468
+P A S +QK Q R H ++Q+ Q Q T + ++
Sbjct: 404 KQNPRAVNTNISSVQSLQK---QKRFHHRQMQQQQPQQGNHQHQMQTNEMNDVRMRERVN 463
Query: 469 RSGMSLKHHVNSN-----------FSPQIYEAAQLSQIAQRPLPT------NPCASSSHG 528
L+ V+S+ S QI + + Q LP P SS
Sbjct: 464 IKARLLEQQVSSSQRQVPKQESNVSSSQIQNHSSPQLVDQHILPATINKTGTPLNSSGSA 523
Query: 529 RASPAPSSSIVGLEKISPNVSYLPSSIFQFPQHCNARELLHPKAEIQVQSQKIRSSSAMT 588
+PAPS P S +P S+ P + + + + SSS +
Sbjct: 524 FVAPAPSP--------VPGDSEMPISVES------------PVSGVDEINSTLDSSSKLG 583
Query: 589 SPFAEPTSLGSNGKLSTGTQAHDRLVKAVKSLSNEALNVAVSGICSVGYMEDTITDPW-- 648
+ E L + DRL+KA ++ S ++L +VS I SV M D I +
Sbjct: 584 T--QETPLLFVPPPEPITERPIDRLIKAFQAASPKSLAESVSEISSVISMVDMIGGSFPS 643
Query: 649 -----------CHAKATDLRLLDGCGSSNNMKRKINAMALN-----DIPSPCSDIAGSEP 708
+ + + S MKR IN + + D S +
Sbjct: 644 SGGSRAGLGEDLSERTRNFTTHEETNLSKRMKRSINIVPPDMSSQIDSYEQLSSLESEVV 703
Query: 709 TVTSSRKKLKKLS-DYALLEEMRNINKQFIETVLELDLDENLNRRLANAGTVLRCSYSAV 768
+ TSS K+ ++ YALL+E++ N + +ETV+E+ +++L GT++ C+Y+ V
Sbjct: 704 STTSSGLKVNNIAPGYALLQEIKETNGRLVETVVEICDEDSL-------GTIVTCTYAPV 763
Query: 769 TDS---KNSEPCPVKLPVLSVKLLVPLDYPEDYPVFLSKFNTDSSNVDEEFRDLSNEATS 800
S K+ + ++LL P+DYP P+ L + + D+S ++ DLS S
Sbjct: 764 ALSATFKDHYKSGKIAQIQPLRLLFPMDYPYSSPIVLEEISFDTS--VHKYEDLSARTRS 823
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_038903080.1 | 0.0e+00 | 77.61 | probable mediator of RNA polymerase II transcription subunit 15c isoform X1 [Ben... | [more] |
XP_038903081.1 | 0.0e+00 | 78.04 | mediator of RNA polymerase II transcription subunit 15a-like isoform X2 [Beninca... | [more] |
XP_008443239.1 | 5.2e-267 | 65.92 | PREDICTED: uncharacterized protein LOC103486878 [Cucumis melo] >XP_008443241.1 P... | [more] |
XP_011652173.1 | 1.6e-239 | 62.23 | uncharacterized protein LOC105434992 [Cucumis sativus] >KAE8651172.1 hypothetica... | [more] |
XP_022147805.1 | 3.1e-219 | 55.49 | probable mediator of RNA polymerase II transcription subunit 15c isoform X1 [Mom... | [more] |
Match Name | E-value | Identity | Description | |
F4I171 | 8.6e-31 | 30.43 | Mediator of RNA polymerase II transcription subunit 15a OS=Arabidopsis thaliana ... | [more] |
Q9SHV7 | 2.4e-17 | 22.45 | Probable mediator of RNA polymerase II transcription subunit 15c OS=Arabidopsis ... | [more] |
Match Name | E-value | Identity | Description | |
A0A1S4DUH4 | 2.5e-267 | 65.92 | uncharacterized protein LOC103486878 OS=Cucumis melo OX=3656 GN=LOC103486878 PE=... | [more] |
A0A5A7UH89 | 2.5e-267 | 65.92 | Putative tartrate dehydrogenase/decarboxylase ttuC OS=Cucumis melo var. makuwa O... | [more] |
E5GBP1 | 2.5e-267 | 65.92 | KIX_2 domain-containing protein OS=Cucumis melo subsp. melo OX=412675 PE=4 SV=1 | [more] |
A0A6J1D158 | 1.5e-219 | 55.49 | probable mediator of RNA polymerase II transcription subunit 15c isoform X1 OS=M... | [more] |
A0A6J1D3G7 | 2.9e-207 | 56.17 | mediator of RNA polymerase II transcription subunit 15a-like isoform X2 OS=Momor... | [more] |
Match Name | E-value | Identity | Description | |
AT1G15780.1 | 6.1e-32 | 30.43 | unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... | [more] |
AT2G10440.1 | 1.7e-18 | 22.45 | unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... | [more] |
AT2G10440.2 | 2.3e-18 | 22.87 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |