Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTTTTCACATTTGTTCGCCGTCTCCCCTTCAAGACTTCTTCTATACGCTCGCTTCATCTTCTCTCTTTCTCTTTATCTCTCTATCTCTCTCTCCGCCCTTCCGTTTCTCTCTGCCGTTTTACGTTCTCTCCGCCACCGCACGGCGGCTGAAGCTGACAACCGAACCGAACAAACAGACAAAAACCCTATTCGCTATTCCGAACAACCCAAAGCCTCATTTTCTCTCATTCCCAATCCCTTTCCCTCTCTCTCCGCCGCCGCTTTCTTTCTTCACACTCTGAAACTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTATCTATCTATCTATCTATATTGTTGTTGCCCTCTTCAGACGCATTTCTTTTTGTTTTGGGGATATACAGCAACCACTGGGGGGCAGTTTTTGAAACAGTTGCGCCTTCTGGGGTTCTTCAATCAGTCTTGAGGGTAGGGTTAGTTCTTCCTCTTCATCGGATATCTCACAGGTTATATTATCTATCTCTTGTGTATTTTCTTGCGGGTTTGGACCTCTGCGGAAGTTTTTTTTTTTCTGAGGGTGCATATTTGGTTTAGGGTTTCTGATATTTTTCTTTTGGGCACATTCAGTCGGAGTTGGGATTCTGGTTATTCCAATTTCTTATTTCGTTAGGGTTTCTTTTTATTCTCTGATCGAACTGTTTTGACGGGGACAATCTTCGGTTTTGGAATTGCTCCACCTCTTTATTCTTGAGGTATACCTTCTTATATTTCTGCATCTCAACCGATTCCCTCAAAGATGAGGGTTTTCCGAGCGTGTTTTGCTGACCGAATCTACACTTAATCGAACACGGAGGCTGGGAGAAGATAGTGTGAGGTCTTTCCTCTTTTATCATTCTTTTCAATTGAACCGCTTCATCTTATCCCCGCTATTTCCTCTGTTTTCGTCATCTGAAATCCCACTCCCGTATGAGAATTAGGTTATATTGATTTTGCACATAAGATAAGATAAATCCCGTTCAGAATGCGTTCAGCTTGTAGCCAAATTGGTAGCTACTTGCTTGAATACCGGCGGTCGTGAAGTAGGATAGTTGTTGAAGATAACTTCTTTAGTAGCATTTAGGACATAAATTTTGAGAACCATCCTCTGGGTTCACTACCGTGCGCGAGTTTGCAACGCTGAAGCAATTCACTGCGCATGTCTCTCTTTGCTGGAACCCTGGATATGCCATAAAGTTGATGACACATGGTTTGTTGCCGTGAATTGCATAGTCAGGGATGCTCGTGAAAGTTCATCCAGCTTAAAATAGAGGTTAGGCATTTGAAATAATAAAGTACAGGCTAGGGTTTTATAAATTCCTCTGCACAAGGAGTACGGTTTTTGGAAATTTGAGGCGGTAAAGAAGGCAAGAAGTGTTTTTTTCTTCTGATATTTTGATCATGTTCAGTATTCCTTATGGAAAGAAGTGAACCCACATTAGTTCCAGAATGGTTGAGAAGTACTGGAAGCGTTACTGGTGGCGGCAATTCAAACCACCATTTTCCGTCGTCTTCTTCCCACTCAGGTACTGCTTTTCTTTTTTTTTTTTTCATTGTTGCATGCTTCGAATAAAGATGACTTGCCTCTGTTTTGATATTGTGAACGCTGTTAATGTATATCCTGTGTTGTTGTGTAGATGTCCCCTCTCTATCTCAATTGAGAAATAGAACTTCCAAGACCGCCAGCGATTTTGACACCTCGCGTTCTGCTTTTCTGGATCGGACATCTTCATCAAATTCAAGGAGAAGTTCAAGTAATGGTTCTTCCAAACATGCATATAGTAGCTTTAACAGGGGTCATCGTGATAAGGATCGTGAGAAAGAAAAGGATAGGTTAAACTTTGGGGATAATTGGGACCGTGACTCTCATGACCCTCTTGGAAAGATTCTTTCCAACAGGATTGATAAGGATGCTTTGCGGCGGTCCCATTCAATGGTATCCAGGAAGCAAGGTGAGCGTGAGTTGTTTCATAGAAGAGTTGCAACAGAAATGAAAAGTCACAACAGCAATGGAATTCTTTCTGGAGCTAGTGTCGGTAGTAGCATTCAGAAAGCTGTATTTGAAAAGGATTTCCCATCTCTGGGATCTGAAGACAAGCAGGGAGCATCAGAAATTGGAAGAGTTTCATCTCCTGGTTTGAGCTCGCCAGTTCAAAGCTTGCCTATTGGCAATTCAGCCTTAATTGTTGGTGGAGAGGGATGGACATCTGCTCTTGCTGAGGTGCCCAGTATGATCGGAAGCACCACAGGGTCACCGTCATTTCAACAAATTGTTCCTGCTACATCAGGGGCAGGGCCTCTGAGTATTACAGCCGGACTTAATATGGCCGAAGCGTTGGTGCAGGTTCCATCTCGAGCTCGTGCTGCTCCCCAGGTATCTGAGGTACTATACTAACCTTCTTGTGCTGTACAGTATGTTCTCCTTTTGGTCTAACAGTCTCATCCTTTACCCCCACTTTGGTAGTTATCTGTCAAGACCCAGAGGCTTGAGGAATTGGCTATTAAACAGTCCAGGCAATTAATACCAGTGACGCCTTCTATGCCAAAAGCTATGGTAATTGGAGCTAATGCAATTGTTGAATCTTTGTTGTTAGACCTTTTCCTGCTTCTATATATTTTAATTATTTGGTGTTTGTTGGTTTGAAAATGTAAAGGTTTGTAATTAAAATATATAAGAGAACTGAGAACCATCTGAACCGAACCTAGACAACTTTTGCCAATATCTGATTTCAGCTGGGACTAATACTCTCTTCAAAGCTGCTTTTGTGTTGTAGTATTAGGCACTTAGCATGTATAAACTATGTGTATGTTTTGACATTTTTCTGCTACTACATTTTAGGCTGGTGACTGATGATTTAAATTTTTGGATTTAGCTTGTTAAATAAACTCTGCTACTAAATTTTAGGAACTGTATATTTTTGTGGAAGTTGGTTATTCAAGTGTTCTATCTCTTCAACTTACTCATTGACTAAAGAGCAGTGTTTATGCGTTAGATAGCGGAAGTTAAAACATTTGTTAGAAGTTTCAAAGTTATTTGCGATGTGGAAGATAGGCTAGATCACTTTTCCTCCTTCTATTTGATTAAGAACTCAATTTTTAGAACTCAAGGACACAGTTGTAGGAACAGATCCTATCCAAGTAAGGTTGTATGAGAAACTCAATTTAAGGACGGTTGGAAGAACAAAATGCAAGTGAGCCCCATAACGGCCGCCAGGGATTTAGGAATTGGTTATTTTAACTTCTTCGAAGCGTTTTGTCCTTGCTTTCTTGCCTCAACCCTTTTGGAAATTTGTAAGAACTAGACGATAGGGAGGAGATAACCAGATTTGATTTTGGACCCCATGTCATAGTGTTCATATATGTGCCTTGCTACTTTTGACACTTTGCGAACTGAATTATCTTTGGTTAACCACTTTTTATTCTCTCTCGAGGATGGAATAGGTCAGTGTTAGATTAAATATGCTTTAAAGGAAGTGTGAAGTTTCTATTATTAGCGTTGTTCATTTGAGATTGTTGTGCTGTAGCCAATATGTGACTGATGGCTAGTATCCTTTGTTTGTTAACGTGTTCTATTTGGTAGGCAAGTTTTCTGTTAACTGATTGCCAACCCCTCTTCAGTATTAATAGTTGTAACTTGACTGAGCATTCCTCTTCTGAAATATTCTCATTTGTCTTCTGTTTTGTTTTTTGAAGCTGATGTAAATAATTGTTGAGTTTCAGTATGGTGATGTAATAAACTTGCTTGGCAGTATAGTTCTTAAAATAGTTTTAGAACAAAAAAAATGCAAATATTTTGTTGTGTTTTATTTAATGATTCCAAAATTAATTCATATATTCTTATATATGTTTAAATTCAATAATCTAAGTTTAAATGGATTTAAATATGTATTCTTATATTTACATTCTGAAGAAATAAATGAGAATGGAAAATTATACCATATCTTTGCTAATAATCTTAAGTAGACTTGTCTGTAAAAAAAAATTTGATTAAATACATCCAGGATGTTCTCCACCCTCGTAATATGAATACAGTTAGTTTTTACGTTTTGAATAATCCCCTCTGTACCATGGTGTAAAGTTCTACAACGGAAATCTTCTGTTTGTATCTTCTGTTTATACCATCGTGACGTATCTCTCATTATTATATCTAACTGCAAATCTTCTGTTTCTCCCCTTAGATAATCATTAGCTTTTTTGTGTTTTTTGTTTGACAGGTGCTTAATTCTTCGGATAAATCAAAGCCCAAATTAGCATCAAGAACTGCAGAACTTAGTGTAACCATCAAGGGTGGACAGCCACAGCCGTTGTCAGTTCATGCCAACCAATCTCGTGGAGGACATGTCAAGTCCGATGCTCAAAAAAGTTCTCATGGGAAGTTTCTTGTTTTGAAACCTGTAAGAGAAAATGGTGTCTCCCTTACAGCAAAGGATGTCTCAAGTCCAACTAGTAATGCAAATAGCATGGCAGCAAACAGCCAGTTCGCTCTTGCACCTTCAGTTCCACATCCTCCTTTGAGAAGCCCAAATAATACAAATGTTTCTTCCGTAGAGCGCAAAATTGCTAGCTTAGATCTCAAATCCGGAACAACTTTAGAAAAAAGACCATCCTTATCTCAAGTCCAGAGCCGGAATGATTTCTTTAACCTCATTAAGAAGAAAACTTCAATGAGTTCTTCTGCCGTTCTCTCAGATTCATGCTCTTCTGTGAAATCTCCTTCAATTGGCCAATCTAATGAACTAACAAGGGAAGAAATTGACATTCCCGCAAGTCCTCGTATTATTGAAAATGGTTCTGTGGAGGATAGAAATGGAGATAGTTCTGAAGAGGTTCAAGCATCCTGTGATAGTGGTGAAAAAACTGAGAGCCACGTTGCTGCAGAATCTCTAGATGAAGAGGAGGCTGCCTTTCTTCGTTCTCTTGGCTGGGATGAAAACTGTGGTGAGGACGAAGGTCTTACTGAAGAAGAAATCAATTCTTTCTATCGGGAGGTAATAAATACTTCTATTGCTTTCACTGCAATGCTTTTCAAAATACTGTTGTTTTCTCTGTTATCTGATGAAGTTGTAAATTCAATTCTTAATTGCAGTACATGAACTTGAAGCCATCTCTAAAGATGGGTCGATGCATTCAGCCAAAGATATTTGTGCCATCTGAATCTCACGAGGACAGCAAGGATGGAGCCGGTTCTGAATTGAGCTCATCTGACTCGGAAGCCTGAATTACTTTTTTCTTTTTCACTCATGAGAAGTTGCTAGTTCACAATTTTTAATGGCAGTGGGGGTTAGTTTTTTTTTTTTGTCTTCTTTTTAAATTTTTTTTTTCTCTTTTTGTTTTCATGGTTTTAAAGAAATGCTGATGAGAGTTGGGTTGGAAGAAGAGGTGGATGTTAATTGAACACTGAACAGTGCAAATAGTGCAATAAGTTACAGTTGCCCCAAATCATCCTTGCTGCCTTAGCTTTACAGCGGGAGTTTTTGCGGTTTTTGAAAAGGGCAGTTGATGATGAGGTTTGGCGGGCCTGAAAGATGTATGCAAATTCCATCTTTTGGTTTATTTTGGTTTTTGATTCTTGAAGTTACTAGGAAAAAAGAAAAAAGAAAAAAAAGAAAAAAGAGTTACGCTTCTTCAAAGGAAATTGGGAAATTATGGAAAATGAAAAAAGGTTTCTATCTTCGTTCGGTATAATATTGATTTATTGTTTTGGGATGTTACCTTTCTAAGTTGGTAGTACTAGCCTTGTTTGACATTGAATTGCACTAATTACACATGAATAAGAGGATTAGATTTAGA
mRNA sequence
TTTTTCACATTTGTTCGCCGTCTCCCCTTCAAGACTTCTTCTATACGCTCGCTTCATCTTCTCTCTTTCTCTTTATCTCTCTATCTCTCTCTCCGCCCTTCCGTTTCTCTCTGCCGTTTTACGTTCTCTCCGCCACCGCACGGCGGCTGAAGCTGACAACCGAACCGAACAAACAGACAAAAACCCTATTCGCTATTCCGAACAACCCAAAGCCTCATTTTCTCTCATTCCCAATCCCTTTCCCTCTCTCTCCGCCGCCGCTTTCTTTCTTCACACTCTGAAACTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTATCTATCTATCTATCTATATTGTTGTTGCCCTCTTCAGACGCATTTCTTTTTGTTTTGGGGATATACAGCAACCACTGGGGGGCAGTTTTTGAAACAGTTGCGCCTTCTGGGGTTCTTCAATCAGTCTTGAGGGTAGGGTTAGTTCTTCCTCTTCATCGGATATCTCACAGGTTATATTATCTATCTCTTGTGTATTTTCTTGCGGGTTTGGACCTCTGCGGAAGTTTTTTTTTTTCTGAGGGTGCATATTTGGTTTAGGGTTTCTGATATTTTTCTTTTGGGCACATTCAGTCGGAGTTGGGATTCTGGTTATTCCAATTTCTTATTTCGTTAGGGTTTCTTTTTATTCTCTGATCGAACTGTTTTGACGGGGACAATCTTCGGTTTTGGAATTGCTCCACCTCTTTATTCTTGAGGTATACCTTCTTATATTTCTGCATCTCAACCGATTCCCTCAAAGATGAGGGTTTTCCGAGCGTGTTTTGCTGACCGAATCTACACTTAATCGAACACGGAGGCTGGGAGAAGATAGTGTGAGGTCTTTCCTCTTTTATCATTCTTTTCAATTGAACCGCTTCATCTTATCCCCGCTATTTCCTCTGTTTTCGTCATCTGAAATCCCACTCCCGTATGAGAATTAGGTTATATTGATTTTGCACATAAGATAAGATAAATCCCGTTCAGAATGCGTTCAGCTTGTAGCCAAATTGGTAGCTACTTGCTTGAATACCGGCGGTCGTGAAGTAGGATAGTTGTTGAAGATAACTTCTTTAGTAGCATTTAGGACATAAATTTTGAGAACCATCCTCTGGGTTCACTACCGTGCGCGAGTTTGCAACGCTGAAGCAATTCACTGCGCATGTCTCTCTTTGCTGGAACCCTGGATATGCCATAAAGTTGATGACACATGGTTTGTTGCCGTGAATTGCATAGTCAGGGATGCTCGTGAAAGTTCATCCAGCTTAAAATAGAGGTTAGGCATTTGAAATAATAAAGTACAGGCTAGGGTTTTATAAATTCCTCTGCACAAGGAGTACGGTTTTTGGAAATTTGAGGCGGTAAAGAAGGCAAGAAGTGTTTTTTTCTTCTGATATTTTGATCATGTTCAGTATTCCTTATGGAAAGAAGTGAACCCACATTAGTTCCAGAATGGTTGAGAAGTACTGGAAGCGTTACTGGTGGCGGCAATTCAAACCACCATTTTCCGTCGTCTTCTTCCCACTCAGATGTCCCCTCTCTATCTCAATTGAGAAATAGAACTTCCAAGACCGCCAGCGATTTTGACACCTCGCGTTCTGCTTTTCTGGATCGGACATCTTCATCAAATTCAAGGAGAAGTTCAAGTAATGGTTCTTCCAAACATGCATATAGTAGCTTTAACAGGGGTCATCGTGATAAGGATCGTGAGAAAGAAAAGGATAGGTTAAACTTTGGGGATAATTGGGACCGTGACTCTCATGACCCTCTTGGAAAGATTCTTTCCAACAGGATTGATAAGGATGCTTTGCGGCGGTCCCATTCAATGGTATCCAGGAAGCAAGGTGAGCGTGAGTTGTTTCATAGAAGAGTTGCAACAGAAATGAAAAGTCACAACAGCAATGGAATTCTTTCTGGAGCTAGTGTCGGTAGTAGCATTCAGAAAGCTGTATTTGAAAAGGATTTCCCATCTCTGGGATCTGAAGACAAGCAGGGAGCATCAGAAATTGGAAGAGTTTCATCTCCTGGTTTGAGCTCGCCAGTTCAAAGCTTGCCTATTGGCAATTCAGCCTTAATTGTTGGTGGAGAGGGATGGACATCTGCTCTTGCTGAGGTGCCCAGTATGATCGGAAGCACCACAGGGTCACCGTCATTTCAACAAATTGTTCCTGCTACATCAGGGGCAGGGCCTCTGAGTATTACAGCCGGACTTAATATGGCCGAAGCGTTGGTGCAGGTTCCATCTCGAGCTCGTGCTGCTCCCCAGTTATCTGTCAAGACCCAGAGGCTTGAGGAATTGGCTATTAAACAGTCCAGGCAATTAATACCAGTGACGCCTTCTATGCCAAAAGCTATGGTAATTGGAGCTAATGCAATTGTGCTTAATTCTTCGGATAAATCAAAGCCCAAATTAGCATCAAGAACTGCAGAACTTAGTGTAACCATCAAGGGTGGACAGCCACAGCCGTTGTCAGTTCATGCCAACCAATCTCGTGGAGGACATGTCAAGTCCGATGCTCAAAAAAGTTCTCATGGGAAGTTTCTTGTTTTGAAACCTGTAAGAGAAAATGGTGTCTCCCTTACAGCAAAGGATGTCTCAAGTCCAACTAGTAATGCAAATAGCATGGCAGCAAACAGCCAGTTCGCTCTTGCACCTTCAGTTCCACATCCTCCTTTGAGAAGCCCAAATAATACAAATGTTTCTTCCGTAGAGCGCAAAATTGCTAGCTTAGATCTCAAATCCGGAACAACTTTAGAAAAAAGACCATCCTTATCTCAAGTCCAGAGCCGGAATGATTTCTTTAACCTCATTAAGAAGAAAACTTCAATGAGTTCTTCTGCCGTTCTCTCAGATTCATGCTCTTCTGTGAAATCTCCTTCAATTGGCCAATCTAATGAACTAACAAGGGAAGAAATTGACATTCCCGCAAGTCCTCGTATTATTGAAAATGGTTCTGTGGAGGATAGAAATGGAGATAGTTCTGAAGAGGTTCAAGCATCCTGTGATAGTGGTGAAAAAACTGAGAGCCACGTTGCTGCAGAATCTCTAGATGAAGAGGAGGCTGCCTTTCTTCGTTCTCTTGGCTGGGATGAAAACTGTGGTGAGGACGAAGGTCTTACTGAAGAAGAAATCAATTCTTTCTATCGGGAGTACATGAACTTGAAGCCATCTCTAAAGATGGGTCGATGCATTCAGCCAAAGATATTTGTGCCATCTGAATCTCACGAGGACAGCAAGGATGGAGCCGGTTCTGAATTGAGCTCATCTGACTCGGAAGCCTGAATTACTTTTTTCTTTTTCACTCATGAGAAGTTGCTAGTTCACAATTTTTAATGGCAGTGGGGGTTAGTTTTTTTTTTTTGTCTTCTTTTTAAATTTTTTTTTTCTCTTTTTGTTTTCATGGTTTTAAAGAAATGCTGATGAGAGTTGGGTTGGAAGAAGAGGTGGATGTTAATTGAACACTGAACAGTGCAAATAGTGCAATAAGTTACAGTTGCCCCAAATCATCCTTGCTGCCTTAGCTTTACAGCGGGAGTTTTTGCGGTTTTTGAAAAGGGCAGTTGATGATGAGGTTTGGCGGGCCTGAAAGATGTATGCAAATTCCATCTTTTGGTTTATTTTGGTTTTTGATTCTTGAAGTTACTAGGAAAAAAGAAAAAAGAAAAAAAAGAAAAAAGAGTTACGCTTCTTCAAAGGAAATTGGGAAATTATGGAAAATGAAAAAAGGTTTCTATCTTCGTTCGGTATAATATTGATTTATTGTTTTGGGATGTTACCTTTCTAAGTTGGTAGTACTAGCCTTGTTTGACATTGAATTGCACTAATTACACATGAATAAGAGGATTAGATTTAGA
Coding sequence (CDS)
ATGGAAAGAAGTGAACCCACATTAGTTCCAGAATGGTTGAGAAGTACTGGAAGCGTTACTGGTGGCGGCAATTCAAACCACCATTTTCCGTCGTCTTCTTCCCACTCAGATGTCCCCTCTCTATCTCAATTGAGAAATAGAACTTCCAAGACCGCCAGCGATTTTGACACCTCGCGTTCTGCTTTTCTGGATCGGACATCTTCATCAAATTCAAGGAGAAGTTCAAGTAATGGTTCTTCCAAACATGCATATAGTAGCTTTAACAGGGGTCATCGTGATAAGGATCGTGAGAAAGAAAAGGATAGGTTAAACTTTGGGGATAATTGGGACCGTGACTCTCATGACCCTCTTGGAAAGATTCTTTCCAACAGGATTGATAAGGATGCTTTGCGGCGGTCCCATTCAATGGTATCCAGGAAGCAAGGTGAGCGTGAGTTGTTTCATAGAAGAGTTGCAACAGAAATGAAAAGTCACAACAGCAATGGAATTCTTTCTGGAGCTAGTGTCGGTAGTAGCATTCAGAAAGCTGTATTTGAAAAGGATTTCCCATCTCTGGGATCTGAAGACAAGCAGGGAGCATCAGAAATTGGAAGAGTTTCATCTCCTGGTTTGAGCTCGCCAGTTCAAAGCTTGCCTATTGGCAATTCAGCCTTAATTGTTGGTGGAGAGGGATGGACATCTGCTCTTGCTGAGGTGCCCAGTATGATCGGAAGCACCACAGGGTCACCGTCATTTCAACAAATTGTTCCTGCTACATCAGGGGCAGGGCCTCTGAGTATTACAGCCGGACTTAATATGGCCGAAGCGTTGGTGCAGGTTCCATCTCGAGCTCGTGCTGCTCCCCAGTTATCTGTCAAGACCCAGAGGCTTGAGGAATTGGCTATTAAACAGTCCAGGCAATTAATACCAGTGACGCCTTCTATGCCAAAAGCTATGGTAATTGGAGCTAATGCAATTGTGCTTAATTCTTCGGATAAATCAAAGCCCAAATTAGCATCAAGAACTGCAGAACTTAGTGTAACCATCAAGGGTGGACAGCCACAGCCGTTGTCAGTTCATGCCAACCAATCTCGTGGAGGACATGTCAAGTCCGATGCTCAAAAAAGTTCTCATGGGAAGTTTCTTGTTTTGAAACCTGTAAGAGAAAATGGTGTCTCCCTTACAGCAAAGGATGTCTCAAGTCCAACTAGTAATGCAAATAGCATGGCAGCAAACAGCCAGTTCGCTCTTGCACCTTCAGTTCCACATCCTCCTTTGAGAAGCCCAAATAATACAAATGTTTCTTCCGTAGAGCGCAAAATTGCTAGCTTAGATCTCAAATCCGGAACAACTTTAGAAAAAAGACCATCCTTATCTCAAGTCCAGAGCCGGAATGATTTCTTTAACCTCATTAAGAAGAAAACTTCAATGAGTTCTTCTGCCGTTCTCTCAGATTCATGCTCTTCTGTGAAATCTCCTTCAATTGGCCAATCTAATGAACTAACAAGGGAAGAAATTGACATTCCCGCAAGTCCTCGTATTATTGAAAATGGTTCTGTGGAGGATAGAAATGGAGATAGTTCTGAAGAGGTTCAAGCATCCTGTGATAGTGGTGAAAAAACTGAGAGCCACGTTGCTGCAGAATCTCTAGATGAAGAGGAGGCTGCCTTTCTTCGTTCTCTTGGCTGGGATGAAAACTGTGGTGAGGACGAAGGTCTTACTGAAGAAGAAATCAATTCTTTCTATCGGGAGTACATGAACTTGAAGCCATCTCTAAAGATGGGTCGATGCATTCAGCCAAAGATATTTGTGCCATCTGAATCTCACGAGGACAGCAAGGATGGAGCCGGTTCTGAATTGAGCTCATCTGACTCGGAAGCCTGA
Protein sequence
MERSEPTLVPEWLRSTGSVTGGGNSNHHFPSSSSHSDVPSLSQLRNRTSKTASDFDTSRSAFLDRTSSSNSRRSSSNGSSKHAYSSFNRGHRDKDREKEKDRLNFGDNWDRDSHDPLGKILSNRIDKDALRRSHSMVSRKQGERELFHRRVATEMKSHNSNGILSGASVGSSIQKAVFEKDFPSLGSEDKQGASEIGRVSSPGLSSPVQSLPIGNSALIVGGEGWTSALAEVPSMIGSTTGSPSFQQIVPATSGAGPLSITAGLNMAEALVQVPSRARAAPQLSVKTQRLEELAIKQSRQLIPVTPSMPKAMVIGANAIVLNSSDKSKPKLASRTAELSVTIKGGQPQPLSVHANQSRGGHVKSDAQKSSHGKFLVLKPVRENGVSLTAKDVSSPTSNANSMAANSQFALAPSVPHPPLRSPNNTNVSSVERKIASLDLKSGTTLEKRPSLSQVQSRNDFFNLIKKKTSMSSSAVLSDSCSSVKSPSIGQSNELTREEIDIPASPRIIENGSVEDRNGDSSEEVQASCDSGEKTESHVAAESLDEEEAAFLRSLGWDENCGEDEGLTEEEINSFYREYMNLKPSLKMGRCIQPKIFVPSESHEDSKDGAGSELSSSDSEA
Homology
BLAST of CcUC03G051500 vs. NCBI nr
Match:
XP_008460470.1 (PREDICTED: mediator of RNA polymerase II transcription subunit 1 isoform X2 [Cucumis melo])
HSP 1 Score: 1039.3 bits (2686), Expect = 1.4e-299
Identity = 572/622 (91.96%), Postives = 587/622 (94.37%), Query Frame = 0
Query: 1 MERSEPTLVPEWLRSTGSVTGGGNSNHHFPSSSSHSDVPSLSQLRNRTSKTASDFDTSRS 60
MERSEPTLVPEWLRSTGSVTGGGNSNHHFPSSSSHSDVPSLSQ RNR SKT DFDTSRS
Sbjct: 1 MERSEPTLVPEWLRSTGSVTGGGNSNHHFPSSSSHSDVPSLSQSRNRISKTTGDFDTSRS 60
Query: 61 AFLDRTSSSNSRRSSSNGSSKHAYSSFNRGHRDKDREKEKDRLNFGDNWDRDSHDPLGKI 120
+FLDRTSSSNSRRSSSNGSSKHAYSSFNRGHRDKDREKEKDRLNFGDNWDRD+HDPLGKI
Sbjct: 61 SFLDRTSSSNSRRSSSNGSSKHAYSSFNRGHRDKDREKEKDRLNFGDNWDRDAHDPLGKI 120
Query: 121 LSNRIDKDALRRSHSMVSRKQGERELFHRRVATEMKSHN-SNGILSGASVGSSIQKAVFE 180
LSNRIDKDALRRSHSMVSRKQG ELFHRRV TE+KSHN SNGILSG SVGSSIQKAVFE
Sbjct: 121 LSNRIDKDALRRSHSMVSRKQG--ELFHRRVGTELKSHNSSNGILSGTSVGSSIQKAVFE 180
Query: 181 KDFPSLGSEDKQGASEIGRVSSPGLSSPVQSLPIGNSALIVGGEGWTSALAEVPSMIGST 240
KDFPSLGSE+KQGASEIGRVSSPGLSSPVQSLPIGNSALIVGGEGWTSALAEVPSMIGST
Sbjct: 181 KDFPSLGSEEKQGASEIGRVSSPGLSSPVQSLPIGNSALIVGGEGWTSALAEVPSMIGST 240
Query: 241 TGSPSFQQIVPATSGAGPLSITAGLNMAEALVQVPSRARAAPQLSVKTQRLEELAIKQSR 300
GS SFQQ VPATSGAGPLS+TAGLNMAEALVQ PSR R APQLSVKTQRLEELAIKQSR
Sbjct: 241 PGSSSFQQTVPATSGAGPLSVTAGLNMAEALVQSPSRTRTAPQLSVKTQRLEELAIKQSR 300
Query: 301 QLIPVTPSMPKAMVIGANAIVLNSSDKSKPKLASRTAELSVTIKGGQPQPLSVHANQSRG 360
QLIPVTPSMPKAM VL+SSDKSKPKLASRT EL+ TIKGGQPQP SVHANQSR
Sbjct: 301 QLIPVTPSMPKAM-------VLSSSDKSKPKLASRTGELNATIKGGQPQPSSVHANQSRV 360
Query: 361 GHVKSDAQKSSHGKFLVLKPVRENGVSLTAKDVSSPTSNANSMAANSQFALAPSVPHPPL 420
GHVK DAQKSSHGKFLVLKPVRENGVSL AKDVSSPTSNANSMAANSQFALAPSVPH PL
Sbjct: 361 GHVKPDAQKSSHGKFLVLKPVRENGVSLAAKDVSSPTSNANSMAANSQFALAPSVPHAPL 420
Query: 421 RSPNNTNVSSVERKIASLDLKSGTTLEKRPSLSQVQSRNDFFNLIKKKTSMSSSAVLSDS 480
RSPNNTNVSSVERKIASLDLK+GTTLEKRPSLSQVQSRNDFFNLIKKKTSMSSSAVLSDS
Sbjct: 421 RSPNNTNVSSVERKIASLDLKTGTTLEKRPSLSQVQSRNDFFNLIKKKTSMSSSAVLSDS 480
Query: 481 CSSVKSPSIGQSNELTREEIDIPASPRIIENGSVEDRNGDSSEEVQASCDSGEKTESHVA 540
CSSVKSPSIGQSNELT EE+ IPASPR+IENG+VE+RNG+SSEEVQ S DSGEKTESHVA
Sbjct: 481 CSSVKSPSIGQSNELTSEEMGIPASPRVIENGAVENRNGNSSEEVQISRDSGEKTESHVA 540
Query: 541 AESLDEEEAAFLRSLGWDENCGEDEGLTEEEINSFYREYMNLKPSLKMGRCIQPKIFVPS 600
AESLDEEEAAFLRSLGWDE+CGEDEGLTEEEINSFYREYMNLKPSLK+GRCIQPKIFVPS
Sbjct: 541 AESLDEEEAAFLRSLGWDESCGEDEGLTEEEINSFYREYMNLKPSLKIGRCIQPKIFVPS 600
Query: 601 ESHEDSK-DGAGSELSSSDSEA 621
ES EDSK DGAGSELSSSDSEA
Sbjct: 601 ESREDSKDDGAGSELSSSDSEA 613
BLAST of CcUC03G051500 vs. NCBI nr
Match:
XP_038907228.1 (mediator of RNA polymerase II transcription subunit 1 isoform X2 [Benincasa hispida])
HSP 1 Score: 1035.4 bits (2676), Expect = 2.0e-298
Identity = 567/621 (91.30%), Postives = 584/621 (94.04%), Query Frame = 0
Query: 1 MERSEPTLVPEWLRSTGSVTGGGNSNHHFPSSSSHSDVPSLSQLRNRTSKTASDFDTSRS 60
MERSEPTLVPEWLRSTGSVTGG NSNHHFPSSSSHSDVPSLSQ RNR SKT DFDTSRS
Sbjct: 1 MERSEPTLVPEWLRSTGSVTGGSNSNHHFPSSSSHSDVPSLSQSRNRISKTTGDFDTSRS 60
Query: 61 AFLDRTSSSNSRRSSSNGSSKHAYSSFNRGHRDKDREKEKDRLNFGDNWDRDSHDPLGKI 120
AFLDRTSSSNSRRSSSNGSSKHAYSSFNRGHRDKDREKEKDRLNFGD+WDRDS DPLGKI
Sbjct: 61 AFLDRTSSSNSRRSSSNGSSKHAYSSFNRGHRDKDREKEKDRLNFGDSWDRDSPDPLGKI 120
Query: 121 LSNRIDKDALRRSHSMVSRKQGERELFHRRVATEMKSH-NSNGILSGASVGSSIQKAVFE 180
LSN+IDKDALRRSHSMVSRK GERELFHRR ATE+K+H NSNGILSG SV SSIQKAVFE
Sbjct: 121 LSNKIDKDALRRSHSMVSRKLGERELFHRRAATELKNHNNSNGILSGTSVSSSIQKAVFE 180
Query: 181 KDFPSLGSEDKQGASEIGRVSSPGLSSPVQSLPIGNSALIVGGEGWTSALAEVPSMIGST 240
KDFPSLGSE+KQGASEIGRVSSPGLSSPVQSLPIGNSALIVGGEGWTSALAEVPSMIGST
Sbjct: 181 KDFPSLGSEEKQGASEIGRVSSPGLSSPVQSLPIGNSALIVGGEGWTSALAEVPSMIGST 240
Query: 241 TGSPSFQQIVPATSGAGPLSITAGLNMAEALVQVPSRARAAPQLSVKTQRLEELAIKQSR 300
TGS SFQQ VPATSGAGPLS+TAGLNMAEALVQ PSRARA PQLSVKTQRLEELAIKQSR
Sbjct: 241 TGSSSFQQTVPATSGAGPLSVTAGLNMAEALVQAPSRARATPQLSVKTQRLEELAIKQSR 300
Query: 301 QLIPVTPSMPKAMVIGANAIVLNSSDKSKPKLASRTAELSVTIKGGQPQPLSVHANQSRG 360
QLIPVTPSM KAM VLN SDKSKPKLASRT EL+VTIKGGQ QPLSVHANQSRG
Sbjct: 301 QLIPVTPSMTKAM-------VLNPSDKSKPKLASRTGELNVTIKGGQQQPLSVHANQSRG 360
Query: 361 GHVKSDAQKSSHGKFLVLKPVRENGVSLTAKDVSSPTSNANSMAANSQFALAPSVPHPPL 420
G VKSDAQKS+HGKFLVLKPVRENG+SL AKDVSSPTSNANSMA NSQFALA +V H PL
Sbjct: 361 GLVKSDAQKSAHGKFLVLKPVRENGISLAAKDVSSPTSNANSMAVNSQFALASAVAHAPL 420
Query: 421 RSPNNTNVSSVERKIASLDLKSGTTLEKRPSLSQVQSRNDFFNLIKKKTSMSSSAVLSDS 480
RSPNNTNVSSVERKIASLDLKSGTTLEKRPSLSQVQSRNDFFNLIKKKTSM SSAVLSDS
Sbjct: 421 RSPNNTNVSSVERKIASLDLKSGTTLEKRPSLSQVQSRNDFFNLIKKKTSM-SSAVLSDS 480
Query: 481 CSSVKSPSIGQSNELTREEIDIPASPRIIENGSVEDRNGDSSEEVQASCDSGEKTESHVA 540
CSSVKSPSI SNELTREE D+PASPR+IENG+VE+RNGD SEEV+ASCD+GEKTESHVA
Sbjct: 481 CSSVKSPSISHSNELTREETDMPASPRVIENGAVENRNGDGSEEVRASCDTGEKTESHVA 540
Query: 541 AESLDEEEAAFLRSLGWDENCGEDEGLTEEEINSFYREYMNLKPSLKMGRCIQPKIFVPS 600
AESLDEEEAAFLRSLGWDENCGEDEGLTEEEINSFYREYMNLKPSLKMGRCI+PKIFVPS
Sbjct: 541 AESLDEEEAAFLRSLGWDENCGEDEGLTEEEINSFYREYMNLKPSLKMGRCIKPKIFVPS 600
Query: 601 ESHEDSKDGAGSELSSSDSEA 621
ESHEDSKDGAGSELSSSDSEA
Sbjct: 601 ESHEDSKDGAGSELSSSDSEA 613
BLAST of CcUC03G051500 vs. NCBI nr
Match:
XP_008460469.1 (PREDICTED: mediator of RNA polymerase II transcription subunit 1 isoform X1 [Cucumis melo] >KAA0067384.1 mediator of RNA polymerase II transcription subunit 1 isoform X1 [Cucumis melo var. makuwa] >TYK26525.1 mediator of RNA polymerase II transcription subunit 1 isoform X1 [Cucumis melo var. makuwa])
HSP 1 Score: 1033.9 bits (2672), Expect = 5.8e-298
Identity = 572/625 (91.52%), Postives = 587/625 (93.92%), Query Frame = 0
Query: 1 MERSEPTLVPEWLRSTGSVTGGGNSNHHFPSSSSHSDVPSLSQLRNRTSKTASDFDTSRS 60
MERSEPTLVPEWLRSTGSVTGGGNSNHHFPSSSSHSDVPSLSQ RNR SKT DFDTSRS
Sbjct: 1 MERSEPTLVPEWLRSTGSVTGGGNSNHHFPSSSSHSDVPSLSQSRNRISKTTGDFDTSRS 60
Query: 61 AFLDRTSSSNSRRSSSNGSSKHAYSSFNRGHRDKDREKEKDRLNFGDNWDRDSHDPLGKI 120
+FLDRTSSSNSRRSSSNGSSKHAYSSFNRGHRDKDREKEKDRLNFGDNWDRD+HDPLGKI
Sbjct: 61 SFLDRTSSSNSRRSSSNGSSKHAYSSFNRGHRDKDREKEKDRLNFGDNWDRDAHDPLGKI 120
Query: 121 LSNRIDKDALRRSHSMVSRKQGERELFHRRVATEMKSHN-SNGILSGASVGSSIQKAVFE 180
LSNRIDKDALRRSHSMVSRKQG ELFHRRV TE+KSHN SNGILSG SVGSSIQKAVFE
Sbjct: 121 LSNRIDKDALRRSHSMVSRKQG--ELFHRRVGTELKSHNSSNGILSGTSVGSSIQKAVFE 180
Query: 181 KDFPSLGSEDKQGASEIGRVSSPGLSSPVQSLPIGNSALIVGGEGWTSALAEVPSMIGST 240
KDFPSLGSE+KQGASEIGRVSSPGLSSPVQSLPIGNSALIVGGEGWTSALAEVPSMIGST
Sbjct: 181 KDFPSLGSEEKQGASEIGRVSSPGLSSPVQSLPIGNSALIVGGEGWTSALAEVPSMIGST 240
Query: 241 TGSPSFQQIVPATSGAGPLSITAGLNMAEALVQVPSRARAAPQ---LSVKTQRLEELAIK 300
GS SFQQ VPATSGAGPLS+TAGLNMAEALVQ PSR R APQ LSVKTQRLEELAIK
Sbjct: 241 PGSSSFQQTVPATSGAGPLSVTAGLNMAEALVQSPSRTRTAPQVSELSVKTQRLEELAIK 300
Query: 301 QSRQLIPVTPSMPKAMVIGANAIVLNSSDKSKPKLASRTAELSVTIKGGQPQPLSVHANQ 360
QSRQLIPVTPSMPKAM VL+SSDKSKPKLASRT EL+ TIKGGQPQP SVHANQ
Sbjct: 301 QSRQLIPVTPSMPKAM-------VLSSSDKSKPKLASRTGELNATIKGGQPQPSSVHANQ 360
Query: 361 SRGGHVKSDAQKSSHGKFLVLKPVRENGVSLTAKDVSSPTSNANSMAANSQFALAPSVPH 420
SR GHVK DAQKSSHGKFLVLKPVRENGVSL AKDVSSPTSNANSMAANSQFALAPSVPH
Sbjct: 361 SRVGHVKPDAQKSSHGKFLVLKPVRENGVSLAAKDVSSPTSNANSMAANSQFALAPSVPH 420
Query: 421 PPLRSPNNTNVSSVERKIASLDLKSGTTLEKRPSLSQVQSRNDFFNLIKKKTSMSSSAVL 480
PLRSPNNTNVSSVERKIASLDLK+GTTLEKRPSLSQVQSRNDFFNLIKKKTSMSSSAVL
Sbjct: 421 APLRSPNNTNVSSVERKIASLDLKTGTTLEKRPSLSQVQSRNDFFNLIKKKTSMSSSAVL 480
Query: 481 SDSCSSVKSPSIGQSNELTREEIDIPASPRIIENGSVEDRNGDSSEEVQASCDSGEKTES 540
SDSCSSVKSPSIGQSNELT EE+ IPASPR+IENG+VE+RNG+SSEEVQ S DSGEKTES
Sbjct: 481 SDSCSSVKSPSIGQSNELTSEEMGIPASPRVIENGAVENRNGNSSEEVQISRDSGEKTES 540
Query: 541 HVAAESLDEEEAAFLRSLGWDENCGEDEGLTEEEINSFYREYMNLKPSLKMGRCIQPKIF 600
HVAAESLDEEEAAFLRSLGWDE+CGEDEGLTEEEINSFYREYMNLKPSLK+GRCIQPKIF
Sbjct: 541 HVAAESLDEEEAAFLRSLGWDESCGEDEGLTEEEINSFYREYMNLKPSLKIGRCIQPKIF 600
Query: 601 VPSESHEDSK-DGAGSELSSSDSEA 621
VPSES EDSK DGAGSELSSSDSEA
Sbjct: 601 VPSESREDSKDDGAGSELSSSDSEA 616
BLAST of CcUC03G051500 vs. NCBI nr
Match:
XP_038907227.1 (mediator of RNA polymerase II transcription subunit 1 isoform X1 [Benincasa hispida])
HSP 1 Score: 1030.0 bits (2662), Expect = 8.3e-297
Identity = 567/624 (90.87%), Postives = 584/624 (93.59%), Query Frame = 0
Query: 1 MERSEPTLVPEWLRSTGSVTGGGNSNHHFPSSSSHSDVPSLSQLRNRTSKTASDFDTSRS 60
MERSEPTLVPEWLRSTGSVTGG NSNHHFPSSSSHSDVPSLSQ RNR SKT DFDTSRS
Sbjct: 1 MERSEPTLVPEWLRSTGSVTGGSNSNHHFPSSSSHSDVPSLSQSRNRISKTTGDFDTSRS 60
Query: 61 AFLDRTSSSNSRRSSSNGSSKHAYSSFNRGHRDKDREKEKDRLNFGDNWDRDSHDPLGKI 120
AFLDRTSSSNSRRSSSNGSSKHAYSSFNRGHRDKDREKEKDRLNFGD+WDRDS DPLGKI
Sbjct: 61 AFLDRTSSSNSRRSSSNGSSKHAYSSFNRGHRDKDREKEKDRLNFGDSWDRDSPDPLGKI 120
Query: 121 LSNRIDKDALRRSHSMVSRKQGERELFHRRVATEMKSH-NSNGILSGASVGSSIQKAVFE 180
LSN+IDKDALRRSHSMVSRK GERELFHRR ATE+K+H NSNGILSG SV SSIQKAVFE
Sbjct: 121 LSNKIDKDALRRSHSMVSRKLGERELFHRRAATELKNHNNSNGILSGTSVSSSIQKAVFE 180
Query: 181 KDFPSLGSEDKQGASEIGRVSSPGLSSPVQSLPIGNSALIVGGEGWTSALAEVPSMIGST 240
KDFPSLGSE+KQGASEIGRVSSPGLSSPVQSLPIGNSALIVGGEGWTSALAEVPSMIGST
Sbjct: 181 KDFPSLGSEEKQGASEIGRVSSPGLSSPVQSLPIGNSALIVGGEGWTSALAEVPSMIGST 240
Query: 241 TGSPSFQQIVPATSGAGPLSITAGLNMAEALVQVPSRARAAPQ---LSVKTQRLEELAIK 300
TGS SFQQ VPATSGAGPLS+TAGLNMAEALVQ PSRARA PQ LSVKTQRLEELAIK
Sbjct: 241 TGSSSFQQTVPATSGAGPLSVTAGLNMAEALVQAPSRARATPQVSELSVKTQRLEELAIK 300
Query: 301 QSRQLIPVTPSMPKAMVIGANAIVLNSSDKSKPKLASRTAELSVTIKGGQPQPLSVHANQ 360
QSRQLIPVTPSM KAM VLN SDKSKPKLASRT EL+VTIKGGQ QPLSVHANQ
Sbjct: 301 QSRQLIPVTPSMTKAM-------VLNPSDKSKPKLASRTGELNVTIKGGQQQPLSVHANQ 360
Query: 361 SRGGHVKSDAQKSSHGKFLVLKPVRENGVSLTAKDVSSPTSNANSMAANSQFALAPSVPH 420
SRGG VKSDAQKS+HGKFLVLKPVRENG+SL AKDVSSPTSNANSMA NSQFALA +V H
Sbjct: 361 SRGGLVKSDAQKSAHGKFLVLKPVRENGISLAAKDVSSPTSNANSMAVNSQFALASAVAH 420
Query: 421 PPLRSPNNTNVSSVERKIASLDLKSGTTLEKRPSLSQVQSRNDFFNLIKKKTSMSSSAVL 480
PLRSPNNTNVSSVERKIASLDLKSGTTLEKRPSLSQVQSRNDFFNLIKKKTSM SSAVL
Sbjct: 421 APLRSPNNTNVSSVERKIASLDLKSGTTLEKRPSLSQVQSRNDFFNLIKKKTSM-SSAVL 480
Query: 481 SDSCSSVKSPSIGQSNELTREEIDIPASPRIIENGSVEDRNGDSSEEVQASCDSGEKTES 540
SDSCSSVKSPSI SNELTREE D+PASPR+IENG+VE+RNGD SEEV+ASCD+GEKTES
Sbjct: 481 SDSCSSVKSPSISHSNELTREETDMPASPRVIENGAVENRNGDGSEEVRASCDTGEKTES 540
Query: 541 HVAAESLDEEEAAFLRSLGWDENCGEDEGLTEEEINSFYREYMNLKPSLKMGRCIQPKIF 600
HVAAESLDEEEAAFLRSLGWDENCGEDEGLTEEEINSFYREYMNLKPSLKMGRCI+PKIF
Sbjct: 541 HVAAESLDEEEAAFLRSLGWDENCGEDEGLTEEEINSFYREYMNLKPSLKMGRCIKPKIF 600
Query: 601 VPSESHEDSKDGAGSELSSSDSEA 621
VPSESHEDSKDGAGSELSSSDSEA
Sbjct: 601 VPSESHEDSKDGAGSELSSSDSEA 616
BLAST of CcUC03G051500 vs. NCBI nr
Match:
XP_011655200.1 (mediator of RNA polymerase II transcription subunit 1 isoform X2 [Cucumis sativus] >KGN51281.2 hypothetical protein Csa_009411 [Cucumis sativus])
HSP 1 Score: 1019.6 bits (2635), Expect = 1.1e-293
Identity = 564/622 (90.68%), Postives = 583/622 (93.73%), Query Frame = 0
Query: 1 MERSEPTLVPEWLRSTGSVTGGGNSNHHFPSSSSHSDVPSLSQLRNRTSKTASDFDTSRS 60
MERSEPTLVPEWLRSTGSV GGGN NHHFPSSSSHSDVPSLSQ RNR SKT DFD+SRS
Sbjct: 1 MERSEPTLVPEWLRSTGSVAGGGNPNHHFPSSSSHSDVPSLSQSRNRISKTTGDFDSSRS 60
Query: 61 AFLDRTSSSNSRRSSSNGSSKHAYSSFNRGHRDKDREKEKDRLNFGDNWDRDSHDPLGKI 120
+FLDRTSSSNSRRSSSNGSSKHAYSSFNRGHRDKDREKEKDRLNFGDNWDRD+HDPLGKI
Sbjct: 61 SFLDRTSSSNSRRSSSNGSSKHAYSSFNRGHRDKDREKEKDRLNFGDNWDRDAHDPLGKI 120
Query: 121 LSNRIDKDALRRSHSMVSRKQGERELFHRRVATEMKSHN-SNGILSGASVGSSIQKAVFE 180
LSNRIDKDALRRSHSMVSRKQG ELFHRRV TE+KSHN SNGILSG SVGSSIQKAVFE
Sbjct: 121 LSNRIDKDALRRSHSMVSRKQG--ELFHRRVGTELKSHNSSNGILSGTSVGSSIQKAVFE 180
Query: 181 KDFPSLGSEDKQGASEIGRVSSPGLSSPVQSLPIGNSALIVGGEGWTSALAEVPSMIGST 240
KDFPSLGSE+KQGASEIGRVSSPGLSSPVQSLPIGNSALIVGGEGWTSALAEVPSMIGST
Sbjct: 181 KDFPSLGSEEKQGASEIGRVSSPGLSSPVQSLPIGNSALIVGGEGWTSALAEVPSMIGST 240
Query: 241 TGSPSFQQIVPATSGAGPLSITAGLNMAEALVQVPSRARAAPQLSVKTQRLEELAIKQSR 300
TGS SFQQ VPATSGAGPLS+TAGLNMAEALVQ PSRARAAPQLSVKTQRLEELAIKQSR
Sbjct: 241 TGSSSFQQTVPATSGAGPLSVTAGLNMAEALVQAPSRARAAPQLSVKTQRLEELAIKQSR 300
Query: 301 QLIPVTPSMPKAMVIGANAIVLNSSDKSKPKLASRTAELSVTIKGGQPQPLSVHANQSRG 360
QLIPVTPSMPKAM VL+SSDKSKPKLASRT EL+ TIKGGQPQPL VHANQSR
Sbjct: 301 QLIPVTPSMPKAM-------VLSSSDKSKPKLASRTGELNATIKGGQPQPLLVHANQSRV 360
Query: 361 GHVKSDAQKSSHGKFLVLKPVRENGVSLTAKDVSSPTSNANSMAANSQFALAPSVPHPPL 420
GHVK DAQKSSHGKFLVLKPVRENGVSL AKDVSSPTSNANSMAANSQFALAPSVPH PL
Sbjct: 361 GHVKPDAQKSSHGKFLVLKPVRENGVSLAAKDVSSPTSNANSMAANSQFALAPSVPHAPL 420
Query: 421 RSPNNTNVSSVERKIASLDLKSGTTLEKRPSLSQVQSRNDFFNLIKKKTSMSSSAVLSDS 480
RSPNN NVSS+ERKIASLDLK+GTTLEKRPSLSQVQSRNDFF LIKKKTSM+SSAVLSDS
Sbjct: 421 RSPNNINVSSMERKIASLDLKTGTTLEKRPSLSQVQSRNDFFKLIKKKTSMNSSAVLSDS 480
Query: 481 CSSVKSPSIGQSNELTREEIDIPASPRIIENGSVEDRNGDSSEEVQASCDSGEKTESHVA 540
CSSVKSPSIGQSNELT EE+ ASPR+IENG+VE+RNG+SSEEVQ S DSGEKTESHVA
Sbjct: 481 CSSVKSPSIGQSNELTSEEMG-TASPRVIENGAVENRNGNSSEEVQVSRDSGEKTESHVA 540
Query: 541 AESLDEEEAAFLRSLGWDENCGEDEGLTEEEINSFYREYMNLKPSLKMGRCIQPKIFVPS 600
AESLDEEEAAFLRSLGWDE+CGEDEGLTEEEINSFYREY+NLKPSLK+GRCIQPKIFVPS
Sbjct: 541 AESLDEEEAAFLRSLGWDESCGEDEGLTEEEINSFYREYVNLKPSLKIGRCIQPKIFVPS 600
Query: 601 ESHEDSK-DGAGSELSSSDSEA 621
ES DSK DGAGSELSSSDSEA
Sbjct: 601 ESRVDSKDDGAGSELSSSDSEA 612
BLAST of CcUC03G051500 vs. ExPASy TrEMBL
Match:
A0A1S3CC42 (mediator of RNA polymerase II transcription subunit 1 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103499274 PE=4 SV=1)
HSP 1 Score: 1039.3 bits (2686), Expect = 6.6e-300
Identity = 572/622 (91.96%), Postives = 587/622 (94.37%), Query Frame = 0
Query: 1 MERSEPTLVPEWLRSTGSVTGGGNSNHHFPSSSSHSDVPSLSQLRNRTSKTASDFDTSRS 60
MERSEPTLVPEWLRSTGSVTGGGNSNHHFPSSSSHSDVPSLSQ RNR SKT DFDTSRS
Sbjct: 1 MERSEPTLVPEWLRSTGSVTGGGNSNHHFPSSSSHSDVPSLSQSRNRISKTTGDFDTSRS 60
Query: 61 AFLDRTSSSNSRRSSSNGSSKHAYSSFNRGHRDKDREKEKDRLNFGDNWDRDSHDPLGKI 120
+FLDRTSSSNSRRSSSNGSSKHAYSSFNRGHRDKDREKEKDRLNFGDNWDRD+HDPLGKI
Sbjct: 61 SFLDRTSSSNSRRSSSNGSSKHAYSSFNRGHRDKDREKEKDRLNFGDNWDRDAHDPLGKI 120
Query: 121 LSNRIDKDALRRSHSMVSRKQGERELFHRRVATEMKSHN-SNGILSGASVGSSIQKAVFE 180
LSNRIDKDALRRSHSMVSRKQG ELFHRRV TE+KSHN SNGILSG SVGSSIQKAVFE
Sbjct: 121 LSNRIDKDALRRSHSMVSRKQG--ELFHRRVGTELKSHNSSNGILSGTSVGSSIQKAVFE 180
Query: 181 KDFPSLGSEDKQGASEIGRVSSPGLSSPVQSLPIGNSALIVGGEGWTSALAEVPSMIGST 240
KDFPSLGSE+KQGASEIGRVSSPGLSSPVQSLPIGNSALIVGGEGWTSALAEVPSMIGST
Sbjct: 181 KDFPSLGSEEKQGASEIGRVSSPGLSSPVQSLPIGNSALIVGGEGWTSALAEVPSMIGST 240
Query: 241 TGSPSFQQIVPATSGAGPLSITAGLNMAEALVQVPSRARAAPQLSVKTQRLEELAIKQSR 300
GS SFQQ VPATSGAGPLS+TAGLNMAEALVQ PSR R APQLSVKTQRLEELAIKQSR
Sbjct: 241 PGSSSFQQTVPATSGAGPLSVTAGLNMAEALVQSPSRTRTAPQLSVKTQRLEELAIKQSR 300
Query: 301 QLIPVTPSMPKAMVIGANAIVLNSSDKSKPKLASRTAELSVTIKGGQPQPLSVHANQSRG 360
QLIPVTPSMPKAM VL+SSDKSKPKLASRT EL+ TIKGGQPQP SVHANQSR
Sbjct: 301 QLIPVTPSMPKAM-------VLSSSDKSKPKLASRTGELNATIKGGQPQPSSVHANQSRV 360
Query: 361 GHVKSDAQKSSHGKFLVLKPVRENGVSLTAKDVSSPTSNANSMAANSQFALAPSVPHPPL 420
GHVK DAQKSSHGKFLVLKPVRENGVSL AKDVSSPTSNANSMAANSQFALAPSVPH PL
Sbjct: 361 GHVKPDAQKSSHGKFLVLKPVRENGVSLAAKDVSSPTSNANSMAANSQFALAPSVPHAPL 420
Query: 421 RSPNNTNVSSVERKIASLDLKSGTTLEKRPSLSQVQSRNDFFNLIKKKTSMSSSAVLSDS 480
RSPNNTNVSSVERKIASLDLK+GTTLEKRPSLSQVQSRNDFFNLIKKKTSMSSSAVLSDS
Sbjct: 421 RSPNNTNVSSVERKIASLDLKTGTTLEKRPSLSQVQSRNDFFNLIKKKTSMSSSAVLSDS 480
Query: 481 CSSVKSPSIGQSNELTREEIDIPASPRIIENGSVEDRNGDSSEEVQASCDSGEKTESHVA 540
CSSVKSPSIGQSNELT EE+ IPASPR+IENG+VE+RNG+SSEEVQ S DSGEKTESHVA
Sbjct: 481 CSSVKSPSIGQSNELTSEEMGIPASPRVIENGAVENRNGNSSEEVQISRDSGEKTESHVA 540
Query: 541 AESLDEEEAAFLRSLGWDENCGEDEGLTEEEINSFYREYMNLKPSLKMGRCIQPKIFVPS 600
AESLDEEEAAFLRSLGWDE+CGEDEGLTEEEINSFYREYMNLKPSLK+GRCIQPKIFVPS
Sbjct: 541 AESLDEEEAAFLRSLGWDESCGEDEGLTEEEINSFYREYMNLKPSLKIGRCIQPKIFVPS 600
Query: 601 ESHEDSK-DGAGSELSSSDSEA 621
ES EDSK DGAGSELSSSDSEA
Sbjct: 601 ESREDSKDDGAGSELSSSDSEA 613
BLAST of CcUC03G051500 vs. ExPASy TrEMBL
Match:
A0A5D3DT29 (Mediator of RNA polymerase II transcription subunit 1 isoform X1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold313G001070 PE=4 SV=1)
HSP 1 Score: 1033.9 bits (2672), Expect = 2.8e-298
Identity = 572/625 (91.52%), Postives = 587/625 (93.92%), Query Frame = 0
Query: 1 MERSEPTLVPEWLRSTGSVTGGGNSNHHFPSSSSHSDVPSLSQLRNRTSKTASDFDTSRS 60
MERSEPTLVPEWLRSTGSVTGGGNSNHHFPSSSSHSDVPSLSQ RNR SKT DFDTSRS
Sbjct: 1 MERSEPTLVPEWLRSTGSVTGGGNSNHHFPSSSSHSDVPSLSQSRNRISKTTGDFDTSRS 60
Query: 61 AFLDRTSSSNSRRSSSNGSSKHAYSSFNRGHRDKDREKEKDRLNFGDNWDRDSHDPLGKI 120
+FLDRTSSSNSRRSSSNGSSKHAYSSFNRGHRDKDREKEKDRLNFGDNWDRD+HDPLGKI
Sbjct: 61 SFLDRTSSSNSRRSSSNGSSKHAYSSFNRGHRDKDREKEKDRLNFGDNWDRDAHDPLGKI 120
Query: 121 LSNRIDKDALRRSHSMVSRKQGERELFHRRVATEMKSHN-SNGILSGASVGSSIQKAVFE 180
LSNRIDKDALRRSHSMVSRKQG ELFHRRV TE+KSHN SNGILSG SVGSSIQKAVFE
Sbjct: 121 LSNRIDKDALRRSHSMVSRKQG--ELFHRRVGTELKSHNSSNGILSGTSVGSSIQKAVFE 180
Query: 181 KDFPSLGSEDKQGASEIGRVSSPGLSSPVQSLPIGNSALIVGGEGWTSALAEVPSMIGST 240
KDFPSLGSE+KQGASEIGRVSSPGLSSPVQSLPIGNSALIVGGEGWTSALAEVPSMIGST
Sbjct: 181 KDFPSLGSEEKQGASEIGRVSSPGLSSPVQSLPIGNSALIVGGEGWTSALAEVPSMIGST 240
Query: 241 TGSPSFQQIVPATSGAGPLSITAGLNMAEALVQVPSRARAAPQ---LSVKTQRLEELAIK 300
GS SFQQ VPATSGAGPLS+TAGLNMAEALVQ PSR R APQ LSVKTQRLEELAIK
Sbjct: 241 PGSSSFQQTVPATSGAGPLSVTAGLNMAEALVQSPSRTRTAPQVSELSVKTQRLEELAIK 300
Query: 301 QSRQLIPVTPSMPKAMVIGANAIVLNSSDKSKPKLASRTAELSVTIKGGQPQPLSVHANQ 360
QSRQLIPVTPSMPKAM VL+SSDKSKPKLASRT EL+ TIKGGQPQP SVHANQ
Sbjct: 301 QSRQLIPVTPSMPKAM-------VLSSSDKSKPKLASRTGELNATIKGGQPQPSSVHANQ 360
Query: 361 SRGGHVKSDAQKSSHGKFLVLKPVRENGVSLTAKDVSSPTSNANSMAANSQFALAPSVPH 420
SR GHVK DAQKSSHGKFLVLKPVRENGVSL AKDVSSPTSNANSMAANSQFALAPSVPH
Sbjct: 361 SRVGHVKPDAQKSSHGKFLVLKPVRENGVSLAAKDVSSPTSNANSMAANSQFALAPSVPH 420
Query: 421 PPLRSPNNTNVSSVERKIASLDLKSGTTLEKRPSLSQVQSRNDFFNLIKKKTSMSSSAVL 480
PLRSPNNTNVSSVERKIASLDLK+GTTLEKRPSLSQVQSRNDFFNLIKKKTSMSSSAVL
Sbjct: 421 APLRSPNNTNVSSVERKIASLDLKTGTTLEKRPSLSQVQSRNDFFNLIKKKTSMSSSAVL 480
Query: 481 SDSCSSVKSPSIGQSNELTREEIDIPASPRIIENGSVEDRNGDSSEEVQASCDSGEKTES 540
SDSCSSVKSPSIGQSNELT EE+ IPASPR+IENG+VE+RNG+SSEEVQ S DSGEKTES
Sbjct: 481 SDSCSSVKSPSIGQSNELTSEEMGIPASPRVIENGAVENRNGNSSEEVQISRDSGEKTES 540
Query: 541 HVAAESLDEEEAAFLRSLGWDENCGEDEGLTEEEINSFYREYMNLKPSLKMGRCIQPKIF 600
HVAAESLDEEEAAFLRSLGWDE+CGEDEGLTEEEINSFYREYMNLKPSLK+GRCIQPKIF
Sbjct: 541 HVAAESLDEEEAAFLRSLGWDESCGEDEGLTEEEINSFYREYMNLKPSLKIGRCIQPKIF 600
Query: 601 VPSESHEDSK-DGAGSELSSSDSEA 621
VPSES EDSK DGAGSELSSSDSEA
Sbjct: 601 VPSESREDSKDDGAGSELSSSDSEA 616
BLAST of CcUC03G051500 vs. ExPASy TrEMBL
Match:
A0A1S3CDT9 (mediator of RNA polymerase II transcription subunit 1 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103499274 PE=4 SV=1)
HSP 1 Score: 1033.9 bits (2672), Expect = 2.8e-298
Identity = 572/625 (91.52%), Postives = 587/625 (93.92%), Query Frame = 0
Query: 1 MERSEPTLVPEWLRSTGSVTGGGNSNHHFPSSSSHSDVPSLSQLRNRTSKTASDFDTSRS 60
MERSEPTLVPEWLRSTGSVTGGGNSNHHFPSSSSHSDVPSLSQ RNR SKT DFDTSRS
Sbjct: 1 MERSEPTLVPEWLRSTGSVTGGGNSNHHFPSSSSHSDVPSLSQSRNRISKTTGDFDTSRS 60
Query: 61 AFLDRTSSSNSRRSSSNGSSKHAYSSFNRGHRDKDREKEKDRLNFGDNWDRDSHDPLGKI 120
+FLDRTSSSNSRRSSSNGSSKHAYSSFNRGHRDKDREKEKDRLNFGDNWDRD+HDPLGKI
Sbjct: 61 SFLDRTSSSNSRRSSSNGSSKHAYSSFNRGHRDKDREKEKDRLNFGDNWDRDAHDPLGKI 120
Query: 121 LSNRIDKDALRRSHSMVSRKQGERELFHRRVATEMKSHN-SNGILSGASVGSSIQKAVFE 180
LSNRIDKDALRRSHSMVSRKQG ELFHRRV TE+KSHN SNGILSG SVGSSIQKAVFE
Sbjct: 121 LSNRIDKDALRRSHSMVSRKQG--ELFHRRVGTELKSHNSSNGILSGTSVGSSIQKAVFE 180
Query: 181 KDFPSLGSEDKQGASEIGRVSSPGLSSPVQSLPIGNSALIVGGEGWTSALAEVPSMIGST 240
KDFPSLGSE+KQGASEIGRVSSPGLSSPVQSLPIGNSALIVGGEGWTSALAEVPSMIGST
Sbjct: 181 KDFPSLGSEEKQGASEIGRVSSPGLSSPVQSLPIGNSALIVGGEGWTSALAEVPSMIGST 240
Query: 241 TGSPSFQQIVPATSGAGPLSITAGLNMAEALVQVPSRARAAPQ---LSVKTQRLEELAIK 300
GS SFQQ VPATSGAGPLS+TAGLNMAEALVQ PSR R APQ LSVKTQRLEELAIK
Sbjct: 241 PGSSSFQQTVPATSGAGPLSVTAGLNMAEALVQSPSRTRTAPQVSELSVKTQRLEELAIK 300
Query: 301 QSRQLIPVTPSMPKAMVIGANAIVLNSSDKSKPKLASRTAELSVTIKGGQPQPLSVHANQ 360
QSRQLIPVTPSMPKAM VL+SSDKSKPKLASRT EL+ TIKGGQPQP SVHANQ
Sbjct: 301 QSRQLIPVTPSMPKAM-------VLSSSDKSKPKLASRTGELNATIKGGQPQPSSVHANQ 360
Query: 361 SRGGHVKSDAQKSSHGKFLVLKPVRENGVSLTAKDVSSPTSNANSMAANSQFALAPSVPH 420
SR GHVK DAQKSSHGKFLVLKPVRENGVSL AKDVSSPTSNANSMAANSQFALAPSVPH
Sbjct: 361 SRVGHVKPDAQKSSHGKFLVLKPVRENGVSLAAKDVSSPTSNANSMAANSQFALAPSVPH 420
Query: 421 PPLRSPNNTNVSSVERKIASLDLKSGTTLEKRPSLSQVQSRNDFFNLIKKKTSMSSSAVL 480
PLRSPNNTNVSSVERKIASLDLK+GTTLEKRPSLSQVQSRNDFFNLIKKKTSMSSSAVL
Sbjct: 421 APLRSPNNTNVSSVERKIASLDLKTGTTLEKRPSLSQVQSRNDFFNLIKKKTSMSSSAVL 480
Query: 481 SDSCSSVKSPSIGQSNELTREEIDIPASPRIIENGSVEDRNGDSSEEVQASCDSGEKTES 540
SDSCSSVKSPSIGQSNELT EE+ IPASPR+IENG+VE+RNG+SSEEVQ S DSGEKTES
Sbjct: 481 SDSCSSVKSPSIGQSNELTSEEMGIPASPRVIENGAVENRNGNSSEEVQISRDSGEKTES 540
Query: 541 HVAAESLDEEEAAFLRSLGWDENCGEDEGLTEEEINSFYREYMNLKPSLKMGRCIQPKIF 600
HVAAESLDEEEAAFLRSLGWDE+CGEDEGLTEEEINSFYREYMNLKPSLK+GRCIQPKIF
Sbjct: 541 HVAAESLDEEEAAFLRSLGWDESCGEDEGLTEEEINSFYREYMNLKPSLKIGRCIQPKIF 600
Query: 601 VPSESHEDSK-DGAGSELSSSDSEA 621
VPSES EDSK DGAGSELSSSDSEA
Sbjct: 601 VPSESREDSKDDGAGSELSSSDSEA 616
BLAST of CcUC03G051500 vs. ExPASy TrEMBL
Match:
A0A0A0KN63 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G387430 PE=4 SV=1)
HSP 1 Score: 1014.2 bits (2621), Expect = 2.3e-292
Identity = 564/625 (90.24%), Postives = 583/625 (93.28%), Query Frame = 0
Query: 1 MERSEPTLVPEWLRSTGSVTGGGNSNHHFPSSSSHSDVPSLSQLRNRTSKTASDFDTSRS 60
MERSEPTLVPEWLRSTGSV GGGN NHHFPSSSSHSDVPSLSQ RNR SKT DFD+SRS
Sbjct: 1 MERSEPTLVPEWLRSTGSVAGGGNPNHHFPSSSSHSDVPSLSQSRNRISKTTGDFDSSRS 60
Query: 61 AFLDRTSSSNSRRSSSNGSSKHAYSSFNRGHRDKDREKEKDRLNFGDNWDRDSHDPLGKI 120
+FLDRTSSSNSRRSSSNGSSKHAYSSFNRGHRDKDREKEKDRLNFGDNWDRD+HDPLGKI
Sbjct: 61 SFLDRTSSSNSRRSSSNGSSKHAYSSFNRGHRDKDREKEKDRLNFGDNWDRDAHDPLGKI 120
Query: 121 LSNRIDKDALRRSHSMVSRKQGERELFHRRVATEMKSHN-SNGILSGASVGSSIQKAVFE 180
LSNRIDKDALRRSHSMVSRKQG ELFHRRV TE+KSHN SNGILSG SVGSSIQKAVFE
Sbjct: 121 LSNRIDKDALRRSHSMVSRKQG--ELFHRRVGTELKSHNSSNGILSGTSVGSSIQKAVFE 180
Query: 181 KDFPSLGSEDKQGASEIGRVSSPGLSSPVQSLPIGNSALIVGGEGWTSALAEVPSMIGST 240
KDFPSLGSE+KQGASEIGRVSSPGLSSPVQSLPIGNSALIVGGEGWTSALAEVPSMIGST
Sbjct: 181 KDFPSLGSEEKQGASEIGRVSSPGLSSPVQSLPIGNSALIVGGEGWTSALAEVPSMIGST 240
Query: 241 TGSPSFQQIVPATSGAGPLSITAGLNMAEALVQVPSRARAAPQ---LSVKTQRLEELAIK 300
TGS SFQQ VPATSGAGPLS+TAGLNMAEALVQ PSRARAAPQ LSVKTQRLEELAIK
Sbjct: 241 TGSSSFQQTVPATSGAGPLSVTAGLNMAEALVQAPSRARAAPQVSELSVKTQRLEELAIK 300
Query: 301 QSRQLIPVTPSMPKAMVIGANAIVLNSSDKSKPKLASRTAELSVTIKGGQPQPLSVHANQ 360
QSRQLIPVTPSMPKAM VL+SSDKSKPKLASRT EL+ TIKGGQPQPL VHANQ
Sbjct: 301 QSRQLIPVTPSMPKAM-------VLSSSDKSKPKLASRTGELNATIKGGQPQPLLVHANQ 360
Query: 361 SRGGHVKSDAQKSSHGKFLVLKPVRENGVSLTAKDVSSPTSNANSMAANSQFALAPSVPH 420
SR GHVK DAQKSSHGKFLVLKPVRENGVSL AKDVSSPTSNANSMAANSQFALAPSVPH
Sbjct: 361 SRVGHVKPDAQKSSHGKFLVLKPVRENGVSLAAKDVSSPTSNANSMAANSQFALAPSVPH 420
Query: 421 PPLRSPNNTNVSSVERKIASLDLKSGTTLEKRPSLSQVQSRNDFFNLIKKKTSMSSSAVL 480
PLRSPNN NVSS+ERKIASLDLK+GTTLEKRPSLSQVQSRNDFF LIKKKTSM+SSAVL
Sbjct: 421 APLRSPNNINVSSMERKIASLDLKTGTTLEKRPSLSQVQSRNDFFKLIKKKTSMNSSAVL 480
Query: 481 SDSCSSVKSPSIGQSNELTREEIDIPASPRIIENGSVEDRNGDSSEEVQASCDSGEKTES 540
SDSCSSVKSPSIGQSNELT EE+ ASPR+IENG+VE+RNG+SSEEVQ S DSGEKTES
Sbjct: 481 SDSCSSVKSPSIGQSNELTSEEMG-TASPRVIENGAVENRNGNSSEEVQVSRDSGEKTES 540
Query: 541 HVAAESLDEEEAAFLRSLGWDENCGEDEGLTEEEINSFYREYMNLKPSLKMGRCIQPKIF 600
HVAAESLDEEEAAFLRSLGWDE+CGEDEGLTEEEINSFYREY+NLKPSLK+GRCIQPKIF
Sbjct: 541 HVAAESLDEEEAAFLRSLGWDESCGEDEGLTEEEINSFYREYVNLKPSLKIGRCIQPKIF 600
Query: 601 VPSESHEDSK-DGAGSELSSSDSEA 621
VPSES DSK DGAGSELSSSDSEA
Sbjct: 601 VPSESRVDSKDDGAGSELSSSDSEA 615
BLAST of CcUC03G051500 vs. ExPASy TrEMBL
Match:
A0A6J1FM76 (uncharacterized protein LOC111445235 OS=Cucurbita moschata OX=3662 GN=LOC111445235 PE=4 SV=1)
HSP 1 Score: 994.6 bits (2570), Expect = 1.9e-286
Identity = 549/625 (87.84%), Postives = 574/625 (91.84%), Query Frame = 0
Query: 1 MERSEPTLVPEWLRSTGSVTGGGNSNHHFPSSSSHSDVPSLSQLRNRTSKTASDFDTSRS 60
MERSEPTLVPEWLR+TGSVTGGGNSNHHF S S HSDVPS SQ RNRTSKT DFDTSR
Sbjct: 1 MERSEPTLVPEWLRNTGSVTGGGNSNHHFQSPSPHSDVPSQSQPRNRTSKTTGDFDTSRP 60
Query: 61 AFLDRTSSSNSRRSSSNGSSKHAYSSFNRGHRDKDREKEKDRLNFGDNWDRDSHDPLGKI 120
AFLDRT+SSNSRRSSSNGS+KHAYSSFNRGHRDKDREKEKDR NFGDNWDRDSHDPLGK+
Sbjct: 61 AFLDRTASSNSRRSSSNGSAKHAYSSFNRGHRDKDREKEKDRFNFGDNWDRDSHDPLGKL 120
Query: 121 LSNRIDKDALRRSHSMVSRKQGERELFHRRVATEMKS--HNSNGILSGASVGSSIQKAVF 180
L NR+DKDALRRSHSMVSRKQ ELFHRRVAT++K+ ++SNG+ G SVGSSIQKAVF
Sbjct: 121 LPNRVDKDALRRSHSMVSRKQD--ELFHRRVATDLKAGVNSSNGMPPGISVGSSIQKAVF 180
Query: 181 EKDFPSLGSEDKQGASEIGRVSSPGLSSPVQSLPIGNSALIVGGEGWTSALAEVPSMIGS 240
EKDFPSLGSE+KQGASEIGRVSSPGLSS VQSLPIGNSALIVGGEGWTSALAEVPSMIGS
Sbjct: 181 EKDFPSLGSEEKQGASEIGRVSSPGLSSSVQSLPIGNSALIVGGEGWTSALAEVPSMIGS 240
Query: 241 TTGSPSFQQIVPATSGAGPLSITAGLNMAEALVQVPSRARAAPQ---LSVKTQRLEELAI 300
TTGS SFQQ VPATSGAGPLS+TAGLNMAEALVQ PSRARAAPQ LSVKTQRLEELAI
Sbjct: 241 TTGSSSFQQTVPATSGAGPLSVTAGLNMAEALVQAPSRARAAPQASELSVKTQRLEELAI 300
Query: 301 KQSRQLIPVTPSMPKAMVIGANAIVLNSSDKSKPKLASRTAELSVTIKGGQPQPLSVHAN 360
KQSRQLIPVTPSMPKA LNSSDKSKPKLASRT EL+VT+KGGQP P SVHAN
Sbjct: 301 KQSRQLIPVTPSMPKAS-------ALNSSDKSKPKLASRTGELNVTVKGGQPLPSSVHAN 360
Query: 361 QSRGGHVKSDAQKSSHGKFLVLKPVRENGVSLTAKDVSSPTSNANSMAANSQFALAPSVP 420
QSRGGHVKSDAQKSSHGKFLVLKP RENGVS TAKDVSSPTSN ANSQFALAPSVP
Sbjct: 361 QSRGGHVKSDAQKSSHGKFLVLKPARENGVSPTAKDVSSPTSN-----ANSQFALAPSVP 420
Query: 421 HPPLRSPNNTNVSSVERKIASLDLKSGTTLEKRPSLSQVQSRNDFFNLIKKKTSMSSSAV 480
H PLRSPNN+NV+SVERK+ASLDLKSGTTLEKRPSLSQVQSRNDFFNLIKKKTSMSSSAV
Sbjct: 421 HAPLRSPNNSNVASVERKMASLDLKSGTTLEKRPSLSQVQSRNDFFNLIKKKTSMSSSAV 480
Query: 481 LSDSCSSVKSPSIGQSNELTREEIDIPASPRIIENGSVEDRNGDSSEEVQASCDSGEKTE 540
LSDSCSSVKSPSIGQSNELTREEID PASP ++ENG VE+ NGDSSEEV++SCDSGEKTE
Sbjct: 481 LSDSCSSVKSPSIGQSNELTREEIDTPASPHVLENGVVENTNGDSSEEVRSSCDSGEKTE 540
Query: 541 SHVAAESLDEEEAAFLRSLGWDENCGEDEGLTEEEINSFYREYMNLKPSLKMGRCIQPKI 600
+HVAAESLDEEEAAFLRSLGWDENCGEDEGLTEEEINSFYREYM+LKPSLKMGR IQPKI
Sbjct: 541 THVAAESLDEEEAAFLRSLGWDENCGEDEGLTEEEINSFYREYMSLKPSLKMGRSIQPKI 600
Query: 601 FVPSESHEDSKDGAGSELSSSDSEA 621
VPSESHEDSKDGAGSELSSSDSEA
Sbjct: 601 SVPSESHEDSKDGAGSELSSSDSEA 611
BLAST of CcUC03G051500 vs. TAIR 10
Match:
AT1G36990.1 (unknown protein; LOCATED IN: chloroplast; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 14 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G08510.1); Has 5029 Blast hits to 1779 proteins in 339 species: Archae - 2; Bacteria - 1372; Metazoa - 990; Fungi - 933; Plants - 111; Viruses - 28; Other Eukaryotes - 1593 (source: NCBI BLink). )
HSP 1 Score: 406.8 bits (1044), Expect = 3.2e-113
Identity = 296/600 (49.33%), Postives = 369/600 (61.50%), Query Frame = 0
Query: 1 MERSEPTLVPEWLRSTGSVTGGGNSNHHFPSSSSHSDVPSLS-QLRNRTSKTASDFDTSR 60
M++ E +L PEWLRS+G +GGG+SNH SSSSHSD SL RNR S++ SD D+
Sbjct: 1 MDKGEHSLAPEWLRSSGHASGGGSSNHLLVSSSSHSDSASLQYNSRNRNSRSKSDVDSIH 60
Query: 61 SAFLDRTSSSNSRRSSSNGSSKHAYSS--FNRGHRDKDREKEKDRLNFGDNWDRDSHDPL 120
S FLDR+SS+NSRR SSNGS+KHAYSS FNR RDKDR ++KDR+++ D WD D+ PL
Sbjct: 61 SPFLDRSSSTNSRRGSSNGSAKHAYSSFNFNRSQRDKDRSRDKDRVSYVDPWDLDTSIPL 120
Query: 121 GKILSNRIDKDALRRSHSMVSRKQGERELFHRRVAT----EMKSHNSNGILSGASVGSSI 180
IL+ R D D LRRSHSMV+RKQGE V S+N NG+LSG S+G+S
Sbjct: 121 RTILTGR-DPDPLRRSHSMVTRKQGEHLSRGLTVGLNNGGSSNSYNGNGLLSGPSIGNSF 180
Query: 181 QKAVFEKDFPSLGSEDKQGASEIGRVSSPGLSSPVQSLPIGNSALIVGGEGWTSALAEVP 240
Q+ F+KDFPSLG+E+KQ ++ RVSSPG+SS VQ+LP+GNSALI GGEGWTSALAEVP
Sbjct: 181 QRTGFDKDFPSLGAEEKQNGQDVVRVSSPGISSVVQNLPVGNSALI-GGEGWTSALAEVP 240
Query: 241 SMI-----GSTTGSPSFQQIVPATSGAGPLSITAGLNMAEALVQVPSRARAAPQLSVKTQ 300
++I GS T SP + AG L+ +GLNMAEALVQ P+R PQ SVKTQ
Sbjct: 241 NVIEKACTGSLT-SPKANAV-----SAGTLTGPSGLNMAEALVQAPARTHTPPQGSVKTQ 300
Query: 301 RLEELAIKQSRQLIPVTPSMPKAMVIGANAIVLNSSDKSKPKLASRTAELSVTIKGG--- 360
RLE+LAIKQSRQLIPV PS PK + LNSSDKSK K RT E +
Sbjct: 301 RLEDLAIKQSRQLIPVVPSAPK-------GLSLNSSDKSKTKQVVRTGETCLAPSRNALQ 360
Query: 361 QPQPLSVHANQSRGGHVKSDAQKSSHGKFLVLKPVRENGVSLTAKDVSSPTSNANSMAAN 420
QP L + G +K + K LVLKP RENGVS K+ SP++N N+ AA+
Sbjct: 361 QPAVLLGSFQSNPSGQIKPEK------KLLVLKPARENGVS-AVKESGSPSANTNTRAAS 420
Query: 421 SQ-FALAPSVPHPPLRSPNNTNVSSVERKIAS-LDLKSGTTLEKRPSLSQVQSRNDFFNL 480
SQ + S P+RS N S E K AS + SG T+EK+PS +Q QSR+ F++
Sbjct: 421 SQLMSNTQSTQSAPVRSTN----SPKELKGASAFSMISGQTIEKKPSAAQAQSRSAFYSA 480
Query: 481 IKKKTSMSSSAVLSDSCSSVKSPSIGQSNELTREEIDIPASPRIIENGSVEDRNGDSSEE 540
+K+K + S+S + +D SS S S +L + I + P S E
Sbjct: 481 LKQKQTASTS-ITTDPVSSSTSASSSVEVKLNSSKDLIASDP--------SSSQATSGVE 540
Query: 541 VQASCDSGEKTESHVAAESLDEEEAAFLRSLGWDENCGEDEGLTEEEINSFYREYMNLKP 584
V S T A ++ DEEEA FLRSLGW EN GE E LTEEEI+SF +Y L+P
Sbjct: 541 VTDSVQVASHTSGFEATDTPDEEEAQFLRSLGWVENNGE-EYLTEEEIDSFLEQYKELRP 564
BLAST of CcUC03G051500 vs. TAIR 10
Match:
AT4G08510.1 (unknown protein; LOCATED IN: chloroplast; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G36990.1); Has 888 Blast hits to 321 proteins in 121 species: Archae - 0; Bacteria - 120; Metazoa - 86; Fungi - 24; Plants - 79; Viruses - 0; Other Eukaryotes - 579 (source: NCBI BLink). )
HSP 1 Score: 302.0 bits (772), Expect = 1.1e-81
Identity = 244/595 (41.01%), Postives = 331/595 (55.63%), Query Frame = 0
Query: 1 MERSEPTLVPEWLRSTGSVTGGGNSNHHFPSSSSHSDVPSLSQLRNRTSKTASDFDTSRS 60
ME+ EP+LVPEWLRS+G +G G+SN S SD SL +NR +++ SD D+ S
Sbjct: 1 MEKREPSLVPEWLRSSGHGSGVGSSN-------SLSD--SLRNSKNRNARSRSDADSVGS 60
Query: 61 AFLDRTSSSNSRRSSSNGSSKHAYSS--FNRGHRDKDREKEKDRLNFGDNWDRDSHDPLG 120
FLDR+SS+N+RR SSNGS+KHAYSS FNR +RDKDR +EKDR+++ D WD DS P G
Sbjct: 61 PFLDRSSSTNTRRGSSNGSTKHAYSSFNFNRSNRDKDRSREKDRMSYMDPWDNDSSMPFG 120
Query: 121 KILSNRIDKDALRRSHSMVSRKQGERELFHRRVATE----MKSHNSNGILSGASVGSSIQ 180
L R ++ LRRSHSM +RKQG V + + + N +GIL G S S +
Sbjct: 121 TFLIGR-GEEPLRRSHSMTTRKQGNHLAQGFTVGYKNGGNINTFNGHGILPGTSPVKSSK 180
Query: 181 KAVFEKDFPSLGSEDKQGASEIGRVSSPGLSSPVQSLPIGNSALIVGGEGWTSALAEVPS 240
+ F KDFP L E++ G ++ R+SSPG S QSL + N ALI+ GEGWTSALAEVP+
Sbjct: 181 RMGFNKDFPLLRGEERNGGPDVVRISSPGRSPTAQSLSVANPALII-GEGWTSALAEVPN 240
Query: 241 MIGSTTGSPSFQQIVPATSGAGPLSITAGLNMAEALVQVPSRARAAPQLSVKTQRLEELA 300
+I + G+ S + + + +GP A NMAEALVQ P R PQ Q LE+ A
Sbjct: 241 VIEKSGGAESHANVGNSATLSGP----ACRNMAEALVQAPGRTGTPPQ----AQTLEDRA 300
Query: 301 IKQSRQLIPVTPSMPKAMVIGANAIVLNSSDKSKPKLASRTAELSVTIKGGQPQPLSV-- 360
I+QSRQLIPV PS PK V NSSDKSK K R+ E + Q SV
Sbjct: 301 IRQSRQLIPVVPSAPKGS-------VHNSSDKSKTKPMFRSGETGLASSRNTQQQSSVML 360
Query: 361 -HANQSRGGHVKSDAQKSSHGKFLVLKPVRENGVSLTAKDVSSPTSNANSMAANSQFALA 420
+ + G +K D K K ++LKP RENG V + S NS A SQ A
Sbjct: 361 GNMQSNPGSQIKPDTTK----KLVILKPARENG-------VVAGGSPPNSRVAASQPTTA 420
Query: 421 PSVPH-PPLRSPNNTNVSSVERKIASLDLKSGTTLEKRPSLSQVQSRNDFFNLIKKKTSM 480
PS +RS N + + AS+++ +G EK+ SL+Q QSR+ F++ +K+KT
Sbjct: 421 PSTQFTASVRSTNGPR----DLRGASVNMLAGKAAEKKLSLAQTQSRHAFYSALKQKTCT 480
Query: 481 SSSAVLSDSCSSVKSPSIGQSNELTREEIDIPASPRIIENGSVEDRNGDSSEEVQASCDS 540
+ S S + S + S Q+N P+SP+ E + E V+ +
Sbjct: 481 NISTDPSKTSSCILSSVEEQANSSKELVASDPSSPQAAERDEI-------MESVEKVSNV 540
Query: 541 GEKTESHVAAESLDEEEAAFLRSLGWDENCGEDEGLTEEEINSFYREYMNLKPSL 586
E+ +A D +EAAFL+SLGWDEN ++ T EE+ + +++ KPSL
Sbjct: 541 AERISRFESAVRPDPKEAAFLKSLGWDENDSDEYTHTMEEMREWCKKF---KPSL 544
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_008460470.1 | 1.4e-299 | 91.96 | PREDICTED: mediator of RNA polymerase II transcription subunit 1 isoform X2 [Cuc... | [more] |
XP_038907228.1 | 2.0e-298 | 91.30 | mediator of RNA polymerase II transcription subunit 1 isoform X2 [Benincasa hisp... | [more] |
XP_008460469.1 | 5.8e-298 | 91.52 | PREDICTED: mediator of RNA polymerase II transcription subunit 1 isoform X1 [Cuc... | [more] |
XP_038907227.1 | 8.3e-297 | 90.87 | mediator of RNA polymerase II transcription subunit 1 isoform X1 [Benincasa hisp... | [more] |
XP_011655200.1 | 1.1e-293 | 90.68 | mediator of RNA polymerase II transcription subunit 1 isoform X2 [Cucumis sativu... | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A1S3CC42 | 6.6e-300 | 91.96 | mediator of RNA polymerase II transcription subunit 1 isoform X2 OS=Cucumis melo... | [more] |
A0A5D3DT29 | 2.8e-298 | 91.52 | Mediator of RNA polymerase II transcription subunit 1 isoform X1 OS=Cucumis melo... | [more] |
A0A1S3CDT9 | 2.8e-298 | 91.52 | mediator of RNA polymerase II transcription subunit 1 isoform X1 OS=Cucumis melo... | [more] |
A0A0A0KN63 | 2.3e-292 | 90.24 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G387430 PE=4 SV=1 | [more] |
A0A6J1FM76 | 1.9e-286 | 87.84 | uncharacterized protein LOC111445235 OS=Cucurbita moschata OX=3662 GN=LOC1114452... | [more] |
Match Name | E-value | Identity | Description | |
AT1G36990.1 | 3.2e-113 | 49.33 | unknown protein; LOCATED IN: chloroplast; EXPRESSED IN: 24 plant structures; EXP... | [more] |
AT4G08510.1 | 1.1e-81 | 41.01 | unknown protein; LOCATED IN: chloroplast; BEST Arabidopsis thaliana protein matc... | [more] |