Cla97C01G006460 (gene) Watermelon (97103) v2.5

Overview
NameCla97C01G006460
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionMediator of RNA polymerase II transcription subunit
LocationCla97Chr01: 6421413 .. 6421877 (+)
RNA-Seq ExpressionCla97C01G006460
SyntenyCla97C01G006460
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAATCCATTAATTCCATCGCCTTCTCAATCTCCTCCACCAATTCCGCTGCCTTCCTCCTCCCTCCCACTCCCTCCGGCCACCGCCTCCGCCGCCTACCAGTGGCTGTAACCATCAGAGCATCGTCTCGATCGTCAGCCGACAACAACAATAATTACTACGCGAGCGGGAAAGTGGTGGACGAGAGCATGATCGTTCTCCGGAAACGAATCCACGAGATCAAGATGGCGGAGCAAAGCCACGATCCGCCGGCCGATTGGTTTGATTGGGAAAAACGGTATTATTCCAATTACGATTCTCATATCTGTGAAGCTTTAGGCTATCTTCAATCTCATTTGATGAATACTCGACCTAGCGTGGCTTTGGGAATGCTCCTTATTATAATCCTCAGCGTTCCGCTTTCCTCCGCCCTCCTTCTCCACCGCTTCTTCCAAATCGCCGCCGCGCTTCTCGCCGCCGCTTAA

mRNA sequence

ATGAAATCCATTAATTCCATCGCCTTCTCAATCTCCTCCACCAATTCCGCTGCCTTCCTCCTCCCTCCCACTCCCTCCGGCCACCGCCTCCGCCGCCTACCAGTGGCTGTAACCATCAGAGCATCGTCTCGATCGTCAGCCGACAACAACAATAATTACTACGCGAGCGGGAAAGTGGTGGACGAGAGCATGATCGTTCTCCGGAAACGAATCCACGAGATCAAGATGGCGGAGCAAAGCCACGATCCGCCGGCCGATTGGTTTGATTGGGAAAAACGGTATTATTCCAATTACGATTCTCATATCTGTGAAGCTTTAGGCTATCTTCAATCTCATTTGATGAATACTCGACCTAGCGTGGCTTTGGGAATGCTCCTTATTATAATCCTCAGCGTTCCGCTTTCCTCCGCCCTCCTTCTCCACCGCTTCTTCCAAATCGCCGCCGCGCTTCTCGCCGCCGCTTAA

Coding sequence (CDS)

ATGAAATCCATTAATTCCATCGCCTTCTCAATCTCCTCCACCAATTCCGCTGCCTTCCTCCTCCCTCCCACTCCCTCCGGCCACCGCCTCCGCCGCCTACCAGTGGCTGTAACCATCAGAGCATCGTCTCGATCGTCAGCCGACAACAACAATAATTACTACGCGAGCGGGAAAGTGGTGGACGAGAGCATGATCGTTCTCCGGAAACGAATCCACGAGATCAAGATGGCGGAGCAAAGCCACGATCCGCCGGCCGATTGGTTTGATTGGGAAAAACGGTATTATTCCAATTACGATTCTCATATCTGTGAAGCTTTAGGCTATCTTCAATCTCATTTGATGAATACTCGACCTAGCGTGGCTTTGGGAATGCTCCTTATTATAATCCTCAGCGTTCCGCTTTCCTCCGCCCTCCTTCTCCACCGCTTCTTCCAAATCGCCGCCGCGCTTCTCGCCGCCGCTTAA

Protein sequence

MKSINSIAFSISSTNSAAFLLPPTPSGHRLRRLPVAVTIRASSRSSADNNNNYYASGKVVDESMIVLRKRIHEIKMAEQSHDPPADWFDWEKRYYSNYDSHICEALGYLQSHLMNTRPSVALGMLLIIILSVPLSSALLLHRFFQIAAALLAAA
Homology
BLAST of Cla97C01G006460 vs. NCBI nr
Match: XP_038875502.1 (uncharacterized protein LOC120067929 [Benincasa hispida])

HSP 1 Score: 245.0 bits (624), Expect = 4.3e-61
Identity = 132/153 (86.27%), Postives = 140/153 (91.50%), Query Frame = 0

Query: 1   MKSINSIAFSISSTNSAAFLLPPTPSGHRLRRLPVAVTIRASSRSSAD-NNNNYYASGKV 60
           MKSINS+  SISSTNSA FL P T S HR RRLP+AVT+ ASSRSSAD NNNNYYASGKV
Sbjct: 1   MKSINSLDRSISSTNSAPFLFPST-SAHRHRRLPLAVTVTASSRSSADKNNNNYYASGKV 60

Query: 61  VDESMIVLRKRIHEIKMAEQSHDPPADWFDWEKRYYSNYDSHICEALGYLQSHLMNTRPS 120
           VDESMIVLRKRIHEIKMAEQSH+PPADW DWEKRYYS+YDSHICEALGYLQ+HLMNTRPS
Sbjct: 61  VDESMIVLRKRIHEIKMAEQSHEPPADWLDWEKRYYSDYDSHICEALGYLQTHLMNTRPS 120

Query: 121 VALGMLLIIILSVPLSSALLLHRFFQIAAALLA 153
           VALGMLL+I LSVP SSA+LLHRFFQIAAALLA
Sbjct: 121 VALGMLLLITLSVPFSSAILLHRFFQIAAALLA 152

BLAST of Cla97C01G006460 vs. NCBI nr
Match: KAA0046315.1 (uncharacterized protein E6C27_scaffold149G00240 [Cucumis melo var. makuwa])

HSP 1 Score: 209.5 bits (532), Expect = 2.0e-50
Identity = 116/159 (72.96%), Postives = 125/159 (78.62%), Query Frame = 0

Query: 1   MKSINSIAFSISSTNSAAFLLPPTPSGHRLRRLPVAVTIRASSRSSADNNNN-------Y 60
           MKSINS++FS  S      L  P P  H  R+   AVT+RAS   +ADNNNN       Y
Sbjct: 1   MKSINSLSFSTPSP-----LFSPAPHPHGRRKPLPAVTVRASREQAADNNNNNNNNYNDY 60

Query: 61  YASGKVVDESMIVLRKRIHEIKMAEQSHDPPADWFDWEKRYYSNYDSHICEALGYLQSHL 120
           YA GKVVDESMIVLRKRIHEIKMAEQ  +PPADW DWEKRYYS+YDSHICEALGYLQSHL
Sbjct: 61  YAGGKVVDESMIVLRKRIHEIKMAEQRQEPPADWLDWEKRYYSDYDSHICEALGYLQSHL 120

Query: 121 MNTRPSVALGMLLIIILSVPLSSALLLHRFFQIAAALLA 153
           MNTRPSVALGMLL+II+SVPLSSALLLHRFF IA ALLA
Sbjct: 121 MNTRPSVALGMLLLIIVSVPLSSALLLHRFFHIAVALLA 154

BLAST of Cla97C01G006460 vs. NCBI nr
Match: XP_022993164.1 (uncharacterized protein LOC111489262 [Cucurbita maxima])

HSP 1 Score: 203.4 bits (516), Expect = 1.4e-48
Identity = 111/149 (74.50%), Postives = 124/149 (83.22%), Query Frame = 0

Query: 10  SISSTNSAAFL--LPPTPSGHRLRRLPVAVTIRAS----SRSSADNNNNYYASGKVVDES 69
           SISS NSA FL   P +PS  R RRLP+   I +S    S SS +NNNNYYASGK+VDES
Sbjct: 3   SISSANSAPFLHFHPISPSSRRRRRLPLFTVIASSRGADSSSSNNNNNNYYASGKLVDES 62

Query: 70  MIVLRKRIHEIKMAEQSHDPPADWFDWEKRYYSNYDSHICEALGYLQSHLMNTRPSVALG 129
           MIVLRKRIHEIKM EQSH+PP+DW DWEKR YS+YDSHICEALGYLQSHLMNTRPSV+LG
Sbjct: 63  MIVLRKRIHEIKMVEQSHEPPSDWLDWEKRCYSDYDSHICEALGYLQSHLMNTRPSVSLG 122

Query: 130 MLLIIILSVPLSSALLLHRFFQIAAALLA 153
           +LL+I LSVPLSSA++LHRF  IAAALLA
Sbjct: 123 ILLLITLSVPLSSAVILHRFIDIAAALLA 151

BLAST of Cla97C01G006460 vs. NCBI nr
Match: XP_022939420.1 (uncharacterized protein LOC111445338 [Cucurbita moschata])

HSP 1 Score: 202.2 bits (513), Expect = 3.2e-48
Identity = 110/147 (74.83%), Postives = 122/147 (82.99%), Query Frame = 0

Query: 10  SISSTNSAAFL-LPPTPSGHRLRRLPVAVTIRASSR---SSADNNNNYYASGKVVDESMI 69
           SISS NSA FL   P  S  R RR P   T+ ASSR   S++ NNNNYYASGK+VDESMI
Sbjct: 3   SISSANSAPFLHFHPISSSSRRRRRPPLFTVMASSRGADSNSSNNNNYYASGKLVDESMI 62

Query: 70  VLRKRIHEIKMAEQSHDPPADWFDWEKRYYSNYDSHICEALGYLQSHLMNTRPSVALGML 129
           VLRKRIHEIKM E+SH+PP+DW DWEKR YS+YDSHICEALGYLQSHLMNTRPSVALG+L
Sbjct: 63  VLRKRIHEIKMVERSHEPPSDWLDWEKRCYSDYDSHICEALGYLQSHLMNTRPSVALGIL 122

Query: 130 LIIILSVPLSSALLLHRFFQIAAALLA 153
           L+I LSVPLSSA++LHRF  IAAALLA
Sbjct: 123 LLITLSVPLSSAVILHRFIDIAAALLA 149

BLAST of Cla97C01G006460 vs. NCBI nr
Match: XP_022989988.1 (uncharacterized protein LOC111487016 [Cucurbita maxima])

HSP 1 Score: 201.8 bits (512), Expect = 4.2e-48
Identity = 110/154 (71.43%), Postives = 126/154 (81.82%), Query Frame = 0

Query: 1   MKSINSIAFSISSTNSAAFLLPP-TPSGHRLRRLPVAVTIRASSRSSAD-NNNNYYASGK 60
           M SINS+  SI ST+SA  L PP +PS HR RR P A T+ ASSR     ++N+YYA GK
Sbjct: 1   MNSINSL-LSIPSTHSAPLLFPPISPSSHRRRRPPPAFTVTASSRPKEQADSNSYYADGK 60

Query: 61  VVDESMIVLRKRIHEIKMAEQSHDPPADWFDWEKRYYSNYDSHICEALGYLQSHLMNTRP 120
           +VDESMIVLRKRIHEIK AEQ+HDPP DWFDWEKR YSNYDS+ICEALGYLQSHLMNTRP
Sbjct: 61  LVDESMIVLRKRIHEIKTAEQTHDPPRDWFDWEKRCYSNYDSNICEALGYLQSHLMNTRP 120

Query: 121 SVALGMLLIIILSVPLSSALLLHRFFQIAAALLA 153
           SVALGML ++ LSVP+SSA++LHRF +IA  LLA
Sbjct: 121 SVALGMLALLTLSVPVSSAVVLHRFIEIAVGLLA 153

BLAST of Cla97C01G006460 vs. ExPASy TrEMBL
Match: A0A5A7TY46 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold149G00240 PE=4 SV=1)

HSP 1 Score: 209.5 bits (532), Expect = 9.7e-51
Identity = 116/159 (72.96%), Postives = 125/159 (78.62%), Query Frame = 0

Query: 1   MKSINSIAFSISSTNSAAFLLPPTPSGHRLRRLPVAVTIRASSRSSADNNNN-------Y 60
           MKSINS++FS  S      L  P P  H  R+   AVT+RAS   +ADNNNN       Y
Sbjct: 1   MKSINSLSFSTPSP-----LFSPAPHPHGRRKPLPAVTVRASREQAADNNNNNNNNYNDY 60

Query: 61  YASGKVVDESMIVLRKRIHEIKMAEQSHDPPADWFDWEKRYYSNYDSHICEALGYLQSHL 120
           YA GKVVDESMIVLRKRIHEIKMAEQ  +PPADW DWEKRYYS+YDSHICEALGYLQSHL
Sbjct: 61  YAGGKVVDESMIVLRKRIHEIKMAEQRQEPPADWLDWEKRYYSDYDSHICEALGYLQSHL 120

Query: 121 MNTRPSVALGMLLIIILSVPLSSALLLHRFFQIAAALLA 153
           MNTRPSVALGMLL+II+SVPLSSALLLHRFF IA ALLA
Sbjct: 121 MNTRPSVALGMLLLIIVSVPLSSALLLHRFFHIAVALLA 154

BLAST of Cla97C01G006460 vs. ExPASy TrEMBL
Match: A0A6J1JZG1 (uncharacterized protein LOC111489262 OS=Cucurbita maxima OX=3661 GN=LOC111489262 PE=4 SV=1)

HSP 1 Score: 203.4 bits (516), Expect = 7.0e-49
Identity = 111/149 (74.50%), Postives = 124/149 (83.22%), Query Frame = 0

Query: 10  SISSTNSAAFL--LPPTPSGHRLRRLPVAVTIRAS----SRSSADNNNNYYASGKVVDES 69
           SISS NSA FL   P +PS  R RRLP+   I +S    S SS +NNNNYYASGK+VDES
Sbjct: 3   SISSANSAPFLHFHPISPSSRRRRRLPLFTVIASSRGADSSSSNNNNNNYYASGKLVDES 62

Query: 70  MIVLRKRIHEIKMAEQSHDPPADWFDWEKRYYSNYDSHICEALGYLQSHLMNTRPSVALG 129
           MIVLRKRIHEIKM EQSH+PP+DW DWEKR YS+YDSHICEALGYLQSHLMNTRPSV+LG
Sbjct: 63  MIVLRKRIHEIKMVEQSHEPPSDWLDWEKRCYSDYDSHICEALGYLQSHLMNTRPSVSLG 122

Query: 130 MLLIIILSVPLSSALLLHRFFQIAAALLA 153
           +LL+I LSVPLSSA++LHRF  IAAALLA
Sbjct: 123 ILLLITLSVPLSSAVILHRFIDIAAALLA 151

BLAST of Cla97C01G006460 vs. ExPASy TrEMBL
Match: A0A6J1FMN3 (uncharacterized protein LOC111445338 OS=Cucurbita moschata OX=3662 GN=LOC111445338 PE=4 SV=1)

HSP 1 Score: 202.2 bits (513), Expect = 1.6e-48
Identity = 110/147 (74.83%), Postives = 122/147 (82.99%), Query Frame = 0

Query: 10  SISSTNSAAFL-LPPTPSGHRLRRLPVAVTIRASSR---SSADNNNNYYASGKVVDESMI 69
           SISS NSA FL   P  S  R RR P   T+ ASSR   S++ NNNNYYASGK+VDESMI
Sbjct: 3   SISSANSAPFLHFHPISSSSRRRRRPPLFTVMASSRGADSNSSNNNNYYASGKLVDESMI 62

Query: 70  VLRKRIHEIKMAEQSHDPPADWFDWEKRYYSNYDSHICEALGYLQSHLMNTRPSVALGML 129
           VLRKRIHEIKM E+SH+PP+DW DWEKR YS+YDSHICEALGYLQSHLMNTRPSVALG+L
Sbjct: 63  VLRKRIHEIKMVERSHEPPSDWLDWEKRCYSDYDSHICEALGYLQSHLMNTRPSVALGIL 122

Query: 130 LIIILSVPLSSALLLHRFFQIAAALLA 153
           L+I LSVPLSSA++LHRF  IAAALLA
Sbjct: 123 LLITLSVPLSSAVILHRFIDIAAALLA 149

BLAST of Cla97C01G006460 vs. ExPASy TrEMBL
Match: A0A6J1JRY7 (uncharacterized protein LOC111487016 OS=Cucurbita maxima OX=3661 GN=LOC111487016 PE=4 SV=1)

HSP 1 Score: 201.8 bits (512), Expect = 2.0e-48
Identity = 110/154 (71.43%), Postives = 126/154 (81.82%), Query Frame = 0

Query: 1   MKSINSIAFSISSTNSAAFLLPP-TPSGHRLRRLPVAVTIRASSRSSAD-NNNNYYASGK 60
           M SINS+  SI ST+SA  L PP +PS HR RR P A T+ ASSR     ++N+YYA GK
Sbjct: 1   MNSINSL-LSIPSTHSAPLLFPPISPSSHRRRRPPPAFTVTASSRPKEQADSNSYYADGK 60

Query: 61  VVDESMIVLRKRIHEIKMAEQSHDPPADWFDWEKRYYSNYDSHICEALGYLQSHLMNTRP 120
           +VDESMIVLRKRIHEIK AEQ+HDPP DWFDWEKR YSNYDS+ICEALGYLQSHLMNTRP
Sbjct: 61  LVDESMIVLRKRIHEIKTAEQTHDPPRDWFDWEKRCYSNYDSNICEALGYLQSHLMNTRP 120

Query: 121 SVALGMLLIIILSVPLSSALLLHRFFQIAAALLA 153
           SVALGML ++ LSVP+SSA++LHRF +IA  LLA
Sbjct: 121 SVALGMLALLTLSVPVSSAVVLHRFIEIAVGLLA 153

BLAST of Cla97C01G006460 vs. ExPASy TrEMBL
Match: A0A0A0KPI5 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G571480 PE=4 SV=1)

HSP 1 Score: 200.7 bits (509), Expect = 4.5e-48
Identity = 112/159 (70.44%), Postives = 123/159 (77.36%), Query Frame = 0

Query: 1   MKSINSIAFSISSTNSAAFLLPPTPSGHRLRRLPVAVTIRASSRSSAD-------NNNNY 60
           M+SINS++FS  S      L  P    H  R+   AVT+RAS   +AD       NNNNY
Sbjct: 1   MESINSLSFSTPS------LFSPAAHPHVRRKPLPAVTVRASREQAADSNNNNKNNNNNY 60

Query: 61  YASGKVVDESMIVLRKRIHEIKMAEQSHDPPADWFDWEKRYYSNYDSHICEALGYLQSHL 120
           YA GKVVDESMIVLRKRIHEIKMAEQ  +PPADW DWEKR+YS+YDSHICEALGYLQSHL
Sbjct: 61  YAGGKVVDESMIVLRKRIHEIKMAEQRQEPPADWLDWEKRWYSDYDSHICEALGYLQSHL 120

Query: 121 MNTRPSVALGMLLIIILSVPLSSALLLHRFFQIAAALLA 153
           MNTRPSVALGMLL+II+SVPLSSALLLHRF  IA ALLA
Sbjct: 121 MNTRPSVALGMLLLIIISVPLSSALLLHRFLHIAVALLA 153

BLAST of Cla97C01G006460 vs. TAIR 10
Match: AT2G01300.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G15010.1); Has 73 Blast hits to 73 proteins in 12 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 73; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 124.8 bits (312), Expect = 6.1e-29
Identity = 61/118 (51.69%), Postives = 93/118 (78.81%), Query Frame = 0

Query: 37  VTIRASSRSSADNNNNYYASGKVVDESMIVLRKRIHEIKMAEQSHDPPADWFDWEKRYYS 96
           +T+R S+ +S+   ++YY  G++VDE+MIVLRKRIHE+KM E++++PP+ W DWEKR+Y+
Sbjct: 34  MTMRVSAAASS-GKDHYYGGGRLVDENMIVLRKRIHEMKMVERNYEPPSHWMDWEKRFYN 93

Query: 97  NYDSHICEALGYLQSHLMNTRPSVALGMLLIIILSVPLSSALLLHRFFQIAAALLAAA 155
           +YDS IC+++G LQS LMN+RP+VA+  LL +++SVP+SS ++  R   +   LLAAA
Sbjct: 94  SYDSVICDSVGLLQSFLMNSRPTVAIATLLFLLVSVPVSSTVIAFRLIDLLHWLLAAA 150

BLAST of Cla97C01G006460 vs. TAIR 10
Match: AT1G15010.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G01300.1); Has 71 Blast hits to 71 proteins in 13 species: Archae - 0; Bacteria - 2; Metazoa - 0; Fungi - 0; Plants - 69; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 115.9 bits (289), Expect = 2.8e-26
Identity = 60/123 (48.78%), Postives = 83/123 (67.48%), Query Frame = 0

Query: 39  IRASSRSSADNNN--------NYYASGKVVDESMIVLRKRIHEIKMAEQSHDPPADWFDW 98
           +R S R +   N         +YY  G+ VDE+M+VLRKRIHE+KM E++ +PP+ W  W
Sbjct: 16  LRTSQRIATGTNRRRTTTVCADYYRGGRTVDENMVVLRKRIHEMKMVERNFEPPSHWMQW 75

Query: 99  EKRYYSNYDSHICEALGYLQSHLMNTRPSVALGMLLIIILSVPLSSALLLHRFFQIAAAL 154
           EKR+Y NYD+ IC+AL  LQ+ LMN+RPSVA G  L++++SVP+SSA+   R   +A  L
Sbjct: 76  EKRFYCNYDATICDALTLLQTFLMNSRPSVAFGTCLLLLVSVPVSSAVFAFRILDLALWL 135

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038875502.14.3e-6186.27uncharacterized protein LOC120067929 [Benincasa hispida][more]
KAA0046315.12.0e-5072.96uncharacterized protein E6C27_scaffold149G00240 [Cucumis melo var. makuwa][more]
XP_022993164.11.4e-4874.50uncharacterized protein LOC111489262 [Cucurbita maxima][more]
XP_022939420.13.2e-4874.83uncharacterized protein LOC111445338 [Cucurbita moschata][more]
XP_022989988.14.2e-4871.43uncharacterized protein LOC111487016 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A5A7TY469.7e-5172.96Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold... [more]
A0A6J1JZG17.0e-4974.50uncharacterized protein LOC111489262 OS=Cucurbita maxima OX=3661 GN=LOC111489262... [more]
A0A6J1FMN31.6e-4874.83uncharacterized protein LOC111445338 OS=Cucurbita moschata OX=3662 GN=LOC1114453... [more]
A0A6J1JRY72.0e-4871.43uncharacterized protein LOC111487016 OS=Cucurbita maxima OX=3661 GN=LOC111487016... [more]
A0A0A0KPI54.5e-4870.44Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G571480 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G01300.16.1e-2951.69unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT1G15010.12.8e-2648.78unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (97103) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR33782:SF13MEDIATOR OF RNA POLYMERASE II TRANSCRIPTION SUBUNITcoord: 34..152
NoneNo IPR availablePANTHERPTHR33782OS01G0121600 PROTEINcoord: 34..152

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C01G006460.1Cla97C01G006460.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane