ClCG01G004970 (gene) Watermelon (Charleston Gray) v2.5

Overview
NameClCG01G004970
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionGeneral transcription factor IIH subunit
LocationCG_Chr01: 5263727 .. 5267400 (+)
RNA-Seq ExpressionClCG01G004970
SyntenyClCG01G004970
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CTTGCTCTTACTCTTTAGGGCTGAAATCATGAAATCATGGAAAGCTTAAAGTTGAAAAGAGGGTGAAAGCGCAGCAGCCTTAACAAGCCGTGGGTCACTGGGTGTTTTCCATGTCTTGGTTTTGAGGCTGAATCGCGCCGCCGCGGGTCTTGCTGTAGAGGACTCTAACACGCCGTCGTTCGTATCAGGTCAGTGGCGGTCGGGGCGCGCTGGTCTCGGTTTTCAAGGGCAGTTTCCGGCGGTTTATCTCCGCGCCGGTGTGAGTCTCGGGTTAGAGACTGTGAAGCACGCAGTGGGTTTCCCATTTTAGCAATCAAACGGTAACCTTGAGGTTGATATAAAGATTGTGGGTATATTTCAAGGTTGTTGATTTCTGTTGTGCTGGTGTTTTGGGCGTTTGAATCCGTTTTACTTCTTCATTGTCGATTCCTCATCACACCTACCTACTGTAACTCTTAGGTCCCCAAGTTGTGATTATTCTCTGTGTAGAGCTTGTTTTAGGTATTGGGGTTTCATCAATTTGATCTTGTTTCAAAACCTTCGAGCGTAGCTTCTGAAAAAGTCCTTAAATATGAACAATGGCGAAAATATGAGATTGAATGGGGAAGCCGATGAAGAAGACGATGATGACGATGCCAATGGTGGACTTGCTGCGTGGGAAAGGACTTACGCAGATGATAGGTCGTGGGAAGCCTTGCAAGAAGACGAGTCTGGACTCCTTCGCCCGATCGACAATAAGGCAATTTACCATGCTCAGTATCGAAGGCGCCTTCGCTCCCTTTCTTCCTTAGCAACCACTGCTCGAATTCAGAAGGGTCTTATTCGTTATCTCTATATCATCATTGACTTCTCTAGGGTACTCTTTTTCCTTCCATGACTCTATTACTACATTTGTTAATGTTCCGTCTGTTTGATTTATTAGGTTCTGTGTCTTTGCAATTGCTTCTAATCGTCTTTTTGAGAAAAGGGATTTGTTGTAATGTTCATCAGGTTAAATGCATAGTATCTCTGCTGAAATATTTCATAACCATCAAATTGGATTGGTAAATTTATAGTTTTGCTGTTCTTAGCTTACCCAATAGCTGTAATGTGGTCTAATATCGATGTCTTCCTACTTGTTTGTTCTCTGATTATGCTTGAAAGGGTCTGGAGAAGTATCTCCACAGTTATTCCATAGAGGATTGGTCCTCATTTAATCCAACCACTATGTAGTGAATGAAAGATGTGCTAACTTGAGTTTTGTTTAGAGCAAGTATGATTTCTGTTCATAGAAACTGTGAATGATAGTGGAGATGAAGAATTTTGAATGGATGGTGATGGAATAGAGGTTTTGCTTTTTTGATTTCTCTTATTTTGCTTGTCCCACCATGTGTAGGCAGCTACAGAAATGGATTTTCGACCAAGTCGAATGGCTGTTGTGGCAAAACATGTAGAGGCTTTTGTCAGGGAATTCTTTGACCAAAATCCACTCAGTCAGATTGGTTTGGTAACTATGAAAGATGGAGTTGCTCATTGCTTAACAGATCTTGGTGGAAGTCCTGAATCCCATGTTAAAGCATTAATGGGCAAACTGGAATGCTCAGGTGATGCATCCTTGCAGAACGGTCTGGAACTTGTCCACGGCTATCTGAATCAAATTCCATCATATGGGCATAGAGAAGTTTTAATCTTATACTCTGCTCTTAATTCTTGTGATCCTGGGGATATCATGGAGACAGTTCAGAAATGCAAAACTTCTAAAATAAGGTGTTCAGTAATTGGTCTTACTGCAGAAATTTTTATATGCAGACATCTCTGCCAAGAAACTGGTGGCTCATACTCTGTCGCATTGGATGAGGTGAGTTTAACAGAATGTGCGAATTTCCCAAAATAGAGAGAAATTGAAATGGCTATTTTTTTAGTTGGCCAGCTAATTGAACTGAGTACAGATTACAAAGGAAGCTTAACAAAGTACACAGTTATTGAAACAGAAGTTAGTAAAATTACTATAGCAAGATTTTATTTGATGTTTCTATATATGGAAGTTCTGTTTCCTAAATCTTTATGTAATACTTGCCTTTTATATACAGTCCCACTTCAAAGAGTTGCTATTGGAGCATGCACCCCCACCCCCAGCAATAGCAGACTCTGCAATGCCTAATTTAATCAAGATGGGCTTCCCACAAAGAGCAGCAGAGAGTTCTATTGCAATATGTTCATGTCACAAGGAAGCTAAAGCTGGAGGGGGCTATACTTGCCCTCGATGCAAAGCACGGGTTTGTGAGCTCCCCACAGAGTGTCGAATTTGTGGATTGACACTGATCTCCTCGCCCCATTTGGCTAGGTCATATCACCATCTCTTTCCAATTATACCATTTGATGAAGTCACTGATAAAGTACTTCATGATCCACGACATCAATCTCCAAAAGTTTGCTTTGGCTGCCAAGAAAGCCTCATGAATCCTGGTAAGAACTCAAATAAACATGATATTTGGAAGCTTTCAAATGAAACTTCTTTTTATGGCATTTGTTGTCTTCTAATAATATAATGCACATGTCTAATTAGTCAGAATTTTGTGTCAGAGGATATCTTAATATCGGATGAAGTGAAAATTAAGAAAAGATTTATCGTCTTTCCTACTGGGAGTTTGTTAATGCCCTGTGATGTTCAACTTCTGAATCAAAGAATGTTGTGTATAAGAAAACTTCCATGTTTCCTGTTCGTGCTTTGGTCATTTTTCAGTACTAATTGTGTACACCACCAGTACTGCAATCTGCTGAAACAGTGTTATCAATGGGAATCGATTTTGCCTTTTGTGTTGGGTGATTTCTTTTTATTATTCATTGTGCATATGTGATTCTGTTTAAGGCAATAGGAACTACTTGTACCTGTGACATGCAATTACGAATAAATATATGGATGATCTTTTGGCCCTTTGTAATTTGCTGATTTGAAATTTTGACCCTACAATTACTGTGTACTCATTCAAATGTGCTTTTCTAGGCACAGGTAATAGCCCAGGCATTCGTGTTTCTTGCCCAAAGTGCAAACAACACTTTTGTCTTGATTGTGATATTTATATTCACGAGAGCTTGCACAATTGTCCTGGCTGTGAGAGTTTTAGGCGTCCAAAATTGTTGACTTCTGACGAATGAACGTCTACTTTCGGATGCAACCATGTGGCCCCAACCGAATCCAAATTCAATCTTCAATCTGACCGTGACTCTGCTCTGTCTGTAATGAACTTTGTACCACTATTTATAAAGTTTCAAGCATGTCTACCAATGATGGTGAATTCAAGGCTCTCATTGTTACCGCGGAGTCGTCTTTCGTGTTTCTCGACAGCTCCTAAACCAACCTCAAAGAGATCCAAAGCTTGTGGCTTTCTGACACTTGATGGCATCGTTGAAATTGGCTAGAAGATTAATTTTTCAAGATGAAAGGCTGAAACTTACTTTCCTTCTGGGCTGCAGATGGCTGATATTACTTGGAGTGTTACCCAAAATTGACCCCCCAATGGACAATTTCCATTTTCATGAGATTTTGTAGGAACTTGATATTAATTCTTGTCTGACATTTCAGATGTATTTGACATTCATATTCTTTTAGTTAAGTTATTAAAAGAGCTTGTGCCAGAAACTTCAGTGTGCCTACTATTAACAATAAACAATTATATGTACCAATGTTGTAAATG

mRNA sequence

CTTGCTCTTACTCTTTAGGGCTGAAATCATGAAATCATGGAAAGCTTAAAGTTGAAAAGAGGGTGAAAGCGCAGCAGCCTTAACAAGCCGTGGGTCACTGGGTGTTTTCCATGTCTTGGTTTTGAGGCTGAATCGCGCCGCCGCGGGTCTTGCTGTAGAGGACTCTAACACGCCGTCGTTCGTATCAGGTCAGTGGCGGTCGGGGCGCGCTGGTCTCGGTTTTCAAGGGCAGTTTCCGGCGGTTTATCTCCGCGCCGGTGTGAGTCTCGGGTTAGAGACTGTGAAGCACGCAGTGGGTTTCCCATTTTAGCAATCAAACGGTAACCTTGAGGTTGATATAAAGATTGTGGGTATATTTCAAGGTTGTTGATTTCTGTTGTGCTGGTGTTTTGGGCGTTTGAATCCGTTTTACTTCTTCATTGTCGATTCCTCATCACACCTACCTACTGTAACTCTTAGGTCCCCAAGTTGTGATTATTCTCTGTGTAGAGCTTGTTTTAGGTATTGGGGTTTCATCAATTTGATCTTGTTTCAAAACCTTCGAGCGTAGCTTCTGAAAAAGTCCTTAAATATGAACAATGGCGAAAATATGAGATTGAATGGGGAAGCCGATGAAGAAGACGATGATGACGATGCCAATGGTGGACTTGCTGCGTGGGAAAGGACTTACGCAGATGATAGGTCGTGGGAAGCCTTGCAAGAAGACGAGTCTGGACTCCTTCGCCCGATCGACAATAAGGCAATTTACCATGCTCAGTATCGAAGGCGCCTTCGCTCCCTTTCTTCCTTAGCAACCACTGCTCGAATTCAGAAGGGTCTTATTCGTTATCTCTATATCATCATTGACTTCTCTAGGGCAGCTACAGAAATGGATTTTCGACCAAGTCGAATGGCTGTTGTGGCAAAACATGTAGAGGCTTTTGTCAGGGAATTCTTTGACCAAAATCCACTCAGTCAGATTGGTTTGGTAACTATGAAAGATGGAGTTGCTCATTGCTTAACAGATCTTGGTGGAAGTCCTGAATCCCATGTTAAAGCATTAATGGGCAAACTGGAATGCTCAGGTGATGCATCCTTGCAGAACGGTCTGGAACTTGTCCACGGCTATCTGAATCAAATTCCATCATATGGGCATAGAGAAGTTTTAATCTTATACTCTGCTCTTAATTCTTGTGATCCTGGGGATATCATGGAGACAGTTCAGAAATGCAAAACTTCTAAAATAAGGTGTTCAGTAATTGGTCTTACTGCAGAAATTTTTATATGCAGACATCTCTGCCAAGAAACTGGTGGCTCATACTCTGTCGCATTGGATGAGTCCCACTTCAAAGAGTTGCTATTGGAGCATGCACCCCCACCCCCAGCAATAGCAGACTCTGCAATGCCTAATTTAATCAAGATGGGCTTCCCACAAAGAGCAGCAGAGAGTTCTATTGCAATATGTTCATGTCACAAGGAAGCTAAAGCTGGAGGGGGCTATACTTGCCCTCGATGCAAAGCACGGGTTTGTGAGCTCCCCACAGAGTGTCGAATTTGTGGATTGACACTGATCTCCTCGCCCCATTTGGCTAGGTCATATCACCATCTCTTTCCAATTATACCATTTGATGAAGTCACTGATAAAGTACTTCATGATCCACGACATCAATCTCCAAAAGTTTGCTTTGGCTGCCAAGAAAGCCTCATGAATCCTGGCACAGGTAATAGCCCAGGCATTCGTGTTTCTTGCCCAAAGTGCAAACAACACTTTTGTCTTGATTGTGATATTTATATTCACGAGAGCTTGCACAATTGTCCTGGCTGTGAGAGTTTTAGGCGTCCAAAATTGTTGACTTCTGACGAATGAACGTCTACTTTCGGATGCAACCATGTGGCCCCAACCGAATCCAAATTCAATCTTCAATCTGACCGTGACTCTGCTCTGTCTGTAATGAACTTTGTACCACTATTTATAAAGTTTCAAGCATGTCTACCAATGATGGTGAATTCAAGGCTCTCATTGTTACCGCGGAGTCGTCTTTCGTGTTTCTCGACAGCTCCTAAACCAACCTCAAAGAGATCCAAAGCTTGTGGCTTTCTGACACTTGATGGCATCGTTGAAATTGGCTAGAAGATTAATTTTTCAAGATGAAAGGCTGAAACTTACTTTCCTTCTGGGCTGCAGATGGCTGATATTACTTGGAGTGTTACCCAAAATTGACCCCCCAATGGACAATTTCCATTTTCATGAGATTTTGTAGGAACTTGATATTAATTCTTGTCTGACATTTCAGATGTATTTGACATTCATATTCTTTTAGTTAAGTTATTAAAAGAGCTTGTGCCAGAAACTTCAGTGTGCCTACTATTAACAATAAACAATTATATGTACCAATGTTGTAAATG

Coding sequence (CDS)

ATGAACAATGGCGAAAATATGAGATTGAATGGGGAAGCCGATGAAGAAGACGATGATGACGATGCCAATGGTGGACTTGCTGCGTGGGAAAGGACTTACGCAGATGATAGGTCGTGGGAAGCCTTGCAAGAAGACGAGTCTGGACTCCTTCGCCCGATCGACAATAAGGCAATTTACCATGCTCAGTATCGAAGGCGCCTTCGCTCCCTTTCTTCCTTAGCAACCACTGCTCGAATTCAGAAGGGTCTTATTCGTTATCTCTATATCATCATTGACTTCTCTAGGGCAGCTACAGAAATGGATTTTCGACCAAGTCGAATGGCTGTTGTGGCAAAACATGTAGAGGCTTTTGTCAGGGAATTCTTTGACCAAAATCCACTCAGTCAGATTGGTTTGGTAACTATGAAAGATGGAGTTGCTCATTGCTTAACAGATCTTGGTGGAAGTCCTGAATCCCATGTTAAAGCATTAATGGGCAAACTGGAATGCTCAGGTGATGCATCCTTGCAGAACGGTCTGGAACTTGTCCACGGCTATCTGAATCAAATTCCATCATATGGGCATAGAGAAGTTTTAATCTTATACTCTGCTCTTAATTCTTGTGATCCTGGGGATATCATGGAGACAGTTCAGAAATGCAAAACTTCTAAAATAAGGTGTTCAGTAATTGGTCTTACTGCAGAAATTTTTATATGCAGACATCTCTGCCAAGAAACTGGTGGCTCATACTCTGTCGCATTGGATGAGTCCCACTTCAAAGAGTTGCTATTGGAGCATGCACCCCCACCCCCAGCAATAGCAGACTCTGCAATGCCTAATTTAATCAAGATGGGCTTCCCACAAAGAGCAGCAGAGAGTTCTATTGCAATATGTTCATGTCACAAGGAAGCTAAAGCTGGAGGGGGCTATACTTGCCCTCGATGCAAAGCACGGGTTTGTGAGCTCCCCACAGAGTGTCGAATTTGTGGATTGACACTGATCTCCTCGCCCCATTTGGCTAGGTCATATCACCATCTCTTTCCAATTATACCATTTGATGAAGTCACTGATAAAGTACTTCATGATCCACGACATCAATCTCCAAAAGTTTGCTTTGGCTGCCAAGAAAGCCTCATGAATCCTGGCACAGGTAATAGCCCAGGCATTCGTGTTTCTTGCCCAAAGTGCAAACAACACTTTTGTCTTGATTGTGATATTTATATTCACGAGAGCTTGCACAATTGTCCTGGCTGTGAGAGTTTTAGGCGTCCAAAATTGTTGACTTCTGACGAATGA

Protein sequence

MNNGENMRLNGEADEEDDDDDANGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIIIDFSRAATEMDFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTMKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHGYLNQIPSYGHREVLILYSALNSCDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKAGGGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVTDKVLHDPRHQSPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLHNCPGCESFRRPKLLTSDE
Homology
BLAST of ClCG01G004970 vs. NCBI nr
Match: XP_038874496.1 (general transcription factor IIH subunit 2 isoform X1 [Benincasa hispida])

HSP 1 Score: 863.2 bits (2229), Expect = 9.2e-247
Identity = 413/424 (97.41%), Postives = 419/424 (98.82%), Query Frame = 0

Query: 1   MNNGENMRLNGEADEEDDDDDANGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYH 60
           MNNGEN RLNGEADEEDDDDDANGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYH
Sbjct: 1   MNNGENRRLNGEADEEDDDDDANGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYH 60

Query: 61  AQYRRRLRSLSSLATTARIQKGLIRYLYIIIDFSRAATEMDFRPSRMAVVAKHVEAFVRE 120
           AQYRRRLRSLSSLATTARIQKGLIRYLYI+IDFSRAATEMDFRPSRMAVVAKHVEAFVRE
Sbjct: 61  AQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAATEMDFRPSRMAVVAKHVEAFVRE 120

Query: 121 FFDQNPLSQIGLVTMKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHGYL 180
           FFDQNPLSQIGLVT+KDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHG+L
Sbjct: 121 FFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHGFL 180

Query: 181 NQIPSYGHREVLILYSALNSCDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETG 240
           NQIPSYGHREVLILYSALNSCDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETG
Sbjct: 181 NQIPSYGHREVLILYSALNSCDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETG 240

Query: 241 GSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKAG 300
           GSYS+ALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAK G
Sbjct: 241 GSYSIALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG 300

Query: 301 GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVTDKVLHDPRHQS 360
           GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEV+DKVLHDPR+Q 
Sbjct: 301 GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRNQL 360

Query: 361 PKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLHNCPGCESFRRPKLL 420
           PKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLHNCPGCESFRRPK  
Sbjct: 361 PKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLHNCPGCESFRRPKSA 420

Query: 421 TSDE 425
           TSDE
Sbjct: 421 TSDE 424

BLAST of ClCG01G004970 vs. NCBI nr
Match: KAA0055121.1 (general transcription factor IIH subunit 2 [Cucumis melo var. makuwa] >TYK20738.1 general transcription factor IIH subunit 2 [Cucumis melo var. makuwa])

HSP 1 Score: 849.7 bits (2194), Expect = 1.1e-242
Identity = 408/424 (96.23%), Postives = 414/424 (97.64%), Query Frame = 0

Query: 1   MNNGENMRLNGEADEEDDDDDANGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYH 60
           MNNGEN RLNGEADEEDDDDDAN GLAAWERTYADDRSWEALQEDESGLL PIDNKAIYH
Sbjct: 1   MNNGENRRLNGEADEEDDDDDAN-GLAAWERTYADDRSWEALQEDESGLLCPIDNKAIYH 60

Query: 61  AQYRRRLRSLSSLATTARIQKGLIRYLYIIIDFSRAATEMDFRPSRMAVVAKHVEAFVRE 120
           AQYRRRLR+LSSLATTARIQKGLIRYLYI+IDFS+AATEMDFRPSRMAVVAKHVEAFVRE
Sbjct: 61  AQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMDFRPSRMAVVAKHVEAFVRE 120

Query: 121 FFDQNPLSQIGLVTMKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHGYL 180
           FFDQNPLSQIGLVT+KDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVH YL
Sbjct: 121 FFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYL 180

Query: 181 NQIPSYGHREVLILYSALNSCDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETG 240
           NQIPSYGHREVL+LYSALNSCDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETG
Sbjct: 181 NQIPSYGHREVLVLYSALNSCDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETG 240

Query: 241 GSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKAG 300
           GSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAK G
Sbjct: 241 GSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG 300

Query: 301 GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVTDKVLHDPRHQS 360
           GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEV+DKV HDPRHQ 
Sbjct: 301 GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQL 360

Query: 361 PKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLHNCPGCESFRRPKLL 420
           PKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLHNCPGCESFR PKL 
Sbjct: 361 PKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLHNCPGCESFRHPKLA 420

Query: 421 TSDE 425
           T DE
Sbjct: 421 TFDE 423

BLAST of ClCG01G004970 vs. NCBI nr
Match: XP_004143721.1 (general transcription factor IIH subunit 2 [Cucumis sativus] >XP_031741116.1 general transcription factor IIH subunit 2 [Cucumis sativus] >KGN50372.1 hypothetical protein Csa_000481 [Cucumis sativus])

HSP 1 Score: 846.7 bits (2186), Expect = 8.9e-242
Identity = 406/424 (95.75%), Postives = 414/424 (97.64%), Query Frame = 0

Query: 1   MNNGENMRLNGEADEEDDDDDANGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYH 60
           MNNGEN RLNGEADEEDDDDDAN GLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYH
Sbjct: 1   MNNGENRRLNGEADEEDDDDDAN-GLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYH 60

Query: 61  AQYRRRLRSLSSLATTARIQKGLIRYLYIIIDFSRAATEMDFRPSRMAVVAKHVEAFVRE 120
           AQYRRRLR+LSSLATTARIQKGLIRYLYI+IDFS+AATEMDFRPSRMAVVAKHV+AFVRE
Sbjct: 61  AQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMDFRPSRMAVVAKHVDAFVRE 120

Query: 121 FFDQNPLSQIGLVTMKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHGYL 180
           FFDQNPLSQIGLVT+KDG A+CLTDLGGSPESHVKALMGKLECSGDASLQNGLELVH YL
Sbjct: 121 FFDQNPLSQIGLVTIKDGFANCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYL 180

Query: 181 NQIPSYGHREVLILYSALNSCDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETG 240
           NQIPSYGHREVL+LYSALNSCDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETG
Sbjct: 181 NQIPSYGHREVLVLYSALNSCDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETG 240

Query: 241 GSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKAG 300
           GSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAK G
Sbjct: 241 GSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG 300

Query: 301 GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVTDKVLHDPRHQS 360
           GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEV+DKV HDPRHQ 
Sbjct: 301 GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQL 360

Query: 361 PKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLHNCPGCESFRRPKLL 420
           PKVCFGCQESLMNP TGNSP IRVSCPKCKQHFCLDCDIYIHESLHNCPGCESFRRPKL 
Sbjct: 361 PKVCFGCQESLMNPSTGNSPSIRVSCPKCKQHFCLDCDIYIHESLHNCPGCESFRRPKLA 420

Query: 421 TSDE 425
           TSDE
Sbjct: 421 TSDE 423

BLAST of ClCG01G004970 vs. NCBI nr
Match: XP_008467294.1 (PREDICTED: general transcription factor IIH subunit 2 [Cucumis melo])

HSP 1 Score: 846.7 bits (2186), Expect = 8.9e-242
Identity = 407/424 (95.99%), Postives = 413/424 (97.41%), Query Frame = 0

Query: 1   MNNGENMRLNGEADEEDDDDDANGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYH 60
           MNNGEN RLNGEADEEDDDDDAN GLAAWERTYADDRSWEALQEDESGLL PIDNKAIYH
Sbjct: 1   MNNGENRRLNGEADEEDDDDDAN-GLAAWERTYADDRSWEALQEDESGLLCPIDNKAIYH 60

Query: 61  AQYRRRLRSLSSLATTARIQKGLIRYLYIIIDFSRAATEMDFRPSRMAVVAKHVEAFVRE 120
           AQYRRRLR+LSSLATTARIQKGLIRYLYI+IDFS+AATEMDFRPSRMAVVAKHVEAFVRE
Sbjct: 61  AQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMDFRPSRMAVVAKHVEAFVRE 120

Query: 121 FFDQNPLSQIGLVTMKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHGYL 180
           FFDQNPLSQIGLVT+KDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVH YL
Sbjct: 121 FFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYL 180

Query: 181 NQIPSYGHREVLILYSALNSCDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETG 240
           NQIPSYGHREVL+LYSALNSCDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETG
Sbjct: 181 NQIPSYGHREVLVLYSALNSCDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETG 240

Query: 241 GSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKAG 300
           GSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAK G
Sbjct: 241 GSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG 300

Query: 301 GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVTDKVLHDPRHQS 360
           GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEV+DKV HDPRHQ 
Sbjct: 301 GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQL 360

Query: 361 PKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLHNCPGCESFRRPKLL 420
           PKVCFGCQESLMNPGT NSPGIRVSCPKCKQHFCLDCDIYIHESLHNCPGCESFR PKL 
Sbjct: 361 PKVCFGCQESLMNPGTRNSPGIRVSCPKCKQHFCLDCDIYIHESLHNCPGCESFRHPKLA 420

Query: 421 TSDE 425
           T DE
Sbjct: 421 TFDE 423

BLAST of ClCG01G004970 vs. NCBI nr
Match: XP_022949453.1 (general transcription factor IIH subunit 2 [Cucurbita moschata] >XP_023525764.1 general transcription factor IIH subunit 2 [Cucurbita pepo subsp. pepo] >KAG6607180.1 General transcription factor IIH subunit 2, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 839.7 bits (2168), Expect = 1.1e-239
Identity = 399/424 (94.10%), Postives = 412/424 (97.17%), Query Frame = 0

Query: 1   MNNGENMRLNGEADEEDDDDDANGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYH 60
           MNNGEN RLNGEA+EEDDDDDANGG+AAWERTYADDRSWEALQEDESGLLRPIDNKAI+H
Sbjct: 1   MNNGENRRLNGEAEEEDDDDDANGGMAAWERTYADDRSWEALQEDESGLLRPIDNKAIFH 60

Query: 61  AQYRRRLRSLSSLATTARIQKGLIRYLYIIIDFSRAATEMDFRPSRMAVVAKHVEAFVRE 120
           AQYRRRLRSLSSLATTARIQKGLIRYLY++IDFSRAA EMDFRPSRMAVVAKHVEAFVRE
Sbjct: 61  AQYRRRLRSLSSLATTARIQKGLIRYLYVVIDFSRAAAEMDFRPSRMAVVAKHVEAFVRE 120

Query: 121 FFDQNPLSQIGLVTMKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHGYL 180
           FFDQNPLSQIGLVT+KDGVAH LTDLGGSPESHVKALMGKLECSG+ASLQNGL+LV GYL
Sbjct: 121 FFDQNPLSQIGLVTIKDGVAHSLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVCGYL 180

Query: 181 NQIPSYGHREVLILYSALNSCDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETG 240
           NQIPSYGHREVLILYSALNSCDPGDIMETVQKCKTSKIRCSVIGLTAE+FICRHLCQETG
Sbjct: 181 NQIPSYGHREVLILYSALNSCDPGDIMETVQKCKTSKIRCSVIGLTAELFICRHLCQETG 240

Query: 241 GSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKAG 300
           GSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA ESSIAICSCHKEAK G
Sbjct: 241 GSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAGESSIAICSCHKEAKVG 300

Query: 301 GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVTDKVLHDPRHQS 360
           GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEV+DK+ HDPRHQ 
Sbjct: 301 GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKLFHDPRHQL 360

Query: 361 PKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLHNCPGCESFRRPKLL 420
           PKVCFGCQE+  NPGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLHNCPGCESFRRPK  
Sbjct: 361 PKVCFGCQENFTNPGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLHNCPGCESFRRPKSA 420

Query: 421 TSDE 425
           TSD+
Sbjct: 421 TSDD 424

BLAST of ClCG01G004970 vs. ExPASy Swiss-Prot
Match: Q9ZVN9 (General transcription factor IIH subunit 2 OS=Arabidopsis thaliana OX=3702 GN=GTF2H2 PE=1 SV=1)

HSP 1 Score: 662.5 bits (1708), Expect = 3.1e-189
Identity = 318/417 (76.26%), Postives = 355/417 (85.13%), Query Frame = 0

Query: 3   NGENMRLNGEADEEDDDDDANGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQ 62
           + +  R N E +EEDD+D    G+  WER Y DDRSWE LQEDESGLLRPIDN AIYHAQ
Sbjct: 2   SNQRKRSNDEREEEDDEDAE--GIGEWERAYVDDRSWEELQEDESGLLRPIDNSAIYHAQ 61

Query: 63  YRRRLRSLSSLATTARIQKGLIRYLYIIIDFSRAATEMDFRPSRMAVVAKHVEAFVREFF 122
           YRRRLR LS+ A   RIQKGLIRYLYI+IDFSRAA EMDFRPSRMA++AKHVEAF+REFF
Sbjct: 62  YRRRLRMLSAAAAGTRIQKGLIRYLYIVIDFSRAAAEMDFRPSRMAIMAKHVEAFIREFF 121

Query: 123 DQNPLSQIGLVTMKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHGYLNQ 182
           DQNPLSQIGLV++K+GVAH LTDLGGSPE+H+KALMGKLE  GD+SLQN LELVH +LNQ
Sbjct: 122 DQNPLSQIGLVSIKNGVAHTLTDLGGSPETHIKALMGKLEALGDSSLQNALELVHEHLNQ 181

Query: 183 IPSYGHREVLILYSALNSCDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGS 242
           +PSYGHREVLILYSAL +CDPGDIMET+QKCK SK+RCSVIGL+AE+FIC+HLCQETGG 
Sbjct: 182 VPSYGHREVLILYSALCTCDPGDIMETIQKCKKSKLRCSVIGLSAEMFICKHLCQETGGL 241

Query: 243 YSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKAGGG 302
           YSVA+DE H K+LLLEHAPPPPAIA+ A+ NLIKMGFPQRAAE S+AICSCHKE K G G
Sbjct: 242 YSVAVDEVHLKDLLLEHAPPPPAIAEFAIANLIKMGFPQRAAEGSMAICSCHKEVKIGAG 301

Query: 303 YTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVTD-KVLHDPRHQSP 362
           Y CPRCKARVC+LPTEC ICGLTL+SSPHLARSYHHLFPI PFDEV     L+D R +  
Sbjct: 302 YMCPRCKARVCDLPTECTICGLTLVSSPHLARSYHHLFPIAPFDEVPALSSLNDNRRKLG 361

Query: 363 KVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLHNCPGCESFRRPK 419
           K CFGCQ+SL+  G GN P   V+C KCK +FCLDCDIYIHESLHNCPGCES  RPK
Sbjct: 362 KSCFGCQQSLI--GAGNKPVPCVTCRKCKHYFCLDCDIYIHESLHNCPGCESIHRPK 414

BLAST of ClCG01G004970 vs. ExPASy Swiss-Prot
Match: Q2TBV5 (General transcription factor IIH subunit 2 OS=Bos taurus OX=9913 GN=GTF2H2 PE=2 SV=1)

HSP 1 Score: 333.2 bits (853), Expect = 4.3e-90
Identity = 164/396 (41.41%), Postives = 242/396 (61.11%), Query Frame = 0

Query: 29  WERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLY 88
           WE  Y  +R+WE L+EDESG L+      ++ A+ +R            +++ G++R+LY
Sbjct: 11  WEGGY--ERTWEILKEDESGSLKATIEDILFKAKRKRVFEH------HGQVRLGMMRHLY 70

Query: 89  IIIDFSRAATEMDFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTMKDGVAHCLTDLGG 148
           +++D SR   + D +P+R+    K +E FV E+FDQNP+SQIG++  K   A  LT+L G
Sbjct: 71  VVVDGSRTMEDQDLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVTKSKRAEKLTELSG 130

Query: 149 SPESHVKALMGKLE--CSGDASLQNGLELVHGYLNQIPSYGHREVLILYSALNSCDPGDI 208
           +P  H+ +L   ++  C G+ SL N L +    L  +P +  REVLI++S+L +CDP +I
Sbjct: 131 NPRKHITSLKKAVDMTCHGEPSLYNSLSMAMQTLKHMPGHTSREVLIIFSSLTTCDPSNI 190

Query: 209 METVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAI 268
            + ++  K +KIR S+IGL+AE+ +C  L +ETGG+Y V LDESH+KELL  H  PPPA 
Sbjct: 191 YDLIKSLKAAKIRVSIIGLSAEVRVCTALARETGGTYHVILDESHYKELLTHHVSPPPAS 250

Query: 269 ADSAMPNLIKMGFPQRA--------AESSIAICSCHKEAKAG---GGYTCPRCKARVCEL 328
           ++S   +LI+MGFPQ          A+ S ++       + G   GGY CP+C+A+ CEL
Sbjct: 251 SNSEC-SLIRMGFPQHTIASLSDQDAKPSFSMAHLDSNTEPGLTLGGYFCPQCRAKYCEL 310

Query: 329 PTECRICGLTLISSPHLARSYHHLFPIIPFDEVTDKVLHDPRHQSPKVCFGCQESLMNPG 388
           P EC+ICGLTL+S+PHLARSYHHLFP+  F E+  +      H   + C+ CQ  L +  
Sbjct: 311 PVECKICGLTLVSAPHLARSYHHLFPLDAFQEIPLE-----EHNGERFCYACQGELKDQH 370

Query: 389 TGNSPGIRVSCPKCKQHFCLDCDIYIHESLHNCPGC 412
                     C  C+  FC+DCD+++H+SLH CPGC
Sbjct: 371 V-------YVCSVCQNVFCVDCDVFVHDSLHCCPGC 385

BLAST of ClCG01G004970 vs. ExPASy Swiss-Prot
Match: Q13888 (General transcription factor IIH subunit 2 OS=Homo sapiens OX=9606 GN=GTF2H2 PE=1 SV=1)

HSP 1 Score: 331.3 bits (848), Expect = 1.7e-89
Identity = 165/396 (41.67%), Postives = 243/396 (61.36%), Query Frame = 0

Query: 29  WERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLY 88
           WE  Y  +R+WE L+EDESG L+      ++ A+ +R            +++ G++R+LY
Sbjct: 11  WEGGY--ERTWEILKEDESGSLKATIEDILFKAKRKRVFEH------HGQVRLGMMRHLY 70

Query: 89  IIIDFSRAATEMDFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTMKDGVAHCLTDLGG 148
           +++D SR   + D +P+R+    K +E FV E+FDQNP+SQIG++  K   A  LT+L G
Sbjct: 71  VVVDGSRTMEDQDLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVTKSKRAEKLTELSG 130

Query: 149 SPESHVKALMGKLE--CSGDASLQNGLELVHGYLNQIPSYGHREVLILYSALNSCDPGDI 208
           +P  H+ +L   ++  C G+ SL N L +    L  +P +  REVLI++S+L +CDP +I
Sbjct: 131 NPRKHITSLKKAVDMTCHGEPSLYNSLSIAMQTLKHMPGHTSREVLIIFSSLTTCDPSNI 190

Query: 209 METVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAI 268
            + ++  K +KIR SVIGL+AE+ +C  L +ETGG+Y V LDESH+KELL  H  PPPA 
Sbjct: 191 YDLIKTLKAAKIRVSVIGLSAEVRVCTVLARETGGTYHVILDESHYKELLTHHVSPPPA- 250

Query: 269 ADSAMPNLIKMGFPQRA--------AESSIAICSCHKEAKAG---GGYTCPRCKARVCEL 328
           + S+  +LI+MGFPQ          A+ S ++       + G   GGY CP+C+A+ CEL
Sbjct: 251 SSSSECSLIRMGFPQHTIASLSDQDAKPSFSMAHLDGNTEPGLTLGGYFCPQCRAKYCEL 310

Query: 329 PTECRICGLTLISSPHLARSYHHLFPIIPFDEVTDKVLHDPRHQSPKVCFGCQESLMNPG 388
           P EC+ICGLTL+S+PHLARSYHHLFP+  F E+  +      +   + C+GCQ  L +  
Sbjct: 311 PVECKICGLTLVSAPHLARSYHHLFPLDAFQEIPLE-----EYNGERFCYGCQGELKDQH 370

Query: 389 TGNSPGIRVSCPKCKQHFCLDCDIYIHESLHNCPGC 412
                     C  C+  FC+DCD+++H+SLH CPGC
Sbjct: 371 V-------YVCAVCQNVFCVDCDVFVHDSLHCCPGC 385

BLAST of ClCG01G004970 vs. ExPASy Swiss-Prot
Match: Q6P1K8 (General transcription factor IIH subunit 2-like protein OS=Homo sapiens OX=9606 GN=GTF2H2C PE=1 SV=1)

HSP 1 Score: 330.9 bits (847), Expect = 2.2e-89
Identity = 165/396 (41.67%), Postives = 243/396 (61.36%), Query Frame = 0

Query: 29  WERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLY 88
           WE  Y  +R+WE L+EDESG L+      ++ A+ +R            +++ G++R+LY
Sbjct: 11  WEGGY--ERTWEILKEDESGSLKATIEDILFKAKRKRVFEH------HGQVRLGMMRHLY 70

Query: 89  IIIDFSRAATEMDFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTMKDGVAHCLTDLGG 148
           +++D SR   + D +P+R+    K +E FV E+FDQNP+SQIG++  K   A  LT+L G
Sbjct: 71  VVVDGSRTMEDQDLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVTKSKRAEKLTELSG 130

Query: 149 SPESHVKALMGKLE--CSGDASLQNGLELVHGYLNQIPSYGHREVLILYSALNSCDPGDI 208
           +P  H+ +L   ++  C G+ SL N L +    L  +P +  REVLI++S+L +CDP +I
Sbjct: 131 NPRKHITSLKEAVDMTCHGEPSLYNSLSMAMQTLKHMPGHTSREVLIIFSSLTTCDPSNI 190

Query: 209 METVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAI 268
            + ++  K +KIR SVIGL+AE+ +C  L +ETGG+Y V LDESH+KELL  H  PPPA 
Sbjct: 191 YDLIKTLKAAKIRVSVIGLSAEVRVCTVLARETGGTYHVILDESHYKELLTHHLSPPPA- 250

Query: 269 ADSAMPNLIKMGFPQRA--------AESSIAICSCHKEAKAG---GGYTCPRCKARVCEL 328
           + S+  +LI+MGFPQ          A+ S ++       + G   GGY CP+C+A+ CEL
Sbjct: 251 SSSSECSLIRMGFPQHTIASLSDQDAKPSFSMAHLDGNTEPGLTLGGYFCPQCRAKYCEL 310

Query: 329 PTECRICGLTLISSPHLARSYHHLFPIIPFDEVTDKVLHDPRHQSPKVCFGCQESLMNPG 388
           P EC+ICGLTL+S+PHLARSYHHLFP+  F E+  +      +   + C+GCQ  L +  
Sbjct: 311 PVECKICGLTLVSAPHLARSYHHLFPLDAFQEIPLE-----EYNGERFCYGCQGELKDQH 370

Query: 389 TGNSPGIRVSCPKCKQHFCLDCDIYIHESLHNCPGC 412
                     C  C+  FC+DCD+++H+SLH CPGC
Sbjct: 371 V-------YVCAVCQNVFCVDCDVFVHDSLHCCPGC 385

BLAST of ClCG01G004970 vs. ExPASy Swiss-Prot
Match: A0JN27 (General transcription factor IIH subunit 2 OS=Rattus norvegicus OX=10116 GN=Gtf2h2 PE=1 SV=1)

HSP 1 Score: 329.7 bits (844), Expect = 4.8e-89
Identity = 165/397 (41.56%), Postives = 245/397 (61.71%), Query Frame = 0

Query: 29  WERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLY 88
           WE  Y  +R+WE L+EDESG L+      ++ A+ +R            +++ G++R+LY
Sbjct: 11  WEGGY--ERTWEILKEDESGSLKATIEDILFKAKRKRVFEH------HGQVRLGMMRHLY 70

Query: 89  IIIDFSRAATEMDFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTMKDGVAHCLTDLGG 148
           +++D SR   + D +P+R+    K +E FV E+FDQNP+SQIG++  K   A  LT+L G
Sbjct: 71  VVVDGSRTMEDQDLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVTKSKRAEKLTELSG 130

Query: 149 SPESHVKALMGKLE--CSGDASLQNGLELVHGYLNQIPSYGHREVLILYSALNSCDPGDI 208
           +P  H+ +L   ++  C G+ SL N L +    L  +P +  REVLI++S+L +CDP +I
Sbjct: 131 NPRKHITSLKKAVDMTCHGEPSLYNSLSMAMQTLKHMPGHTSREVLIIFSSLTTCDPSNI 190

Query: 209 METVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAI 268
            + ++  KT+KIR SVIGL+AE+ +C  L +ETGG+Y V LDE+H+KELL  H  PPPA 
Sbjct: 191 YDLIKTLKTAKIRVSVIGLSAEVRVCTVLARETGGTYHVILDETHYKELLARHVSPPPAS 250

Query: 269 ADSAMPNLIKMGFPQRA--------AESSIAICSC-HKEAKAG---GGYTCPRCKARVCE 328
           + S   +LI+MGFPQ          A+ S ++    +   + G   GGY CP+C+A+ CE
Sbjct: 251 SGSEC-SLIRMGFPQHTIASLSDQDAKPSFSMAHLDNNSTEPGLTLGGYFCPQCRAKYCE 310

Query: 329 LPTECRICGLTLISSPHLARSYHHLFPIIPFDEVTDKVLHDPRHQSPKVCFGCQESLMNP 388
           LP EC+ICGLTL+S+PHLARSYHHLFP+  F E+  +      ++  + C+GCQ  L + 
Sbjct: 311 LPVECKICGLTLVSAPHLARSYHHLFPLDAFQEIPLE-----EYKGERFCYGCQGELKDQ 370

Query: 389 GTGNSPGIRVSCPKCKQHFCLDCDIYIHESLHNCPGC 412
                      C  C+  FC+DCD+++H+SLH CPGC
Sbjct: 371 HV-------YVCTVCRNVFCVDCDVFVHDSLHCCPGC 386

BLAST of ClCG01G004970 vs. ExPASy TrEMBL
Match: A0A5A7UNG8 (General transcription factor IIH subunit OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold480G001400 PE=3 SV=1)

HSP 1 Score: 849.7 bits (2194), Expect = 5.1e-243
Identity = 408/424 (96.23%), Postives = 414/424 (97.64%), Query Frame = 0

Query: 1   MNNGENMRLNGEADEEDDDDDANGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYH 60
           MNNGEN RLNGEADEEDDDDDAN GLAAWERTYADDRSWEALQEDESGLL PIDNKAIYH
Sbjct: 1   MNNGENRRLNGEADEEDDDDDAN-GLAAWERTYADDRSWEALQEDESGLLCPIDNKAIYH 60

Query: 61  AQYRRRLRSLSSLATTARIQKGLIRYLYIIIDFSRAATEMDFRPSRMAVVAKHVEAFVRE 120
           AQYRRRLR+LSSLATTARIQKGLIRYLYI+IDFS+AATEMDFRPSRMAVVAKHVEAFVRE
Sbjct: 61  AQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMDFRPSRMAVVAKHVEAFVRE 120

Query: 121 FFDQNPLSQIGLVTMKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHGYL 180
           FFDQNPLSQIGLVT+KDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVH YL
Sbjct: 121 FFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYL 180

Query: 181 NQIPSYGHREVLILYSALNSCDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETG 240
           NQIPSYGHREVL+LYSALNSCDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETG
Sbjct: 181 NQIPSYGHREVLVLYSALNSCDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETG 240

Query: 241 GSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKAG 300
           GSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAK G
Sbjct: 241 GSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG 300

Query: 301 GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVTDKVLHDPRHQS 360
           GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEV+DKV HDPRHQ 
Sbjct: 301 GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQL 360

Query: 361 PKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLHNCPGCESFRRPKLL 420
           PKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLHNCPGCESFR PKL 
Sbjct: 361 PKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLHNCPGCESFRHPKLA 420

Query: 421 TSDE 425
           T DE
Sbjct: 421 TFDE 423

BLAST of ClCG01G004970 vs. ExPASy TrEMBL
Match: A0A0A0KPM4 (General transcription factor IIH subunit OS=Cucumis sativus OX=3659 GN=Csa_5G169080 PE=3 SV=1)

HSP 1 Score: 846.7 bits (2186), Expect = 4.3e-242
Identity = 406/424 (95.75%), Postives = 414/424 (97.64%), Query Frame = 0

Query: 1   MNNGENMRLNGEADEEDDDDDANGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYH 60
           MNNGEN RLNGEADEEDDDDDAN GLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYH
Sbjct: 1   MNNGENRRLNGEADEEDDDDDAN-GLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYH 60

Query: 61  AQYRRRLRSLSSLATTARIQKGLIRYLYIIIDFSRAATEMDFRPSRMAVVAKHVEAFVRE 120
           AQYRRRLR+LSSLATTARIQKGLIRYLYI+IDFS+AATEMDFRPSRMAVVAKHV+AFVRE
Sbjct: 61  AQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMDFRPSRMAVVAKHVDAFVRE 120

Query: 121 FFDQNPLSQIGLVTMKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHGYL 180
           FFDQNPLSQIGLVT+KDG A+CLTDLGGSPESHVKALMGKLECSGDASLQNGLELVH YL
Sbjct: 121 FFDQNPLSQIGLVTIKDGFANCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYL 180

Query: 181 NQIPSYGHREVLILYSALNSCDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETG 240
           NQIPSYGHREVL+LYSALNSCDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETG
Sbjct: 181 NQIPSYGHREVLVLYSALNSCDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETG 240

Query: 241 GSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKAG 300
           GSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAK G
Sbjct: 241 GSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG 300

Query: 301 GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVTDKVLHDPRHQS 360
           GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEV+DKV HDPRHQ 
Sbjct: 301 GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQL 360

Query: 361 PKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLHNCPGCESFRRPKLL 420
           PKVCFGCQESLMNP TGNSP IRVSCPKCKQHFCLDCDIYIHESLHNCPGCESFRRPKL 
Sbjct: 361 PKVCFGCQESLMNPSTGNSPSIRVSCPKCKQHFCLDCDIYIHESLHNCPGCESFRRPKLA 420

Query: 421 TSDE 425
           TSDE
Sbjct: 421 TSDE 423

BLAST of ClCG01G004970 vs. ExPASy TrEMBL
Match: A0A1S3CUH8 (General transcription factor IIH subunit OS=Cucumis melo OX=3656 GN=LOC103504674 PE=3 SV=1)

HSP 1 Score: 846.7 bits (2186), Expect = 4.3e-242
Identity = 407/424 (95.99%), Postives = 413/424 (97.41%), Query Frame = 0

Query: 1   MNNGENMRLNGEADEEDDDDDANGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYH 60
           MNNGEN RLNGEADEEDDDDDAN GLAAWERTYADDRSWEALQEDESGLL PIDNKAIYH
Sbjct: 1   MNNGENRRLNGEADEEDDDDDAN-GLAAWERTYADDRSWEALQEDESGLLCPIDNKAIYH 60

Query: 61  AQYRRRLRSLSSLATTARIQKGLIRYLYIIIDFSRAATEMDFRPSRMAVVAKHVEAFVRE 120
           AQYRRRLR+LSSLATTARIQKGLIRYLYI+IDFS+AATEMDFRPSRMAVVAKHVEAFVRE
Sbjct: 61  AQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMDFRPSRMAVVAKHVEAFVRE 120

Query: 121 FFDQNPLSQIGLVTMKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHGYL 180
           FFDQNPLSQIGLVT+KDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVH YL
Sbjct: 121 FFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYL 180

Query: 181 NQIPSYGHREVLILYSALNSCDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETG 240
           NQIPSYGHREVL+LYSALNSCDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETG
Sbjct: 181 NQIPSYGHREVLVLYSALNSCDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETG 240

Query: 241 GSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKAG 300
           GSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAK G
Sbjct: 241 GSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG 300

Query: 301 GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVTDKVLHDPRHQS 360
           GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEV+DKV HDPRHQ 
Sbjct: 301 GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQL 360

Query: 361 PKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLHNCPGCESFRRPKLL 420
           PKVCFGCQESLMNPGT NSPGIRVSCPKCKQHFCLDCDIYIHESLHNCPGCESFR PKL 
Sbjct: 361 PKVCFGCQESLMNPGTRNSPGIRVSCPKCKQHFCLDCDIYIHESLHNCPGCESFRHPKLA 420

Query: 421 TSDE 425
           T DE
Sbjct: 421 TFDE 423

BLAST of ClCG01G004970 vs. ExPASy TrEMBL
Match: A0A6J1GC53 (General transcription factor IIH subunit OS=Cucurbita moschata OX=3662 GN=LOC111452792 PE=3 SV=1)

HSP 1 Score: 839.7 bits (2168), Expect = 5.3e-240
Identity = 399/424 (94.10%), Postives = 412/424 (97.17%), Query Frame = 0

Query: 1   MNNGENMRLNGEADEEDDDDDANGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYH 60
           MNNGEN RLNGEA+EEDDDDDANGG+AAWERTYADDRSWEALQEDESGLLRPIDNKAI+H
Sbjct: 1   MNNGENRRLNGEAEEEDDDDDANGGMAAWERTYADDRSWEALQEDESGLLRPIDNKAIFH 60

Query: 61  AQYRRRLRSLSSLATTARIQKGLIRYLYIIIDFSRAATEMDFRPSRMAVVAKHVEAFVRE 120
           AQYRRRLRSLSSLATTARIQKGLIRYLY++IDFSRAA EMDFRPSRMAVVAKHVEAFVRE
Sbjct: 61  AQYRRRLRSLSSLATTARIQKGLIRYLYVVIDFSRAAAEMDFRPSRMAVVAKHVEAFVRE 120

Query: 121 FFDQNPLSQIGLVTMKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHGYL 180
           FFDQNPLSQIGLVT+KDGVAH LTDLGGSPESHVKALMGKLECSG+ASLQNGL+LV GYL
Sbjct: 121 FFDQNPLSQIGLVTIKDGVAHSLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVCGYL 180

Query: 181 NQIPSYGHREVLILYSALNSCDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETG 240
           NQIPSYGHREVLILYSALNSCDPGDIMETVQKCKTSKIRCSVIGLTAE+FICRHLCQETG
Sbjct: 181 NQIPSYGHREVLILYSALNSCDPGDIMETVQKCKTSKIRCSVIGLTAELFICRHLCQETG 240

Query: 241 GSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKAG 300
           GSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA ESSIAICSCHKEAK G
Sbjct: 241 GSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAGESSIAICSCHKEAKVG 300

Query: 301 GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVTDKVLHDPRHQS 360
           GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEV+DK+ HDPRHQ 
Sbjct: 301 GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKLFHDPRHQL 360

Query: 361 PKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLHNCPGCESFRRPKLL 420
           PKVCFGCQE+  NPGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLHNCPGCESFRRPK  
Sbjct: 361 PKVCFGCQENFTNPGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLHNCPGCESFRRPKSA 420

Query: 421 TSDE 425
           TSD+
Sbjct: 421 TSDD 424

BLAST of ClCG01G004970 vs. ExPASy TrEMBL
Match: A0A6J1DUE2 (General transcription factor IIH subunit OS=Momordica charantia OX=3673 GN=LOC111024539 PE=3 SV=1)

HSP 1 Score: 839.3 bits (2167), Expect = 6.9e-240
Identity = 401/424 (94.58%), Postives = 412/424 (97.17%), Query Frame = 0

Query: 1   MNNGENMRLNGEADEEDDDDDANGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYH 60
           MNNGE  RLNGEADEEDDDDD NGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYH
Sbjct: 1   MNNGEGSRLNGEADEEDDDDDGNGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYH 60

Query: 61  AQYRRRLRSLSSLATTARIQKGLIRYLYIIIDFSRAATEMDFRPSRMAVVAKHVEAFVRE 120
           AQYRRRLRSLSS+ATTARIQKGLIRYLYI+IDFSRAA EMDFRPSRMAVVAKHVEAFVRE
Sbjct: 61  AQYRRRLRSLSSIATTARIQKGLIRYLYIVIDFSRAAAEMDFRPSRMAVVAKHVEAFVRE 120

Query: 121 FFDQNPLSQIGLVTMKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHGYL 180
           FFDQNPLSQIGLVT+KDGVAHCLTDLGGSPESHVKALMGKLECSG+ASLQNGL+LV GYL
Sbjct: 121 FFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVCGYL 180

Query: 181 NQIPSYGHREVLILYSALNSCDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETG 240
           NQIPSYGHREVLILYSALNSCDPGDIMET+QKCKTSKIRCSVIGLTAEIFICRHLCQETG
Sbjct: 181 NQIPSYGHREVLILYSALNSCDPGDIMETIQKCKTSKIRCSVIGLTAEIFICRHLCQETG 240

Query: 241 GSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKAG 300
           GSYS+ALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAK G
Sbjct: 241 GSYSIALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG 300

Query: 301 GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVTDKVLHDPRHQS 360
           GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEV+DKVL+DPRH+ 
Sbjct: 301 GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLNDPRHRL 360

Query: 361 PKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLHNCPGCESFRRPKLL 420
           PKVCFGCQESLMN GTGNS GIRVSCPKCKQHFCLDCDIYIHESLHNCPGCESFRRPK  
Sbjct: 361 PKVCFGCQESLMNSGTGNSQGIRVSCPKCKQHFCLDCDIYIHESLHNCPGCESFRRPKSA 420

Query: 421 TSDE 425
            S+E
Sbjct: 421 ASNE 424

BLAST of ClCG01G004970 vs. TAIR 10
Match: AT1G05055.1 (general transcription factor II H2 )

HSP 1 Score: 662.5 bits (1708), Expect = 2.2e-190
Identity = 318/417 (76.26%), Postives = 355/417 (85.13%), Query Frame = 0

Query: 3   NGENMRLNGEADEEDDDDDANGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQ 62
           + +  R N E +EEDD+D    G+  WER Y DDRSWE LQEDESGLLRPIDN AIYHAQ
Sbjct: 2   SNQRKRSNDEREEEDDEDAE--GIGEWERAYVDDRSWEELQEDESGLLRPIDNSAIYHAQ 61

Query: 63  YRRRLRSLSSLATTARIQKGLIRYLYIIIDFSRAATEMDFRPSRMAVVAKHVEAFVREFF 122
           YRRRLR LS+ A   RIQKGLIRYLYI+IDFSRAA EMDFRPSRMA++AKHVEAF+REFF
Sbjct: 62  YRRRLRMLSAAAAGTRIQKGLIRYLYIVIDFSRAAAEMDFRPSRMAIMAKHVEAFIREFF 121

Query: 123 DQNPLSQIGLVTMKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHGYLNQ 182
           DQNPLSQIGLV++K+GVAH LTDLGGSPE+H+KALMGKLE  GD+SLQN LELVH +LNQ
Sbjct: 122 DQNPLSQIGLVSIKNGVAHTLTDLGGSPETHIKALMGKLEALGDSSLQNALELVHEHLNQ 181

Query: 183 IPSYGHREVLILYSALNSCDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGS 242
           +PSYGHREVLILYSAL +CDPGDIMET+QKCK SK+RCSVIGL+AE+FIC+HLCQETGG 
Sbjct: 182 VPSYGHREVLILYSALCTCDPGDIMETIQKCKKSKLRCSVIGLSAEMFICKHLCQETGGL 241

Query: 243 YSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKAGGG 302
           YSVA+DE H K+LLLEHAPPPPAIA+ A+ NLIKMGFPQRAAE S+AICSCHKE K G G
Sbjct: 242 YSVAVDEVHLKDLLLEHAPPPPAIAEFAIANLIKMGFPQRAAEGSMAICSCHKEVKIGAG 301

Query: 303 YTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVTD-KVLHDPRHQSP 362
           Y CPRCKARVC+LPTEC ICGLTL+SSPHLARSYHHLFPI PFDEV     L+D R +  
Sbjct: 302 YMCPRCKARVCDLPTECTICGLTLVSSPHLARSYHHLFPIAPFDEVPALSSLNDNRRKLG 361

Query: 363 KVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLHNCPGCESFRRPK 419
           K CFGCQ+SL+  G GN P   V+C KCK +FCLDCDIYIHESLHNCPGCES  RPK
Sbjct: 362 KSCFGCQQSLI--GAGNKPVPCVTCRKCKHYFCLDCDIYIHESLHNCPGCESIHRPK 414

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038874496.19.2e-24797.41general transcription factor IIH subunit 2 isoform X1 [Benincasa hispida][more]
KAA0055121.11.1e-24296.23general transcription factor IIH subunit 2 [Cucumis melo var. makuwa] >TYK20738.... [more]
XP_004143721.18.9e-24295.75general transcription factor IIH subunit 2 [Cucumis sativus] >XP_031741116.1 gen... [more]
XP_008467294.18.9e-24295.99PREDICTED: general transcription factor IIH subunit 2 [Cucumis melo][more]
XP_022949453.11.1e-23994.10general transcription factor IIH subunit 2 [Cucurbita moschata] >XP_023525764.1 ... [more]
Match NameE-valueIdentityDescription
Q9ZVN93.1e-18976.26General transcription factor IIH subunit 2 OS=Arabidopsis thaliana OX=3702 GN=GT... [more]
Q2TBV54.3e-9041.41General transcription factor IIH subunit 2 OS=Bos taurus OX=9913 GN=GTF2H2 PE=2 ... [more]
Q138881.7e-8941.67General transcription factor IIH subunit 2 OS=Homo sapiens OX=9606 GN=GTF2H2 PE=... [more]
Q6P1K82.2e-8941.67General transcription factor IIH subunit 2-like protein OS=Homo sapiens OX=9606 ... [more]
A0JN274.8e-8941.56General transcription factor IIH subunit 2 OS=Rattus norvegicus OX=10116 GN=Gtf2... [more]
Match NameE-valueIdentityDescription
A0A5A7UNG85.1e-24396.23General transcription factor IIH subunit OS=Cucumis melo var. makuwa OX=1194695 ... [more]
A0A0A0KPM44.3e-24295.75General transcription factor IIH subunit OS=Cucumis sativus OX=3659 GN=Csa_5G169... [more]
A0A1S3CUH84.3e-24295.99General transcription factor IIH subunit OS=Cucumis melo OX=3656 GN=LOC103504674... [more]
A0A6J1GC535.3e-24094.10General transcription factor IIH subunit OS=Cucurbita moschata OX=3662 GN=LOC111... [more]
A0A6J1DUE26.9e-24094.58General transcription factor IIH subunit OS=Momordica charantia OX=3673 GN=LOC11... [more]
Match NameE-valueIdentityDescription
AT1G05055.12.2e-19076.26general transcription factor II H2 [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (Charleston Gray) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002035von Willebrand factor, type ASMARTSM00327VWA_4coord: 84..260
e-value: 6.1E-12
score: 55.7
IPR002035von Willebrand factor, type APROSITEPS50234VWFAcoord: 86..225
score: 8.791193
IPR004595TFIIH C1-like domainSMARTSM01047C1_4_2coord: 363..413
e-value: 1.8E-20
score: 84.0
IPR004595TFIIH C1-like domainPFAMPF07975C1_4coord: 364..413
e-value: 3.2E-19
score: 68.9
IPR007198Ssl1-likePFAMPF04056Ssl1coord: 90..280
e-value: 8.6E-86
score: 286.7
IPR007198Ssl1-likeCDDcd01453vWA_transcription_factor_IIH_typecoord: 82..264
e-value: 2.30308E-103
score: 302.712
IPR012170TFIIH subunit Ssl1/p44PIRSFPIRSF015919TFIIH_SSL1coord: 1..414
e-value: 2.9E-162
score: 538.2
IPR012170TFIIH subunit Ssl1/p44TIGRFAMTIGR00622TIGR00622coord: 302..413
e-value: 1.6E-37
score: 126.6
IPR036465von Willebrand factor A-like domain superfamilyGENE3D3.40.50.410von Willebrand factor, type A domaincoord: 80..265
e-value: 5.9E-72
score: 243.2
IPR036465von Willebrand factor A-like domain superfamilySUPERFAMILY53300vWA-likecoord: 82..249
IPR013083Zinc finger, RING/FYVE/PHD-typeGENE3D3.30.40.10Zinc/RING finger domain, C3HC4 (zinc finger)coord: 340..418
e-value: 4.7E-21
score: 76.6
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..26
NoneNo IPR availablePANTHERPTHR12695GENERAL TRANSCRIPTION FACTOR IIH SUBUNIT 2coord: 19..411
NoneNo IPR availableSUPERFAMILY57889Cysteine-rich domaincoord: 337..412
IPR013087Zinc finger C2H2-typePROSITEPS00028ZINC_FINGER_C2H2_1coord: 386..406

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG01G004970.1ClCG01G004970.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006289 nucleotide-excision repair
biological_process GO:0006357 regulation of transcription by RNA polymerase II
biological_process GO:0006281 DNA repair
biological_process GO:0006351 transcription, DNA-templated
cellular_component GO:0000439 transcription factor TFIIH core complex
cellular_component GO:0005675 transcription factor TFIIH holo complex
molecular_function GO:0008270 zinc ion binding