Spg021000 (gene) Sponge gourd (cylindrica) v1

Overview
NameSpg021000
Typegene
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRNA polymerase II transcription factor B subunit 4
Locationscaffold9: 806389 .. 812745 (+)
RNA-Seq ExpressionSpg021000
SyntenySpg021000
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
GGGCCCAATTTCCATCCACATCTATTAGAAGTTTAGAACTGGGCGTCTTCTCTCTATTTGCCTAATAGCCATGGCCTCTGCTCCTTCGAAGCTCTATGCAGGTTCTTCTCCAATCCCTCTCTCCTTCTATTACGAGCACTTAGGTGAAGTTTCAGAGTGGCAGAAACTTATTCATTTTTGTAATCTTTTGATTATGTTGTTGCAGATGATGTTAGCCTTTTAGTGGTTTTACTGGATACGAATCCATTTTTCTGGAGCACATCTTCTCTCCCATTCTCCAAGTTTCTCTCTCATGTAATCGATAAATTGTCTCTCCATCTGCTAAAAATTCTACCAAGTAGGTGTTTTAGATAGAAATGTAATGAACATTGGTATTTTTTCTGTTTTGGGCGTAGGTACTTGCTTTTTTGAACTCCATTTTAGTTTTGAATCAACTTAATGAGGTTGTGGTCATTGGTACCGGATACGCCTCATGCAAGTATCTATATAATTCGTCTTCGTACTCTAATCGTGGGCTTGAAGATGGTAGAATGCCTGCTCTTTGCACTCGTTTATTGAAGAATTTGGAGGAATTCATGATTGGGGATGAGCAGTCCATTAAGGAAGACCCCAAAGGAGGAACCATGTCTTCACTTCTTTCTGGATCGCTCTCCATGGCTCTGTGTTCTATCCCAAAGGAGGAACCATGTCTTCACTTCTTTCTGGATCGCTCTCCATGGCTCTGTGTTCTATCCTTGTTGATGATTTTTGGTGCTGATTATTTTCTTGTTTGTGTTGATAGCTTCTTGTCGTCTACGTAAAAGTAAAACTCCATCGTGTTCTAGAATTAGAATGTGTCAATAGACCATTAAAATTGAGTAACAAAACGTAACAGTTATCACTAAACAATATCTATGGAGATGGGAGTAGCATTTGGAAGTTATCTTTCATCTGTACATTTGATCAATTTTTGTGGTAATGAGTAATGGCAGCAAAATACTGTGAAAGTTATAGCTGACACGATAGAATTTGTAGATTGCCACACTCTTTAAATTCAACTTGAGTAATCTAATGTTCACCAACATATTATTTTGCTCCAAATGCTTGACCTTGTGAGTAGATATCCAGAAAGTTTTCCGCTCCGGATCCCTCCATCCCCAACCTCGAGTAAGCACTTGTTTCTGTTCTCATCACTTTATTGCCATTTTTCAAATGATATTTGTGGTCTATATGTTTCCTTTCTTATGTTTGTTGTGTACATATACTATTTGTTCCACAGTAATTTCAATAATCTGCTTTGAATTTTTGGTGAAAATGTGACTAATCTATGTGCATGTATTTACTTTGAGTGTAGATCCTTTGCTTGCAGGGATCCCCAGATGGACCTGAACAGTAAGATTTTTCTGATCTTCTCTGTTGAATTTATTTTATTTATGTAATTTTCTTTAATATTGGAAAACAAACTGTAAAATCTTTCTTCTAGTAATTGTTTTTCTCTTTCAGATATGTTGCAATCATGAATGCCATCTTTTCTGCTCAACGTTCAATGGTGAGTATTTCTTGTTGTCACTTGTGAGTGTTCTTATATGACTTCTTGAACTTATGTCAATTACCAATTATCTTGTATTTAACAGGTTGATTATCTGTAAGTAGTGAAAAATGTAGTCCGTCCCTAAATATTTTATGGAAATGCTCAACATAACTTTCCTCTGTATTAATCTTGTTCTTTGTCTGGTCTTGATGCTAGAAAGTGGTGTAGGACTACATTTTTCTGTAGATTTCTTCCCCTATAATTTTTGGAAAATAATTCAATACATCCCATAAGTGCATTTGAATTTATATGCATCTACGAGACTACAACAAATTATATCAACCTACTAGCTTGGAGTTTAACTGAATATCTGGTTTCTACAGTCAGAATTGAAGTCCAATGTTTAAAAAATGTACTTGTCCCATGGACGTAAAATTTGATGAATATTTGGAAACTTTTCTATTATTGATCTTGTCAAATGTGAAAACCGAAATTGTAGAAAATAAAGATGTATGCATAAAATAAGAAAAAAGAATATCATTAAAGAGCAAAGGTTCTACAAAAAGGACGGGAGATATGGAGTTTCTCTGAAGGGACAGATTCTAGGATGGAGGTCTATTAGAATAACCCCAGTCCAAGTTGTCAATGAAGTAGAGAAGTCACAAAACCCTTGTCTAAGTACTTCAAATAAAAGGTATTCTAACTGTACAAGCTCCAAAAGTTAGGAAAAAGGCCCTAAAAATGTATTTTCAAAATTGAAAATAGAAAAAATAGAGATGCACTGTCAAATTACTCTCTCCAACAAGATCTATCAAGCTCAAAAGATGAAGAACTTTTGAGATTTTTCAATTGTTAGGTTTCTCTTTGTTATGACCGCTTAACATAAAAATGACCAAAAGAATGACTAATATCTTCGTATGAAGTAATTTCACTTGTAATGTTCATGATTGTAATGATCATTTGGTCATAAGCAAAAAGTTCATAGTCAATGAAATTTGGAAAGAAATGCAAATCTCCTTTCTCGTTTGCATTTATGGGTTTGCTTTCAGCTTTCATGTGCACTAGTCATTCATTTGAAGTCTGAGATAGCTGTCTATTGCAGCTCCATGTTGCTTTTAACTATAAACAACATTCAACAATTTCCTTGTGTATAAGTATCATATCATGTATGATTGGGTATTCCTGGTTCAGGTTCCTATAGATTCATGTTACATTGGTTCACACAATTCTGCATTTCTTCAGCAGGTAAGCTATCTGTACATTGTTCTGTTAATGGGTCTTTTACTTTTTCCCCCCAGAATCTTACCGTCATAAGCTGTATCCTCTATTCATTTGCTTGTGGTGATTTGATTTCTGATGTTGTATAGGCTTCTTACATAACTGGCGGAGTTTATCTGAAGCCTCAGCAAATGGATGGGTTGTTTCAATATCTCTCTGTAGGTTATATTTACCAATGGCTTTCTATTGCTCCCTCCAAGTTCAAACTTCAAAGCATCTGTTTACTTTTCTCATCGTTTTAGGATTTTAAGTTACAGATTAGTTAAGTGAATGGATTTTTATTGAAAACTCTGCTTGATCGTGGTACAAGAGGTCTTCCCTATTTTTCATAATCTCTTGCTTACGCTTATGTGATATTGTATTGTTCTTTCTCTGCAGACTGTTTATGCCACTGATTTGCATTCCCGGACCTTCTTACAGCTTCCGAAGTCTGTTGGAGTGGATTTTCGTGCATCGTAAGTTTGGCGTCTTGTGAATATACAATTCTGTGATTTTGTGCTTAAAATTTTTCCAGCTTACTATGGCCTTATACCATTTTACACCTCTCATACTAGCAATTTCCTTCTCTGTTGGTGTGGATTTTTGTATATTGTAAGTTTGGAGTTTTGTGAATATACAATTCTGCGGCTTTGTGCTTAAAATTTGGCCAGCTTACTGTGGCCTTATACTAATTTACACCTCTCATCATAATTTTATCTTTTTCTGTATCCAACAGAGGAGGGAATTGCTAGTACCTAATGAATTGGTAATTTGCAGGTGTTTTTGCCACAAAAAAACAATTGATATGAGCTATGTCTGTTCTGTTTGCCTATCTATATTCTGCAAGCATCACAAGAAATGTTCTACCTGTGGGTAAGTGAAATGTCTAAACCGTCATTATTCTTAAAACTTATTGAAGGTGGTTTCAATTTTGAGAATGTTGTTGCATGTTGAGATTTCTAAGCGTGAAAAGGAAGAGGAAGAAGAAGCTGGATTTTCTAAAATTCATTATGCTAGGATTTTTTTTTTCTTGGCCAATTAATACTTTTACTAGGAAATTTGGGAAGAATACTTGGGTTCTGATGTCATCACTCATGAGGGAGCTTAGAGTTTAGTATATTTTATCTTGCCCATAGATAATGGCTGCCTAAATACAGTCCCAAAAGAATCTGCCGACAAAATCGATGAAACGGAAGAGATCCGAGTGAATGGAAAGGAAAAAACAAAAGATGAATTCCACTTTAAAAGGACGTCTTTAGTTTCTTTATCAGTTTTATCTGCTATCTTGCGGTCATTGTTTCAACTCTTGGATTTAATAAAGAGCACTTGAAACTTGGGATGGAAATTACGGCAATTTAAGGAATCAAGACCCTTCAGCAATTGCTCAAAAAGCAACTTTTGACATTATAGGTTTATTAGTGCTTTGTGGATGTTTCAAACGAGAGTCTGTCTGTAATTCGTTTTTCTCCTTCAAACAAGTCTTTGTGTGTATTATTGATGAGTTTTTGACTTCATTCAGGTCAGTTTTTGGTGAGATCCCGGTAGATGTCGATTCAGTGGCTAAACAGAAGAGAAAAACTCCATAAACATGATTTCACTTCGCTGTAAGTTGTGATAATGCTGATGGATCGTTAGTATGACTTTGAACTGATTACTTTGTCCTTCACTTCTATGGCTATAGAACTTTGTCCTATCTTTCTTTCCTTTCCTGAGCTCTTTGGAACTGGAAAATGTCTTTTTGCTGATAATTGGCTAGTTAAAATCATGGAAGTTGGTTCATATGGCTTGAGCAGGAGATGCCTCAAGGGAAATGGGTGCTGGAAGAGGGTCGGTATTTGCCCTGACATTGTACGACACAGATCATGAGTTCGAAGAAGAGCTAGCATTCGAACTCAGTTGGATGTGGGAATATGGAATCATGGCAAGCAAGGACATGCACCCATATTCATCTTTATCAGTCCCACAAGAATCTGGTAACAAAATTACTATGAAACTTGAGAGACTTTAACTGATACACGTGTACTGAAAGCTTGAAAGGTTTCGAAGCATATACATAAAAATTCATTTTTGCTTACGGTTGAGAGCATTTCAAAGTTTCCTTCCACACTGCTGAGTTCAAAATTTATGAAGTGTCCAAAATTACTTTAGAACCGTGTTATTATAGTTGTCTTCCTCACATTTTGAAATTTCACAAGTTGAAGGGAAATTCGAGAGCGCCCTCCATAATCCATCTTTACCTAAATAGAAAAATGAGATGGTCTTCAGCCTCACAGGTAACAGAAAATAACCTAAGGGTGCGCTCCTTTTCCTTTTTACGCTTATAAAACTTTGTAATTGGGTGCCTAATACTATCAAAGTGAATCATATGAAGTTATTGGATCGATTGATTCAATACGACAATGTGAGCACATATATTTCTTTTTAGGTTACAAGTTTCCAAGGGTTAGTGAGTTTTGAATTCAGTGACTATCTGTGTTGGCAGATGTGTTTATTATTTTCAGATTTTGTGATGCAAATGGTAGAAAAATTCTGAAAACAAACTTTTTTATGATCCAGATTTTGAATTTGAAAATTTTAATTGCAGAATTAAATTTAAAAGAAAAAAGATAAAAACTTTAATAAATATAGTATTTCATGTTATAAACTCAATTCATTTTTGTGAAATTAAAATTTGTATAATATAATGTATTATAAATTATATGATATTATAAAATAATAATTATAAAAAAATTTATAACATATAAAATGTTTATAAATTAATTAATAATAGTTTATATTAAATTTAATATTTAGTTGATAACTACAAATAATTTGCGATTTATAAATATATTGTCAAGTATATTTTGAACAATTTTTTAAATAGAAATCCAATTTTAAATCTACTATCTAACACATTTGAAAACATGAAATACAATTATGTTTTTATTGAATTTATTGTTTTCAAATTTATGTTTTTAGATCACTTGCCAAACAAAAAGGAACGAATTACAGGTACAATAACTAACTAAACAAGCATAATTTTCATGTTTTCAAATGTGTTAAACAAGCATAATTTTCAAAAACCAAAAACCAAAAACCAAATAGTTATCAAACGAGGCCTTAGTTGTCTGGCTTGGTGGGTTCATCTAAATATTAAAGATTGTGGGTTTTACTGATTGATGGTCGGCTTCGAGCCCTTGTGCAGAGTGTTTAATCGACCATTTTTTGCATCTTTTGATATTCAATTGCAAATTAGAACAATCACTTAAACTATATCATCCTTTTCTTCATGGTCGTTTAAATTTACAGGGCCACAGCAATTCCCTCCTCGTAAAGCCCCGCCTGCTATAGTAATCAGAGATTCAAATTTTGCACCTGCACCTGCTCCTCAATTTGATAGCAGTGGATCTCCTGCACCTGGATTGGGATTGATCCCTTCAACAACTGGTTCTCTCCCCGGATTCTCGCCCGCTAGTGGCCCTGTTGGTAGTACTTCTGGGTTGAATTCTGCACCTTCTGCTGCTTCCTCATTGGTGTATCCATTAGCGAGTATCTCTGTTGCCTTTCTGTTATTTATCATCGCCAACTTCTTTACATCATTTTCATTGTAA

mRNA sequence

ATGGCCTCTGCTCCTTCGAAGCTCTATGCAGATGATGTTAGCCTTTTAGTGGTTTTACTGGATACGAATCCATTTTTCTGGAGCACATCTTCTCTCCCATTCTCCAAGTTTCTCTCTCATGTACTTGCTTTTTTGAACTCCATTTTAGTTTTGAATCAACTTAATGAGGTTGTGGTCATTGGTACCGGATACGCCTCATGCAAGTATCTATATAATTCGTCTTCGTACTCTAATCGTGGGCTTGAAGATGGTAGAATGCCTGCTCTTTGCACTCGTTTATTGAAGAATTTGGAGGAATTCATGATTGGGGATGAGCAGTCCATTAAGGAAGACCCCAAAGGAGGAACCATGTCTTCACTTCTTTCTGGATCGCTCTCCATGGCTCTGTGTTCTATCCCAAAGGAGGAACCATGTCTTCACTTCTTTCTGGATCGCTCTCCATGGCTCTGTGTTCTATCCTTGTTGATGATTTTTGGTGCTGATTATTTTCTTGTTTGTGTTGATAGCTTCTTGTCGTCTACCATTTGGAAGTTATCTTTCATCTATATCCAGAAAGTTTTCCGCTCCGGATCCCTCCATCCCCAACCTCGAATCCTTTGCTTGCAGGGATCCCCAGATGGACCTGAACAATATGTTGCAATCATGAATGCCATCTTTTCTGCTCAACGTTCAATGGTTCCTATAGATTCATGTTACATTGGTTCACACAATTCTGCATTTCTTCAGCAGGCTTCTTACATAACTGGCGGAGTTTATCTGAAGCCTCAGCAAATGGATGGGTTGTTTCAATATCTCTCTACTGTTTATGCCACTGATTTGCATTCCCGGACCTTCTTACAGCTTCCGAAGTCTGTTGGAGTGGATTTTCGTGCATCGTGTTTTTGCCACAAAAAAACAATTGATATGAGCTATGTCTGTTCTGTTTGCCTATCTATATTCTGCAAGCATCACAAGAAATGTTCTACCTGTGGGTCAGTTTTTGGAGATGCCTCAAGGGAAATGGGTGCTGGAAGAGGGTCGGTATTTGCCCTGACATTGTACGACACAGATCATGAGTTCGAAGAAGAGCTAGCATTCGAACTCAGTTGGATGTGGGAATATGGAATCATGGCAAGCAAGGACATGCACCCATATTCATCTTTATCAGTCCCACAAGAATCTGGGCCACAGCAATTCCCTCCTCGTAAAGCCCCGCCTGCTATAGTAATCAGAGATTCAAATTTTGCACCTGCACCTGCTCCTCAATTTGATAGCAGTGGATCTCCTGCACCTGGATTGGGATTGATCCCTTCAACAACTGGTTCTCTCCCCGGATTCTCGCCCGCTAGTGGCCCTGTTGGTAGTACTTCTGGGTTGAATTCTGCACCTTCTGCTGCTTCCTCATTGGTGTATCCATTAGCGAGTATCTCTGTTGCCTTTCTGTTATTTATCATCGCCAACTTCTTTACATCATTTTCATTGTAA

Coding sequence (CDS)

ATGGCCTCTGCTCCTTCGAAGCTCTATGCAGATGATGTTAGCCTTTTAGTGGTTTTACTGGATACGAATCCATTTTTCTGGAGCACATCTTCTCTCCCATTCTCCAAGTTTCTCTCTCATGTACTTGCTTTTTTGAACTCCATTTTAGTTTTGAATCAACTTAATGAGGTTGTGGTCATTGGTACCGGATACGCCTCATGCAAGTATCTATATAATTCGTCTTCGTACTCTAATCGTGGGCTTGAAGATGGTAGAATGCCTGCTCTTTGCACTCGTTTATTGAAGAATTTGGAGGAATTCATGATTGGGGATGAGCAGTCCATTAAGGAAGACCCCAAAGGAGGAACCATGTCTTCACTTCTTTCTGGATCGCTCTCCATGGCTCTGTGTTCTATCCCAAAGGAGGAACCATGTCTTCACTTCTTTCTGGATCGCTCTCCATGGCTCTGTGTTCTATCCTTGTTGATGATTTTTGGTGCTGATTATTTTCTTGTTTGTGTTGATAGCTTCTTGTCGTCTACCATTTGGAAGTTATCTTTCATCTATATCCAGAAAGTTTTCCGCTCCGGATCCCTCCATCCCCAACCTCGAATCCTTTGCTTGCAGGGATCCCCAGATGGACCTGAACAATATGTTGCAATCATGAATGCCATCTTTTCTGCTCAACGTTCAATGGTTCCTATAGATTCATGTTACATTGGTTCACACAATTCTGCATTTCTTCAGCAGGCTTCTTACATAACTGGCGGAGTTTATCTGAAGCCTCAGCAAATGGATGGGTTGTTTCAATATCTCTCTACTGTTTATGCCACTGATTTGCATTCCCGGACCTTCTTACAGCTTCCGAAGTCTGTTGGAGTGGATTTTCGTGCATCGTGTTTTTGCCACAAAAAAACAATTGATATGAGCTATGTCTGTTCTGTTTGCCTATCTATATTCTGCAAGCATCACAAGAAATGTTCTACCTGTGGGTCAGTTTTTGGAGATGCCTCAAGGGAAATGGGTGCTGGAAGAGGGTCGGTATTTGCCCTGACATTGTACGACACAGATCATGAGTTCGAAGAAGAGCTAGCATTCGAACTCAGTTGGATGTGGGAATATGGAATCATGGCAAGCAAGGACATGCACCCATATTCATCTTTATCAGTCCCACAAGAATCTGGGCCACAGCAATTCCCTCCTCGTAAAGCCCCGCCTGCTATAGTAATCAGAGATTCAAATTTTGCACCTGCACCTGCTCCTCAATTTGATAGCAGTGGATCTCCTGCACCTGGATTGGGATTGATCCCTTCAACAACTGGTTCTCTCCCCGGATTCTCGCCCGCTAGTGGCCCTGTTGGTAGTACTTCTGGGTTGAATTCTGCACCTTCTGCTGCTTCCTCATTGGTGTATCCATTAGCGAGTATCTCTGTTGCCTTTCTGTTATTTATCATCGCCAACTTCTTTACATCATTTTCATTGTAA

Protein sequence

MASAPSKLYADDVSLLVVLLDTNPFFWSTSSLPFSKFLSHVLAFLNSILVLNQLNEVVVIGTGYASCKYLYNSSSYSNRGLEDGRMPALCTRLLKNLEEFMIGDEQSIKEDPKGGTMSSLLSGSLSMALCSIPKEEPCLHFFLDRSPWLCVLSLLMIFGADYFLVCVDSFLSSTIWKLSFIYIQKVFRSGSLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYIGSHNSAFLQQASYITGGVYLKPQQMDGLFQYLSTVYATDLHSRTFLQLPKSVGVDFRASCFCHKKTIDMSYVCSVCLSIFCKHHKKCSTCGSVFGDASREMGAGRGSVFALTLYDTDHEFEEELAFELSWMWEYGIMASKDMHPYSSLSVPQESGPQQFPPRKAPPAIVIRDSNFAPAPAPQFDSSGSPAPGLGLIPSTTGSLPGFSPASGPVGSTSGLNSAPSAASSLVYPLASISVAFLLFIIANFFTSFSL
Homology
BLAST of Spg021000 vs. NCBI nr
Match: XP_004152842.1 (general transcription and DNA repair factor IIH subunit TFB4 [Cucumis sativus] >XP_011648981.1 general transcription and DNA repair factor IIH subunit TFB4 [Cucumis sativus])

HSP 1 Score: 530.0 bits (1364), Expect = 2.1e-146
Identity = 272/334 (81.44%), Postives = 277/334 (82.93%), Query Frame = 0

Query: 1   MASAPSKLYADDVSLLVVLLDTNPFFWSTSSLPFSKFLSHVLAFLNSILVLNQLNEVVVI 60
           MASAPSKLYADDVSLLVVLLDTNPFFWSTS+LPFSKFLSHVLAFLNSILVLNQLNEVVVI
Sbjct: 1   MASAPSKLYADDVSLLVVLLDTNPFFWSTSALPFSKFLSHVLAFLNSILVLNQLNEVVVI 60

Query: 61  GTGYASCKYLYNSSSYSNRGLEDGRMPALCTRLLKNLEEFMIGDEQSIKEDPKGGTMSSL 120
           GTGYASCKYLYNSSSYSN GLEDGRMPALCTRLLKNLEEF+IGDEQSIKEDPKGGTMSSL
Sbjct: 61  GTGYASCKYLYNSSSYSNHGLEDGRMPALCTRLLKNLEEFVIGDEQSIKEDPKGGTMSSL 120

Query: 121 LSGSLSMALCSIPKEEPCLHFFLDRSPWLCVLSLLMIFGADYFLVCVDSFLSSTIWKLSF 180
           LSGSLSMALC                                                  
Sbjct: 121 LSGSLSMALC-------------------------------------------------- 180

Query: 181 IYIQKVFRSGSLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYIGSHNSAF 240
            YIQKVFRSGSLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYIGSHNSAF
Sbjct: 181 -YIQKVFRSGSLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYIGSHNSAF 240

Query: 241 LQQASYITGGVYLKPQQMDGLFQYLSTVYATDLHSRTFLQLPKSVGVDFRASCFCHKKTI 300
           LQQASYITGGVYLKPQQMDGLFQYLSTV+ TDLHSRTFLQLPKSVGVDFRASCFCHKKTI
Sbjct: 241 LQQASYITGGVYLKPQQMDGLFQYLSTVFGTDLHSRTFLQLPKSVGVDFRASCFCHKKTI 283

Query: 301 DMSYVCSVCLSIFCKHHKKCSTCGSVFGDASREM 335
           DM YVCSVCLSIFCKHHKKCSTCGSVFG+   E+
Sbjct: 301 DMGYVCSVCLSIFCKHHKKCSTCGSVFGETPVEL 283

BLAST of Spg021000 vs. NCBI nr
Match: XP_008441918.2 (PREDICTED: LOW QUALITY PROTEIN: RNA polymerase II transcription factor B subunit 4 [Cucumis melo])

HSP 1 Score: 524.6 bits (1350), Expect = 8.9e-145
Identity = 270/334 (80.84%), Postives = 275/334 (82.34%), Query Frame = 0

Query: 1   MASAPSKLYADDVSLLVVLLDTNPFFWSTSSLPFSKFLSHVLAFLNSILVLNQLNEVVVI 60
           MASAPSKLYADDVSLLVVLLDTNPFFWSTSSLPFSKFLSHVLAFLNSILVLNQLNEVVVI
Sbjct: 1   MASAPSKLYADDVSLLVVLLDTNPFFWSTSSLPFSKFLSHVLAFLNSILVLNQLNEVVVI 60

Query: 61  GTGYASCKYLYNSSSYSNRGLEDGRMPALCTRLLKNLEEFMIGDEQSIKEDPKGGTMSSL 120
           GTGYASCKYLYNSSSYSN GLEDGRMPALCTRLLKNLEEF+IGDEQSIKEDPKGGTMSSL
Sbjct: 61  GTGYASCKYLYNSSSYSNHGLEDGRMPALCTRLLKNLEEFVIGDEQSIKEDPKGGTMSSL 120

Query: 121 LSGSLSMALCSIPKEEPCLHFFLDRSPWLCVLSLLMIFGADYFLVCVDSFLSSTIWKLSF 180
           LSGSLSMALC                                                  
Sbjct: 121 LSGSLSMALC-------------------------------------------------- 180

Query: 181 IYIQKVFRSGSLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYIGSHNSAF 240
            YIQKVFRSGSLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMV IDSCYIGSHNSAF
Sbjct: 181 -YIQKVFRSGSLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVSIDSCYIGSHNSAF 240

Query: 241 LQQASYITGGVYLKPQQMDGLFQYLSTVYATDLHSRTFLQLPKSVGVDFRASCFCHKKTI 300
           LQQASYITGGVYLKPQQMDGLFQYL+TV+ TDLHSRTFLQLPKSVGVDFRASCFCH KTI
Sbjct: 241 LQQASYITGGVYLKPQQMDGLFQYLATVFGTDLHSRTFLQLPKSVGVDFRASCFCHXKTI 283

Query: 301 DMSYVCSVCLSIFCKHHKKCSTCGSVFGDASREM 335
           DM YVCSVCLSIFCKHHKKCSTCGSVFG+   E+
Sbjct: 301 DMGYVCSVCLSIFCKHHKKCSTCGSVFGETPVEL 283

BLAST of Spg021000 vs. NCBI nr
Match: XP_038886688.1 (general transcription and DNA repair factor IIH subunit TFB4 [Benincasa hispida] >XP_038886689.1 general transcription and DNA repair factor IIH subunit TFB4 [Benincasa hispida])

HSP 1 Score: 524.6 bits (1350), Expect = 8.9e-145
Identity = 269/334 (80.54%), Postives = 275/334 (82.34%), Query Frame = 0

Query: 1   MASAPSKLYADDVSLLVVLLDTNPFFWSTSSLPFSKFLSHVLAFLNSILVLNQLNEVVVI 60
           MASAP KLYADDVSLLVVLLDTNPFFWSTSSLPFSKFLSHVLAFLNSILVLNQLNEVVVI
Sbjct: 1   MASAPPKLYADDVSLLVVLLDTNPFFWSTSSLPFSKFLSHVLAFLNSILVLNQLNEVVVI 60

Query: 61  GTGYASCKYLYNSSSYSNRGLEDGRMPALCTRLLKNLEEFMIGDEQSIKEDPKGGTMSSL 120
           GTGYASCKYLYNSSSYSN GLEDGRMPALCTRLL NLEEF+I DEQS+KEDPKGGTMSSL
Sbjct: 61  GTGYASCKYLYNSSSYSNHGLEDGRMPALCTRLLNNLEEFVISDEQSLKEDPKGGTMSSL 120

Query: 121 LSGSLSMALCSIPKEEPCLHFFLDRSPWLCVLSLLMIFGADYFLVCVDSFLSSTIWKLSF 180
           LSGSLSMALC                                                  
Sbjct: 121 LSGSLSMALC-------------------------------------------------- 180

Query: 181 IYIQKVFRSGSLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYIGSHNSAF 240
            YIQKVFRSGSLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYIGSHNSAF
Sbjct: 181 -YIQKVFRSGSLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYIGSHNSAF 240

Query: 241 LQQASYITGGVYLKPQQMDGLFQYLSTVYATDLHSRTFLQLPKSVGVDFRASCFCHKKTI 300
           LQQASYITGGVYLKPQQMDGLFQYLSTV+ TDLHSRTFLQLPKSVGVDFRASCFCHKKTI
Sbjct: 241 LQQASYITGGVYLKPQQMDGLFQYLSTVFGTDLHSRTFLQLPKSVGVDFRASCFCHKKTI 283

Query: 301 DMSYVCSVCLSIFCKHHKKCSTCGSVFGDASREM 335
           DM YVCSVCLSIFCKHHKKCSTCGSVFG++  E+
Sbjct: 301 DMGYVCSVCLSIFCKHHKKCSTCGSVFGESPIEL 283

BLAST of Spg021000 vs. NCBI nr
Match: KAG6578972.1 (LysM domain-containing GPI-anchored protein 1, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 522.3 bits (1344), Expect = 4.4e-144
Identity = 268/329 (81.46%), Postives = 273/329 (82.98%), Query Frame = 0

Query: 1   MASAPSKLYADDVSLLVVLLDTNPFFWSTSSLPFSKFLSHVLAFLNSILVLNQLNEVVVI 60
           MASAPSKLYADDVSLLVVLLDTNPFFWST SLPFSKFLSHVLAFLNSILVLNQLNEVVVI
Sbjct: 552 MASAPSKLYADDVSLLVVLLDTNPFFWSTYSLPFSKFLSHVLAFLNSILVLNQLNEVVVI 611

Query: 61  GTGYASCKYLYNSSSYSNRGLEDGRMPALCTRLLKNLEEFMIGDEQSIKEDPKGGTMSSL 120
           GTGYASCKYLYNSSSYSNR LEDGRMPALCTRLL NLEEFMIGDEQSIKEDP+GGTMSSL
Sbjct: 612 GTGYASCKYLYNSSSYSNRSLEDGRMPALCTRLLNNLEEFMIGDEQSIKEDPRGGTMSSL 671

Query: 121 LSGSLSMALCSIPKEEPCLHFFLDRSPWLCVLSLLMIFGADYFLVCVDSFLSSTIWKLSF 180
           LSGSLSMALC                                                  
Sbjct: 672 LSGSLSMALC-------------------------------------------------- 731

Query: 181 IYIQKVFRSGSLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYIGSHNSAF 240
            YIQKVFRSGSLHP PRILCLQGSPDGPEQYVAIMN+IFSAQRSMVPIDSCYIGSHNSAF
Sbjct: 732 -YIQKVFRSGSLHPHPRILCLQGSPDGPEQYVAIMNSIFSAQRSMVPIDSCYIGSHNSAF 791

Query: 241 LQQASYITGGVYLKPQQMDGLFQYLSTVYATDLHSRTFLQLPKSVGVDFRASCFCHKKTI 300
           LQQASYITGGVYLKPQQMDGLFQYLSTV+ATDLHSRTFLQLPKSVGVDFRASCFCHKKTI
Sbjct: 792 LQQASYITGGVYLKPQQMDGLFQYLSTVFATDLHSRTFLQLPKSVGVDFRASCFCHKKTI 829

Query: 301 DMSYVCSVCLSIFCKHHKKCSTCGSVFGD 330
           DM +VCSVCLSIFCKHHKKCSTCGSVFG+
Sbjct: 852 DMGFVCSVCLSIFCKHHKKCSTCGSVFGE 829

BLAST of Spg021000 vs. NCBI nr
Match: XP_023550583.1 (RNA polymerase II transcription factor B subunit 4 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 522.3 bits (1344), Expect = 4.4e-144
Identity = 268/329 (81.46%), Postives = 273/329 (82.98%), Query Frame = 0

Query: 1   MASAPSKLYADDVSLLVVLLDTNPFFWSTSSLPFSKFLSHVLAFLNSILVLNQLNEVVVI 60
           MASAPSKLYADDVSLLVVLLDTNPFFWST SLPFSKFLSHVLAFLNSILVLNQLNEVVVI
Sbjct: 1   MASAPSKLYADDVSLLVVLLDTNPFFWSTYSLPFSKFLSHVLAFLNSILVLNQLNEVVVI 60

Query: 61  GTGYASCKYLYNSSSYSNRGLEDGRMPALCTRLLKNLEEFMIGDEQSIKEDPKGGTMSSL 120
           GTGYASCKYLYNSSSYSNR LEDGRMPALCTRLL NLEEFMIGDEQSIKEDP+GGTMSSL
Sbjct: 61  GTGYASCKYLYNSSSYSNRSLEDGRMPALCTRLLNNLEEFMIGDEQSIKEDPRGGTMSSL 120

Query: 121 LSGSLSMALCSIPKEEPCLHFFLDRSPWLCVLSLLMIFGADYFLVCVDSFLSSTIWKLSF 180
           LSGSLSMALC                                                  
Sbjct: 121 LSGSLSMALC-------------------------------------------------- 180

Query: 181 IYIQKVFRSGSLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYIGSHNSAF 240
            YIQKVFRSGSLHP PRILCLQGSPDGPEQYVAIMN+IFSAQRSMVPIDSCYIGSHNSAF
Sbjct: 181 -YIQKVFRSGSLHPHPRILCLQGSPDGPEQYVAIMNSIFSAQRSMVPIDSCYIGSHNSAF 240

Query: 241 LQQASYITGGVYLKPQQMDGLFQYLSTVYATDLHSRTFLQLPKSVGVDFRASCFCHKKTI 300
           LQQASYITGGVYLKPQQMDGLFQYLSTV+ATDLHSRTFLQLPKSVGVDFRASCFCHKKTI
Sbjct: 241 LQQASYITGGVYLKPQQMDGLFQYLSTVFATDLHSRTFLQLPKSVGVDFRASCFCHKKTI 278

Query: 301 DMSYVCSVCLSIFCKHHKKCSTCGSVFGD 330
           DM +VCSVCLSIFCKHHKKCSTCGSVFG+
Sbjct: 301 DMGFVCSVCLSIFCKHHKKCSTCGSVFGE 278

BLAST of Spg021000 vs. ExPASy Swiss-Prot
Match: Q8LF41 (General transcription and DNA repair factor IIH subunit TFB4 OS=Arabidopsis thaliana OX=3702 GN=TFB4 PE=2 SV=1)

HSP 1 Score: 394.8 bits (1013), Expect = 1.4e-108
Identity = 207/334 (61.98%), Postives = 239/334 (71.56%), Query Frame = 0

Query: 1   MASAPSKLYADDVSLLVVLLDTNPFFWSTSSLPFSKFLSHVLAFLNSILVLNQLNEVVVI 60
           M +  SK Y+DDVSLLV+LLDTNP FWST+S+ FS+FLSHVLAFLN++L LNQLN+VVVI
Sbjct: 1   MPAIASKQYSDDVSLLVLLLDTNPLFWSTTSITFSQFLSHVLAFLNAVLGLNQLNQVVVI 60

Query: 61  GTGYASCKYLYNSSSYSNRGLEDGR---MPALCTRLLKNLEEFMIGDEQSIKEDPKGGTM 120
            TGY+SC Y+Y+SS  SN G  +     MPA+   LLK LEEF+  DE+  KE+     +
Sbjct: 61  ATGYSSCDYIYDSSLTSNHGNFESNGTGMPAIFGSLLKKLEEFVTKDEELSKEEVSEDRI 120

Query: 121 -SSLLSGSLSMALCSIPKEEPCLHFFLDRSPWLCVLSLLMIFGADYFLVCVDSFLSSTIW 180
            S LLSGSLSMALC                                              
Sbjct: 121 PSCLLSGSLSMALC---------------------------------------------- 180

Query: 181 KLSFIYIQKVFRSGSLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYIGSH 240
                YIQ+VFRSG LHPQPRILCLQGSPDGPEQYVA+MN+IFSAQR MVPIDSCYIG  
Sbjct: 181 -----YIQRVFRSGHLHPQPRILCLQGSPDGPEQYVAVMNSIFSAQRLMVPIDSCYIGVQ 240

Query: 241 NSAFLQQASYITGGVYLKPQQMDGLFQYLSTVYATDLHSRTFLQLPKSVGVDFRASCFCH 300
           NSAFLQQASYITGGV+  P+Q+DGLFQYL+T++ATDLHSR F+QLPK +GVDFRASCFCH
Sbjct: 241 NSAFLQQASYITGGVHHTPKQLDGLFQYLTTIFATDLHSRGFVQLPKPIGVDFRASCFCH 283

Query: 301 KKTIDMSYVCSVCLSIFCKHHKKCSTCGSVFGDA 331
           KKTIDM Y+CSVCLSIFC+HHKKCSTCGSVFG +
Sbjct: 301 KKTIDMGYICSVCLSIFCEHHKKCSTCGSVFGQS 283

BLAST of Spg021000 vs. ExPASy Swiss-Prot
Match: Q86IB5 (General transcription factor IIH subunit 3 OS=Dictyostelium discoideum OX=44689 GN=gtf2h3 PE=3 SV=1)

HSP 1 Score: 169.9 bits (429), Expect = 7.3e-41
Identity = 100/294 (34.01%), Postives = 150/294 (51.02%), Query Frame = 0

Query: 34  FSKFLSHVLAFLNSILVLNQLNEVVVIGTGYASCKYLYNSSSYSNRGLEDGRMPALCTRL 93
           F+KFL H + F+N+ L+LNQ N++ +I +      +++  S+      E           
Sbjct: 92  FNKFLEHFMVFINAYLMLNQENQLAIICSKIGESSFVFPQSNIDQYQQEQ---------- 151

Query: 94  LKNLEEFMIGDEQSIKEDPKGGTMSSLLSGSLSMALCSIPKEEPCLHFFLDRSPWLCVLS 153
            + LE+  + +   +   P     +  + G +   L  +  E                  
Sbjct: 152 -QELEQRQLNENGELLPTP-----NKTIQGQILAKLQKLDLE------------------ 211

Query: 154 LLMIFGADYFLVCVDSFLSSTIWKLSFIYIQKVFRSGSLHPQPRILCLQGSPDGPEQYVA 213
                  D   +   SF +S    ++  YI ++ R  +   +PRIL    SPD   QY++
Sbjct: 212 ----IKHDQTDILSSSFSAS--MSIALCYINRIKRE-TPTIKPRILVFNISPDVSSQYIS 271

Query: 214 IMNAIFSAQRSMVPIDSCYIGSHNSAFLQQASYITGGVYLKPQQMDGLFQYLSTVYATDL 273
           +MN IFS+Q+  +P+DSC +   +S FLQQAS++T G+YLKPQ+ + L QYL T +  D 
Sbjct: 272 VMNCIFSSQKQSIPVDSCILSQSDSTFLQQASHLTSGIYLKPQKQELLSQYLLTTFLLDT 331

Query: 274 HSRTFLQLPKSVGVDFRASCFCHKKTIDMSYVCSVCLSIFCKHHKKCSTCGSVF 328
            SR  L  P    VD+RASCFCHK+ +D+ YVCSVCLSIFC H   CSTCG+ F
Sbjct: 332 LSRKSLAYPTLKSVDYRASCFCHKRIVDIGYVCSVCLSIFCGHSSSCSTCGTKF 344

BLAST of Spg021000 vs. ExPASy Swiss-Prot
Match: Q13889 (General transcription factor IIH subunit 3 OS=Homo sapiens OX=9606 GN=GTF2H3 PE=1 SV=2)

HSP 1 Score: 154.1 bits (388), Expect = 4.2e-36
Identity = 106/339 (31.27%), Postives = 161/339 (47.49%), Query Frame = 0

Query: 11  DDVSLLVVLLDTNPFFWSTSSLPFSKF-----LSHVLAFLNSILVLNQLNEVVVIGTGYA 70
           D+++LLV+++D NP +W   +L  S+F     +  V+   NS L +N+ N++ VI +   
Sbjct: 6   DELNLLVIVVDANPIWWGKQALKESQFTLSKCIDAVMVLGNSHLFMNRSNKLAVIASHIQ 65

Query: 71  SCKYLY---------------NSSSYSNRGLEDGRMPALCTRLLKNLEEFMIGDEQSI-- 130
             ++LY               N   ++  G +DG+       LL +  E ++ + + +  
Sbjct: 66  ESRFLYPGKNGRLGDFFGDPGNPPEFNPSGSKDGKY-----ELLTSANEVIVEEIKDLMT 125

Query: 131 KEDPKGGTMSSLLSGSLSMALCSIPKEEPCLHFFLDRSPWLCVLSLLMIFGADYFLVCVD 190
           K D KG    +LL+GSL+ ALC                                      
Sbjct: 126 KSDIKGQHTETLLAGSLAKALC-------------------------------------- 185

Query: 191 SFLSSTIWKLSFIYIQKVFRSGSLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPI 250
                      +I+        +   + RIL ++ + D   QY+  MN IF+AQ+  + I
Sbjct: 186 -----------YIHRMNKEVKDNQEMKSRILVIKAAEDSALQYMNFMNVIFAAQKQNILI 245

Query: 251 DSCYIGSHNSAFLQQASYITGGVYLKPQQMDGLFQYLSTVYATDLHSRTFLQLPKSVGVD 310
           D+C + S +S  LQQA  ITGG+YLK  QM  L QYL  V+  D   R+ L LP  V VD
Sbjct: 246 DACVLDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQDQRSQLILPPPVHVD 289

Query: 311 FRASCFCHKKTIDMSYVCSVCLSIFCKHHKKCSTCGSVF 328
           +RA+CFCH+  I++ YVCSVCLSIFC     C+TC + F
Sbjct: 306 YRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 289

BLAST of Spg021000 vs. ExPASy Swiss-Prot
Match: Q05B56 (General transcription factor IIH subunit 3 OS=Bos taurus OX=9913 GN=GTF2H3 PE=2 SV=1)

HSP 1 Score: 153.7 bits (387), Expect = 5.4e-36
Identity = 107/338 (31.66%), Postives = 158/338 (46.75%), Query Frame = 0

Query: 11  DDVSLLVVLLDTNPFFWSTSSLPFSKF-----LSHVLAFLNSILVLNQLNEVVVIGTGYA 70
           D+++LLV+++DTNP +W   +L  S+F     +  V+   NS L +N+ N++ VI +   
Sbjct: 6   DELNLLVIIVDTNPIWWGKQALKESQFTLSKCIDAVMVLGNSHLFMNRSNKLAVIASHIQ 65

Query: 71  SCKYLY----------------NSSSYSNRGLEDGRMPALCTRLLKNLEEFMIGDEQSIK 130
             ++LY                 SS ++  G +DG+   L        EE     +   K
Sbjct: 66  ESRFLYPGKNGRLGDFFGDPGNPSSEFTPSGSKDGKYELLTAANEVIAEEI---KDLMTK 125

Query: 131 EDPKGGTMSSLLSGSLSMALCSIPKEEPCLHFFLDRSPWLCVLSLLMIFGADYFLVCVDS 190
            D +G    +LL+GSL+ ALC                                       
Sbjct: 126 SDIEGQHTETLLAGSLAKALC--------------------------------------- 185

Query: 191 FLSSTIWKLSFIYIQKVFRSGSLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPID 250
                     +I+        +   + RIL ++ + D   QY+  MN IF+AQ+  + ID
Sbjct: 186 ----------YIHRMNKEVKDNQEMKSRILVIKAAEDSALQYMNFMNVIFAAQKQNILID 245

Query: 251 SCYIGSHNSAFLQQASYITGGVYLKPQQMDGLFQYLSTVYATDLHSRTFLQLPKSVGVDF 310
           +C + S +S  LQQA  ITGG+YLK  QM  L QYL  V+  D   R+ L LP  V VD+
Sbjct: 246 ACVLDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQDQRSQLILPPPVHVDY 290

Query: 311 RASCFCHKKTIDMSYVCSVCLSIFCKHHKKCSTCGSVF 328
           RA+CFCH+  I++ YVCSVCLSIFC     C+TC + F
Sbjct: 306 RAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 290

BLAST of Spg021000 vs. ExPASy Swiss-Prot
Match: Q561R7 (General transcription factor IIH subunit 3 OS=Rattus norvegicus OX=10116 GN=Gtf2h3 PE=2 SV=1)

HSP 1 Score: 152.9 bits (385), Expect = 9.3e-36
Identity = 104/338 (30.77%), Postives = 156/338 (46.15%), Query Frame = 0

Query: 11  DDVSLLVVLLDTNPFFWSTSSLPFSKF-----LSHVLAFLNSILVLNQLNEVVVIGTGYA 70
           D+++LLV+++DTNP +W   +L  S+F     +  V+   N+ L +N+ N++ VI +   
Sbjct: 6   DELNLLVIIVDTNPIWWGKQALKESQFTLSKCMDAVMVLANAHLFMNRSNQLAVIASHIQ 65

Query: 71  SCKYLYNSSS----------------YSNRGLEDGRMPALCTRLLKNLEEFMIGDEQSIK 130
             ++LY   +                 +  G +DG+   L        EE     +   K
Sbjct: 66  ESRFLYPGKNGRLGDFFGDPGNALPDCNPSGSKDGKYELLTAANEVIAEEI---KDLMTK 125

Query: 131 EDPKGGTMSSLLSGSLSMALCSIPKEEPCLHFFLDRSPWLCVLSLLMIFGADYFLVCVDS 190
            D KG    +LL+GSL+ ALC I +    +                              
Sbjct: 126 SDIKGQHTETLLAGSLAKALCYIHRASKAV------------------------------ 185

Query: 191 FLSSTIWKLSFIYIQKVFRSGSLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPID 250
                                +   + RIL ++ + D   QY+  MN IF+AQ+  + ID
Sbjct: 186 -------------------KDNQEMKSRILVIKAAEDSALQYMNFMNVIFAAQKQNILID 245

Query: 251 SCYIGSHNSAFLQQASYITGGVYLKPQQMDGLFQYLSTVYATDLHSRTFLQLPKSVGVDF 310
           +C + S +S  LQQA  ITGG+YLK  QM  L QYL  V+  D   R+ L LP  + VD+
Sbjct: 246 ACVLDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQDQRSQLILPPPIHVDY 290

Query: 311 RASCFCHKKTIDMSYVCSVCLSIFCKHHKKCSTCGSVF 328
           RA+CFCH+  I++ YVCSVCLSIFC     C+TC + F
Sbjct: 306 RAACFCHRSLIEIGYVCSVCLSIFCNFSPICTTCETAF 290

BLAST of Spg021000 vs. ExPASy TrEMBL
Match: A0A0A0LMM2 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G072450 PE=3 SV=1)

HSP 1 Score: 530.0 bits (1364), Expect = 1.0e-146
Identity = 272/334 (81.44%), Postives = 277/334 (82.93%), Query Frame = 0

Query: 1   MASAPSKLYADDVSLLVVLLDTNPFFWSTSSLPFSKFLSHVLAFLNSILVLNQLNEVVVI 60
           MASAPSKLYADDVSLLVVLLDTNPFFWSTS+LPFSKFLSHVLAFLNSILVLNQLNEVVVI
Sbjct: 1   MASAPSKLYADDVSLLVVLLDTNPFFWSTSALPFSKFLSHVLAFLNSILVLNQLNEVVVI 60

Query: 61  GTGYASCKYLYNSSSYSNRGLEDGRMPALCTRLLKNLEEFMIGDEQSIKEDPKGGTMSSL 120
           GTGYASCKYLYNSSSYSN GLEDGRMPALCTRLLKNLEEF+IGDEQSIKEDPKGGTMSSL
Sbjct: 61  GTGYASCKYLYNSSSYSNHGLEDGRMPALCTRLLKNLEEFVIGDEQSIKEDPKGGTMSSL 120

Query: 121 LSGSLSMALCSIPKEEPCLHFFLDRSPWLCVLSLLMIFGADYFLVCVDSFLSSTIWKLSF 180
           LSGSLSMALC                                                  
Sbjct: 121 LSGSLSMALC-------------------------------------------------- 180

Query: 181 IYIQKVFRSGSLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYIGSHNSAF 240
            YIQKVFRSGSLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYIGSHNSAF
Sbjct: 181 -YIQKVFRSGSLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYIGSHNSAF 240

Query: 241 LQQASYITGGVYLKPQQMDGLFQYLSTVYATDLHSRTFLQLPKSVGVDFRASCFCHKKTI 300
           LQQASYITGGVYLKPQQMDGLFQYLSTV+ TDLHSRTFLQLPKSVGVDFRASCFCHKKTI
Sbjct: 241 LQQASYITGGVYLKPQQMDGLFQYLSTVFGTDLHSRTFLQLPKSVGVDFRASCFCHKKTI 283

Query: 301 DMSYVCSVCLSIFCKHHKKCSTCGSVFGDASREM 335
           DM YVCSVCLSIFCKHHKKCSTCGSVFG+   E+
Sbjct: 301 DMGYVCSVCLSIFCKHHKKCSTCGSVFGETPVEL 283

BLAST of Spg021000 vs. ExPASy TrEMBL
Match: A0A1S3B4H7 (LOW QUALITY PROTEIN: RNA polymerase II transcription factor B subunit 4 OS=Cucumis melo OX=3656 GN=LOC103485913 PE=3 SV=1)

HSP 1 Score: 524.6 bits (1350), Expect = 4.3e-145
Identity = 270/334 (80.84%), Postives = 275/334 (82.34%), Query Frame = 0

Query: 1   MASAPSKLYADDVSLLVVLLDTNPFFWSTSSLPFSKFLSHVLAFLNSILVLNQLNEVVVI 60
           MASAPSKLYADDVSLLVVLLDTNPFFWSTSSLPFSKFLSHVLAFLNSILVLNQLNEVVVI
Sbjct: 1   MASAPSKLYADDVSLLVVLLDTNPFFWSTSSLPFSKFLSHVLAFLNSILVLNQLNEVVVI 60

Query: 61  GTGYASCKYLYNSSSYSNRGLEDGRMPALCTRLLKNLEEFMIGDEQSIKEDPKGGTMSSL 120
           GTGYASCKYLYNSSSYSN GLEDGRMPALCTRLLKNLEEF+IGDEQSIKEDPKGGTMSSL
Sbjct: 61  GTGYASCKYLYNSSSYSNHGLEDGRMPALCTRLLKNLEEFVIGDEQSIKEDPKGGTMSSL 120

Query: 121 LSGSLSMALCSIPKEEPCLHFFLDRSPWLCVLSLLMIFGADYFLVCVDSFLSSTIWKLSF 180
           LSGSLSMALC                                                  
Sbjct: 121 LSGSLSMALC-------------------------------------------------- 180

Query: 181 IYIQKVFRSGSLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYIGSHNSAF 240
            YIQKVFRSGSLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMV IDSCYIGSHNSAF
Sbjct: 181 -YIQKVFRSGSLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVSIDSCYIGSHNSAF 240

Query: 241 LQQASYITGGVYLKPQQMDGLFQYLSTVYATDLHSRTFLQLPKSVGVDFRASCFCHKKTI 300
           LQQASYITGGVYLKPQQMDGLFQYL+TV+ TDLHSRTFLQLPKSVGVDFRASCFCH KTI
Sbjct: 241 LQQASYITGGVYLKPQQMDGLFQYLATVFGTDLHSRTFLQLPKSVGVDFRASCFCHXKTI 283

Query: 301 DMSYVCSVCLSIFCKHHKKCSTCGSVFGDASREM 335
           DM YVCSVCLSIFCKHHKKCSTCGSVFG+   E+
Sbjct: 301 DMGYVCSVCLSIFCKHHKKCSTCGSVFGETPVEL 283

BLAST of Spg021000 vs. ExPASy TrEMBL
Match: A0A6J1FJQ6 (RNA polymerase II transcription factor B subunit 4 OS=Cucurbita moschata OX=3662 GN=LOC111444867 PE=3 SV=1)

HSP 1 Score: 520.8 bits (1340), Expect = 6.2e-144
Identity = 267/329 (81.16%), Postives = 273/329 (82.98%), Query Frame = 0

Query: 1   MASAPSKLYADDVSLLVVLLDTNPFFWSTSSLPFSKFLSHVLAFLNSILVLNQLNEVVVI 60
           MASAPSKLYADDVSLLVVLLDTNPFFWST SLPFSKFLSHVLAFLNSILVLNQLNEVVVI
Sbjct: 1   MASAPSKLYADDVSLLVVLLDTNPFFWSTYSLPFSKFLSHVLAFLNSILVLNQLNEVVVI 60

Query: 61  GTGYASCKYLYNSSSYSNRGLEDGRMPALCTRLLKNLEEFMIGDEQSIKEDPKGGTMSSL 120
           GTGYASCKYLYNSSSYSNR LEDGRMPALCTRLL NLEEFMIGDEQSI+EDP+GGTMSSL
Sbjct: 61  GTGYASCKYLYNSSSYSNRSLEDGRMPALCTRLLNNLEEFMIGDEQSIEEDPRGGTMSSL 120

Query: 121 LSGSLSMALCSIPKEEPCLHFFLDRSPWLCVLSLLMIFGADYFLVCVDSFLSSTIWKLSF 180
           LSGSLSMALC                                                  
Sbjct: 121 LSGSLSMALC-------------------------------------------------- 180

Query: 181 IYIQKVFRSGSLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYIGSHNSAF 240
            YIQKVFRSGSLHP PRILCLQGSPDGPEQYVAIMN+IFSAQRSMVPIDSCYIGSHNSAF
Sbjct: 181 -YIQKVFRSGSLHPHPRILCLQGSPDGPEQYVAIMNSIFSAQRSMVPIDSCYIGSHNSAF 240

Query: 241 LQQASYITGGVYLKPQQMDGLFQYLSTVYATDLHSRTFLQLPKSVGVDFRASCFCHKKTI 300
           LQQASYITGGVYLKPQQMDGLFQYLSTV+ATDLHSRTFLQLPKSVGVDFRASCFCHKKTI
Sbjct: 241 LQQASYITGGVYLKPQQMDGLFQYLSTVFATDLHSRTFLQLPKSVGVDFRASCFCHKKTI 278

Query: 301 DMSYVCSVCLSIFCKHHKKCSTCGSVFGD 330
           DM +VCSVCLSIFCKHHKKCSTCGSVFG+
Sbjct: 301 DMGFVCSVCLSIFCKHHKKCSTCGSVFGE 278

BLAST of Spg021000 vs. ExPASy TrEMBL
Match: A0A6J1JZK7 (RNA polymerase II transcription factor B subunit 4 OS=Cucurbita maxima OX=3661 GN=LOC111489720 PE=3 SV=1)

HSP 1 Score: 518.5 bits (1334), Expect = 3.1e-143
Identity = 266/329 (80.85%), Postives = 272/329 (82.67%), Query Frame = 0

Query: 1   MASAPSKLYADDVSLLVVLLDTNPFFWSTSSLPFSKFLSHVLAFLNSILVLNQLNEVVVI 60
           MASAPSKLYADDVSLLVVLLDTNPFFWST SLPFSKFLSHVLAFLNSILVLNQLNEVVVI
Sbjct: 1   MASAPSKLYADDVSLLVVLLDTNPFFWSTYSLPFSKFLSHVLAFLNSILVLNQLNEVVVI 60

Query: 61  GTGYASCKYLYNSSSYSNRGLEDGRMPALCTRLLKNLEEFMIGDEQSIKEDPKGGTMSSL 120
           GTGYASCKYLYNSSSYSNR LEDGRMPALCTRLL NLEEFMIGDEQSIKEDP+ GTMSSL
Sbjct: 61  GTGYASCKYLYNSSSYSNRSLEDGRMPALCTRLLNNLEEFMIGDEQSIKEDPREGTMSSL 120

Query: 121 LSGSLSMALCSIPKEEPCLHFFLDRSPWLCVLSLLMIFGADYFLVCVDSFLSSTIWKLSF 180
           LSGSLSMALC                                                  
Sbjct: 121 LSGSLSMALC-------------------------------------------------- 180

Query: 181 IYIQKVFRSGSLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYIGSHNSAF 240
            Y+QKVFRSGSLHP PRILCLQGSPDGPEQYVAIMN+IFSAQRSMVPIDSCYIGSHNSAF
Sbjct: 181 -YLQKVFRSGSLHPHPRILCLQGSPDGPEQYVAIMNSIFSAQRSMVPIDSCYIGSHNSAF 240

Query: 241 LQQASYITGGVYLKPQQMDGLFQYLSTVYATDLHSRTFLQLPKSVGVDFRASCFCHKKTI 300
           LQQASYITGGVYLKPQQMDGLFQYLSTV+ATDLHSRTFLQLPKSVGVDFRASCFCHKKTI
Sbjct: 241 LQQASYITGGVYLKPQQMDGLFQYLSTVFATDLHSRTFLQLPKSVGVDFRASCFCHKKTI 278

Query: 301 DMSYVCSVCLSIFCKHHKKCSTCGSVFGD 330
           DM +VCSVCLSIFCKHHKKCSTCGSVFG+
Sbjct: 301 DMGFVCSVCLSIFCKHHKKCSTCGSVFGE 278

BLAST of Spg021000 vs. ExPASy TrEMBL
Match: A0A6J1CM09 (RNA polymerase II transcription factor B subunit 4 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111012196 PE=3 SV=1)

HSP 1 Score: 518.1 bits (1333), Expect = 4.0e-143
Identity = 265/329 (80.55%), Postives = 272/329 (82.67%), Query Frame = 0

Query: 1   MASAPSKLYADDVSLLVVLLDTNPFFWSTSSLPFSKFLSHVLAFLNSILVLNQLNEVVVI 60
           MAS PSKLYADDVSLL+VLLDTNPFFWSTSSLPFSKFLSHVLAFLNSIL LNQLNEVVVI
Sbjct: 1   MASVPSKLYADDVSLLMVLLDTNPFFWSTSSLPFSKFLSHVLAFLNSILDLNQLNEVVVI 60

Query: 61  GTGYASCKYLYNSSSYSNRGLEDGRMPALCTRLLKNLEEFMIGDEQSIKEDPKGGTMSSL 120
           GTGYASCKYLYNSSS+SNRGLEDGRMPALCTRLLKNLEEF+I DEQS+KEDP+GGTMSSL
Sbjct: 61  GTGYASCKYLYNSSSHSNRGLEDGRMPALCTRLLKNLEEFVIADEQSVKEDPRGGTMSSL 120

Query: 121 LSGSLSMALCSIPKEEPCLHFFLDRSPWLCVLSLLMIFGADYFLVCVDSFLSSTIWKLSF 180
           LSGSLSMALC                                                  
Sbjct: 121 LSGSLSMALC-------------------------------------------------- 180

Query: 181 IYIQKVFRSGSLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYIGSHNSAF 240
            YIQKVFRSGSLHP PRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYIGSHNSAF
Sbjct: 181 -YIQKVFRSGSLHPHPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYIGSHNSAF 240

Query: 241 LQQASYITGGVYLKPQQMDGLFQYLSTVYATDLHSRTFLQLPKSVGVDFRASCFCHKKTI 300
           LQQASYITGGVYLKPQQMDGLFQYLSTV+ATDLHSR FLQLPKSVGVDFRASCFCHKKTI
Sbjct: 241 LQQASYITGGVYLKPQQMDGLFQYLSTVFATDLHSRAFLQLPKSVGVDFRASCFCHKKTI 278

Query: 301 DMSYVCSVCLSIFCKHHKKCSTCGSVFGD 330
           DM YVCSVCLSIFCKHHKKCSTCGSVFG+
Sbjct: 301 DMGYVCSVCLSIFCKHHKKCSTCGSVFGE 278

BLAST of Spg021000 vs. TAIR 10
Match: AT1G18340.1 (basal transcription factor complex subunit-related )

HSP 1 Score: 394.8 bits (1013), Expect = 9.9e-110
Identity = 207/334 (61.98%), Postives = 239/334 (71.56%), Query Frame = 0

Query: 1   MASAPSKLYADDVSLLVVLLDTNPFFWSTSSLPFSKFLSHVLAFLNSILVLNQLNEVVVI 60
           M +  SK Y+DDVSLLV+LLDTNP FWST+S+ FS+FLSHVLAFLN++L LNQLN+VVVI
Sbjct: 1   MPAIASKQYSDDVSLLVLLLDTNPLFWSTTSITFSQFLSHVLAFLNAVLGLNQLNQVVVI 60

Query: 61  GTGYASCKYLYNSSSYSNRGLEDGR---MPALCTRLLKNLEEFMIGDEQSIKEDPKGGTM 120
            TGY+SC Y+Y+SS  SN G  +     MPA+   LLK LEEF+  DE+  KE+     +
Sbjct: 61  ATGYSSCDYIYDSSLTSNHGNFESNGTGMPAIFGSLLKKLEEFVTKDEELSKEEVSEDRI 120

Query: 121 -SSLLSGSLSMALCSIPKEEPCLHFFLDRSPWLCVLSLLMIFGADYFLVCVDSFLSSTIW 180
            S LLSGSLSMALC                                              
Sbjct: 121 PSCLLSGSLSMALC---------------------------------------------- 180

Query: 181 KLSFIYIQKVFRSGSLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYIGSH 240
                YIQ+VFRSG LHPQPRILCLQGSPDGPEQYVA+MN+IFSAQR MVPIDSCYIG  
Sbjct: 181 -----YIQRVFRSGHLHPQPRILCLQGSPDGPEQYVAVMNSIFSAQRLMVPIDSCYIGVQ 240

Query: 241 NSAFLQQASYITGGVYLKPQQMDGLFQYLSTVYATDLHSRTFLQLPKSVGVDFRASCFCH 300
           NSAFLQQASYITGGV+  P+Q+DGLFQYL+T++ATDLHSR F+QLPK +GVDFRASCFCH
Sbjct: 241 NSAFLQQASYITGGVHHTPKQLDGLFQYLTTIFATDLHSRGFVQLPKPIGVDFRASCFCH 283

Query: 301 KKTIDMSYVCSVCLSIFCKHHKKCSTCGSVFGDA 331
           KKTIDM Y+CSVCLSIFC+HHKKCSTCGSVFG +
Sbjct: 301 KKTIDMGYICSVCLSIFCEHHKKCSTCGSVFGQS 283

BLAST of Spg021000 vs. TAIR 10
Match: AT1G21880.2 (lysm domain GPI-anchored protein 1 precursor )

HSP 1 Score: 49.3 bits (116), Expect = 1.0e-05
Identity = 45/106 (42.45%), Postives = 59/106 (55.66%), Query Frame = 0

Query: 384 PQESGPQQFPPRKAPPAIVIRDSNFAPAPAPQFDSSGSPA--PGLGLIPSTTGSLPGFSP 443
           P+  GPQQF P  APP  V RD  +APAP+P FD  GS A  P   ++P   G LPG +P
Sbjct: 323 PRCPGPQQFAPLLAPPDTVPRDVMYAPAPSPDFDGPGSIASSPRSSMLPG-GGILPG-NP 382

Query: 444 ASGPVGSTSGLNSAPSAASSLVYPLASISVAFLLFIIANFFTSFSL 488
           A+GP GS S  ++            +S+S  F+ F+I+    SFSL
Sbjct: 383 ANGPAGSISTASA------------SSVSYFFITFLIS--IASFSL 412

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004152842.12.1e-14681.44general transcription and DNA repair factor IIH subunit TFB4 [Cucumis sativus] >... [more]
XP_008441918.28.9e-14580.84PREDICTED: LOW QUALITY PROTEIN: RNA polymerase II transcription factor B subunit... [more]
XP_038886688.18.9e-14580.54general transcription and DNA repair factor IIH subunit TFB4 [Benincasa hispida]... [more]
KAG6578972.14.4e-14481.46LysM domain-containing GPI-anchored protein 1, partial [Cucurbita argyrosperma s... [more]
XP_023550583.14.4e-14481.46RNA polymerase II transcription factor B subunit 4 [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
Q8LF411.4e-10861.98General transcription and DNA repair factor IIH subunit TFB4 OS=Arabidopsis thal... [more]
Q86IB57.3e-4134.01General transcription factor IIH subunit 3 OS=Dictyostelium discoideum OX=44689 ... [more]
Q138894.2e-3631.27General transcription factor IIH subunit 3 OS=Homo sapiens OX=9606 GN=GTF2H3 PE=... [more]
Q05B565.4e-3631.66General transcription factor IIH subunit 3 OS=Bos taurus OX=9913 GN=GTF2H3 PE=2 ... [more]
Q561R79.3e-3630.77General transcription factor IIH subunit 3 OS=Rattus norvegicus OX=10116 GN=Gtf2... [more]
Match NameE-valueIdentityDescription
A0A0A0LMM21.0e-14681.44Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G072450 PE=3 SV=1[more]
A0A1S3B4H74.3e-14580.84LOW QUALITY PROTEIN: RNA polymerase II transcription factor B subunit 4 OS=Cucum... [more]
A0A6J1FJQ66.2e-14481.16RNA polymerase II transcription factor B subunit 4 OS=Cucurbita moschata OX=3662... [more]
A0A6J1JZK73.1e-14380.85RNA polymerase II transcription factor B subunit 4 OS=Cucurbita maxima OX=3661 G... [more]
A0A6J1CM094.0e-14380.55RNA polymerase II transcription factor B subunit 4 isoform X1 OS=Momordica chara... [more]
Match NameE-valueIdentityDescription
AT1G18340.19.9e-11061.98basal transcription factor complex subunit-related [more]
AT1G21880.21.0e-0542.45lysm domain GPI-anchored protein 1 precursor [more]
InterPro
Analysis Name: InterPro Annotations of Sponge gourd (cylindrica) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR004600TFIIH subunit Tfb4/GTF2H3PFAMPF03850Tfb4coord: 13..324
e-value: 5.0E-84
score: 282.1
IPR004600TFIIH subunit Tfb4/GTF2H3PANTHERPTHR12831TRANSCRIPTION INITIATION FACTOR IIH TFIIH , POLYPEPTIDE 3-RELATEDcoord: 178..332
coord: 1..134
IPR036465von Willebrand factor A-like domain superfamilyGENE3D3.40.50.410von Willebrand factor, type A domaincoord: 10..272
e-value: 6.0E-57
score: 194.8

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Spg021000.1Spg021000.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006289 nucleotide-excision repair
biological_process GO:0070816 phosphorylation of RNA polymerase II C-terminal domain
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0000439 transcription factor TFIIH core complex
cellular_component GO:0005675 transcription factor TFIIH holo complex
molecular_function GO:0046872 metal ion binding