Spg033974 (gene) Sponge gourd (cylindrica) v1

Overview
NameSpg033974
Typegene
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionGATA transcription factor
Locationscaffold13: 32789594 .. 32791889 (+)
RNA-Seq ExpressionSpg033974
SyntenySpg033974
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CTCCAAAAATAATTTCCCAAAACTAATTTCTCTTTACCCTCAACACACACACAAACATCCATGGAAGCTCCAGAATATTTCCAGAACAATGGCTACTGCTCCCAATTCGCCGCCGACAACGACGCCCCCGCCGCCGTCGGAGACCATTTCATCGTCGAGGAGCTTCTCGATTTCTCCAACCACGACGCCGACGCCGGAGGATTGTTCAACAATGCCACCGGCTGCTTCCTCAATAATGGAAACTCCGCCTCCGCCTCCGCCGACTCCTCCGCCCTCACCGTCGTCGAGAGCTGCAATTCCTCCAATTCCTTTTCGGTTTCCGAACCCAATTCCTTTCTCGAAGACATTAGTGCCTCTAATTTAGCCGACGCCCATTTCTCCGACGAACTCTGCATTCCGGTAATTGTTGTTATTATCATTAATTATTTTTTTTTTCTTTAAATAGGATGACGAGGAACTTTTCTTTTGGGGATTTCTTGCTTTTTTTGTTTTTTTGTTTTTTGTTTTTGTTACGTACATTGAACATTTTTCTGTGTATTAATTATATAATTTTGTTGAGGAATAGTAATTAATGACTAATGTATGAATATATATTCATTTGAAATTACACTCCCATGGCTTTTTTTTTTTTTTTTTTTGGGTGTGTTTTCTTCTTAATAGTAAGCTTAGAGTATTTACCTTATTTGTGATATATAAAAAGTTAAGATGGTTAAAAATATTTTTCAATTGGGGTTTGTTTTATTTTTGGTGAATGTATAAAATGGTCTCTGTTTTTTTTTTCTTAAATTTCTTTTAGTTATTTGATATTTAATCTTCTAAAAGTTCACAATTCTATCTTGATACATTGGGAAACGATTGATGGGTCATTCAATTGTTGTTTTATTAACATATTCGAACATAAAATCATGCTACTTTTATATAACATTGAAGAAGTAGCTTTTGTTGATCAAAAGGCAAAACTCTACCGAATAAACTAGTTGGATTCTGTGTGTTCTAGTTGAATTAAGTGACATATTTGTCCGTCCAAATATGTTTTGTTTAGTTCAACAAGAGGTGAGAGATTTGAACACCAGAAAAAATGAACCTACCTGAACACACTAACTTTTAACCCGCCCTTTTTTGACCTTTTGGTCGAGAGTACATGTCAATTACCGCTAAACTATGCTCACTTTGGCAACCATATAGTATTAATAAAGTAAAATAAGATGGTAAATTTTTGTAATGAAGATTATTATTATTATTATTAACACATTAGCTTTTTTTATTTGTGTGTTTTTACAGTTAGATGATTTAGCTGAGTTGGAATGGCTTTCAAATTTCGTAGAGGAATCATTTTCCAGTGAGGACATGCAAAAGTTAGAACTCATCTCCGGAGTCAAAGTCAAATCCGACGGACCCTCCCACTCCCGACAACCCACCACCGCCACCGCCGCCGTCTCCACCACCACCCACGCCCGAAACGCAGCCGAAATCTTCAAACCCGACATTGTCTCAGTTCCGGCGAAGGCCCGCAGCAAACGCTCACGCGCCGTCCCATCCAATTGGAACAACTCCCGCCTCCTCCCGCTCTCTCCGACCACCTCCTCGTCGGAATCCGACATCGCCCCCGCCGGACCACCGCAGCCGGTCAAAAAAATCCCGCCGAAGGCGGCGGCGACGGTGAAGAAGAAGGACTGCCCGGAGGCCGGAGCGTCCGCCGGAGAGGGGCGGAAGTGCATGCACTGCGCCACCGACAAGACGCCGCAGTGGCGGACGGGCCCAATGGGCCCAAAGACGCTGTGTAACGCTTGCGGCGTTCGGTACAAATCCGGGCGCCTGGTGCCGGAGTACCGCCCCGCCGCCAGCCCCACCTTCGTCCTCACCAAACACTCCAACTCCCACAGGAAAGTTTTGGAGCTCCGGCGGCAGAAGGAGCTTCTCAGAGCCCAACAACAGCAACAGCAACAAGTGCTTTTGGATCACCATCACCATCACCGTCATCAGGATATGATCTTTGATTCATCCAACGGTGACGATTATCTCATCCATCAACACGTGGGCCCCGATTACCGGCAGCTGATCTGACCTCCACCGCCGCCGCCGCCGCTAGGGGTAGAGTTCGGCCAATCTGATCCATTTGATTTTTTTTTTTTTTCCCTCAAATTTTTAGATGATTTTGTTGGAGTTTCTAACATTAAGATTTTGATTCTTTTTTCTCTTCCCATAGACATTATTTTTGTTGTCTGGAGAACAAAGGCAAATTACAATTTACAATTGATGAAAAAGTGTTGTCTAATTAATGAAACAACATTAAATC

mRNA sequence

ATGGAAGCTCCAGAATATTTCCAGAACAATGGCTACTGCTCCCAATTCGCCGCCGACAACGACGCCCCCGCCGCCGTCGGAGACCATTTCATCGTCGAGGAGCTTCTCGATTTCTCCAACCACGACGCCGACGCCGGAGGATTGTTCAACAATGCCACCGGCTGCTTCCTCAATAATGGAAACTCCGCCTCCGCCTCCGCCGACTCCTCCGCCCTCACCGTCGTCGAGAGCTGCAATTCCTCCAATTCCTTTTCGGTTTCCGAACCCAATTCCTTTCTCGAAGACATTAGTGCCTCTAATTTAGCCGACGCCCATTTCTCCGACGAACTCTGCATTCCGTTAGATGATTTAGCTGAGTTGGAATGGCTTTCAAATTTCGTAGAGGAATCATTTTCCAGTGAGGACATGCAAAAGTTAGAACTCATCTCCGGAGTCAAAGTCAAATCCGACGGACCCTCCCACTCCCGACAACCCACCACCGCCACCGCCGCCGTCTCCACCACCACCCACGCCCGAAACGCAGCCGAAATCTTCAAACCCGACATTGTCTCAGTTCCGGCGAAGGCCCGCAGCAAACGCTCACGCGCCGTCCCATCCAATTGGAACAACTCCCGCCTCCTCCCGCTCTCTCCGACCACCTCCTCGTCGGAATCCGACATCGCCCCCGCCGGACCACCGCAGCCGGTCAAAAAAATCCCGCCGAAGGCGGCGGCGACGGTGAAGAAGAAGGACTGCCCGGAGGCCGGAGCGTCCGCCGGAGAGGGGCGGAAGTGCATGCACTGCGCCACCGACAAGACGCCGCAGTGGCGGACGGGCCCAATGGGCCCAAAGACGCTGTGTAACGCTTGCGGCGTTCGGTACAAATCCGGGCGCCTGGTGCCGGAGTACCGCCCCGCCGCCAGCCCCACCTTCGTCCTCACCAAACACTCCAACTCCCACAGGAAAGTTTTGGAGCTCCGGCGGCAGAAGGAGCTTCTCAGAGCCCAACAACAGCAACAGCAACAAGTGCTTTTGGATCACCATCACCATCACCGTCATCAGGATATGATCTTTGATTCATCCAACGGTGACGATTATCTCATCCATCAACACGTGGGCCCCGATTACCGGCAGCTGATCTGA

Coding sequence (CDS)

ATGGAAGCTCCAGAATATTTCCAGAACAATGGCTACTGCTCCCAATTCGCCGCCGACAACGACGCCCCCGCCGCCGTCGGAGACCATTTCATCGTCGAGGAGCTTCTCGATTTCTCCAACCACGACGCCGACGCCGGAGGATTGTTCAACAATGCCACCGGCTGCTTCCTCAATAATGGAAACTCCGCCTCCGCCTCCGCCGACTCCTCCGCCCTCACCGTCGTCGAGAGCTGCAATTCCTCCAATTCCTTTTCGGTTTCCGAACCCAATTCCTTTCTCGAAGACATTAGTGCCTCTAATTTAGCCGACGCCCATTTCTCCGACGAACTCTGCATTCCGTTAGATGATTTAGCTGAGTTGGAATGGCTTTCAAATTTCGTAGAGGAATCATTTTCCAGTGAGGACATGCAAAAGTTAGAACTCATCTCCGGAGTCAAAGTCAAATCCGACGGACCCTCCCACTCCCGACAACCCACCACCGCCACCGCCGCCGTCTCCACCACCACCCACGCCCGAAACGCAGCCGAAATCTTCAAACCCGACATTGTCTCAGTTCCGGCGAAGGCCCGCAGCAAACGCTCACGCGCCGTCCCATCCAATTGGAACAACTCCCGCCTCCTCCCGCTCTCTCCGACCACCTCCTCGTCGGAATCCGACATCGCCCCCGCCGGACCACCGCAGCCGGTCAAAAAAATCCCGCCGAAGGCGGCGGCGACGGTGAAGAAGAAGGACTGCCCGGAGGCCGGAGCGTCCGCCGGAGAGGGGCGGAAGTGCATGCACTGCGCCACCGACAAGACGCCGCAGTGGCGGACGGGCCCAATGGGCCCAAAGACGCTGTGTAACGCTTGCGGCGTTCGGTACAAATCCGGGCGCCTGGTGCCGGAGTACCGCCCCGCCGCCAGCCCCACCTTCGTCCTCACCAAACACTCCAACTCCCACAGGAAAGTTTTGGAGCTCCGGCGGCAGAAGGAGCTTCTCAGAGCCCAACAACAGCAACAGCAACAAGTGCTTTTGGATCACCATCACCATCACCGTCATCAGGATATGATCTTTGATTCATCCAACGGTGACGATTATCTCATCCATCAACACGTGGGCCCCGATTACCGGCAGCTGATCTGA

Protein sequence

MEAPEYFQNNGYCSQFAADNDAPAAVGDHFIVEELLDFSNHDADAGGLFNNATGCFLNNGNSASASADSSALTVVESCNSSNSFSVSEPNSFLEDISASNLADAHFSDELCIPLDDLAELEWLSNFVEESFSSEDMQKLELISGVKVKSDGPSHSRQPTTATAAVSTTTHARNAAEIFKPDIVSVPAKARSKRSRAVPSNWNNSRLLPLSPTTSSSESDIAPAGPPQPVKKIPPKAAATVKKKDCPEAGASAGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQQQQVLLDHHHHHRHQDMIFDSSNGDDYLIHQHVGPDYRQLI
Homology
BLAST of Spg033974 vs. NCBI nr
Match: XP_038886306.1 (GATA transcription factor 12-like [Benincasa hispida])

HSP 1 Score: 519.2 bits (1336), Expect = 2.9e-143
Identity = 293/387 (75.71%), Postives = 313/387 (80.88%), Query Frame = 0

Query: 1   MEAPEYFQNNGYCSQF----AADNDAPAAVG----DHFIVEELLDFSNHD----ADAGGL 60
           MEAPEYFQ NGYCSQF    ++D D   A      +HFIVEELLDFSN D     D GGL
Sbjct: 1   MEAPEYFQINGYCSQFSTHSSSDTDTTTATATAGPEHFIVEELLDFSNDDDGVVGDGGGL 60

Query: 61  FNNATGCFLNNGNSASASADSSALTVVESCNSSNSFSVSEPN-SFLEDISASNLADAHFS 120
           F N      N  N+ + S +SSA+TV+ESCNSS SFS  EPN SFLEDIS SNLADAHFS
Sbjct: 61  FYNTN----NGNNNNNNSTESSAVTVIESCNSS-SFSGCEPNSSFLEDISGSNLADAHFS 120

Query: 121 DELCIPLDDLAELEWLSNFVEESFSSEDMQKLELISGVKVKSDGPSHSRQPTTATAAVST 180
            ELC+P DDLAELEWLS+FVEESFSSEDMQKLELISGVKV+SD P++SRQPT        
Sbjct: 121 SELCVPYDDLAELEWLSHFVEESFSSEDMQKLELISGVKVRSDEPTNSRQPTA------- 180

Query: 181 TTHARNAAEIFKPDIVSVPAKARSKRSRAVPSNWNNSRLLPLSPTTSSSESDI-APAGPP 240
               RNAA IFKPDIVSVPAKARSKRSRAVPSNWNNSRLLPLSPTT   E +I A AGPP
Sbjct: 181 ---TRNAAAIFKPDIVSVPAKARSKRSRAVPSNWNNSRLLPLSPTT---EPEITATAGPP 240

Query: 241 QPVKKIPPKAAATVKKKDCPEAGASAGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVR 300
            P+KK PPK AAT KKKD PE G S+GEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVR
Sbjct: 241 HPIKKNPPK-AATAKKKDSPEVGVSSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVR 300

Query: 301 YKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQQQQVLLDHHHHHRH 360
           YKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQQQ +LLDH     H
Sbjct: 301 YKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQQQHLLLDH-----H 360

Query: 361 QDMIFDSSNGDDYLIHQHVGPDYRQLI 374
           QDMIFD+SNGDDYLIHQH+GPD+RQLI
Sbjct: 361 QDMIFDASNGDDYLIHQHMGPDFRQLI 363

BLAST of Spg033974 vs. NCBI nr
Match: KAG6585379.1 (GATA transcription factor 12, partial [Cucurbita argyrosperma subsp. sororia] >KAG7020295.1 GATA transcription factor 12, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 493.0 bits (1268), Expect = 2.2e-135
Identity = 280/397 (70.53%), Postives = 310/397 (78.09%), Query Frame = 0

Query: 1   MEAPEYFQNNGYCSQFAADNDAPA------AVGDHFIVEELLDFSNHD----ADAGGLFN 60
           MEAPEYF NN YCSQF +D DA A      A  DHFIVEELLDFSN D    AD+GG FN
Sbjct: 1   MEAPEYFHNNAYCSQFTSDKDAAAATTASTATADHFIVEELLDFSNDDDSAIADSGGFFN 60

Query: 61  NATGCFLNNGNSASASADSSALTVVESCNSSNSFSVSEPNSFLEDISASNLADAHFSDEL 120
           N T CFL NGN    SA+SSA T VES NSS SFS SE  SF +D+SAS+LAD  FSD++
Sbjct: 61  NVT-CFL-NGN----SAESSAATAVESSNSS-SFSGSERTSFFDDVSASSLADVRFSDDI 120

Query: 121 CIPLDDLAELEWLSNFVEESFSSEDMQKLELISGVKVKSDGPSHSRQPTTATAAVSTTTH 180
            IP ++L ELEWL++F EE FSSEDMQKLELI+GVKVK D P  S  PTTA +AVS  +H
Sbjct: 121 FIPYNELVELEWLASFEEEPFSSEDMQKLELITGVKVKPDEPPQSHHPTTAVSAVSAASH 180

Query: 181 ARN-AAEIFKPDIVSVPAKARSKRSRAVPSNWNNSRLLPLSPTTSSSESDI-APAGPPQP 240
            RN AA IFKPDIV+VPAKARSKRSR +PSNWNNSRLLPLSPT+SSSE DI A   PP P
Sbjct: 181 GRNAAAAIFKPDIVAVPAKARSKRSRIIPSNWNNSRLLPLSPTSSSSELDIPATEPPPHP 240

Query: 241 VKKIPPKAAAT---------VKKKD---CPEAGASAGEGRKCMHCATDKTPQWRTGPMGP 300
           VKK+PPK AAT         VKKK+     E G SAGEGRKCMHCATDKTPQWRTGPMGP
Sbjct: 241 VKKVPPKVAATATATASTAAVKKKESSSSSETGMSAGEGRKCMHCATDKTPQWRTGPMGP 300

Query: 301 KTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQQQQV 360
           KTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKEL +A  Q+QQ +
Sbjct: 301 KTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELQKA--QEQQAL 360

Query: 361 LLDHHHHHRHQDMIFDSSNGDDYLIHQHVGPDYRQLI 374
           ++DHHHHH HQ+M+FDSSNG+DYL+ Q+V  DY  LI
Sbjct: 361 MMDHHHHHHHQEMMFDSSNGEDYLMKQNVAHDYLHLI 388

BLAST of Spg033974 vs. NCBI nr
Match: XP_023002390.1 (GATA transcription factor 12-like [Cucurbita maxima])

HSP 1 Score: 490.7 bits (1262), Expect = 1.1e-134
Identity = 276/388 (71.13%), Postives = 307/388 (79.12%), Query Frame = 0

Query: 1   MEAPEYFQNNGYCSQFAADNDAPA------AVGDHFIVEELLDFSNHD----ADAGGLFN 60
           MEAPEYF NN YCSQF +D DA A      A  DHFIVEELLDFSN D    AD+GG FN
Sbjct: 1   MEAPEYFHNNAYCSQFTSDKDAAASTATATATADHFIVEELLDFSNDDDSAIADSGGFFN 60

Query: 61  NATGCFLNNGNSASASADSSALTVVESCNSSNSFSVSEPNSFLEDISASNLADAHFSDEL 120
           N T CFL NGN    S++SSA T VES NSS SFS  E  SF +D+S S+LAD  FSD++
Sbjct: 61  NVT-CFL-NGN----SSESSAATAVESSNSS-SFSGCERTSFFDDVSGSSLADVRFSDDI 120

Query: 121 CIPLDDLAELEWLSNFVEESFSSEDMQKLELISGVKVKSDGPSHSRQPTTATAAVSTTTH 180
            IP ++L ELEWL++F EE FSSEDMQKLELI+GVKVK D P  S+ PTT   AVS  +H
Sbjct: 121 FIPYNELVELEWLASFEEEPFSSEDMQKLELITGVKVKPDEPLQSQHPTT---AVSAASH 180

Query: 181 ARN-AAEIFKPDIVSVPAKARSKRSRAVPSNWNNSRLLPLSPTTSSSESDI-APAGPPQP 240
            RN AAEIFKPDIV+VPAKARSKRSR +PSNWNNSRLLPLSPT+SSSE DI A   PP P
Sbjct: 181 GRNAAAEIFKPDIVAVPAKARSKRSRIIPSNWNNSRLLPLSPTSSSSEQDIPATEPPPHP 240

Query: 241 VKKIPPKAAATVKKKD---CPEAGASAGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGV 300
           VKK+PPK AA VKKK+     E G SAGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGV
Sbjct: 241 VKKVPPKVAAAVKKKESSSSSETGMSAGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGV 300

Query: 301 RYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQQQQVLLDHHHHHR 360
           RYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKEL +A  Q+QQ +++DHHHHH 
Sbjct: 301 RYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELQKA--QEQQALMMDHHHHHH 360

Query: 361 HQDMIFDSSNGDDYLIHQHVGPDYRQLI 374
           HQ+M+FDSSNG+DYL+ Q+V  DY  LI
Sbjct: 361 HQEMMFDSSNGEDYLMKQNVAHDYLHLI 376

BLAST of Spg033974 vs. NCBI nr
Match: XP_023538437.1 (GATA transcription factor 12-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 490.3 bits (1261), Expect = 1.4e-134
Identity = 278/394 (70.56%), Postives = 309/394 (78.43%), Query Frame = 0

Query: 1   MEAPEYFQNNGYCSQFAADNDAPA------AVGDHFIVEELLDFSNHD----ADAGGLFN 60
           MEAPEYF  N YCSQF +D DA A      A  DHFIVEELLDFSN D    AD+GG FN
Sbjct: 1   MEAPEYFHKNAYCSQFTSDKDAAASTATASATADHFIVEELLDFSNDDDSAIADSGGFFN 60

Query: 61  NATGCFLNNGNSASASADSSALTVVESCNSSNSFSVSEPNSFLEDISASNLADAHFSDEL 120
           N T CFL NGN    SA+SSA T VES NSS SFS SE  SF +D+S S+LAD  FSD++
Sbjct: 61  NVT-CFL-NGN----SAESSAATAVESSNSS-SFSGSERTSFFDDVSGSSLADVRFSDDI 120

Query: 121 CIPLDDLAELEWLSNFVEESFSSEDMQKLELISGVKVKSDGPSHSRQPTTATAAVSTTTH 180
            IP ++L ELEWL++F EE FSSEDMQKLELI+GVKVK D P  S+ PTTA +AVS  +H
Sbjct: 121 FIPYNELVELEWLASFEEEPFSSEDMQKLELITGVKVKPDEPPQSQHPTTAPSAVSAASH 180

Query: 181 ARN-AAEIFKPDIVSVPAKARSKRSRAVPSNWNNSRLLPLSPTTSSSESDI-APAGPPQP 240
            RN AA IFKPDIV+VPAKARSKRSR +PSNWNNSRLLPLSPT+SSSE DI A   PP P
Sbjct: 181 GRNAAAAIFKPDIVAVPAKARSKRSRIIPSNWNNSRLLPLSPTSSSSELDIPATEPPPHP 240

Query: 241 VKKIPPKAAAT------VKKKD---CPEAGASAGEGRKCMHCATDKTPQWRTGPMGPKTL 300
           VKK+PPK AAT      VKKK+     E G SAGEGRKCMHCATDKTPQWRTGPMGPKTL
Sbjct: 241 VKKVPPKVAATATATAAVKKKESSSSSETGMSAGEGRKCMHCATDKTPQWRTGPMGPKTL 300

Query: 301 CNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQQQQVLLD 360
           CNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKEL +A  Q+QQ +++D
Sbjct: 301 CNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELQKA--QEQQALMMD 360

Query: 361 HHHHHRHQDMIFDSSNGDDYLIHQHVGPDYRQLI 374
           HHHHH HQ+M+FDSSNG+DYL+ Q+V  DY  LI
Sbjct: 361 HHHHHHHQEMMFDSSNGEDYLMKQNVAHDYLHLI 385

BLAST of Spg033974 vs. NCBI nr
Match: XP_022951637.1 (GATA transcription factor 12-like [Cucurbita moschata])

HSP 1 Score: 489.6 bits (1259), Expect = 2.4e-134
Identity = 278/387 (71.83%), Postives = 308/387 (79.59%), Query Frame = 0

Query: 1   MEAPEYFQNNGYCSQFAADNDAPAA-VGDHFIVEELLDFSNHD----ADAGGLFNNATGC 60
           MEAPEYF NN YCSQF +D DA AA   DHFIVEELLDFSN D    AD+GG FNN T C
Sbjct: 1   MEAPEYFHNNAYCSQFTSDKDAAAATTADHFIVEELLDFSNDDDSAIADSGGFFNNVT-C 60

Query: 61  FLNNGNSASASADSSALTVVESCNSSNSFSVSEPNSFLEDISASNLADAHFSDELCIPLD 120
           FL NGN    SA+SSA T VES NSS SFS SE  SF +D+SAS+LAD  FSD++ IP +
Sbjct: 61  FL-NGN----SAESSAATAVESSNSS-SFSGSERTSFFDDVSASSLADVRFSDDIFIPYN 120

Query: 121 DLAELEWLSNFVEESFSSEDMQKLELISGVKVKSDGPSHSRQPTTATAAVSTTTHARN-A 180
           +L ELEWL++F EE FSSEDMQKLELI+GVKVK D P  S  PT A +A+S   H RN A
Sbjct: 121 ELVELEWLASFEEEPFSSEDMQKLELITGVKVKPDEPPQSHHPTNAVSALS---HGRNAA 180

Query: 181 AEIFKPDIVSVPAKARSKRSRAVPSNWNNSRLLPLSPTTSSSESDI-APAGPPQPVKKIP 240
           A IFKPDIV+VPAKARSKRSR +PSNWNNSRLLPLSPT+SSSE DI A   PP PVKK+P
Sbjct: 181 AAIFKPDIVAVPAKARSKRSRTIPSNWNNSRLLPLSPTSSSSELDIPATEPPPHPVKKVP 240

Query: 241 PKAAAT----VKKKD---CPEAGASAGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVR 300
           PK AAT    VKKK+     E G SAGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVR
Sbjct: 241 PKVAATATAAVKKKESSSSSETGMSAGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVR 300

Query: 301 YKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQQQQVLLDHHHHHRH 360
           YKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKEL +A  Q+QQ +++DHHHHH H
Sbjct: 301 YKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELQKA--QEQQALMMDHHHHHHH 360

Query: 361 QDMIFDSSNGDDYLIHQHVGPDYRQLI 374
           Q+M+FDSSNG+DYL+ Q+V  DY  LI
Sbjct: 361 QEMMFDSSNGEDYLMKQNVAHDYLHLI 375

BLAST of Spg033974 vs. ExPASy Swiss-Prot
Match: P69781 (GATA transcription factor 12 OS=Arabidopsis thaliana OX=3702 GN=GATA12 PE=2 SV=1)

HSP 1 Score: 265.4 bits (677), Expect = 9.8e-70
Identity = 179/359 (49.86%), Postives = 214/359 (59.61%), Query Frame = 0

Query: 30  FIVEELL-DFSNHDADAGGLFNNATGCFLNNGNSASASADSSALTVVESCNSSNSFSVSE 89
           F V++LL DFSN D +   +                  ADS+  T +     S++FS ++
Sbjct: 14  FAVDDLLVDFSNDDDEENDVV-----------------ADSTTTTTI---TDSSNFSAAD 73

Query: 90  PNSFLEDISASNLADAHFSDELCIPLDDLA-ELEWLSNFVEESFSSEDMQKLELISGVKV 149
             SF  D+         FS +LCIP DDLA ELEWLSN V+ES S ED+ KLELISG K 
Sbjct: 74  LPSFHGDVQDG----TSFSGDLCIPSDDLADELEWLSNIVDESLSPEDVHKLELISGFKS 133

Query: 150 KSDGPSHSRQPTTATAAVSTTTHARNAAEIFKPDIVSVPAKARSKRSRAVPSNWNNSRLL 209
           + D  S +  P           +  +++ IF  D VSVPAKARSKRSRA   NW +  LL
Sbjct: 134 RPDPKSDTGSP----------ENPNSSSPIFTTD-VSVPAKARSKRSRAAACNWASRGLL 193

Query: 210 -------PLSPTT--SSSESDIAPAGPPQPVKKIPPKAAAT---VKKKDCPEAGASAGEG 269
                  P +  T  SS +    P  PP  +  +  K A      +KKD     +   E 
Sbjct: 194 KETFYDSPFTGETILSSQQHLSPPTSPPLLMAPLGKKQAVDGGHRRKKDVSSPESGGAEE 253

Query: 270 RKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRK 329
           R+C+HCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVL KHSNSHRK
Sbjct: 254 RRCLHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLAKHSNSHRK 313

Query: 330 VLELRRQKELLRAQQQQQQQVLLDHHHHHRHQDMIFD-SSNGDDYLIHQHVGPDYRQLI 374
           V+ELRRQKE+ RA  +        HHHH     MIFD SS+GDDYLIH +VGPD+RQLI
Sbjct: 314 VMELRRQKEMSRAHHE------FIHHHHGTDTAMIFDVSSDGDDYLIHHNVGPDFRQLI 331

BLAST of Spg033974 vs. ExPASy Swiss-Prot
Match: O82632 (GATA transcription factor 9 OS=Arabidopsis thaliana OX=3702 GN=GATA9 PE=2 SV=1)

HSP 1 Score: 225.7 bits (574), Expect = 8.6e-58
Identity = 155/350 (44.29%), Postives = 197/350 (56.29%), Query Frame = 0

Query: 28  DHFIVEELLDFSNHDADAGGLFNNATGCFLNNGNSASASADSSALTVVESCNSSNSFSVS 87
           D F+V++LLDFSN D +     N                 DSS L+     +SSNS S+ 
Sbjct: 16  DSFVVDDLLDFSNDDGEVDDGLNTL--------------PDSSTLSTGTLTDSSNSSSL- 75

Query: 88  EPNSFLEDISASNLADAHFSDELCIPLDDLAELEWLSNFVEESFSSEDMQKLELISGVKV 147
                          D     +L IP DD+AELEWLSNFVEESF+ ED  KL L SG+K 
Sbjct: 76  -------------FTDGTGFSDLYIPNDDIAELEWLSNFVEESFAGEDQDKLHLFSGLK- 135

Query: 148 KSDGPSHSRQPTTATAAVSTTTHARNAAEIFKPDIVSVPAKARSKRSRAVPSNWNNSRLL 207
                +     +T T  +       +         V+VPAKARSKRSR+  S W  SRLL
Sbjct: 136 -----NPQTTGSTLTHLIKPEPELDHQFIDIDESNVAVPAKARSKRSRSAASTW-ASRLL 195

Query: 208 PLSPTTSSSESDIAPAGPPQPVKKIPPKAAATVKKKDCPEAGASAGEGRKCMHCATDKTP 267
            L        +D     P +  +++  +  A     DC E+G     GR+C+HCAT+KTP
Sbjct: 196 SL--------ADSDETNPKKKQRRVKEQDFAGDMDVDCGESGG----GRRCLHCATEKTP 255

Query: 268 QWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLR 327
           QWRTGPMGPKTLCNACGVRYKSGRLVPEYRPA+SPTFV+ +HSNSHRKV+ELRRQKE+  
Sbjct: 256 QWRTGPMGPKTLCNACGVRYKSGRLVPEYRPASSPTFVMARHSNSHRKVMELRRQKEM-- 308

Query: 328 AQQQQQQQVLLDHHHHHRHQDMIFD-SSNGDDYLIH---QHVGPDYRQLI 374
                + + LL      R ++++ D  SNG+D+L+H    HV PD+R LI
Sbjct: 316 -----RDEHLLS---QLRCENLLMDIRSNGEDFLMHNNTNHVAPDFRHLI 308

BLAST of Spg033974 vs. ExPASy Swiss-Prot
Match: O49741 (GATA transcription factor 2 OS=Arabidopsis thaliana OX=3702 GN=GATA2 PE=2 SV=1)

HSP 1 Score: 174.9 bits (442), Expect = 1.7e-42
Identity = 129/313 (41.21%), Postives = 164/313 (52.40%), Query Frame = 0

Query: 32  VEELLDFSNHDADAGGLFNNATGCFLNNGNSASASADSSALTVVESCNSSNSFSVSEPNS 91
           +++LLDFSN D     +F            SAS+S  S+A T      SS+SF   +  S
Sbjct: 14  IDDLLDFSNED-----IF------------SASSSGGSTAAT------SSSSFPPPQNPS 73

Query: 92  FLEDISASNLADAHFSDELCIPLDDLAELEWLSNFVEESFSSEDMQKLELISGVKVKSDG 151
           F      S+     F  ++C+P DD A LEWLS FV++SF                 +D 
Sbjct: 74  FHHHHLPSSADHHSFLHDICVPSDDAAHLEWLSQFVDDSF-----------------ADF 133

Query: 152 PSHSRQPTTATAAVSTTTHARNAAEIFKPDIVSVPAKARSKRSRAVPSNWNNSRLLPLSP 211
           P++    T  +    T                S P K RSKRSRA          +PL  
Sbjct: 134 PANPLGGTMTSVKTET----------------SFPGKPRSKRSRAPAPFAGTWSPMPL-- 193

Query: 212 TTSSSESDIAPAGPPQPVKKIPPKAAATVKKKDCPEAGASAGEG-RKCMHCATDKTPQWR 271
              S    +  A   +P K+          +     +  + G G R+C HCA++KTPQWR
Sbjct: 194 --ESEHQQLHSAAKFKPKKEQSGGGGGGGGRHQSSSSETTEGGGMRRCTHCASEKTPQWR 253

Query: 272 TGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQ 331
           TGP+GPKTLCNACGVR+KSGRLVPEYRPA+SPTFVLT+HSNSHRKV+ELRRQKE++R   
Sbjct: 254 TGPLGPKTLCNACGVRFKSGRLVPEYRPASSPTFVLTQHSNSHRKVMELRRQKEVMR--- 262

Query: 332 QQQQQVLLDHHHH 344
            Q QQV L HHHH
Sbjct: 314 -QPQQVQLHHHHH 262

BLAST of Spg033974 vs. ExPASy Swiss-Prot
Match: O49743 (GATA transcription factor 4 OS=Arabidopsis thaliana OX=3702 GN=GATA4 PE=2 SV=1)

HSP 1 Score: 159.1 bits (401), Expect = 9.9e-38
Identity = 119/293 (40.61%), Postives = 151/293 (51.54%), Query Frame = 0

Query: 32  VEELLDFSNHDADAGGLFNNATGCFLNNGNSASASADSSALTVVESCNSSNSFSVSEPNS 91
           +++LLDFSN +     +F            S+S++  SSA        +S++ S   P S
Sbjct: 14  IDDLLDFSNDE-----IF------------SSSSTVTSSA--------ASSAASSENPFS 73

Query: 92  FLEDISASNLADAHFSDELCIPLDDLAELEWLSNFVEESFSSEDMQKLELISGVKVKSDG 151
           F      S      F+ +LC+P DD A LEWLS FV++SF                 SD 
Sbjct: 74  FPSSTYTSPTLLTDFTHDLCVPSDDAAHLEWLSRFVDDSF-----------------SDF 133

Query: 152 PSHSRQPTTATAAVSTTTHARNAAEIFKPDIVSVPAKARSKRSRAVPSNWNNSRLLPLSP 211
           P++   P T T                +P+I S   K RS+RSRA              P
Sbjct: 134 PAN---PLTMTV---------------RPEI-SFTGKPRSRRSRA--------------P 193

Query: 212 TTSSSESDIAPAGPPQPVKKIPPKAAATVKKKDCPEAGASAGEGRKCMHCATDKTPQWRT 271
             S     +A    P    ++    A    KK       +A   R+C HCA++KTPQWRT
Sbjct: 194 APS-----VAGTWAPMSESELCHSVAKPKPKKVYNAESVTADGARRCTHCASEKTPQWRT 226

Query: 272 GPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKE 325
           GP+GPKTLCNACGVRYKSGRLVPEYRPA+SPTFVLT+HSNSHRKV+ELRRQKE
Sbjct: 254 GPLGPKTLCNACGVRYKSGRLVPEYRPASSPTFVLTQHSNSHRKVMELRRQKE 226

BLAST of Spg033974 vs. ExPASy Swiss-Prot
Match: Q9FH57 (GATA transcription factor 5 OS=Arabidopsis thaliana OX=3702 GN=GATA5 PE=2 SV=1)

HSP 1 Score: 141.4 bits (355), Expect = 2.1e-32
Identity = 114/313 (36.42%), Postives = 147/313 (46.96%), Query Frame = 0

Query: 28  DHFIVEELLDFSNHDADAGGLFNNATGCFLNNGNSASASADSSALTVVESCNSSNSFSVS 87
           D F V++LLD SN D             F +      A  +   ++  E  +  ++   S
Sbjct: 39  DDFSVDDLLDLSNDDV------------FADEETDLKAQHEMVRVSSEEPNDDGDALRRS 98

Query: 88  EPNSFLEDISASNLADAHFSDELCIPLDDLAELEWLSNFVEESFSSEDMQKLELISGVKV 147
              S  +D  +        + EL +P DDLA LEWLS+FVE+SF+               
Sbjct: 99  SDFSGCDDFGSLP------TSELSLPADDLANLEWLSHFVEDSFT--------------- 158

Query: 148 KSDGPSHSRQPTTATA-AVSTTTHARNAAEIFKPDIVSVPAKARSKRSRAVPSNWNNSRL 207
           +  GP+ +  PT   A       H   A          VPAKARSKR+R     W+    
Sbjct: 159 EYSGPNLTGTPTEKPAWLTGDRKHPVTAVTEETCFKSPVPAKARSKRNRNGLKVWSLGSS 218

Query: 208 LPLSPTTSSSESDIAPAGPPQP-------VKKIPPKAAATVKKKDCPEAGASAGEG---- 267
               P++S S S  + +GP  P       ++ +         KK    +  S   G    
Sbjct: 219 SSSGPSSSGSTSS-SSSGPSSPWFSGAELLEPVVTSERPPFPKKHKKRSAESVFSGELQQ 278

Query: 268 ----RKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSN 325
               RKC HC   KTPQWR GPMG KTLCNACGVRYKSGRL+PEYRPA SPTF    HSN
Sbjct: 279 LQPQRKCSHCGVQKTPQWRAGPMGAKTLCNACGVRYKSGRLLPEYRPACSPTFSSELHSN 317

BLAST of Spg033974 vs. ExPASy TrEMBL
Match: A0A6J1KNT0 (GATA transcription factor OS=Cucurbita maxima OX=3661 GN=LOC111496245 PE=3 SV=1)

HSP 1 Score: 490.7 bits (1262), Expect = 5.3e-135
Identity = 276/388 (71.13%), Postives = 307/388 (79.12%), Query Frame = 0

Query: 1   MEAPEYFQNNGYCSQFAADNDAPA------AVGDHFIVEELLDFSNHD----ADAGGLFN 60
           MEAPEYF NN YCSQF +D DA A      A  DHFIVEELLDFSN D    AD+GG FN
Sbjct: 1   MEAPEYFHNNAYCSQFTSDKDAAASTATATATADHFIVEELLDFSNDDDSAIADSGGFFN 60

Query: 61  NATGCFLNNGNSASASADSSALTVVESCNSSNSFSVSEPNSFLEDISASNLADAHFSDEL 120
           N T CFL NGN    S++SSA T VES NSS SFS  E  SF +D+S S+LAD  FSD++
Sbjct: 61  NVT-CFL-NGN----SSESSAATAVESSNSS-SFSGCERTSFFDDVSGSSLADVRFSDDI 120

Query: 121 CIPLDDLAELEWLSNFVEESFSSEDMQKLELISGVKVKSDGPSHSRQPTTATAAVSTTTH 180
            IP ++L ELEWL++F EE FSSEDMQKLELI+GVKVK D P  S+ PTT   AVS  +H
Sbjct: 121 FIPYNELVELEWLASFEEEPFSSEDMQKLELITGVKVKPDEPLQSQHPTT---AVSAASH 180

Query: 181 ARN-AAEIFKPDIVSVPAKARSKRSRAVPSNWNNSRLLPLSPTTSSSESDI-APAGPPQP 240
            RN AAEIFKPDIV+VPAKARSKRSR +PSNWNNSRLLPLSPT+SSSE DI A   PP P
Sbjct: 181 GRNAAAEIFKPDIVAVPAKARSKRSRIIPSNWNNSRLLPLSPTSSSSEQDIPATEPPPHP 240

Query: 241 VKKIPPKAAATVKKKD---CPEAGASAGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGV 300
           VKK+PPK AA VKKK+     E G SAGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGV
Sbjct: 241 VKKVPPKVAAAVKKKESSSSSETGMSAGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGV 300

Query: 301 RYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQQQQVLLDHHHHHR 360
           RYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKEL +A  Q+QQ +++DHHHHH 
Sbjct: 301 RYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELQKA--QEQQALMMDHHHHHH 360

Query: 361 HQDMIFDSSNGDDYLIHQHVGPDYRQLI 374
           HQ+M+FDSSNG+DYL+ Q+V  DY  LI
Sbjct: 361 HQEMMFDSSNGEDYLMKQNVAHDYLHLI 376

BLAST of Spg033974 vs. ExPASy TrEMBL
Match: A0A6J1GI87 (GATA transcription factor OS=Cucurbita moschata OX=3662 GN=LOC111454392 PE=3 SV=1)

HSP 1 Score: 489.6 bits (1259), Expect = 1.2e-134
Identity = 278/387 (71.83%), Postives = 308/387 (79.59%), Query Frame = 0

Query: 1   MEAPEYFQNNGYCSQFAADNDAPAA-VGDHFIVEELLDFSNHD----ADAGGLFNNATGC 60
           MEAPEYF NN YCSQF +D DA AA   DHFIVEELLDFSN D    AD+GG FNN T C
Sbjct: 1   MEAPEYFHNNAYCSQFTSDKDAAAATTADHFIVEELLDFSNDDDSAIADSGGFFNNVT-C 60

Query: 61  FLNNGNSASASADSSALTVVESCNSSNSFSVSEPNSFLEDISASNLADAHFSDELCIPLD 120
           FL NGN    SA+SSA T VES NSS SFS SE  SF +D+SAS+LAD  FSD++ IP +
Sbjct: 61  FL-NGN----SAESSAATAVESSNSS-SFSGSERTSFFDDVSASSLADVRFSDDIFIPYN 120

Query: 121 DLAELEWLSNFVEESFSSEDMQKLELISGVKVKSDGPSHSRQPTTATAAVSTTTHARN-A 180
           +L ELEWL++F EE FSSEDMQKLELI+GVKVK D P  S  PT A +A+S   H RN A
Sbjct: 121 ELVELEWLASFEEEPFSSEDMQKLELITGVKVKPDEPPQSHHPTNAVSALS---HGRNAA 180

Query: 181 AEIFKPDIVSVPAKARSKRSRAVPSNWNNSRLLPLSPTTSSSESDI-APAGPPQPVKKIP 240
           A IFKPDIV+VPAKARSKRSR +PSNWNNSRLLPLSPT+SSSE DI A   PP PVKK+P
Sbjct: 181 AAIFKPDIVAVPAKARSKRSRTIPSNWNNSRLLPLSPTSSSSELDIPATEPPPHPVKKVP 240

Query: 241 PKAAAT----VKKKD---CPEAGASAGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVR 300
           PK AAT    VKKK+     E G SAGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVR
Sbjct: 241 PKVAATATAAVKKKESSSSSETGMSAGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVR 300

Query: 301 YKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQQQQVLLDHHHHHRH 360
           YKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKEL +A  Q+QQ +++DHHHHH H
Sbjct: 301 YKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELQKA--QEQQALMMDHHHHHHH 360

Query: 361 QDMIFDSSNGDDYLIHQHVGPDYRQLI 374
           Q+M+FDSSNG+DYL+ Q+V  DY  LI
Sbjct: 361 QEMMFDSSNGEDYLMKQNVAHDYLHLI 375

BLAST of Spg033974 vs. ExPASy TrEMBL
Match: A0A0A0LPR5 (GATA transcription factor OS=Cucumis sativus OX=3659 GN=Csa_2G373450 PE=3 SV=1)

HSP 1 Score: 482.6 bits (1241), Expect = 1.4e-132
Identity = 274/398 (68.84%), Postives = 300/398 (75.38%), Query Frame = 0

Query: 1   MEAPEYFQNNGYCSQFAADND-------APAAVGDHFIVEELLDFSNHDADA-------- 60
           MEAPEYFQ N Y SQF++ +D       A AA  DHFIVEELLDFSN++ DA        
Sbjct: 1   MEAPEYFQINAYSSQFSSPDDADATTTAAAAAAPDHFIVEELLDFSNNEDDAVLTDSGGG 60

Query: 61  ---------GGLFNNATGCFLNNGNSASASADSSALTVVESCNSSNSFSVSEPNSFLEDI 120
                    GGLF N      N+ N+ + S +SSA+TV+ESCNSS        +SF EDI
Sbjct: 61  GGGGGGGGGGGLFYNNNNTSTNDHNNNNNSTESSAVTVMESCNSS--------SSFFEDI 120

Query: 121 SASNLADAHFSDELCIPLDDLAELEWLSNFVEESFSSEDMQKLELISGVKVKSD-GPSHS 180
           S SNL DAHFS ELC+P DDLAELEWLSNFVEESFSSEDMQKLELISGVKVKSD  P+ S
Sbjct: 121 SGSNLGDAHFSSELCVPYDDLAELEWLSNFVEESFSSEDMQKLELISGVKVKSDEPPTQS 180

Query: 181 RQPTTATAAVSTTTHARNAAEIFKPDIVSVPAKARSKRSRAVPSNWNNSRLLPLSPTTSS 240
            QPT            R+AA IFKP+IVSVPAKARSKRSRA+PSNWNNS LLPLS  T+ 
Sbjct: 181 PQPTA----------TRSAAAIFKPEIVSVPAKARSKRSRALPSNWNNSALLPLSSPTAE 240

Query: 241 SESDIAPAGPPQPVKKIPPKAAATVKKKDCPEAGASAGEGRKCMHCATDKTPQWRTGPMG 300
           SE+   P   P P+KK  PKAAAT KKKD P+ G S+GEGRKCMHCATDKTPQWRTGPMG
Sbjct: 241 SET-TPPIEQPHPIKKTLPKAAATAKKKDSPDLGFSSGEGRKCMHCATDKTPQWRTGPMG 300

Query: 301 PKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQQQQ 360
           PKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKE+LRAQQQQ Q 
Sbjct: 301 PKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKEILRAQQQQPQH 360

Query: 361 VLLDHHHHHRHQDMIFDSSNGDDYLIHQHVGPDYRQLI 374
           +LLDH      QDMIFD+SNGDDYLIHQHVGPD+RQLI
Sbjct: 361 LLLDH-----RQDMIFDASNGDDYLIHQHVGPDFRQLI 374

BLAST of Spg033974 vs. ExPASy TrEMBL
Match: A0A5A7VCX1 (GATA transcription factor OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold82G003790 PE=3 SV=1)

HSP 1 Score: 481.1 bits (1237), Expect = 4.2e-132
Identity = 273/391 (69.82%), Postives = 301/391 (76.98%), Query Frame = 0

Query: 1   MEAPEYFQNNGYCSQFAADNDA-----PAAVGDHFIVEELLDFSNHDADA---------- 60
           MEAPEYFQ N Y SQF++ + A      AA  +HFIVEELLDFSN++ DA          
Sbjct: 1   MEAPEYFQINAYSSQFSSPDHADASTTAAAAPEHFIVEELLDFSNNEDDAVFTDAGGGGG 60

Query: 61  -GGLFNNATGCFLNNGNSASASADSSALTVVESCNSSNSFSVSEPNSFLEDISASNLADA 120
            GGLF N      N+ N+ + SA+SSA+TV+ESCNSS        +SF EDIS SNL DA
Sbjct: 61  GGGLFYNNNNTTSNDHNNNNNSAESSAITVMESCNSS--------SSFFEDISGSNLGDA 120

Query: 121 HFSDELCIPLDDLAELEWLSNFVEESFSSEDMQKLELISGVKVKSDG-PSHSRQPTTATA 180
           HFS ELC+P DDLAELEWLSNFVEESFSSEDMQKLEL+SGVKVKSD  P+ S QPT    
Sbjct: 121 HFSSELCVPYDDLAELEWLSNFVEESFSSEDMQKLELLSGVKVKSDEIPAQSPQPTA--- 180

Query: 181 AVSTTTHARNAAEIFKPDIVSVPAKARSKRSRAVPSNWNNSRLLPLSPTTSSSESDI-AP 240
                   R AA IFKP+IVSVPAKARSKRSRA+PSNWNNS LLPLSPT   +E +I AP
Sbjct: 181 -------TRTAAAIFKPEIVSVPAKARSKRSRALPSNWNNSSLLPLSPT---AEPEITAP 240

Query: 241 AGPPQPVKKIPPKAAATVKKKDCPEAGASAGEGRKCMHCATDKTPQWRTGPMGPKTLCNA 300
            G P  +KK  PK AAT KKKD P+ G S+GEGRKCMHCATDKTPQWRTGPMGPKTLCNA
Sbjct: 241 IGQPYSIKKPLPKVAATAKKKDNPDVGFSSGEGRKCMHCATDKTPQWRTGPMGPKTLCNA 300

Query: 301 CGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQQQQVLLDHHH 360
           CGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKE+LRAQQQQQQ +LLDH  
Sbjct: 301 CGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKEILRAQQQQQQHLLLDH-- 360

Query: 361 HHRHQDMIFDSSNGDDYLIHQHVGPDYRQLI 374
               QDMIFD+SNGDDYLIHQHVGPD+RQ+I
Sbjct: 361 ---RQDMIFDASNGDDYLIHQHVGPDFRQMI 365

BLAST of Spg033974 vs. ExPASy TrEMBL
Match: A0A1S3BBN7 (GATA transcription factor OS=Cucumis melo OX=3656 GN=LOC103488171 PE=3 SV=1)

HSP 1 Score: 481.1 bits (1237), Expect = 4.2e-132
Identity = 273/391 (69.82%), Postives = 301/391 (76.98%), Query Frame = 0

Query: 1   MEAPEYFQNNGYCSQFAADNDA-----PAAVGDHFIVEELLDFSNHDADA---------- 60
           MEAPEYFQ N Y SQF++ + A      AA  +HFIVEELLDFSN++ DA          
Sbjct: 1   MEAPEYFQINAYSSQFSSPDHADASTTAAAAPEHFIVEELLDFSNNEDDAVFTDAGGGGG 60

Query: 61  -GGLFNNATGCFLNNGNSASASADSSALTVVESCNSSNSFSVSEPNSFLEDISASNLADA 120
            GGLF N      N+ N+ + SA+SSA+TV+ESCNSS        +SF EDIS SNL DA
Sbjct: 61  GGGLFYNNNNTTSNDHNNNNNSAESSAITVMESCNSS--------SSFFEDISGSNLGDA 120

Query: 121 HFSDELCIPLDDLAELEWLSNFVEESFSSEDMQKLELISGVKVKSDG-PSHSRQPTTATA 180
           HFS ELC+P DDLAELEWLSNFVEESFSSEDMQKLEL+SGVKVKSD  P+ S QPT    
Sbjct: 121 HFSSELCVPYDDLAELEWLSNFVEESFSSEDMQKLELLSGVKVKSDEIPAQSPQPTA--- 180

Query: 181 AVSTTTHARNAAEIFKPDIVSVPAKARSKRSRAVPSNWNNSRLLPLSPTTSSSESDI-AP 240
                   R AA IFKP+IVSVPAKARSKRSRA+PSNWNNS LLPLSPT   +E +I AP
Sbjct: 181 -------TRTAAAIFKPEIVSVPAKARSKRSRALPSNWNNSSLLPLSPT---AEPEITAP 240

Query: 241 AGPPQPVKKIPPKAAATVKKKDCPEAGASAGEGRKCMHCATDKTPQWRTGPMGPKTLCNA 300
            G P  +KK  PK AAT KKKD P+ G S+GEGRKCMHCATDKTPQWRTGPMGPKTLCNA
Sbjct: 241 IGQPYSIKKPLPKVAATAKKKDNPDVGFSSGEGRKCMHCATDKTPQWRTGPMGPKTLCNA 300

Query: 301 CGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQQQQVLLDHHH 360
           CGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKE+LRAQQQQQQ +LLDH  
Sbjct: 301 CGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKEILRAQQQQQQHLLLDH-- 360

Query: 361 HHRHQDMIFDSSNGDDYLIHQHVGPDYRQLI 374
               QDMIFD+SNGDDYLIHQHVGPD+RQ+I
Sbjct: 361 ---RQDMIFDASNGDDYLIHQHVGPDFRQMI 365

BLAST of Spg033974 vs. TAIR 10
Match: AT5G25830.1 (GATA transcription factor 12 )

HSP 1 Score: 265.4 bits (677), Expect = 7.0e-71
Identity = 179/359 (49.86%), Postives = 214/359 (59.61%), Query Frame = 0

Query: 30  FIVEELL-DFSNHDADAGGLFNNATGCFLNNGNSASASADSSALTVVESCNSSNSFSVSE 89
           F V++LL DFSN D +   +                  ADS+  T +     S++FS ++
Sbjct: 14  FAVDDLLVDFSNDDDEENDVV-----------------ADSTTTTTI---TDSSNFSAAD 73

Query: 90  PNSFLEDISASNLADAHFSDELCIPLDDLA-ELEWLSNFVEESFSSEDMQKLELISGVKV 149
             SF  D+         FS +LCIP DDLA ELEWLSN V+ES S ED+ KLELISG K 
Sbjct: 74  LPSFHGDVQDG----TSFSGDLCIPSDDLADELEWLSNIVDESLSPEDVHKLELISGFKS 133

Query: 150 KSDGPSHSRQPTTATAAVSTTTHARNAAEIFKPDIVSVPAKARSKRSRAVPSNWNNSRLL 209
           + D  S +  P           +  +++ IF  D VSVPAKARSKRSRA   NW +  LL
Sbjct: 134 RPDPKSDTGSP----------ENPNSSSPIFTTD-VSVPAKARSKRSRAAACNWASRGLL 193

Query: 210 -------PLSPTT--SSSESDIAPAGPPQPVKKIPPKAAAT---VKKKDCPEAGASAGEG 269
                  P +  T  SS +    P  PP  +  +  K A      +KKD     +   E 
Sbjct: 194 KETFYDSPFTGETILSSQQHLSPPTSPPLLMAPLGKKQAVDGGHRRKKDVSSPESGGAEE 253

Query: 270 RKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRK 329
           R+C+HCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVL KHSNSHRK
Sbjct: 254 RRCLHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLAKHSNSHRK 313

Query: 330 VLELRRQKELLRAQQQQQQQVLLDHHHHHRHQDMIFD-SSNGDDYLIHQHVGPDYRQLI 374
           V+ELRRQKE+ RA  +        HHHH     MIFD SS+GDDYLIH +VGPD+RQLI
Sbjct: 314 VMELRRQKEMSRAHHE------FIHHHHGTDTAMIFDVSSDGDDYLIHHNVGPDFRQLI 331

BLAST of Spg033974 vs. TAIR 10
Match: AT4G32890.1 (GATA transcription factor 9 )

HSP 1 Score: 225.7 bits (574), Expect = 6.1e-59
Identity = 155/350 (44.29%), Postives = 197/350 (56.29%), Query Frame = 0

Query: 28  DHFIVEELLDFSNHDADAGGLFNNATGCFLNNGNSASASADSSALTVVESCNSSNSFSVS 87
           D F+V++LLDFSN D +     N                 DSS L+     +SSNS S+ 
Sbjct: 16  DSFVVDDLLDFSNDDGEVDDGLNTL--------------PDSSTLSTGTLTDSSNSSSL- 75

Query: 88  EPNSFLEDISASNLADAHFSDELCIPLDDLAELEWLSNFVEESFSSEDMQKLELISGVKV 147
                          D     +L IP DD+AELEWLSNFVEESF+ ED  KL L SG+K 
Sbjct: 76  -------------FTDGTGFSDLYIPNDDIAELEWLSNFVEESFAGEDQDKLHLFSGLK- 135

Query: 148 KSDGPSHSRQPTTATAAVSTTTHARNAAEIFKPDIVSVPAKARSKRSRAVPSNWNNSRLL 207
                +     +T T  +       +         V+VPAKARSKRSR+  S W  SRLL
Sbjct: 136 -----NPQTTGSTLTHLIKPEPELDHQFIDIDESNVAVPAKARSKRSRSAASTW-ASRLL 195

Query: 208 PLSPTTSSSESDIAPAGPPQPVKKIPPKAAATVKKKDCPEAGASAGEGRKCMHCATDKTP 267
            L        +D     P +  +++  +  A     DC E+G     GR+C+HCAT+KTP
Sbjct: 196 SL--------ADSDETNPKKKQRRVKEQDFAGDMDVDCGESGG----GRRCLHCATEKTP 255

Query: 268 QWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLR 327
           QWRTGPMGPKTLCNACGVRYKSGRLVPEYRPA+SPTFV+ +HSNSHRKV+ELRRQKE+  
Sbjct: 256 QWRTGPMGPKTLCNACGVRYKSGRLVPEYRPASSPTFVMARHSNSHRKVMELRRQKEM-- 308

Query: 328 AQQQQQQQVLLDHHHHHRHQDMIFD-SSNGDDYLIH---QHVGPDYRQLI 374
                + + LL      R ++++ D  SNG+D+L+H    HV PD+R LI
Sbjct: 316 -----RDEHLLS---QLRCENLLMDIRSNGEDFLMHNNTNHVAPDFRHLI 308

BLAST of Spg033974 vs. TAIR 10
Match: AT2G45050.1 (GATA transcription factor 2 )

HSP 1 Score: 174.9 bits (442), Expect = 1.2e-43
Identity = 129/313 (41.21%), Postives = 164/313 (52.40%), Query Frame = 0

Query: 32  VEELLDFSNHDADAGGLFNNATGCFLNNGNSASASADSSALTVVESCNSSNSFSVSEPNS 91
           +++LLDFSN D     +F            SAS+S  S+A T      SS+SF   +  S
Sbjct: 14  IDDLLDFSNED-----IF------------SASSSGGSTAAT------SSSSFPPPQNPS 73

Query: 92  FLEDISASNLADAHFSDELCIPLDDLAELEWLSNFVEESFSSEDMQKLELISGVKVKSDG 151
           F      S+     F  ++C+P DD A LEWLS FV++SF                 +D 
Sbjct: 74  FHHHHLPSSADHHSFLHDICVPSDDAAHLEWLSQFVDDSF-----------------ADF 133

Query: 152 PSHSRQPTTATAAVSTTTHARNAAEIFKPDIVSVPAKARSKRSRAVPSNWNNSRLLPLSP 211
           P++    T  +    T                S P K RSKRSRA          +PL  
Sbjct: 134 PANPLGGTMTSVKTET----------------SFPGKPRSKRSRAPAPFAGTWSPMPL-- 193

Query: 212 TTSSSESDIAPAGPPQPVKKIPPKAAATVKKKDCPEAGASAGEG-RKCMHCATDKTPQWR 271
              S    +  A   +P K+          +     +  + G G R+C HCA++KTPQWR
Sbjct: 194 --ESEHQQLHSAAKFKPKKEQSGGGGGGGGRHQSSSSETTEGGGMRRCTHCASEKTPQWR 253

Query: 272 TGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQ 331
           TGP+GPKTLCNACGVR+KSGRLVPEYRPA+SPTFVLT+HSNSHRKV+ELRRQKE++R   
Sbjct: 254 TGPLGPKTLCNACGVRFKSGRLVPEYRPASSPTFVLTQHSNSHRKVMELRRQKEVMR--- 262

Query: 332 QQQQQVLLDHHHH 344
            Q QQV L HHHH
Sbjct: 314 -QPQQVQLHHHHH 262

BLAST of Spg033974 vs. TAIR 10
Match: AT3G60530.1 (GATA transcription factor 4 )

HSP 1 Score: 159.1 bits (401), Expect = 7.0e-39
Identity = 119/293 (40.61%), Postives = 151/293 (51.54%), Query Frame = 0

Query: 32  VEELLDFSNHDADAGGLFNNATGCFLNNGNSASASADSSALTVVESCNSSNSFSVSEPNS 91
           +++LLDFSN +     +F            S+S++  SSA        +S++ S   P S
Sbjct: 14  IDDLLDFSNDE-----IF------------SSSSTVTSSA--------ASSAASSENPFS 73

Query: 92  FLEDISASNLADAHFSDELCIPLDDLAELEWLSNFVEESFSSEDMQKLELISGVKVKSDG 151
           F      S      F+ +LC+P DD A LEWLS FV++SF                 SD 
Sbjct: 74  FPSSTYTSPTLLTDFTHDLCVPSDDAAHLEWLSRFVDDSF-----------------SDF 133

Query: 152 PSHSRQPTTATAAVSTTTHARNAAEIFKPDIVSVPAKARSKRSRAVPSNWNNSRLLPLSP 211
           P++   P T T                +P+I S   K RS+RSRA              P
Sbjct: 134 PAN---PLTMTV---------------RPEI-SFTGKPRSRRSRA--------------P 193

Query: 212 TTSSSESDIAPAGPPQPVKKIPPKAAATVKKKDCPEAGASAGEGRKCMHCATDKTPQWRT 271
             S     +A    P    ++    A    KK       +A   R+C HCA++KTPQWRT
Sbjct: 194 APS-----VAGTWAPMSESELCHSVAKPKPKKVYNAESVTADGARRCTHCASEKTPQWRT 226

Query: 272 GPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKE 325
           GP+GPKTLCNACGVRYKSGRLVPEYRPA+SPTFVLT+HSNSHRKV+ELRRQKE
Sbjct: 254 GPLGPKTLCNACGVRYKSGRLVPEYRPASSPTFVLTQHSNSHRKVMELRRQKE 226

BLAST of Spg033974 vs. TAIR 10
Match: AT5G66320.1 (GATA transcription factor 5 )

HSP 1 Score: 141.4 bits (355), Expect = 1.5e-33
Identity = 114/313 (36.42%), Postives = 147/313 (46.96%), Query Frame = 0

Query: 28  DHFIVEELLDFSNHDADAGGLFNNATGCFLNNGNSASASADSSALTVVESCNSSNSFSVS 87
           D F V++LLD SN D             F +      A  +   ++  E  +  ++   S
Sbjct: 39  DDFSVDDLLDLSNDDV------------FADEETDLKAQHEMVRVSSEEPNDDGDALRRS 98

Query: 88  EPNSFLEDISASNLADAHFSDELCIPLDDLAELEWLSNFVEESFSSEDMQKLELISGVKV 147
              S  +D  +        + EL +P DDLA LEWLS+FVE+SF+               
Sbjct: 99  SDFSGCDDFGSLP------TSELSLPADDLANLEWLSHFVEDSFT--------------- 158

Query: 148 KSDGPSHSRQPTTATA-AVSTTTHARNAAEIFKPDIVSVPAKARSKRSRAVPSNWNNSRL 207
           +  GP+ +  PT   A       H   A          VPAKARSKR+R     W+    
Sbjct: 159 EYSGPNLTGTPTEKPAWLTGDRKHPVTAVTEETCFKSPVPAKARSKRNRNGLKVWSLGSS 218

Query: 208 LPLSPTTSSSESDIAPAGPPQP-------VKKIPPKAAATVKKKDCPEAGASAGEG---- 267
               P++S S S  + +GP  P       ++ +         KK    +  S   G    
Sbjct: 219 SSSGPSSSGSTSS-SSSGPSSPWFSGAELLEPVVTSERPPFPKKHKKRSAESVFSGELQQ 278

Query: 268 ----RKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSN 325
               RKC HC   KTPQWR GPMG KTLCNACGVRYKSGRL+PEYRPA SPTF    HSN
Sbjct: 279 LQPQRKCSHCGVQKTPQWRAGPMGAKTLCNACGVRYKSGRLLPEYRPACSPTFSSELHSN 317

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038886306.12.9e-14375.71GATA transcription factor 12-like [Benincasa hispida][more]
KAG6585379.12.2e-13570.53GATA transcription factor 12, partial [Cucurbita argyrosperma subsp. sororia] >K... [more]
XP_023002390.11.1e-13471.13GATA transcription factor 12-like [Cucurbita maxima][more]
XP_023538437.11.4e-13470.56GATA transcription factor 12-like [Cucurbita pepo subsp. pepo][more]
XP_022951637.12.4e-13471.83GATA transcription factor 12-like [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
P697819.8e-7049.86GATA transcription factor 12 OS=Arabidopsis thaliana OX=3702 GN=GATA12 PE=2 SV=1[more]
O826328.6e-5844.29GATA transcription factor 9 OS=Arabidopsis thaliana OX=3702 GN=GATA9 PE=2 SV=1[more]
O497411.7e-4241.21GATA transcription factor 2 OS=Arabidopsis thaliana OX=3702 GN=GATA2 PE=2 SV=1[more]
O497439.9e-3840.61GATA transcription factor 4 OS=Arabidopsis thaliana OX=3702 GN=GATA4 PE=2 SV=1[more]
Q9FH572.1e-3236.42GATA transcription factor 5 OS=Arabidopsis thaliana OX=3702 GN=GATA5 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A6J1KNT05.3e-13571.13GATA transcription factor OS=Cucurbita maxima OX=3661 GN=LOC111496245 PE=3 SV=1[more]
A0A6J1GI871.2e-13471.83GATA transcription factor OS=Cucurbita moschata OX=3662 GN=LOC111454392 PE=3 SV=... [more]
A0A0A0LPR51.4e-13268.84GATA transcription factor OS=Cucumis sativus OX=3659 GN=Csa_2G373450 PE=3 SV=1[more]
A0A5A7VCX14.2e-13269.82GATA transcription factor OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffo... [more]
A0A1S3BBN74.2e-13269.82GATA transcription factor OS=Cucumis melo OX=3656 GN=LOC103488171 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT5G25830.17.0e-7149.86GATA transcription factor 12 [more]
AT4G32890.16.1e-5944.29GATA transcription factor 9 [more]
AT2G45050.11.2e-4341.21GATA transcription factor 2 [more]
AT3G60530.17.0e-3940.61GATA transcription factor 4 [more]
AT5G66320.11.5e-3336.42GATA transcription factor 5 [more]
InterPro
Analysis Name: InterPro Annotations of Sponge gourd (cylindrica) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 316..336
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 195..221
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 189..239
NoneNo IPR availablePANTHERPTHR45658:SF43GATA TRANSCRIPTION FACTORcoord: 1..373
NoneNo IPR availablePANTHERPTHR45658GATA TRANSCRIPTION FACTORcoord: 1..373
NoneNo IPR availableSUPERFAMILY57716Glucocorticoid receptor-like (DNA-binding domain)coord: 254..316
IPR000679Zinc finger, GATA-typeSMARTSM00401GATA_3coord: 252..302
e-value: 7.5E-18
score: 75.3
IPR000679Zinc finger, GATA-typePFAMPF00320GATAcoord: 258..291
e-value: 1.9E-15
score: 56.1
IPR000679Zinc finger, GATA-typePROSITEPS00344GATA_ZN_FINGER_1coord: 258..283
IPR000679Zinc finger, GATA-typePROSITEPS50114GATA_ZN_FINGER_2coord: 252..288
score: 12.545015
IPR000679Zinc finger, GATA-typeCDDcd00202ZnF_GATAcoord: 257..304
e-value: 5.16017E-15
score: 66.6274
IPR016679Transcription factor, GATA, plantPIRSFPIRSF016992Txn_fac_GATA_plantcoord: 14..349
e-value: 3.6E-73
score: 244.7
IPR013088Zinc finger, NHR/GATA-typeGENE3D3.30.50.10coord: 250..336
e-value: 6.8E-16
score: 59.7

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Spg033974.1Spg033974.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0030154 cell differentiation
biological_process GO:0045893 positive regulation of transcription, DNA-templated
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0005634 nucleus
molecular_function GO:0043565 sequence-specific DNA binding
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0003677 DNA binding