CmoCh02G016520 (gene) Cucurbita moschata (Rifu) v1

Overview
NameCmoCh02G016520
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu) v1)
DescriptionAT-hook motif nuclear-localized protein
LocationCmo_Chr02: 9527908 .. 9532190 (-)
RNA-Seq ExpressionCmoCh02G016520
SyntenyCmoCh02G016520
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTTTTTTTTTTTTTTTTTTTTTTTTTTAATGGATTCACTTGATACTCCTCCGTCACTCTCAGCACCGTCCAACATGACGGTCGGAGTACCGACGGCGTATTCGCCAGGCATGTCCAACGCCAACAACAATGCCTCTTCCACGCTTGGTCTAAATCCGGCCACGGCACAGATGATACAGCCTTCTGCGCGATTTCCGTTTAACTCTGTGATCGCCCCTGCGTCCGTGCCTTTGGATTCTATGAATGTTTCTCCGTACGACGGATCTCATTCTGGGAGTTTCAACGCTGATTCGGGGAAGAAGAAGAGAGGACGGCCGAGGAAGTACACGCCTGATGGTAACATTGCCTTAGGCTTAGCACCTACCACTATGGCGTCTTCTGTCGGTCACGGGGATCTGAGCGGCACTCCTGATTTGGAGCAGCCGGCGAAGAAAGCGAGGGGAAGGCCACCAGGCTCGGGGAAGAAACAGATGAATGCTATTGGTATCGTCAATTGAATTACTCTTTTGTTTTTCTTTGTGTTGGAACTGCATTTTTTTGTTCATATTAATTGATATTATCTGGACGTTTTGTTGGATAGGATCAAGCGGTGTTGGTTTTACTCCTCACGTAGTATGGGCGAAGCCTGGTGAGGTGCGTATTCTGTATAATTCTTAGCTTGTTTTTTATATAATTATTGAAATTTGAAAGATTGTTCTTTTTAAAGTGATCTTTAGTTTAAGCGTTCAAGATTATGCGCGTTTATTAGAATGACTGCAATGCACGTAAAAGAACATGCTGTAATTGGGAAGTGATGCAGTAGCCCCTTGTTTCTGTTTCTAGTTCTAGGACAACTACGGATTTTGTATAAAACTACATTCAGAATATGTTGTATTAAATGGGATACTGGATTTCTGGATGTCTCCTTGGCTGGTTCATTCTAGTTATATCCATATTAGAGTTCTTCATTTATCAATTTTTGTGGAGAAAATGTGACATTCCTGAATTTATCTGTTAGATCATGTGATATCTACAACCAAAGATGAATTATTTCACGACGCAGAAATTATTTTGCATTGTATATGAAATCTAATGTAACTTTTTTTATCCTTTTTAAGGAACCGAATATCCTTGACTTCGATATTGATAAACGATAGACTGAACTATTAGCTAAATTATGACGGCATTGTCATAAACTGATTTGAGCTTGAGTATTAATGCATTGATATCATTCCATGAAGTTTTAGCTTGTTGTTGGAGTGAATTCAGTCAATTTAGGTATGTTATGGAGGCTTTGTATATTTGATTTAAGAAATGATTATAAGTAATATTGAAAACAAAGGCAACATTCTTCTCAAAAGGAAACAAAAGTAGCACAAAAAGAATGAATTTTGTAGTCGCGTAAACTTAGCAACTACCTTGTGAACATGGTTTTTTTTTAGGTAGAAGTTGCCAGAATGCAAATTGCGTGCAATGAATAAGCGTAAAATAAATTGTATGTTATTGATGTAGTTTTTTTGCTTGGAAGCTGTCAAATTTCCTGTTAAAAGCATCAAAATGAATATCTTGTTATTTGCAACATGATTTTAAGATTTACATTGTTCCTCTGAGCTTTGGATTAATTTTACTTCTCTCTCGGTAGTCCTTTTTTTATTTCTTGACAAATCTTTTTATCAGTGGTGTTGGGACTTCTATTTTTTAGATTATCCTTTTCATGGTTTATTGTTAATTACAACTCTAGTCGGTTTGGAGAAGTTAGAACATAGTCCCTACAATTTTAAAAGTTAGAATTTCTTATGGTTTGATAAAACTTTATAAGTGGTCCATATGGTTCGATAATCTCCACGAATACAGTAAGTACTACTTTTTCCAGTTTTATTAAACTAAAGGACTAATTCTAACATTTAAAAGTACAAAGACTAAATTCCAACTTTCTAATAGGTCAACTAATCTGGTAGAAAGTATGTAAGCATAGCTATGATTCATCTCTAATAGGTCATTAGTTTGTAATTCTATGTATTGCACGAGTTTGGAGGTGAGGATGCCTGCAAGTTTGATGGTTCCTTTTCTTCCTTTTAGTGGAGAGTTATTGATAGAAGTAATATTTTATTGGAGACTCTTAGGTCGTAATGCGGGTTTCATGCTTACATGGTTGCAGGACGTAGCAGCAAAAATTTTGGCTTTCTCACAGCAAGGACCACGAACTGTCTTCATTCTCTCTGCAAATGGTTCCATCAGTAATGCTACTCTTCGACACTCAATGACATCTGGTGGTTCTGTGACATATGAGGTGCTTATAATTGTTTTTGAATTGCATAGCTGATTGACTCTCTGCATGGATATAATTACTAATGTTTTATCATATCTAGTATTTGCCTCTTTGAATTAATCATCAACTCATGTAAACTTGAAGCCTTCGCTGAATTCCGATCTGATAGCTATTAAATCAACTACATGGACCTTGGATAGCAAATGGAAAAGGATTGTTCAATTTAAGTGTGCTCAGGTTAGGATAACATTTGTTTCCAATAAGTAGTAGGGACATGCTCCCTCCCAAGTATGCACAGTCCTTTTCAGTTCTGATGTTGTAGAGATTCTGTGATTCATAGGCAGTGTTTCTAAGGTTAGGCAGGAGCCACATGTGCCTGCAAAGTGCATTCACGTGGACGATCGTCTGGGTCACCTTTTCTATAACCATTTGCACTACAGTAAAATAAAGGCAATCACCAAGTTTCTTTTTGCCTTTTCTTCTCCTTTTCTACTACACTAATTAAGGGATCCTGATCTCTTTCAGCATCTCTTGAAGGTAAAAACGGTATAAAAAACCTATATTATTAAAAATATATATTATTCCATTGCTAAATCATCTTATGTTATGTTACCATGTATTCTAATTTTGATGTAGAACTGAATTTTATATTTCGAGAAATGCACTTTACTTGGCACAGGTGACATTGTGTGCCTTCCCATGATTTAGCAATAAACTTATCGTCACCAAGTGAGAACGTTTCTGATAATTTTCTAGACTTGAAGTTATCACCATTAAGCTAAGGGTTAAGGAAAGGGACTCCCCGGTTGATAAAGGAGAGAGTGGAAGAAAAAATGCAGATGATTATTGCAAGATGGACAGATAAAAGAAAAAAGGGCTTGGTCAATATGAAGATAGGCCTAGTGGCCTTTCTTGGAAGGGCTCTCAACAATTCCTACACAATCGATCTTTCCACCGGAAATTAAATAGGTGCCAGTTGATATTAAAGAATATGCCGAGATGATTCGTAGAGGTTATGGTTGTAGGTCGTAAAAGTATAAAACTAGAAAATATAGACTTTGTAACAATAAAACTCCGTACTAGTGCCAGTAGAACAGGTGCAATATGCTCTGCTTTGAGTATGTGATCAATCCCCTATGAAAATAGGCGCAACCTTATATAGGATCAGTTTGAGATCGATTCTAAAAGAAAAACATTTATCTAATCTTATGCAGGGGCAGTATGAGATAATCTCTCTGTCAGGCTCCTTTATGCTCTCGGAGAATAATGGAACTCGAAGTAGAACAGGTGGTTTGAGTGTGCTGCTGGCTGGGTCAGACGGACAGGTTCTTGGTGGTGGAGTTGCAGGAATGCTAATGGCAAGTTCCCAAGTACAGGTACTAATTTTACCTGGGTATCCTTATTTCAAAGAAACTGTTTTTTAACACTCATATAACTTATGTCCGAAGCGCTTCAAAGGCGTCATGTACTTGATAGACTTCACAGATCATGCTTTACAAACACTCTGATTTCATTGCATCCTCAGGTGGTTGTGGGAAGTTTTCTCGAGAATGATAAAAAGTCCAACAACACAGGTATGCTGAATTCTGGATCCTCTGCTTCACCATCTCAAATGATAAACTTTGGTGGTGCAGCAGCAGCAGCTGCAGCCAGCCCTCCATCGTTAGGGGCATCGAGTGGCGAGTCGTCTGCCGACAACGGAGGCAGCCCTCTTAACAATAGGCATCCCGGAATGTTCAGTAATAGCAGCCAGCCAATCCACAACATGCAGATGTACCACCAATTGTGGGCAGGCCAAACACAGCAATGAGGATTTATTTTGCAACTAACAGTGAGAAGTGCATTAGGATGATGATGATGATGCTCTTCAGTCTACACACATTCAACATCGTTTTGTGAATCTAACTTTCGCAATTAATTACCAAAAGTATTTTATAGAGGGGCTTATGTTTATTTATTTAGGTAATTATTAAGAAAGATTATTATTTCAATCTTGCTTTCTGTTTTCCCCCCCTCTTTCCGCAGCACTGAAGGAA

mRNA sequence

TTTTTTTTTTTTTTTTTTTTTTTTTTTAATGGATTCACTTGATACTCCTCCGTCACTCTCAGCACCGTCCAACATGACGGTCGGAGTACCGACGGCGTATTCGCCAGGCATGTCCAACGCCAACAACAATGCCTCTTCCACGCTTGGTCTAAATCCGGCCACGGCACAGATGATACAGCCTTCTGCGCGATTTCCGTTTAACTCTGTGATCGCCCCTGCGTCCGTGCCTTTGGATTCTATGAATGTTTCTCCGTACGACGGATCTCATTCTGGGAGTTTCAACGCTGATTCGGGGAAGAAGAAGAGAGGACGGCCGAGGAAGTACACGCCTGATGGTAACATTGCCTTAGGCTTAGCACCTACCACTATGGCGTCTTCTGTCGGTCACGGGGATCTGAGCGGCACTCCTGATTTGGAGCAGCCGGCGAAGAAAGCGAGGGGAAGGCCACCAGGCTCGGGGAAGAAACAGATGAATGCTATTGGATCAAGCGGTGTTGGTTTTACTCCTCACGTAGTATGGGCGAAGCCTGGTGAGGACGTAGCAGCAAAAATTTTGGCTTTCTCACAGCAAGGACCACGAACTGTCTTCATTCTCTCTGCAAATGGTTCCATCAGTAATGCTACTCTTCGACACTCAATGACATCTGGTGGTTCTGTGACATATGAGCCTTCGCTGAATTCCGATCTGATAGCTATTAAATCAACTACATGGACCTTGGATAGCAAATGGAAAAGGATTGTTCAATTTAAGTGTGCTCAGGGGCAGTATGAGATAATCTCTCTGTCAGGCTCCTTTATGCTCTCGGAGAATAATGGAACTCGAAGTAGAACAGGTGGTTTGAGTGTGCTGCTGGCTGGGTCAGACGGACAGGTTCTTGGTGGTGGAGTTGCAGGAATGCTAATGGCAAGTTCCCAAGTACAGGTGGTTGTGGGAAGTTTTCTCGAGAATGATAAAAAGTCCAACAACACAGGTATGCTGAATTCTGGATCCTCTGCTTCACCATCTCAAATGATAAACTTTGGTGGTGCAGCAGCAGCAGCTGCAGCCAGCCCTCCATCGTTAGGGGCATCGAGTGGCGAGTCGTCTGCCGACAACGGAGGCAGCCCTCTTAACAATAGGCATCCCGGAATGTTCAGTAATAGCAGCCAGCCAATCCACAACATGCAGATGTACCACCAATTGTGGGCAGGCCAAACACAGCAATGAGGATTTATTTTGCAACTAACAGTGAGAAGTGCATTAGGATGATGATGATGATGCTCTTCAGTCTACACACATTCAACATCGTTTTGTGAATCTAACTTTCGCAATTAATTACCAAAAGTATTTTATAGAGGGGCTTATGTTTATTTATTTAGGTAATTATTAAGAAAGATTATTATTTCAATCTTGCTTTCTGTTTTCCCCCCCTCTTTCCGCAGCACTGAAGGAA

Coding sequence (CDS)

ATGGATTCACTTGATACTCCTCCGTCACTCTCAGCACCGTCCAACATGACGGTCGGAGTACCGACGGCGTATTCGCCAGGCATGTCCAACGCCAACAACAATGCCTCTTCCACGCTTGGTCTAAATCCGGCCACGGCACAGATGATACAGCCTTCTGCGCGATTTCCGTTTAACTCTGTGATCGCCCCTGCGTCCGTGCCTTTGGATTCTATGAATGTTTCTCCGTACGACGGATCTCATTCTGGGAGTTTCAACGCTGATTCGGGGAAGAAGAAGAGAGGACGGCCGAGGAAGTACACGCCTGATGGTAACATTGCCTTAGGCTTAGCACCTACCACTATGGCGTCTTCTGTCGGTCACGGGGATCTGAGCGGCACTCCTGATTTGGAGCAGCCGGCGAAGAAAGCGAGGGGAAGGCCACCAGGCTCGGGGAAGAAACAGATGAATGCTATTGGATCAAGCGGTGTTGGTTTTACTCCTCACGTAGTATGGGCGAAGCCTGGTGAGGACGTAGCAGCAAAAATTTTGGCTTTCTCACAGCAAGGACCACGAACTGTCTTCATTCTCTCTGCAAATGGTTCCATCAGTAATGCTACTCTTCGACACTCAATGACATCTGGTGGTTCTGTGACATATGAGCCTTCGCTGAATTCCGATCTGATAGCTATTAAATCAACTACATGGACCTTGGATAGCAAATGGAAAAGGATTGTTCAATTTAAGTGTGCTCAGGGGCAGTATGAGATAATCTCTCTGTCAGGCTCCTTTATGCTCTCGGAGAATAATGGAACTCGAAGTAGAACAGGTGGTTTGAGTGTGCTGCTGGCTGGGTCAGACGGACAGGTTCTTGGTGGTGGAGTTGCAGGAATGCTAATGGCAAGTTCCCAAGTACAGGTGGTTGTGGGAAGTTTTCTCGAGAATGATAAAAAGTCCAACAACACAGGTATGCTGAATTCTGGATCCTCTGCTTCACCATCTCAAATGATAAACTTTGGTGGTGCAGCAGCAGCAGCTGCAGCCAGCCCTCCATCGTTAGGGGCATCGAGTGGCGAGTCGTCTGCCGACAACGGAGGCAGCCCTCTTAACAATAGGCATCCCGGAATGTTCAGTAATAGCAGCCAGCCAATCCACAACATGCAGATGTACCACCAATTGTGGGCAGGCCAAACACAGCAATGA

Protein sequence

MDSLDTPPSLSAPSNMTVGVPTAYSPGMSNANNNASSTLGLNPATAQMIQPSARFPFNSVIAPASVPLDSMNVSPYDGSHSGSFNADSGKKKRGRPRKYTPDGNIALGLAPTTMASSVGHGDLSGTPDLEQPAKKARGRPPGSGKKQMNAIGSSGVGFTPHVVWAKPGEDVAAKILAFSQQGPRTVFILSANGSISNATLRHSMTSGGSVTYEPSLNSDLIAIKSTTWTLDSKWKRIVQFKCAQGQYEIISLSGSFMLSENNGTRSRTGGLSVLLAGSDGQVLGGGVAGMLMASSQVQVVVGSFLENDKKSNNTGMLNSGSSASPSQMINFGGAAAAAAASPPSLGASSGESSADNGGSPLNNRHPGMFSNSSQPIHNMQMYHQLWAGQTQQ
Homology
BLAST of CmoCh02G016520 vs. ExPASy Swiss-Prot
Match: Q940I0 (AT-hook motif nuclear-localized protein 13 OS=Arabidopsis thaliana OX=3702 GN=AHL13 PE=1 SV=1)

HSP 1 Score: 208.0 bits (528), Expect = 2.0e-52
Identity = 172/427 (40.28%), Postives = 227/427 (53.16%), Query Frame = 0

Query: 30  NANNNASSTLGLNPATAQMIQPSARFPFNSVIAP--------------ASVPLDSMNVSP 89
           N N  A+  +G N +T+Q +    R PF   ++P                +   ++    
Sbjct: 48  NPNAAAAVLMGHNTSTSQAMH--QRLPFGGSMSPHQPQQHQYHHPQPQQQIDQKTLESLG 107

Query: 90  YDGS---------HSGSFNAD--SGKKKRGRPRKYTPDG--------NIALGLAPTTMAS 149
           +DGS         HS  F  D    KKKRGRPRKY  DG        NIALGLAPT+   
Sbjct: 108 FDGSPSSVAATQQHSMRFGIDHQQVKKKRGRPRKYAADGGGGGGGGSNIALGLAPTSPLP 167

Query: 150 SV-----------GHGDLSG--TPDLEQPAKKARGRPPGSGKKQMNAI-GSSGVGFTPHV 209
           S            G GD +G      + PAK+ RGRPPGSGKKQ++A+ G+ GVGFTPHV
Sbjct: 168 SASNSYGGGNEGGGGGDSAGANANSSDPPAKRNRGRPPGSGKKQLDALGGTGGVGFTPHV 227

Query: 210 VWAKPGEDVAAKILAFSQQGPRTVFILSANGSISNATLRHSMTSG--GSVTYEPSLNSDL 269
           +  K GED+A KILAF+ QGPR + ILSA G+++N  LR +  S   G+V YE       
Sbjct: 228 IEVKTGEDIATKILAFTNQGPRAICILSATGAVTNVMLRQANNSNPTGTVKYE------- 287

Query: 270 IAIKSTTWTLDSKWKRIVQFKCAQGQYEIISLSGSFMLSENNGTRSRTGGLSVLLAGSDG 329
                                   G++EIISLSGSF+ SE+NGT ++TG LSV LAG +G
Sbjct: 288 ------------------------GRFEIISLSGSFLNSESNGTVTKTGNLSVSLAGHEG 347

Query: 330 QVLGGGVAGMLMASSQVQVVVGSFLENDKKSNNTG--MLNSGSSAS-PSQMINFGGAAAA 389
           +++GG V GML+A SQVQV+VGSF+ + +K   +     N+   AS P+ M++FGG    
Sbjct: 348 RIVGGCVDGMLVAGSQVQVIVGSFVPDGRKQKQSAGRAQNTPEPASAPANMLSFGG--VG 407

Query: 390 AAASPPSLGAS-SGESSADN-GGSPL-------NNRHPGMFSNSS-QPIHN--MQMYHQL 393
              SP S G   S ESS +N   SPL       N+ + G+F NS+ QP+H   MQMY  L
Sbjct: 408 GPGSPRSQGQQHSSESSEENESNSPLHRRSNNNNSNNHGIFGNSTPQPLHQIPMQMYQNL 439

BLAST of CmoCh02G016520 vs. ExPASy Swiss-Prot
Match: Q9FIR1 (AT-hook motif nuclear-localized protein 8 OS=Arabidopsis thaliana OX=3702 GN=AHL8 PE=2 SV=1)

HSP 1 Score: 199.9 bits (507), Expect = 5.3e-50
Identity = 170/441 (38.55%), Postives = 228/441 (51.70%), Query Frame = 0

Query: 1   MDSLDTPPS---LSAPSNMTVGVPTAYSPGMSNAN----NNASSTLGLNPATAQMIQPSA 60
           MDS D PPS   L  P            PGM  ++    N A+S L +  +T+Q IQ   
Sbjct: 1   MDSRDIPPSHNQLQPP------------PGMLMSHYRNPNAAASPLMVPTSTSQPIQ-HP 60

Query: 61  RFPFNSVIAPASV-----------PLDSMNVSPYDGSHSGS---FNADSG------KKKR 120
           R PF +     +             L+S+     DGS S     F  D        KKKR
Sbjct: 61  RLPFGNQQQSQTFHQQQQQQMDQKTLESLGFG--DGSPSSQPMRFGIDDQNQQLQVKKKR 120

Query: 121 GRPRKYTPDGNIALGLAPTTMASSV--------GHGDLSGTPD-LEQPAKKARGRPPGSG 180
           GRPRKYTPDG+IALGLAPT+   S         G GD  G  + ++ P K+ RGRPPGS 
Sbjct: 121 GRPRKYTPDGSIALGLAPTSPLLSAASNSYGEGGVGDSGGNGNSVDPPVKRNRGRPPGSS 180

Query: 181 KKQMNAI-GSSGVGFTPHVVWAKPGEDVAAKILAFSQQGPRTVFILSANGSISNATLRHS 240
           KKQ++A+ G+SGVGFTPHV+    GED+A+K++AFS QG RT+ ILSA+G++S   LR +
Sbjct: 181 KKQLDALGGTSGVGFTPHVIEVNTGEDIASKVMAFSDQGSRTICILSASGAVSRVMLRQA 240

Query: 241 MTSGGSVTYEPSLNSDLIAIKSTTWTLDSKWKRIVQFKCAQGQYEIISLSGSFMLSENNG 300
             S G VTYE                               G++EII+LSGS +  E NG
Sbjct: 241 SHSSGIVTYE-------------------------------GRFEIITLSGSVLNYEVNG 300

Query: 301 TRSRTGGLSVLLAGSDGQVLGGGVAGMLMASSQVQVVVGSFLENDKKSNNTGM-----LN 360
           + +R+G LSV LAG DG ++GG V G L+A++QVQV+VGSF+   KK   + +      N
Sbjct: 301 STNRSGNLSVALAGPDGGIVGGSVVGNLVAATQVQVIVGSFVAEAKKPKQSSVNIARGQN 360

Query: 361 SGSSASPSQMINFGGAAAAAAASPPSLGASSGESSADNGGSPLNNR--HPGMF----SNS 392
              +++P+ M+NFG           S G SS  S  +  GSP  +R  + G++       
Sbjct: 361 PEPASAPANMLNFGSV---------SQGPSSESSEENESGSPAMHRDNNNGIYGAQQQQQ 386

BLAST of CmoCh02G016520 vs. ExPASy Swiss-Prot
Match: O22812 (AT-hook motif nuclear-localized protein 10 OS=Arabidopsis thaliana OX=3702 GN=AHL10 PE=1 SV=2)

HSP 1 Score: 186.4 bits (472), Expect = 6.1e-46
Identity = 141/361 (39.06%), Postives = 190/361 (52.63%), Query Frame = 0

Query: 24  YSPGMSNANNNASSTLGLNPATAQMIQPSARFPFNSVIAPASVPLDSMNVSPYDGSHSGS 83
           +S      + N   + G +  TA   QP           P S   +S+      G  SG 
Sbjct: 28  HSQAQPQQSQNRPLSFGGDDGTALYKQPMRSVSPPQQYQPNSAGENSVLNMNLPGGESGG 87

Query: 84  F---NADSGKKKRGRPRKYTPD-GNIALGLAPTTMASSVGHGDLSGTPDLEQPAKKARGR 143
                ++  KK+RGRPRKY PD G ++LGL P   + +V      G        +K RGR
Sbjct: 88  MTGTGSEPVKKRRGRPRKYGPDSGEMSLGLNPGAPSFTVSQPSSGG-----DGGEKKRGR 147

Query: 144 PPGSGKK--QMNAIGSSGVGFTPHVVWAKPGEDVAAKILAFSQQGPRTVFILSANGSISN 203
           PPGS  K  ++ A+GS+G+GFTPHV+    GEDV++KI+A +  GPR V +LSANG+ISN
Sbjct: 148 PPGSSSKRLKLQALGSTGIGFTPHVLTVLAGEDVSSKIMALTHNGPRAVCVLSANGAISN 207

Query: 204 ATLRHSMTSGGSVTYEPSLNSDLIAIKSTTWTLDSKWKRIVQFKCAQGQYEIISLSGSFM 263
            TLR S TSGG+VTYE                               G++EI+SLSGSF 
Sbjct: 208 VTLRQSATSGGTVTYE-------------------------------GRFEILSLSGSFH 267

Query: 264 LSENNGTRSRTGGLSVLLAGSDGQVLGGGVAGMLMASSQVQVVVGSFLENDKKSNNTGML 323
           L ENNG RSRTGGLSV L+  DG VLGG VAG+L+A+S VQ+VVGSFL + +K     + 
Sbjct: 268 LLENNGQRSRTGGLSVSLSSPDGNVLGGSVAGLLIAASPVQIVVGSFLPDGEKEPKQHVG 327

Query: 324 NSGSSA------SPSQMINFGGAAAAAAASPPSLGASSGESSADNGGSPLNNRHPGMFSN 373
             G S+      +P+Q++          +SP S G  S  S     GSP++    G ++N
Sbjct: 328 QMGLSSPVLPRVAPTQVL-------MTPSSPQSRGTMSESSCGGGHGSPIHQSTGGPYNN 345

BLAST of CmoCh02G016520 vs. ExPASy Swiss-Prot
Match: Q9SB31 (AT-hook motif nuclear-localized protein 3 OS=Arabidopsis thaliana OX=3702 GN=AHL3 PE=1 SV=1)

HSP 1 Score: 161.0 bits (406), Expect = 2.7e-38
Identity = 119/319 (37.30%), Postives = 164/319 (51.41%), Query Frame = 0

Query: 29  SNANNNASSTLGLNPATAQMIQ---------PSARFPFNSVIAPASVPLDSMNVSPYDGS 88
           +N NNN +S+ GL                  P    P   ++ P +VP  +   +    +
Sbjct: 7   TNINNNITSSFGLKQQHEAAASDGGYSMDPPPRPENPNPFLVPPTTVPAAATVAAAVTEN 66

Query: 89  HSGSF---------NADSGKKKRGRPRKYTPDGNIALGLAPTTMASSVGHGDLSGTPDLE 148
            +  F         +A+  KKKRGRPRKY PDG + + L+P  ++SSV     S  P   
Sbjct: 67  AATPFSLTMPTENTSAEQLKKKRGRPRKYNPDGTLVVTLSPMPISSSVPL--TSEFP--- 126

Query: 149 QPAKKARGRPPGS---GKKQM------------NAIGSS---GVGFTPHVVWAKPGEDVA 208
            P K+ RGR   +    K QM              +G++   G  FTPHV+    GEDV 
Sbjct: 127 -PRKRGRGRGKSNRWLKKSQMFQFDRSPVDTNLAGVGTADFVGANFTPHVLIVNAGEDVT 186

Query: 209 AKILAFSQQGPRTVFILSANGSISNATLRHSMTSGGSVTYEPSLNSDLIAIKSTTWTLDS 268
            KI+ FSQQG R + ILSANG ISN TLR SMTSGG++TYE                   
Sbjct: 187 MKIMTFSQQGSRAICILSANGPISNVTLRQSMTSGGTLTYE------------------- 246

Query: 269 KWKRIVQFKCAQGQYEIISLSGSFMLSENNGTRSRTGGLSVLLAGSDGQVLGGGVAGMLM 312
                       G++EI+SL+GSFM +++ GTRSR GG+SV LAG DG+V GGG+AG+ +
Sbjct: 247 ------------GRFEILSLTGSFMQNDSGGTRSRAGGMSVCLAGPDGRVFGGGLAGLFL 288

BLAST of CmoCh02G016520 vs. ExPASy Swiss-Prot
Match: Q8VYJ2 (AT-hook motif nuclear-localized protein 1 OS=Arabidopsis thaliana OX=3702 GN=AHL1 PE=1 SV=1)

HSP 1 Score: 159.5 bits (402), Expect = 8.0e-38
Identity = 114/297 (38.38%), Postives = 163/297 (54.88%), Query Frame = 0

Query: 49  IQPSARFPFNSVIAPASVPLDSMNVSPYDGSHSGSFNADSGKKKRGRPRKYTPDGN-IAL 108
           + P    P +   AP  + + ++  +    +  G  +    KKKRGRPRKY PDG  +AL
Sbjct: 49  VTPPPPQPSSHHTAPPPLQISTVTTTTTTAAMEG-ISGGLMKKKRGRPRKYGPDGTVVAL 108

Query: 109 GLAPTTMASSVGH--GDLSGTPDLEQPAKKARGRPPGSGKK-----QMNAIG-----SSG 168
              P + A +  H     S   D     K+++ +P  S  +     Q+  +G     S G
Sbjct: 109 SPKPISSAPAPSHLPPPSSHVIDFSASEKRSKVKPTNSFNRTKYHHQVENLGEWAPCSVG 168

Query: 169 VGFTPHVVWAKPGEDVAAKILAFSQQGPRTVFILSANGSISNATLRHSMTSGGSVTYEPS 228
             FTPH++    GEDV  KI++FSQQGPR++ +LSANG IS+ TLR   +SGG++TYE  
Sbjct: 169 GNFTPHIITVNTGEDVTMKIISFSQQGPRSICVLSANGVISSVTLRQPDSSGGTLTYE-- 228

Query: 229 LNSDLIAIKSTTWTLDSKWKRIVQFKCAQGQYEIISLSGSFMLSENNGTRSRTGGLSVLL 288
                                        G++EI+SLSGSFM +++ GTRSRTGG+SV L
Sbjct: 229 -----------------------------GRFEILSLSGSFMPNDSGGTRSRTGGMSVSL 288

Query: 289 AGSDGQVLGGGVAGMLMASSQVQVVVGSFL-------ENDKKSNNTGMLNSGSSASP 326
           A  DG+V+GGG+AG+L+A+S VQVVVGSFL       +  KK+ +  ML+S ++A P
Sbjct: 289 ASPDGRVVGGGLAGLLVAASPVQVVVGSFLAGTDHQDQKPKKNKHDFMLSSPTAAIP 313

BLAST of CmoCh02G016520 vs. ExPASy TrEMBL
Match: A0A6J1EXY5 (AT-hook motif nuclear-localized protein OS=Cucurbita moschata OX=3662 GN=LOC111437488 PE=4 SV=1)

HSP 1 Score: 667.5 bits (1721), Expect = 3.3e-188
Identity = 361/392 (92.09%), Postives = 361/392 (92.09%), Query Frame = 0

Query: 1   MDSLDTPPSLSAPSNMTVGVPTAYSPGMSNANNNASSTLGLNPATAQMIQPSARFPFNSV 60
           MDSLDTPPSLSAPSNMTVGVPTAYSPGMSNANNNASSTLGLNPATAQMIQPSARFPFNSV
Sbjct: 1   MDSLDTPPSLSAPSNMTVGVPTAYSPGMSNANNNASSTLGLNPATAQMIQPSARFPFNSV 60

Query: 61  IAPASVPLDSMNVSPYDGSHSGSFNADSGKKKRGRPRKYTPDGNIALGLAPTTMASSVGH 120
           IAPASVPLDSMNVSPYDGSHSGSFNADSGKKKRGRPRKYTPDGNIALGLAPTTMASSVGH
Sbjct: 61  IAPASVPLDSMNVSPYDGSHSGSFNADSGKKKRGRPRKYTPDGNIALGLAPTTMASSVGH 120

Query: 121 GDLSGTPDLEQPAKKARGRPPGSGKKQMNAIGSSGVGFTPHVVWAKPGEDVAAKILAFSQ 180
           GDLSGTPDLEQPAKKARGRPPGSGKKQMNAIGSSGVGFTPHVVWAKPGEDVAAKILAFSQ
Sbjct: 121 GDLSGTPDLEQPAKKARGRPPGSGKKQMNAIGSSGVGFTPHVVWAKPGEDVAAKILAFSQ 180

Query: 181 QGPRTVFILSANGSISNATLRHSMTSGGSVTYEPSLNSDLIAIKSTTWTLDSKWKRIVQF 240
           QGPRTVFILSANGSISNATLRHSMTSGGSVTYE                           
Sbjct: 181 QGPRTVFILSANGSISNATLRHSMTSGGSVTYE--------------------------- 240

Query: 241 KCAQGQYEIISLSGSFMLSENNGTRSRTGGLSVLLAGSDGQVLGGGVAGMLMASSQVQVV 300
               GQYEIISLSGSFMLSENNGTRSRTGGLSVLLAGSDGQVLGGGVAGMLMASSQVQVV
Sbjct: 241 ----GQYEIISLSGSFMLSENNGTRSRTGGLSVLLAGSDGQVLGGGVAGMLMASSQVQVV 300

Query: 301 VGSFLENDKKSNNTGMLNSGSSASPSQMINFGGAAAAAAASPPSLGASSGESSADNGGSP 360
           VGSFLENDKKSNNTGMLNSGSSASPSQMINFGGAAAAAAASPPSLGASSGESSADNGGSP
Sbjct: 301 VGSFLENDKKSNNTGMLNSGSSASPSQMINFGGAAAAAAASPPSLGASSGESSADNGGSP 360

Query: 361 LNNRHPGMFSNSSQPIHNMQMYHQLWAGQTQQ 393
           LNNRHPGMFSNSSQPIHNMQMYHQLWAGQTQQ
Sbjct: 361 LNNRHPGMFSNSSQPIHNMQMYHQLWAGQTQQ 361

BLAST of CmoCh02G016520 vs. ExPASy TrEMBL
Match: A0A6J1K244 (AT-hook motif nuclear-localized protein OS=Cucurbita maxima OX=3661 GN=LOC111491025 PE=4 SV=1)

HSP 1 Score: 665.2 bits (1715), Expect = 1.6e-187
Identity = 360/392 (91.84%), Postives = 360/392 (91.84%), Query Frame = 0

Query: 1   MDSLDTPPSLSAPSNMTVGVPTAYSPGMSNANNNASSTLGLNPATAQMIQPSARFPFNSV 60
           MDSLDTPPSLSAPSNMTVGVPTAYSPGMSNANNNASSTLGLNPATAQMI PSARFPFNSV
Sbjct: 1   MDSLDTPPSLSAPSNMTVGVPTAYSPGMSNANNNASSTLGLNPATAQMIPPSARFPFNSV 60

Query: 61  IAPASVPLDSMNVSPYDGSHSGSFNADSGKKKRGRPRKYTPDGNIALGLAPTTMASSVGH 120
           IAPASVPLDSMNVSPYDGSHSGSFNADSGKKKRGRPRKYTPDGNIALGLAPTTMASSVGH
Sbjct: 61  IAPASVPLDSMNVSPYDGSHSGSFNADSGKKKRGRPRKYTPDGNIALGLAPTTMASSVGH 120

Query: 121 GDLSGTPDLEQPAKKARGRPPGSGKKQMNAIGSSGVGFTPHVVWAKPGEDVAAKILAFSQ 180
           GDLSGTPDLEQPAKKARGRPPGSGKKQMNAIGSSGVGFTPHVVWAKPGEDVAAKILAFSQ
Sbjct: 121 GDLSGTPDLEQPAKKARGRPPGSGKKQMNAIGSSGVGFTPHVVWAKPGEDVAAKILAFSQ 180

Query: 181 QGPRTVFILSANGSISNATLRHSMTSGGSVTYEPSLNSDLIAIKSTTWTLDSKWKRIVQF 240
           QGPRTVFILSANGSISNATLRHSMTSGGSVTYE                           
Sbjct: 181 QGPRTVFILSANGSISNATLRHSMTSGGSVTYE--------------------------- 240

Query: 241 KCAQGQYEIISLSGSFMLSENNGTRSRTGGLSVLLAGSDGQVLGGGVAGMLMASSQVQVV 300
               GQYEIISLSGSFMLSENNGTRSRTGGLSVLLAGSDGQVLGGGVAGMLMASSQVQVV
Sbjct: 241 ----GQYEIISLSGSFMLSENNGTRSRTGGLSVLLAGSDGQVLGGGVAGMLMASSQVQVV 300

Query: 301 VGSFLENDKKSNNTGMLNSGSSASPSQMINFGGAAAAAAASPPSLGASSGESSADNGGSP 360
           VGSFLENDKKSNNTGMLNSGSSASPSQMINFGGAAAAAAASPPSLGASSGESSADNGGSP
Sbjct: 301 VGSFLENDKKSNNTGMLNSGSSASPSQMINFGGAAAAAAASPPSLGASSGESSADNGGSP 360

Query: 361 LNNRHPGMFSNSSQPIHNMQMYHQLWAGQTQQ 393
           LNNRHPGMFSNSSQPIHNMQMYHQLWAGQTQQ
Sbjct: 361 LNNRHPGMFSNSSQPIHNMQMYHQLWAGQTQQ 361

BLAST of CmoCh02G016520 vs. ExPASy TrEMBL
Match: A0A6J1ETT7 (AT-hook motif nuclear-localized protein OS=Cucurbita moschata OX=3662 GN=LOC111437488 PE=4 SV=1)

HSP 1 Score: 620.9 bits (1600), Expect = 3.6e-174
Identity = 341/392 (86.99%), Postives = 341/392 (86.99%), Query Frame = 0

Query: 1   MDSLDTPPSLSAPSNMTVGVPTAYSPGMSNANNNASSTLGLNPATAQMIQPSARFPFNSV 60
           MDSLDTPPSLSAPSNMTVGVPTAYSPGMSNANNNASSTLGLNPATAQMIQPSARFPFNSV
Sbjct: 1   MDSLDTPPSLSAPSNMTVGVPTAYSPGMSNANNNASSTLGLNPATAQMIQPSARFPFNSV 60

Query: 61  IAPASVPLDSMNVSPYDGSHSGSFNADSGKKKRGRPRKYTPDGNIALGLAPTTMASSVGH 120
           IAPASVPLDSMNVSPYDGSHSGSFNADSGKKKRGRPRKYTPDGNIALGLAPTTMASSVGH
Sbjct: 61  IAPASVPLDSMNVSPYDGSHSGSFNADSGKKKRGRPRKYTPDGNIALGLAPTTMASSVGH 120

Query: 121 GDLSGTPDLEQPAKKARGRPPGSGKKQMNAIGSSGVGFTPHVVWAKPGEDVAAKILAFSQ 180
           GDLSGTPDLEQPAKKARGRPPGSGKKQMNAIGSSGVGFTPHVVWAKPGEDVAAKILAFSQ
Sbjct: 121 GDLSGTPDLEQPAKKARGRPPGSGKKQMNAIGSSGVGFTPHVVWAKPGEDVAAKILAFSQ 180

Query: 181 QGPRTVFILSANGSISNATLRHSMTSGGSVTYEPSLNSDLIAIKSTTWTLDSKWKRIVQF 240
           QGPRTVFILSANGSISNATLRHSMTSGGSVTYE                           
Sbjct: 181 QGPRTVFILSANGSISNATLRHSMTSGGSVTYE--------------------------- 240

Query: 241 KCAQGQYEIISLSGSFMLSENNGTRSRTGGLSVLLAGSDGQVLGGGVAGMLMASSQVQVV 300
               GQYEIISLSGSFMLSENNGTRSRTGGLSVLLAGSDGQVLGGGVAGMLMASSQVQVV
Sbjct: 241 ----GQYEIISLSGSFMLSENNGTRSRTGGLSVLLAGSDGQVLGGGVAGMLMASSQVQVV 300

Query: 301 VGSFLENDKKSNNTGMLNSGSSASPSQMINFGGAAAAAAASPPSLGASSGESSADNGGSP 360
           VGSFLENDKKSNNT                    AAAAAASPPSLGASSGESSADNGGSP
Sbjct: 301 VGSFLENDKKSNNT--------------------AAAAAASPPSLGASSGESSADNGGSP 341

Query: 361 LNNRHPGMFSNSSQPIHNMQMYHQLWAGQTQQ 393
           LNNRHPGMFSNSSQPIHNMQMYHQLWAGQTQQ
Sbjct: 361 LNNRHPGMFSNSSQPIHNMQMYHQLWAGQTQQ 341

BLAST of CmoCh02G016520 vs. ExPASy TrEMBL
Match: A0A6J1JZ43 (AT-hook motif nuclear-localized protein OS=Cucurbita maxima OX=3661 GN=LOC111491025 PE=4 SV=1)

HSP 1 Score: 618.6 bits (1594), Expect = 1.8e-173
Identity = 340/392 (86.73%), Postives = 340/392 (86.73%), Query Frame = 0

Query: 1   MDSLDTPPSLSAPSNMTVGVPTAYSPGMSNANNNASSTLGLNPATAQMIQPSARFPFNSV 60
           MDSLDTPPSLSAPSNMTVGVPTAYSPGMSNANNNASSTLGLNPATAQMI PSARFPFNSV
Sbjct: 1   MDSLDTPPSLSAPSNMTVGVPTAYSPGMSNANNNASSTLGLNPATAQMIPPSARFPFNSV 60

Query: 61  IAPASVPLDSMNVSPYDGSHSGSFNADSGKKKRGRPRKYTPDGNIALGLAPTTMASSVGH 120
           IAPASVPLDSMNVSPYDGSHSGSFNADSGKKKRGRPRKYTPDGNIALGLAPTTMASSVGH
Sbjct: 61  IAPASVPLDSMNVSPYDGSHSGSFNADSGKKKRGRPRKYTPDGNIALGLAPTTMASSVGH 120

Query: 121 GDLSGTPDLEQPAKKARGRPPGSGKKQMNAIGSSGVGFTPHVVWAKPGEDVAAKILAFSQ 180
           GDLSGTPDLEQPAKKARGRPPGSGKKQMNAIGSSGVGFTPHVVWAKPGEDVAAKILAFSQ
Sbjct: 121 GDLSGTPDLEQPAKKARGRPPGSGKKQMNAIGSSGVGFTPHVVWAKPGEDVAAKILAFSQ 180

Query: 181 QGPRTVFILSANGSISNATLRHSMTSGGSVTYEPSLNSDLIAIKSTTWTLDSKWKRIVQF 240
           QGPRTVFILSANGSISNATLRHSMTSGGSVTYE                           
Sbjct: 181 QGPRTVFILSANGSISNATLRHSMTSGGSVTYE--------------------------- 240

Query: 241 KCAQGQYEIISLSGSFMLSENNGTRSRTGGLSVLLAGSDGQVLGGGVAGMLMASSQVQVV 300
               GQYEIISLSGSFMLSENNGTRSRTGGLSVLLAGSDGQVLGGGVAGMLMASSQVQVV
Sbjct: 241 ----GQYEIISLSGSFMLSENNGTRSRTGGLSVLLAGSDGQVLGGGVAGMLMASSQVQVV 300

Query: 301 VGSFLENDKKSNNTGMLNSGSSASPSQMINFGGAAAAAAASPPSLGASSGESSADNGGSP 360
           VGSFLENDKKSNNT                    AAAAAASPPSLGASSGESSADNGGSP
Sbjct: 301 VGSFLENDKKSNNT--------------------AAAAAASPPSLGASSGESSADNGGSP 341

Query: 361 LNNRHPGMFSNSSQPIHNMQMYHQLWAGQTQQ 393
           LNNRHPGMFSNSSQPIHNMQMYHQLWAGQTQQ
Sbjct: 361 LNNRHPGMFSNSSQPIHNMQMYHQLWAGQTQQ 341

BLAST of CmoCh02G016520 vs. ExPASy TrEMBL
Match: A0A6J1E3M0 (AT-hook motif nuclear-localized protein OS=Momordica charantia OX=3673 GN=LOC111026175 PE=4 SV=1)

HSP 1 Score: 540.0 bits (1390), Expect = 8.0e-150
Identity = 302/393 (76.84%), Postives = 328/393 (83.46%), Query Frame = 0

Query: 1   MDSLDT-PPSLSAPSNMTVGVPTAYSPGMSNANNNASSTLGLNPATAQMIQPSARFPFNS 60
           MDSL+T PP LSA SNM VG  TAYS  MSNANNNASST+GLNP T QM+ P+ RFPFNS
Sbjct: 1   MDSLETPPPPLSAASNMAVGGQTAYSAAMSNANNNASSTIGLNPTTTQMMAPAPRFPFNS 60

Query: 61  VIAPASVPLDSMNVSPYDGSHSGSFNADSGKKKRGRPRKYTPDGNIALGLAPTTMASSVG 120
           VIAPASVPLDS+NV+PYDGSHSG+FN DSGKKKRGRPRKYTPDGNIALGLAPTT+ASSVG
Sbjct: 61  VIAPASVPLDSLNVTPYDGSHSGTFNIDSGKKKRGRPRKYTPDGNIALGLAPTTVASSVG 120

Query: 121 HGDLSGTPDLEQPAKKARGRPPGSGKKQMNAIGSSGVGFTPHVVWAKPGEDVAAKILAFS 180
           HGDL+ TPD EQPAKKARGRPPGSGKKQMNA GS G+GFTPHVV  KPGEDVAAKI++F+
Sbjct: 121 HGDLTATPDSEQPAKKARGRPPGSGKKQMNAHGSGGIGFTPHVVLVKPGEDVAAKIVSFT 180

Query: 181 QQGPRTVFILSANGSISNATLRHSMTSGGSVTYEPSLNSDLIAIKSTTWTLDSKWKRIVQ 240
           QQGPR VFILSANG++S+ATLRH  TSGGSVTYE                          
Sbjct: 181 QQGPRAVFILSANGTVSSATLRHPATSGGSVTYE-------------------------- 240

Query: 241 FKCAQGQYEIISLSGSFMLSENNGTRSRTGGLSVLLAGSDGQVLGGGVAGMLMASSQVQV 300
                GQYEIISLSGSF+LSENNGTRSRTGGLSVLLAGSDGQVLGGGVAGMLMA SQVQ+
Sbjct: 241 -----GQYEIISLSGSFLLSENNGTRSRTGGLSVLLAGSDGQVLGGGVAGMLMAGSQVQL 300

Query: 301 VVGSFLENDKKSNNTGMLNSGSSASPSQMINFGGAAAAAAASPPSLGASSGESSADNGGS 360
           +VGSFLE+DKKSN++ MLNS SSA P QMINF GA AA AASPPSLGASSGESSA+NG S
Sbjct: 301 IVGSFLEDDKKSNSS-MLNSASSAGPPQMINF-GAPAATAASPPSLGASSGESSAENGDS 358

Query: 361 PLNNRHPGMFSNSSQPIHNMQMYHQLWAGQTQQ 393
           PL NRHPGMF+N+SQPI N+QMYH LWAGQTQQ
Sbjct: 361 PL-NRHPGMFNNTSQPI-NVQMYHHLWAGQTQQ 358

BLAST of CmoCh02G016520 vs. TAIR 10
Match: AT4G17950.1 (AT hook motif DNA-binding family protein )

HSP 1 Score: 208.0 bits (528), Expect = 1.4e-53
Identity = 172/427 (40.28%), Postives = 227/427 (53.16%), Query Frame = 0

Query: 30  NANNNASSTLGLNPATAQMIQPSARFPFNSVIAP--------------ASVPLDSMNVSP 89
           N N  A+  +G N +T+Q +    R PF   ++P                +   ++    
Sbjct: 48  NPNAAAAVLMGHNTSTSQAMH--QRLPFGGSMSPHQPQQHQYHHPQPQQQIDQKTLESLG 107

Query: 90  YDGS---------HSGSFNAD--SGKKKRGRPRKYTPDG--------NIALGLAPTTMAS 149
           +DGS         HS  F  D    KKKRGRPRKY  DG        NIALGLAPT+   
Sbjct: 108 FDGSPSSVAATQQHSMRFGIDHQQVKKKRGRPRKYAADGGGGGGGGSNIALGLAPTSPLP 167

Query: 150 SV-----------GHGDLSG--TPDLEQPAKKARGRPPGSGKKQMNAI-GSSGVGFTPHV 209
           S            G GD +G      + PAK+ RGRPPGSGKKQ++A+ G+ GVGFTPHV
Sbjct: 168 SASNSYGGGNEGGGGGDSAGANANSSDPPAKRNRGRPPGSGKKQLDALGGTGGVGFTPHV 227

Query: 210 VWAKPGEDVAAKILAFSQQGPRTVFILSANGSISNATLRHSMTSG--GSVTYEPSLNSDL 269
           +  K GED+A KILAF+ QGPR + ILSA G+++N  LR +  S   G+V YE       
Sbjct: 228 IEVKTGEDIATKILAFTNQGPRAICILSATGAVTNVMLRQANNSNPTGTVKYE------- 287

Query: 270 IAIKSTTWTLDSKWKRIVQFKCAQGQYEIISLSGSFMLSENNGTRSRTGGLSVLLAGSDG 329
                                   G++EIISLSGSF+ SE+NGT ++TG LSV LAG +G
Sbjct: 288 ------------------------GRFEIISLSGSFLNSESNGTVTKTGNLSVSLAGHEG 347

Query: 330 QVLGGGVAGMLMASSQVQVVVGSFLENDKKSNNTG--MLNSGSSAS-PSQMINFGGAAAA 389
           +++GG V GML+A SQVQV+VGSF+ + +K   +     N+   AS P+ M++FGG    
Sbjct: 348 RIVGGCVDGMLVAGSQVQVIVGSFVPDGRKQKQSAGRAQNTPEPASAPANMLSFGG--VG 407

Query: 390 AAASPPSLGAS-SGESSADN-GGSPL-------NNRHPGMFSNSS-QPIHN--MQMYHQL 393
              SP S G   S ESS +N   SPL       N+ + G+F NS+ QP+H   MQMY  L
Sbjct: 408 GPGSPRSQGQQHSSESSEENESNSPLHRRSNNNNSNNHGIFGNSTPQPLHQIPMQMYQNL 439

BLAST of CmoCh02G016520 vs. TAIR 10
Match: AT5G46640.1 (AT hook motif DNA-binding family protein )

HSP 1 Score: 199.9 bits (507), Expect = 3.8e-51
Identity = 170/441 (38.55%), Postives = 228/441 (51.70%), Query Frame = 0

Query: 1   MDSLDTPPS---LSAPSNMTVGVPTAYSPGMSNAN----NNASSTLGLNPATAQMIQPSA 60
           MDS D PPS   L  P            PGM  ++    N A+S L +  +T+Q IQ   
Sbjct: 1   MDSRDIPPSHNQLQPP------------PGMLMSHYRNPNAAASPLMVPTSTSQPIQ-HP 60

Query: 61  RFPFNSVIAPASV-----------PLDSMNVSPYDGSHSGS---FNADSG------KKKR 120
           R PF +     +             L+S+     DGS S     F  D        KKKR
Sbjct: 61  RLPFGNQQQSQTFHQQQQQQMDQKTLESLGFG--DGSPSSQPMRFGIDDQNQQLQVKKKR 120

Query: 121 GRPRKYTPDGNIALGLAPTTMASSV--------GHGDLSGTPD-LEQPAKKARGRPPGSG 180
           GRPRKYTPDG+IALGLAPT+   S         G GD  G  + ++ P K+ RGRPPGS 
Sbjct: 121 GRPRKYTPDGSIALGLAPTSPLLSAASNSYGEGGVGDSGGNGNSVDPPVKRNRGRPPGSS 180

Query: 181 KKQMNAI-GSSGVGFTPHVVWAKPGEDVAAKILAFSQQGPRTVFILSANGSISNATLRHS 240
           KKQ++A+ G+SGVGFTPHV+    GED+A+K++AFS QG RT+ ILSA+G++S   LR +
Sbjct: 181 KKQLDALGGTSGVGFTPHVIEVNTGEDIASKVMAFSDQGSRTICILSASGAVSRVMLRQA 240

Query: 241 MTSGGSVTYEPSLNSDLIAIKSTTWTLDSKWKRIVQFKCAQGQYEIISLSGSFMLSENNG 300
             S G VTYE                               G++EII+LSGS +  E NG
Sbjct: 241 SHSSGIVTYE-------------------------------GRFEIITLSGSVLNYEVNG 300

Query: 301 TRSRTGGLSVLLAGSDGQVLGGGVAGMLMASSQVQVVVGSFLENDKKSNNTGM-----LN 360
           + +R+G LSV LAG DG ++GG V G L+A++QVQV+VGSF+   KK   + +      N
Sbjct: 301 STNRSGNLSVALAGPDGGIVGGSVVGNLVAATQVQVIVGSFVAEAKKPKQSSVNIARGQN 360

Query: 361 SGSSASPSQMINFGGAAAAAAASPPSLGASSGESSADNGGSPLNNR--HPGMF----SNS 392
              +++P+ M+NFG           S G SS  S  +  GSP  +R  + G++       
Sbjct: 361 PEPASAPANMLNFGSV---------SQGPSSESSEENESGSPAMHRDNNNGIYGAQQQQQ 386

BLAST of CmoCh02G016520 vs. TAIR 10
Match: AT2G33620.1 (AT hook motif DNA-binding family protein )

HSP 1 Score: 186.4 bits (472), Expect = 4.3e-47
Identity = 141/361 (39.06%), Postives = 190/361 (52.63%), Query Frame = 0

Query: 24  YSPGMSNANNNASSTLGLNPATAQMIQPSARFPFNSVIAPASVPLDSMNVSPYDGSHSGS 83
           +S      + N   + G +  TA   QP           P S   +S+      G  SG 
Sbjct: 28  HSQAQPQQSQNRPLSFGGDDGTALYKQPMRSVSPPQQYQPNSAGENSVLNMNLPGGESGG 87

Query: 84  F---NADSGKKKRGRPRKYTPD-GNIALGLAPTTMASSVGHGDLSGTPDLEQPAKKARGR 143
                ++  KK+RGRPRKY PD G ++LGL P   + +V      G        +K RGR
Sbjct: 88  MTGTGSEPVKKRRGRPRKYGPDSGEMSLGLNPGAPSFTVSQPSSGG-----DGGEKKRGR 147

Query: 144 PPGSGKK--QMNAIGSSGVGFTPHVVWAKPGEDVAAKILAFSQQGPRTVFILSANGSISN 203
           PPGS  K  ++ A+GS+G+GFTPHV+    GEDV++KI+A +  GPR V +LSANG+ISN
Sbjct: 148 PPGSSSKRLKLQALGSTGIGFTPHVLTVLAGEDVSSKIMALTHNGPRAVCVLSANGAISN 207

Query: 204 ATLRHSMTSGGSVTYEPSLNSDLIAIKSTTWTLDSKWKRIVQFKCAQGQYEIISLSGSFM 263
            TLR S TSGG+VTYE                               G++EI+SLSGSF 
Sbjct: 208 VTLRQSATSGGTVTYE-------------------------------GRFEILSLSGSFH 267

Query: 264 LSENNGTRSRTGGLSVLLAGSDGQVLGGGVAGMLMASSQVQVVVGSFLENDKKSNNTGML 323
           L ENNG RSRTGGLSV L+  DG VLGG VAG+L+A+S VQ+VVGSFL + +K     + 
Sbjct: 268 LLENNGQRSRTGGLSVSLSSPDGNVLGGSVAGLLIAASPVQIVVGSFLPDGEKEPKQHVG 327

Query: 324 NSGSSA------SPSQMINFGGAAAAAAASPPSLGASSGESSADNGGSPLNNRHPGMFSN 373
             G S+      +P+Q++          +SP S G  S  S     GSP++    G ++N
Sbjct: 328 QMGLSSPVLPRVAPTQVL-------MTPSSPQSRGTMSESSCGGGHGSPIHQSTGGPYNN 345

BLAST of CmoCh02G016520 vs. TAIR 10
Match: AT2G33620.2 (AT hook motif DNA-binding family protein )

HSP 1 Score: 186.4 bits (472), Expect = 4.3e-47
Identity = 141/361 (39.06%), Postives = 190/361 (52.63%), Query Frame = 0

Query: 24  YSPGMSNANNNASSTLGLNPATAQMIQPSARFPFNSVIAPASVPLDSMNVSPYDGSHSGS 83
           +S      + N   + G +  TA   QP           P S   +S+      G  SG 
Sbjct: 28  HSQAQPQQSQNRPLSFGGDDGTALYKQPMRSVSPPQQYQPNSAGENSVLNMNLPGGESGG 87

Query: 84  F---NADSGKKKRGRPRKYTPD-GNIALGLAPTTMASSVGHGDLSGTPDLEQPAKKARGR 143
                ++  KK+RGRPRKY PD G ++LGL P   + +V      G        +K RGR
Sbjct: 88  MTGTGSEPVKKRRGRPRKYGPDSGEMSLGLNPGAPSFTVSQPSSGG-----DGGEKKRGR 147

Query: 144 PPGSGKK--QMNAIGSSGVGFTPHVVWAKPGEDVAAKILAFSQQGPRTVFILSANGSISN 203
           PPGS  K  ++ A+GS+G+GFTPHV+    GEDV++KI+A +  GPR V +LSANG+ISN
Sbjct: 148 PPGSSSKRLKLQALGSTGIGFTPHVLTVLAGEDVSSKIMALTHNGPRAVCVLSANGAISN 207

Query: 204 ATLRHSMTSGGSVTYEPSLNSDLIAIKSTTWTLDSKWKRIVQFKCAQGQYEIISLSGSFM 263
            TLR S TSGG+VTYE                               G++EI+SLSGSF 
Sbjct: 208 VTLRQSATSGGTVTYE-------------------------------GRFEILSLSGSFH 267

Query: 264 LSENNGTRSRTGGLSVLLAGSDGQVLGGGVAGMLMASSQVQVVVGSFLENDKKSNNTGML 323
           L ENNG RSRTGGLSV L+  DG VLGG VAG+L+A+S VQ+VVGSFL + +K     + 
Sbjct: 268 LLENNGQRSRTGGLSVSLSSPDGNVLGGSVAGLLIAASPVQIVVGSFLPDGEKEPKQHVG 327

Query: 324 NSGSSA------SPSQMINFGGAAAAAAASPPSLGASSGESSADNGGSPLNNRHPGMFSN 373
             G S+      +P+Q++          +SP S G  S  S     GSP++    G ++N
Sbjct: 328 QMGLSSPVLPRVAPTQVL-------MTPSSPQSRGTMSESSCGGGHGSPIHQSTGGPYNN 345

BLAST of CmoCh02G016520 vs. TAIR 10
Match: AT2G33620.3 (AT hook motif DNA-binding family protein )

HSP 1 Score: 186.4 bits (472), Expect = 4.3e-47
Identity = 141/361 (39.06%), Postives = 190/361 (52.63%), Query Frame = 0

Query: 24  YSPGMSNANNNASSTLGLNPATAQMIQPSARFPFNSVIAPASVPLDSMNVSPYDGSHSGS 83
           +S      + N   + G +  TA   QP           P S   +S+      G  SG 
Sbjct: 28  HSQAQPQQSQNRPLSFGGDDGTALYKQPMRSVSPPQQYQPNSAGENSVLNMNLPGGESGG 87

Query: 84  F---NADSGKKKRGRPRKYTPD-GNIALGLAPTTMASSVGHGDLSGTPDLEQPAKKARGR 143
                ++  KK+RGRPRKY PD G ++LGL P   + +V      G        +K RGR
Sbjct: 88  MTGTGSEPVKKRRGRPRKYGPDSGEMSLGLNPGAPSFTVSQPSSGG-----DGGEKKRGR 147

Query: 144 PPGSGKK--QMNAIGSSGVGFTPHVVWAKPGEDVAAKILAFSQQGPRTVFILSANGSISN 203
           PPGS  K  ++ A+GS+G+GFTPHV+    GEDV++KI+A +  GPR V +LSANG+ISN
Sbjct: 148 PPGSSSKRLKLQALGSTGIGFTPHVLTVLAGEDVSSKIMALTHNGPRAVCVLSANGAISN 207

Query: 204 ATLRHSMTSGGSVTYEPSLNSDLIAIKSTTWTLDSKWKRIVQFKCAQGQYEIISLSGSFM 263
            TLR S TSGG+VTYE                               G++EI+SLSGSF 
Sbjct: 208 VTLRQSATSGGTVTYE-------------------------------GRFEILSLSGSFH 267

Query: 264 LSENNGTRSRTGGLSVLLAGSDGQVLGGGVAGMLMASSQVQVVVGSFLENDKKSNNTGML 323
           L ENNG RSRTGGLSV L+  DG VLGG VAG+L+A+S VQ+VVGSFL + +K     + 
Sbjct: 268 LLENNGQRSRTGGLSVSLSSPDGNVLGGSVAGLLIAASPVQIVVGSFLPDGEKEPKQHVG 327

Query: 324 NSGSSA------SPSQMINFGGAAAAAAASPPSLGASSGESSADNGGSPLNNRHPGMFSN 373
             G S+      +P+Q++          +SP S G  S  S     GSP++    G ++N
Sbjct: 328 QMGLSSPVLPRVAPTQVL-------MTPSSPQSRGTMSESSCGGGHGSPIHQSTGGPYNN 345

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q940I02.0e-5240.28AT-hook motif nuclear-localized protein 13 OS=Arabidopsis thaliana OX=3702 GN=AH... [more]
Q9FIR15.3e-5038.55AT-hook motif nuclear-localized protein 8 OS=Arabidopsis thaliana OX=3702 GN=AHL... [more]
O228126.1e-4639.06AT-hook motif nuclear-localized protein 10 OS=Arabidopsis thaliana OX=3702 GN=AH... [more]
Q9SB312.7e-3837.30AT-hook motif nuclear-localized protein 3 OS=Arabidopsis thaliana OX=3702 GN=AHL... [more]
Q8VYJ28.0e-3838.38AT-hook motif nuclear-localized protein 1 OS=Arabidopsis thaliana OX=3702 GN=AHL... [more]
Match NameE-valueIdentityDescription
A0A6J1EXY53.3e-18892.09AT-hook motif nuclear-localized protein OS=Cucurbita moschata OX=3662 GN=LOC1114... [more]
A0A6J1K2441.6e-18791.84AT-hook motif nuclear-localized protein OS=Cucurbita maxima OX=3661 GN=LOC111491... [more]
A0A6J1ETT73.6e-17486.99AT-hook motif nuclear-localized protein OS=Cucurbita moschata OX=3662 GN=LOC1114... [more]
A0A6J1JZ431.8e-17386.73AT-hook motif nuclear-localized protein OS=Cucurbita maxima OX=3661 GN=LOC111491... [more]
A0A6J1E3M08.0e-15076.84AT-hook motif nuclear-localized protein OS=Momordica charantia OX=3673 GN=LOC111... [more]
Match NameE-valueIdentityDescription
AT4G17950.11.4e-5340.28AT hook motif DNA-binding family protein [more]
AT5G46640.13.8e-5138.55AT hook motif DNA-binding family protein [more]
AT2G33620.14.3e-4739.06AT hook motif DNA-binding family protein [more]
AT2G33620.24.3e-4739.06AT hook motif DNA-binding family protein [more]
AT2G33620.34.3e-4739.06AT hook motif DNA-binding family protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita moschata (Rifu) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR017956AT hook, DNA-binding motifSMARTSM00384AT_hook_2coord: 134..146
e-value: 16.0
score: 8.5
coord: 90..102
e-value: 0.32
score: 18.0
IPR005175PPC domainPFAMPF03479PCCcoord: 160..304
e-value: 6.5E-24
score: 84.4
IPR005175PPC domainPROSITEPS51742PPCcoord: 152..326
score: 22.750118
IPR005175PPC domainCDDcd11378DUF296coord: 159..304
e-value: 5.15136E-26
score: 99.1969
NoneNo IPR availableGENE3D3.30.1330.80coord: 160..319
e-value: 5.1E-22
score: 80.2
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..40
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 340..374
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 79..148
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 345..374
NoneNo IPR availablePANTHERPTHR31500:SF57AT-HOOK MOTIF NUCLEAR-LOCALIZED PROTEIN 10coord: 71..354
NoneNo IPR availableSUPERFAMILY117856AF0104/ALDC/Ptd012-likecoord: 158..307
IPR039605AT-hook motif nuclear-localized proteinPANTHERPTHR31500AT-HOOK MOTIF NUCLEAR-LOCALIZED PROTEIN 9coord: 71..354

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh02G016520.1CmoCh02G016520.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0005634 nucleus
molecular_function GO:0003680 minor groove of adenine-thymine-rich DNA binding
molecular_function GO:0003677 DNA binding