HG10005305 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10005305
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptiontranscription factor SPT20 homolog isoform X1
LocationChr07: 1439943 .. 1442570 (-)
RNA-Seq ExpressionHG10005305
SyntenyHG10005305
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGTCTGGTTCAGCAGGTCGCCCTAACTCCTCCCCCAAATCGTTTGATTTTGGTTCTGATGATATCCTTTGCTCATTTGAAGACTACGGTAAACAGGACGCTTCAAACGGTAGCCATACTGATCCCGTTTCCCTTACCAATTCTAGCAAGGTTCGTGTTGGTATTACTTGTTCTTAGGTTTCTGTGGCTAGTTTCCTGATGTGGGTTATTGATGCTCGAAACCAATTTTCGATTCTTCGTTCTTATGTGCGATTAGGGTTTGTGATTTTTTATATTAATATTTAAGTTTTGATCTGAGTTTCATTGGGTGCTGATTATTTTTAGCTTAAGCAACGAGTACTTGACTGCTCGGGGCAGGGGTGATGATGATCCATGCCAATCCTTAAAATTAGCTGACTTCTTATCCATATGTCTGTCTGGTTTATTATTGAGTAAACATTGTTTGTGCTTTCCAGTCTTTAGAATGAAATTAGTTGTTCAATTCAATTTAGCTTTAACCTTGAAAAGTTCAGTCATTAGTCCTGTGAATGGTCTTACCGAGTGAGTTGAAATGTTGATACAAAGCTCCTCCTTCTACATGATGTGCAAGAAATTTGATCAAAATTTGAATAATTATTCTAGAGGAGGAGTACATTTTCTTTTACGTGGAATTGGGTTTTGTATCTTATCTGTATATCCGTCTTGGTCTTTTATTTATTATTGTAAATGAATGGTTTCATGAACTGATAGAGTGGAAGAGTTAACTCAACTATTATCATTTATTATTGTAAATGAATGGTTTCATGAACTGATGGTTTCATGAACTGATAGTATTTGCTGTTCAGTAGCACGTCTCATAAAAATGATTAAAGCTGGAATTAATGATGAAGAAACCCATAGGATTATTAAGCTGCATTGCAGTGGCAACAAATCTTGTTTCTAGTCTTATTGTGGCTCAAGCTTTGAAAAATTAATCCTGACAAAATTTTGTACTTTCTAATTGCCTTTGATTTATATTAAACCATCTGTCAGTTTTATGTCTTCTTGAATCAATGCTAAAATAGGAATTGGTTTATATGTATTATTTTATGAGTAATCAGTATTGTGCATATTTGTACGAGCAGGATTTTCACAAGAGTAGAATGTCTACTGTATTCCCTGCTGCAGCCTATGGTCAAGCAGATGATTCCCTTAGTCAAAATTTGATTTCCACTGTTGAGAACAGCATGAAAAAGCATTCTGATAACCTTTTGCGTTTCCTTGAGGGAATAAGTTCACGCCTATCACAACTTGAACTATATTGCTACAACCTTGATAAATCTGTTGGAGAAATGCGGTCTGAATTAGCCCGTGACCATGAAGAGGCAGATTCAAAGCTTAAATCTCTTGAGAAGCATGTACAAGAGGTAAGCTACAAATTTAGAACTTTGATGAGTTTATAGATCTTTGAAAAAAGAGTTTGGAAATAGTATCTTATCAAAAAGAGTTTGCAACTTAAACCTATCACTTATTGACTCTTGAATCATGCACTTGATGTGAACATTGACTGTTCTTATTCCTTTTCAAGCCCTGTATTGTTATCATCATTTTATTCCTTAAAATTTGATACTGATCAGGAGGATTAGTTATGTTATAATTGAATTTGTACTTGCATTTCTGTAGGTCCACAGGTCTGTACAGATTATAAGAGACAAGCAAGAACTTGCTGAGACTCAAAAAGACTTGGCTAAACTTCAGGTCTCGCAGAAAGAGCCATCTTCGTCGAGCCATTCGCAGTCAAATGAGGAGAGGACTTCATCAGTTGCCTCTGATCCTAAAAAGAATGAAAATCCATCTGAGATTCACAACCAGCAGTTAGCTTTGGCCTTGCCACATCAGATCGTCCCACAGCAAAATCCTATAACTCCCCCTTCAGCAGTTTTGCCTCAGAATATGCCTCAACAACAGCAATCTTACTACATCTCTGCATCTCAATTACCTGGTCAACCACCCCATATCCAGCATGCTCAGGGCCAATATATCTCACCTGATTCCCAGCACCGGGCATCACAACCTCAAGATGTTTCACAGATGTCCAATCCCCAACTAAGTCAAACTCAACCACAACCATTCAATCAGTATCAACAACAATGGGCGCAGCCACCATCTCAGCAGCAACAACCTCCTCAACAGCCTTCTATGCAACCTCAGATCAGACCACCCCCCAGTTCAGTCTACCCTTCTCCTTATCCACCAAATCAACCGTCTTCTATGACCGAGACACTGTCAAGCAGCATGCCCATGCAAATGTCCTTTCCATCTATTCCTCAACCCGGCTCAAGCCGCATGGATGCAGGGCCTTATGGGTATGCTGCTGCAAGTGGTGGTTCTGCTCCACAGCAGCCTCCTCAAGTGAAAAATGCTTATGGTTCAGCAACAGGTGAGGGATATATGCCTCCTGGACAACAATCTGGAGGAGCATATATGATGTATGATAGGGAAAGCGGAAGACCGCCACACCATTCGCCTCAACAACCACACCATCCGTCTCAACAACCGCACTTCAATCAAAGTGGATATCCTCCGGCCAATGCACCTCATCAGGTTCCTCCTCAGGCTCCATCAGGCCCCATGTTTCAGCCAGGAATCCAAGCCATTCACATCTAA

mRNA sequence

ATGGCGTCTGGTTCAGCAGGTCGCCCTAACTCCTCCCCCAAATCGTTTGATTTTGGTTCTGATGATATCCTTTGCTCATTTGAAGACTACGGTAAACAGGACGCTTCAAACGGTAGCCATACTGATCCCGTTTCCCTTACCAATTCTAGCAAGGATTTTCACAAGAGTAGAATGTCTACTGTATTCCCTGCTGCAGCCTATGGTCAAGCAGATGATTCCCTTAGTCAAAATTTGATTTCCACTGTTGAGAACAGCATGAAAAAGCATTCTGATAACCTTTTGCGTTTCCTTGAGGGAATAAGTTCACGCCTATCACAACTTGAACTATATTGCTACAACCTTGATAAATCTGTTGGAGAAATGCGGTCTGAATTAGCCCGTGACCATGAAGAGGCAGATTCAAAGCTTAAATCTCTTGAGAAGCATGTACAAGAGGTCCACAGGTCTGTACAGATTATAAGAGACAAGCAAGAACTTGCTGAGACTCAAAAAGACTTGGCTAAACTTCAGGTCTCGCAGAAAGAGCCATCTTCGTCGAGCCATTCGCAGTCAAATGAGGAGAGGACTTCATCAGTTGCCTCTGATCCTAAAAAGAATGAAAATCCATCTGAGATTCACAACCAGCAGTTAGCTTTGGCCTTGCCACATCAGATCGTCCCACAGCAAAATCCTATAACTCCCCCTTCAGCAGTTTTGCCTCAGAATATGCCTCAACAACAGCAATCTTACTACATCTCTGCATCTCAATTACCTGGTCAACCACCCCATATCCAGCATGCTCAGGGCCAATATATCTCACCTGATTCCCAGCACCGGGCATCACAACCTCAAGATGTTTCACAGATGTCCAATCCCCAACTAAGTCAAACTCAACCACAACCATTCAATCAGTATCAACAACAATGGGCGCAGCCACCATCTCAGCAGCAACAACCTCCTCAACAGCCTTCTATGCAACCTCAGATCAGACCACCCCCCAGTTCAGTCTACCCTTCTCCTTATCCACCAAATCAACCGTCTTCTATGACCGAGACACTGTCAAGCAGCATGCCCATGCAAATGTCCTTTCCATCTATTCCTCAACCCGGCTCAAGCCGCATGGATGCAGGGCCTTATGGGTATGCTGCTGCAAGTGGTGGTTCTGCTCCACAGCAGCCTCCTCAAGTGAAAAATGCTTATGGTTCAGCAACAGGTGAGGGATATATGCCTCCTGGACAACAATCTGGAGGAGCATATATGATGTATGATAGGGAAAGCGGAAGACCGCCACACCATTCGCCTCAACAACCACACCATCCGTCTCAACAACCGCACTTCAATCAAAGTGGATATCCTCCGGCCAATGCACCTCATCAGGTTCCTCCTCAGGCTCCATCAGGCCCCATGTTTCAGCCAGGAATCCAAGCCATTCACATCTAA

Coding sequence (CDS)

ATGGCGTCTGGTTCAGCAGGTCGCCCTAACTCCTCCCCCAAATCGTTTGATTTTGGTTCTGATGATATCCTTTGCTCATTTGAAGACTACGGTAAACAGGACGCTTCAAACGGTAGCCATACTGATCCCGTTTCCCTTACCAATTCTAGCAAGGATTTTCACAAGAGTAGAATGTCTACTGTATTCCCTGCTGCAGCCTATGGTCAAGCAGATGATTCCCTTAGTCAAAATTTGATTTCCACTGTTGAGAACAGCATGAAAAAGCATTCTGATAACCTTTTGCGTTTCCTTGAGGGAATAAGTTCACGCCTATCACAACTTGAACTATATTGCTACAACCTTGATAAATCTGTTGGAGAAATGCGGTCTGAATTAGCCCGTGACCATGAAGAGGCAGATTCAAAGCTTAAATCTCTTGAGAAGCATGTACAAGAGGTCCACAGGTCTGTACAGATTATAAGAGACAAGCAAGAACTTGCTGAGACTCAAAAAGACTTGGCTAAACTTCAGGTCTCGCAGAAAGAGCCATCTTCGTCGAGCCATTCGCAGTCAAATGAGGAGAGGACTTCATCAGTTGCCTCTGATCCTAAAAAGAATGAAAATCCATCTGAGATTCACAACCAGCAGTTAGCTTTGGCCTTGCCACATCAGATCGTCCCACAGCAAAATCCTATAACTCCCCCTTCAGCAGTTTTGCCTCAGAATATGCCTCAACAACAGCAATCTTACTACATCTCTGCATCTCAATTACCTGGTCAACCACCCCATATCCAGCATGCTCAGGGCCAATATATCTCACCTGATTCCCAGCACCGGGCATCACAACCTCAAGATGTTTCACAGATGTCCAATCCCCAACTAAGTCAAACTCAACCACAACCATTCAATCAGTATCAACAACAATGGGCGCAGCCACCATCTCAGCAGCAACAACCTCCTCAACAGCCTTCTATGCAACCTCAGATCAGACCACCCCCCAGTTCAGTCTACCCTTCTCCTTATCCACCAAATCAACCGTCTTCTATGACCGAGACACTGTCAAGCAGCATGCCCATGCAAATGTCCTTTCCATCTATTCCTCAACCCGGCTCAAGCCGCATGGATGCAGGGCCTTATGGGTATGCTGCTGCAAGTGGTGGTTCTGCTCCACAGCAGCCTCCTCAAGTGAAAAATGCTTATGGTTCAGCAACAGGTGAGGGATATATGCCTCCTGGACAACAATCTGGAGGAGCATATATGATGTATGATAGGGAAAGCGGAAGACCGCCACACCATTCGCCTCAACAACCACACCATCCGTCTCAACAACCGCACTTCAATCAAAGTGGATATCCTCCGGCCAATGCACCTCATCAGGTTCCTCCTCAGGCTCCATCAGGCCCCATGTTTCAGCCAGGAATCCAAGCCATTCACATCTAA

Protein sequence

MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDASNGSHTDPVSLTNSSKDFHKSRMSTVFPAAAYGQADDSLSQNLISTVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSSHSQSNEERTSSVASDPKKNENPSEIHNQQLALALPHQIVPQQNPITPPSAVLPQNMPQQQQSYYISASQLPGQPPHIQHAQGQYISPDSQHRASQPQDVSQMSNPQLSQTQPQPFNQYQQQWAQPPSQQQQPPQQPSMQPQIRPPPSSVYPSPYPPNQPSSMTETLSSSMPMQMSFPSIPQPGSSRMDAGPYGYAAASGGSAPQQPPQVKNAYGSATGEGYMPPGQQSGGAYMMYDRESGRPPHHSPQQPHHPSQQPHFNQSGYPPANAPHQVPPQAPSGPMFQPGIQAIHI
Homology
BLAST of HG10005305 vs. NCBI nr
Match: XP_038888365.1 (ataxin-2 homolog [Benincasa hispida])

HSP 1 Score: 803.5 bits (2074), Expect = 9.6e-229
Identity = 434/463 (93.74%), Postives = 444/463 (95.90%), Query Frame = 0

Query: 1   MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDASNGSHTDPVSLTNSSKDFHKSRMST 60
           MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQD SNGSHTDPVS+TNS+KDFHKSRMST
Sbjct: 1   MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSHTDPVSITNSTKDFHKSRMST 60

Query: 61  VFPAAAYG--QADDSLSQNLISTVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSV 120
           VFPAAAYG  QADDS+SQN+ISTVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSV
Sbjct: 61  VFPAAAYGQAQADDSISQNVISTVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSV 120

Query: 121 GEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSS 180
           GEMRS+LARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSS
Sbjct: 121 GEMRSDLARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSS 180

Query: 181 SSHSQSNEERTSSVASDPKKNENPSEIHNQQLALALPHQIVPQQNPITPPSAVLPQNMPQ 240
           SSHSQSNEER SSVASDPKKNENPSEIHNQQLALALPHQIVPQQN IT PSA LPQNMPQ
Sbjct: 181 SSHSQSNEERASSVASDPKKNENPSEIHNQQLALALPHQIVPQQNSITAPSAALPQNMPQ 240

Query: 241 QQQSYYISASQLPGQPPHIQHAQGQYISPDSQHRASQPQDVSQMSNPQLSQTQPQPFNQY 300
           QQQSYYIS+SQLPGQPPH+QHAQGQYISPDS +RASQPQDVSQMSNPQLSQT PQPFNQY
Sbjct: 241 QQQSYYISSSQLPGQPPHLQHAQGQYISPDS-NRASQPQDVSQMSNPQLSQTPPQPFNQY 300

Query: 301 QQQWAQPPSQQQQPPQQPSMQPQIRPPPSSVYPSPYPPNQPSSMTETLSSSMPMQMSFPS 360
            QQWAQPPSQQ QPPQQPSMQPQIRPPP SVYPS YPPNQP+SM ETLSSSMPM MSFPS
Sbjct: 301 -QQWAQPPSQQPQPPQQPSMQPQIRPPPPSVYPSTYPPNQPTSMPETLSSSMPMPMSFPS 360

Query: 361 IPQPGSSRMDAGPYGYAAASGGSAPQQPPQVKNAYGSATGEGYMPPGQQSGGAYMMYDRE 420
           IPQPGSSRMDAGPYGYAAASGGSAPQQPPQVKNAYG ATGEGYMPPGQQSGGAYMMYDRE
Sbjct: 361 IPQPGSSRMDAGPYGYAAASGGSAPQQPPQVKNAYGPATGEGYMPPGQQSGGAYMMYDRE 420

Query: 421 SGRPPHHSPQQPHHPSQQPHFNQSGYPPANAPHQVPPQAPSGP 462
           SGRPPHH PQQPHHPSQQPHFNQSGYPPAN  HQVPPQAP+GP
Sbjct: 421 SGRPPHHPPQQPHHPSQQPHFNQSGYPPANVSHQVPPQAPTGP 461

BLAST of HG10005305 vs. NCBI nr
Match: XP_008455322.1 (PREDICTED: arginine-glutamic acid dipeptide repeats protein-like [Cucumis melo])

HSP 1 Score: 770.8 bits (1989), Expect = 6.9e-219
Identity = 419/461 (90.89%), Postives = 428/461 (92.84%), Query Frame = 0

Query: 1   MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDASNGSHTDPVSLTNSSKDFHKSRMST 60
           MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQD SNGS +DPVS+TN  KDFHKSRMST
Sbjct: 1   MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVSVTNPGKDFHKSRMST 60

Query: 61  VFPAAAYGQADDSLSQNLISTVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGE 120
           VFPAA YGQADD++SQN+ISTVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGE
Sbjct: 61  VFPAAGYGQADDTISQNVISTVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGE 120

Query: 121 MRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSS 180
           MRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSS+
Sbjct: 121 MRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSN 180

Query: 181 HSQSNEERTSSVASDPKKNENPSEIHNQQLALALPHQIVPQQNPITPPSAVLPQNMPQQQ 240
           HSQSNEER SSVASD KK ENPSEIHNQQLALALPHQIVPQQNPITPPSA LPQNMPQQQ
Sbjct: 181 HSQSNEERASSVASDSKKKENPSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQ 240

Query: 241 QSYYISASQLPGQPPHIQHAQGQYISPDSQHRASQPQDVSQMSNPQLSQTQPQPFNQYQQ 300
           QSYYIS SQLPGQPPHIQHAQ QYIS DSQHRASQPQDVSQMSNPQLSQT PQPFNQYQQ
Sbjct: 241 QSYYISQSQLPGQPPHIQHAQSQYISSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQQ 300

Query: 301 QWAQPPSQQQQPPQQPSMQPQIRPPPSSVYPSPYPPNQPSSMTETLSSSMPMQMSFPSIP 360
           QWAQPPSQQ QPPQQPSMQ QIRPPP SVYPS YPPNQP+SM ETL SSMPMQMSFPSIP
Sbjct: 301 QWAQPPSQQPQPPQQPSMQ-QIRPPPPSVYPSTYPPNQPTSMPETLPSSMPMQMSFPSIP 360

Query: 361 QPGSSRMDAGPYGYAAASGGSAPQQPPQVKNAYGSATGEGYMPPGQQSGGAYMMYDRESG 420
           QPGSSR+DAGPYGYA  SGGSAPQQPPQVKNAYG  TGEGYMPPGQQSGGAYMMYDRESG
Sbjct: 361 QPGSSRVDAGPYGYAPGSGGSAPQQPPQVKNAYGPPTGEGYMPPGQQSGGAYMMYDRESG 420

Query: 421 RPPHHSPQQPHHPSQQPHFNQSGYPPANAPHQVPPQAPSGP 462
           RP       PHHP QQ HFNQSGYP ANAPHQVPPQAP+GP
Sbjct: 421 RP-------PHHPPQQAHFNQSGYPLANAPHQVPPQAPAGP 453

BLAST of HG10005305 vs. NCBI nr
Match: KAA0031573.1 (arginine-glutamic acid dipeptide repeats protein-like [Cucumis melo var. makuwa] >TYK07025.1 arginine-glutamic acid dipeptide repeats protein-like [Cucumis melo var. makuwa])

HSP 1 Score: 768.8 bits (1984), Expect = 2.6e-218
Identity = 418/461 (90.67%), Postives = 427/461 (92.62%), Query Frame = 0

Query: 1   MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDASNGSHTDPVSLTNSSKDFHKSRMST 60
           MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQD SNGS +DPVS+TN  KDFHKSRMST
Sbjct: 1   MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVSITNPGKDFHKSRMST 60

Query: 61  VFPAAAYGQADDSLSQNLISTVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGE 120
           VFPAA Y QADD++SQN+ISTVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGE
Sbjct: 61  VFPAAGYAQADDTISQNVISTVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGE 120

Query: 121 MRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSS 180
           MRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSS+
Sbjct: 121 MRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSN 180

Query: 181 HSQSNEERTSSVASDPKKNENPSEIHNQQLALALPHQIVPQQNPITPPSAVLPQNMPQQQ 240
           HSQSNEER SSVASD KK ENPSEIHNQQLALALPHQIVPQQNPITPPSA LPQNMPQQQ
Sbjct: 181 HSQSNEERASSVASDSKKKENPSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQ 240

Query: 241 QSYYISASQLPGQPPHIQHAQGQYISPDSQHRASQPQDVSQMSNPQLSQTQPQPFNQYQQ 300
           QSYYIS SQLPGQPPHIQHAQ QYIS DSQHRASQPQDVSQMSNPQLSQT PQPFNQYQQ
Sbjct: 241 QSYYISQSQLPGQPPHIQHAQSQYISSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQQ 300

Query: 301 QWAQPPSQQQQPPQQPSMQPQIRPPPSSVYPSPYPPNQPSSMTETLSSSMPMQMSFPSIP 360
           QWAQPPSQQ QPPQQPSMQ QIRPPP SVYPS YPPNQP+SM ETL SSMPMQMSFPSIP
Sbjct: 301 QWAQPPSQQPQPPQQPSMQ-QIRPPPPSVYPSTYPPNQPTSMPETLPSSMPMQMSFPSIP 360

Query: 361 QPGSSRMDAGPYGYAAASGGSAPQQPPQVKNAYGSATGEGYMPPGQQSGGAYMMYDRESG 420
           QPGSSR+DAGPYGYA  SGGSAPQQPPQVKNAYG  TGEGYMPPGQQSGGAYMMYDRESG
Sbjct: 361 QPGSSRVDAGPYGYAPGSGGSAPQQPPQVKNAYGPPTGEGYMPPGQQSGGAYMMYDRESG 420

Query: 421 RPPHHSPQQPHHPSQQPHFNQSGYPPANAPHQVPPQAPSGP 462
           RP       PHHP QQ HFNQSGYP ANAPHQVPPQAP+GP
Sbjct: 421 RP-------PHHPPQQAHFNQSGYPLANAPHQVPPQAPAGP 453

BLAST of HG10005305 vs. NCBI nr
Match: XP_004136824.1 (trithorax group protein osa [Cucumis sativus] >KGN43616.1 hypothetical protein Csa_020468 [Cucumis sativus])

HSP 1 Score: 760.4 bits (1962), Expect = 9.4e-216
Identity = 415/462 (89.83%), Postives = 426/462 (92.21%), Query Frame = 0

Query: 1   MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDASNGSHTDPVSLTNSSKDFHKSRMST 60
           MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQD SNGS +DPVS+ N  KDFHK RMST
Sbjct: 1   MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVSVNNPGKDFHKGRMST 60

Query: 61  VFPAAAYGQADDSLSQNLISTVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGE 120
           VFPA+ YGQADD++SQN+ISTVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGE
Sbjct: 61  VFPASGYGQADDTISQNVISTVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGE 120

Query: 121 MRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSS 180
           MRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSS++
Sbjct: 121 MRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSTN 180

Query: 181 HSQSNEERTSSVASDPKKNENPSEIHNQQLALALPHQIVPQQNPITPPSAVLPQNMPQQQ 240
           HSQSNEER SSVASDPKK EN SEIHNQQLALALPHQIVPQQNPITPPSA LPQNMPQQQ
Sbjct: 181 HSQSNEERASSVASDPKKKENSSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQ 240

Query: 241 QSYYISASQLPGQPPHIQHAQGQYISPDSQHRASQPQDVSQMSNPQLSQTQPQPFNQYQQ 300
           QSYYIS SQLPGQPPHIQHAQ QYI  DSQHRASQPQDVSQMSNPQLSQT PQPFNQYQQ
Sbjct: 241 QSYYISQSQLPGQPPHIQHAQSQYIPSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQQ 300

Query: 301 QWAQPPSQQQQPPQQPSMQPQIRPPPSSVYPSPY-PPNQPSSMTETLSSSMPMQMSFPSI 360
           QWAQPPSQQ QPPQQPSMQ QIRPPP SVYPS Y PPNQP+SM ETL SSMPMQMSFPSI
Sbjct: 301 QWAQPPSQQPQPPQQPSMQ-QIRPPPPSVYPSTYPPPNQPTSMPETLPSSMPMQMSFPSI 360

Query: 361 PQPGSSRMDAGPYGYAAASGGSAPQQPPQVKNAYGSATGEGYMPPGQQSGGAYMMYDRES 420
           PQPGSSR+DAGPYGYAA SGGSAPQQPPQVKNAYG  TGEGYMPPGQQSGGAYMMYDRES
Sbjct: 361 PQPGSSRVDAGPYGYAAGSGGSAPQQPPQVKNAYGPPTGEGYMPPGQQSGGAYMMYDRES 420

Query: 421 GRPPHHSPQQPHHPSQQPHFNQSGYPPANAPHQVPPQAPSGP 462
           GRP       PHHP QQ HFNQSGYP ANAPHQVPPQAP+GP
Sbjct: 421 GRP-------PHHPPQQTHFNQSGYPLANAPHQVPPQAPAGP 454

BLAST of HG10005305 vs. NCBI nr
Match: XP_023554446.1 (trithorax group protein osa-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 756.9 bits (1953), Expect = 1.0e-214
Identity = 411/466 (88.20%), Postives = 434/466 (93.13%), Query Frame = 0

Query: 1   MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDASNGSHTDPVSLTNSSKDFHKSRMST 60
           MASGSAGRPNS+PKSFDFGSD+ILCSFEDY KQ+ SNGSH+DPVS+ NSSKDFHKSRMST
Sbjct: 1   MASGSAGRPNSAPKSFDFGSDNILCSFEDYTKQEPSNGSHSDPVSVANSSKDFHKSRMST 60

Query: 61  VFPAAAYGQADDSLSQNLISTVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGE 120
           VFP AAYGQ DDS++Q++I+TVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGE
Sbjct: 61  VFPGAAYGQPDDSINQDVITTVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGE 120

Query: 121 MRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSS 180
           MRS+LARDHEEA+SKLKS+EKHVQEVHRSVQIIRDKQELAETQKDLAKLQV QKEPS SS
Sbjct: 121 MRSDLARDHEEAESKLKSIEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVPQKEPSLSS 180

Query: 181 HSQSNEERTSSVASDPKKNENPSEIHNQQLALALPHQIVPQQNPITPPSAVLPQNMPQQQ 240
           HSQ+NEER   V++DPKKNENPSEIHNQQLALALPHQIVPQQNPITPPSA LPQN+PQQQ
Sbjct: 181 HSQTNEER---VSTDPKKNENPSEIHNQQLALALPHQIVPQQNPITPPSAALPQNVPQQQ 240

Query: 241 QSYYISASQLPG-QPPHIQHAQGQYISPDSQHRASQPQDVSQMSNPQLSQTQPQPFNQYQ 300
           QSYYIS+SQLPG QP HIQHAQ QYIS DSQHRASQPQDVS M+NPQLSQT PQPFNQYQ
Sbjct: 241 QSYYISSSQLPGQQPSHIQHAQNQYISSDSQHRASQPQDVSHMTNPQLSQT-PQPFNQYQ 300

Query: 301 QQWAQPPSQQQQPPQQPSMQPQIRPPPSSVYPSPYPPNQPSSMTETLSSSMPMQMSFPSI 360
           QQWAQPPSQ  QPPQQ SMQPQIRPPP+SVYPSPYPPNQP+SM ETLSSSMPMQMSF  I
Sbjct: 301 QQWAQPPSQPAQPPQQASMQPQIRPPPTSVYPSPYPPNQPTSMPETLSSSMPMQMSFAYI 360

Query: 361 PQPGSSRMDAGPYGYAAASGGSAPQQPPQVKNAYGSATGEGYMPPGQQ----SGGAYMMY 420
           PQPGSSR DA PYGYAA+SGGSAPQQPPQVKNAYG ATGEGYMPPGQQ    SGGAYMMY
Sbjct: 361 PQPGSSRADAVPYGYAASSGGSAPQQPPQVKNAYGPATGEGYMPPGQQPALSSGGAYMMY 420

Query: 421 DRESGRPPHHSPQQPHHPSQQPHFNQSGYPPANAPHQVPPQAPSGP 462
           DRESGRPPHH PQQPHHPSQQ HFNQSGYPPANAPHQVPPQAP+GP
Sbjct: 421 DRESGRPPHHLPQQPHHPSQQSHFNQSGYPPANAPHQVPPQAPAGP 462

BLAST of HG10005305 vs. ExPASy TrEMBL
Match: A0A1S3C1W2 (arginine-glutamic acid dipeptide repeats protein-like OS=Cucumis melo OX=3656 GN=LOC103495513 PE=4 SV=1)

HSP 1 Score: 770.8 bits (1989), Expect = 3.4e-219
Identity = 419/461 (90.89%), Postives = 428/461 (92.84%), Query Frame = 0

Query: 1   MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDASNGSHTDPVSLTNSSKDFHKSRMST 60
           MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQD SNGS +DPVS+TN  KDFHKSRMST
Sbjct: 1   MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVSVTNPGKDFHKSRMST 60

Query: 61  VFPAAAYGQADDSLSQNLISTVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGE 120
           VFPAA YGQADD++SQN+ISTVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGE
Sbjct: 61  VFPAAGYGQADDTISQNVISTVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGE 120

Query: 121 MRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSS 180
           MRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSS+
Sbjct: 121 MRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSN 180

Query: 181 HSQSNEERTSSVASDPKKNENPSEIHNQQLALALPHQIVPQQNPITPPSAVLPQNMPQQQ 240
           HSQSNEER SSVASD KK ENPSEIHNQQLALALPHQIVPQQNPITPPSA LPQNMPQQQ
Sbjct: 181 HSQSNEERASSVASDSKKKENPSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQ 240

Query: 241 QSYYISASQLPGQPPHIQHAQGQYISPDSQHRASQPQDVSQMSNPQLSQTQPQPFNQYQQ 300
           QSYYIS SQLPGQPPHIQHAQ QYIS DSQHRASQPQDVSQMSNPQLSQT PQPFNQYQQ
Sbjct: 241 QSYYISQSQLPGQPPHIQHAQSQYISSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQQ 300

Query: 301 QWAQPPSQQQQPPQQPSMQPQIRPPPSSVYPSPYPPNQPSSMTETLSSSMPMQMSFPSIP 360
           QWAQPPSQQ QPPQQPSMQ QIRPPP SVYPS YPPNQP+SM ETL SSMPMQMSFPSIP
Sbjct: 301 QWAQPPSQQPQPPQQPSMQ-QIRPPPPSVYPSTYPPNQPTSMPETLPSSMPMQMSFPSIP 360

Query: 361 QPGSSRMDAGPYGYAAASGGSAPQQPPQVKNAYGSATGEGYMPPGQQSGGAYMMYDRESG 420
           QPGSSR+DAGPYGYA  SGGSAPQQPPQVKNAYG  TGEGYMPPGQQSGGAYMMYDRESG
Sbjct: 361 QPGSSRVDAGPYGYAPGSGGSAPQQPPQVKNAYGPPTGEGYMPPGQQSGGAYMMYDRESG 420

Query: 421 RPPHHSPQQPHHPSQQPHFNQSGYPPANAPHQVPPQAPSGP 462
           RP       PHHP QQ HFNQSGYP ANAPHQVPPQAP+GP
Sbjct: 421 RP-------PHHPPQQAHFNQSGYPLANAPHQVPPQAPAGP 453

BLAST of HG10005305 vs. ExPASy TrEMBL
Match: A0A5D3C6G6 (Arginine-glutamic acid dipeptide repeats protein-like OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold13G003420 PE=4 SV=1)

HSP 1 Score: 768.8 bits (1984), Expect = 1.3e-218
Identity = 418/461 (90.67%), Postives = 427/461 (92.62%), Query Frame = 0

Query: 1   MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDASNGSHTDPVSLTNSSKDFHKSRMST 60
           MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQD SNGS +DPVS+TN  KDFHKSRMST
Sbjct: 1   MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVSITNPGKDFHKSRMST 60

Query: 61  VFPAAAYGQADDSLSQNLISTVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGE 120
           VFPAA Y QADD++SQN+ISTVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGE
Sbjct: 61  VFPAAGYAQADDTISQNVISTVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGE 120

Query: 121 MRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSS 180
           MRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSS+
Sbjct: 121 MRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSN 180

Query: 181 HSQSNEERTSSVASDPKKNENPSEIHNQQLALALPHQIVPQQNPITPPSAVLPQNMPQQQ 240
           HSQSNEER SSVASD KK ENPSEIHNQQLALALPHQIVPQQNPITPPSA LPQNMPQQQ
Sbjct: 181 HSQSNEERASSVASDSKKKENPSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQ 240

Query: 241 QSYYISASQLPGQPPHIQHAQGQYISPDSQHRASQPQDVSQMSNPQLSQTQPQPFNQYQQ 300
           QSYYIS SQLPGQPPHIQHAQ QYIS DSQHRASQPQDVSQMSNPQLSQT PQPFNQYQQ
Sbjct: 241 QSYYISQSQLPGQPPHIQHAQSQYISSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQQ 300

Query: 301 QWAQPPSQQQQPPQQPSMQPQIRPPPSSVYPSPYPPNQPSSMTETLSSSMPMQMSFPSIP 360
           QWAQPPSQQ QPPQQPSMQ QIRPPP SVYPS YPPNQP+SM ETL SSMPMQMSFPSIP
Sbjct: 301 QWAQPPSQQPQPPQQPSMQ-QIRPPPPSVYPSTYPPNQPTSMPETLPSSMPMQMSFPSIP 360

Query: 361 QPGSSRMDAGPYGYAAASGGSAPQQPPQVKNAYGSATGEGYMPPGQQSGGAYMMYDRESG 420
           QPGSSR+DAGPYGYA  SGGSAPQQPPQVKNAYG  TGEGYMPPGQQSGGAYMMYDRESG
Sbjct: 361 QPGSSRVDAGPYGYAPGSGGSAPQQPPQVKNAYGPPTGEGYMPPGQQSGGAYMMYDRESG 420

Query: 421 RPPHHSPQQPHHPSQQPHFNQSGYPPANAPHQVPPQAPSGP 462
           RP       PHHP QQ HFNQSGYP ANAPHQVPPQAP+GP
Sbjct: 421 RP-------PHHPPQQAHFNQSGYPLANAPHQVPPQAPAGP 453

BLAST of HG10005305 vs. ExPASy TrEMBL
Match: A0A0A0K720 (DUF1421 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_7G048000 PE=4 SV=1)

HSP 1 Score: 760.4 bits (1962), Expect = 4.5e-216
Identity = 415/462 (89.83%), Postives = 426/462 (92.21%), Query Frame = 0

Query: 1   MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDASNGSHTDPVSLTNSSKDFHKSRMST 60
           MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQD SNGS +DPVS+ N  KDFHK RMST
Sbjct: 1   MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVSVNNPGKDFHKGRMST 60

Query: 61  VFPAAAYGQADDSLSQNLISTVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGE 120
           VFPA+ YGQADD++SQN+ISTVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGE
Sbjct: 61  VFPASGYGQADDTISQNVISTVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGE 120

Query: 121 MRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSS 180
           MRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSS++
Sbjct: 121 MRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSTN 180

Query: 181 HSQSNEERTSSVASDPKKNENPSEIHNQQLALALPHQIVPQQNPITPPSAVLPQNMPQQQ 240
           HSQSNEER SSVASDPKK EN SEIHNQQLALALPHQIVPQQNPITPPSA LPQNMPQQQ
Sbjct: 181 HSQSNEERASSVASDPKKKENSSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQ 240

Query: 241 QSYYISASQLPGQPPHIQHAQGQYISPDSQHRASQPQDVSQMSNPQLSQTQPQPFNQYQQ 300
           QSYYIS SQLPGQPPHIQHAQ QYI  DSQHRASQPQDVSQMSNPQLSQT PQPFNQYQQ
Sbjct: 241 QSYYISQSQLPGQPPHIQHAQSQYIPSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQQ 300

Query: 301 QWAQPPSQQQQPPQQPSMQPQIRPPPSSVYPSPY-PPNQPSSMTETLSSSMPMQMSFPSI 360
           QWAQPPSQQ QPPQQPSMQ QIRPPP SVYPS Y PPNQP+SM ETL SSMPMQMSFPSI
Sbjct: 301 QWAQPPSQQPQPPQQPSMQ-QIRPPPPSVYPSTYPPPNQPTSMPETLPSSMPMQMSFPSI 360

Query: 361 PQPGSSRMDAGPYGYAAASGGSAPQQPPQVKNAYGSATGEGYMPPGQQSGGAYMMYDRES 420
           PQPGSSR+DAGPYGYAA SGGSAPQQPPQVKNAYG  TGEGYMPPGQQSGGAYMMYDRES
Sbjct: 361 PQPGSSRVDAGPYGYAAGSGGSAPQQPPQVKNAYGPPTGEGYMPPGQQSGGAYMMYDRES 420

Query: 421 GRPPHHSPQQPHHPSQQPHFNQSGYPPANAPHQVPPQAPSGP 462
           GRP       PHHP QQ HFNQSGYP ANAPHQVPPQAP+GP
Sbjct: 421 GRP-------PHHPPQQTHFNQSGYPLANAPHQVPPQAPAGP 454

BLAST of HG10005305 vs. ExPASy TrEMBL
Match: A0A6J1HZW1 (ataxin-2 homolog OS=Cucurbita maxima OX=3661 GN=LOC111468169 PE=4 SV=1)

HSP 1 Score: 755.0 bits (1948), Expect = 1.9e-214
Identity = 411/466 (88.20%), Postives = 433/466 (92.92%), Query Frame = 0

Query: 1   MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDASNGSHTDPVSLTNSSKDFHKSRMST 60
           MASGSAGRPNS+PKSFDFGSD+ILCSFEDY KQ+ SNGSH+DPVS+ NSSKDFHKSRMST
Sbjct: 1   MASGSAGRPNSAPKSFDFGSDNILCSFEDYTKQEPSNGSHSDPVSVANSSKDFHKSRMST 60

Query: 61  VFPAAAYGQADDSLSQNLISTVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGE 120
           VFP AAYGQ DDS++Q++I+TVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGE
Sbjct: 61  VFPGAAYGQPDDSINQDVIATVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGE 120

Query: 121 MRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSS 180
           MRS+LARDHEEADSKLKS+EKHVQEVHRSVQIIRDKQELAETQKDLAKLQV QKEPS SS
Sbjct: 121 MRSDLARDHEEADSKLKSIEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVPQKEPSLSS 180

Query: 181 HSQSNEERTSSVASDPKKNENPSEIHNQQLALALPHQIVPQQNPITPPSAVLPQNMPQQQ 240
           HSQ+NEER   V++DPKKNENPSEIHNQQLALALPHQIVPQQNP+TPPSA LPQN+PQQ 
Sbjct: 181 HSQTNEER---VSTDPKKNENPSEIHNQQLALALPHQIVPQQNPMTPPSAALPQNVPQQH 240

Query: 241 QSYYISASQLPG-QPPHIQHAQGQYISPDSQHRASQPQDVSQMSNPQLSQTQPQPFNQYQ 300
           QSYYIS+SQLPG QP HIQHAQ QYIS DS HRASQPQDVSQM+NPQLSQT PQPFNQYQ
Sbjct: 241 QSYYISSSQLPGQQPSHIQHAQNQYISSDSHHRASQPQDVSQMTNPQLSQT-PQPFNQYQ 300

Query: 301 QQWAQPPSQQQQPPQQPSMQPQIRPPPSSVYPSPYPPNQPSSMTETLSSSMPMQMSFPSI 360
           QQWAQPPSQ  QPPQQ SMQPQIRPPP+SVYPSPYPPNQP+SM ETLSSSMPMQMSF SI
Sbjct: 301 QQWAQPPSQPAQPPQQASMQPQIRPPPTSVYPSPYPPNQPTSMPETLSSSMPMQMSFASI 360

Query: 361 PQPGSSRMDAGPYGYAAASGGSAPQQPPQVKNAYGSATGEGYMPPGQQ----SGGAYMMY 420
           PQPGSSR DA PYGYAAASGGSAPQQPPQVKNAYG ATGEGYMPPGQQ    SGGAYMMY
Sbjct: 361 PQPGSSRADAVPYGYAAASGGSAPQQPPQVKNAYGPATGEGYMPPGQQPALSSGGAYMMY 420

Query: 421 DRESGRPPHHSPQQPHHPSQQPHFNQSGYPPANAPHQVPPQAPSGP 462
           DRESGRPPHH PQQPHHPSQQ HFNQSGYPPANAP QVPPQAP+GP
Sbjct: 421 DRESGRPPHHLPQQPHHPSQQSHFNQSGYPPANAPPQVPPQAPTGP 462

BLAST of HG10005305 vs. ExPASy TrEMBL
Match: A0A6J1GLD5 (class E vacuolar protein-sorting machinery protein hse1-like OS=Cucurbita moschata OX=3662 GN=LOC111455039 PE=4 SV=1)

HSP 1 Score: 750.7 bits (1937), Expect = 3.6e-213
Identity = 413/468 (88.25%), Postives = 435/468 (92.95%), Query Frame = 0

Query: 1   MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDASNGSHTDPVSLTNSSKDFHKSRMST 60
           MASGSAGRPNS+PKSFDFGSD+ILCSFEDY KQ+ SNGSH+DPVS+ NSSKDFHKSRMST
Sbjct: 1   MASGSAGRPNSAPKSFDFGSDNILCSFEDYTKQEPSNGSHSDPVSVANSSKDFHKSRMST 60

Query: 61  VFPAAAYGQADDSLSQNLISTVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGE 120
           VFP AAYGQ DDS++Q++I+ VENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGE
Sbjct: 61  VFPGAAYGQPDDSINQDVIAAVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGE 120

Query: 121 MRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSS 180
           MRS+LARDHEEADSKLKS+EKHVQEVHRSVQIIRDKQELAETQKDLAKLQV QKEPS SS
Sbjct: 121 MRSDLARDHEEADSKLKSIEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVPQKEPSLSS 180

Query: 181 HSQSNEERTSSVASDPKKNENPSEIHNQQLALALPHQIVPQQNPIT-PPSAVLPQNMPQQ 240
           HSQ+NEER   V++DPKKNENPSEIHNQQLALALPHQIVPQQNPIT PPSA LPQN+PQQ
Sbjct: 181 HSQTNEER---VSTDPKKNENPSEIHNQQLALALPHQIVPQQNPITAPPSAALPQNVPQQ 240

Query: 241 QQSYYISASQLPG-QPPHIQHAQGQYISPDSQHRASQPQDVSQMSNPQLSQTQPQPFNQY 300
           QQSYYIS+SQLPG QP HIQHAQ QYIS DSQHRASQPQDVSQM+NPQLSQT PQPFNQY
Sbjct: 241 QQSYYISSSQLPGQQPSHIQHAQNQYISSDSQHRASQPQDVSQMTNPQLSQT-PQPFNQY 300

Query: 301 QQQWAQPPSQQQQPPQQPSMQPQIRPPPSSVYPSPY-PPNQPSSMTETLSSSMPMQMSFP 360
           QQQWAQPPSQ  QPPQQ SMQPQIRPPP+SVYPSPY PPNQP+SM ETLSSSMPMQMSF 
Sbjct: 301 QQQWAQPPSQPAQPPQQASMQPQIRPPPTSVYPSPYPPPNQPTSMPETLSSSMPMQMSFA 360

Query: 361 SIPQPGSSRMDAGPYGYAAASGGSAPQQPPQVKNAYGSATGEGYMPPGQQ----SGGAYM 420
           SIPQPGSSR DA PYGYAAASGGSAPQQPPQVKNAYG ATGEGYMPPGQQ    SGGAYM
Sbjct: 361 SIPQPGSSRADAVPYGYAAASGGSAPQQPPQVKNAYGPATGEGYMPPGQQPALSSGGAYM 420

Query: 421 MYDRESGRPPHHSPQQPHHPSQQPHFNQSGYPPANAPHQVPPQAPSGP 462
           MYDRESGRPPHH PQQPHHPSQQ HF+QSGYPPANAPHQVPPQAP+GP
Sbjct: 421 MYDRESGRPPHHLPQQPHHPSQQSHFSQSGYPPANAPHQVPPQAPTGP 464

BLAST of HG10005305 vs. TAIR 10
Match: AT4G28300.1 (Protein of unknown function (DUF1421) )

HSP 1 Score: 328.9 bits (842), Expect = 6.5e-90
Identity = 238/467 (50.96%), Postives = 289/467 (61.88%), Query Frame = 0

Query: 1   MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDASNGSHTDP-VSLTNSSKDFHKSRM- 60
           MASGS+GR NS  K FDFGSDDILCS++DY  QD+SNG H+DP ++ +NS+K+FHK+RM 
Sbjct: 1   MASGSSGRVNSGSKGFDFGSDDILCSYDDYTNQDSSNGPHSDPAIAASNSNKEFHKTRMA 60

Query: 61  -STVFPAAAYGQADDSLSQNLISTVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKS 120
            S+VFP ++Y   +DSLSQ++  TVE +MK ++DN++RFLEG+SSRLSQLELYCYNLDK+
Sbjct: 61  RSSVFPTSSYSPPEDSLSQDITDTVERTMKMYADNMMRFLEGLSSRLSQLELYCYNLDKT 120

Query: 121 VGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPS 180
           +GEMRSEL   HE+AD KL+SL+KH+QEVHRSVQI+RDKQELA+TQK+LAKLQ+ QKE S
Sbjct: 121 IGEMRSELTHAHEDADVKLRSLDKHLQEVHRSVQILRDKQELADTQKELAKLQLVQKESS 180

Query: 181 SSSHSQSNEERTSSVASDPKKNENPSEIHNQQLALALPHQIVPQQNPITPPSAVLPQNMP 240
           SSSHSQ  E+R ++   +PKK+EN S+ HNQQLALALPHQI PQ         V PQ  P
Sbjct: 181 SSSHSQHGEDRVATPVPEPKKSENTSDAHNQQLALALPHQIAPQPQ-------VQPQPQP 240

Query: 241 QQQQSYYISASQLPGQPPHIQH--AQGQYISPDSQHRASQPQDVSQMSNP---QLSQTQP 300
           QQ Q Y      +P  P  +Q+  A     +P SQ +A   Q       P     S  Q 
Sbjct: 241 QQHQYY------MPPPPTQLQNTPAPVPVSTPPSQLQAPPAQSQFMPPPPAPSHPSSAQT 300

Query: 301 QPFNQYQQQWAQPPSQQQQPPQQPSMQPQIRPPPSSVYP--SPYPP-NQPSSMTETLSSS 360
           Q F QYQQ W         PP     QPQ RP  S  YP  SP PP NQP    E+L SS
Sbjct: 301 QSFPQYQQNW---------PP-----QPQARPQSSGGYPTYSPAPPGNQPP--VESLPSS 360

Query: 361 MPMQMSFPSIPQPGSSRMDAGPYGYAAASGGSAPQQPPQVKNAYGSATGEGYMPPGQQSG 420
           M MQ  +   PQ          YGY AA    AP  P Q K +Y   TG+GY+P G    
Sbjct: 361 MQMQSPYSGPPQQSMQ-----AYGYGAAPPPQAP--PQQTKMSYSPQTGDGYLPSGPPPP 420

Query: 421 GAYMMYDRESGRPPHHSPQQPHHPSQQPHFNQ----SGYPPANAPHQ 453
             Y     E GR   + P QP    QQ H+ Q     GY P   PHQ
Sbjct: 421 SGYANAMYEGGR-MQYPPPQPQQQQQQAHYLQGPQGGGYSP--QPHQ 428

BLAST of HG10005305 vs. TAIR 10
Match: AT4G28300.2 (Protein of unknown function (DUF1421) )

HSP 1 Score: 256.1 bits (653), Expect = 5.3e-68
Identity = 199/406 (49.01%), Postives = 239/406 (58.87%), Query Frame = 0

Query: 59  STVFPAAAYGQADDSLSQNLISTVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSV 118
           S+VFP ++Y   +DSLSQ++  TVE +MK ++DN++RFLEG+SSRLSQLELYCYNLDK++
Sbjct: 4   SSVFPTSSYSPPEDSLSQDITDTVERTMKMYADNMMRFLEGLSSRLSQLELYCYNLDKTI 63

Query: 119 GEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSS 178
           GEMRSEL   HE+AD KL+SL+KH+QEVHRSVQI+RDKQELA+TQK+LAKLQ+ QKE SS
Sbjct: 64  GEMRSELTHAHEDADVKLRSLDKHLQEVHRSVQILRDKQELADTQKELAKLQLVQKESSS 123

Query: 179 SSHSQSNEERTSSVASDPKKNENPSEIHNQQLALALPHQIVPQQNPITPPSAVLPQNMPQ 238
           SSHSQ  E+R ++   +PKK+EN S+ HNQQLALALPHQI PQ         V PQ  PQ
Sbjct: 124 SSHSQHGEDRVATPVPEPKKSENTSDAHNQQLALALPHQIAPQPQ-------VQPQPQPQ 183

Query: 239 QQQSYYISASQLPGQPPHIQH--AQGQYISPDSQHRASQPQDVSQMSNP---QLSQTQPQ 298
           Q Q Y      +P  P  +Q+  A     +P SQ +A   Q       P     S  Q Q
Sbjct: 184 QHQYY------MPPPPTQLQNTPAPVPVSTPPSQLQAPPAQSQFMPPPPAPSHPSSAQTQ 243

Query: 299 PFNQYQQQWAQPPSQQQQPPQQPSMQPQIRPPPSSVYP--SPYPP-NQPSSMTETLSSSM 358
            F QYQQ W         PP     QPQ RP  S  YP  SP PP NQP    E+L SSM
Sbjct: 244 SFPQYQQNW---------PP-----QPQARPQSSGGYPTYSPAPPGNQPP--VESLPSSM 303

Query: 359 PMQMSFPSIPQPGSSRMDAGPYGYAAASGGSAPQQPPQVKNAYGSATGEGYMPPGQQSGG 418
            MQ  +   PQ          YGY AA    AP  P Q K +Y   TG+GY+P G     
Sbjct: 304 QMQSPYSGPPQQSMQ-----AYGYGAAPPPQAP--PQQTKMSYSPQTGDGYLPSGPPPPS 363

Query: 419 AYMMYDRESGRPPHHSPQQPHHPSQQPHFNQ----SGYPPANAPHQ 453
            Y     E GR   + P QP    QQ H+ Q     GY P   PHQ
Sbjct: 364 GYANAMYEGGR-MQYPPPQPQQQQQQAHYLQGPQGGGYSP--QPHQ 370

BLAST of HG10005305 vs. TAIR 10
Match: AT5G14540.1 (Protein of unknown function (DUF1421) )

HSP 1 Score: 75.1 bits (183), Expect = 1.7e-13
Identity = 123/433 (28.41%), Postives = 184/433 (42.49%), Query Frame = 0

Query: 41  TDPVSLTNSSKDFHKSRMSTVFPAAAYGQAD-DSLSQNLISTVENSMKKHSDNLLRFLEG 100
           +DP  ++ SS   + S M ++ P+  + + D +S    +IS ++ +MK H+D LL  +EG
Sbjct: 89  SDPKPVSASSARSYGS-MDSLEPSKLFAEKDRNSPESAIISAIDRTMKAHADKLLHVMEG 148

Query: 101 ISSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQEL 160
           +S+RL+QLE    +L+  V +++  +   H + D KL+ LE  + EV   VQ+++DKQE+
Sbjct: 149 VSARLTQLETRTRDLENLVDDVKVSVGNSHGKTDGKLRQLENIMLEVQNGVQLLKDKQEI 208

Query: 161 AETQKDLAKLQVSQKEPSSSSHSQSNEERTSSVASDPKKNENPSEIHNQQLALALPHQIV 220
            E Q  L+KLQ+S+      +HS   E      AS P+                      
Sbjct: 209 VEAQLQLSKLQLSKVNQQPETHSTHVEPTAQPPASLPQ---------------------- 268

Query: 221 PQQNPITPPSAVLPQNMPQQQQSYYISASQLPGQPPHIQHAQGQYISPDSQHRASQPQDV 280
           P  +   PPS +  Q +P QQ   +I       QPP  QH     +SP S      P   
Sbjct: 269 PPASAAAPPS-LTQQGLPPQQ---FI-------QPPASQHG----LSPPSLQLPQLPNQF 328

Query: 281 SQMSNPQL---SQTQPQPFNQYQQQWAQPPSQQQQPP-QQPSMQPQI--RPPPSSVYPSP 340
           S    P      Q+QP P  Q   Q   P     QPP Q P  QPQ   +PPP   +PS 
Sbjct: 329 SPQQEPYFPPSGQSQPPPTIQPPYQPPPPTQSLHQPPYQPPPQQPQYPQQPPPQLQHPSG 388

Query: 341 YPPNQPSSMTETLSSSMPMQMSFPSIPQPGSSRMDAGPYGYAAASGGSAPQQPPQVKNAY 400
           Y P +P    ++   + P Q   PS P PGS+          +    +AP  PP + +  
Sbjct: 389 YNPEEPPYPQQSYPPNPPRQP--PSHPPPGSA---------PSQQYYNAPPTPPSMYDGP 448

Query: 401 GSATGEGYMPPGQQSGGAYMMYDRESGRPPHHSPQQPHHPSQQPHFNQSGYPPANAPHQV 460
           G  +  G+  P   S  +Y      +G P  +       P+ Q       YP       +
Sbjct: 449 GGRSNSGF--PSGYSPESYPY----TGPPSQYGNTPSVKPTHQSGSGSGAYPQLPMARPL 466

Query: 461 PPQAPSGPMFQPG 467
           P   P       G
Sbjct: 509 PQGLPMASAISSG 466

BLAST of HG10005305 vs. TAIR 10
Match: AT3G01560.1 (Protein of unknown function (DUF1421) )

HSP 1 Score: 68.2 bits (165), Expect = 2.1e-11
Identity = 109/370 (29.46%), Postives = 166/370 (44.86%), Query Frame = 0

Query: 37  NGSHTDPVSLTNSSKDFHKSRMSTVFPAAAYGQADDSLSQNLIST--VENSMKKHSDNLL 96
           + S   PVS T+ + +F    + ++ P+        ++    I +  ++ +MKKH+D LL
Sbjct: 89  SASDYKPVSTTSPNTNF--GSLDSIEPSKLVPDKGQNVFNTTIMSEIIDRTMKKHTDTLL 148

Query: 97  RFLEGISSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIR 156
             +EG+S+RLSQLE   +NL+  V +++  +   H   D K++ L+  + EV   VQ+++
Sbjct: 149 HVMEGVSARLSQLETRTHNLENLVDDLKVSVDNSHGSTDGKMRQLKNILVEVQSGVQLLK 208

Query: 157 DKQELAETQKDLAKLQVSQK---------EPSSSSHSQSNEER--TSSVASDPKKNENPS 216
           DKQE+ E Q  L+K QVS +         +P++ S +    ++   +S    P     PS
Sbjct: 209 DKQEILEAQ--LSKHQVSNQHAKTHSLHVDPTAQSPAPVPMQQFPLTSFPQPPSSTAAPS 268

Query: 217 EIHNQQLALALPHQIVPQQNPITPPSAVLPQNMPQQQQSYYISASQLPGQPPHIQHAQGQ 276
           +  + QL   LP Q   QQ P  PP +  PQ  P     Y    +Q P QP         
Sbjct: 269 QPPSSQLPPQLPTQFSSQQEPYCPPPS-HPQPPPSNPPPYQAPQTQTPHQP--------S 328

Query: 277 YISPDSQHRASQPQDVSQMSNP------QLSQTQPQPFNQYQQQWAQPPSQQQQPPQ-QP 336
           Y SP  Q +  Q    S   NP      Q+    P P  Q     + P  Q   PPQ QP
Sbjct: 329 YQSPPQQPQYPQQPPPSSGYNPEEQPPYQMQSYPPNPPRQQPPAGSTPSQQFYNPPQPQP 388

Query: 337 SMQPQIRPPPSSVYPSPYPPNQPSSMTETLSSSMPMQMSF--PSIPQPGSSR--MDAGPY 383
           SM        +S +PS Y     +     +SS+ P  +S      PQ  +SR    A P 
Sbjct: 389 SMYDGAGGRSNSGFPSGYLSEPYTYSGSPMSSAKPPHISSNGTGYPQLSNSRPLPHALPM 445

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038888365.19.6e-22993.74ataxin-2 homolog [Benincasa hispida][more]
XP_008455322.16.9e-21990.89PREDICTED: arginine-glutamic acid dipeptide repeats protein-like [Cucumis melo][more]
KAA0031573.12.6e-21890.67arginine-glutamic acid dipeptide repeats protein-like [Cucumis melo var. makuwa]... [more]
XP_004136824.19.4e-21689.83trithorax group protein osa [Cucumis sativus] >KGN43616.1 hypothetical protein C... [more]
XP_023554446.11.0e-21488.20trithorax group protein osa-like [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A1S3C1W23.4e-21990.89arginine-glutamic acid dipeptide repeats protein-like OS=Cucumis melo OX=3656 GN... [more]
A0A5D3C6G61.3e-21890.67Arginine-glutamic acid dipeptide repeats protein-like OS=Cucumis melo var. makuw... [more]
A0A0A0K7204.5e-21689.83DUF1421 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_7G048000 PE=... [more]
A0A6J1HZW11.9e-21488.20ataxin-2 homolog OS=Cucurbita maxima OX=3661 GN=LOC111468169 PE=4 SV=1[more]
A0A6J1GLD53.6e-21388.25class E vacuolar protein-sorting machinery protein hse1-like OS=Cucurbita moscha... [more]
Match NameE-valueIdentityDescription
AT4G28300.16.5e-9050.96Protein of unknown function (DUF1421) [more]
AT4G28300.25.3e-6849.01Protein of unknown function (DUF1421) [more]
AT5G14540.11.7e-1328.41Protein of unknown function (DUF1421) [more]
AT3G01560.12.1e-1129.46Protein of unknown function (DUF1421) [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 122..149
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 168..190
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 232..308
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 339..362
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 168..202
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 309..338
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 431..447
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 218..472
NoneNo IPR availablePANTHERPTHR31805:SF16FORMIN-LIKE PROTEIN (DUF1421)coord: 1..461
NoneNo IPR availablePANTHERPTHR31805RECEPTOR-LIKE KINASE, PUTATIVE (DUF1421)-RELATEDcoord: 1..461

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10005305.1HG10005305.1mRNA