HG10004105 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10004105
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionMyb_DNA-bind_3 domain-containing protein
LocationChr08: 13741872 .. 13743960 (-)
RNA-Seq ExpressionHG10004105
SyntenyHG10004105
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACTACCGATGAACTTTTTATACATTCTTCTTATACAGGAATGGAATCGTATGGCCTGCTAGGGCAAAGAAGAGATGTAAAGCACAAAGGAAGGAATGTTGTTTGGTCAGTTGCAATGGACAAGTGCCTTATTGAAGCTCTTGCTATTCAGGCCAGAAATGGGAATAAAATTGACAGATGCTTTAATGAAAATGCATATACAGCTGCATGTATTGCTGTCAATAGTCATTTTAACTTGAACTTGAACAACCAGAAAGTTATCAATCGCCTAAAGACAATTAAGAAGAGGTACAAAGTAATCAAGGATATTCTTTGTCGAGATGGGTTTCGGTGGAATCCGACTTCAAAGATGATTGAGTGTGACAGTGAAGACCTTTGGAAGAGATATGTGGCAGTGAGTAGATGTTTCTCTCATCACTTCTTCAAATCTAAACATCACAACTATTATGGAATTGTTCATGTAGTATAACGATATTCAATTCGTGGAATAGGCACACCCCGATGCGAGAGGAATCAGAGGGAAGCCAATAGAGATGTATGATGAACTAAACATTGTTTGTGGCAATTATCAGGCCCCGAGTAGATGGGCGAAGATGAAGGATGGAAACCATTCACTGCAGGTCAGGAATTTTGAGGAAGAGTCTGCATCATTTCACTCCCCAAGCTCTGAAGATCTTAGTGAAACAGATGATACAGAGTCATATACTGGACCATCTGAATATGCAGAACTTCCCAATGGCAGTCAGGACCCTCTACCAAACAACCCGACGAGACAACATCCGAAGAGACCACGGGCATCTGAGGCTCTCCAAGATGCAATGCTGGCTGTGGCATCCAGTATTCGTCGTCTGGCCGATGCAATGGAACTGAGTAAACACTCGATAGATGCCAATGAACTGTTAGAAGCCGTAATGGAGGTTGATGGTTTGGAGGAGGCTAAACAGATGTATGCCTTCGAATATTTGAACGCCGATCCGGTGAAAGCCCGAGCGTTCTTGACATATAACGCTCGAATGAGGAAGATCTATTTGTTTCGCCAGTTTTGGTGGTGGAAGTAATCATCAGCTGTTGTTTGATTCCTGTGTACCCGATGTGAGATTATATTGAGCTTGTTTTATGATTATTTGCACATTATTAGAACCAGATCTGATTGGTACTATGAATGTAAATGTATAGATAATTCAATTGAAGAGATGCAGACATTGCAGAAATTTTTATTGCTTCCAATTCTTGTTGACTCAAATCCTTTCCTTGTGATGCTTCAAAATAGTGTAGGTTGGATATTCCAAGTTGTTTTAAACCCCCCTTGTTTGGTTATGAAGTGGTTAAGTTAAAAAAATTGTTAAATTACAAATTTAGTTCCTATGGTTAAGAGAAAGTTATAATTAGTCCTAATGGCTTGAAAAAACCTCATAAATGGTTCTTATGGTATGATAAAATTTTCATAAATAGTACCCACTTTTTTATAAGAGTTTTATTAAACTATAGAAACTAAATTATAATTATAAACCATATGAAACTAAAGATTATTTTTGTCCAAACTTGACCAATGTGCAATTTAACCATAAAAAATTCAATCCATAGGTTTGTGTCTAGCTAGGTCAATGAATTTTGAGAAGTAATTAGGTAAGGGACCTAGTTAGAGATATATGAGGTTATTATTGGGATAATTAGATAGACAGTTAGTTACTAAATATTACTATAAATAGAAGGATGATTCTGATCCCTTTAGAATGTACTAAGAAACTGTATGAAGTTAAACACAAAGTTAAAATGTTAATATCCTACTTAACACAAATCATTTGGTGTCATTTATTGTGCATTGAGAAGCATAAACGTCATGCAGGGGACTAATGGAATTAATTTACCGCTTGAAATGAACGATAATAGTAAATGTAACAATAGGTTAAATGGTAATAATTACATTACATGTTAGCCAATGGAAGGGTAAGGAGGAAGAGAAGAAGAAGGAAGAAGAGGCAGGGAGGAAAAGACGTCGTGCTGGCGAAGGTCCCGACAGCACAAAGGAGGAGGAGGAGAAAAAGACCAAACGTGCATTAGTGATCATATCCAGTGACTCGGAGTAG

mRNA sequence

ATGACTACCGATGAACTTTTTATACATTCTTCTTATACAGGAATGGAATCGTATGGCCTGCTAGGGCAAAGAAGAGATGTAAAGCACAAAGGAAGGAATGTTGTTTGGTCAGTTGCAATGGACAAGTGCCTTATTGAAGCTCTTGCTATTCAGGCCAGAAATGGGAATAAAATTGACAGATGCTTTAATGAAAATGCATATACAGCTGCATGTATTGCTGTCAATAGTCATTTTAACTTGAACTTGAACAACCAGAAAGTTATCAATCGCCTAAAGACAATTAAGAAGAGGTACAAAGTAATCAAGGATATTCTTTGTCGAGATGGGTTTCGGTGGAATCCGACTTCAAAGATGATTGAGTGTGACAGTGAAGACCTTTGGAAGAGATATGTGGCAGCACACCCCGATGCGAGAGGAATCAGAGGGAAGCCAATAGAGATGTATGATGAACTAAACATTGTTTGTGGCAATTATCAGGCCCCGAGTAGATGGGCGAAGATGAAGGATGGAAACCATTCACTGCAGGTCAGGAATTTTGAGGAAGAGTCTGCATCATTTCACTCCCCAAGCTCTGAAGATCTTAGTGAAACAGATGATACAGAGTCATATACTGGACCATCTGAATATGCAGAACTTCCCAATGGCAGTCAGGACCCTCTACCAAACAACCCGACGAGACAACATCCGAAGAGACCACGGGCATCTGAGGCTCTCCAAGATGCAATGCTGGCTGTGGCATCCAGTATTCGTCGTCTGGCCGATGCAATGGAACTGAGTAAACACTCGATAGATGCCAATGAACTGTTAGAAGCCGTAATGGAGGTTGATGGTTTGGAGGAGGCTAAACAGATGTATGCCTTCGAATATTTGAACGCCGATCCGGTGAAAGCCCGAGCGTTCTTGACATATAACGCTCGAATGAGGAAGATCTATTTGTTTCGCCAGTTTTGGTGGTGGAACCAATGGAAGGGTAAGGAGGAAGAGAAGAAGAAGGAAGAAGAGGCAGGGAGGAAAAGACGTCGTGCTGGCGAAGGTCCCGACAGCACAAAGGAGGAGGAGGAGAAAAAGACCAAACGTGCATTAGTGATCATATCCAGTGACTCGGAGTAG

Coding sequence (CDS)

ATGACTACCGATGAACTTTTTATACATTCTTCTTATACAGGAATGGAATCGTATGGCCTGCTAGGGCAAAGAAGAGATGTAAAGCACAAAGGAAGGAATGTTGTTTGGTCAGTTGCAATGGACAAGTGCCTTATTGAAGCTCTTGCTATTCAGGCCAGAAATGGGAATAAAATTGACAGATGCTTTAATGAAAATGCATATACAGCTGCATGTATTGCTGTCAATAGTCATTTTAACTTGAACTTGAACAACCAGAAAGTTATCAATCGCCTAAAGACAATTAAGAAGAGGTACAAAGTAATCAAGGATATTCTTTGTCGAGATGGGTTTCGGTGGAATCCGACTTCAAAGATGATTGAGTGTGACAGTGAAGACCTTTGGAAGAGATATGTGGCAGCACACCCCGATGCGAGAGGAATCAGAGGGAAGCCAATAGAGATGTATGATGAACTAAACATTGTTTGTGGCAATTATCAGGCCCCGAGTAGATGGGCGAAGATGAAGGATGGAAACCATTCACTGCAGGTCAGGAATTTTGAGGAAGAGTCTGCATCATTTCACTCCCCAAGCTCTGAAGATCTTAGTGAAACAGATGATACAGAGTCATATACTGGACCATCTGAATATGCAGAACTTCCCAATGGCAGTCAGGACCCTCTACCAAACAACCCGACGAGACAACATCCGAAGAGACCACGGGCATCTGAGGCTCTCCAAGATGCAATGCTGGCTGTGGCATCCAGTATTCGTCGTCTGGCCGATGCAATGGAACTGAGTAAACACTCGATAGATGCCAATGAACTGTTAGAAGCCGTAATGGAGGTTGATGGTTTGGAGGAGGCTAAACAGATGTATGCCTTCGAATATTTGAACGCCGATCCGGTGAAAGCCCGAGCGTTCTTGACATATAACGCTCGAATGAGGAAGATCTATTTGTTTCGCCAGTTTTGGTGGTGGAACCAATGGAAGGGTAAGGAGGAAGAGAAGAAGAAGGAAGAAGAGGCAGGGAGGAAAAGACGTCGTGCTGGCGAAGGTCCCGACAGCACAAAGGAGGAGGAGGAGAAAAAGACCAAACGTGCATTAGTGATCATATCCAGTGACTCGGAGTAG

Protein sequence

MTTDELFIHSSYTGMESYGLLGQRRDVKHKGRNVVWSVAMDKCLIEALAIQARNGNKIDRCFNENAYTAACIAVNSHFNLNLNNQKVINRLKTIKKRYKVIKDILCRDGFRWNPTSKMIECDSEDLWKRYVAAHPDARGIRGKPIEMYDELNIVCGNYQAPSRWAKMKDGNHSLQVRNFEEESASFHSPSSEDLSETDDTESYTGPSEYAELPNGSQDPLPNNPTRQHPKRPRASEALQDAMLAVASSIRRLADAMELSKHSIDANELLEAVMEVDGLEEAKQMYAFEYLNADPVKARAFLTYNARMRKIYLFRQFWWWNQWKGKEEEKKKEEEAGRKRRRAGEGPDSTKEEEEKKTKRALVIISSDSE
Homology
BLAST of HG10004105 vs. NCBI nr
Match: XP_038885642.1 (uncharacterized protein LOC120075957 [Benincasa hispida] >XP_038885643.1 uncharacterized protein LOC120075957 [Benincasa hispida] >XP_038885644.1 uncharacterized protein LOC120075957 [Benincasa hispida] >XP_038885645.1 uncharacterized protein LOC120075957 [Benincasa hispida] >XP_038885646.1 uncharacterized protein LOC120075957 [Benincasa hispida])

HSP 1 Score: 617.5 bits (1591), Expect = 7.7e-173
Identity = 304/305 (99.67%), Postives = 304/305 (99.67%), Query Frame = 0

Query: 15  MESYGLLGQRRDVKHKGRNVVWSVAMDKCLIEALAIQARNGNKIDRCFNENAYTAACIAV 74
           MESYGLLGQRRDVKHKGRNVVWSVAMDKCLIEALAIQARNGNKIDRCFNENAYTAACIAV
Sbjct: 5   MESYGLLGQRRDVKHKGRNVVWSVAMDKCLIEALAIQARNGNKIDRCFNENAYTAACIAV 64

Query: 75  NSHFNLNLNNQKVINRLKTIKKRYKVIKDILCRDGFRWNPTSKMIECDSEDLWKRYVAAH 134
           NSHFNLNLNNQKVINRLKTIKKRYKVIKDILCRDGFRWNPTSKMIECDSEDLWKRYVAAH
Sbjct: 65  NSHFNLNLNNQKVINRLKTIKKRYKVIKDILCRDGFRWNPTSKMIECDSEDLWKRYVAAH 124

Query: 135 PDARGIRGKPIEMYDELNIVCGNYQAPSRWAKMKDGNHSLQVRNFEEESASFHSPSSEDL 194
           PDARGIRGKPIEMYDELNIVCGNYQAPSRWAKMKDGNHSLQVRNFEEESASFHSPSSEDL
Sbjct: 125 PDARGIRGKPIEMYDELNIVCGNYQAPSRWAKMKDGNHSLQVRNFEEESASFHSPSSEDL 184

Query: 195 SETDDTESYTGPSEYAELPNGSQDPLPNNPTRQHPKRPRASEALQDAMLAVASSIRRLAD 254
           SETDDTESYTGPSEYAELPNGSQDPLPNNPTRQHPKRPRASEALQDAMLAVASSIRRLAD
Sbjct: 185 SETDDTESYTGPSEYAELPNGSQDPLPNNPTRQHPKRPRASEALQDAMLAVASSIRRLAD 244

Query: 255 AMELSKHSIDANELLEAVMEVDGLEEAKQMYAFEYLNADPVKARAFLTYNARMRKIYLFR 314
           AMELSKHSIDA ELLEAVMEVDGLEEAKQMYAFEYLNADPVKARAFLTYNARMRKIYLFR
Sbjct: 245 AMELSKHSIDAKELLEAVMEVDGLEEAKQMYAFEYLNADPVKARAFLTYNARMRKIYLFR 304

Query: 315 QFWWW 320
           QFWWW
Sbjct: 305 QFWWW 309

BLAST of HG10004105 vs. NCBI nr
Match: XP_008456640.1 (PREDICTED: uncharacterized protein LOC103496536 isoform X1 [Cucumis melo] >XP_008456641.1 PREDICTED: uncharacterized protein LOC103496536 isoform X1 [Cucumis melo])

HSP 1 Score: 609.0 bits (1569), Expect = 2.7e-170
Identity = 300/305 (98.36%), Postives = 303/305 (99.34%), Query Frame = 0

Query: 15  MESYGLLGQRRDVKHKGRNVVWSVAMDKCLIEALAIQARNGNKIDRCFNENAYTAACIAV 74
           MESYGLLGQRRDVKHKGRNVVWSVAMDKCLIEALAIQARNGNKIDRCFNENAYTAACIAV
Sbjct: 5   MESYGLLGQRRDVKHKGRNVVWSVAMDKCLIEALAIQARNGNKIDRCFNENAYTAACIAV 64

Query: 75  NSHFNLNLNNQKVINRLKTIKKRYKVIKDILCRDGFRWNPTSKMIECDSEDLWKRYVAAH 134
           NSHFNLNLNNQKVINRLKTIKKRYKVIKDILCRDGFRWNPTSKMIECDSEDLWKRYVAAH
Sbjct: 65  NSHFNLNLNNQKVINRLKTIKKRYKVIKDILCRDGFRWNPTSKMIECDSEDLWKRYVAAH 124

Query: 135 PDARGIRGKPIEMYDELNIVCGNYQAPSRWAKMKDGNHSLQVRNFEEESASFHSPSSEDL 194
           PDARGIRGKPIEMYDELNIVCGNYQAPS+WAKMKDGN SLQVRNFEEESASFHSPSSEDL
Sbjct: 125 PDARGIRGKPIEMYDELNIVCGNYQAPSQWAKMKDGNRSLQVRNFEEESASFHSPSSEDL 184

Query: 195 SETDDTESYTGPSEYAELPNGSQDPLPNNPTRQHPKRPRASEALQDAMLAVASSIRRLAD 254
           SETD+TESYTGPSEYAELPNGSQDPLPNNPTRQ PKRPR+SEALQDAMLAVASSIRRLAD
Sbjct: 185 SETDNTESYTGPSEYAELPNGSQDPLPNNPTRQQPKRPRSSEALQDAMLAVASSIRRLAD 244

Query: 255 AMELSKHSIDANELLEAVMEVDGLEEAKQMYAFEYLNADPVKARAFLTYNARMRKIYLFR 314
           AMELSKHSIDANELLEAVMEVDGLEEAKQMYAFEYLNADPVKARAFLTYNARMRKIYLFR
Sbjct: 245 AMELSKHSIDANELLEAVMEVDGLEEAKQMYAFEYLNADPVKARAFLTYNARMRKIYLFR 304

Query: 315 QFWWW 320
           QFWWW
Sbjct: 305 QFWWW 309

BLAST of HG10004105 vs. NCBI nr
Match: XP_016901970.1 (PREDICTED: uncharacterized protein LOC103496536 isoform X2 [Cucumis melo] >KAA0031712.1 uncharacterized protein E6C27_scaffold139G004990 [Cucumis melo var. makuwa] >TYK30392.1 uncharacterized protein E5676_scaffold2254G00050 [Cucumis melo var. makuwa])

HSP 1 Score: 609.0 bits (1569), Expect = 2.7e-170
Identity = 300/305 (98.36%), Postives = 303/305 (99.34%), Query Frame = 0

Query: 15  MESYGLLGQRRDVKHKGRNVVWSVAMDKCLIEALAIQARNGNKIDRCFNENAYTAACIAV 74
           MESYGLLGQRRDVKHKGRNVVWSVAMDKCLIEALAIQARNGNKIDRCFNENAYTAACIAV
Sbjct: 1   MESYGLLGQRRDVKHKGRNVVWSVAMDKCLIEALAIQARNGNKIDRCFNENAYTAACIAV 60

Query: 75  NSHFNLNLNNQKVINRLKTIKKRYKVIKDILCRDGFRWNPTSKMIECDSEDLWKRYVAAH 134
           NSHFNLNLNNQKVINRLKTIKKRYKVIKDILCRDGFRWNPTSKMIECDSEDLWKRYVAAH
Sbjct: 61  NSHFNLNLNNQKVINRLKTIKKRYKVIKDILCRDGFRWNPTSKMIECDSEDLWKRYVAAH 120

Query: 135 PDARGIRGKPIEMYDELNIVCGNYQAPSRWAKMKDGNHSLQVRNFEEESASFHSPSSEDL 194
           PDARGIRGKPIEMYDELNIVCGNYQAPS+WAKMKDGN SLQVRNFEEESASFHSPSSEDL
Sbjct: 121 PDARGIRGKPIEMYDELNIVCGNYQAPSQWAKMKDGNRSLQVRNFEEESASFHSPSSEDL 180

Query: 195 SETDDTESYTGPSEYAELPNGSQDPLPNNPTRQHPKRPRASEALQDAMLAVASSIRRLAD 254
           SETD+TESYTGPSEYAELPNGSQDPLPNNPTRQ PKRPR+SEALQDAMLAVASSIRRLAD
Sbjct: 181 SETDNTESYTGPSEYAELPNGSQDPLPNNPTRQQPKRPRSSEALQDAMLAVASSIRRLAD 240

Query: 255 AMELSKHSIDANELLEAVMEVDGLEEAKQMYAFEYLNADPVKARAFLTYNARMRKIYLFR 314
           AMELSKHSIDANELLEAVMEVDGLEEAKQMYAFEYLNADPVKARAFLTYNARMRKIYLFR
Sbjct: 241 AMELSKHSIDANELLEAVMEVDGLEEAKQMYAFEYLNADPVKARAFLTYNARMRKIYLFR 300

Query: 315 QFWWW 320
           QFWWW
Sbjct: 301 QFWWW 305

BLAST of HG10004105 vs. NCBI nr
Match: XP_004140924.1 (uncharacterized protein LOC101213668 [Cucumis sativus] >XP_031744107.1 uncharacterized protein LOC101213668 [Cucumis sativus] >KGN46070.1 hypothetical protein Csa_005033 [Cucumis sativus])

HSP 1 Score: 609.0 bits (1569), Expect = 2.7e-170
Identity = 299/305 (98.03%), Postives = 303/305 (99.34%), Query Frame = 0

Query: 15  MESYGLLGQRRDVKHKGRNVVWSVAMDKCLIEALAIQARNGNKIDRCFNENAYTAACIAV 74
           MESYGLLGQ+RDVKHKGRNVVWSVAMDKCLIEALAIQARNGNKIDRCFNENAYTAACIAV
Sbjct: 5   MESYGLLGQKRDVKHKGRNVVWSVAMDKCLIEALAIQARNGNKIDRCFNENAYTAACIAV 64

Query: 75  NSHFNLNLNNQKVINRLKTIKKRYKVIKDILCRDGFRWNPTSKMIECDSEDLWKRYVAAH 134
           NSHFNLNLNNQKVINRLKTIKKRYKVIKDILCRDGFRWNPTSKMIECDSEDLWKRYVAAH
Sbjct: 65  NSHFNLNLNNQKVINRLKTIKKRYKVIKDILCRDGFRWNPTSKMIECDSEDLWKRYVAAH 124

Query: 135 PDARGIRGKPIEMYDELNIVCGNYQAPSRWAKMKDGNHSLQVRNFEEESASFHSPSSEDL 194
           PDARGIRGKPIEMYDELNIVCGNYQAPS+WAKMKDGNH+LQVRNFEEESASFHSPSSEDL
Sbjct: 125 PDARGIRGKPIEMYDELNIVCGNYQAPSQWAKMKDGNHALQVRNFEEESASFHSPSSEDL 184

Query: 195 SETDDTESYTGPSEYAELPNGSQDPLPNNPTRQHPKRPRASEALQDAMLAVASSIRRLAD 254
           SETD+TESYTGP EYAELPNGSQDPLPNNPTRQ PKRPRASEALQDAMLAVASSIRRLAD
Sbjct: 185 SETDNTESYTGPCEYAELPNGSQDPLPNNPTRQQPKRPRASEALQDAMLAVASSIRRLAD 244

Query: 255 AMELSKHSIDANELLEAVMEVDGLEEAKQMYAFEYLNADPVKARAFLTYNARMRKIYLFR 314
           AMELSKHSIDANELLEAVMEVDGLEEAKQMYAFEYLNADPVKARAFLTYNARMRKIYLFR
Sbjct: 245 AMELSKHSIDANELLEAVMEVDGLEEAKQMYAFEYLNADPVKARAFLTYNARMRKIYLFR 304

Query: 315 QFWWW 320
           QFWWW
Sbjct: 305 QFWWW 309

BLAST of HG10004105 vs. NCBI nr
Match: XP_023550438.1 (uncharacterized protein LOC111808583 isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 600.5 bits (1547), Expect = 9.7e-168
Identity = 293/305 (96.07%), Postives = 300/305 (98.36%), Query Frame = 0

Query: 15  MESYGLLGQRRDVKHKGRNVVWSVAMDKCLIEALAIQARNGNKIDRCFNENAYTAACIAV 74
           MESYGLLGQRRDVKHKGRNVVWSVAMDKCLIEALA+QARNGNKI+RCFNENAYTAAC+AV
Sbjct: 5   MESYGLLGQRRDVKHKGRNVVWSVAMDKCLIEALAVQARNGNKIERCFNENAYTAACVAV 64

Query: 75  NSHFNLNLNNQKVINRLKTIKKRYKVIKDILCRDGFRWNPTSKMIECDSEDLWKRYVAAH 134
           NSHFNLNLNNQKVINRLKTIKKRYKVIKDILCRDGFRWNPTSKMI+CDSE+LWKRYVAAH
Sbjct: 65  NSHFNLNLNNQKVINRLKTIKKRYKVIKDILCRDGFRWNPTSKMIDCDSEELWKRYVAAH 124

Query: 135 PDARGIRGKPIEMYDELNIVCGNYQAPSRWAKMKDGNHSLQVRNFEEESASFHSPSSEDL 194
           PDARG+RGKPIEMYDELNIVCGNYQAPSRW KMKDGN  LQVRNF EESASFHSPSSEDL
Sbjct: 125 PDARGLRGKPIEMYDELNIVCGNYQAPSRWTKMKDGNRPLQVRNFVEESASFHSPSSEDL 184

Query: 195 SETDDTESYTGPSEYAELPNGSQDPLPNNPTRQHPKRPRASEALQDAMLAVASSIRRLAD 254
           SETDDTESYTGPSEYAELPNGSQDPLPN+P RQHPKRPRASEALQDAMLAVASSIRRLAD
Sbjct: 185 SETDDTESYTGPSEYAELPNGSQDPLPNSPQRQHPKRPRASEALQDAMLAVASSIRRLAD 244

Query: 255 AMELSKHSIDANELLEAVMEVDGLEEAKQMYAFEYLNADPVKARAFLTYNARMRKIYLFR 314
           AMELSKHSIDANELLEAVMEVDGLEEAKQMYAFEYLNADPVKARAFLTYNARMRKIYLFR
Sbjct: 245 AMELSKHSIDANELLEAVMEVDGLEEAKQMYAFEYLNADPVKARAFLTYNARMRKIYLFR 304

Query: 315 QFWWW 320
           QFWWW
Sbjct: 305 QFWWW 309

BLAST of HG10004105 vs. ExPASy TrEMBL
Match: A0A1S3C4E4 (uncharacterized protein LOC103496536 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103496536 PE=4 SV=1)

HSP 1 Score: 609.0 bits (1569), Expect = 1.3e-170
Identity = 300/305 (98.36%), Postives = 303/305 (99.34%), Query Frame = 0

Query: 15  MESYGLLGQRRDVKHKGRNVVWSVAMDKCLIEALAIQARNGNKIDRCFNENAYTAACIAV 74
           MESYGLLGQRRDVKHKGRNVVWSVAMDKCLIEALAIQARNGNKIDRCFNENAYTAACIAV
Sbjct: 5   MESYGLLGQRRDVKHKGRNVVWSVAMDKCLIEALAIQARNGNKIDRCFNENAYTAACIAV 64

Query: 75  NSHFNLNLNNQKVINRLKTIKKRYKVIKDILCRDGFRWNPTSKMIECDSEDLWKRYVAAH 134
           NSHFNLNLNNQKVINRLKTIKKRYKVIKDILCRDGFRWNPTSKMIECDSEDLWKRYVAAH
Sbjct: 65  NSHFNLNLNNQKVINRLKTIKKRYKVIKDILCRDGFRWNPTSKMIECDSEDLWKRYVAAH 124

Query: 135 PDARGIRGKPIEMYDELNIVCGNYQAPSRWAKMKDGNHSLQVRNFEEESASFHSPSSEDL 194
           PDARGIRGKPIEMYDELNIVCGNYQAPS+WAKMKDGN SLQVRNFEEESASFHSPSSEDL
Sbjct: 125 PDARGIRGKPIEMYDELNIVCGNYQAPSQWAKMKDGNRSLQVRNFEEESASFHSPSSEDL 184

Query: 195 SETDDTESYTGPSEYAELPNGSQDPLPNNPTRQHPKRPRASEALQDAMLAVASSIRRLAD 254
           SETD+TESYTGPSEYAELPNGSQDPLPNNPTRQ PKRPR+SEALQDAMLAVASSIRRLAD
Sbjct: 185 SETDNTESYTGPSEYAELPNGSQDPLPNNPTRQQPKRPRSSEALQDAMLAVASSIRRLAD 244

Query: 255 AMELSKHSIDANELLEAVMEVDGLEEAKQMYAFEYLNADPVKARAFLTYNARMRKIYLFR 314
           AMELSKHSIDANELLEAVMEVDGLEEAKQMYAFEYLNADPVKARAFLTYNARMRKIYLFR
Sbjct: 245 AMELSKHSIDANELLEAVMEVDGLEEAKQMYAFEYLNADPVKARAFLTYNARMRKIYLFR 304

Query: 315 QFWWW 320
           QFWWW
Sbjct: 305 QFWWW 309

BLAST of HG10004105 vs. ExPASy TrEMBL
Match: A0A5A7SKV5 (Myb_DNA-bind_3 domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold2254G00050 PE=4 SV=1)

HSP 1 Score: 609.0 bits (1569), Expect = 1.3e-170
Identity = 300/305 (98.36%), Postives = 303/305 (99.34%), Query Frame = 0

Query: 15  MESYGLLGQRRDVKHKGRNVVWSVAMDKCLIEALAIQARNGNKIDRCFNENAYTAACIAV 74
           MESYGLLGQRRDVKHKGRNVVWSVAMDKCLIEALAIQARNGNKIDRCFNENAYTAACIAV
Sbjct: 1   MESYGLLGQRRDVKHKGRNVVWSVAMDKCLIEALAIQARNGNKIDRCFNENAYTAACIAV 60

Query: 75  NSHFNLNLNNQKVINRLKTIKKRYKVIKDILCRDGFRWNPTSKMIECDSEDLWKRYVAAH 134
           NSHFNLNLNNQKVINRLKTIKKRYKVIKDILCRDGFRWNPTSKMIECDSEDLWKRYVAAH
Sbjct: 61  NSHFNLNLNNQKVINRLKTIKKRYKVIKDILCRDGFRWNPTSKMIECDSEDLWKRYVAAH 120

Query: 135 PDARGIRGKPIEMYDELNIVCGNYQAPSRWAKMKDGNHSLQVRNFEEESASFHSPSSEDL 194
           PDARGIRGKPIEMYDELNIVCGNYQAPS+WAKMKDGN SLQVRNFEEESASFHSPSSEDL
Sbjct: 121 PDARGIRGKPIEMYDELNIVCGNYQAPSQWAKMKDGNRSLQVRNFEEESASFHSPSSEDL 180

Query: 195 SETDDTESYTGPSEYAELPNGSQDPLPNNPTRQHPKRPRASEALQDAMLAVASSIRRLAD 254
           SETD+TESYTGPSEYAELPNGSQDPLPNNPTRQ PKRPR+SEALQDAMLAVASSIRRLAD
Sbjct: 181 SETDNTESYTGPSEYAELPNGSQDPLPNNPTRQQPKRPRSSEALQDAMLAVASSIRRLAD 240

Query: 255 AMELSKHSIDANELLEAVMEVDGLEEAKQMYAFEYLNADPVKARAFLTYNARMRKIYLFR 314
           AMELSKHSIDANELLEAVMEVDGLEEAKQMYAFEYLNADPVKARAFLTYNARMRKIYLFR
Sbjct: 241 AMELSKHSIDANELLEAVMEVDGLEEAKQMYAFEYLNADPVKARAFLTYNARMRKIYLFR 300

Query: 315 QFWWW 320
           QFWWW
Sbjct: 301 QFWWW 305

BLAST of HG10004105 vs. ExPASy TrEMBL
Match: A0A0A0K8L4 (Myb_DNA-bind_3 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G051470 PE=4 SV=1)

HSP 1 Score: 609.0 bits (1569), Expect = 1.3e-170
Identity = 299/305 (98.03%), Postives = 303/305 (99.34%), Query Frame = 0

Query: 15  MESYGLLGQRRDVKHKGRNVVWSVAMDKCLIEALAIQARNGNKIDRCFNENAYTAACIAV 74
           MESYGLLGQ+RDVKHKGRNVVWSVAMDKCLIEALAIQARNGNKIDRCFNENAYTAACIAV
Sbjct: 5   MESYGLLGQKRDVKHKGRNVVWSVAMDKCLIEALAIQARNGNKIDRCFNENAYTAACIAV 64

Query: 75  NSHFNLNLNNQKVINRLKTIKKRYKVIKDILCRDGFRWNPTSKMIECDSEDLWKRYVAAH 134
           NSHFNLNLNNQKVINRLKTIKKRYKVIKDILCRDGFRWNPTSKMIECDSEDLWKRYVAAH
Sbjct: 65  NSHFNLNLNNQKVINRLKTIKKRYKVIKDILCRDGFRWNPTSKMIECDSEDLWKRYVAAH 124

Query: 135 PDARGIRGKPIEMYDELNIVCGNYQAPSRWAKMKDGNHSLQVRNFEEESASFHSPSSEDL 194
           PDARGIRGKPIEMYDELNIVCGNYQAPS+WAKMKDGNH+LQVRNFEEESASFHSPSSEDL
Sbjct: 125 PDARGIRGKPIEMYDELNIVCGNYQAPSQWAKMKDGNHALQVRNFEEESASFHSPSSEDL 184

Query: 195 SETDDTESYTGPSEYAELPNGSQDPLPNNPTRQHPKRPRASEALQDAMLAVASSIRRLAD 254
           SETD+TESYTGP EYAELPNGSQDPLPNNPTRQ PKRPRASEALQDAMLAVASSIRRLAD
Sbjct: 185 SETDNTESYTGPCEYAELPNGSQDPLPNNPTRQQPKRPRASEALQDAMLAVASSIRRLAD 244

Query: 255 AMELSKHSIDANELLEAVMEVDGLEEAKQMYAFEYLNADPVKARAFLTYNARMRKIYLFR 314
           AMELSKHSIDANELLEAVMEVDGLEEAKQMYAFEYLNADPVKARAFLTYNARMRKIYLFR
Sbjct: 245 AMELSKHSIDANELLEAVMEVDGLEEAKQMYAFEYLNADPVKARAFLTYNARMRKIYLFR 304

Query: 315 QFWWW 320
           QFWWW
Sbjct: 305 QFWWW 309

BLAST of HG10004105 vs. ExPASy TrEMBL
Match: A0A1S4E1W1 (uncharacterized protein LOC103496536 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103496536 PE=4 SV=1)

HSP 1 Score: 609.0 bits (1569), Expect = 1.3e-170
Identity = 300/305 (98.36%), Postives = 303/305 (99.34%), Query Frame = 0

Query: 15  MESYGLLGQRRDVKHKGRNVVWSVAMDKCLIEALAIQARNGNKIDRCFNENAYTAACIAV 74
           MESYGLLGQRRDVKHKGRNVVWSVAMDKCLIEALAIQARNGNKIDRCFNENAYTAACIAV
Sbjct: 1   MESYGLLGQRRDVKHKGRNVVWSVAMDKCLIEALAIQARNGNKIDRCFNENAYTAACIAV 60

Query: 75  NSHFNLNLNNQKVINRLKTIKKRYKVIKDILCRDGFRWNPTSKMIECDSEDLWKRYVAAH 134
           NSHFNLNLNNQKVINRLKTIKKRYKVIKDILCRDGFRWNPTSKMIECDSEDLWKRYVAAH
Sbjct: 61  NSHFNLNLNNQKVINRLKTIKKRYKVIKDILCRDGFRWNPTSKMIECDSEDLWKRYVAAH 120

Query: 135 PDARGIRGKPIEMYDELNIVCGNYQAPSRWAKMKDGNHSLQVRNFEEESASFHSPSSEDL 194
           PDARGIRGKPIEMYDELNIVCGNYQAPS+WAKMKDGN SLQVRNFEEESASFHSPSSEDL
Sbjct: 121 PDARGIRGKPIEMYDELNIVCGNYQAPSQWAKMKDGNRSLQVRNFEEESASFHSPSSEDL 180

Query: 195 SETDDTESYTGPSEYAELPNGSQDPLPNNPTRQHPKRPRASEALQDAMLAVASSIRRLAD 254
           SETD+TESYTGPSEYAELPNGSQDPLPNNPTRQ PKRPR+SEALQDAMLAVASSIRRLAD
Sbjct: 181 SETDNTESYTGPSEYAELPNGSQDPLPNNPTRQQPKRPRSSEALQDAMLAVASSIRRLAD 240

Query: 255 AMELSKHSIDANELLEAVMEVDGLEEAKQMYAFEYLNADPVKARAFLTYNARMRKIYLFR 314
           AMELSKHSIDANELLEAVMEVDGLEEAKQMYAFEYLNADPVKARAFLTYNARMRKIYLFR
Sbjct: 241 AMELSKHSIDANELLEAVMEVDGLEEAKQMYAFEYLNADPVKARAFLTYNARMRKIYLFR 300

Query: 315 QFWWW 320
           QFWWW
Sbjct: 301 QFWWW 305

BLAST of HG10004105 vs. ExPASy TrEMBL
Match: A0A6J1FGR7 (uncharacterized protein LOC111445529 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111445529 PE=4 SV=1)

HSP 1 Score: 599.4 bits (1544), Expect = 1.0e-167
Identity = 292/305 (95.74%), Postives = 300/305 (98.36%), Query Frame = 0

Query: 15  MESYGLLGQRRDVKHKGRNVVWSVAMDKCLIEALAIQARNGNKIDRCFNENAYTAACIAV 74
           MESYGLLGQRRDVKHKGRNVVWSVAMDKCLIEALA+QARNGNKI+RCFNENAYTAAC+AV
Sbjct: 5   MESYGLLGQRRDVKHKGRNVVWSVAMDKCLIEALAVQARNGNKIERCFNENAYTAACVAV 64

Query: 75  NSHFNLNLNNQKVINRLKTIKKRYKVIKDILCRDGFRWNPTSKMIECDSEDLWKRYVAAH 134
           NSHFNLNLNNQKVINRLKTIKKRYKVIKDILCRDGFRWNPTSKMI+CDSE+LWKRYVAAH
Sbjct: 65  NSHFNLNLNNQKVINRLKTIKKRYKVIKDILCRDGFRWNPTSKMIDCDSEELWKRYVAAH 124

Query: 135 PDARGIRGKPIEMYDELNIVCGNYQAPSRWAKMKDGNHSLQVRNFEEESASFHSPSSEDL 194
           PDARG+RGKPIEMYDELNIVCGNYQAPSRW KM+DGN  LQVRNF EESASFHSPSSEDL
Sbjct: 125 PDARGLRGKPIEMYDELNIVCGNYQAPSRWTKMRDGNRPLQVRNFVEESASFHSPSSEDL 184

Query: 195 SETDDTESYTGPSEYAELPNGSQDPLPNNPTRQHPKRPRASEALQDAMLAVASSIRRLAD 254
           SETDDTESYTGPSEYAELPNGSQDPLPN+P RQHPKRPRASEALQDAMLAVASSIRRLAD
Sbjct: 185 SETDDTESYTGPSEYAELPNGSQDPLPNSPQRQHPKRPRASEALQDAMLAVASSIRRLAD 244

Query: 255 AMELSKHSIDANELLEAVMEVDGLEEAKQMYAFEYLNADPVKARAFLTYNARMRKIYLFR 314
           AMELSKHSIDANELLEAVMEVDGLEEAKQMYAFEYLNADPVKARAFLTYNARMRKIYLFR
Sbjct: 245 AMELSKHSIDANELLEAVMEVDGLEEAKQMYAFEYLNADPVKARAFLTYNARMRKIYLFR 304

Query: 315 QFWWW 320
           QFWWW
Sbjct: 305 QFWWW 309

BLAST of HG10004105 vs. TAIR 10
Match: AT4G02550.3 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G02210.2); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 399.8 bits (1026), Expect = 2.3e-111
Identity = 197/309 (63.75%), Postives = 247/309 (79.94%), Query Frame = 0

Query: 14  GMESYGLLGQRRDVKHKGRNVVWSVAMDKCLIEALAIQARNGNKIDRCFNENAYTAACIA 73
           GM+ YG+  +R+++KHKGRNV+WSV MDKCLIEALA+QA+NGNK+D+CFN+ AYTAAC+A
Sbjct: 15  GMDQYGIPVERKEMKHKGRNVIWSVGMDKCLIEALAVQAKNGNKVDKCFNDKAYTAACVA 74

Query: 74  VNSHFNLNLNNQKVINRLKTIKKRYKVIKDILCRDGFRWNPTSKMIECDSEDLWKRYVAA 133
           VN+ FNLNL +QK INRLKTIKKRY+V++DIL RDGF WN ++KMI+C+S++LW+RY+A 
Sbjct: 75  VNTRFNLNLTSQKAINRLKTIKKRYRVMRDILSRDGFWWNSSTKMIDCESDELWRRYIAV 134

Query: 134 HPDARGIRGKPIEMYDELNIVCGNYQAPSRWAKMK--DGNHSLQVRNFEEESASFHSPSS 193
           +PDA+  RGK IEMY+EL  VCG+YQ P ++ K+K    +H   V+ FEE+S SF   SS
Sbjct: 135 NPDAKAFRGKQIEMYEELRTVCGDYQTPGKYNKVKKESSHHLNDVKQFEEDSVSFPLGSS 194

Query: 194 EDLSETDDTESYTGPSEYAELPNGSQD-PLPNNPTRQHPKRPRASEALQDAMLAVASSIR 253
           E+ S+TD TESY G SEY  +   SQD P P +P R+  KR R S+  Q+AML VASSIR
Sbjct: 195 EEHSDTDGTESYAGASEY--MHEESQDLPPPRDPLRRPSKRSRNSDPCQEAMLVVASSIR 254

Query: 254 RLADAMELSKHSIDANELLEAVMEVDGLEEAKQMYAFEYLNADPVKARAFLTYNARMRKI 313
           RLADA+  SK  I+  ELL+AVME+D LEEAKQMYAFEYLN DPVKARAF+ YN RMRK+
Sbjct: 255 RLADAVVQSKTLINTEELLKAVMEIDELEEAKQMYAFEYLNGDPVKARAFMAYNNRMRKM 314

Query: 314 YLFRQFWWW 320
           +LFRQFWWW
Sbjct: 315 FLFRQFWWW 321

BLAST of HG10004105 vs. TAIR 10
Match: AT4G02550.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G02210.2); Has 370 Blast hits to 300 proteins in 18 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 10; Plants - 354; Viruses - 0; Other Eukaryotes - 6 (source: NCBI BLink). )

HSP 1 Score: 397.9 bits (1021), Expect = 8.9e-111
Identity = 196/308 (63.64%), Postives = 246/308 (79.87%), Query Frame = 0

Query: 15  MESYGLLGQRRDVKHKGRNVVWSVAMDKCLIEALAIQARNGNKIDRCFNENAYTAACIAV 74
           M+ YG+  +R+++KHKGRNV+WSV MDKCLIEALA+QA+NGNK+D+CFN+ AYTAAC+AV
Sbjct: 1   MDQYGIPVERKEMKHKGRNVIWSVGMDKCLIEALAVQAKNGNKVDKCFNDKAYTAACVAV 60

Query: 75  NSHFNLNLNNQKVINRLKTIKKRYKVIKDILCRDGFRWNPTSKMIECDSEDLWKRYVAAH 134
           N+ FNLNL +QK INRLKTIKKRY+V++DIL RDGF WN ++KMI+C+S++LW+RY+A +
Sbjct: 61  NTRFNLNLTSQKAINRLKTIKKRYRVMRDILSRDGFWWNSSTKMIDCESDELWRRYIAVN 120

Query: 135 PDARGIRGKPIEMYDELNIVCGNYQAPSRWAKMK--DGNHSLQVRNFEEESASFHSPSSE 194
           PDA+  RGK IEMY+EL  VCG+YQ P ++ K+K    +H   V+ FEE+S SF   SSE
Sbjct: 121 PDAKAFRGKQIEMYEELRTVCGDYQTPGKYNKVKKESSHHLNDVKQFEEDSVSFPLGSSE 180

Query: 195 DLSETDDTESYTGPSEYAELPNGSQD-PLPNNPTRQHPKRPRASEALQDAMLAVASSIRR 254
           + S+TD TESY G SEY  +   SQD P P +P R+  KR R S+  Q+AML VASSIRR
Sbjct: 181 EHSDTDGTESYAGASEY--MHEESQDLPPPRDPLRRPSKRSRNSDPCQEAMLVVASSIRR 240

Query: 255 LADAMELSKHSIDANELLEAVMEVDGLEEAKQMYAFEYLNADPVKARAFLTYNARMRKIY 314
           LADA+  SK  I+  ELL+AVME+D LEEAKQMYAFEYLN DPVKARAF+ YN RMRK++
Sbjct: 241 LADAVVQSKTLINTEELLKAVMEIDELEEAKQMYAFEYLNGDPVKARAFMAYNNRMRKMF 300

Query: 315 LFRQFWWW 320
           LFRQFWWW
Sbjct: 301 LFRQFWWW 306

BLAST of HG10004105 vs. TAIR 10
Match: AT4G02550.2 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G02210.2); Has 350 Blast hits to 284 proteins in 18 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 13; Plants - 331; Viruses - 0; Other Eukaryotes - 6 (source: NCBI BLink). )

HSP 1 Score: 370.9 bits (951), Expect = 1.2e-102
Identity = 186/306 (60.78%), Postives = 230/306 (75.16%), Query Frame = 0

Query: 15  MESYGLLGQRRDVKHKGRNVVWSVAMDKCLIEALAIQARNGNKIDRCFNENAYTAACIAV 74
           M+ YG+  +R+++KHKGRNV+WSV MDKCLIEALA+QA+NGNK+D+CFN+ AYTAAC+AV
Sbjct: 1   MDQYGIPVERKEMKHKGRNVIWSVGMDKCLIEALAVQAKNGNKVDKCFNDKAYTAACVAV 60

Query: 75  NSHFNLNLNNQKVINRLKTIKKRYKVIKDILCRDGFRWNPTSKMIECDSEDLWKRYVAAH 134
           N+ FNLNL +QK INRLKTIKKRY+V++DIL RDGF WN ++KMI+C+S++LW+RY+A +
Sbjct: 61  NTRFNLNLTSQKAINRLKTIKKRYRVMRDILSRDGFWWNSSTKMIDCESDELWRRYIAVN 120

Query: 135 PDARGIRGKPIEMYDELNIVCGNYQAPSRWAKMKDGNHSLQVRNFEEESASFHSPSSEDL 194
           PDA+  RGK IEMY+EL  VCG+YQ P                            SSE+ 
Sbjct: 121 PDAKAFRGKQIEMYEELRTVCGDYQTPG---------------------------SSEEH 180

Query: 195 SETDDTESYTGPSEYAELPNGSQD-PLPNNPTRQHPKRPRASEALQDAMLAVASSIRRLA 254
           S+TD TESY G SEY  +   SQD P P +P R+  KR R S+  Q+AML VASSIRRLA
Sbjct: 181 SDTDGTESYAGASEY--MHEESQDLPPPRDPLRRPSKRSRNSDPCQEAMLVVASSIRRLA 240

Query: 255 DAMELSKHSIDANELLEAVMEVDGLEEAKQMYAFEYLNADPVKARAFLTYNARMRKIYLF 314
           DA+  SK  I+  ELL+AVME+D LEEAKQMYAFEYLN DPVKARAF+ YN RMRK++LF
Sbjct: 241 DAVVQSKTLINTEELLKAVMEIDELEEAKQMYAFEYLNGDPVKARAFMAYNNRMRKMFLF 277

Query: 315 RQFWWW 320
           RQFWWW
Sbjct: 301 RQFWWW 277

BLAST of HG10004105 vs. TAIR 10
Match: AT4G02550.4 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 18 plant structures; EXPRESSED DURING: 7 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G02210.2). )

HSP 1 Score: 370.9 bits (951), Expect = 1.2e-102
Identity = 186/306 (60.78%), Postives = 230/306 (75.16%), Query Frame = 0

Query: 15  MESYGLLGQRRDVKHKGRNVVWSVAMDKCLIEALAIQARNGNKIDRCFNENAYTAACIAV 74
           M+ YG+  +R+++KHKGRNV+WSV MDKCLIEALA+QA+NGNK+D+CFN+ AYTAAC+AV
Sbjct: 1   MDQYGIPVERKEMKHKGRNVIWSVGMDKCLIEALAVQAKNGNKVDKCFNDKAYTAACVAV 60

Query: 75  NSHFNLNLNNQKVINRLKTIKKRYKVIKDILCRDGFRWNPTSKMIECDSEDLWKRYVAAH 134
           N+ FNLNL +QK INRLKTIKKRY+V++DIL RDGF WN ++KMI+C+S++LW+RY+A +
Sbjct: 61  NTRFNLNLTSQKAINRLKTIKKRYRVMRDILSRDGFWWNSSTKMIDCESDELWRRYIAVN 120

Query: 135 PDARGIRGKPIEMYDELNIVCGNYQAPSRWAKMKDGNHSLQVRNFEEESASFHSPSSEDL 194
           PDA+  RGK IEMY+EL  VCG+YQ P                            SSE+ 
Sbjct: 121 PDAKAFRGKQIEMYEELRTVCGDYQTPG---------------------------SSEEH 180

Query: 195 SETDDTESYTGPSEYAELPNGSQD-PLPNNPTRQHPKRPRASEALQDAMLAVASSIRRLA 254
           S+TD TESY G SEY  +   SQD P P +P R+  KR R S+  Q+AML VASSIRRLA
Sbjct: 181 SDTDGTESYAGASEY--MHEESQDLPPPRDPLRRPSKRSRNSDPCQEAMLVVASSIRRLA 240

Query: 255 DAMELSKHSIDANELLEAVMEVDGLEEAKQMYAFEYLNADPVKARAFLTYNARMRKIYLF 314
           DA+  SK  I+  ELL+AVME+D LEEAKQMYAFEYLN DPVKARAF+ YN RMRK++LF
Sbjct: 241 DAVVQSKTLINTEELLKAVMEIDELEEAKQMYAFEYLNGDPVKARAFMAYNNRMRKMFLF 277

Query: 315 RQFWWW 320
           RQFWWW
Sbjct: 301 RQFWWW 277

BLAST of HG10004105 vs. TAIR 10
Match: AT4G02210.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G24960.2); Has 791 Blast hits to 465 proteins in 19 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 17; Plants - 748; Viruses - 0; Other Eukaryotes - 26 (source: NCBI BLink). )

HSP 1 Score: 99.0 bits (245), Expect = 8.5e-21
Identity = 80/286 (27.97%), Postives = 136/286 (47.55%), Query Frame = 0

Query: 36  WSVAMDKCLIEALAIQARNGNKIDRCFNENAYTAACIAVNSHFNLNLNNQKVINRLKTIK 95
           W   MD+  I+ +  QAR GN+I+  F + A+T      N+ F  N +   + NR K+++
Sbjct: 186 WHPPMDRYFIDLMLDQARRGNQIEGVFRKQAWTEMVNLFNAKFESNFDVDVLKNRYKSLR 245

Query: 96  KRYKVIKDILCRDGFRWNPTSKMIECDSEDLWKRYVAAHPDARGIRGKPIEMYDELNIVC 155
           +++  IK IL  DGF W+   +M+  D+ ++W+ Y+ AH DAR    +PI  Y +L ++C
Sbjct: 246 RQFNAIKSILRSDGFAWDNERQMVTADN-NVWQDYIKAHRDARQFMTRPIPYYKDLCVLC 305

Query: 156 GNYQAPSRWAKMKDGNHSLQVRNF--EEESASFHSPSSEDLS---ETDDTESYTGPSEYA 215
           G+       + +++    + +  F  E E   F S  + DLS   E +D+ S        
Sbjct: 306 GD-------SGIEENECFVAMDWFDPETEFQEFKSSGTTDLSISAEEEDSNSLLFD---- 365

Query: 216 ELPNGSQDPLPNNPTRQ-HPKRPRASEALQDAMLAVASSIRRLADAMELSKHSIDANELL 275
             P   +D L N  T   +PK+PR  E    ++     +I+ L D  +  +  +DA +LL
Sbjct: 366 --PKNKRDQLANTDTSPINPKKPRVDETQTMSIEDTVEAIQALPDMDD--ELILDACDLL 425

Query: 276 EAVMEVDGLEEAKQMYAFEYLNADPVKARAFLTYNARMRKIYLFRQ 316
           E                      D +KA+ FL  + ++RK +L R+
Sbjct: 426 E----------------------DKLKAKTFLALDVKLRKKWLLRK 433

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038885642.17.7e-17399.67uncharacterized protein LOC120075957 [Benincasa hispida] >XP_038885643.1 unchara... [more]
XP_008456640.12.7e-17098.36PREDICTED: uncharacterized protein LOC103496536 isoform X1 [Cucumis melo] >XP_00... [more]
XP_016901970.12.7e-17098.36PREDICTED: uncharacterized protein LOC103496536 isoform X2 [Cucumis melo] >KAA00... [more]
XP_004140924.12.7e-17098.03uncharacterized protein LOC101213668 [Cucumis sativus] >XP_031744107.1 uncharact... [more]
XP_023550438.19.7e-16896.07uncharacterized protein LOC111808583 isoform X1 [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A1S3C4E41.3e-17098.36uncharacterized protein LOC103496536 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A5A7SKV51.3e-17098.36Myb_DNA-bind_3 domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 ... [more]
A0A0A0K8L41.3e-17098.03Myb_DNA-bind_3 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G051... [more]
A0A1S4E1W11.3e-17098.36uncharacterized protein LOC103496536 isoform X2 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A6J1FGR71.0e-16795.74uncharacterized protein LOC111445529 isoform X1 OS=Cucurbita moschata OX=3662 GN... [more]
Match NameE-valueIdentityDescription
AT4G02550.32.3e-11163.75unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT4G02550.18.9e-11163.64unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT4G02550.21.2e-10260.78unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT4G02550.41.2e-10260.78unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT4G02210.18.5e-2127.97unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR024752Myb/SANT-like domainPFAMPF12776Myb_DNA-bind_3coord: 36..130
e-value: 1.3E-22
score: 80.6
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 180..235
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 325..369
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 200..224
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 325..362
NoneNo IPR availablePANTHERPTHR46929EXPRESSED PROTEINcoord: 28..316
NoneNo IPR availablePANTHERPTHR46929:SF18MYB/SANT-LIKE DNA-BINDING DOMAIN PROTEINcoord: 28..316

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10004105.1HG10004105.1mRNA