HG10020477 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10020477
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionUPF0430 protein CG31712 isoform X1
LocationChr04: 32129598 .. 32139507 (+)
RNA-Seq ExpressionHG10020477
SyntenyHG10020477
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAAATGAAAAGAAGAAGAGAAAAAACAAGAAAAAGAAGAACAAACCGATTAGAACCTCAGAAGATGAAATGATTGTATCAGAATCGACTTCTGTGGATGATACTCACATGAGAAATGGACAGAATGATCAAAATCCAATTTCTGACACAGTAGATCTTTCATACCAGCATAGCAGTGGAACAAAAGATGTATGTTTGTCTTTAAATTCAATAATGGTTCATCACCTATCTATATAATTGGGTGGTGTGATTCATCGCTAGCTGCTGGATGTAGTGGATGACATGAACGTGTGCAGTCCCTTCTATTTATTCATATATCTTATTTTATAAACACGAATAAGCTCTATCTTTGAATCTGATGCATACCATCTTACTAAAGACCAGAAAGCAACCCTGAAAAAATTGATGCCCTTAGCTCTGTCTTCTCACCCAAGGTCCCCGTAGATCAAAATAAATAAAGATACGAAATCCATGCAATCTAAATTCATTACTAATATTTGAACTACACTTAACATGGTTTTCCGTCTTAGAAGAGATAATCGAAGATTTGAGGAAGGAAAGCCTTTTGCTTGCACATAAAAAGGCGATGTCATAATTGCACAAATTCCACCGATGACAATCGGGTGCTTTCTTTCCCCTATTAATGCATTTTCTATTTCTTTGCAGGCTAAGCTGGAGGATACAATTAAGCATTTACATGAGGAAAACAATGTACATATACAAAAAATGGTAATCAATTTAAGCAAACATTCAGACCCCATTTGATAACCATTTGATTTTTTGTTTTTGAAAATCAAGTTTATAAATACTACTTCTACCTATATGTTCATTTGTTTTATTATCCATTTCTGCTAATGTTTTAAAAAACCAAACCAAGTTTGAAAACTAACTAAAGTAACCTTTAAAAACTTGCTTTTGTTTTGAGACTGATTAAGAATTCAAGTGTATCATTAAGAAATGTGAAAACCGTGGTGCAAAAATTTGGAGGAAAAAAGTATAGATTTCAAAATAAGAAAACTGAAAACGAAATCGTTTTTAAACGAGGCCTCAAGTAACTGATCTATGTGCTTGTGAATTGTGTCTTTTCAAATAATTTTTTCTTGCAGGCTGACCTAGAGTTGAAGCTTGTGGAATGCGAGGGTGAAAAACACTCATGGCTTCAAAAAGAGGTGATTTTTTTAAAAGTATCTTTTTTCTATGCACTAATCTAATCATTTAAATTTTAAAGTGGCCATATTTAATTTTCTAAAAGCAGGAAGCACATTGAAGTACACTAAATTTCTATACTAAAGTGTGAAGTGCAAAAAGAGCAAGCAAGGACTTCTTTTTTATTGTCAAGCGCATAAATTGAAAATAAAAATAAGAATATCTTATATGGAAAACCATAGTATTAAAAATAAAAACAAAGAAAAGTAAAAGAAATGTTAAGAGCAGGATTTTAGAGAACATTGCATATTTTGTTAACAAATATCTATATCTATACTATATTAAAAGGGGCTATGAGGGGAGAAATTTTAATTCCCCTCTTTGCCTTTAAATTTTTTGATAATGTCCATTATACCCTTATAGTCACACCCCTTCACTATATGTAGCTTTCATGTCTTCGCCAGTAACTGTCCATGCATCGCTTCAATAAAAATCAATATAAGCTATAAGGGGAGAATTTTTAACTCCCCTCTTTGCCCTTAAATTTTTTCATAATGTCTATTTTACCCTTATAGTTATACCCCTTCATTATACGTAGCTTTCATGCCTTAGCCAATAATTATCCACGCATCGCTTGAACAAAAATCAATAAAACAAAAAAATTGAACCCCAAAATCTTCAAGCATAAACCTTGCATGCTCACACGTAATCAAAATAGCATTTTTTTGCAAATAAAAATTGAAGATGGAGTCGGAGAATCTTAAGTGCTTTATGAGGTGAAATCAACTATGGAAGAAAGAAGATCACAAGAAGTTTTAAGGACAGGGCCATGGAAATTTCCCTATCCTTTCTTCCAATCTTCAAAGAGTAACAATTTGAGAGAAACAATCGCCGAAAACTGAGAAAAAAATATGGAGAGAAATTCAAGCAAGAAGGAGACGAGGGTGATGGTGGATAGACATGGTTATGGTGGAAGATGCTTTTAGGAATTTAGAGGTTCCTTTGGTTGGAGGTCAATTTCTCTCTTTTTTAAGTAACCAACAAAGAAATCCATCATTCACAACCTTTCTTTGATATCACCCTAAAATTAGGGTTAAAGAGCTCTCATGTAGTTGATTCAATACCATGTCTATAGATACATAAAATTCAGTAAAATCTATCTAATATTGCCCATAATACATGGCAATCAAATAAAATAAAAATAGAAAATAATTTTAAGAAATTATATTAATCCTAAAATCTAGGAAAGGAATAATATGGAACCTCTTTGTCCTTCTTTTTTTCAATAATGGTGAGTTTTTATTATCCAATTTGTTAATTCATTGGGTTAGTTATAGGTTGACACTCTTTTCTTAATTTAACATTTCATCGTTCAAACATTTTGCTTAATGTCTTACTTCATTTTACGTCTTTGATTACTTTTAAATTTCATGTCCAGTATTTGTATTTCACCCTAGAAAATTTTGCACCCAACACATTTCATATTCATTATGACCAAATAAAACATATATAGTATTTTAGCCATTATCTCTTTCATTACTCTTATATTTGGTGTTAGACATTTACTATTTAAATTAAATTAAAAGTATATTCATTTTATAATTTATGTTCTACAGATTTTAAATTCTAACTAGTAAGAAAAATTTGAGACTCTGATGTGGCATCCTGGATGTCTGCTTTTTTAGCATCGGTCATGCCCCCATACAACATTCATTTTGCATTACAGTTTTGTTTTTTATTTTTTAATGCATCTGTTGGTTTCTTGTTCTATCTATTTACCTCTTTAGAGATTGGGGTTCCATTGTTTTCAATCTGCAGGCAGCGTTAGTGGATAAAATTACAAGCTTACAAGAGGACAAGACTGCCTTGGATTTAGAAGGGGTAATATTTTTATGCTACATAGTTCCTTCCTCTACAAAGCGATTTAATCATTACTTTTTTCAGCTTTCCTCGTCGATGTGCACAATTATTTGTTCTATTAATTTTGCACTAAATTTCCTCCTTTTGTATTATCAAGGCCAGGTTATTGCACACAATTGAGCATCTAGAGAGAGATAAAGCTTCATTAATTCTTAATGAGGTAATCGATATATTGTTGATGCTTGATAGAGACATGGTGTGAATTCTCCTTTTTTTTTTATTTGTAACCCTTTTATGTGGGAAAAAATACATTTTTTCGAGTAAGGGTACATGTTTCTTAATAGTATTACTTATGTATGCATAGAAGAGAGGGAGGATAGGTGATGTAATAGGATTTAGGATTCCTTTAATCCAACCAGCTCTTGGTTTTGATTTTCTTGAATTATTTCTTTCTTGAACTCTTGCCATTATTCTGAAAGCTGATCCATGTTTTGAGTCATCTTTCCTAACAAACGTTGGATTTCTTTTACTCCATTCTTCATTTCCTCCAATACTTCAGGATTGATATAGGACTTGGGTGTTCTTGAATAAGTTTTAGTCTCATAGTTGTTGTAGGGCCGATTCTTGGGTTCTCTTGAATATGTTTCAACCTTTCTTCATTCTCCATAACTAAGATAATGAGTTTTGACCCTAAACGTATTAGTTCTACTATAAAAACCCATTCTTGGATGATTCTTCAAATTTCTTCTTGCTCTTCTTGAATTTAAAGGGTTCCCCCTGATTAGGTGTGTGTTGTTCGGCAATGAGCACAACTTACGAAGCTCCCTCTTGCGATCGTCGTTGTCAAGAGTTTCTCAGTCTTCCATCGGCGGCAGTCAACGTTGGCCTAGGTGGCTGCTCTGATACCAAATTGATGTAGCCTAAAGTAGGGATGGCAAAATTCCCCGCAGGTACCTGCCCCGATCGGGGCGGGAAATCTCTAGTTTGACTGGGGATGGGGTCAAACTGGGGACACAAAACGGGTCGCTGAACTGGGGACAAGGCAGGGACGGGGAGGGTATCCCGACCCTGTCCCCGACTATTAACTTTTTTAATATATGTGTGTGTGTGTGTGTTTGTTAAATTTTTTAAAACTTTTTTTTAAAAAATTTTCATTTCATAATATAATTAATTTAACTTGTTATTAAACTTACAATAAAAAATGTTAAAATATTTTTTTTTTATTAATTTAAATAAAGAAAACAATAATAATACTTATTTGTTAGGGCTGTTTTCAAATATAGAAAAAATATCAAACTATTTACAAATATAGAAAATTTTTACTGTCTATTAGTGGTAGATCGCAATAGAATTCTATCGCTCATAGAATTCTATGCAGTCTATCGCTGTTAGACAATGAAATTTTTCTATATTTGTAAATAGTTTGATATTTTTTCTGTTTATAATAATTTTCCTATTTGGTTACCTTAAATATTTACAATTTTAACATAAATATCAATTTTTAATTGATTTTAAATAAAAAATAATTAAAAATAAAAATAAAAAACGAGAATTTTTTCCCTGCGGGGATCTAATCCCCGAATGGGGATTCCCCGACCCCGACTCCCATTCCCCAAAACGGGGAATGTGGTGGGTACGGAGATTATAAATCCCCACGAGGACGGGGATGGGGAGTGCATCCCTGGCCCCGCCCCGCCCCGCCCCGCCCCATTGCGAACCCTAGCCTAAAGAGATAAAAGGAGAGAAATCTTCAAGAATCCAAACAAAATCTTTATTATTATGTGAGGCCCCCTCTTACATACATACAGTAGGAGGGGTAGAAGGGGAAATTCACCCATTTAGTTAATTTTTCTGTTAGTGGGCGAAATCATATATATGTATTTTGTGTTACATTTTGGTTATGTATTTTTTAAGTTAAGACTTTGGTAGTCAATTGGGAGAGAACCACCCTCTCGAATAGGTGGAGAGTCTGTAATCTTGTTTTAATAAATATCAAATCATAGCTTCCTATAGGAAGTGTTACATAGTATTTCTTTTGAAACAGAAATAAGTATTTTTCATTTAATTAATGAAATGAGTCTAATGCTCAAAGTACAATGAAACAAACAAAATACAACAAAGAAAATGGAGCAACAAGAACAAAATGAAAGATAAAGGAGACCAATCCTTAATACAAGTACAACACCCAATAGACAAAAAATTCCAGGCTGCCAATCTTCTAGCAAAGTACAAGACTTGAGACATGAATGCCATAAAGTTTTGAAAGTAGAGGAAACTTGAAAAGGAGCTCAAAACCAGACTGGCTTGCCTACTACCAACCCTAGAGATCTAGCCAGCCAAACATAAAACACCAAAACCACCGACCAATAATAATAACACAAATGGAGGAGAGCACAAGAATTCGAAAAATAAAATAATGCATTGAAGACTTGCACCACATATCCAAAATACTATAAAAACTCCATAGCTGAAATTGACCAAAATAAGATCTCCAATCATAACACTATATGAACAACCACAAACCAAACAAAACAACCTTACAAGTAGTGAAAGGCCTCCTAACCAAACATTTCATCTTTCAATGAGTTCAATTTCTGGACAAAACCACCAAATTATGCTTGAATTGAGGGAGATTGAGGAGGAATTGCTTCCATCTGAAAACCACATTTTTCAATCAAAATAGTGAACTTGGCAGGAATTATGGGAGACAATTTTGCTTGGACATTTTCACCCTTCTCAAACAAAGTATCAAAACCATTTGCAAAGCAAACCTCTAGAGTCTCCTCAACAATATCAACTTGTACTAGAGAATCCATAATATCCTCACTGCTTAAACTAGCATCTGAATCACCATTAGAATCGTAACCAGAAGGAGTGTCTTTAGGAGATGGATGACTATAAACTCTTTTTACAAAAGTAATATTTGATCCAACAAAATCTTTATTATAAATGAAAGGTTGTTTAATACAAGAGAGGGGATTCTCTATCTATAGAGAATAAAATGCAAGAAAGAATCCTAAAAGGATAACTAAAAGAAAACTAATTTCTATGTAAAAAAGGAATTAATTAAGGAATTAAACTAAGTTATACTAAATCCTATTACATCAATAGGAGTTGGTGCAAAAGTGTTAATAGGAATATGATAGATACAACTCTATTGATTTATATTATAGAATATTACAAATATGACGGTGGAGATTCAAACCTCAACCTACGAGGGAGGTTACTGTCGCCTTGACCCATAATCATTTAAGTTATGTTAAAGTTATTAATTGTTCATGCATTACCACTTGTGGTTGCTTGTTGGCCAATTTCCTGATGTTGCTTGCCTTTCGTCTGAGTCGTGTGTTGCCTTGAAAACACTGTTTGAGTGTATTGATGATTGAGCATCGAGCATTGCCATTGACTATAGACTATAGTTGAAGCTTCAAGCCCAAAGTTAACAACGAGGCAAGTGCAAGGAATTAGGAAGGAGATTTGGTGACATGAGATTTGGGAGATCTATTCCAAACATTGGTAGGAGCTACAACTTATAAGAAATCTTGCCCATTGTTTCCAAAATGGAGGAGGCTGATGGAATCTACTTGCCAACTTCTTCAAATATCTTGGTTGTAAATTGTAAGTCGAGAATTATCGAGTTGCGTGGTTTAAATTCCATGTCTGATGTCCCTTCTATCATCTATGTTAATCTACAAATGTGACTTTTTTTTTTTTGATATCCGTGAGTGTTCGGGCCAGCTTACGTGCACCTTGACTAATTTCACGGGATAACCCGCCTGACCGTACAACATTTGGGTGTCAAGGAAACTCGTAGGAAATTAATTCCCTGGGTAAGTGATCACCATGGATTGAACCTATGACCTCTACCATGGATTGAACCTATGACCTCTTAGCCATTTATTGAGACTATGTCTCCTTTTTTACCATTAGGCTAACCCATGATGGTTTCGGCAGATGTGACTTTTATCAGATAATATGTCTATTTACCTCCTTCAATTGACTATCAGGATGGGGTAATAGTGATAGTGACACTATGTTCAAAGTATTCTCGTAGGTTCTAATTAGATCATCCCAAAATAAACTCATCGACACCTTGTCTCTACTTGAAGTTGGCAATTTGTGGATGCCATATGATTTCACAATTCCTCAGTCAATCCATCCACTGCTATTTTGAATTTATTTGTACAACAAATATGGGGGTGGGAATTTGAATAATCAATTCACCGCTATTAGGATAGTGTATCCTATAGCGGAATCCATCTAAGGTCATGCCTTGCCGATGTAATTTGAGAATGAGGAGTGGTTAAATCCGACAGACCTACTAAACACACTTATATTAGGAGAAAGAAGAGTAGAGAAGTGGAGACAGAAAAGAAATTAGTTAGTTAGAGGCAGCACGGGGAGGGAATATTTTGTGTGTGTGGGTGACAATGGCATTAGGGCCCCATTTGGGTATTTTTTTAACAAGTGGTATAGATAATAGTTGATGTATTGGGAAAAGAAATTGTTTGTTGTGCAGTAGCTTCTGGGAGATGCTCATGCCTCTTGTAATGCCCGAGAGTTGTCTCTTTCCTCTCACTGTGAGCTATTCAAGAGATCAGTAAACTCTAATTATCAACAATCATCATATCTCAACCTCGGCGGTGATGTGAACTGCTTGTGTTAGCAAACATTACAACTCCTCATGTCTTGTTAGTACAAAATAAGCTGAAAGGCTTGAGCTTAGGAGTAACCAACCATCATACTATAATGGCACACTTATTACAAGAGGGATGCCGGGGGTTGATCCCCTAGCCCAGTAAACATTCTTTTATACTGGAATTGACCCAGTTCAACTTTAAAATGTGGGAAAGCATCCCTCGCCTGACCCATGGCCTTTTTCATGAGAGAAAGCAAAAATCATTATCCATATATGCTTGTTGTTTGTGGACAAATACTATAACCCATCCCCTCAAGCGTTTGGGTTAAGTGAAATATCAGTCAGTTTCTTTAATAGATCCAGCTAATGACTCCTTGTTGGCCTGGTGATTATTTGCAAATTCTTCTCTAGCGTCCAACTTCTTATGTGTTTAAATGCTCAATGAACATTTAGGGACTGGGGAAATTTGGTTTAAAAGGTTTATTATATGCAAAATGATTGATGCTAGATTAGTGATTCATCAACAAGTTAATTAAAATTTTAAGGTCTACAAATATGGTTTTTAGTCTTTTATTAACTAGGGAGGAGAAGGAAAAAATTGTGGACAGGATTTAGCTTTATTGTATATGTTATTTTGGATCCACAATGGGCTCTCATTGTGTGCTGTCATTATAGGGGGATTATATTCTTACCGAAGTTGGTTTTTTCATTCACAGAAATCAAGCAGGGAGACGATAATTGATAAGAACAAGGACATCTCTAGGTTGCAGGCACGGGTATGCTGCTTCATTAACTTAAGTAAACCACTAATAAATTGCATAAAGTAAACACGAGTTTTATCTCTTTCAGTTGCTTTCTGGTTGTGCAAACAGCCACCAACTTTGATTGTGATGTCACCATGTTTATTGGAAGAATTAGTAACAAGTTGAATAGGGTCTGGAGGATGTTGGTGGCTTTAACTTGAAACTTATCATACAGGACCATTTATTTACTTATGGGTACTTGTTCATTCACGAGCTGCGAGCATAGGACACTACTTTTTGTTCTAATGAGGAATTGAAACCAAAGTTTGACAGGTTGTGGAGTTGGAAAAACAAAGACATGATCTGTTGCAAGAAAACAAACAACTGACGGAAAATGTTGCTGATTATCAGTCAAAACTTCTGAACCTTGAGAGGAAAATATCTTCCACTTACACACACTCTTCAGATCGAGTCACAAAGGTAAACGTCTTATCCTCACCTATTAAACGGTTAAACTTCACAAGTTTATTTTTGCATTGAGAATAAAGTCCCTTAGAAGAACTTCGCAAGATCTGCTAAAACCAAGCTGTAGACAATTCTCTTTTGGGAGATAACTTTTGGTCTCCTTTAAGATGATGGTTTTTTTGTTCTTTTCCCTGAATTATAGTGCACGAATAATTAATTACTCCTTGTATACCTCATGTCAATGTGTACGAGATGAGCAGGAGATGTTGAGTTCACAAGTTGATGCAGCTTGTATTCTGGTTGATAAATTGATTACAGAAAATGCAGAGCTTATTGGGAAGGTAATAGCACTGCTGTAATATCCGTAGTGCACATCTGTGATACTTCCTTGTGCCTAGTTTCATTTCGGAAATTTTTTAGTTTGATCTGAACTGACCTTGTATTGAAAGAGAGCATAATTGCTGAACTCCTGATATCATGTTTTATGATATAGTGATGTTATATTTAGATGCAGAGTAGCTGGATAAAAATCCATTTCTTACAGATGTTATGAAATTATGGTCATTGCTTTATCTGGAGGTTAACAACTCACGCTTTTTATTCTTAGGTGAATGAGTTATTTGTTGAGCTTCAAAGAGTTACAAAGACTGAGCTATCTCCAGGCGTGGAGCCTGACCAGACGGCTGAAGCTACTGACACCGCCACTTTCAACGATCCCAAGCCTCCCTTGATTCTGAATAGCATCACATCAAGTAAAAGTTTGGACGCATTAAAATCTGTTCCAATCCACAACCATAGCATTAGTAATGATTTTGTGGAGCTGGACAACGATTTTCTGGCTCCAAAATCCTCAATGCCTCTGGCGGCAGGAGAAATCGAACAAATTCCGTTGCATGAAATTGAAGATAGGAACAGGGACCGAGAACTGCCAGCTACAGAGAGCGATGAGAAGGATGTGCTGCTTTCAGATGCTCCTTTGATTGGGGCTCCTTATCGGTTGATGTCATTTATGGCCAAATACGTAAGCGGCGCTGACCTGGTTGGCAAAAGCTGA

mRNA sequence

ATGGAAAATGAAAAGAAGAAGAGAAAAAACAAGAAAAAGAAGAACAAACCGATTAGAACCTCAGAAGATGAAATGATTGTATCAGAATCGACTTCTGTGGATGATACTCACATGAGAAATGGACAGAATGATCAAAATCCAATTTCTGACACAGTAGATCTTTCATACCAGCATAGCAGTGGAACAAAAGATGCTAAGCTGGAGGATACAATTAAGCATTTACATGAGGAAAACAATGTACATATACAAAAAATGGCTGACCTAGAGTTGAAGCTTGTGGAATGCGAGGGTGAAAAACACTCATGGCTTCAAAAAGAGGCAGCGTTAGTGGATAAAATTACAAGCTTACAAGAGGACAAGACTGCCTTGGATTTAGAAGGGAAATCAAGCAGGGAGACGATAATTGATAAGAACAAGGACATCTCTAGGTTGCAGGCACGGGTTGTGGAGTTGGAAAAACAAAGACATGATCTGTTGCAAGAAAACAAACAACTGACGGAAAATGTTGCTGATTATCAGTCAAAACTTCTGAACCTTGAGAGGAAAATATCTTCCACTTACACACACTCTTCAGATCGAGTCACAAAGGAGATGTTGAGTTCACAAGTTGATGCAGCTTGTATTCTGGTTGATAAATTGATTACAGAAAATGCAGAGCTTATTGGGAAGGTGAATGAGTTATTTGTTGAGCTTCAAAGAGTTACAAAGACTGAGCTATCTCCAGGCGTGGAGCCTGACCAGACGGCTGAAGCTACTGACACCGCCACTTTCAACGATCCCAAGCCTCCCTTGATTCTGAATAGCATCACATCAAGTAAAAGTTTGGACGCATTAAAATCTGTTCCAATCCACAACCATAGCATTAGTAATGATTTTGTGGAGCTGGACAACGATTTTCTGGCTCCAAAATCCTCAATGCCTCTGGCGGCAGGAGAAATCGAACAAATTCCGTTGCATGAAATTGAAGATAGGAACAGGGACCGAGAACTGCCAGCTACAGAGAGCGATGAGAAGGATGTGCTGCTTTCAGATGCTCCTTTGATTGGGGCTCCTTATCGGTTGATGTCATTTATGGCCAAATACGTAAGCGGCGCTGACCTGGTTGGCAAAAGCTGA

Coding sequence (CDS)

ATGGAAAATGAAAAGAAGAAGAGAAAAAACAAGAAAAAGAAGAACAAACCGATTAGAACCTCAGAAGATGAAATGATTGTATCAGAATCGACTTCTGTGGATGATACTCACATGAGAAATGGACAGAATGATCAAAATCCAATTTCTGACACAGTAGATCTTTCATACCAGCATAGCAGTGGAACAAAAGATGCTAAGCTGGAGGATACAATTAAGCATTTACATGAGGAAAACAATGTACATATACAAAAAATGGCTGACCTAGAGTTGAAGCTTGTGGAATGCGAGGGTGAAAAACACTCATGGCTTCAAAAAGAGGCAGCGTTAGTGGATAAAATTACAAGCTTACAAGAGGACAAGACTGCCTTGGATTTAGAAGGGAAATCAAGCAGGGAGACGATAATTGATAAGAACAAGGACATCTCTAGGTTGCAGGCACGGGTTGTGGAGTTGGAAAAACAAAGACATGATCTGTTGCAAGAAAACAAACAACTGACGGAAAATGTTGCTGATTATCAGTCAAAACTTCTGAACCTTGAGAGGAAAATATCTTCCACTTACACACACTCTTCAGATCGAGTCACAAAGGAGATGTTGAGTTCACAAGTTGATGCAGCTTGTATTCTGGTTGATAAATTGATTACAGAAAATGCAGAGCTTATTGGGAAGGTGAATGAGTTATTTGTTGAGCTTCAAAGAGTTACAAAGACTGAGCTATCTCCAGGCGTGGAGCCTGACCAGACGGCTGAAGCTACTGACACCGCCACTTTCAACGATCCCAAGCCTCCCTTGATTCTGAATAGCATCACATCAAGTAAAAGTTTGGACGCATTAAAATCTGTTCCAATCCACAACCATAGCATTAGTAATGATTTTGTGGAGCTGGACAACGATTTTCTGGCTCCAAAATCCTCAATGCCTCTGGCGGCAGGAGAAATCGAACAAATTCCGTTGCATGAAATTGAAGATAGGAACAGGGACCGAGAACTGCCAGCTACAGAGAGCGATGAGAAGGATGTGCTGCTTTCAGATGCTCCTTTGATTGGGGCTCCTTATCGGTTGATGTCATTTATGGCCAAATACGTAAGCGGCGCTGACCTGGTTGGCAAAAGCTGA

Protein sequence

MENEKKKRKNKKKKNKPIRTSEDEMIVSESTSVDDTHMRNGQNDQNPISDTVDLSYQHSSGTKDAKLEDTIKHLHEENNVHIQKMADLELKLVECEGEKHSWLQKEAALVDKITSLQEDKTALDLEGKSSRETIIDKNKDISRLQARVVELEKQRHDLLQENKQLTENVADYQSKLLNLERKISSTYTHSSDRVTKEMLSSQVDAACILVDKLITENAELIGKVNELFVELQRVTKTELSPGVEPDQTAEATDTATFNDPKPPLILNSITSSKSLDALKSVPIHNHSISNDFVELDNDFLAPKSSMPLAAGEIEQIPLHEIEDRNRDRELPATESDEKDVLLSDAPLIGAPYRLMSFMAKYVSGADLVGKS
Homology
BLAST of HG10020477 vs. NCBI nr
Match: XP_038906167.1 (myosin-6 [Benincasa hispida] >XP_038906168.1 myosin-6 [Benincasa hispida] >XP_038906169.1 myosin-6 [Benincasa hispida] >XP_038906170.1 myosin-6 [Benincasa hispida])

HSP 1 Score: 591.3 bits (1523), Expect = 5.9e-165
Identity = 330/393 (83.97%), Postives = 348/393 (88.55%), Query Frame = 0

Query: 1   MENEKKKRKNKKKKNKPIRTSEDEMIVSESTSVDDTHMRNGQNDQNPISDTVDLSYQHSS 60
           M+NEKKKRKNKKKK + IRTSEDE+IVSESTSVDDTHMRNG NDQNPISD VDLSYQH S
Sbjct: 1   MDNEKKKRKNKKKKKQQIRTSEDELIVSESTSVDDTHMRNGLNDQNPISDAVDLSYQHIS 60

Query: 61  GTKDAKLEDTIKHLHEENNVHIQKMADLELKLVECEGEKHSWLQKEAALVDKITSLQEDK 120
           GTKDAK EDTIKHLHEENN+H Q+MADLELKLVE E EKHSWLQKEAAL+DKI SLQEDK
Sbjct: 61  GTKDAK-EDTIKHLHEENNIHTQRMADLELKLVEYESEKHSWLQKEAALMDKIRSLQEDK 120

Query: 121 TALDLEG---------------------KSSRETIIDKNKDISRLQARVVELEKQRHDLL 180
           TALDLEG                     KSS ETI+DKNKDISRLQA+VVELE+QR DLL
Sbjct: 121 TALDLEGARLLDTIELLERDKASLILNEKSSWETIVDKNKDISRLQAQVVELEEQRRDLL 180

Query: 181 QENKQLTENVADYQSKLLNLERKISSTYTHSSDRVTKEMLSSQVDAACILVDKLITENAE 240
           QENKQLTENVADYQSKLLNLERK+SSTY HSS RVTKEMLSSQVDAA ILVDKLITENAE
Sbjct: 181 QENKQLTENVADYQSKLLNLERKLSSTYMHSSGRVTKEMLSSQVDAARILVDKLITENAE 240

Query: 241 LIGKVNELFVELQRVTKTELSPGVEPDQTA-EATDTATFNDPKPPLILNSITSSKSLDAL 300
           LIGKVN LFVELQRVTKTELS GVEPDQ A EAT TATFNDP+PPLILNS+TSSKSLDAL
Sbjct: 241 LIGKVNGLFVELQRVTKTELSSGVEPDQMAKEATGTATFNDPEPPLILNSVTSSKSLDAL 300

Query: 301 KSVPIHNHSISNDFVELDNDFLAPKSSMPLAAGEIEQIPLHEIEDRNRDRELPATESDEK 360
           +SVPIHNHSI +DFV+LDNDFLA KSSMP+AAGEIEQ PLHEIEDRNRDRELPAT SDE+
Sbjct: 301 ESVPIHNHSIGSDFVDLDNDFLASKSSMPMAAGEIEQTPLHEIEDRNRDRELPATGSDEQ 360

Query: 361 DVLLSDAPLIGAPYRLMSFMAKYVSGADLVGKS 372
           DVLLSDAPLIGAPYRL+SFMAKYVSGADLVGKS
Sbjct: 361 DVLLSDAPLIGAPYRLISFMAKYVSGADLVGKS 392

BLAST of HG10020477 vs. NCBI nr
Match: XP_008442312.1 (PREDICTED: uncharacterized protein LOC103486222 [Cucumis melo] >XP_008442313.1 PREDICTED: uncharacterized protein LOC103486222 [Cucumis melo])

HSP 1 Score: 567.8 bits (1462), Expect = 7.0e-158
Identity = 313/393 (79.64%), Postives = 341/393 (86.77%), Query Frame = 0

Query: 1   MENEKKKRKNKKKKNKPIRTSEDEMIVSESTSVDDTHMRNGQNDQNPISDTVDLSYQHSS 60
           M+N KKKRKNKKKKNK IRTSEDEM+VSESTSVDDTH RN QNDQNPISDT+ LSYQ SS
Sbjct: 1   MDNGKKKRKNKKKKNKQIRTSEDEMVVSESTSVDDTHPRNRQNDQNPISDTLLLSYQQSS 60

Query: 61  GTKDAKLEDTIKHLHEENNVHIQKMADLELKLVECEGEKHSWLQKEAALVDKITSLQEDK 120
             KDAKL+DTIKHLHEENN+HIQ+MADLELKLVECEGEKHSWLQKE AL+DKI +LQEDK
Sbjct: 61  AKKDAKLDDTIKHLHEENNIHIQRMADLELKLVECEGEKHSWLQKEEALMDKIRNLQEDK 120

Query: 121 TALDLEG---------------------KSSRETIIDKNKDISRLQARVVELEKQRHDLL 180
           T+LDLEG                     KSS+ETI+DKNKDISRLQA+VVELE+QR DLL
Sbjct: 121 TSLDLEGARLLNTIKLLERDKASLILDEKSSKETIVDKNKDISRLQAQVVELEEQRCDLL 180

Query: 181 QENKQLTENVADYQSKLLNLERKISSTYTHSSDRVTKEMLSSQVDAACILVDKLITENAE 240
            ENK+LTE VADYQSKLLNLERKISSTY HSSDRVTKE+L+SQVDAA ILVD+LITENAE
Sbjct: 181 HENKELTEKVADYQSKLLNLERKISSTYIHSSDRVTKEILNSQVDAARILVDRLITENAE 240

Query: 241 LIGKVNELFVELQRVTKTELSPGVEPDQTA-EATDTATFNDPKPPLILNSITSSKSLDAL 300
           LIGKVNELFVELQRVTKTELS GVEPDQ A EATDT TFNDP+PPLILNS+T  KS DAL
Sbjct: 241 LIGKVNELFVELQRVTKTELSSGVEPDQMAKEATDTTTFNDPEPPLILNSVTCGKSSDAL 300

Query: 301 KSVPIHNHSISNDFVELDNDFLAPKSSMPLAAGEIEQIPLHEIEDRNRDRELPATESDEK 360
            SVPIH+HSI  DFV+LD+D+LA KSSM +A GEIEQIPL + +DRNR+RELPATE DEK
Sbjct: 301 NSVPIHSHSIGGDFVDLDSDYLASKSSMRMATGEIEQIPLPQFDDRNRNRELPATEIDEK 360

Query: 361 DVLLSDAPLIGAPYRLMSFMAKYVSGADLVGKS 372
           DVLLSDAPLIGAPYRL+SFMAKYVSGADLVGKS
Sbjct: 361 DVLLSDAPLIGAPYRLISFMAKYVSGADLVGKS 393

BLAST of HG10020477 vs. NCBI nr
Match: XP_004137671.1 (uncharacterized protein LOC101221440 [Cucumis sativus] >KGN58680.1 hypothetical protein Csa_002486 [Cucumis sativus])

HSP 1 Score: 558.9 bits (1439), Expect = 3.3e-155
Identity = 309/393 (78.63%), Postives = 341/393 (86.77%), Query Frame = 0

Query: 1   MENEKKKRKNKKKKNKPIRTSEDEMIVSESTSVDDTHMRNGQNDQNPISDTVDLSYQHSS 60
           M+ EKKKRKNKKKKNK IRTSEDEM+VSESTSVDDTH RN QNDQNPISDT+ +SYQHSS
Sbjct: 1   MDKEKKKRKNKKKKNKQIRTSEDEMVVSESTSVDDTHPRNRQNDQNPISDTL-ISYQHSS 60

Query: 61  GTKDAKLEDTIKHLHEENNVHIQKMADLELKLVECEGEKHSWLQKEAALVDKITSLQEDK 120
           GTKDAKL+DTIKHLHEENN+HI++MADL+LKLVECEGEK+SWLQKE AL+DKI +LQEDK
Sbjct: 61  GTKDAKLDDTIKHLHEENNIHIKRMADLDLKLVECEGEKYSWLQKEEALMDKIRNLQEDK 120

Query: 121 TALDLEG---------------------KSSRETIIDKNKDISRLQARVVELEKQRHDLL 180
           TALDLEG                     KSSRETI+DKNKDISRLQA+VVELE+Q+ DLL
Sbjct: 121 TALDLEGARLLNIIKLLERDKASLILDEKSSRETIVDKNKDISRLQAQVVELEEQKRDLL 180

Query: 181 QENKQLTENVADYQSKLLNLERKISSTYTHSSDRVTKEMLSSQVDAACILVDKLITENAE 240
            ENKQLT  VADYQSKLLNLERKISSTY HSSDRVTKE+L+SQVDAA ILVDKLITENAE
Sbjct: 181 HENKQLTGKVADYQSKLLNLERKISSTYIHSSDRVTKEILNSQVDAARILVDKLITENAE 240

Query: 241 LIGKVNELFVELQRVTKTELSPGVEPDQTA-EATDTATFNDPKPPLILNSITSSKSLDAL 300
           LIGKVNELFVELQRVTKTEL  GV PDQ A EATDT TFN+ +PP+ILNS+TS KSLDAL
Sbjct: 241 LIGKVNELFVELQRVTKTELPSGVVPDQMATEATDTTTFNESEPPVILNSVTSGKSLDAL 300

Query: 301 KSVPIHNHSISNDFVELDNDFLAPKSSMPLAAGEIEQIPLHEIEDRNRDRELPATESDEK 360
           KSV IH+HSI  DFV+L +DF+A ++SMP+AAGEIEQI LH+ ED+N  RELPATE DEK
Sbjct: 301 KSVSIHSHSIGGDFVDLGSDFMASEASMPMAAGEIEQIQLHQFEDQNGTRELPATEIDEK 360

Query: 361 DVLLSDAPLIGAPYRLMSFMAKYVSGADLVGKS 372
           DVLLSDAPLIGAPYRL+SFMAKYVSGADLVGKS
Sbjct: 361 DVLLSDAPLIGAPYRLISFMAKYVSGADLVGKS 392

BLAST of HG10020477 vs. NCBI nr
Match: XP_023540633.1 (uncharacterized protein LOC111800938 [Cucurbita pepo subsp. pepo] >XP_023540634.1 uncharacterized protein LOC111800938 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 521.5 bits (1342), Expect = 5.8e-144
Identity = 300/416 (72.12%), Postives = 329/416 (79.09%), Query Frame = 0

Query: 1   MENEKKKRKNKKKKNKPIRTSEDEMIVSESTSVDDTHMRNGQNDQNPISDTVDLSYQHSS 60
           MENEKKKRKNKKKKNK IRTSEDE+I SESTSVDDTH  NGQNDQNPIS TVD S QHS 
Sbjct: 1   MENEKKKRKNKKKKNKQIRTSEDEIIASESTSVDDTHTTNGQNDQNPISGTVDSSCQHSR 60

Query: 61  GTKD---------------------AKLEDTIKHLHEENNVHIQKMADLELKLVECEGEK 120
           GTKD                     AKLEDTIK LHEENNVH+QK+ADLELKLVE EGEK
Sbjct: 61  GTKDAVSEETNEHLRKESLLLAHDKAKLEDTIKRLHEENNVHMQKLADLELKLVEFEGEK 120

Query: 121 HSWLQKEAALVDKITSLQEDKTALDLEG---------------------KSSRETIIDKN 180
           HSWL+KE  LVDKI  LQEDKTALDLEG                      SS E I+DKN
Sbjct: 121 HSWLRKEETLVDKIRRLQEDKTALDLEGARLLHTIEQLERDKASLIFNENSSTEMIVDKN 180

Query: 181 KDISRLQARVVELEKQRHDLLQENKQLTENVADYQSKLLNLERKISSTYTHSSDRVTKEM 240
           KDISRL A+VVELE+QR DLLQENKQLTENVADYQSK+  LERKISST+THSSDRVTKEM
Sbjct: 181 KDISRLHAQVVELEEQRRDLLQENKQLTENVADYQSKITILERKISSTHTHSSDRVTKEM 240

Query: 241 LSSQVDAACILVDKLITENAELIGKVNELFVELQRVTKTELSPGVEPDQTAE-ATDTATF 300
           LSSQVDAA ILVDKLITENAELIGKVNEL+VELQRVTK E++ G+EPDQ  E ATDTATF
Sbjct: 241 LSSQVDAARILVDKLITENAELIGKVNELYVELQRVTKAEVNSGMEPDQMVEAATDTATF 300

Query: 301 NDPKPPLILNSITSSKSLDALKSVPIHNHSISNDFVELDNDFLAPKSS--MPLAAGEIEQ 360
           N+P+PPLI N +TSSKSLDAL+SVPIHNHS+ ++ +++D+D L   +S  +P+  GEIEQ
Sbjct: 301 NEPEPPLIHNMVTSSKSLDALESVPIHNHSVGDNIMDMDHDLLLSPTSLILPMEEGEIEQ 360

Query: 361 IPLHEIEDRNRDRELPATESDEKDVLLSDAPLIGAPYRLMSFMAKYVSGADLVGKS 372
           IP  E EDRNR+REL   ESDEKDVLLSDAPLIGAPYRL+SFMAKYVSGADLVGKS
Sbjct: 361 IPPQENEDRNRNRELSGAESDEKDVLLSDAPLIGAPYRLISFMAKYVSGADLVGKS 416

BLAST of HG10020477 vs. NCBI nr
Match: XP_022971477.1 (UPF0430 protein CG31712 isoform X1 [Cucurbita maxima] >XP_022971478.1 UPF0430 protein CG31712 isoform X1 [Cucurbita maxima])

HSP 1 Score: 518.5 bits (1334), Expect = 4.9e-143
Identity = 299/415 (72.05%), Postives = 326/415 (78.55%), Query Frame = 0

Query: 1   MENEKKKRKNKKKKNKPIRTSEDEMIVSESTSVDDTHMRNGQNDQNPISDTVDLSYQHSS 60
           MENEKKKRKNKKKKNK IRTSED+ I SESTSVDDTH  NGQNDQNPIS TVD SYQHS 
Sbjct: 1   MENEKKKRKNKKKKNKQIRTSEDDTIASESTSVDDTHTTNGQNDQNPISGTVDPSYQHSR 60

Query: 61  GTKD---------------------AKLEDTIKHLHEENNVHIQKMADLELKLVECEGEK 120
            TKD                     AKLEDTIK LHEENN+H+QK+ADLELKLVE EGEK
Sbjct: 61  ETKDAVSEETNEHLRKESHLLAHDKAKLEDTIKRLHEENNLHMQKLADLELKLVEFEGEK 120

Query: 121 HSWLQKEAALVDKITSLQEDKTALDLEG---------------------KSSRETIIDKN 180
           HSWL+KE  LVDKI  LQEDKTALDLEG                      SS E I+DKN
Sbjct: 121 HSWLRKEETLVDKIRRLQEDKTALDLEGARLLHTIEQLERDKASLIFNENSSTEMIVDKN 180

Query: 181 KDISRLQARVVELEKQRHDLLQENKQLTENVADYQSKLLNLERKISSTYTHSSDRVTKEM 240
           KDISRLQA+VVELE+QR DLLQENKQLTENVADY+SK+  LERKISST+TH SDRVTKEM
Sbjct: 181 KDISRLQAQVVELEEQRRDLLQENKQLTENVADYRSKITILERKISSTHTHFSDRVTKEM 240

Query: 241 LSSQVDAACILVDKLITENAELIGKVNELFVELQRVTKTELSPGVEPDQTAE-ATDTATF 300
           LSSQVDAA ILVDKLITENAELIGKVN L+VELQRVTK E++ G+EPDQ  E ATDTATF
Sbjct: 241 LSSQVDAARILVDKLITENAELIGKVNALYVELQRVTKAEVNSGMEPDQMVESATDTATF 300

Query: 301 NDPKPPLILNSITSSKSLDALKSVPIHNHSISNDFVELDNDFLAPKSS--MPLAAGEIEQ 360
           NDPKPPLI N +TSSKSLDAL+SVPIHNHS+ ++ V++DND L   +S  +P+  GEIEQ
Sbjct: 301 NDPKPPLIHNMVTSSKSLDALESVPIHNHSVGDNVVDMDNDLLLSPTSLILPMEEGEIEQ 360

Query: 361 IPLHEIEDRNRDRELPATESDEKDVLLSDAPLIGAPYRLMSFMAKYVSGADLVGK 371
           IP  E EDRNR+REL   ESDEKDVLLSDAPLIGAPYRL+SFMAKYVSGADLVGK
Sbjct: 361 IPPQENEDRNRNRELSGAESDEKDVLLSDAPLIGAPYRLISFMAKYVSGADLVGK 415

BLAST of HG10020477 vs. ExPASy TrEMBL
Match: A0A1S3B4X8 (uncharacterized protein LOC103486222 OS=Cucumis melo OX=3656 GN=LOC103486222 PE=4 SV=1)

HSP 1 Score: 567.8 bits (1462), Expect = 3.4e-158
Identity = 313/393 (79.64%), Postives = 341/393 (86.77%), Query Frame = 0

Query: 1   MENEKKKRKNKKKKNKPIRTSEDEMIVSESTSVDDTHMRNGQNDQNPISDTVDLSYQHSS 60
           M+N KKKRKNKKKKNK IRTSEDEM+VSESTSVDDTH RN QNDQNPISDT+ LSYQ SS
Sbjct: 1   MDNGKKKRKNKKKKNKQIRTSEDEMVVSESTSVDDTHPRNRQNDQNPISDTLLLSYQQSS 60

Query: 61  GTKDAKLEDTIKHLHEENNVHIQKMADLELKLVECEGEKHSWLQKEAALVDKITSLQEDK 120
             KDAKL+DTIKHLHEENN+HIQ+MADLELKLVECEGEKHSWLQKE AL+DKI +LQEDK
Sbjct: 61  AKKDAKLDDTIKHLHEENNIHIQRMADLELKLVECEGEKHSWLQKEEALMDKIRNLQEDK 120

Query: 121 TALDLEG---------------------KSSRETIIDKNKDISRLQARVVELEKQRHDLL 180
           T+LDLEG                     KSS+ETI+DKNKDISRLQA+VVELE+QR DLL
Sbjct: 121 TSLDLEGARLLNTIKLLERDKASLILDEKSSKETIVDKNKDISRLQAQVVELEEQRCDLL 180

Query: 181 QENKQLTENVADYQSKLLNLERKISSTYTHSSDRVTKEMLSSQVDAACILVDKLITENAE 240
            ENK+LTE VADYQSKLLNLERKISSTY HSSDRVTKE+L+SQVDAA ILVD+LITENAE
Sbjct: 181 HENKELTEKVADYQSKLLNLERKISSTYIHSSDRVTKEILNSQVDAARILVDRLITENAE 240

Query: 241 LIGKVNELFVELQRVTKTELSPGVEPDQTA-EATDTATFNDPKPPLILNSITSSKSLDAL 300
           LIGKVNELFVELQRVTKTELS GVEPDQ A EATDT TFNDP+PPLILNS+T  KS DAL
Sbjct: 241 LIGKVNELFVELQRVTKTELSSGVEPDQMAKEATDTTTFNDPEPPLILNSVTCGKSSDAL 300

Query: 301 KSVPIHNHSISNDFVELDNDFLAPKSSMPLAAGEIEQIPLHEIEDRNRDRELPATESDEK 360
            SVPIH+HSI  DFV+LD+D+LA KSSM +A GEIEQIPL + +DRNR+RELPATE DEK
Sbjct: 301 NSVPIHSHSIGGDFVDLDSDYLASKSSMRMATGEIEQIPLPQFDDRNRNRELPATEIDEK 360

Query: 361 DVLLSDAPLIGAPYRLMSFMAKYVSGADLVGKS 372
           DVLLSDAPLIGAPYRL+SFMAKYVSGADLVGKS
Sbjct: 361 DVLLSDAPLIGAPYRLISFMAKYVSGADLVGKS 393

BLAST of HG10020477 vs. ExPASy TrEMBL
Match: A0A0A0L9P2 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G727980 PE=4 SV=1)

HSP 1 Score: 558.9 bits (1439), Expect = 1.6e-155
Identity = 309/393 (78.63%), Postives = 341/393 (86.77%), Query Frame = 0

Query: 1   MENEKKKRKNKKKKNKPIRTSEDEMIVSESTSVDDTHMRNGQNDQNPISDTVDLSYQHSS 60
           M+ EKKKRKNKKKKNK IRTSEDEM+VSESTSVDDTH RN QNDQNPISDT+ +SYQHSS
Sbjct: 1   MDKEKKKRKNKKKKNKQIRTSEDEMVVSESTSVDDTHPRNRQNDQNPISDTL-ISYQHSS 60

Query: 61  GTKDAKLEDTIKHLHEENNVHIQKMADLELKLVECEGEKHSWLQKEAALVDKITSLQEDK 120
           GTKDAKL+DTIKHLHEENN+HI++MADL+LKLVECEGEK+SWLQKE AL+DKI +LQEDK
Sbjct: 61  GTKDAKLDDTIKHLHEENNIHIKRMADLDLKLVECEGEKYSWLQKEEALMDKIRNLQEDK 120

Query: 121 TALDLEG---------------------KSSRETIIDKNKDISRLQARVVELEKQRHDLL 180
           TALDLEG                     KSSRETI+DKNKDISRLQA+VVELE+Q+ DLL
Sbjct: 121 TALDLEGARLLNIIKLLERDKASLILDEKSSRETIVDKNKDISRLQAQVVELEEQKRDLL 180

Query: 181 QENKQLTENVADYQSKLLNLERKISSTYTHSSDRVTKEMLSSQVDAACILVDKLITENAE 240
            ENKQLT  VADYQSKLLNLERKISSTY HSSDRVTKE+L+SQVDAA ILVDKLITENAE
Sbjct: 181 HENKQLTGKVADYQSKLLNLERKISSTYIHSSDRVTKEILNSQVDAARILVDKLITENAE 240

Query: 241 LIGKVNELFVELQRVTKTELSPGVEPDQTA-EATDTATFNDPKPPLILNSITSSKSLDAL 300
           LIGKVNELFVELQRVTKTEL  GV PDQ A EATDT TFN+ +PP+ILNS+TS KSLDAL
Sbjct: 241 LIGKVNELFVELQRVTKTELPSGVVPDQMATEATDTTTFNESEPPVILNSVTSGKSLDAL 300

Query: 301 KSVPIHNHSISNDFVELDNDFLAPKSSMPLAAGEIEQIPLHEIEDRNRDRELPATESDEK 360
           KSV IH+HSI  DFV+L +DF+A ++SMP+AAGEIEQI LH+ ED+N  RELPATE DEK
Sbjct: 301 KSVSIHSHSIGGDFVDLGSDFMASEASMPMAAGEIEQIQLHQFEDQNGTRELPATEIDEK 360

Query: 361 DVLLSDAPLIGAPYRLMSFMAKYVSGADLVGKS 372
           DVLLSDAPLIGAPYRL+SFMAKYVSGADLVGKS
Sbjct: 361 DVLLSDAPLIGAPYRLISFMAKYVSGADLVGKS 392

BLAST of HG10020477 vs. ExPASy TrEMBL
Match: A0A6J1I220 (UPF0430 protein CG31712 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111470185 PE=4 SV=1)

HSP 1 Score: 518.5 bits (1334), Expect = 2.4e-143
Identity = 299/415 (72.05%), Postives = 326/415 (78.55%), Query Frame = 0

Query: 1   MENEKKKRKNKKKKNKPIRTSEDEMIVSESTSVDDTHMRNGQNDQNPISDTVDLSYQHSS 60
           MENEKKKRKNKKKKNK IRTSED+ I SESTSVDDTH  NGQNDQNPIS TVD SYQHS 
Sbjct: 1   MENEKKKRKNKKKKNKQIRTSEDDTIASESTSVDDTHTTNGQNDQNPISGTVDPSYQHSR 60

Query: 61  GTKD---------------------AKLEDTIKHLHEENNVHIQKMADLELKLVECEGEK 120
            TKD                     AKLEDTIK LHEENN+H+QK+ADLELKLVE EGEK
Sbjct: 61  ETKDAVSEETNEHLRKESHLLAHDKAKLEDTIKRLHEENNLHMQKLADLELKLVEFEGEK 120

Query: 121 HSWLQKEAALVDKITSLQEDKTALDLEG---------------------KSSRETIIDKN 180
           HSWL+KE  LVDKI  LQEDKTALDLEG                      SS E I+DKN
Sbjct: 121 HSWLRKEETLVDKIRRLQEDKTALDLEGARLLHTIEQLERDKASLIFNENSSTEMIVDKN 180

Query: 181 KDISRLQARVVELEKQRHDLLQENKQLTENVADYQSKLLNLERKISSTYTHSSDRVTKEM 240
           KDISRLQA+VVELE+QR DLLQENKQLTENVADY+SK+  LERKISST+TH SDRVTKEM
Sbjct: 181 KDISRLQAQVVELEEQRRDLLQENKQLTENVADYRSKITILERKISSTHTHFSDRVTKEM 240

Query: 241 LSSQVDAACILVDKLITENAELIGKVNELFVELQRVTKTELSPGVEPDQTAE-ATDTATF 300
           LSSQVDAA ILVDKLITENAELIGKVN L+VELQRVTK E++ G+EPDQ  E ATDTATF
Sbjct: 241 LSSQVDAARILVDKLITENAELIGKVNALYVELQRVTKAEVNSGMEPDQMVESATDTATF 300

Query: 301 NDPKPPLILNSITSSKSLDALKSVPIHNHSISNDFVELDNDFLAPKSS--MPLAAGEIEQ 360
           NDPKPPLI N +TSSKSLDAL+SVPIHNHS+ ++ V++DND L   +S  +P+  GEIEQ
Sbjct: 301 NDPKPPLIHNMVTSSKSLDALESVPIHNHSVGDNVVDMDNDLLLSPTSLILPMEEGEIEQ 360

Query: 361 IPLHEIEDRNRDRELPATESDEKDVLLSDAPLIGAPYRLMSFMAKYVSGADLVGK 371
           IP  E EDRNR+REL   ESDEKDVLLSDAPLIGAPYRL+SFMAKYVSGADLVGK
Sbjct: 361 IPPQENEDRNRNRELSGAESDEKDVLLSDAPLIGAPYRLISFMAKYVSGADLVGK 415

BLAST of HG10020477 vs. ExPASy TrEMBL
Match: A0A6J1FUD8 (uncharacterized protein LOC111446955 OS=Cucurbita moschata OX=3662 GN=LOC111446955 PE=4 SV=1)

HSP 1 Score: 517.7 bits (1332), Expect = 4.0e-143
Identity = 299/416 (71.88%), Postives = 327/416 (78.61%), Query Frame = 0

Query: 1   MENEKKKRKNKKKKNKPIRTSEDEMIVSESTSVDDTHMRNGQNDQNPISDTVDLSYQHSS 60
           MENEKKKRKNKKKKNK IRTSED+ I SESTSVDDTH  NGQNDQNPIS TVD SYQ S 
Sbjct: 1   MENEKKKRKNKKKKNKQIRTSEDDTIASESTSVDDTHTTNGQNDQNPISGTVDPSYQQSR 60

Query: 61  GTKD---------------------AKLEDTIKHLHEENNVHIQKMADLELKLVECEGEK 120
           GTKD                     AKLEDTIK LHEENNVH+QK+ADLELKLVE EGEK
Sbjct: 61  GTKDAVSEETNEHLRKESLLLAHDKAKLEDTIKRLHEENNVHMQKLADLELKLVEFEGEK 120

Query: 121 HSWLQKEAALVDKITSLQEDKTALDLEG---------------------KSSRETIIDKN 180
           HSWL+KE  LVDKI  LQEDKTALDLEG                      SS E I+DKN
Sbjct: 121 HSWLRKEETLVDKIRRLQEDKTALDLEGARLLHTIEQLERDKASLIFNENSSTEMIVDKN 180

Query: 181 KDISRLQARVVELEKQRHDLLQENKQLTENVADYQSKLLNLERKISSTYTHSSDRVTKEM 240
           KDISRLQA+VV LE+QR DLLQENKQLTENVADYQSK+  LERKISST+THSSDRVTKEM
Sbjct: 181 KDISRLQAQVVALEEQRRDLLQENKQLTENVADYQSKITILERKISSTHTHSSDRVTKEM 240

Query: 241 LSSQVDAACILVDKLITENAELIGKVNELFVELQRVTKTELSPGVEPDQTAEA-TDTATF 300
           LSSQVDAA ILVDKLITENAELIGKVNEL+VELQRVTK E++ G+EPDQ  EA T+TATF
Sbjct: 241 LSSQVDAARILVDKLITENAELIGKVNELYVELQRVTKAEVNSGMEPDQMVEATTNTATF 300

Query: 301 NDPKPPLILNSITSSKSLDALKSVPIHNHSISNDFVELDNDFLAPKSS--MPLAAGEIEQ 360
           N+P+PPLI N +TSSKSLDAL+SVPIHNHS+  + V++DND L   +S  +P+  GEIEQ
Sbjct: 301 NEPEPPLIHNMVTSSKSLDALESVPIHNHSVGYNIVDMDNDLLLSPTSLILPMEEGEIEQ 360

Query: 361 IPLHEIEDRNRDRELPATESDEKDVLLSDAPLIGAPYRLMSFMAKYVSGADLVGKS 372
           IP  E EDRNR+REL   ESDE+DVLLSDAPLIGAPYRL+SFMAKYVSGADLVGKS
Sbjct: 361 IPPQENEDRNRNRELSGAESDEEDVLLSDAPLIGAPYRLISFMAKYVSGADLVGKS 416

BLAST of HG10020477 vs. ExPASy TrEMBL
Match: A0A6J1I8P0 (UPF0430 protein CG31712 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111470185 PE=4 SV=1)

HSP 1 Score: 510.4 bits (1313), Expect = 6.4e-141
Identity = 289/394 (73.35%), Postives = 319/394 (80.96%), Query Frame = 0

Query: 1   MENEKKKRKNKKKKNKPIRTSEDEMIVSESTSVDDTHMRNGQNDQNPISDTVDLSYQHSS 60
           MENEKKKRKNKKKKNK IRTSED+ I SESTSVDDTH  NGQNDQNPIS TVD SYQHS 
Sbjct: 1   MENEKKKRKNKKKKNKQIRTSEDDTIASESTSVDDTHTTNGQNDQNPISGTVDPSYQHSR 60

Query: 61  GTKD---------------------AKLEDTIKHLHEENNVHIQKMADLELKLVECEGEK 120
            TKD                     AKLEDTIK LHEENN+H+QK+ADLELKLVE EGEK
Sbjct: 61  ETKDAVSEETNEHLRKESHLLAHDKAKLEDTIKRLHEENNLHMQKLADLELKLVEFEGEK 120

Query: 121 HSWLQKEAALVDKITSLQEDKTALDLEGKSSRETIIDKNKDISRLQARVVELEKQRHDLL 180
           HSWL+KEA L+  I  L+ DK +L     SS E I+DKNKDISRLQA+VVELE+QR DLL
Sbjct: 121 HSWLRKEARLLHTIEQLERDKASLIFNENSSTEMIVDKNKDISRLQAQVVELEEQRRDLL 180

Query: 181 QENKQLTENVADYQSKLLNLERKISSTYTHSSDRVTKEMLSSQVDAACILVDKLITENAE 240
           QENKQLTENVADY+SK+  LERKISST+TH SDRVTKEMLSSQVDAA ILVDKLITENAE
Sbjct: 181 QENKQLTENVADYRSKITILERKISSTHTHFSDRVTKEMLSSQVDAARILVDKLITENAE 240

Query: 241 LIGKVNELFVELQRVTKTELSPGVEPDQTAE-ATDTATFNDPKPPLILNSITSSKSLDAL 300
           LIGKVN L+VELQRVTK E++ G+EPDQ  E ATDTATFNDPKPPLI N +TSSKSLDAL
Sbjct: 241 LIGKVNALYVELQRVTKAEVNSGMEPDQMVESATDTATFNDPKPPLIHNMVTSSKSLDAL 300

Query: 301 KSVPIHNHSISNDFVELDNDFLAPKSS--MPLAAGEIEQIPLHEIEDRNRDRELPATESD 360
           +SVPIHNHS+ ++ V++DND L   +S  +P+  GEIEQIP  E EDRNR+REL   ESD
Sbjct: 301 ESVPIHNHSVGDNVVDMDNDLLLSPTSLILPMEEGEIEQIPPQENEDRNRNRELSGAESD 360

Query: 361 EKDVLLSDAPLIGAPYRLMSFMAKYVSGADLVGK 371
           EKDVLLSDAPLIGAPYRL+SFMAKYVSGADLVGK
Sbjct: 361 EKDVLLSDAPLIGAPYRLISFMAKYVSGADLVGK 394

BLAST of HG10020477 vs. TAIR 10
Match: AT2G38580.1 (Mitochondrial ATP synthase D chain-related protein )

HSP 1 Score: 147.1 bits (370), Expect = 2.7e-35
Identity = 109/314 (34.71%), Postives = 176/314 (56.05%), Query Frame = 0

Query: 64  DAKLEDTIKHLHEENNVHIQKMADLELKLVECEGEKHSWLQKEAALVDKITSLQEDKTAL 123
           + KLE+ +     +N++ +++M+  E ++ +   E+ ++ QKEA+L  K+  LQ D+ +L
Sbjct: 207 EEKLEERLVQYKNKNDMLLREMSSTEAQMRQLLDERSTFTQKEASLEKKVQQLQHDEESL 266

Query: 124 DLEGKSSRETIIDKNKDISRLQARVVELEKQRHDLLQENKQLTENVADYQSKLLNLERKI 183
             E KSSRE I   N +I+RL+A+V ELEK + +LL++N+ L E +++ Q +  N     
Sbjct: 267 VAEEKSSREMISSLNNEIARLRAQVTELEKSKSNLLEQNQSLKETISNLQVQHEN----- 326

Query: 184 SSTYTHSSDRVTKEMLSSQVDAACILVDKLITENAELIGKVNELFVELQRVTKTELSPGV 243
              +  ++   ++E L+SQ++AAC LV+KLITENA+L+ KVNEL ++L +      S   
Sbjct: 327 ---HDSNAKGASEEELNSQIEAACTLVEKLITENADLVEKVNELCIKLNQ------SQHA 386

Query: 244 EPDQTAEATDTATFNDPKPPLILNSITSSKSLDALKSVPIHNHSISNDFVELDN--DFLA 303
            P+  A                   I   KS ++L+ +PIH     ++ + +DN  D   
Sbjct: 387 SPESLA-------------------IEVEKS-ESLEEIPIH-----DELIRIDNSRDMDT 446

Query: 304 PKSSMPLAAGEIEQ-IPLHEIEDRNRDRELPATESDEKD----VLLSDAPLIGAPYRLMS 363
                  + GEIE+ +PL    +   D E     + E +    V L+DAPLIGAP+RL+S
Sbjct: 447 ASIKRNFSEGEIEETVPLSLNANGEVDVESQVAVAGEDEINAGVPLADAPLIGAPFRLVS 481

Query: 364 FMAKYVSGADLVGK 371
           F+A+YVSGADL  K
Sbjct: 507 FVARYVSGADLAAK 481

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038906167.15.9e-16583.97myosin-6 [Benincasa hispida] >XP_038906168.1 myosin-6 [Benincasa hispida] >XP_03... [more]
XP_008442312.17.0e-15879.64PREDICTED: uncharacterized protein LOC103486222 [Cucumis melo] >XP_008442313.1 P... [more]
XP_004137671.13.3e-15578.63uncharacterized protein LOC101221440 [Cucumis sativus] >KGN58680.1 hypothetical ... [more]
XP_023540633.15.8e-14472.12uncharacterized protein LOC111800938 [Cucurbita pepo subsp. pepo] >XP_023540634.... [more]
XP_022971477.14.9e-14372.05UPF0430 protein CG31712 isoform X1 [Cucurbita maxima] >XP_022971478.1 UPF0430 pr... [more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A1S3B4X83.4e-15879.64uncharacterized protein LOC103486222 OS=Cucumis melo OX=3656 GN=LOC103486222 PE=... [more]
A0A0A0L9P21.6e-15578.63Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G727980 PE=4 SV=1[more]
A0A6J1I2202.4e-14372.05UPF0430 protein CG31712 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111470185 P... [more]
A0A6J1FUD84.0e-14371.88uncharacterized protein LOC111446955 OS=Cucurbita moschata OX=3662 GN=LOC1114469... [more]
A0A6J1I8P06.4e-14173.35UPF0430 protein CG31712 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111470185 P... [more]
Match NameE-valueIdentityDescription
AT2G38580.12.7e-3534.71Mitochondrial ATP synthase D chain-related protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 134..182
NoneNo IPR availableCOILSCoilCoilcoord: 71..91
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 247..262
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..52
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 24..52
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 240..262

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10020477.1HG10020477.1mRNA