Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAGAAGGTGGACGTGCCCATGTTGGGTTTGTTTCTTGTTGTTCTCCTGGCAGCAATGACATTTGAGCCCATCTCATCTTTGCCATCTACTATTCCTGCATTCCTTTGGTCCCCTCACCATCGCCACAGGTATGCTGCTTGATTGTTTGTGTTTCTTTTATGTGATATGTGCCATTTTTTGTGTACCAACTCATTTCAAATGAACAATAAACACATCCCAACATATCTACTCAGGATTGTTGGGATCTTCCTTTTCATTGTTGGATCCATTGTCTTTTAACTTTTTGGCCTTTCCTTAAATTACTTTTTGGAAAATTTTCGTCTTGTTATTTAAACTGTTCTTCCTGGATTTGCCAATTGAAGGTGTGGGATTTTCCGCTTGAGTGATATGAGGGTTCATATAAGAGCTCAATAGCCTGTAAAATTAATGTTCACATAATACTACAGGCAGTTTGGCACAACTTGCATAGTATAAACTCTACATAGATGAAGCTGTAGTGAATTTTTTGCCTTTCTCCCTGCTTAGCTGATTAAGATTTATTGTTATTATTTTAAATTTTAGATTCCGCATTGTTTTTTTCTTTCAGTCTATATGGTTAGTTTCCCCATTATCATCCTTCATTTTATTCTATAGGAATGAGTTCTCCTATTTGGTGGGGGGGGGGGGGGGAGGGACCACTTATTGAACAACTGTTCTAAACGTGTTACAATAAACTCTATATCATTGCTCTTTGTTCGATTTAAAAATTGTACTGTATGGCTGCAGGTTTTCTAACAACATTCTAGAAAAATATGTTGATTATCAGACCATTTCCCCAGAGGAGCTGGCAAAGTCTGTTCTGTATGAAGGGGGCTGGTCAAAAATTCTGGTGTGTTTGTAACATTTCAAATCTTGTTCTAGAGTTCTCTGTCATTAGTTTTTAAGAATGTCCTCCATGCGCAATTTTTATTTTTTTAAAAAAGCTACATGTTTTGCTTTTTGGTATGTCCTCTTTTTTAGTTTGTCTCCCTTTCTTTTTGGGCTTGGTTTTGGACACCCTTTTATTCTTTCATTTTGTTTAATCTCAATGTAAATTTGGTTATTCGTAAAAAAATTCTTTTCTCTTGATTTCTTTTCTCATATTTATATATTATTAACATTATGAATTGTTTAATTCTTTAGTGCACGGGAAAGGAAGTAGCGCAGCATGTGGATCTTGCAATAATCTTTGTTGGTTCAGAGGTAAGAATATATGGAGTTTGAGTACCACTTTTCGCTGTGTAATTCTTTCAGTTGTTGTGAATTTGTTTGGTCTCTCACATGGTTTATTTCATACTTCATTGTGTTTTTCCTTATAGCCAAAAATCATCCCTTGTTTATGAAGTTTGGATTTGGGATTATTGTTTGAGGTTAGAACATTTAAAAGACCTAGTTTTGTAATATACTAGATGAGTTATCCTAAAGATGCGTTTGATGATTCCAGCATTTTTATAAAATAGAAAACCAACTTTTAACGTGTTCACCGATTGACGTTGACCCAAAAGCTTAAGGTGATAGGTTATGGTAAATTTAATTATATCAACACTTTAACACTCTCCCTCACTCGTGGGCTTTGAAATTTGTACAAAACCCAACAAGTGGAAATCAATATTAGTTGGGGAGCAAATGACATTACGAGGGTAGGGGTTTGAACACAAGACCTCCCCGCTGCAGAAGAAACCAAAATAACCCAGAGTCCTTCTCCCAAGAGTGGAACAATCTCTTCCAACAAGACGAAGAGAGATGCAACATCCGTAGCTTTCCCATTGGTCAAAGGGCGACGAAACCCAAATGAAAGAGAAACTGAGCTCCTTGATCAGATCAAAAAATCAGAAACCACCCTATTCTTAAAGGAAGACAAATAATAGATATAAAGAAACACATAGAAAAGGGGATTAGATCTCACCCAATGATCTTCCCACATTTCCTTCCCTTCCACCACTACACAATGAACTAAGTGAGAGAAAGAAGGGAACTCAAAAGAAATATCTTTCCTTAAATTTCGGTAAGCGCTTTATCCCCACCCGACAACCATTCAAAAGGATGAGGACCAAATTTACTCACGATAATCCTGTTCCAAAGAGAGTTGGGTTCAAGGGGAAAATGCCACAACTATTTGGCTAACAAAGCTTTGTGGCTTTTTGGAATACATTGAAGTTATTAGTCTTTCCCAAGGCAACCAACCAAGTAAAGAACTAGGTTTTCTTATAGATTTTAACCTCCACAGGGTAGAGAAGAGAGAGCATACGAAGGAACAAGATGAGAAGGGGACTAAGAAGGAAACTGAATGAATAAACAAGAATCTACAAGTATATGGCAAACAATATTTTATGTTTTCCTTTGTTATGTTGTTTTGGTTATGTTTTTGGAGCTTTTATTATGTTTTTGGAGCTTTTATGTCTCTAAGGTTTGCTTTTTCCTTTTTGCCTCTAATTTCTTCTATCAATCATTTTTAACTTAAGAGGAGGTTTACATCTTACCTTCAGAAGGGAAGGACTGGAGCTAAAATTTCTCCTGTTTCTTTTAGAAAAAATTCTTATATACACACCCCTCTTTGTCAATTGCCAAAGACTATTATAATATTCTAAGACAGTATCGGGTATACACCATGTTAATTGTTATGTGTTATTGAAAAATGAATGCAAAGCTTGAAAATAGTAACTATGAATCTTTGAAATCCACGTTCTGCTTGCTTCTCTCTCTCTCTCTTTTTAATAAATAATAATAATTTATTTATTTATTCATACTCAAATTATGATGCTAGTTCAAAACTTTCTTTTATGTCAATGTGCCACAAATAGTGTTTCTACTAAAGGATAATAATTTGGTGCTTTGTTATGTTGGTTTCTTGATCAACATTATGTTCGTTCACGGTGTGACAAGTCTTTTGCTTTTTATAGTTGCAGTCAGATTTCATGTTGAGCAGGCACGTGGATCCAAATCTTATGGACTTACTTAAGGTTTGGAAAAGCAATTTGTTTTATAAATTCTAAAACTGCATATCTCGAGTTTCAACTCTAAATTTAGGATCAATTTTCACAGGTCTCTTTCTCAAGGTCTAACTTCTCTATGGCATTTCCCTATGTGGCTGCACCAGAAAGGGGTGCAATAGAAAAGTTATTGATCTCAGAGTTCAAAAAATCATGTGGGCATGACCTCAGAATCAGCACTAGCGCTTTCCAGGAGTTGTCCTCTGTTGAGGATGAATCATTCCAGAAGCTTCAACTGCTGCCACATTCGATTAATGTAAATCTGTCTTTATATGACTAATGTCATCTGTTTTCTTATCAGAACATTAATGATATCCTTATTTATGTTTTTTTCATTAAACGAGTAGGATTATATGGTTTCAAGAATGGAAAAGAAGCCAAAGGGAGAGACAGATTTGGTCGTTTTCTCTCATGGAGATTTCAGTTCTCCTCAAGAAGGAAATCTATGGACTTCTGAAAGTATGTTGACTCATTTTATTTTGTCTTCAGATGGTCATTAGGACAAGCAACATGATTGATGTTATGGTATAAAATTCTTATGTTGAATTTTTGAAATTATTATAGGCAAAACTTTGTTGGAGATCATGACTTCTGCGGAGCATGTTGGGGCAAAATATGAAATTCTCTATATATCAGATCCATTTAGGTCCATTCGCCATTCTTATGTGGAGCTGGGAAGATTTATGGCTGAAGGTTCCTCTGGAAATGGATCAGCTAAATCAGAAAATCTTTGTGATGAAGTCTGCCAAATTAAATCATCTCTTCTCGAGGGCCTCTTTGTTGTGAGTCACTAA
mRNA sequence
ATGAAGAAGGTGGACGTGCCCATGTTGGGTTTGTTTCTTGTTGTTCTCCTGGCAGCAATGACATTTGAGCCCATCTCATCTTTGCCATCTACTATTCCTGCATTCCTTTGGTCCCCTCACCATCGCCACAGGTTTTCTAACAACATTCTAGAAAAATATGTTGATTATCAGACCATTTCCCCAGAGGAGCTGGCAAAGTCTGTTCTGTATGAAGGGGGCTGGTCAAAAATTCTGTGCACGGGAAAGGAAGTAGCGCAGCATGTGGATCTTGCAATAATCTTTGTTGGTTCAGAGTCAGATTTCATGTTGAGCAGGCACGTGGATCCAAATCTTATGGACTTACTTAAGGTCTCTTTCTCAAGGTCTAACTTCTCTATGGCATTTCCCTATGTGGCTGCACCAGAAAGGGGTGCAATAGAAAAGTTATTGATCTCAGAGTTCAAAAAATCATGTGGGCATGACCTCAGAATCAGCACTAGCGCTTTCCAGGAGTTGTCCTCTGTTGAGGATGAATCATTCCAGAAGCTTCAACTGCTGCCACATTCGATTAATGATTATATGGTTTCAAGAATGGAAAAGAAGCCAAAGGGAGAGACAGATTTGGTCGTTTTCTCTCATGGAGATTTCAGTTCTCCTCAAGAAGGAAATCTATGGACTTCTGAAAGCAAAACTTTGTTGGAGATCATGACTTCTGCGGAGCATGTTGGGGCAAAATATGAAATTCTCTATATATCAGATCCATTTAGGTCCATTCGCCATTCTTATGTGGAGCTGGGAAGATTTATGGCTGAAGGTTCCTCTGGAAATGGATCAGCTAAATCAGAAAATCTTTGTGATGAAGTCTGCCAAATTAAATCATCTCTTCTCGAGGGCCTCTTTGTTGTGAGTCACTAA
Coding sequence (CDS)
ATGAAGAAGGTGGACGTGCCCATGTTGGGTTTGTTTCTTGTTGTTCTCCTGGCAGCAATGACATTTGAGCCCATCTCATCTTTGCCATCTACTATTCCTGCATTCCTTTGGTCCCCTCACCATCGCCACAGGTTTTCTAACAACATTCTAGAAAAATATGTTGATTATCAGACCATTTCCCCAGAGGAGCTGGCAAAGTCTGTTCTGTATGAAGGGGGCTGGTCAAAAATTCTGTGCACGGGAAAGGAAGTAGCGCAGCATGTGGATCTTGCAATAATCTTTGTTGGTTCAGAGTCAGATTTCATGTTGAGCAGGCACGTGGATCCAAATCTTATGGACTTACTTAAGGTCTCTTTCTCAAGGTCTAACTTCTCTATGGCATTTCCCTATGTGGCTGCACCAGAAAGGGGTGCAATAGAAAAGTTATTGATCTCAGAGTTCAAAAAATCATGTGGGCATGACCTCAGAATCAGCACTAGCGCTTTCCAGGAGTTGTCCTCTGTTGAGGATGAATCATTCCAGAAGCTTCAACTGCTGCCACATTCGATTAATGATTATATGGTTTCAAGAATGGAAAAGAAGCCAAAGGGAGAGACAGATTTGGTCGTTTTCTCTCATGGAGATTTCAGTTCTCCTCAAGAAGGAAATCTATGGACTTCTGAAAGCAAAACTTTGTTGGAGATCATGACTTCTGCGGAGCATGTTGGGGCAAAATATGAAATTCTCTATATATCAGATCCATTTAGGTCCATTCGCCATTCTTATGTGGAGCTGGGAAGATTTATGGCTGAAGGTTCCTCTGGAAATGGATCAGCTAAATCAGAAAATCTTTGTGATGAAGTCTGCCAAATTAAATCATCTCTTCTCGAGGGCCTCTTTGTTGTGAGTCACTAA
Protein sequence
MKKVDVPMLGLFLVVLLAAMTFEPISSLPSTIPAFLWSPHHRHRFSNNILEKYVDYQTISPEELAKSVLYEGGWSKILCTGKEVAQHVDLAIIFVGSESDFMLSRHVDPNLMDLLKVSFSRSNFSMAFPYVAAPERGAIEKLLISEFKKSCGHDLRISTSAFQELSSVEDESFQKLQLLPHSINDYMVSRMEKKPKGETDLVVFSHGDFSSPQEGNLWTSESKTLLEIMTSAEHVGAKYEILYISDPFRSIRHSYVELGRFMAEGSSGNGSAKSENLCDEVCQIKSSLLEGLFVVSH
Homology
BLAST of HG10004066 vs. NCBI nr
Match:
XP_038885229.1 (uncharacterized protein LOC120075692 [Benincasa hispida] >XP_038885230.1 uncharacterized protein LOC120075692 [Benincasa hispida] >XP_038885231.1 uncharacterized protein LOC120075692 [Benincasa hispida] >XP_038885232.1 uncharacterized protein LOC120075692 [Benincasa hispida])
HSP 1 Score: 540.0 bits (1390), Expect = 1.3e-149
Identity = 272/296 (91.89%), Postives = 282/296 (95.27%), Query Frame = 0
Query: 1 MKKVDVPMLGLFLVVLLAAMTFEPISSLPSTIPAFLWSPHHRHRFSNNILEKYVDYQTIS 60
MKKVDVPMLGL LVV LAA TFEPISSLPST+PAFLWSPHHRHRFSNNI +KYVDYQTIS
Sbjct: 1 MKKVDVPMLGLSLVVFLAAATFEPISSLPSTVPAFLWSPHHRHRFSNNIEDKYVDYQTIS 60
Query: 61 PEELAKSVLYEGGWSKILCTGKEVAQHVDLAIIFVGSE--SDFMLSRHVDPNLMDLLKVS 120
P+ELAKSVLYEGGWSKILC GKEV QHVDLAIIF+GSE SDFMLSR VDPNLMDLLKVS
Sbjct: 61 PQELAKSVLYEGGWSKILCMGKEVEQHVDLAIIFIGSELQSDFMLSRQVDPNLMDLLKVS 120
Query: 121 FSRSNFSMAFPYVAAPERGAIEKLLISEFKKSCGHDLRISTSAFQELSSVEDESFQKLQL 180
FSRSNFSMAFPYVAAPERGAIEKLLISEFKKSCGHDLRIS SAFQELSSVEDESFQKL +
Sbjct: 121 FSRSNFSMAFPYVAAPERGAIEKLLISEFKKSCGHDLRISNSAFQELSSVEDESFQKLPM 180
Query: 181 LPHSINDYMVSRMEKKPKGETDLVVFSHGDFSSPQEGNLWTSESKTLLEIMTSAEHVGAK 240
L HSINDYMVSRMEKKP+GET+LVVFSHGDFSSP+EGN WTSESKTLLEIMTSAEHVGAK
Sbjct: 181 LAHSINDYMVSRMEKKPEGETELVVFSHGDFSSPKEGNPWTSESKTLLEIMTSAEHVGAK 240
Query: 241 YEILYISDPFRSIRHSYVELGRFMAEGSSGNGSAKSENLCDEVCQIKSSLLEGLFV 295
YEILYISDPFRSIRHSYVELGRFMAEGS+GNGSAKSE+ CDEVCQIKSSLLEGLFV
Sbjct: 241 YEILYISDPFRSIRHSYVELGRFMAEGSAGNGSAKSEDFCDEVCQIKSSLLEGLFV 296
BLAST of HG10004066 vs. NCBI nr
Match:
XP_008456729.1 (PREDICTED: uncharacterized protein LOC103496586 isoform X2 [Cucumis melo])
HSP 1 Score: 527.3 bits (1357), Expect = 8.4e-146
Identity = 264/294 (89.80%), Postives = 278/294 (94.56%), Query Frame = 0
Query: 1 MKKVDVPMLGLFLVVLLAAMTFEPISSLPSTIPAFLWSPHHRHRFSNNILEKYVDYQTIS 60
MK+ VP L LFLVVLLAA TF+PISSLPSTIPAFLWSPHHRH FSNNILEKYVDYQTIS
Sbjct: 1 MKQAGVPTLDLFLVVLLAAATFKPISSLPSTIPAFLWSPHHRHGFSNNILEKYVDYQTIS 60
Query: 61 PEELAKSVLYEGGWSKILCTGKEVAQHVDLAIIFVGSESDFMLSRHVDPNLMDLLKVSFS 120
P+ELAKSVL EGGWS++LCTGKEV Q VDLAIIFVGS+SDF SRHVDPNLM+LLKVSFS
Sbjct: 61 PQELAKSVLNEGGWSQLLCTGKEVKQPVDLAIIFVGSKSDFTSSRHVDPNLMNLLKVSFS 120
Query: 121 RSNFSMAFPYVAAPERGAIEKLLISEFKKSCGHDLRISTSAFQELSSVEDESFQKLQLLP 180
RSNFSMAFPYVAAPERGA+EKLLISEFK+SCGHDLRIS SAFQELSSVEDESFQKL LLP
Sbjct: 121 RSNFSMAFPYVAAPERGAVEKLLISEFKQSCGHDLRISNSAFQELSSVEDESFQKLPLLP 180
Query: 181 HSINDYMVSRMEKKPKGETDLVVFSHGDFSSPQEGNLWTSESKTLLEIMTSAEHVGAKYE 240
HSINDYMVSRME KP+GET+LV+FSHGDFSSP+EGN WTSESKTL EIMTSAEHVGAKYE
Sbjct: 181 HSINDYMVSRMENKPEGETELVIFSHGDFSSPEEGNPWTSESKTLSEIMTSAEHVGAKYE 240
Query: 241 ILYISDPFRSIRHSYVELGRFMAEGSSGNGSAKSENLCDEVCQIKSSLLEGLFV 295
ILYISDPFRSIRHSYVELGRFMAEGSSGNGSAKSE+LCDEVCQIKSSLLEGLFV
Sbjct: 241 ILYISDPFRSIRHSYVELGRFMAEGSSGNGSAKSESLCDEVCQIKSSLLEGLFV 294
BLAST of HG10004066 vs. NCBI nr
Match:
XP_011656616.1 (uncharacterized protein LOC101220040 isoform X2 [Cucumis sativus])
HSP 1 Score: 526.6 bits (1355), Expect = 1.4e-145
Identity = 264/294 (89.80%), Postives = 277/294 (94.22%), Query Frame = 0
Query: 1 MKKVDVPMLGLFLVVLLAAMTFEPISSLPSTIPAFLWSPHHRHRFSNNILEKYVDYQTIS 60
MK+ DVP LGLFLVVLLAA TFEPISSLPSTIPAFLWSPH RH FSNNILEKYVDYQTIS
Sbjct: 1 MKQADVPTLGLFLVVLLAAATFEPISSLPSTIPAFLWSPHQRHGFSNNILEKYVDYQTIS 60
Query: 61 PEELAKSVLYEGGWSKILCTGKEVAQHVDLAIIFVGSESDFMLSRHVDPNLMDLLKVSFS 120
P+ELAKSVL EGGWS++LCTGKEV QHVDLAIIFVGSESDF SRHVDPNLMDLLKVSFS
Sbjct: 61 PQELAKSVLNEGGWSQLLCTGKEVKQHVDLAIIFVGSESDFTSSRHVDPNLMDLLKVSFS 120
Query: 121 RSNFSMAFPYVAAPERGAIEKLLISEFKKSCGHDLRISTSAFQELSSVEDESFQKLQLLP 180
RSNFSMAFPYVAAPE+GA+EKLLISEFK+SCGHDLRIS+SAFQELSSVEDESFQKL LLP
Sbjct: 121 RSNFSMAFPYVAAPEKGAVEKLLISEFKQSCGHDLRISSSAFQELSSVEDESFQKLSLLP 180
Query: 181 HSINDYMVSRMEKKPKGETDLVVFSHGDFSSPQEGNLWTSESKTLLEIMTSAEHVGAKYE 240
HSINDYMVSRME K +GET+LV+FSHGDFSSP+EGN WTSESKTL EIMTSAEHVGAKYE
Sbjct: 181 HSINDYMVSRMENKREGETELVIFSHGDFSSPEEGNPWTSESKTLSEIMTSAEHVGAKYE 240
Query: 241 ILYISDPFRSIRHSYVELGRFMAEGSSGNGSAKSENLCDEVCQIKSSLLEGLFV 295
ILYISDPFRSIRHSYVELGRFMAEGSS N SAKSE+ CDEVCQIKSSLLEGLFV
Sbjct: 241 ILYISDPFRSIRHSYVELGRFMAEGSSVNESAKSESFCDEVCQIKSSLLEGLFV 294
BLAST of HG10004066 vs. NCBI nr
Match:
XP_008456728.1 (PREDICTED: uncharacterized protein LOC103496586 isoform X1 [Cucumis melo])
HSP 1 Score: 522.7 bits (1345), Expect = 2.1e-144
Identity = 264/296 (89.19%), Postives = 278/296 (93.92%), Query Frame = 0
Query: 1 MKKVDVPMLGLFLVVLLAAMTFEPISSLPSTIPAFLWSPHHRHRFSNNILEKYVDYQTIS 60
MK+ VP L LFLVVLLAA TF+PISSLPSTIPAFLWSPHHRH FSNNILEKYVDYQTIS
Sbjct: 1 MKQAGVPTLDLFLVVLLAAATFKPISSLPSTIPAFLWSPHHRHGFSNNILEKYVDYQTIS 60
Query: 61 PEELAKSVLYEGGWSKILCTGKEVAQHVDLAIIFVGS--ESDFMLSRHVDPNLMDLLKVS 120
P+ELAKSVL EGGWS++LCTGKEV Q VDLAIIFVGS +SDF SRHVDPNLM+LLKVS
Sbjct: 61 PQELAKSVLNEGGWSQLLCTGKEVKQPVDLAIIFVGSKLQSDFTSSRHVDPNLMNLLKVS 120
Query: 121 FSRSNFSMAFPYVAAPERGAIEKLLISEFKKSCGHDLRISTSAFQELSSVEDESFQKLQL 180
FSRSNFSMAFPYVAAPERGA+EKLLISEFK+SCGHDLRIS SAFQELSSVEDESFQKL L
Sbjct: 121 FSRSNFSMAFPYVAAPERGAVEKLLISEFKQSCGHDLRISNSAFQELSSVEDESFQKLPL 180
Query: 181 LPHSINDYMVSRMEKKPKGETDLVVFSHGDFSSPQEGNLWTSESKTLLEIMTSAEHVGAK 240
LPHSINDYMVSRME KP+GET+LV+FSHGDFSSP+EGN WTSESKTL EIMTSAEHVGAK
Sbjct: 181 LPHSINDYMVSRMENKPEGETELVIFSHGDFSSPEEGNPWTSESKTLSEIMTSAEHVGAK 240
Query: 241 YEILYISDPFRSIRHSYVELGRFMAEGSSGNGSAKSENLCDEVCQIKSSLLEGLFV 295
YEILYISDPFRSIRHSYVELGRFMAEGSSGNGSAKSE+LCDEVCQIKSSLLEGLFV
Sbjct: 241 YEILYISDPFRSIRHSYVELGRFMAEGSSGNGSAKSESLCDEVCQIKSSLLEGLFV 296
BLAST of HG10004066 vs. NCBI nr
Match:
TYK04490.1 (uncharacterized protein E5676_scaffold409G001040 [Cucumis melo var. makuwa])
HSP 1 Score: 522.7 bits (1345), Expect = 2.1e-144
Identity = 264/296 (89.19%), Postives = 278/296 (93.92%), Query Frame = 0
Query: 1 MKKVDVPMLGLFLVVLLAAMTFEPISSLPSTIPAFLWSPHHRHRFSNNILEKYVDYQTIS 60
MK+ VP L LFLVVLLAA TF+PISSLPSTIPAFLWSPHHRH FSNNILEKYVDYQTIS
Sbjct: 7 MKQAGVPTLDLFLVVLLAAATFKPISSLPSTIPAFLWSPHHRHGFSNNILEKYVDYQTIS 66
Query: 61 PEELAKSVLYEGGWSKILCTGKEVAQHVDLAIIFVGS--ESDFMLSRHVDPNLMDLLKVS 120
P+ELAKSVL EGGWS++LCTGKEV Q VDLAIIFVGS +SDF SRHVDPNLM+LLKVS
Sbjct: 67 PQELAKSVLNEGGWSQLLCTGKEVKQPVDLAIIFVGSKLQSDFTSSRHVDPNLMNLLKVS 126
Query: 121 FSRSNFSMAFPYVAAPERGAIEKLLISEFKKSCGHDLRISTSAFQELSSVEDESFQKLQL 180
FSRSNFSMAFPYVAAPERGA+EKLLISEFK+SCGHDLRIS SAFQELSSVEDESFQKL L
Sbjct: 127 FSRSNFSMAFPYVAAPERGAVEKLLISEFKQSCGHDLRISNSAFQELSSVEDESFQKLPL 186
Query: 181 LPHSINDYMVSRMEKKPKGETDLVVFSHGDFSSPQEGNLWTSESKTLLEIMTSAEHVGAK 240
LPHSINDYMVSRME KP+GET+LV+FSHGDFSSP+EGN WTSESKTL EIMTSAEHVGAK
Sbjct: 187 LPHSINDYMVSRMENKPEGETELVIFSHGDFSSPEEGNPWTSESKTLSEIMTSAEHVGAK 246
Query: 241 YEILYISDPFRSIRHSYVELGRFMAEGSSGNGSAKSENLCDEVCQIKSSLLEGLFV 295
YEILYISDPFRSIRHSYVELGRFMAEGSSGNGSAKSE+LCDEVCQIKSSLLEGLFV
Sbjct: 247 YEILYISDPFRSIRHSYVELGRFMAEGSSGNGSAKSESLCDEVCQIKSSLLEGLFV 302
BLAST of HG10004066 vs. ExPASy TrEMBL
Match:
A0A1S3C4L5 (uncharacterized protein LOC103496586 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103496586 PE=4 SV=1)
HSP 1 Score: 527.3 bits (1357), Expect = 4.1e-146
Identity = 264/294 (89.80%), Postives = 278/294 (94.56%), Query Frame = 0
Query: 1 MKKVDVPMLGLFLVVLLAAMTFEPISSLPSTIPAFLWSPHHRHRFSNNILEKYVDYQTIS 60
MK+ VP L LFLVVLLAA TF+PISSLPSTIPAFLWSPHHRH FSNNILEKYVDYQTIS
Sbjct: 1 MKQAGVPTLDLFLVVLLAAATFKPISSLPSTIPAFLWSPHHRHGFSNNILEKYVDYQTIS 60
Query: 61 PEELAKSVLYEGGWSKILCTGKEVAQHVDLAIIFVGSESDFMLSRHVDPNLMDLLKVSFS 120
P+ELAKSVL EGGWS++LCTGKEV Q VDLAIIFVGS+SDF SRHVDPNLM+LLKVSFS
Sbjct: 61 PQELAKSVLNEGGWSQLLCTGKEVKQPVDLAIIFVGSKSDFTSSRHVDPNLMNLLKVSFS 120
Query: 121 RSNFSMAFPYVAAPERGAIEKLLISEFKKSCGHDLRISTSAFQELSSVEDESFQKLQLLP 180
RSNFSMAFPYVAAPERGA+EKLLISEFK+SCGHDLRIS SAFQELSSVEDESFQKL LLP
Sbjct: 121 RSNFSMAFPYVAAPERGAVEKLLISEFKQSCGHDLRISNSAFQELSSVEDESFQKLPLLP 180
Query: 181 HSINDYMVSRMEKKPKGETDLVVFSHGDFSSPQEGNLWTSESKTLLEIMTSAEHVGAKYE 240
HSINDYMVSRME KP+GET+LV+FSHGDFSSP+EGN WTSESKTL EIMTSAEHVGAKYE
Sbjct: 181 HSINDYMVSRMENKPEGETELVIFSHGDFSSPEEGNPWTSESKTLSEIMTSAEHVGAKYE 240
Query: 241 ILYISDPFRSIRHSYVELGRFMAEGSSGNGSAKSENLCDEVCQIKSSLLEGLFV 295
ILYISDPFRSIRHSYVELGRFMAEGSSGNGSAKSE+LCDEVCQIKSSLLEGLFV
Sbjct: 241 ILYISDPFRSIRHSYVELGRFMAEGSSGNGSAKSESLCDEVCQIKSSLLEGLFV 294
BLAST of HG10004066 vs. ExPASy TrEMBL
Match:
A0A5D3BZE4 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold409G001040 PE=4 SV=1)
HSP 1 Score: 522.7 bits (1345), Expect = 1.0e-144
Identity = 264/296 (89.19%), Postives = 278/296 (93.92%), Query Frame = 0
Query: 1 MKKVDVPMLGLFLVVLLAAMTFEPISSLPSTIPAFLWSPHHRHRFSNNILEKYVDYQTIS 60
MK+ VP L LFLVVLLAA TF+PISSLPSTIPAFLWSPHHRH FSNNILEKYVDYQTIS
Sbjct: 7 MKQAGVPTLDLFLVVLLAAATFKPISSLPSTIPAFLWSPHHRHGFSNNILEKYVDYQTIS 66
Query: 61 PEELAKSVLYEGGWSKILCTGKEVAQHVDLAIIFVGS--ESDFMLSRHVDPNLMDLLKVS 120
P+ELAKSVL EGGWS++LCTGKEV Q VDLAIIFVGS +SDF SRHVDPNLM+LLKVS
Sbjct: 67 PQELAKSVLNEGGWSQLLCTGKEVKQPVDLAIIFVGSKLQSDFTSSRHVDPNLMNLLKVS 126
Query: 121 FSRSNFSMAFPYVAAPERGAIEKLLISEFKKSCGHDLRISTSAFQELSSVEDESFQKLQL 180
FSRSNFSMAFPYVAAPERGA+EKLLISEFK+SCGHDLRIS SAFQELSSVEDESFQKL L
Sbjct: 127 FSRSNFSMAFPYVAAPERGAVEKLLISEFKQSCGHDLRISNSAFQELSSVEDESFQKLPL 186
Query: 181 LPHSINDYMVSRMEKKPKGETDLVVFSHGDFSSPQEGNLWTSESKTLLEIMTSAEHVGAK 240
LPHSINDYMVSRME KP+GET+LV+FSHGDFSSP+EGN WTSESKTL EIMTSAEHVGAK
Sbjct: 187 LPHSINDYMVSRMENKPEGETELVIFSHGDFSSPEEGNPWTSESKTLSEIMTSAEHVGAK 246
Query: 241 YEILYISDPFRSIRHSYVELGRFMAEGSSGNGSAKSENLCDEVCQIKSSLLEGLFV 295
YEILYISDPFRSIRHSYVELGRFMAEGSSGNGSAKSE+LCDEVCQIKSSLLEGLFV
Sbjct: 247 YEILYISDPFRSIRHSYVELGRFMAEGSSGNGSAKSESLCDEVCQIKSSLLEGLFV 302
BLAST of HG10004066 vs. ExPASy TrEMBL
Match:
A0A1S3C3X2 (uncharacterized protein LOC103496586 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103496586 PE=4 SV=1)
HSP 1 Score: 522.7 bits (1345), Expect = 1.0e-144
Identity = 264/296 (89.19%), Postives = 278/296 (93.92%), Query Frame = 0
Query: 1 MKKVDVPMLGLFLVVLLAAMTFEPISSLPSTIPAFLWSPHHRHRFSNNILEKYVDYQTIS 60
MK+ VP L LFLVVLLAA TF+PISSLPSTIPAFLWSPHHRH FSNNILEKYVDYQTIS
Sbjct: 1 MKQAGVPTLDLFLVVLLAAATFKPISSLPSTIPAFLWSPHHRHGFSNNILEKYVDYQTIS 60
Query: 61 PEELAKSVLYEGGWSKILCTGKEVAQHVDLAIIFVGS--ESDFMLSRHVDPNLMDLLKVS 120
P+ELAKSVL EGGWS++LCTGKEV Q VDLAIIFVGS +SDF SRHVDPNLM+LLKVS
Sbjct: 61 PQELAKSVLNEGGWSQLLCTGKEVKQPVDLAIIFVGSKLQSDFTSSRHVDPNLMNLLKVS 120
Query: 121 FSRSNFSMAFPYVAAPERGAIEKLLISEFKKSCGHDLRISTSAFQELSSVEDESFQKLQL 180
FSRSNFSMAFPYVAAPERGA+EKLLISEFK+SCGHDLRIS SAFQELSSVEDESFQKL L
Sbjct: 121 FSRSNFSMAFPYVAAPERGAVEKLLISEFKQSCGHDLRISNSAFQELSSVEDESFQKLPL 180
Query: 181 LPHSINDYMVSRMEKKPKGETDLVVFSHGDFSSPQEGNLWTSESKTLLEIMTSAEHVGAK 240
LPHSINDYMVSRME KP+GET+LV+FSHGDFSSP+EGN WTSESKTL EIMTSAEHVGAK
Sbjct: 181 LPHSINDYMVSRMENKPEGETELVIFSHGDFSSPEEGNPWTSESKTLSEIMTSAEHVGAK 240
Query: 241 YEILYISDPFRSIRHSYVELGRFMAEGSSGNGSAKSENLCDEVCQIKSSLLEGLFV 295
YEILYISDPFRSIRHSYVELGRFMAEGSSGNGSAKSE+LCDEVCQIKSSLLEGLFV
Sbjct: 241 YEILYISDPFRSIRHSYVELGRFMAEGSSGNGSAKSESLCDEVCQIKSSLLEGLFV 296
BLAST of HG10004066 vs. ExPASy TrEMBL
Match:
A0A0A0KB93 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G053330 PE=4 SV=1)
HSP 1 Score: 521.5 bits (1342), Expect = 2.2e-144
Identity = 264/296 (89.19%), Postives = 277/296 (93.58%), Query Frame = 0
Query: 1 MKKVDVPMLGLFLVVLLAAMTFEPISSLPSTIPAFLWSPHHRHRFSNNILEKYVDYQTIS 60
MK+ DVP LGLFLVVLLAA TFEPISSLPSTIPAFLWSPH RH FSNNILEKYVDYQTIS
Sbjct: 75 MKQADVPTLGLFLVVLLAAATFEPISSLPSTIPAFLWSPHQRHGFSNNILEKYVDYQTIS 134
Query: 61 PEELAKSVLYEGGWSKILCTGKEVAQHVDLAIIFVGSE--SDFMLSRHVDPNLMDLLKVS 120
P+ELAKSVL EGGWS++LCTGKEV QHVDLAIIFVGSE SDF SRHVDPNLMDLLKVS
Sbjct: 135 PQELAKSVLNEGGWSQLLCTGKEVKQHVDLAIIFVGSELQSDFTSSRHVDPNLMDLLKVS 194
Query: 121 FSRSNFSMAFPYVAAPERGAIEKLLISEFKKSCGHDLRISTSAFQELSSVEDESFQKLQL 180
FSRSNFSMAFPYVAAPE+GA+EKLLISEFK+SCGHDLRIS+SAFQELSSVEDESFQKL L
Sbjct: 195 FSRSNFSMAFPYVAAPEKGAVEKLLISEFKQSCGHDLRISSSAFQELSSVEDESFQKLSL 254
Query: 181 LPHSINDYMVSRMEKKPKGETDLVVFSHGDFSSPQEGNLWTSESKTLLEIMTSAEHVGAK 240
LPHSINDYMVSRME K +GET+LV+FSHGDFSSP+EGN WTSESKTL EIMTSAEHVGAK
Sbjct: 255 LPHSINDYMVSRMENKREGETELVIFSHGDFSSPEEGNPWTSESKTLSEIMTSAEHVGAK 314
Query: 241 YEILYISDPFRSIRHSYVELGRFMAEGSSGNGSAKSENLCDEVCQIKSSLLEGLFV 295
YEILYISDPFRSIRHSYVELGRFMAEGSS N SAKSE+ CDEVCQIKSSLLEGLFV
Sbjct: 315 YEILYISDPFRSIRHSYVELGRFMAEGSSVNESAKSESFCDEVCQIKSSLLEGLFV 370
BLAST of HG10004066 vs. ExPASy TrEMBL
Match:
A0A6J1H415 (uncharacterized protein LOC111459798 OS=Cucurbita moschata OX=3662 GN=LOC111459798 PE=4 SV=1)
HSP 1 Score: 475.7 bits (1223), Expect = 1.4e-130
Identity = 241/296 (81.42%), Postives = 258/296 (87.16%), Query Frame = 0
Query: 1 MKKVDVPMLGLFLVVLLAAMTFEPISSLPSTIPAFLWSPHHRHRFSNNILEKYVDYQTIS 60
MKKVDVP LGL LVVLL A TFEP SSLPST+PAFLWSPHH H FSNN++EK VDYQTIS
Sbjct: 1 MKKVDVPKLGLLLVVLLVAATFEPSSSLPSTVPAFLWSPHHHHGFSNNMIEKSVDYQTIS 60
Query: 61 PEELAKSVLYEGGWSKILCTGKEVAQHVDLAIIFVGSE--SDFMLSRHVDPNLMDLLKVS 120
P+ELAKSVLYEGGWSK LC+ K V QHVDLAI+FVGSE SDFMLSRHVDPNL DLLKVS
Sbjct: 61 PQELAKSVLYEGGWSKFLCSRKNVEQHVDLAIVFVGSELQSDFMLSRHVDPNLKDLLKVS 120
Query: 121 FSRSNFSMAFPYVAAPERGAIEKLLISEFKKSCGHDLRISTSAFQELSSVEDESFQKLQL 180
FSRSNFS+AFPYVAAPE G IE LISEFKKSCGHDL IS SAF EL S+EDESFQ+L
Sbjct: 121 FSRSNFSLAFPYVAAPESGTIENSLISEFKKSCGHDLGISNSAFHELCSIEDESFQRLP- 180
Query: 181 LPHSINDYMVSRMEKKPKGETDLVVFSHGDFSSPQEGNLWTSESKTLLEIMTSAEHVGAK 240
L HSINDYMVSRMEKKPKGETDLVVF HG +SP+E N W SESK LLEIMTSAEHVG+K
Sbjct: 181 LQHSINDYMVSRMEKKPKGETDLVVFCHGGSNSPKEVNSWASESKALLEIMTSAEHVGSK 240
Query: 241 YEILYISDPFRSIRHSYVELGRFMAEGSSGNGSAKSENLCDEVCQIKSSLLEGLFV 295
YEILY+SDPFRSIRH+ ++L RF+AEGSSGNGS KS N CDEVCQIKSSLLEGLFV
Sbjct: 241 YEILYVSDPFRSIRHTSMKLERFLAEGSSGNGSTKSANFCDEVCQIKSSLLEGLFV 295
BLAST of HG10004066 vs. TAIR 10
Match:
AT3G13410.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endoplasmic reticulum; EXPRESSED IN: 25 plant structures; EXPRESSED DURING: 15 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G55546.1); Has 49 Blast hits to 49 proteins in 14 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 48; Viruses - 0; Other Eukaryotes - 1 (source: NCBI BLink). )
HSP 1 Score: 222.2 bits (565), Expect = 5.4e-58
Identity = 129/298 (43.29%), Postives = 189/298 (63.42%), Query Frame = 0
Query: 1 MKKVDVPMLGLFLVVLLAAMTFEPISSLPSTIPAFLWSPHHRHRFSNNILEKYVDYQTIS 60
MKK+ + + L LV L A FE + P+T+PAFLWSPH + +N L++ V+YQ +S
Sbjct: 1 MKKIQIGAVAL-LVFLSVASLFEIGLASPNTVPAFLWSPHLQS--ANGELDEAVNYQVMS 60
Query: 61 PEELAKSVLYEGGWSKILCTGKEVAQHVDLAIIFVGSE---SDFMLSRHVDPNLMDLLKV 120
++L SV +GGWS LC+ K++ Q VD+A++F+G E SD R+ DP L++ L
Sbjct: 61 AKDLVGSVFTQGGWSNFLCSEKKLEQPVDVALVFIGRELLSSDVSSKRNSDPALVNTLNN 120
Query: 121 SFSRSNFSMAFPYVAAPERGAIEKLLISEFKKSCGHDLRISTSAFQELSSVEDESFQKLQ 180
F+ SNFS+AFPY+AAPE +E LL+S K++C +++ +S F + VED + QKL
Sbjct: 121 LFTASNFSLAFPYIAAPEEERMENLLLSGLKEACPNNVGVSNIVFSDSCFVEDGTIQKLS 180
Query: 181 LLPHSINDYMVSRMEKKPKGETDLVVF-SHGDFSSPQEGNLWTSESKTLLEIMTSAEHVG 240
L S D++++R E + +GETDLVV S G S+ Q G SE ++ LE+++S E G
Sbjct: 181 DL-QSFKDHLLARRETRKEGETDLVVLCSEGSESNSQAGQS-HSERESFLELVSSVEQSG 240
Query: 241 AKYEILYISDPFRSIRHSYVELGRFMAEGSSGNGSAKSENLCDEVCQIKSSLLEGLFV 295
+KY LY+SDP+ SY L RF+AE + GN + + CDE+C+ KSSLLEG+ V
Sbjct: 241 SKYTALYVSDPYWYT--SYKTLQRFLAETAKGNSTPEIATGCDELCKFKSSLLEGILV 291
BLAST of HG10004066 vs. TAIR 10
Match:
AT1G55546.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G13410.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )
HSP 1 Score: 77.0 bits (188), Expect = 2.8e-14
Identity = 47/119 (39.50%), Postives = 72/119 (60.50%), Query Frame = 0
Query: 1 MKKVDVPMLGLFLVVLLAAMTFEPISSLPSTIPAFLWSPHHRHRFSNNILEKYVDYQTIS 60
+ K+ + LVVL A + + PST+PAFLWSPH +++N E V+YQ +S
Sbjct: 15 LMKLAINYYQYLLVVLEFASLVDFGLASPSTVPAFLWSPH--LQYANG--ETDVNYQVMS 74
Query: 61 PEELAKSVLYEGGWSKILCTGKEVAQHVDLAIIFVGSE---SDFMLSRHVDPNLMDLLK 117
++L SV GGWS LC+ K++ Q VD+A++F+G E SD +++ DP L++ LK
Sbjct: 75 AKDLVDSVFTLGGWSNFLCSEKKLQQPVDVALVFIGRELLSSDVSSNQNSDPVLVNTLK 129
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_038885229.1 | 1.3e-149 | 91.89 | uncharacterized protein LOC120075692 [Benincasa hispida] >XP_038885230.1 unchara... | [more] |
XP_008456729.1 | 8.4e-146 | 89.80 | PREDICTED: uncharacterized protein LOC103496586 isoform X2 [Cucumis melo] | [more] |
XP_011656616.1 | 1.4e-145 | 89.80 | uncharacterized protein LOC101220040 isoform X2 [Cucumis sativus] | [more] |
XP_008456728.1 | 2.1e-144 | 89.19 | PREDICTED: uncharacterized protein LOC103496586 isoform X1 [Cucumis melo] | [more] |
TYK04490.1 | 2.1e-144 | 89.19 | uncharacterized protein E5676_scaffold409G001040 [Cucumis melo var. makuwa] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A1S3C4L5 | 4.1e-146 | 89.80 | uncharacterized protein LOC103496586 isoform X2 OS=Cucumis melo OX=3656 GN=LOC10... | [more] |
A0A5D3BZE4 | 1.0e-144 | 89.19 | Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... | [more] |
A0A1S3C3X2 | 1.0e-144 | 89.19 | uncharacterized protein LOC103496586 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... | [more] |
A0A0A0KB93 | 2.2e-144 | 89.19 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G053330 PE=4 SV=1 | [more] |
A0A6J1H415 | 1.4e-130 | 81.42 | uncharacterized protein LOC111459798 OS=Cucurbita moschata OX=3662 GN=LOC1114597... | [more] |
Match Name | E-value | Identity | Description | |
AT3G13410.1 | 5.4e-58 | 43.29 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |
AT1G55546.1 | 2.8e-14 | 39.50 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |