Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCAGAGCAAAGCCCTAAACCAAAGCCATCGGAAATCCAGAACTTTCCGCCGCCGAAGTCCACCACCGGCCGGAGCATGTCGACGCCAAGGTCGGCGAGCGGCGGCGGGGGTAGCCGGAGGGAGACGCCGGATTTCCACAGCACAGCGGCGAAACTGGAGAGGGCGAAGGAGGTGTATAGAGCGTACGAAGGGCATGGAGAAAGGCCGACCATTGTGGAGATTGTGGGATGGTGTTTCTATGAACTTTGCTCGTTGTTTGTATTGACGTTGTTGATTCCGGTTGTTTTTCCGTTGATTATCAGCCAGATTAGTGGAACTCTGACGGAACCGCCTCAGGGATGGTTTAAGAGCTTTATGGGCTTCGATTGCCCTCCTAGAGAAATGCAACTGTAAGTCTCGTCTGTTTTCTTTCTCGTTTCAGTTGACAGTTGGTCGTGCCACGTGGATGAGCCTATAAATTTTTACATTTCAGTGCCATTTTTTAAAAGTTAAAATATCGATTTTCTTAAAAAAAAAGAAGTTAAATTTGTTTAATTTTAGTCCATGTACTATCAAATGTGTAATTTTAGTCCTTTTACTTTCAGTCAATCTTAAATTTAGTCCTTCCAAATTAACTTTTATTGAAATTGATTAAATAATCATAATAATTTTCATGCAAAGATACACACTATGTGAATATGTTTTCAAATTTTATAGTGAAAATACTAATGAAAAATAAAAAGTAATGAAAAAATTTACAATAAACTAGTTGTAGGGACTAAATTTAAGATTTATGAAAAATACAGGGACTAAAATTGGACAATTGAAAGTATGGGGACTAAAATTAAACAAATTTGAAAGTATGAAGACCAAAATGGTATTTTAACCAAAAAAATATCATTTTGAATAAAAAAAATCACTGATAGAAAAAATGTCAAACTATTTATAGAAAATAGCAAAAAAAAACTCATAAACATTGATGTACTTCTATTAGCATTTATCAGTGATAGACTTCTATCATTTCTATCACTAATATATTTAGCTTAAATTTCGATGAGTAAATTAATCTTAGAAAATTATATATTATTTTAGTCCATATACTTTAACTTTTGGTTCATTTTAGTCATAAACTTTCAAATTCTTCGTTTGGGTCTCTATACTTTCAAAGCATCCACTTTAGTCCATATATTTTAGAAAAGTAACCATCTTAGTCATTCATTTTCATTTTTAAGAGACAAAAAATCTCAATTTTTAATAGATACGTAGACTAAAACGAATCAAAATTAAAAGTAAAAGGACTAAACCGAATATTTGAAATTAAATATATCAAAATGAACAAATGTTTAAAATAAAGAGACCAAAATAAAAAATTTAAAAATACAAACTCTAAAATAAAATAAAAAATATAGAAACTAAAGTTGTTATTTAAAAATATATATTTATTGACAAAAAAAAAGTACACGTGGGTTGTCTAATTGGTTGGATGCAGGTACCAAAGCCTAACAGAACACACAATAAAGGTATCGAGCACCCAATTCTCACCATTAACATGGACCTCAATCTCATGGGCTTTGGGTTTGTTTCTGGCCGGCCCAATCCTCGCCTTCGCTTCCTTCCACCTCGATTACGGCTTCAATCAACACCTAATCACTCTCGCCGCCGTCGCTGCCGGAGCTCTGTCGTGTCTCCCGACAGGCCTCTTCAAAACGGTCAAGATTTTTCCTGTTTACATTATTTTAATCGTCATTGCTCACTCTGTGGCCTTCACCTCTCACACGCGCCACCTCGGTCTCATGCTCCGTGGCCTCACCGGACCCATCCTCCATGAGCCCAAATTCTCCCAAAGAAGAACCGGATCTGGTCTCATTTCCTCCTGCTCCTCCGCCGTCGGCGGTCTCGGTTCCGCCGCTCTCTCCGCCTTCACTTACCACATGCTTCGACGGTTGGTATTCTAACTAGCACTCTCTTCATATTAATCGAGTTAATGTGTAAATTTAGAATTAGAATTTAATTTATTGGTTTAATAAAATGGGTTGTTTTAAAAATTCGAAAAAATATCAAATTATTTAAAGAATAATCACTAATTTTTCTATATTGGTAAATAATTTGATATTTTTTTAATTTATAATAATTTTTCTAATAAAATCTTCATTAATAATATTAAAAACTAATTCATGAGATTTTATCGAATTATAGAAACTGAATTTTAATTATAATTTATATAAACTAAATTCTAAATTTGTAATTTAGTCTACATAATACTGTACTGTAAATTTTATAATATTTGGCTATCAAATAAGCTGTAAATTTTGAAATGTTTTATTTTTATTAGGTTAATTTATAAGTTTAGTCTTTGAACTATGAAATTTTGTCTATTTGATTCAAAAATTTTGAGAATAGAAGGGTTGAATAATAAATTGGTTCCTAAACTTTCAAAATTAAATATTTTAAAAGACTAAATTTGTATTTTATTTATTTATTTATTACATAGTTCAAAAATTTTGAGAATAGAAGAGTTGAATAATGAATTGGTTCCTAAACTTTTAAAATTAAACATTTTAAAGGACTAAATTTGTATTTTATTTATTTATTATTGGACTCTTAATTTATTTTAATGAAATACTTGCGAATTTTAGACTTGTTTTAAAACATACTTTTTTTCATTAAAATAAAAATTCAAGTGACAACAACTAAAAAAAATATAAGCACACGAGATATTGTTAATTTATTTCTATAATCAACTTTCATACATACTACTACTAGGTCAATGTTAAATATGTCTGTATGGTTTTTTTCAACACCCTAACTAATCATTTAGAGGATTGAACTTCTAACCTTTAAAAAAGAAAAACATGTATGAGTTAAACTTACTTTAACAATGTCACACCAATCTTAAAATATTTTGTTTTAATTTAAGACATAATTGACTAAATTATATAATATTCTTCAATTCCACTCTTATTTAAAAAAAAATACTCTTATTATTTACAATGTTGCAATGCTAGCACTGAATTTTCATAAATATTTTCAAATTATCTTTAGAGTAAAATTTATTTAAAATTTTAGATAAAATTGAGACTTAAAATGATGTGATAATAATCTAAAATAATAGGGTAATCTAATTTTTAGATTAGGTCACGATGACACCATTCCAATATCTTGTGAAAAGAAAAAGAATCTGTAAGACTTTATGCTACCATGATAGTTTTGATACGGTTACGAAAAATATTAATAAAATGCACTCGAGAGCTTAGGAATATCTTTTTTAAAAATATTTTTATTATTATTTTCTGTAACTGTGTATGCTTTTGGTTGTTAGTGACAAGCAAGTGCAAGAAGGAGACGACAATCACTTCCTTAACCTATGGATCGTCACGATCTTCGCCGGCCTTCAATGGCTTCTCGGAATTTTTCATGTCTTCCTCACAAATCGATCAATCTCTGTAACAATCCCTTCCGATTCAGAGCTTCACATTCTTTCAATTTTCAAAGATCCTCACGCAATCGGCACCATAATCTCCGGCGGATTCCTCTCTTCCTTCACCACAATCTCCATCTTCACCGCCGTTTTACTCTTCCTAATCGGTCAAATCTGTTTCAAACCAGTCTTGATTCTCTATCTATGGCTAATCTACTTCCTCGTCCCTCTAATTTCTCTTCCATTACTCCATCAATTTCAGATCCGAATCAAAGCGGATGCCTCAAAAATGCTGATCCTAGGGTTCATCTTGTCCGCCGCCACTTCCGCCACCTGTTTCTACTTCCACACCGGCGAGTGGCGGCGGCGCGTGGTGTTCGTCTTCGCCGTTCTTCAAGGCACGGCGGCGGCGCTTCTTCATGCGTACGGAAGAGTTTTGGTGCTTGATTGCTCGCCGGCGGGAAAGGAAGGTGCGATTTCGATGTGGTTTTCATGGATAAGAGCGATCGGTGGTTGCGTTGGATTTACGGTCGCGGCGGTGGTTCCGGCGAGGTTGCAGGTTTCTTCCGGTGTGGCATTTTGCTGCGCCGTTGTCGGAGGAATGGTGTTGATTTTTGGTAATGTTACTGATTACGGCGGCGCTGTGGCGGCGGGGCATGTGAAAGATGACAGTGAAAAGGGATCGCCGGTGATTGGATTGGAGTCGCGGAGTGAGAGTAAAGAGCTTGAGTCGCCGTGA
mRNA sequence
ATGGCAGAGCAAAGCCCTAAACCAAAGCCATCGGAAATCCAGAACTTTCCGCCGCCGAAGTCCACCACCGGCCGGAGCATGTCGACGCCAAGGTCGGCGAGCGGCGGCGGGGGTAGCCGGAGGGAGACGCCGGATTTCCACAGCACAGCGGCGAAACTGGAGAGGGCGAAGGAGGTGTATAGAGCGTACGAAGGGCATGGAGAAAGGCCGACCATTGTGGAGATTGTGGGATGGTGTTTCTATGAACTTTGCTCGTTGTTTGTATTGACGTTGTTGATTCCGGTTGTTTTTCCGTTGATTATCAGCCAGATTAGTGGAACTCTGACGGAACCGCCTCAGGGATGGTTTAAGAGCTTTATGGGCTTCGATTGCCCTCCTAGAGAAATGCAACTGTACCAAAGCCTAACAGAACACACAATAAAGGTATCGAGCACCCAATTCTCACCATTAACATGGACCTCAATCTCATGGGCTTTGGGTTTGTTTCTGGCCGGCCCAATCCTCGCCTTCGCTTCCTTCCACCTCGATTACGGCTTCAATCAACACCTAATCACTCTCGCCGCCGTCGCTGCCGGAGCTCTGTCGTGTCTCCCGACAGGCCTCTTCAAAACGGTCAAGATTTTTCCTGTTTACATTATTTTAATCGTCATTGCTCACTCTGTGGCCTTCACCTCTCACACGCGCCACCTCGGTCTCATGCTCCGTGGCCTCACCGGACCCATCCTCCATGAGCCCAAATTCTCCCAAAGAAGAACCGGATCTGGTCTCATTTCCTCCTGCTCCTCCGCCGTCGGCGGTCTCGGTTCCGCCGCTCTCTCCGCCTTCACTTACCACATGCTTCGACGTGACAAGCAAGTGCAAGAAGGAGACGACAATCACTTCCTTAACCTATGGATCGTCACGATCTTCGCCGGCCTTCAATGGCTTCTCGGAATTTTTCATGTCTTCCTCACAAATCGATCAATCTCTGTAACAATCCCTTCCGATTCAGAGCTTCACATTCTTTCAATTTTCAAAGATCCTCACGCAATCGGCACCATAATCTCCGGCGGATTCCTCTCTTCCTTCACCACAATCTCCATCTTCACCGCCGTTTTACTCTTCCTAATCGGTCAAATCTGTTTCAAACCAGTCTTGATTCTCTATCTATGGCTAATCTACTTCCTCGTCCCTCTAATTTCTCTTCCATTACTCCATCAATTTCAGATCCGAATCAAAGCGGATGCCTCAAAAATGCTGATCCTAGGGTTCATCTTGTCCGCCGCCACTTCCGCCACCTGTTTCTACTTCCACACCGGCGAGTGGCGGCGGCGCGTGGTGTTCGTCTTCGCCGTTCTTCAAGGCACGGCGGCGGCGCTTCTTCATGCGTACGGAAGAGTTTTGGTGCTTGATTGCTCGCCGGCGGGAAAGGAAGGTGCGATTTCGATGTGGTTTTCATGGATAAGAGCGATCGGTGGTTGCGTTGGATTTACGGTCGCGGCGGTGGTTCCGGCGAGGTTGCAGGTTTCTTCCGGTGTGGCATTTTGCTGCGCCGTTGTCGGAGGAATGGTGTTGATTTTTGGTAATGTTACTGATTACGGCGGCGCTGTGGCGGCGGGGCATGTGAAAGATGACAGTGAAAAGGGATCGCCGGTGATTGGATTGGAGTCGCGGAGTGAGAGTAAAGAGCTTGAGTCGCCGTGA
Coding sequence (CDS)
ATGGCAGAGCAAAGCCCTAAACCAAAGCCATCGGAAATCCAGAACTTTCCGCCGCCGAAGTCCACCACCGGCCGGAGCATGTCGACGCCAAGGTCGGCGAGCGGCGGCGGGGGTAGCCGGAGGGAGACGCCGGATTTCCACAGCACAGCGGCGAAACTGGAGAGGGCGAAGGAGGTGTATAGAGCGTACGAAGGGCATGGAGAAAGGCCGACCATTGTGGAGATTGTGGGATGGTGTTTCTATGAACTTTGCTCGTTGTTTGTATTGACGTTGTTGATTCCGGTTGTTTTTCCGTTGATTATCAGCCAGATTAGTGGAACTCTGACGGAACCGCCTCAGGGATGGTTTAAGAGCTTTATGGGCTTCGATTGCCCTCCTAGAGAAATGCAACTGTACCAAAGCCTAACAGAACACACAATAAAGGTATCGAGCACCCAATTCTCACCATTAACATGGACCTCAATCTCATGGGCTTTGGGTTTGTTTCTGGCCGGCCCAATCCTCGCCTTCGCTTCCTTCCACCTCGATTACGGCTTCAATCAACACCTAATCACTCTCGCCGCCGTCGCTGCCGGAGCTCTGTCGTGTCTCCCGACAGGCCTCTTCAAAACGGTCAAGATTTTTCCTGTTTACATTATTTTAATCGTCATTGCTCACTCTGTGGCCTTCACCTCTCACACGCGCCACCTCGGTCTCATGCTCCGTGGCCTCACCGGACCCATCCTCCATGAGCCCAAATTCTCCCAAAGAAGAACCGGATCTGGTCTCATTTCCTCCTGCTCCTCCGCCGTCGGCGGTCTCGGTTCCGCCGCTCTCTCCGCCTTCACTTACCACATGCTTCGACGTGACAAGCAAGTGCAAGAAGGAGACGACAATCACTTCCTTAACCTATGGATCGTCACGATCTTCGCCGGCCTTCAATGGCTTCTCGGAATTTTTCATGTCTTCCTCACAAATCGATCAATCTCTGTAACAATCCCTTCCGATTCAGAGCTTCACATTCTTTCAATTTTCAAAGATCCTCACGCAATCGGCACCATAATCTCCGGCGGATTCCTCTCTTCCTTCACCACAATCTCCATCTTCACCGCCGTTTTACTCTTCCTAATCGGTCAAATCTGTTTCAAACCAGTCTTGATTCTCTATCTATGGCTAATCTACTTCCTCGTCCCTCTAATTTCTCTTCCATTACTCCATCAATTTCAGATCCGAATCAAAGCGGATGCCTCAAAAATGCTGATCCTAGGGTTCATCTTGTCCGCCGCCACTTCCGCCACCTGTTTCTACTTCCACACCGGCGAGTGGCGGCGGCGCGTGGTGTTCGTCTTCGCCGTTCTTCAAGGCACGGCGGCGGCGCTTCTTCATGCGTACGGAAGAGTTTTGGTGCTTGATTGCTCGCCGGCGGGAAAGGAAGGTGCGATTTCGATGTGGTTTTCATGGATAAGAGCGATCGGTGGTTGCGTTGGATTTACGGTCGCGGCGGTGGTTCCGGCGAGGTTGCAGGTTTCTTCCGGTGTGGCATTTTGCTGCGCCGTTGTCGGAGGAATGGTGTTGATTTTTGGTAATGTTACTGATTACGGCGGCGCTGTGGCGGCGGGGCATGTGAAAGATGACAGTGAAAAGGGATCGCCGGTGATTGGATTGGAGTCGCGGAGTGAGAGTAAAGAGCTTGAGTCGCCGTGA
Protein sequence
MAEQSPKPKPSEIQNFPPPKSTTGRSMSTPRSASGGGGSRRETPDFHSTAAKLERAKEVYRAYEGHGERPTIVEIVGWCFYELCSLFVLTLLIPVVFPLIISQISGTLTEPPQGWFKSFMGFDCPPREMQLYQSLTEHTIKVSSTQFSPLTWTSISWALGLFLAGPILAFASFHLDYGFNQHLITLAAVAAGALSCLPTGLFKTVKIFPVYIILIVIAHSVAFTSHTRHLGLMLRGLTGPILHEPKFSQRRTGSGLISSCSSAVGGLGSAALSAFTYHMLRRDKQVQEGDDNHFLNLWIVTIFAGLQWLLGIFHVFLTNRSISVTIPSDSELHILSIFKDPHAIGTIISGGFLSSFTTISIFTAVLLFLIGQICFKPVLILYLWLIYFLVPLISLPLLHQFQIRIKADASKMLILGFILSAATSATCFYFHTGEWRRRVVFVFAVLQGTAAALLHAYGRVLVLDCSPAGKEGAISMWFSWIRAIGGCVGFTVAAVVPARLQVSSGVAFCCAVVGGMVLIFGNVTDYGGAVAAGHVKDDSEKGSPVIGLESRSESKELESP
Homology
BLAST of HG10002973 vs. NCBI nr
Match:
XP_008448612.1 (PREDICTED: uncharacterized protein LOC103490734 [Cucumis melo] >KAA0053002.1 uncharacterized protein E6C27_scaffold344G001140 [Cucumis melo var. makuwa] >TYK11458.1 uncharacterized protein E5676_scaffold139G001150 [Cucumis melo var. makuwa])
HSP 1 Score: 960.3 bits (2481), Expect = 7.3e-276
Identity = 496/560 (88.57%), Postives = 528/560 (94.29%), Query Frame = 0
Query: 2 AEQSPKPKPSEIQNFPPPKSTTGRSMSTPRSAS-GGGGSRRETPDFHSTAAKLERAKEVY 61
AEQSP+PK SEIQN PP KST+GRS+STPRSA+ GGGGSRRETPDFHSTAAKLERAKEVY
Sbjct: 3 AEQSPRPKQSEIQNLPPSKSTSGRSVSTPRSANGGGGGSRRETPDFHSTAAKLERAKEVY 62
Query: 62 RAYEGHGERPTIVEIVGWCFYELCSLFVLTLLIPVVFPLIISQISGTLTEPPQGWFKSFM 121
+AYEGHGERPTIVEIVGWCFYELCS FVLTLLIPVVFPLIISQISGT T PPQGWFKSFM
Sbjct: 63 KAYEGHGERPTIVEIVGWCFYELCSFFVLTLLIPVVFPLIISQISGTPTAPPQGWFKSFM 122
Query: 122 GFDCPPREMQLYQSLTEHTIKVSSTQFSPLTWTSISWALGLFLAGPILAFASFHLDYGFN 181
GFDCP REMQLYQSLTE TIKVS+ +FSPL WTSISWA+GL LAGPILA ASFHLDYGFN
Sbjct: 123 GFDCPLREMQLYQSLTEQTIKVSNAEFSPLIWTSISWAMGLVLAGPILAAASFHLDYGFN 182
Query: 182 QHLITLAAVAAGALSCLPTGLFKTVKIFPVYIILIVIAHSVAFTSHTRHLGLMLRGLTGP 241
QHLITLAAVAAGAL+CLPTGLFKTVKIFP+YI+LIVIAHSVAFTSHTRHLGLMLRGLTGP
Sbjct: 183 QHLITLAAVAAGALTCLPTGLFKTVKIFPLYIVLIVIAHSVAFTSHTRHLGLMLRGLTGP 242
Query: 242 ILHEPKFSQRRTGSGLISSCSSAVGGLGSAALSAFTYHMLRRDKQVQEGDDNHFLNLWIV 301
I+H+ KFS RR GSG ISS S+AVGG+G++ +SAFTYHMLRRDKQVQEG DNHFLNLWIV
Sbjct: 243 IVHKAKFSLRRIGSGQISSWSAAVGGVGASVISAFTYHMLRRDKQVQEGVDNHFLNLWIV 302
Query: 302 TIFAGLQWLLGIFHVFLTNRSISVTIPSDSELHILSIFKDPHAIGTIISGGFLSSFTTIS 361
TIFAGL+WL+GIFHVFLTNRSIS++IPS+SELHILSIFK P+AI T+ISGGFLSSF TIS
Sbjct: 303 TIFAGLKWLIGIFHVFLTNRSISISIPSNSELHILSIFKYPYAIATVISGGFLSSFATIS 362
Query: 362 IFTAVLLFLIGQICFKPVLILYLWLIYFLVPLISLPLLHQFQIRIKADASKMLILGFILS 421
IFTAVLLFLIGQICFKPVLILYL LIYFLVPLISLPLLHQFQIRIKADASKMLILGFILS
Sbjct: 363 IFTAVLLFLIGQICFKPVLILYLLLIYFLVPLISLPLLHQFQIRIKADASKMLILGFILS 422
Query: 422 AATSATCFYFHTGEWRRRVVFVFAVLQGTAAALLHAYGRVLVLDCSPAGKEGAISMWFSW 481
AATSATCFYFH WRR +VFVFAVLQGTAAA+LHAYGR LVLDCSPAGKE AISMWFSW
Sbjct: 423 AATSATCFYFHAYTWRRHLVFVFAVLQGTAAAVLHAYGRALVLDCSPAGKESAISMWFSW 482
Query: 482 IRAIGGCVGFTVAAVVPARLQVSSGVAFCCAVVGGMVLIFGNVTDYGGAVAAGHVKDDSE 541
+R+IGGCVGFTVAAVVPARLQVSSGV FCCAVVGG+VLIFGNVTDY GAVAAGHV+DDSE
Sbjct: 483 MRSIGGCVGFTVAAVVPARLQVSSGVVFCCAVVGGVVLIFGNVTDYDGAVAAGHVRDDSE 542
Query: 542 KGSPVIGLESRSESKELESP 561
KGSPVIGL+SRSESKELESP
Sbjct: 543 KGSPVIGLDSRSESKELESP 562
BLAST of HG10002973 vs. NCBI nr
Match:
XP_022965332.1 (uncharacterized protein LOC111465229 isoform X1 [Cucurbita maxima])
HSP 1 Score: 931.4 bits (2406), Expect = 3.6e-267
Identity = 473/560 (84.46%), Postives = 516/560 (92.14%), Query Frame = 0
Query: 1 MAEQSPKPKPSEIQNFPPPKSTTGRSMSTPRSASGGGGSRRETPDFHSTAAKLERAKEVY 60
MAEQSP+PK SEIQN PPP+S +GR ST R SG GGSR++TPDFHS AAKLERAKEVY
Sbjct: 1 MAEQSPRPKSSEIQNAPPPRSGSGRITSTTR--SGSGGSRKDTPDFHSMAAKLERAKEVY 60
Query: 61 RAYEGHGERPTIVEIVGWCFYELCSLFVLTLLIPVVFPLIISQISGTLTEPPQGWFKSFM 120
RAYEGHGE+P+I+E+ GWCFYELCSL VLT+LIPVVFPLIISQISG EPPQGWF+SFM
Sbjct: 61 RAYEGHGEKPSIMEMAGWCFYELCSLSVLTVLIPVVFPLIISQISGAAMEPPQGWFQSFM 120
Query: 121 GFDCPPREMQLYQSLTEHTIKVSSTQFSPLTWTSISWALGLFLAGPILAFASFHLDYGFN 180
GFDCPP EMQLYQ LT+HTIK+S T+FSPL WTSISWALGL +AGPILAFASFHLDYGFN
Sbjct: 121 GFDCPPGEMQLYQILTDHTIKISGTRFSPLIWTSISWALGLIIAGPILAFASFHLDYGFN 180
Query: 181 QHLITLAAVAAGALSCLPTGLFKTVKIFPVYIILIVIAHSVAFTSHTRHLGLMLRGLTGP 240
QHLI + AVAAGALSCLPTG+F+TVKIFP+YI+LIVIAHSVAFTSHTRHLGLMLRGL GP
Sbjct: 181 QHLIAVGAVAAGALSCLPTGVFRTVKIFPLYIVLIVIAHSVAFTSHTRHLGLMLRGLVGP 240
Query: 241 ILHEPKFSQRRTGSGLISSCSSAVGGLGSAALSAFTYHMLRRDKQVQEGDDNHFLNLWIV 300
+ + KF+QRRTGSGLISSCS+AVGGLG+AA+SAFTYHMLRR++Q +EGDDNHFL+LWIV
Sbjct: 241 TVLKAKFAQRRTGSGLISSCSTAVGGLGAAAISAFTYHMLRRNRQEKEGDDNHFLSLWIV 300
Query: 301 TIFAGLQWLLGIFHVFLTNRSISVTIPSDSELHILSIFKDPHAIGTIISGGFLSSFTTIS 360
TIF GL+WLLGIFHVFLTNRS+SVTIPSDSELH+L+IFK PHAIGT+IS GFLSSFTTI+
Sbjct: 301 TIFGGLKWLLGIFHVFLTNRSVSVTIPSDSELHLLTIFKYPHAIGTVISAGFLSSFTTIA 360
Query: 361 IFTAVLLFLIGQICFKPVLILYLWLIYFLVPLISLPLLHQFQIRIKADASKMLILGFILS 420
IF AV LFLIGQICFKPVLILYLWLIYFL+PLISLPLLHQFQIRIKADASKM ILGFILS
Sbjct: 361 IFIAVSLFLIGQICFKPVLILYLWLIYFLIPLISLPLLHQFQIRIKADASKMQILGFILS 420
Query: 421 AATSATCFYFHTGEWRRRVVFVFAVLQGTAAALLHAYGRVLVLDCSPAGKEGAISMWFSW 480
A TSA CFYFH WR VVFVFA LQGTAAALLH YGRVLVLDCSPAGKE AISMWFSW
Sbjct: 421 AVTSAICFYFHNDAWRLPVVFVFAALQGTAAALLHTYGRVLVLDCSPAGKEAAISMWFSW 480
Query: 481 IRAIGGCVGFTVAAVVPARLQVSSGVAFCCAVVGGMVLIFGNVTDYGGAVAAGHVKDDSE 540
+RAIGGCVGFTVAAVVPARLQVSSGVAFCCAVVGG+VLI+GN+TDYGGAV+AGHVK+DSE
Sbjct: 481 MRAIGGCVGFTVAAVVPARLQVSSGVAFCCAVVGGVVLIYGNITDYGGAVSAGHVKNDSE 540
Query: 541 KGSPVIGLESRSESKELESP 561
KGSPVIGLESRS SKELESP
Sbjct: 541 KGSPVIGLESRSVSKELESP 558
BLAST of HG10002973 vs. NCBI nr
Match:
XP_022965333.1 (uncharacterized protein LOC111465229 isoform X2 [Cucurbita maxima])
HSP 1 Score: 924.9 bits (2389), Expect = 3.4e-265
Identity = 473/560 (84.46%), Postives = 514/560 (91.79%), Query Frame = 0
Query: 1 MAEQSPKPKPSEIQNFPPPKSTTGRSMSTPRSASGGGGSRRETPDFHSTAAKLERAKEVY 60
MAEQSP+PK SEIQN PPP+S +GR ST R SG GGSR++TPDFHS AAKLERAKEVY
Sbjct: 1 MAEQSPRPKSSEIQNAPPPRSGSGRITSTTR--SGSGGSRKDTPDFHSMAAKLERAKEVY 60
Query: 61 RAYEGHGERPTIVEIVGWCFYELCSLFVLTLLIPVVFPLIISQISGTLTEPPQGWFKSFM 120
RAYEGHGE+P+I+E+ GWCFYELCSL VLT+LIPVVFPLIISQISG EPPQGWF+SFM
Sbjct: 61 RAYEGHGEKPSIMEMAGWCFYELCSLSVLTVLIPVVFPLIISQISGAAMEPPQGWFQSFM 120
Query: 121 GFDCPPREMQLYQSLTEHTIKVSSTQFSPLTWTSISWALGLFLAGPILAFASFHLDYGFN 180
GFDCPP EMQLYQ LT+HTIK+S T+FSPL WTSISWALGL +AGPILAFASFHLDYGFN
Sbjct: 121 GFDCPPGEMQLYQILTDHTIKISGTRFSPLIWTSISWALGLIIAGPILAFASFHLDYGFN 180
Query: 181 QHLITLAAVAAGALSCLPTGLFKTVKIFPVYIILIVIAHSVAFTSHTRHLGLMLRGLTGP 240
QHLI + AVAAGALSCLPTG+F+TVKIFP+YI+LIVIAHSVAFTSHTRHLGLMLRGL GP
Sbjct: 181 QHLIAVGAVAAGALSCLPTGVFRTVKIFPLYIVLIVIAHSVAFTSHTRHLGLMLRGLVGP 240
Query: 241 ILHEPKFSQRRTGSGLISSCSSAVGGLGSAALSAFTYHMLRRDKQVQEGDDNHFLNLWIV 300
+ + KF+QRRTGSGLISSCS+AVGGLG+AA+SAFTYHMLRR Q +EGDDNHFL+LWIV
Sbjct: 241 TVLKAKFAQRRTGSGLISSCSTAVGGLGAAAISAFTYHMLRR--QEKEGDDNHFLSLWIV 300
Query: 301 TIFAGLQWLLGIFHVFLTNRSISVTIPSDSELHILSIFKDPHAIGTIISGGFLSSFTTIS 360
TIF GL+WLLGIFHVFLTNRS+SVTIPSDSELH+L+IFK PHAIGT+IS GFLSSFTTI+
Sbjct: 301 TIFGGLKWLLGIFHVFLTNRSVSVTIPSDSELHLLTIFKYPHAIGTVISAGFLSSFTTIA 360
Query: 361 IFTAVLLFLIGQICFKPVLILYLWLIYFLVPLISLPLLHQFQIRIKADASKMLILGFILS 420
IF AV LFLIGQICFKPVLILYLWLIYFL+PLISLPLLHQFQIRIKADASKM ILGFILS
Sbjct: 361 IFIAVSLFLIGQICFKPVLILYLWLIYFLIPLISLPLLHQFQIRIKADASKMQILGFILS 420
Query: 421 AATSATCFYFHTGEWRRRVVFVFAVLQGTAAALLHAYGRVLVLDCSPAGKEGAISMWFSW 480
A TSA CFYFH WR VVFVFA LQGTAAALLH YGRVLVLDCSPAGKE AISMWFSW
Sbjct: 421 AVTSAICFYFHNDAWRLPVVFVFAALQGTAAALLHTYGRVLVLDCSPAGKEAAISMWFSW 480
Query: 481 IRAIGGCVGFTVAAVVPARLQVSSGVAFCCAVVGGMVLIFGNVTDYGGAVAAGHVKDDSE 540
+RAIGGCVGFTVAAVVPARLQVSSGVAFCCAVVGG+VLI+GN+TDYGGAV+AGHVK+DSE
Sbjct: 481 MRAIGGCVGFTVAAVVPARLQVSSGVAFCCAVVGGVVLIYGNITDYGGAVSAGHVKNDSE 540
Query: 541 KGSPVIGLESRSESKELESP 561
KGSPVIGLESRS SKELESP
Sbjct: 541 KGSPVIGLESRSVSKELESP 556
BLAST of HG10002973 vs. NCBI nr
Match:
KAG6577745.1 (hypothetical protein SDJN03_25319, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 924.5 bits (2388), Expect = 4.4e-265
Identity = 470/560 (83.93%), Postives = 513/560 (91.61%), Query Frame = 0
Query: 1 MAEQSPKPKPSEIQNFPPPKSTTGRSMSTPRSASGGGGSRRETPDFHSTAAKLERAKEVY 60
MAEQSP+PK SEIQ+ PPP+S +GR ST R SG GGSR++TPDFHS AAKLERAKEVY
Sbjct: 1 MAEQSPRPKSSEIQSAPPPRSGSGRITSTTR--SGSGGSRKDTPDFHSMAAKLERAKEVY 60
Query: 61 RAYEGHGERPTIVEIVGWCFYELCSLFVLTLLIPVVFPLIISQISGTLTEPPQGWFKSFM 120
RAYEGHGE+P+I+E+ GWCFYELCSL VLT+LIPVVFPLIISQISG TEPPQGWFKS M
Sbjct: 61 RAYEGHGEKPSIMEMAGWCFYELCSLSVLTVLIPVVFPLIISQISGAATEPPQGWFKSVM 120
Query: 121 GFDCPPREMQLYQSLTEHTIKVSSTQFSPLTWTSISWALGLFLAGPILAFASFHLDYGFN 180
GFDC P EMQLYQ LTEHTIKVS T+FSPL WTSISWALGL LAGPIL FASFHLDYGFN
Sbjct: 121 GFDCAPGEMQLYQILTEHTIKVSGTRFSPLIWTSISWALGLILAGPILVFASFHLDYGFN 180
Query: 181 QHLITLAAVAAGALSCLPTGLFKTVKIFPVYIILIVIAHSVAFTSHTRHLGLMLRGLTGP 240
QHLI + AVAAGALSCLPTG+F+TVKIFP+YI+LIVIAHSVAFTSHTRHLGLMLRGL GP
Sbjct: 181 QHLIAVGAVAAGALSCLPTGVFRTVKIFPLYIVLIVIAHSVAFTSHTRHLGLMLRGLVGP 240
Query: 241 ILHEPKFSQRRTGSGLISSCSSAVGGLGSAALSAFTYHMLRRDKQVQEGDDNHFLNLWIV 300
+ + KF+QRRTGSGLISSCS+AVGGLG+AA+SAFTYHMLRR++Q +EGD+NHFL+LWIV
Sbjct: 241 TVLKAKFAQRRTGSGLISSCSTAVGGLGAAAISAFTYHMLRRNRQEKEGDENHFLSLWIV 300
Query: 301 TIFAGLQWLLGIFHVFLTNRSISVTIPSDSELHILSIFKDPHAIGTIISGGFLSSFTTIS 360
TIF GL+WLLG+ HVFLTNRS+SVTIPSDSELH+L+IFK PHAIGT+IS GFLSSFTTI+
Sbjct: 301 TIFGGLKWLLGVVHVFLTNRSVSVTIPSDSELHLLTIFKYPHAIGTVISAGFLSSFTTIA 360
Query: 361 IFTAVLLFLIGQICFKPVLILYLWLIYFLVPLISLPLLHQFQIRIKADASKMLILGFILS 420
+F AV LFLIGQICFKP LILYLWLIYFL+PLISLPLLHQFQIRIKADASKM ILGFILS
Sbjct: 361 VFIAVSLFLIGQICFKPALILYLWLIYFLIPLISLPLLHQFQIRIKADASKMQILGFILS 420
Query: 421 AATSATCFYFHTGEWRRRVVFVFAVLQGTAAALLHAYGRVLVLDCSPAGKEGAISMWFSW 480
A TSA CFYFH WRR VVFVFA LQGTAAALLH+YGRVLVLDCSPAGKE AISMWFSW
Sbjct: 421 AVTSAICFYFHNDAWRRPVVFVFAALQGTAAALLHSYGRVLVLDCSPAGKEAAISMWFSW 480
Query: 481 IRAIGGCVGFTVAAVVPARLQVSSGVAFCCAVVGGMVLIFGNVTDYGGAVAAGHVKDDSE 540
+RAIGGCVGFTVAAVVP RLQVSSGVAFCCAVVGG+VLI+GNVTDYGGAVAAGHVK+DSE
Sbjct: 481 MRAIGGCVGFTVAAVVPTRLQVSSGVAFCCAVVGGVVLIYGNVTDYGGAVAAGHVKNDSE 540
Query: 541 KGSPVIGLESRSESKELESP 561
KGSPV+GLESRS SKELESP
Sbjct: 541 KGSPVVGLESRSVSKELESP 558
BLAST of HG10002973 vs. NCBI nr
Match:
KAG7015784.1 (hypothetical protein SDJN02_23422, partial [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 923.7 bits (2386), Expect = 7.6e-265
Identity = 471/560 (84.11%), Postives = 513/560 (91.61%), Query Frame = 0
Query: 1 MAEQSPKPKPSEIQNFPPPKSTTGRSMSTPRSASGGGGSRRETPDFHSTAAKLERAKEVY 60
MAEQSP+PK SEIQ+ PPP+S +GR ST R SG GGSR++TPDFHS AAKLERAKEVY
Sbjct: 1 MAEQSPRPKSSEIQSAPPPRSGSGRITSTTR--SGSGGSRKDTPDFHSMAAKLERAKEVY 60
Query: 61 RAYEGHGERPTIVEIVGWCFYELCSLFVLTLLIPVVFPLIISQISGTLTEPPQGWFKSFM 120
RAYEGHGE+P+I+E+ GWCFYELCSL VLT+LIPVVFPLIISQISG TEPPQGWFKS M
Sbjct: 61 RAYEGHGEKPSIMEMAGWCFYELCSLSVLTVLIPVVFPLIISQISGAATEPPQGWFKSVM 120
Query: 121 GFDCPPREMQLYQSLTEHTIKVSSTQFSPLTWTSISWALGLFLAGPILAFASFHLDYGFN 180
GFDC P EMQLYQ LTEHTIKVS T+FSPL WTSISWALGL LAGPIL FASFHLDYGFN
Sbjct: 121 GFDCVPGEMQLYQILTEHTIKVSGTRFSPLIWTSISWALGLILAGPILVFASFHLDYGFN 180
Query: 181 QHLITLAAVAAGALSCLPTGLFKTVKIFPVYIILIVIAHSVAFTSHTRHLGLMLRGLTGP 240
QHLI + AVAAGALSCLPTG+F+TVKIFP+YI+LIVIAHSVAFTSHTRHLGLMLRGL GP
Sbjct: 181 QHLIAVGAVAAGALSCLPTGVFRTVKIFPLYIVLIVIAHSVAFTSHTRHLGLMLRGLVGP 240
Query: 241 ILHEPKFSQRRTGSGLISSCSSAVGGLGSAALSAFTYHMLRRDKQVQEGDDNHFLNLWIV 300
+ + KF+QRRTGSGLISSCS+AVGGLG+AA+SAFTYHMLRR++Q +EGD+NHFL+LWIV
Sbjct: 241 TVLKAKFAQRRTGSGLISSCSTAVGGLGAAAISAFTYHMLRRNRQEKEGDENHFLSLWIV 300
Query: 301 TIFAGLQWLLGIFHVFLTNRSISVTIPSDSELHILSIFKDPHAIGTIISGGFLSSFTTIS 360
TIF GL+WLLGI HVFLTNRS+SVTIPSDSELH+L+IFK PHAIGT+IS GFLSSFTTI+
Sbjct: 301 TIFGGLKWLLGIVHVFLTNRSVSVTIPSDSELHLLTIFKYPHAIGTVISAGFLSSFTTIA 360
Query: 361 IFTAVLLFLIGQICFKPVLILYLWLIYFLVPLISLPLLHQFQIRIKADASKMLILGFILS 420
+F AV LFLIGQICFKP LILYLWLIYFL+PLISLPLLHQFQIRIKADASKM ILGFILS
Sbjct: 361 VFIAVSLFLIGQICFKPALILYLWLIYFLIPLISLPLLHQFQIRIKADASKMQILGFILS 420
Query: 421 AATSATCFYFHTGEWRRRVVFVFAVLQGTAAALLHAYGRVLVLDCSPAGKEGAISMWFSW 480
A TSA CFYFH WRR VVFVFA LQGTAAALLH+YGRVLVLDCSPAGKE AISMWFSW
Sbjct: 421 AVTSAICFYFHNDAWRRPVVFVFASLQGTAAALLHSYGRVLVLDCSPAGKEAAISMWFSW 480
Query: 481 IRAIGGCVGFTVAAVVPARLQVSSGVAFCCAVVGGMVLIFGNVTDYGGAVAAGHVKDDSE 540
+RAIGGCVGFTVAAVVP RLQVSSGVAFCCAVVGG+VLI+GNVTDYGGAVAAGHVK+DSE
Sbjct: 481 MRAIGGCVGFTVAAVVPTRLQVSSGVAFCCAVVGGVVLIYGNVTDYGGAVAAGHVKNDSE 540
Query: 541 KGSPVIGLESRSESKELESP 561
KGSPV+GLESRS SKELESP
Sbjct: 541 KGSPVVGLESRSVSKELESP 558
BLAST of HG10002973 vs. ExPASy TrEMBL
Match:
A0A5D3CJT7 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold139G001150 PE=4 SV=1)
HSP 1 Score: 960.3 bits (2481), Expect = 3.5e-276
Identity = 496/560 (88.57%), Postives = 528/560 (94.29%), Query Frame = 0
Query: 2 AEQSPKPKPSEIQNFPPPKSTTGRSMSTPRSAS-GGGGSRRETPDFHSTAAKLERAKEVY 61
AEQSP+PK SEIQN PP KST+GRS+STPRSA+ GGGGSRRETPDFHSTAAKLERAKEVY
Sbjct: 3 AEQSPRPKQSEIQNLPPSKSTSGRSVSTPRSANGGGGGSRRETPDFHSTAAKLERAKEVY 62
Query: 62 RAYEGHGERPTIVEIVGWCFYELCSLFVLTLLIPVVFPLIISQISGTLTEPPQGWFKSFM 121
+AYEGHGERPTIVEIVGWCFYELCS FVLTLLIPVVFPLIISQISGT T PPQGWFKSFM
Sbjct: 63 KAYEGHGERPTIVEIVGWCFYELCSFFVLTLLIPVVFPLIISQISGTPTAPPQGWFKSFM 122
Query: 122 GFDCPPREMQLYQSLTEHTIKVSSTQFSPLTWTSISWALGLFLAGPILAFASFHLDYGFN 181
GFDCP REMQLYQSLTE TIKVS+ +FSPL WTSISWA+GL LAGPILA ASFHLDYGFN
Sbjct: 123 GFDCPLREMQLYQSLTEQTIKVSNAEFSPLIWTSISWAMGLVLAGPILAAASFHLDYGFN 182
Query: 182 QHLITLAAVAAGALSCLPTGLFKTVKIFPVYIILIVIAHSVAFTSHTRHLGLMLRGLTGP 241
QHLITLAAVAAGAL+CLPTGLFKTVKIFP+YI+LIVIAHSVAFTSHTRHLGLMLRGLTGP
Sbjct: 183 QHLITLAAVAAGALTCLPTGLFKTVKIFPLYIVLIVIAHSVAFTSHTRHLGLMLRGLTGP 242
Query: 242 ILHEPKFSQRRTGSGLISSCSSAVGGLGSAALSAFTYHMLRRDKQVQEGDDNHFLNLWIV 301
I+H+ KFS RR GSG ISS S+AVGG+G++ +SAFTYHMLRRDKQVQEG DNHFLNLWIV
Sbjct: 243 IVHKAKFSLRRIGSGQISSWSAAVGGVGASVISAFTYHMLRRDKQVQEGVDNHFLNLWIV 302
Query: 302 TIFAGLQWLLGIFHVFLTNRSISVTIPSDSELHILSIFKDPHAIGTIISGGFLSSFTTIS 361
TIFAGL+WL+GIFHVFLTNRSIS++IPS+SELHILSIFK P+AI T+ISGGFLSSF TIS
Sbjct: 303 TIFAGLKWLIGIFHVFLTNRSISISIPSNSELHILSIFKYPYAIATVISGGFLSSFATIS 362
Query: 362 IFTAVLLFLIGQICFKPVLILYLWLIYFLVPLISLPLLHQFQIRIKADASKMLILGFILS 421
IFTAVLLFLIGQICFKPVLILYL LIYFLVPLISLPLLHQFQIRIKADASKMLILGFILS
Sbjct: 363 IFTAVLLFLIGQICFKPVLILYLLLIYFLVPLISLPLLHQFQIRIKADASKMLILGFILS 422
Query: 422 AATSATCFYFHTGEWRRRVVFVFAVLQGTAAALLHAYGRVLVLDCSPAGKEGAISMWFSW 481
AATSATCFYFH WRR +VFVFAVLQGTAAA+LHAYGR LVLDCSPAGKE AISMWFSW
Sbjct: 423 AATSATCFYFHAYTWRRHLVFVFAVLQGTAAAVLHAYGRALVLDCSPAGKESAISMWFSW 482
Query: 482 IRAIGGCVGFTVAAVVPARLQVSSGVAFCCAVVGGMVLIFGNVTDYGGAVAAGHVKDDSE 541
+R+IGGCVGFTVAAVVPARLQVSSGV FCCAVVGG+VLIFGNVTDY GAVAAGHV+DDSE
Sbjct: 483 MRSIGGCVGFTVAAVVPARLQVSSGVVFCCAVVGGVVLIFGNVTDYDGAVAAGHVRDDSE 542
Query: 542 KGSPVIGLESRSESKELESP 561
KGSPVIGL+SRSESKELESP
Sbjct: 543 KGSPVIGLDSRSESKELESP 562
BLAST of HG10002973 vs. ExPASy TrEMBL
Match:
A0A1S3BK45 (uncharacterized protein LOC103490734 OS=Cucumis melo OX=3656 GN=LOC103490734 PE=4 SV=1)
HSP 1 Score: 960.3 bits (2481), Expect = 3.5e-276
Identity = 496/560 (88.57%), Postives = 528/560 (94.29%), Query Frame = 0
Query: 2 AEQSPKPKPSEIQNFPPPKSTTGRSMSTPRSAS-GGGGSRRETPDFHSTAAKLERAKEVY 61
AEQSP+PK SEIQN PP KST+GRS+STPRSA+ GGGGSRRETPDFHSTAAKLERAKEVY
Sbjct: 3 AEQSPRPKQSEIQNLPPSKSTSGRSVSTPRSANGGGGGSRRETPDFHSTAAKLERAKEVY 62
Query: 62 RAYEGHGERPTIVEIVGWCFYELCSLFVLTLLIPVVFPLIISQISGTLTEPPQGWFKSFM 121
+AYEGHGERPTIVEIVGWCFYELCS FVLTLLIPVVFPLIISQISGT T PPQGWFKSFM
Sbjct: 63 KAYEGHGERPTIVEIVGWCFYELCSFFVLTLLIPVVFPLIISQISGTPTAPPQGWFKSFM 122
Query: 122 GFDCPPREMQLYQSLTEHTIKVSSTQFSPLTWTSISWALGLFLAGPILAFASFHLDYGFN 181
GFDCP REMQLYQSLTE TIKVS+ +FSPL WTSISWA+GL LAGPILA ASFHLDYGFN
Sbjct: 123 GFDCPLREMQLYQSLTEQTIKVSNAEFSPLIWTSISWAMGLVLAGPILAAASFHLDYGFN 182
Query: 182 QHLITLAAVAAGALSCLPTGLFKTVKIFPVYIILIVIAHSVAFTSHTRHLGLMLRGLTGP 241
QHLITLAAVAAGAL+CLPTGLFKTVKIFP+YI+LIVIAHSVAFTSHTRHLGLMLRGLTGP
Sbjct: 183 QHLITLAAVAAGALTCLPTGLFKTVKIFPLYIVLIVIAHSVAFTSHTRHLGLMLRGLTGP 242
Query: 242 ILHEPKFSQRRTGSGLISSCSSAVGGLGSAALSAFTYHMLRRDKQVQEGDDNHFLNLWIV 301
I+H+ KFS RR GSG ISS S+AVGG+G++ +SAFTYHMLRRDKQVQEG DNHFLNLWIV
Sbjct: 243 IVHKAKFSLRRIGSGQISSWSAAVGGVGASVISAFTYHMLRRDKQVQEGVDNHFLNLWIV 302
Query: 302 TIFAGLQWLLGIFHVFLTNRSISVTIPSDSELHILSIFKDPHAIGTIISGGFLSSFTTIS 361
TIFAGL+WL+GIFHVFLTNRSIS++IPS+SELHILSIFK P+AI T+ISGGFLSSF TIS
Sbjct: 303 TIFAGLKWLIGIFHVFLTNRSISISIPSNSELHILSIFKYPYAIATVISGGFLSSFATIS 362
Query: 362 IFTAVLLFLIGQICFKPVLILYLWLIYFLVPLISLPLLHQFQIRIKADASKMLILGFILS 421
IFTAVLLFLIGQICFKPVLILYL LIYFLVPLISLPLLHQFQIRIKADASKMLILGFILS
Sbjct: 363 IFTAVLLFLIGQICFKPVLILYLLLIYFLVPLISLPLLHQFQIRIKADASKMLILGFILS 422
Query: 422 AATSATCFYFHTGEWRRRVVFVFAVLQGTAAALLHAYGRVLVLDCSPAGKEGAISMWFSW 481
AATSATCFYFH WRR +VFVFAVLQGTAAA+LHAYGR LVLDCSPAGKE AISMWFSW
Sbjct: 423 AATSATCFYFHAYTWRRHLVFVFAVLQGTAAAVLHAYGRALVLDCSPAGKESAISMWFSW 482
Query: 482 IRAIGGCVGFTVAAVVPARLQVSSGVAFCCAVVGGMVLIFGNVTDYGGAVAAGHVKDDSE 541
+R+IGGCVGFTVAAVVPARLQVSSGV FCCAVVGG+VLIFGNVTDY GAVAAGHV+DDSE
Sbjct: 483 MRSIGGCVGFTVAAVVPARLQVSSGVVFCCAVVGGVVLIFGNVTDYDGAVAAGHVRDDSE 542
Query: 542 KGSPVIGLESRSESKELESP 561
KGSPVIGL+SRSESKELESP
Sbjct: 543 KGSPVIGLDSRSESKELESP 562
BLAST of HG10002973 vs. ExPASy TrEMBL
Match:
A0A6J1HK19 (uncharacterized protein LOC111465229 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111465229 PE=4 SV=1)
HSP 1 Score: 931.4 bits (2406), Expect = 1.8e-267
Identity = 473/560 (84.46%), Postives = 516/560 (92.14%), Query Frame = 0
Query: 1 MAEQSPKPKPSEIQNFPPPKSTTGRSMSTPRSASGGGGSRRETPDFHSTAAKLERAKEVY 60
MAEQSP+PK SEIQN PPP+S +GR ST R SG GGSR++TPDFHS AAKLERAKEVY
Sbjct: 1 MAEQSPRPKSSEIQNAPPPRSGSGRITSTTR--SGSGGSRKDTPDFHSMAAKLERAKEVY 60
Query: 61 RAYEGHGERPTIVEIVGWCFYELCSLFVLTLLIPVVFPLIISQISGTLTEPPQGWFKSFM 120
RAYEGHGE+P+I+E+ GWCFYELCSL VLT+LIPVVFPLIISQISG EPPQGWF+SFM
Sbjct: 61 RAYEGHGEKPSIMEMAGWCFYELCSLSVLTVLIPVVFPLIISQISGAAMEPPQGWFQSFM 120
Query: 121 GFDCPPREMQLYQSLTEHTIKVSSTQFSPLTWTSISWALGLFLAGPILAFASFHLDYGFN 180
GFDCPP EMQLYQ LT+HTIK+S T+FSPL WTSISWALGL +AGPILAFASFHLDYGFN
Sbjct: 121 GFDCPPGEMQLYQILTDHTIKISGTRFSPLIWTSISWALGLIIAGPILAFASFHLDYGFN 180
Query: 181 QHLITLAAVAAGALSCLPTGLFKTVKIFPVYIILIVIAHSVAFTSHTRHLGLMLRGLTGP 240
QHLI + AVAAGALSCLPTG+F+TVKIFP+YI+LIVIAHSVAFTSHTRHLGLMLRGL GP
Sbjct: 181 QHLIAVGAVAAGALSCLPTGVFRTVKIFPLYIVLIVIAHSVAFTSHTRHLGLMLRGLVGP 240
Query: 241 ILHEPKFSQRRTGSGLISSCSSAVGGLGSAALSAFTYHMLRRDKQVQEGDDNHFLNLWIV 300
+ + KF+QRRTGSGLISSCS+AVGGLG+AA+SAFTYHMLRR++Q +EGDDNHFL+LWIV
Sbjct: 241 TVLKAKFAQRRTGSGLISSCSTAVGGLGAAAISAFTYHMLRRNRQEKEGDDNHFLSLWIV 300
Query: 301 TIFAGLQWLLGIFHVFLTNRSISVTIPSDSELHILSIFKDPHAIGTIISGGFLSSFTTIS 360
TIF GL+WLLGIFHVFLTNRS+SVTIPSDSELH+L+IFK PHAIGT+IS GFLSSFTTI+
Sbjct: 301 TIFGGLKWLLGIFHVFLTNRSVSVTIPSDSELHLLTIFKYPHAIGTVISAGFLSSFTTIA 360
Query: 361 IFTAVLLFLIGQICFKPVLILYLWLIYFLVPLISLPLLHQFQIRIKADASKMLILGFILS 420
IF AV LFLIGQICFKPVLILYLWLIYFL+PLISLPLLHQFQIRIKADASKM ILGFILS
Sbjct: 361 IFIAVSLFLIGQICFKPVLILYLWLIYFLIPLISLPLLHQFQIRIKADASKMQILGFILS 420
Query: 421 AATSATCFYFHTGEWRRRVVFVFAVLQGTAAALLHAYGRVLVLDCSPAGKEGAISMWFSW 480
A TSA CFYFH WR VVFVFA LQGTAAALLH YGRVLVLDCSPAGKE AISMWFSW
Sbjct: 421 AVTSAICFYFHNDAWRLPVVFVFAALQGTAAALLHTYGRVLVLDCSPAGKEAAISMWFSW 480
Query: 481 IRAIGGCVGFTVAAVVPARLQVSSGVAFCCAVVGGMVLIFGNVTDYGGAVAAGHVKDDSE 540
+RAIGGCVGFTVAAVVPARLQVSSGVAFCCAVVGG+VLI+GN+TDYGGAV+AGHVK+DSE
Sbjct: 481 MRAIGGCVGFTVAAVVPARLQVSSGVAFCCAVVGGVVLIYGNITDYGGAVSAGHVKNDSE 540
Query: 541 KGSPVIGLESRSESKELESP 561
KGSPVIGLESRS SKELESP
Sbjct: 541 KGSPVIGLESRSVSKELESP 558
BLAST of HG10002973 vs. ExPASy TrEMBL
Match:
A0A6J1HNK9 (uncharacterized protein LOC111465229 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111465229 PE=4 SV=1)
HSP 1 Score: 924.9 bits (2389), Expect = 1.6e-265
Identity = 473/560 (84.46%), Postives = 514/560 (91.79%), Query Frame = 0
Query: 1 MAEQSPKPKPSEIQNFPPPKSTTGRSMSTPRSASGGGGSRRETPDFHSTAAKLERAKEVY 60
MAEQSP+PK SEIQN PPP+S +GR ST R SG GGSR++TPDFHS AAKLERAKEVY
Sbjct: 1 MAEQSPRPKSSEIQNAPPPRSGSGRITSTTR--SGSGGSRKDTPDFHSMAAKLERAKEVY 60
Query: 61 RAYEGHGERPTIVEIVGWCFYELCSLFVLTLLIPVVFPLIISQISGTLTEPPQGWFKSFM 120
RAYEGHGE+P+I+E+ GWCFYELCSL VLT+LIPVVFPLIISQISG EPPQGWF+SFM
Sbjct: 61 RAYEGHGEKPSIMEMAGWCFYELCSLSVLTVLIPVVFPLIISQISGAAMEPPQGWFQSFM 120
Query: 121 GFDCPPREMQLYQSLTEHTIKVSSTQFSPLTWTSISWALGLFLAGPILAFASFHLDYGFN 180
GFDCPP EMQLYQ LT+HTIK+S T+FSPL WTSISWALGL +AGPILAFASFHLDYGFN
Sbjct: 121 GFDCPPGEMQLYQILTDHTIKISGTRFSPLIWTSISWALGLIIAGPILAFASFHLDYGFN 180
Query: 181 QHLITLAAVAAGALSCLPTGLFKTVKIFPVYIILIVIAHSVAFTSHTRHLGLMLRGLTGP 240
QHLI + AVAAGALSCLPTG+F+TVKIFP+YI+LIVIAHSVAFTSHTRHLGLMLRGL GP
Sbjct: 181 QHLIAVGAVAAGALSCLPTGVFRTVKIFPLYIVLIVIAHSVAFTSHTRHLGLMLRGLVGP 240
Query: 241 ILHEPKFSQRRTGSGLISSCSSAVGGLGSAALSAFTYHMLRRDKQVQEGDDNHFLNLWIV 300
+ + KF+QRRTGSGLISSCS+AVGGLG+AA+SAFTYHMLRR Q +EGDDNHFL+LWIV
Sbjct: 241 TVLKAKFAQRRTGSGLISSCSTAVGGLGAAAISAFTYHMLRR--QEKEGDDNHFLSLWIV 300
Query: 301 TIFAGLQWLLGIFHVFLTNRSISVTIPSDSELHILSIFKDPHAIGTIISGGFLSSFTTIS 360
TIF GL+WLLGIFHVFLTNRS+SVTIPSDSELH+L+IFK PHAIGT+IS GFLSSFTTI+
Sbjct: 301 TIFGGLKWLLGIFHVFLTNRSVSVTIPSDSELHLLTIFKYPHAIGTVISAGFLSSFTTIA 360
Query: 361 IFTAVLLFLIGQICFKPVLILYLWLIYFLVPLISLPLLHQFQIRIKADASKMLILGFILS 420
IF AV LFLIGQICFKPVLILYLWLIYFL+PLISLPLLHQFQIRIKADASKM ILGFILS
Sbjct: 361 IFIAVSLFLIGQICFKPVLILYLWLIYFLIPLISLPLLHQFQIRIKADASKMQILGFILS 420
Query: 421 AATSATCFYFHTGEWRRRVVFVFAVLQGTAAALLHAYGRVLVLDCSPAGKEGAISMWFSW 480
A TSA CFYFH WR VVFVFA LQGTAAALLH YGRVLVLDCSPAGKE AISMWFSW
Sbjct: 421 AVTSAICFYFHNDAWRLPVVFVFAALQGTAAALLHTYGRVLVLDCSPAGKEAAISMWFSW 480
Query: 481 IRAIGGCVGFTVAAVVPARLQVSSGVAFCCAVVGGMVLIFGNVTDYGGAVAAGHVKDDSE 540
+RAIGGCVGFTVAAVVPARLQVSSGVAFCCAVVGG+VLI+GN+TDYGGAV+AGHVK+DSE
Sbjct: 481 MRAIGGCVGFTVAAVVPARLQVSSGVAFCCAVVGGVVLIYGNITDYGGAVSAGHVKNDSE 540
Query: 541 KGSPVIGLESRSESKELESP 561
KGSPVIGLESRS SKELESP
Sbjct: 541 KGSPVIGLESRSVSKELESP 556
BLAST of HG10002973 vs. ExPASy TrEMBL
Match:
A0A0A0L1Q8 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G006650 PE=4 SV=1)
HSP 1 Score: 914.8 bits (2363), Expect = 1.7e-262
Identity = 477/562 (84.88%), Postives = 509/562 (90.57%), Query Frame = 0
Query: 1 MAEQSPKPKPSEIQNFPPPKSTTGRSMSTPRSAS--GGGGSRRETPDFHSTAAKLERAKE 60
M EQSP+PK SEI N PPPKST+ RS+STPRSA+ GGGGSRRETPDFHSTAAKLERAKE
Sbjct: 1 MTEQSPRPKQSEIHNLPPPKSTSARSVSTPRSATSGGGGGSRRETPDFHSTAAKLERAKE 60
Query: 61 VYRAYEGHGERPTIVEIVGWCFYELCSLFVLTLLIPVVFPLIISQISGTLTEPPQGWFKS 120
VYRAYEGHGERPTI EI+GWCFYELCS FVL LLIPVVFPLIISQISG T PPQGWFKS
Sbjct: 61 VYRAYEGHGERPTIAEILGWCFYELCSFFVLALLIPVVFPLIISQISGPPTAPPQGWFKS 120
Query: 121 FMGFDCPPREMQLYQSLTEHTIKVSSTQFSPLTWTSISWALGLFLAGPILAFASFHLDYG 180
F GFDC REMQLYQSLTE TI VS+ QFSPL WTSISWA+GL LAGPILA ASFHLDYG
Sbjct: 121 FRGFDCSSREMQLYQSLTEQTINVSNAQFSPLIWTSISWAVGLVLAGPILAVASFHLDYG 180
Query: 181 FNQHLITLAAVAAGALSCLPTGLFKTVKIFPVYIILIVIAHSVAFTSHTRHLGLMLRGLT 240
F+Q+LITLAAVAAGAL+CLPTG FKTVKIFP+YIILIVIAHSVA TSHTRHLGLMLRGLT
Sbjct: 181 FHQYLITLAAVAAGALTCLPTGFFKTVKIFPLYIILIVIAHSVASTSHTRHLGLMLRGLT 240
Query: 241 GPILHEPKFSQRRTGSGLISSCSSAVGGLGSAALSAFTYHMLRRDKQVQEGDDNHFLNLW 300
GPI+H+ KFS R GSG ISS S+ VGG+G+AA+SAFTYHMLR DKQVQ G D+HFLNLW
Sbjct: 241 GPIIHKAKFSLRIIGSGQISSWSAGVGGVGAAAISAFTYHMLRSDKQVQ-GIDSHFLNLW 300
Query: 301 IVTIFAGLQWLLGIFHVFLTNRSISVTIPSDSELHILSIFKDPHAIGTIISGGFLSSFTT 360
IVTIFAGL+WL+GIFHVFLTNRSISV+IPSDSE+HILSIFK PHAI T+ISGGFLSSF T
Sbjct: 301 IVTIFAGLKWLIGIFHVFLTNRSISVSIPSDSEIHILSIFKYPHAIATVISGGFLSSFAT 360
Query: 361 ISIFTAVLLFLIGQICFKPVLILYLWLIYFLVPLISLPLLHQFQIRIKADASKMLILGFI 420
ISIFT+VLLFLI QICFKPVLI YL LIYFLVPLISLPLLHQ QIRIKADASKMLILGFI
Sbjct: 361 ISIFTSVLLFLISQICFKPVLIFYLLLIYFLVPLISLPLLHQLQIRIKADASKMLILGFI 420
Query: 421 LSAATSATCFYFHTGEWRRRVVFVFAVLQGTAAALLHAYGRVLVLDCSPAGKEGAISMWF 480
LSAATSATCFYFH W+R +VFVFAVLQGTAAA+LHAYGR LV+ CSPAGKE AISMWF
Sbjct: 421 LSAATSATCFYFHAYAWQRHLVFVFAVLQGTAAAVLHAYGRALVVHCSPAGKESAISMWF 480
Query: 481 SWIRAIGGCVGFTVAAVVPARLQVSSGVAFCCAVVGGMVLIFGNVTDYGGAVAAGHVKDD 540
SW+RAIGGCVGFTVAAVVP LQVSSGV FCCAVVGGM+LIFGNVTDY GAVAAGHV+DD
Sbjct: 481 SWMRAIGGCVGFTVAAVVPTMLQVSSGVVFCCAVVGGMLLIFGNVTDYDGAVAAGHVRDD 540
Query: 541 SEKGSPVIGLESRSESKELESP 561
SEKGSPV GL+SRSESKELESP
Sbjct: 541 SEKGSPVFGLDSRSESKELESP 561
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_008448612.1 | 7.3e-276 | 88.57 | PREDICTED: uncharacterized protein LOC103490734 [Cucumis melo] >KAA0053002.1 unc... | [more] |
XP_022965332.1 | 3.6e-267 | 84.46 | uncharacterized protein LOC111465229 isoform X1 [Cucurbita maxima] | [more] |
XP_022965333.1 | 3.4e-265 | 84.46 | uncharacterized protein LOC111465229 isoform X2 [Cucurbita maxima] | [more] |
KAG6577745.1 | 4.4e-265 | 83.93 | hypothetical protein SDJN03_25319, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
KAG7015784.1 | 7.6e-265 | 84.11 | hypothetical protein SDJN02_23422, partial [Cucurbita argyrosperma subsp. argyro... | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A5D3CJT7 | 3.5e-276 | 88.57 | Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... | [more] |
A0A1S3BK45 | 3.5e-276 | 88.57 | uncharacterized protein LOC103490734 OS=Cucumis melo OX=3656 GN=LOC103490734 PE=... | [more] |
A0A6J1HK19 | 1.8e-267 | 84.46 | uncharacterized protein LOC111465229 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1HNK9 | 1.6e-265 | 84.46 | uncharacterized protein LOC111465229 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A0A0L1Q8 | 1.7e-262 | 84.88 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G006650 PE=4 SV=1 | [more] |
Match Name | E-value | Identity | Description | |