Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CAAAATCAAAGTCGGATACCCCGCAGGATTTGAAATAGTTTTCCCGTCTTGTTTTCATCCGAAATTTTGAAAACTCATCACTTTCCTATCGTTCTTCTAAGTCCCAACCCCAGAAGAGTGAGAATCCACCAGTACTCTTCGCCCAAACTGCTTCAGTTGGTAGAATGTCTTCCGAGGATCAAAAGCTGATTAAGAAAACCAAAGTGGAGGTAGACGATGCGGACGACGGAATAAGCCTGGGTACCCTTTTGCAAGAAAAAAGGAAGAAACTCCTAAATGTGGGTCCTAAATCTATCTCAAAGCCGAAGAAGGAAGAAGTTCAGGGAGAAGATGGAGTGGGAAAATCCCCAAATATAAATTGTAGGTCTGCTTCTAAGGGCACCAAGGTTAAAAAAGAAGAGCGTTTCAACTCTGTTGATGATGATTTTGACGAGAAGCCAGCCAAAAAGAGCTCTGCAGCAAAACGTGATATGGTTCGTATATCTTTCTGTGTTCTATGCTCCGATTTTCTTCAACTGTTTCGATTAATGAGCGACTGAAAGTAGAGTAATCGATTTAGGACAATGTAAGGGAGTAGAATATCCCTGGTGGTTGTAAGAAATGTGTTACTCTAGCGTGGAAGCTACAAGGTTTTTAGTGCAGAATTTTGGGTTCTTAATGTGATAATCTCTTCTTCATATACACTGAAAGCTGTTAACATATGTGTTTTCTGAATGGGCTGGGGAGATAGGAACTGAAGAAGAAGAAGAAAGTGAACGAGGAGAAGAGCAAAAGCGCCAAGGGGGAGCTGGAGAGCCAGAAAAAAGAGAAGAAGGAGAAGAAGGTATATGATTTGCCCGGTCAGAAGCGAGATCCCCCAGAAGAGGTATTAAAGAATGATTTGATAGGTATCAGTTTTATGTTTGGGAGTTCTTTAGCTGATGATCTTCATTTGCTCTTGGTTATTCAGAGAGACCCGCTGAGGATCTTTTACGAAACGCTCCACAAGCAAGTTCCCCACAGTGAAATGGCAGAGTTTTGGTAAGAAAACTGCTCTAATTGTTATTTCTTTAGTATTATCAAGAGTAGTTTTGATACCTTGAAGACGTTATGTTAACAAAGATGTTTATGTATGGGCTAAGTTTTCTGAGAACTCTCTAGTAGAACGTGTTTACTGGTTTCCATTTTCATGCCGATGGCTAGTCAGTATGGAACATAACTAGTTGTGTTCATGTTACTTCATGCCATGAACACAACTAGTTACATTGGAGAAACTCATATGTCGAATTCCGCGTTTGATTCAGATGGTTGAATGATGTCTGTTATGTTGCCCGACATAAGATCTTTAACATTCTGTATACTGCAAAGTAGACACTCAGGAGACGTTTATCAACAGGGATTTAGCAGTGTTAGAATGGAGGAGTGACTAATTCAGGAGATGTTTATTGTTTGTACATAACCATCTCATAAATGGGGAAGTGCACAAACGTGTTAGACAGGAATAAAGACTAGATTAAGATGCCATAAGCATGACACAACTCAATGTGCAGTCTGCAGTCCATTCTACCTCATCCTCCTTGGAAGTTTTCTTGTGGTGTTCTGCATGAACTGAGAACAACTAGAAAAGTGGGGCAATTACTCTCTAATTTGCTATTTCATTTGTTACATCTTTAATGAGTCCACGATCTCCCTGCATTTATACTTTGCAGCATCTGATCATCTTGTGTATGTGAGATTCCACATTGGTTGGAGAGAAGAACGAATCATTCCTTTATAAGGGTGTGAAAACCTCTCCCTAGTAGACAATTTGCTAGCGGTGAGCTTCGACTATTACAAATGGTACCAAAATTAGACATGAGGGCTTGGGCTATTACTAATGGTACCAGAATTAGACAGGAGACATAGTGCCTGTGAGGACGCTGGGCCCTCAAGAAGTGTGGATTGTGAGATCCCATATTGGTTGGAGGAGAGAACGAAACATTCCTTATAAAGGTGTGAAAACCTTCCACTAGGCGGAGTGTCAGTGAGGACGTTGGGCCCTCAATGAGGTTGGATTTTGAGATTCCACATTGGTTGTAGAGTAGAACAAAGCATTCTTTATAAGAGTGTGGAAACCTTTCCCTAGTAGACTAAAACCTTGAGGGAAGTCCAAAAGGGAAAGTCCAAAGAGGACAATATCTGCTAGCGGTGAGTTTGAGTTGTTACTATGTACGAGCATAAGTTAATCATGTTTCAAATTTTCGTTTTCTAAAAGGTAGTAATGGAATGAGATTGAGTTTGTTGTTCATCACTACTTAATATTGTGGAGTTTGGTCTGCAGGATGATGGAGTCTGGTTTACTATCCAAAGAAGAAGCAAAGAAAGTTTTTGACAAGAAGCAGAAGAAGGCACCACTGCAAAAGTTGAGTTCCCCAGGGAAGACTGTGACTGCAGTAAAGAGCGTCACGAAGACCACCATTGTCAAGAAAACTGTCCAATCTCCTCCAGTTTCTTCAAATAAGAAGACGATGACAGTGGGCTCCAAAGTTGTGGCAAAACAGTCAACGAAGCGCAATTTGAAAGACGAAAGCTCCGAAGATGAATCAGACGATGACTTCATAATCAGTCGAAGCATGAAAAAGAAACCAAGAGCAGCCTAATTCAGGTGTAAATGGACAAATCCTTCTATTCAGAAACTGATTTAAATCTCTGATTAGTTGTTCCAAAACATTGCATCAATGTTATAATGAGATTGATACTCCTTTGGTCGTTTCTCCCTTGTGTGCGATGTGCTCATGTTTAT
mRNA sequence
CAAAATCAAAGTCGGATACCCCGCAGGATTTGAAATAGTTTTCCCGTCTTGTTTTCATCCGAAATTTTGAAAACTCATCACTTTCCTATCGTTCTTCTAAGTCCCAACCCCAGAAGAGTGAGAATCCACCAGTACTCTTCGCCCAAACTGCTTCAGTTGGTAGAATGTCTTCCGAGGATCAAAAGCTGATTAAGAAAACCAAAGTGGAGGTAGACGATGCGGACGACGGAATAAGCCTGGGTACCCTTTTGCAAGAAAAAAGGAAGAAACTCCTAAATGTGGGTCCTAAATCTATCTCAAAGCCGAAGAAGGAAGAAGTTCAGGGAGAAGATGGAGTGGGAAAATCCCCAAATATAAATTGTAGGTCTGCTTCTAAGGGCACCAAGGTTAAAAAAGAAGAGCGTTTCAACTCTGTTGATGATGATTTTGACGAGAAGCCAGCCAAAAAGAGCTCTGCAGCAAAACGTGATATGGAACTGAAGAAGAAGAAGAAAGTGAACGAGGAGAAGAGCAAAAGCGCCAAGGGGGAGCTGGAGAGCCAGAAAAAAGAGAAGAAGGAGAAGAAGGTATATGATTTGCCCGGTCAGAAGCGAGATCCCCCAGAAGAGAGAGACCCGCTGAGGATCTTTTACGAAACGCTCCACAAGCAAGTTCCCCACAGTGAAATGGCAGAGTTTTGGATGATGGAGTCTGGTTTACTATCCAAAGAAGAAGCAAAGAAAGTTTTTGACAAGAAGCAGAAGAAGGCACCACTGCAAAAGTTGAGTTCCCCAGGGAAGACTGTGACTGCAGTAAAGAGCGTCACGAAGACCACCATTGTCAAGAAAACTGTCCAATCTCCTCCAGTTTCTTCAAATAAGAAGACGATGACAGTGGGCTCCAAAGTTGTGGCAAAACAGTCAACGAAGCGCAATTTGAAAGACGAAAGCTCCGAAGATGAATCAGACGATGACTTCATAATCAGTCGAAGCATGAAAAAGAAACCAAGAGCAGCCTAATTCAGGTGTAAATGGACAAATCCTTCTATTCAGAAACTGATTTAAATCTCTGATTAGTTGTTCCAAAACATTGCATCAATGTTATAATGAGATTGATACTCCTTTGGTCGTTTCTCCCTTGTGTGCGATGTGCTCATGTTTAT
Coding sequence (CDS)
ATGTCTTCCGAGGATCAAAAGCTGATTAAGAAAACCAAAGTGGAGGTAGACGATGCGGACGACGGAATAAGCCTGGGTACCCTTTTGCAAGAAAAAAGGAAGAAACTCCTAAATGTGGGTCCTAAATCTATCTCAAAGCCGAAGAAGGAAGAAGTTCAGGGAGAAGATGGAGTGGGAAAATCCCCAAATATAAATTGTAGGTCTGCTTCTAAGGGCACCAAGGTTAAAAAAGAAGAGCGTTTCAACTCTGTTGATGATGATTTTGACGAGAAGCCAGCCAAAAAGAGCTCTGCAGCAAAACGTGATATGGAACTGAAGAAGAAGAAGAAAGTGAACGAGGAGAAGAGCAAAAGCGCCAAGGGGGAGCTGGAGAGCCAGAAAAAAGAGAAGAAGGAGAAGAAGGTATATGATTTGCCCGGTCAGAAGCGAGATCCCCCAGAAGAGAGAGACCCGCTGAGGATCTTTTACGAAACGCTCCACAAGCAAGTTCCCCACAGTGAAATGGCAGAGTTTTGGATGATGGAGTCTGGTTTACTATCCAAAGAAGAAGCAAAGAAAGTTTTTGACAAGAAGCAGAAGAAGGCACCACTGCAAAAGTTGAGTTCCCCAGGGAAGACTGTGACTGCAGTAAAGAGCGTCACGAAGACCACCATTGTCAAGAAAACTGTCCAATCTCCTCCAGTTTCTTCAAATAAGAAGACGATGACAGTGGGCTCCAAAGTTGTGGCAAAACAGTCAACGAAGCGCAATTTGAAAGACGAAAGCTCCGAAGATGAATCAGACGATGACTTCATAATCAGTCGAAGCATGAAAAAGAAACCAAGAGCAGCCTAA
Protein sequence
MSSEDQKLIKKTKVEVDDADDGISLGTLLQEKRKKLLNVGPKSISKPKKEEVQGEDGVGKSPNINCRSASKGTKVKKEERFNSVDDDFDEKPAKKSSAAKRDMELKKKKKVNEEKSKSAKGELESQKKEKKEKKVYDLPGQKRDPPEERDPLRIFYETLHKQVPHSEMAEFWMMESGLLSKEEAKKVFDKKQKKAPLQKLSSPGKTVTAVKSVTKTTIVKKTVQSPPVSSNKKTMTVGSKVVAKQSTKRNLKDESSEDESDDDFIISRSMKKKPRAA
Homology
BLAST of CmaCh01G019750 vs. ExPASy TrEMBL
Match:
A0A6J1J474 (protein PXR1-like OS=Cucurbita maxima OX=3661 GN=LOC111481191 PE=4 SV=1)
HSP 1 Score: 497.7 bits (1280), Expect = 3.2e-137
Identity = 277/277 (100.00%), Postives = 277/277 (100.00%), Query Frame = 0
Query: 1 MSSEDQKLIKKTKVEVDDADDGISLGTLLQEKRKKLLNVGPKSISKPKKEEVQGEDGVGK 60
MSSEDQKLIKKTKVEVDDADDGISLGTLLQEKRKKLLNVGPKSISKPKKEEVQGEDGVGK
Sbjct: 1 MSSEDQKLIKKTKVEVDDADDGISLGTLLQEKRKKLLNVGPKSISKPKKEEVQGEDGVGK 60
Query: 61 SPNINCRSASKGTKVKKEERFNSVDDDFDEKPAKKSSAAKRDMELKKKKKVNEEKSKSAK 120
SPNINCRSASKGTKVKKEERFNSVDDDFDEKPAKKSSAAKRDMELKKKKKVNEEKSKSAK
Sbjct: 61 SPNINCRSASKGTKVKKEERFNSVDDDFDEKPAKKSSAAKRDMELKKKKKVNEEKSKSAK 120
Query: 121 GELESQKKEKKEKKVYDLPGQKRDPPEERDPLRIFYETLHKQVPHSEMAEFWMMESGLLS 180
GELESQKKEKKEKKVYDLPGQKRDPPEERDPLRIFYETLHKQVPHSEMAEFWMMESGLLS
Sbjct: 121 GELESQKKEKKEKKVYDLPGQKRDPPEERDPLRIFYETLHKQVPHSEMAEFWMMESGLLS 180
Query: 181 KEEAKKVFDKKQKKAPLQKLSSPGKTVTAVKSVTKTTIVKKTVQSPPVSSNKKTMTVGSK 240
KEEAKKVFDKKQKKAPLQKLSSPGKTVTAVKSVTKTTIVKKTVQSPPVSSNKKTMTVGSK
Sbjct: 181 KEEAKKVFDKKQKKAPLQKLSSPGKTVTAVKSVTKTTIVKKTVQSPPVSSNKKTMTVGSK 240
Query: 241 VVAKQSTKRNLKDESSEDESDDDFIISRSMKKKPRAA 278
VVAKQSTKRNLKDESSEDESDDDFIISRSMKKKPRAA
Sbjct: 241 VVAKQSTKRNLKDESSEDESDDDFIISRSMKKKPRAA 277
BLAST of CmaCh01G019750 vs. ExPASy TrEMBL
Match:
A0A6J1FIR2 (protein PXR1-like OS=Cucurbita moschata OX=3662 GN=LOC111446106 PE=4 SV=1)
HSP 1 Score: 484.2 bits (1245), Expect = 3.7e-133
Identity = 270/277 (97.47%), Postives = 272/277 (98.19%), Query Frame = 0
Query: 1 MSSEDQKLIKKTKVEVDDADDGISLGTLLQEKRKKLLNVGPKSISKPKKEEVQGEDGVGK 60
MSSEDQKLIKKTKVEVD+ADDGISLG LLQEKRKKL NVGPKSISKPKKEEVQGEDGVGK
Sbjct: 1 MSSEDQKLIKKTKVEVDEADDGISLGALLQEKRKKLQNVGPKSISKPKKEEVQGEDGVGK 60
Query: 61 SPNINCRSASKGTKVKKEERFNSVDDDFDEKPAKKSSAAKRDMELKKKKKVNEEKSKSAK 120
SPNINC SASKGTKVKKEERFNSVDDDFDEKPAKKSSAAKRDMELKKKKKVNEEKSKSAK
Sbjct: 61 SPNINCGSASKGTKVKKEERFNSVDDDFDEKPAKKSSAAKRDMELKKKKKVNEEKSKSAK 120
Query: 121 GELESQKKEKKEKKVYDLPGQKRDPPEERDPLRIFYETLHKQVPHSEMAEFWMMESGLLS 180
GELESQKKEKKEKKVYDLPGQKRDPPEERDPLRIFYETLHKQVPHSEMAEFWMMESGLLS
Sbjct: 121 GELESQKKEKKEKKVYDLPGQKRDPPEERDPLRIFYETLHKQVPHSEMAEFWMMESGLLS 180
Query: 181 KEEAKKVFDKKQKKAPLQKLSSPGKTVTAVKSVTKTTIVKKTVQSPPVSSNKKTMTVGSK 240
KEEAKKVF+KKQKKAPLQKLSSPGKTVT VKSVTKTTIVK TVQSPPVSSNKKTMTVGSK
Sbjct: 181 KEEAKKVFEKKQKKAPLQKLSSPGKTVTTVKSVTKTTIVKNTVQSPPVSSNKKTMTVGSK 240
Query: 241 VVAKQSTKRNLKDESSEDESDDDFIISRSMKKKPRAA 278
VVAKQSTKRNLKDESSEDESDDDFIISRSMKKKPRAA
Sbjct: 241 VVAKQSTKRNLKDESSEDESDDDFIISRSMKKKPRAA 277
BLAST of CmaCh01G019750 vs. ExPASy TrEMBL
Match:
A0A6J1CXC1 (transcriptional regulator ATRX homolog OS=Momordica charantia OX=3673 GN=LOC111015025 PE=4 SV=1)
HSP 1 Score: 382.1 bits (980), Expect = 2.0e-102
Identity = 221/278 (79.50%), Postives = 245/278 (88.13%), Query Frame = 0
Query: 1 MSSEDQKLIKKTKVEVDDADDGISLGTLLQEKRKKLLNVGPKSISKPKKEEVQGEDGVGK 60
MSSEDQK IKK+KVE+D+ +DG+SLG++ Q K+KKL N G K +S KKEE+QGEDGVGK
Sbjct: 1 MSSEDQKPIKKSKVELDEMEDGMSLGSIFQAKKKKLSNGGSKPLSNLKKEELQGEDGVGK 60
Query: 61 SPNINCRSASKGTKVKKEERFNSVDDDFDEKPAKKSSAAKRDMELKKKKKV-NEEKSKSA 120
SP + S KGTKVKKEERFNS DDD+DE P+KKSSAAKRDME KKKKKV EEKSK++
Sbjct: 61 SPKMGSGSVPKGTKVKKEERFNSSDDDYDETPSKKSSAAKRDMEPKKKKKVKEEEKSKTS 120
Query: 121 KGELESQKKEKKEKKVYDLPGQKRDPPEERDPLRIFYETLHKQVPHSEMAEFWMMESGLL 180
K + ESQKKE++EKKVYDLPGQKRDPPEERDPLRIFYETLHKQ+PHSEMA+FWMMESGLL
Sbjct: 121 KAKQESQKKERREKKVYDLPGQKRDPPEERDPLRIFYETLHKQLPHSEMAQFWMMESGLL 180
Query: 181 SKEEAKKVFDKKQKKAPLQKLSSPGKTVTAVKSVTKTTIVKKTVQSPPVSSNKKTMTVGS 240
SKEEAKKVF+KKQKKAPLQKLSSP K VTAVKSVTKT IVKKTVQS P+SSNKKT TV S
Sbjct: 181 SKEEAKKVFEKKQKKAPLQKLSSPVKAVTAVKSVTKTAIVKKTVQSSPLSSNKKT-TVDS 240
Query: 241 KVVAKQSTKRNLKDESSEDESDDDFIISRSMKKKPRAA 278
KV+ KQS KR KDESSEDESDDDFIISRS+KKKPRAA
Sbjct: 241 KVMTKQSKKRKSKDESSEDESDDDFIISRSVKKKPRAA 277
BLAST of CmaCh01G019750 vs. ExPASy TrEMBL
Match:
A0A5D3E5S8 (Golgin subfamily A member 6-like protein 22 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold455G008030 PE=4 SV=1)
HSP 1 Score: 377.1 bits (967), Expect = 6.3e-101
Identity = 215/277 (77.62%), Postives = 243/277 (87.73%), Query Frame = 0
Query: 1 MSSEDQKLIKKTKVEVDDADDGISLGTLLQEKRKKLLNVGPKSISKPKKEEVQGEDGVGK 60
MSS+DQK +KK K E+DD+DDG+SLG LLQEKRKKLLNVG K +SKPKKEE+ G DG+GK
Sbjct: 1 MSSQDQKPLKKAKPELDDSDDGMSLGALLQEKRKKLLNVGSKLLSKPKKEELLGVDGLGK 60
Query: 61 SPNINCRSASKGTKVKKEERFNSVDDDFDEKPAKKSSAAKRDMELKKKKKVNEEKSKSAK 120
SP I+ SA KG+KVKKEERFNSV+DDFDEKPAKKSSAAKRD ELKKKKKV EE+ +
Sbjct: 61 SPKIDSGSAPKGSKVKKEERFNSVNDDFDEKPAKKSSAAKRDTELKKKKKVKEEEKSKSS 120
Query: 121 GELESQKKEKKEKKVYDLPGQKRDPPEERDPLRIFYETLHKQVPHSEMAEFWMMESGLLS 180
ELES KKE+K+KKVYDLPGQKRDPPEERDPLRIFYE+LHKQ+PHSEMA+FWMMESGLLS
Sbjct: 121 KELESLKKERKQKKVYDLPGQKRDPPEERDPLRIFYESLHKQLPHSEMAQFWMMESGLLS 180
Query: 181 KEEAKKVFDKKQKKAPLQKLSSPGKTVTAVKSVTKTTIVKKTVQSPPVSSNKKTMTVGSK 240
KEEAK+VF+KKQKKAPLQKLSSP KTV+AVKSVTKT +VKKTVQS P+SSN +T V SK
Sbjct: 181 KEEAKEVFEKKQKKAPLQKLSSPMKTVSAVKSVTKTAVVKKTVQSSPLSSN-RTTKVDSK 240
Query: 241 VVAKQSTKRNLKDESSEDESDDDFIISRSMKKKPRAA 278
VV K S KR KD+SSED+SDDDF IS+S+KKK RAA
Sbjct: 241 VVMKSSKKRKSKDDSSEDDSDDDFFISQSIKKKARAA 276
BLAST of CmaCh01G019750 vs. ExPASy TrEMBL
Match:
A0A1S3CRI7 (uncharacterized protein LOC103503875 OS=Cucumis melo OX=3656 GN=LOC103503875 PE=4 SV=1)
HSP 1 Score: 377.1 bits (967), Expect = 6.3e-101
Identity = 215/277 (77.62%), Postives = 243/277 (87.73%), Query Frame = 0
Query: 1 MSSEDQKLIKKTKVEVDDADDGISLGTLLQEKRKKLLNVGPKSISKPKKEEVQGEDGVGK 60
MSS+DQK +KK K E+DD+DDG+SLG LLQEKRKKLLNVG K +SKPKKEE+ G DG+GK
Sbjct: 1 MSSQDQKPLKKAKPELDDSDDGMSLGALLQEKRKKLLNVGSKLLSKPKKEELLGVDGLGK 60
Query: 61 SPNINCRSASKGTKVKKEERFNSVDDDFDEKPAKKSSAAKRDMELKKKKKVNEEKSKSAK 120
SP I+ SA KG+KVKKEERFNSV+DDFDEKPAKKSSAAKRD ELKKKKKV EE+ +
Sbjct: 61 SPKIDSGSAPKGSKVKKEERFNSVNDDFDEKPAKKSSAAKRDTELKKKKKVKEEEKSKSS 120
Query: 121 GELESQKKEKKEKKVYDLPGQKRDPPEERDPLRIFYETLHKQVPHSEMAEFWMMESGLLS 180
ELES KKE+K+KKVYDLPGQKRDPPEERDPLRIFYE+LHKQ+PHSEMA+FWMMESGLLS
Sbjct: 121 KELESLKKERKQKKVYDLPGQKRDPPEERDPLRIFYESLHKQLPHSEMAQFWMMESGLLS 180
Query: 181 KEEAKKVFDKKQKKAPLQKLSSPGKTVTAVKSVTKTTIVKKTVQSPPVSSNKKTMTVGSK 240
KEEAK+VF+KKQKKAPLQKLSSP KTV+AVKSVTKT +VKKTVQS P+SSN +T V SK
Sbjct: 181 KEEAKEVFEKKQKKAPLQKLSSPMKTVSAVKSVTKTAVVKKTVQSSPLSSN-RTTKVDSK 240
Query: 241 VVAKQSTKRNLKDESSEDESDDDFIISRSMKKKPRAA 278
VV K S KR KD+SSED+SDDDF IS+S+KKK RAA
Sbjct: 241 VVMKSSKKRKSKDDSSEDDSDDDFFISQSIKKKARAA 276
BLAST of CmaCh01G019750 vs. NCBI nr
Match:
XP_022982324.1 (protein PXR1-like [Cucurbita maxima])
HSP 1 Score: 497.7 bits (1280), Expect = 6.6e-137
Identity = 277/277 (100.00%), Postives = 277/277 (100.00%), Query Frame = 0
Query: 1 MSSEDQKLIKKTKVEVDDADDGISLGTLLQEKRKKLLNVGPKSISKPKKEEVQGEDGVGK 60
MSSEDQKLIKKTKVEVDDADDGISLGTLLQEKRKKLLNVGPKSISKPKKEEVQGEDGVGK
Sbjct: 1 MSSEDQKLIKKTKVEVDDADDGISLGTLLQEKRKKLLNVGPKSISKPKKEEVQGEDGVGK 60
Query: 61 SPNINCRSASKGTKVKKEERFNSVDDDFDEKPAKKSSAAKRDMELKKKKKVNEEKSKSAK 120
SPNINCRSASKGTKVKKEERFNSVDDDFDEKPAKKSSAAKRDMELKKKKKVNEEKSKSAK
Sbjct: 61 SPNINCRSASKGTKVKKEERFNSVDDDFDEKPAKKSSAAKRDMELKKKKKVNEEKSKSAK 120
Query: 121 GELESQKKEKKEKKVYDLPGQKRDPPEERDPLRIFYETLHKQVPHSEMAEFWMMESGLLS 180
GELESQKKEKKEKKVYDLPGQKRDPPEERDPLRIFYETLHKQVPHSEMAEFWMMESGLLS
Sbjct: 121 GELESQKKEKKEKKVYDLPGQKRDPPEERDPLRIFYETLHKQVPHSEMAEFWMMESGLLS 180
Query: 181 KEEAKKVFDKKQKKAPLQKLSSPGKTVTAVKSVTKTTIVKKTVQSPPVSSNKKTMTVGSK 240
KEEAKKVFDKKQKKAPLQKLSSPGKTVTAVKSVTKTTIVKKTVQSPPVSSNKKTMTVGSK
Sbjct: 181 KEEAKKVFDKKQKKAPLQKLSSPGKTVTAVKSVTKTTIVKKTVQSPPVSSNKKTMTVGSK 240
Query: 241 VVAKQSTKRNLKDESSEDESDDDFIISRSMKKKPRAA 278
VVAKQSTKRNLKDESSEDESDDDFIISRSMKKKPRAA
Sbjct: 241 VVAKQSTKRNLKDESSEDESDDDFIISRSMKKKPRAA 277
BLAST of CmaCh01G019750 vs. NCBI nr
Match:
KAG6608598.1 (hypothetical protein SDJN03_01940, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 487.6 bits (1254), Expect = 6.9e-134
Identity = 272/277 (98.19%), Postives = 274/277 (98.92%), Query Frame = 0
Query: 1 MSSEDQKLIKKTKVEVDDADDGISLGTLLQEKRKKLLNVGPKSISKPKKEEVQGEDGVGK 60
MSSEDQKLIKKTKVEVD+ADDGISLG LLQEKRKKL NVGPKSISKPKKEEVQGEDGVGK
Sbjct: 1 MSSEDQKLIKKTKVEVDEADDGISLGALLQEKRKKLQNVGPKSISKPKKEEVQGEDGVGK 60
Query: 61 SPNINCRSASKGTKVKKEERFNSVDDDFDEKPAKKSSAAKRDMELKKKKKVNEEKSKSAK 120
SPNINC SASKGTKVKKEERFNSVDDDFDEKPAKKSSAAKRDMELKKKKKVNEEKSKSAK
Sbjct: 61 SPNINCGSASKGTKVKKEERFNSVDDDFDEKPAKKSSAAKRDMELKKKKKVNEEKSKSAK 120
Query: 121 GELESQKKEKKEKKVYDLPGQKRDPPEERDPLRIFYETLHKQVPHSEMAEFWMMESGLLS 180
GELESQKKEKKEKKVYDLPGQKRDPPEERDPLRIFYETLHKQVPHSEMAEFWMMESGLLS
Sbjct: 121 GELESQKKEKKEKKVYDLPGQKRDPPEERDPLRIFYETLHKQVPHSEMAEFWMMESGLLS 180
Query: 181 KEEAKKVFDKKQKKAPLQKLSSPGKTVTAVKSVTKTTIVKKTVQSPPVSSNKKTMTVGSK 240
KEEAKKVF+KKQKKAPLQKLSSPGKTVTAVKSVTKTTIVKKTVQSPPVSSNKKTMTVGSK
Sbjct: 181 KEEAKKVFEKKQKKAPLQKLSSPGKTVTAVKSVTKTTIVKKTVQSPPVSSNKKTMTVGSK 240
Query: 241 VVAKQSTKRNLKDESSEDESDDDFIISRSMKKKPRAA 278
VVAKQSTKRNLKDESSEDESDDDFIISRSMKKKPRAA
Sbjct: 241 VVAKQSTKRNLKDESSEDESDDDFIISRSMKKKPRAA 277
BLAST of CmaCh01G019750 vs. NCBI nr
Match:
XP_023525196.1 (protein PXR1-like [Cucurbita pepo subsp. pepo])
HSP 1 Score: 484.6 bits (1246), Expect = 5.8e-133
Identity = 270/277 (97.47%), Postives = 273/277 (98.56%), Query Frame = 0
Query: 1 MSSEDQKLIKKTKVEVDDADDGISLGTLLQEKRKKLLNVGPKSISKPKKEEVQGEDGVGK 60
MSSEDQKLIKKTKVE D+ADDGISLG LLQEKRKKLLNVGPKSISKPKKEEVQGEDG GK
Sbjct: 1 MSSEDQKLIKKTKVEEDEADDGISLGALLQEKRKKLLNVGPKSISKPKKEEVQGEDGAGK 60
Query: 61 SPNINCRSASKGTKVKKEERFNSVDDDFDEKPAKKSSAAKRDMELKKKKKVNEEKSKSAK 120
SPNINC SASKGTKV+KEERFNSVDDDFDEKPAKKSSAAKRDMELKKKKKVNEEKSKSAK
Sbjct: 61 SPNINCGSASKGTKVQKEERFNSVDDDFDEKPAKKSSAAKRDMELKKKKKVNEEKSKSAK 120
Query: 121 GELESQKKEKKEKKVYDLPGQKRDPPEERDPLRIFYETLHKQVPHSEMAEFWMMESGLLS 180
GELESQKKEKKEKKVYDLPGQKRDPPEERDPLRIFYETLHKQVPHSEMAEFWMMESGLLS
Sbjct: 121 GELESQKKEKKEKKVYDLPGQKRDPPEERDPLRIFYETLHKQVPHSEMAEFWMMESGLLS 180
Query: 181 KEEAKKVFDKKQKKAPLQKLSSPGKTVTAVKSVTKTTIVKKTVQSPPVSSNKKTMTVGSK 240
KEEAKKVF+KKQKKAPLQKLSSPGKTVTAVKSVTKTTIVKKTVQSPPVSSNKKTMTVGSK
Sbjct: 181 KEEAKKVFEKKQKKAPLQKLSSPGKTVTAVKSVTKTTIVKKTVQSPPVSSNKKTMTVGSK 240
Query: 241 VVAKQSTKRNLKDESSEDESDDDFIISRSMKKKPRAA 278
VVAKQSTKRNLKDESSEDESDDDFIISRSMKKKPRAA
Sbjct: 241 VVAKQSTKRNLKDESSEDESDDDFIISRSMKKKPRAA 277
BLAST of CmaCh01G019750 vs. NCBI nr
Match:
XP_022940536.1 (protein PXR1-like [Cucurbita moschata])
HSP 1 Score: 484.2 bits (1245), Expect = 7.6e-133
Identity = 270/277 (97.47%), Postives = 272/277 (98.19%), Query Frame = 0
Query: 1 MSSEDQKLIKKTKVEVDDADDGISLGTLLQEKRKKLLNVGPKSISKPKKEEVQGEDGVGK 60
MSSEDQKLIKKTKVEVD+ADDGISLG LLQEKRKKL NVGPKSISKPKKEEVQGEDGVGK
Sbjct: 1 MSSEDQKLIKKTKVEVDEADDGISLGALLQEKRKKLQNVGPKSISKPKKEEVQGEDGVGK 60
Query: 61 SPNINCRSASKGTKVKKEERFNSVDDDFDEKPAKKSSAAKRDMELKKKKKVNEEKSKSAK 120
SPNINC SASKGTKVKKEERFNSVDDDFDEKPAKKSSAAKRDMELKKKKKVNEEKSKSAK
Sbjct: 61 SPNINCGSASKGTKVKKEERFNSVDDDFDEKPAKKSSAAKRDMELKKKKKVNEEKSKSAK 120
Query: 121 GELESQKKEKKEKKVYDLPGQKRDPPEERDPLRIFYETLHKQVPHSEMAEFWMMESGLLS 180
GELESQKKEKKEKKVYDLPGQKRDPPEERDPLRIFYETLHKQVPHSEMAEFWMMESGLLS
Sbjct: 121 GELESQKKEKKEKKVYDLPGQKRDPPEERDPLRIFYETLHKQVPHSEMAEFWMMESGLLS 180
Query: 181 KEEAKKVFDKKQKKAPLQKLSSPGKTVTAVKSVTKTTIVKKTVQSPPVSSNKKTMTVGSK 240
KEEAKKVF+KKQKKAPLQKLSSPGKTVT VKSVTKTTIVK TVQSPPVSSNKKTMTVGSK
Sbjct: 181 KEEAKKVFEKKQKKAPLQKLSSPGKTVTTVKSVTKTTIVKNTVQSPPVSSNKKTMTVGSK 240
Query: 241 VVAKQSTKRNLKDESSEDESDDDFIISRSMKKKPRAA 278
VVAKQSTKRNLKDESSEDESDDDFIISRSMKKKPRAA
Sbjct: 241 VVAKQSTKRNLKDESSEDESDDDFIISRSMKKKPRAA 277
BLAST of CmaCh01G019750 vs. NCBI nr
Match:
KAG7037915.1 (hypothetical protein SDJN02_01546, partial [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 440.3 bits (1131), Expect = 1.3e-119
Identity = 252/277 (90.97%), Postives = 257/277 (92.78%), Query Frame = 0
Query: 1 MSSEDQKLIKKTKVEVDDADDGISLGTLLQEKRKKLLNVGPKSISKPKKEEVQGEDGVGK 60
MSSEDQKLIKKTKVEVD+ADDGISLG LLQEKRKKL NVGPKSISKPKKEEVQGE+GVGK
Sbjct: 1 MSSEDQKLIKKTKVEVDEADDGISLGALLQEKRKKLQNVGPKSISKPKKEEVQGEEGVGK 60
Query: 61 SPNINCRSASKGTKVKKEERFNSVDDDFDEKPAKKSSAAKRDMELKKKKKVNEEKSKSAK 120
SPNINC SASKGTKVKKEERFNSVDDDFDEKPAKKSSAAKRDMELKKKKKVNEEKSKSAK
Sbjct: 61 SPNINCGSASKGTKVKKEERFNSVDDDFDEKPAKKSSAAKRDMELKKKKKVNEEKSKSAK 120
Query: 121 GELESQKKEKKEKKVYDLPGQKRDPPEERDPLRIFYETLHKQVPHSEMAEFWMMESGLLS 180
GELESQKKEKKEKKVYDLPGQKRDPPEERDPLRIFYETLHKQVPH
Sbjct: 121 GELESQKKEKKEKKVYDLPGQKRDPPEERDPLRIFYETLHKQVPH--------------- 180
Query: 181 KEEAKKVFDKKQKKAPLQKLSSPGKTVTAVKSVTKTTIVKKTVQSPPVSSNKKTMTVGSK 240
+AKKVF+KKQKKAPLQKLSSPGKTVTAVKSVTKTTIVKKTVQSPPVSSNKKTMTVGSK
Sbjct: 181 --KAKKVFEKKQKKAPLQKLSSPGKTVTAVKSVTKTTIVKKTVQSPPVSSNKKTMTVGSK 240
Query: 241 VVAKQSTKRNLKDESSEDESDDDFIISRSMKKKPRAA 278
VVAKQSTKRNLKDESS+DESDDDFIISRSMKKKPRAA
Sbjct: 241 VVAKQSTKRNLKDESSDDESDDDFIISRSMKKKPRAA 260
BLAST of CmaCh01G019750 vs. TAIR 10
Match:
AT1G19990.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: cultured cell; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G11600.1); Has 11256 Blast hits to 7192 proteins in 541 species: Archae - 6; Bacteria - 629; Metazoa - 4714; Fungi - 936; Plants - 545; Viruses - 34; Other Eukaryotes - 4392 (source: NCBI BLink). )
HSP 1 Score: 125.6 bits (314), Expect = 6.4e-29
Identity = 114/274 (41.61%), Postives = 161/274 (58.76%), Query Frame = 0
Query: 3 SEDQKLIKKTKVEVDDADDGISLGTLLQEKRKKLLNVGPKSISKPKKEEVQGEDGVGKSP 62
SED +K K++ + +D SL + ++K N G K K KKEE +D K P
Sbjct: 4 SED---VKAMKMKEEAEEDNKSLSSFAKKKPTNGNNAGSK---KLKKEENDDDDDDNK-P 63
Query: 63 NINCRSASKGTKVKKEERFNSVDDDFDEKPAKKSSAAKRDMELKKKKKVNEEKSKSAKGE 122
+ S S+ VKK+E +D D ++KP K +++ + + K+ K K E
Sbjct: 64 IKSSVSGSRAKPVKKKE---EIDKDDEKKPVSKRNSS---VGVSKENK---------KPE 123
Query: 123 LESQKKEKKEKKVYDLPGQKRDPPEERDPLRIFYETLHKQVPHSEMAEFWMMESGLLSKE 182
E + K+K+E+KVYDLPGQKR+ P+ERDPLRIFYE+L+KQ+P S+MA+ W+MESGLL E
Sbjct: 124 KEEEVKKKRERKVYDLPGQKREQPDERDPLRIFYESLYKQIPTSDMAQIWLMESGLLPAE 183
Query: 183 EAKKVFDKKQKKAPLQKLSSPGKTVTAV-KSVTKT-TIVKKTVQSPP--VSSNKKTMTVG 242
+AKKV +KK +K KLSSP K+ + +S +K+ T+ KK VQ P SNKK
Sbjct: 184 KAKKVLEKKLQKG--GKLSSPVKSAASTPRSNSKSVTVKKKEVQKSPSEALSNKK----- 243
Query: 243 SKVVAKQSTKRNLKDESSEDESDDDFIISRSMKK 273
K + T + K S +D+SDDDF+ SR KK
Sbjct: 244 -KGNDSKPTTKKRKKNSDDDDSDDDFLASRVSKK 247
BLAST of CmaCh01G019750 vs. TAIR 10
Match:
AT5G11600.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 20 plant structures; EXPRESSED DURING: 11 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G19990.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )
HSP 1 Score: 98.2 bits (243), Expect = 1.1e-20
Identity = 68/159 (42.77%), Postives = 99/159 (62.26%), Query Frame = 0
Query: 115 KSKSAKGELESQK-KEKKEKKVYDLPGQKRDPPEERDPLRIFYETLHKQVPHSEMAEFWM 174
K+K+ + K K K+EKKVY L GQK DPPEER+PLRIFYE+L KQ+P SEMAEFW+
Sbjct: 100 KAKTTSATVSLVKGKAKREKKVYSLAGQKFDPPEEREPLRIFYESLSKQIPGSEMAEFWL 159
Query: 175 MESGLLSKEEAKKVFDKKQKKAPLQKLSSPGKTVTAVKSVTKTTIVKKTVQSPPVSSNKK 234
ME G+LS E+AK+ F+KKQ+K ++ +P K+ T S SS +
Sbjct: 160 MEHGMLSPEKAKRAFEKKQRKMKQIRMGTPSKSA-------------PTFSSKAESSQRT 219
Query: 235 TMTVGSKVVAKQSTKRNLKDESSEDESDDDFIISRSMKK 273
+ + + + A++ K+ + D+ +D+ DDDFI+S +K
Sbjct: 220 SASKNNGLDARK--KKKVVDD--DDDDDDDFILSHKRRK 241
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1J474 | 3.2e-137 | 100.00 | protein PXR1-like OS=Cucurbita maxima OX=3661 GN=LOC111481191 PE=4 SV=1 | [more] |
A0A6J1FIR2 | 3.7e-133 | 97.47 | protein PXR1-like OS=Cucurbita moschata OX=3662 GN=LOC111446106 PE=4 SV=1 | [more] |
A0A6J1CXC1 | 2.0e-102 | 79.50 | transcriptional regulator ATRX homolog OS=Momordica charantia OX=3673 GN=LOC1110... | [more] |
A0A5D3E5S8 | 6.3e-101 | 77.62 | Golgin subfamily A member 6-like protein 22 OS=Cucumis melo var. makuwa OX=11946... | [more] |
A0A1S3CRI7 | 6.3e-101 | 77.62 | uncharacterized protein LOC103503875 OS=Cucumis melo OX=3656 GN=LOC103503875 PE=... | [more] |
Match Name | E-value | Identity | Description | |
XP_022982324.1 | 6.6e-137 | 100.00 | protein PXR1-like [Cucurbita maxima] | [more] |
KAG6608598.1 | 6.9e-134 | 98.19 | hypothetical protein SDJN03_01940, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
XP_023525196.1 | 5.8e-133 | 97.47 | protein PXR1-like [Cucurbita pepo subsp. pepo] | [more] |
XP_022940536.1 | 7.6e-133 | 97.47 | protein PXR1-like [Cucurbita moschata] | [more] |
KAG7037915.1 | 1.3e-119 | 90.97 | hypothetical protein SDJN02_01546, partial [Cucurbita argyrosperma subsp. argyro... | [more] |
Match Name | E-value | Identity | Description | |
AT1G19990.1 | 6.4e-29 | 41.61 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |
AT5G11600.1 | 1.1e-20 | 42.77 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |