Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GCCTACTAAACCAAACCATAAAAAAGCCTACAGCGTGAGCTCAGCGATCACTATAATTCATACAATTTCACCGATAAATCATATCCACACGTCTCAATCGCCCTGCCATCAACCGTTTGTGAGCTTGGAAAATCACAAAGAATCAGCAAGAAATGGAGTCTCTGATTTTGATTCCAGCAAGTAAACACGTTCCAATCACAAATTCTTCTCTATTTTGCTCCGGTTTCTCCATTCCATTCACTTCTAGAAAGCGCGCCATCAATTTCCAACCATGTTGTCCACAGATTCTTCATTGCAATGGCCTTCGACATGCGCAAGCAACTCCGAGGTCTTATATTGGAAATGGACCGCCGTATGGTGAGGAAGATGATGATCCTGAACTACAGGTTGAAAGCCTCAGGGTTCCTGATGATTGGTCGGTTCCTTCCAAGGCATTGGAGGTATTGTTTTCCGTTCTTCGTTTTTGTTGTTGATTTTCGATCTGCATTTCTGTACTTGTTCGATTAGGTTACTTGTCCAACTTCTTTTTAGTGATTTGAACGTAGTAATTGGTTTTGTTCTTGATGATGGCGCATTCTTGATGTTTTTGAATGATGAATGCGATGTGATGACATTGTTATCTTCATGAAATTTAAGCAGAAATTAGACATTCTCCTTCGAACGTCGTGTTGCTAACTAAATTCAGTGTGAAGACCTCAGAGTTGGGTTTTTTAAACTGGCCCGAATTTGATGTTTACTCTGTATTAGCCTACGGAAAGAATATTCTAGAGCCACCACACAATGTTCTTGTCGACACAAATTTGTTCTGCCACTAGAGTATGTTGGGCATGACCTTTGATGCCTTTTCCCAATGAATAATTTATTATAGAAAAAGAAAATTGGGTCTGGATGTAAAATGGATTTTGATTTATGAGCTGGGAAAGATTTTCAAGTTAGTACCAATAAAATTAGGCTCCAAAGTCAAATCTAGGTTGCCACTATTTTACATTTGTTTGAATGATCCAAGACAGTTTTGATCATTCTGCTTGAGTAGTTTACGAGAAGGAGATTGGAAATCCTTGAAATTAGTCTTGGTATATCCTATTAGATCTGCTGGTTTCTTCTATATAATGGCTTAGAATTAAATCTATTTGTCCAATTATCTGGTGGAAGTTCTTCTCGATGCTTATTTCTCCAAGCTTGCAGAAAAGTTATAGCTTGGATAAGTTTGAGTGATCAGATTTACTGTCTCAGTGTAGCTCTTTTCACTAATCTGGCAAATGGCTGTCCTTGTCCAATTGATGGTTAAAATTTGATTTTTATTATACTTTTTTTTTAAATAGCTTTAGTTTCATCATGTTTACTCTTTCGGTGGTTGAAACCTTTAAGGGTTTGGGCTTTCGTCCGTATAGTTACATGGAACGAAGTTCCGGTGACATGTTTGGTCCTCTTTGTCAGATTATTGTGTTTGTGTTGCATCTTAGTGCTTTGGAGTCGTCTTTTTCTGTCACTGGACCAGTGAGCATTCTCCATCGGGAAAGCCTTCAAAAGGAGTTTTTTTCTTCTTCTTTTTTTTTTTTTTGCTGTTTCGAACCTCGTTTCAGTCTTCGATCATTGGGCTACAGACTTTTTATGAGTTCTTAATTGGTTGTATTTGGCTGTAATGACTATTTGTTTTTGTTTCATTTGTGTTTTTATTGTATTTTGAGCATTAGTCTGTTTCCATTTTATCAGTAAAAAATCTGACTAATCAGTTGAGATATTTTTCAGGAATCAGAATGGCTTAGGGCTACCTTGCACAAATGGTTAGATGACGAGTACTGTCCAGAGGAAACTAATGTAGAGATAAGCAAGGTTGCAGCAAATTCATATTACAATTCTTTGTTAAAGAAGAGGACAGACCTAGGCGATATTTTGTTGAATATGGCAAGAGAATTGGAATCTATTTCCTATAAGGAAAGTTTTCATGGTGCATTCTCTTCTGCCAATGCAGCAGTGAACTTAATTGCTCAAAGAATAGAGCTGTCTTAACTGTTTCCTCAACCACTTCAATTTCCAAAGTAACTAACTAATTTGTCTCCTTGACTTTTCTTTCTTACGAGTAACTCATCTGTTGTCAGATCTATATTTCTGGACAAACAATTCATGTATTGCTCTAATTCTTCACTGTCAAAGCAACTTTCAGCGACTAAGTATGGATGTTCCAAACACCAGATCTTGCACCATGCCTTTTGCCAGGTTAGCTCTAGGTAAAAGTAGAGTTTCTTCTATTGCATTTTATAGTTGTTTGACCTCAAATTATGTCTAAGAAAAGACGGCACTTCCAGAAGGGCTTTCAAGTTTGTGAGTCAAAGACAAAGCTATAATTTTAGAGGGAAGTGCTATAAAGCATAAGCTAGGGTGTCTATTTGGTCCTTTAATGATACTTGTGTAGGTTTTTCAAGTCCAGACAAAATGTACTACATATGGGAAGTTGGAGGAAGACTTGGATCATATTCTCTGAAGTCGTGAGTTTGCGAATTCCATTTGGGATTACTTCTTTTAGACATTTGGTTTCTTGCTAGCTCAACACAGGGACACCAGCATCATGATTGGATCATTCCTCCTCCATCCTCTCTTTTGTGGGAAGCGTCACTTTTTGTGGCCTGCCCAAATATGTGCTGTTTTACTGGTTCTTTGAGAGAGCATAATAACAAAGTGTTTAGAGGGTTGGAGAGGGATCCTAGTGATGTTTAGTCCTTGTCATATTCTCTGTCTCCTTTTGGGCATCAATTTTGAAGACCTTTTGTAACTATTTTATAGGAACTGTCTCGTATAGTTGGAGTCCCTTTCTTCAGCAAGCTTTCCCTTGTTTCATTAAAAAAAGCCAATCATTGGGGCTTTTGTAATTACAGCAGCTGACTTAGGTTGGAGTATTTTTTCCTGACCCTCCTAGGCAGAAGAGCTCTAATTGCTTTGGCCTTTCTTTCTCCAATGAAAGTTTGTTTCTTACATATATGTATATAGCTTCTGTTCTGAAATAATACAAATGAAGACAGAATATGAAGAACAAACGCTAGAGCAAGCATGCCAATTATACATGCTACTCATGCAATTCTGAAGTAAATCACTGGACGAACAAGGAGGACGATCAAATCTACATTCTACTCATGCAAGTGTGGAGCGAATCCCTGGATGAAAAAAGGGAGGAGGACGATACGAGCTTAACGAATCAATACGCAGGTATCAGCATTTTTACTTATTGTACAACACTACAACTAAGTATCAAAATCTAAGCTTGATTCAGACATCCTTGGAATTCTCCAGTGGCCTTAACAGTCTCATTTAGGAGGAAATTGTGATTATGTTCATGCTCTATAGTTGTTCTAGTGAGTTTATAAACATAATGAGGCTCCTTAATTTTTCTAATATGGGATTCTTAACATTATGCAATAACTTTCCTCGTAATCAATTTTTTTTAATTCAG
mRNA sequence
GCCTACTAAACCAAACCATAAAAAAGCCTACAGCGTGAGCTCAGCGATCACTATAATTCATACAATTTCACCGATAAATCATATCCACACGTCTCAATCGCCCTGCCATCAACCGTTTGTGAGCTTGGAAAATCACAAAGAATCAGCAAGAAATGGAGTCTCTGATTTTGATTCCAGCAAGTAAACACGTTCCAATCACAAATTCTTCTCTATTTTGCTCCGGTTTCTCCATTCCATTCACTTCTAGAAAGCGCGCCATCAATTTCCAACCATGTTGTCCACAGATTCTTCATTGCAATGGCCTTCGACATGCGCAAGCAACTCCGAGGTCTTATATTGGAAATGGACCGCCGTATGGTGAGGAAGATGATGATCCTGAACTACAGGTTGAAAGCCTCAGGGTTCCTGATGATTGGTCGGTTCCTTCCAAGGCATTGGAGGAATCAGAATGGCTTAGGGCTACCTTGCACAAATGGTTAGATGACGAGTACTGTCCAGAGGAAACTAATGTAGAGATAAGCAAGGTTGCAGCAAATTCATATTACAATTCTTTGTTAAAGAAGAGGACAGACCTAGGCGATATTTTGTTGAATATGGCAAGAGAATTGGAATCTATTTCCTATAAGGAAAGTTTTCATGGTGCATTCTCTTCTGCCAATGCAGCAGTGAACTTAATTGCTCAAAGAATAGAGCTGTCTTAACTGTTTCCTCAACCACTTCAATTTCCAAACGACTAAGTATGGATGTTCCAAACACCAGATCTTGCACCATGCCTTTTGCCAGACAGAATATGAAGAACAAACGCTAGAGCAAGCATGCCAATTATACATGCTACTCATGCAATTCTGAAGTAAATCACTGGACGAACAAGGAGGACGATCAAATCTACATTCTACTCATGCAAGTGTGGAGCGAATCCCTGGATGAAAAAAGGGAGGAGGACGATACGAGCTTAACGAATCAATACGCAGGTATCAGCATTTTTACTTATTGTACAACACTACAACTAAGTATCAAAATCTAAGCTTGATTCAGACATCCTTGGAATTCTCCAGTGGCCTTAACAGTCTCATTTAGGAGGAAATTGTGATTATGTTCATGCTCTATAGTTGTTCTAGTGAGTTTATAAACATAATGAGGCTCCTTAATTTTTCTAATATGGGATTCTTAACATTATGCAATAACTTTCCTCGTAATCAATTTTTTTTAATTCAG
Coding sequence (CDS)
ATGGAGTCTCTGATTTTGATTCCAGCAAGTAAACACGTTCCAATCACAAATTCTTCTCTATTTTGCTCCGGTTTCTCCATTCCATTCACTTCTAGAAAGCGCGCCATCAATTTCCAACCATGTTGTCCACAGATTCTTCATTGCAATGGCCTTCGACATGCGCAAGCAACTCCGAGGTCTTATATTGGAAATGGACCGCCGTATGGTGAGGAAGATGATGATCCTGAACTACAGGTTGAAAGCCTCAGGGTTCCTGATGATTGGTCGGTTCCTTCCAAGGCATTGGAGGAATCAGAATGGCTTAGGGCTACCTTGCACAAATGGTTAGATGACGAGTACTGTCCAGAGGAAACTAATGTAGAGATAAGCAAGGTTGCAGCAAATTCATATTACAATTCTTTGTTAAAGAAGAGGACAGACCTAGGCGATATTTTGTTGAATATGGCAAGAGAATTGGAATCTATTTCCTATAAGGAAAGTTTTCATGGTGCATTCTCTTCTGCCAATGCAGCAGTGAACTTAATTGCTCAAAGAATAGAGCTGTCTTAA
Protein sequence
MESLILIPASKHVPITNSSLFCSGFSIPFTSRKRAINFQPCCPQILHCNGLRHAQATPRSYIGNGPPYGEEDDDPELQVESLRVPDDWSVPSKALEESEWLRATLHKWLDDEYCPEETNVEISKVAANSYYNSLLKKRTDLGDILLNMARELESISYKESFHGAFSSANAAVNLIAQRIELS
Homology
BLAST of Bhi09G000804 vs. TAIR 10
Match:
AT4G03150.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 20 plant structures; EXPRESSED DURING: 13 growth stages; Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )
HSP 1 Score: 168.7 bits (426), Expect = 4.3e-42
Identity = 84/155 (54.19%), Postives = 113/155 (72.90%), Query Frame = 0
Query: 26 SIPFTSRKRAINFQPCCPQILHCNGLRHAQATPRSYIGNGPPYGEEDDDPELQVESLRVP 85
S P T R+R+++F L+ +Q++ E ++D E V++L +P
Sbjct: 37 SFPATIRRRSVSFPITSAPKFPSLKLQKSQSS----------IHEGEEDSESAVQALTIP 96
Query: 86 DDWSVPSKALEESEWLRATLHKWLDDEYCPEETNVEISKVAANSYYNSLLKKRTDLGDIL 145
++W +PS+A+EESEWLR TLHKWLDDEYCPE TNVEIS+VAA SYY+SLL+K +D+G+IL
Sbjct: 97 EEWLLPSRAIEESEWLRVTLHKWLDDEYCPEPTNVEISEVAAKSYYSSLLEKESDMGEIL 156
Query: 146 LNMARELESISYKESFHGAFSSANAAVNLIAQRIE 181
L MA++L SISY+ESFHGAF+SANAA+NLI RIE
Sbjct: 157 LKMAQDLTSISYQESFHGAFTSANAAINLIVDRIE 181
BLAST of Bhi09G000804 vs. ExPASy TrEMBL
Match:
A0A6J1D9Y8 (uncharacterized protein LOC111018405 OS=Momordica charantia OX=3673 GN=LOC111018405 PE=4 SV=1)
HSP 1 Score: 319.7 bits (818), Expect = 7.9e-84
Identity = 155/182 (85.16%), Postives = 167/182 (91.76%), Query Frame = 0
Query: 1 MESLILIPASKHVPITNSSLFCSGFSIPFTSRKRAINFQPCCPQILHCNGLRHAQATPRS 60
MESLILIPAS+HV ++NSS FCS F +PFT+ KRAINF+ CPQILH NG+RHAQ TPRS
Sbjct: 1 MESLILIPASRHVSVSNSSGFCSDFYVPFTTGKRAINFRASCPQILHWNGIRHAQTTPRS 60
Query: 61 YIGNGPPYGEEDDDPELQVESLRVPDDWSVPSKALEESEWLRATLHKWLDDEYCPEETNV 120
Y GPPYGEEDDDPE+QVESLRVPDDWSVP+KALEESEWLR TLHKWLDDEYCPEETNV
Sbjct: 61 YTRYGPPYGEEDDDPEVQVESLRVPDDWSVPTKALEESEWLRVTLHKWLDDEYCPEETNV 120
Query: 121 EISKVAANSYYNSLLKKRTDLGDILLNMARELESISYKESFHGAFSSANAAVNLIAQRIE 180
EIS+VAANSYYNSLL+KRTDLGDILL MARELESISYKESFHGAF SANAAVNLIAQ+IE
Sbjct: 121 EISRVAANSYYNSLLEKRTDLGDILLKMARELESISYKESFHGAFXSANAAVNLIAQKIE 180
Query: 181 LS 183
LS
Sbjct: 181 LS 182
BLAST of Bhi09G000804 vs. ExPASy TrEMBL
Match:
A0A1S3BS97 (uncharacterized protein LOC103492727 OS=Cucumis melo OX=3656 GN=LOC103492727 PE=4 SV=1)
HSP 1 Score: 313.5 bits (802), Expect = 5.7e-82
Identity = 158/182 (86.81%), Postives = 162/182 (89.01%), Query Frame = 0
Query: 1 MESLILIPASKHVPITNSSLFCSGFSIPFTSRKRAINFQPCCPQILHCNGLRHAQATPRS 60
MESLILIP SKHV ITNSS F FS F+SRKRAINF CPQ LH NG HAQ T RS
Sbjct: 1 MESLILIPPSKHVSITNSSAFSFHFSSQFSSRKRAINFPQFCPQTLHSNGRWHAQTTSRS 60
Query: 61 YIGNGPPYGEEDDDPELQVESLRVPDDWSVPSKALEESEWLRATLHKWLDDEYCPEETNV 120
YI NGPPYGEEDDDPEL+VESLRVPD+WSVPSKALEESEWLR TLHKWLDDEYCPEETNV
Sbjct: 61 YIRNGPPYGEEDDDPELEVESLRVPDEWSVPSKALEESEWLRVTLHKWLDDEYCPEETNV 120
Query: 121 EISKVAANSYYNSLLKKRTDLGDILLNMARELESISYKESFHGAFSSANAAVNLIAQRIE 180
EISKVAANSYYNSLLKK TDLG+ILLNMARELESISYKESFHGAFSSANAAVNLIAQRIE
Sbjct: 121 EISKVAANSYYNSLLKKTTDLGEILLNMARELESISYKESFHGAFSSANAAVNLIAQRIE 180
Query: 181 LS 183
LS
Sbjct: 181 LS 182
BLAST of Bhi09G000804 vs. ExPASy TrEMBL
Match:
A0A5D3D403 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold411G002040 PE=4 SV=1)
HSP 1 Score: 313.5 bits (802), Expect = 5.7e-82
Identity = 158/182 (86.81%), Postives = 162/182 (89.01%), Query Frame = 0
Query: 1 MESLILIPASKHVPITNSSLFCSGFSIPFTSRKRAINFQPCCPQILHCNGLRHAQATPRS 60
MESLILIP SKHV ITNSS F FS F+SRKRAINF CPQ LH NG HAQ T RS
Sbjct: 1 MESLILIPPSKHVSITNSSAFSFHFSSQFSSRKRAINFPQFCPQTLHSNGRWHAQTTSRS 60
Query: 61 YIGNGPPYGEEDDDPELQVESLRVPDDWSVPSKALEESEWLRATLHKWLDDEYCPEETNV 120
YI NGPPYGEEDDDPEL+VESLRVPD+WSVPSKALEESEWLR TLHKWLDDEYCPEETNV
Sbjct: 61 YIRNGPPYGEEDDDPELEVESLRVPDEWSVPSKALEESEWLRVTLHKWLDDEYCPEETNV 120
Query: 121 EISKVAANSYYNSLLKKRTDLGDILLNMARELESISYKESFHGAFSSANAAVNLIAQRIE 180
EISKVAANSYYNSLLKK TDLG+ILLNMARELESISYKESFHGAFSSANAAVNLIAQRIE
Sbjct: 121 EISKVAANSYYNSLLKKTTDLGEILLNMARELESISYKESFHGAFSSANAAVNLIAQRIE 180
Query: 181 LS 183
LS
Sbjct: 181 LS 182
BLAST of Bhi09G000804 vs. ExPASy TrEMBL
Match:
A0A0A0K5Q9 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G390120 PE=4 SV=1)
HSP 1 Score: 305.8 bits (782), Expect = 1.2e-79
Identity = 153/182 (84.07%), Postives = 161/182 (88.46%), Query Frame = 0
Query: 1 MESLILIPASKHVPITNSSLFCSGFSIPFTSRKRAINFQPCCPQILHCNGLRHAQATPRS 60
MESLILIP +KHV ITNSS+ FS PF SRKRAI F CPQI+HCN HAQ T RS
Sbjct: 1 MESLILIPPTKHVSITNSSV----FSFPFISRKRAIKFPQFCPQIIHCNARLHAQTTSRS 60
Query: 61 YIGNGPPYGEEDDDPELQVESLRVPDDWSVPSKALEESEWLRATLHKWLDDEYCPEETNV 120
YI NGPPYGEEDDDPEL+VESLRVPD+WSVPSKALEESEWLR TLHKWLD+EYCPEETNV
Sbjct: 61 YIRNGPPYGEEDDDPELEVESLRVPDEWSVPSKALEESEWLRVTLHKWLDEEYCPEETNV 120
Query: 121 EISKVAANSYYNSLLKKRTDLGDILLNMARELESISYKESFHGAFSSANAAVNLIAQRIE 180
+ISKVAA SYYNSLLKK TDLG+ILLNMARELESISYKESFHGAFSSANAAVNLIAQRIE
Sbjct: 121 DISKVAAKSYYNSLLKKTTDLGEILLNMARELESISYKESFHGAFSSANAAVNLIAQRIE 178
Query: 181 LS 183
LS
Sbjct: 181 LS 178
BLAST of Bhi09G000804 vs. ExPASy TrEMBL
Match:
A0A6J1H695 (uncharacterized protein LOC111460840 OS=Cucurbita moschata OX=3662 GN=LOC111460840 PE=4 SV=1)
HSP 1 Score: 291.6 bits (745), Expect = 2.3e-75
Identity = 147/182 (80.77%), Postives = 160/182 (87.91%), Query Frame = 0
Query: 1 MESLILIPASKHVPITNSSLFCSGFSIPFTSRKRAINFQPCCPQILHCNGLRHAQATPRS 60
MESLIL+P+S+HV ITNSS F S F + FTSRKRA+NF+ PQIL CN + + TPRS
Sbjct: 1 MESLILVPSSRHVSITNSSAFSSDFKVQFTSRKRAMNFRASSPQILPCNVVLE-RLTPRS 60
Query: 61 YIGNGPPYGEEDDDPELQVESLRVPDDWSVPSKALEESEWLRATLHKWLDDEYCPEETNV 120
YI GPPYG++DDDPE QVESLRVPD WSVPSKALEESEWLR TLHKWLD+EYCPEETNV
Sbjct: 61 YIRYGPPYGDQDDDPEEQVESLRVPDAWSVPSKALEESEWLRVTLHKWLDEEYCPEETNV 120
Query: 121 EISKVAANSYYNSLLKKRTDLGDILLNMARELESISYKESFHGAFSSANAAVNLIAQRIE 180
EISKVAA SYYNSLL+KRT+L DILLNMARELESISYKESFHGAFSSANAAVNLIAQRIE
Sbjct: 121 EISKVAAKSYYNSLLEKRTELADILLNMARELESISYKESFHGAFSSANAAVNLIAQRIE 180
Query: 181 LS 183
LS
Sbjct: 181 LS 181
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
AT4G03150.1 | 4.3e-42 | 54.19 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1D9Y8 | 7.9e-84 | 85.16 | uncharacterized protein LOC111018405 OS=Momordica charantia OX=3673 GN=LOC111018... | [more] |
A0A1S3BS97 | 5.7e-82 | 86.81 | uncharacterized protein LOC103492727 OS=Cucumis melo OX=3656 GN=LOC103492727 PE=... | [more] |
A0A5D3D403 | 5.7e-82 | 86.81 | Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... | [more] |
A0A0A0K5Q9 | 1.2e-79 | 84.07 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G390120 PE=4 SV=1 | [more] |
A0A6J1H695 | 2.3e-75 | 80.77 | uncharacterized protein LOC111460840 OS=Cucurbita moschata OX=3662 GN=LOC1114608... | [more] |