Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAAACAATAATAGAAATGCCCCTCCACCGCAAGCTGCCCAAGAACCAAACGCCGCCTACATGGCACATGACTTGGACAGACCGATTAGATCATATGCGGCACCCAACCTCTACAACTTCAACCTAGGGATCAAGCCTGTAATGGTTCAAATGATTCAGAACGCCGGACAATTTGGCGGTCACCCTGGAGAAGATCCACACGAGCACATTAGGAGTTTCTACTTTATCTGTGCTTCCTTCCATATGCCAGGCATTTCACCTAAAGAACTGAGATTCGCACTCTTCCCATTAACTCTAAGGGACGAGGCGAAGAGGTGGGCCAACGCCTTGGAGGATGGCGAGGTGGGAACATGGGATCAATTGATAGAGAAATTTATGAAGAAATTATTCCCACCTCACGAAAATGCCAGAAGAAGGAAGGAGCTCATGAGCTTCCAGCAGAAAGATAAAGAGAACTATATGATGCGTGGAGTAGGTTCAAGAGGATGGTGAAAGCATGCCCCACAATGGCATTCTCGAATGCATATTGATGGAGGTCTTTTATTTTGGTTTGAACAAGGCTACACAACAGACTGCTGATGCTGTGTTTGTAGGTGGTATGTTAAAAAGCTCCTACAACCAGATTAAGGCGACGTTGGATACAATGCTAGCAATAACGAAGAATGGGATGAAGATGATTTCGGCAATCACCGAGGAGGACGAGGAAGAAGCGATGAAGGTCTGGATAAGAACGTCGTGGTGGCGTTGTAGGGACAAATGA
mRNA sequence
ATGGAAAACAATAATAGAAATGCCCCTCCACCGCAAGCTGCCCAAGAACCAAACGCCGCCTACATGGCACATGACTTGGACAGACCGATTAGATCATATGCGGCACCCAACCTCTACAACTTCAACCTAGGGATCAAGCCTGTAATGGTTCAAATGATTCAGAACGCCGGACAATTTGGCGGTCACCCTGGAGAAGATCCACACGAGCACATTAGGAGTTTCTACTTTATCTGTGCTTCCTTCCATATGCCAGGCATTTCACCTAAAGAACTGAGATTCGCACTCTTCCCATTAACTCTAAGGGACGAGGCGAAGAGGTGGGCCAACGCCTTGGAGGATGGCGAGGTGGGAACATGGGATCAATTGATAGAGAAATTTATGAAGAAATTATTCCCACCTCACGAAAATGCCAGAAGAAGGAAGGAGCTCATGAGCTTCCAGCAGAAAGATAAAGAGAACTATATGATGCGTGGAGCTACACAACAGACTGCTGATGCTGTGTTTGTAGGTGGTATGTTAAAAAGCTCCTACAACCAGATTAAGGCGACGTTGGATACAATGCTAGCAATAACGAAGAATGGGATGAAGATGATTTCGGCAATCACCGAGGAGGACGAGGAAGAAGCGATGAAGGTCTGGATAAGAACGTCGTGGTGGCGTTGTAGGGACAAATGA
Coding sequence (CDS)
ATGGAAAACAATAATAGAAATGCCCCTCCACCGCAAGCTGCCCAAGAACCAAACGCCGCCTACATGGCACATGACTTGGACAGACCGATTAGATCATATGCGGCACCCAACCTCTACAACTTCAACCTAGGGATCAAGCCTGTAATGGTTCAAATGATTCAGAACGCCGGACAATTTGGCGGTCACCCTGGAGAAGATCCACACGAGCACATTAGGAGTTTCTACTTTATCTGTGCTTCCTTCCATATGCCAGGCATTTCACCTAAAGAACTGAGATTCGCACTCTTCCCATTAACTCTAAGGGACGAGGCGAAGAGGTGGGCCAACGCCTTGGAGGATGGCGAGGTGGGAACATGGGATCAATTGATAGAGAAATTTATGAAGAAATTATTCCCACCTCACGAAAATGCCAGAAGAAGGAAGGAGCTCATGAGCTTCCAGCAGAAAGATAAAGAGAACTATATGATGCGTGGAGCTACACAACAGACTGCTGATGCTGTGTTTGTAGGTGGTATGTTAAAAAGCTCCTACAACCAGATTAAGGCGACGTTGGATACAATGCTAGCAATAACGAAGAATGGGATGAAGATGATTTCGGCAATCACCGAGGAGGACGAGGAAGAAGCGATGAAGGTCTGGATAAGAACGTCGTGGTGGCGTTGTAGGGACAAATGA
Protein sequence
MENNNRNAPPPQAAQEPNAAYMAHDLDRPIRSYAAPNLYNFNLGIKPVMVQMIQNAGQFGGHPGEDPHEHIRSFYFICASFHMPGISPKELRFALFPLTLRDEAKRWANALEDGEVGTWDQLIEKFMKKLFPPHENARRRKELMSFQQKDKENYMMRGATQQTADAVFVGGMLKSSYNQIKATLDTMLAITKNGMKMISAITEEDEEEAMKVWIRTSWWRCRDK
Homology
BLAST of PI0017666 vs. ExPASy TrEMBL
Match:
U5CUI2 (Retrotrans_gag domain-containing protein OS=Amborella trichopoda OX=13333 GN=AMTR_s04947p00003620 PE=4 SV=1)
HSP 1 Score: 133.7 bits (335), Expect = 9.9e-28
Identity = 71/153 (46.41%), Postives = 87/153 (56.86%), Query Frame = 0
Query: 13 AAQEPNAAYMAHDLDRPIRSYAAPNLYNFNLGI------------KPVMVQMIQNAGQFG 72
A Q N +A D R IR YAAP N GI KPVM QM+Q GQF
Sbjct: 6 AQQIVNPIILADDRARAIREYAAPMFNELNPGIVRPEIQAPQFELKPVMFQMLQTVGQFS 65
Query: 73 GHPGEDPHEHIRSFYFICASFHMPGISPKELRFALFPLTLRDEAKRWANALEDGEVGTWD 132
G P EDPH H+RSF + SF + G+S + LR LFP +LRD A+ W N L V W+
Sbjct: 66 GMPTEDPHLHLRSFLEVSDSFKIQGVSEEVLRLKLFPFSLRDRARSWLNTLPPDSVTNWN 125
Query: 133 QLIEKFMKKLFPPHENARRRKELMSFQQKDKEN 154
L EKF++K FPP NA+ R E+MSFQQ + E+
Sbjct: 126 DLAEKFLRKYFPPTRNAKFRSEIMSFQQLEDES 158
BLAST of PI0017666 vs. ExPASy TrEMBL
Match:
A0A5A7V2U7 (Putative disease resistance RPP13-like protein 1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold313G001950 PE=4 SV=1)
HSP 1 Score: 123.2 bits (308), Expect = 1.3e-24
Identity = 64/108 (59.26%), Postives = 75/108 (69.44%), Query Frame = 0
Query: 4 NNRNAPPPQAAQEPNAAYMAHDLDRPIRSYAAPNLYNFNLG-------------IKPVMV 63
NN+ P QAAQ N Y+A D DRPIRSYA+PNLY+FN G IK VM+
Sbjct: 5 NNQGIPANQAAQGANPTYLADDRDRPIRSYASPNLYDFNPGIAYPTFSENTRFEIKLVML 64
Query: 64 QMIQNAGQFGGHPGEDPHEHIRSFYFICASFHMPGISPKELRFALFPL 99
Q+IQNAGQF HP +DPH++IR+FY ICASFHMPGIS +ELRF F L
Sbjct: 65 QIIQNAGQFRRHPRKDPHDYIRNFYSICASFHMPGISVEELRFTPFRL 112
BLAST of PI0017666 vs. ExPASy TrEMBL
Match:
A0A6J1H7E4 (uncharacterized protein LOC111461168 OS=Cucurbita moschata OX=3662 GN=LOC111461168 PE=4 SV=1)
HSP 1 Score: 122.9 bits (307), Expect = 1.7e-24
Identity = 81/228 (35.53%), Postives = 107/228 (46.93%), Query Frame = 0
Query: 3 NNNRNAPPPQAAQE---PNAAYMAHDLDRPIRSYAAPNLYNFN------------LGIKP 62
N P A QE NA +A D +R IR+YA P + N +KP
Sbjct: 13 NQEFENPVMMANQERIIANAIQLADDRERAIRAYAHPAVDELNPCIIRPEMQATTFELKP 72
Query: 63 VMVQMIQNAGQFGGHPGEDPHEHIRSFYFICASFHMPGISPKELRFALFPLTLRDEAKRW 122
VM QM+Q GQF G P EDPH H++SF + SF G+ +R +LFP +LRD AK W
Sbjct: 73 VMFQMLQTIGQFHGLPSEDPHLHLKSFLGVSDSFRFQGVDKDVIRLSLFPYSLRDGAKSW 132
Query: 123 ANALEDGEVGTWDQLIEKFMKKLFPPHENARRRKELMSFQQKDKENY---------MMRG 182
N L + +W+ L EKF+ K FPP NAR R E+++FQQ + E M+R
Sbjct: 133 LNTLAPRTIDSWNSLAEKFLIKYFPPTRNARFRNEIVAFQQFEDETLSEAWERFKEMLRK 192
Query: 183 ---------------------ATQQTADAVFVGGMLKSSYNQIKATLD 186
AT+Q DA G ML +YN+ L+
Sbjct: 193 CPHHGLPHCIQMETFYNGLNIATKQVVDASANGAMLSKTYNEAYEILE 240
BLAST of PI0017666 vs. ExPASy TrEMBL
Match:
A0A6J1DW02 (uncharacterized protein LOC111024897 OS=Momordica charantia OX=3673 GN=LOC111024897 PE=4 SV=1)
HSP 1 Score: 122.5 bits (306), Expect = 2.3e-24
Identity = 62/149 (41.61%), Postives = 86/149 (57.72%), Query Frame = 0
Query: 16 EPNAAYMAHDLDRPIRSYAAPNLYNFNLGI------------KPVMVQMIQNAGQFGGHP 75
E N MA + D +R YAA NF+ GI KP+M QM+Q G FGG
Sbjct: 80 EFNYIQMADNRDVAMREYAATAFQNFDSGIVNPIPAHXNFELKPMMFQMLQTIGHFGGQE 139
Query: 76 GEDPHEHIRSFYFICASFHMPGISPKELRFALFPLTLRDEAKRWANALEDGEVGTWDQLI 135
EDPH+H++SF I +F +PGI+ LFP +L+D+A+ NA G + TW L+
Sbjct: 140 HEDPHDHLKSFIQIANAFRLPGITDDAXXLTLFPFSLKDQARXXLNAFPXGSITTWGSLV 199
Query: 136 EKFMKKLFPPHENARRRKELMSFQQKDKE 153
EKF+ K FPP +A R+E++SF+Q D+E
Sbjct: 200 EKFLTKFFPPTRHADIREEIISFRQYDRE 228
BLAST of PI0017666 vs. ExPASy TrEMBL
Match:
A0A6J1DY39 (uncharacterized protein LOC111025653 OS=Momordica charantia OX=3673 GN=LOC111025653 PE=4 SV=1)
HSP 1 Score: 122.1 bits (305), Expect = 3.0e-24
Identity = 59/148 (39.86%), Postives = 85/148 (57.43%), Query Frame = 0
Query: 17 PNAAYMAHDLDRPIRSYAAPNLYNFNLGI------------KPVMVQMIQNAGQFGGHPG 76
PN ++A DR +R YAA L + N + KP+M+QM+ N GQFGG
Sbjct: 7 PNPIHVADQRDRAMRDYAAXILEDLNSSVINSFPADAKFEFKPMMLQMLNNIGQFGGLEH 66
Query: 77 EDPHEHIRSFYFICASFHMPGISPKELRFALFPLTLRDEAKRWANALEDGEVGTWDQLIE 136
EDP H++SF + +F +PGIS LR LFP ++ +A W NA + TW +++
Sbjct: 67 EDPRSHLKSFIKVANTFRLPGISDDALRLTLFPFSVSGQATAWLNAFPSDTITTWSDMVD 126
Query: 137 KFMKKLFPPHENARRRKELMSFQQKDKE 153
KF+ K FPP NA R+E++SF+QK+ E
Sbjct: 127 KFLVKYFPPTRNADVREEIISFRQKENE 154
BLAST of PI0017666 vs. NCBI nr
Match:
WP_217833177.1 (retrotransposon gag domain-containing protein, partial [Synechococcus sp. PCC 7002])
HSP 1 Score: 169.9 bits (429), Expect = 2.6e-38
Identity = 90/224 (40.18%), Postives = 126/224 (56.25%), Query Frame = 0
Query: 5 NRNAPPPQAAQEPNAAYMAHDLDRPIRSYAAPNLYNFNLGI-------------KPVMVQ 64
N NA A + N +AHD +RP+R YA+PNLYNF GI KPVM+Q
Sbjct: 57 NANA-QEHANYDTNPIVLAHDRNRPMREYASPNLYNFAPGILQPTFEGNGRFEMKPVMLQ 116
Query: 65 MIQNAGQFGGHPGEDPHEHIRSFYFICASFHMPGISPKELRFALFPLTLRDEAKRWANAL 124
M+Q AGQFGG GEDPH H++SF IC++F M G+ +R LFP +LRDEA++WA +
Sbjct: 117 MLQAAGQFGGATGEDPHAHLKSFLEICSAFPMAGVPQDSIRLTLFPFSLRDEARQWAYSF 176
Query: 125 EDGEVGTWDQLIEKFMKKLFPPHENARRRKELMSFQQKDKENYM---------------- 184
E GE+ TW +++EKFM+K FPP +A+RR+++++F+QKD E +
Sbjct: 177 EPGEITTWGKMVEKFMQKYFPPTVSAKRRRDIVTFEQKDSETFSEAWTRFKRLVRNCPHN 236
Query: 185 --------------MRGATQQTADAVFVGGMLKSSYNQIKATLD 186
+ +Q ADA GG++ +Y Q K LD
Sbjct: 237 GIPNCVQMEIFYGGLNKTSQSMADASAAGGLMDKTYTQAKEILD 279
BLAST of PI0017666 vs. NCBI nr
Match:
XP_038880527.1 (uncharacterized protein LOC120072192 [Benincasa hispida])
HSP 1 Score: 152.5 bits (384), Expect = 4.2e-33
Identity = 84/219 (38.36%), Postives = 125/219 (57.08%), Query Frame = 0
Query: 22 MAHDLDRPIRSYAAPNLYNFNLGI------------KPVMVQMIQNAGQFGGHPGEDPHE 81
MA++ RP+R YA+P LY+F+ GI K VM+QM+Q A QFGG GEDPH
Sbjct: 1 MANNSTRPMREYASPVLYDFSPGIIYPMPDGTRFEMKSVMLQMLQTARQFGGSHGEDPHA 60
Query: 82 HIRSFYFICASFHMPGISPKELRFALFPLTLRDEAKRWANALEDGEVGTWDQLIEKFMKK 141
H++ F C F +P I+P+++R +LFP +LRD+AK+W ++LE E+ TW++L+EKFM+K
Sbjct: 61 HMKRFLETCNFFVIPRITPEKIRLSLFPNSLRDDAKQWVSSLEPEEITTWEKLVEKFMQK 120
Query: 142 LFPPHENARRRKELMSFQQKDKENYM------------------------------MRGA 199
FPP NARRR+E+M+F+Q+D E + A
Sbjct: 121 YFPPTTNARRRREIMNFEQEDIETLSVACERFNGLVKNCPNHALLLNIQMETFYGGLNRA 180
BLAST of PI0017666 vs. NCBI nr
Match:
XP_038887458.1 (uncharacterized protein LOC120077591 [Benincasa hispida])
HSP 1 Score: 147.5 bits (371), Expect = 1.4e-31
Identity = 80/167 (47.90%), Postives = 107/167 (64.07%), Query Frame = 0
Query: 2 ENNNRNAPP-PQAAQEP--NAAYMAHDLDRPIRSYAAPNLYNFNLG-------------I 61
+NNN AP Q P + ++A D + PIR+YAAPNLY+F+ G I
Sbjct: 37 DNNNNEAPEGNQGVNRPVQDPVFLAADHNIPIRNYAAPNLYDFSPGISRPIVEENARFEI 96
Query: 62 KPVMVQMIQNAGQFGGHPGEDPHEHIRSFYFICASFHMPGISPKELRFALFPLTLRDEAK 121
KPVMVQMIQN QF E+PH H+ F +C++F +PGI+P +R LFP TLRD+AK
Sbjct: 97 KPVMVQMIQNMRQFESLQCENPHAHLTIFVEMCSTFSIPGITPVGIRLYLFPYTLRDKAK 156
Query: 122 RWANALEDGEVGTWDQLIEKFMKKLFPPHENARRRKELMSFQQKDKE 153
RWA++LE E+ + DQL+E FMKK FPP N RRRK +++F++ D E
Sbjct: 157 RWAHSLEANEITSSDQLVEWFMKKFFPPAINTRRRKNVLNFEKMDNE 203
BLAST of PI0017666 vs. NCBI nr
Match:
XP_017233063.1 (PREDICTED: uncharacterized protein LOC108207110 [Daucus carota subsp. sativus])
HSP 1 Score: 139.4 bits (350), Expect = 3.7e-29
Identity = 88/235 (37.45%), Postives = 111/235 (47.23%), Query Frame = 0
Query: 1 MENNNRNAPPPQAAQEPNAAYMAHDLDRPIRSYAAPNLYNFNLGI------------KPV 60
M++N N P P A++ D DR IR YAAP N GI KPV
Sbjct: 36 MDDNVNNGDIPIV---PRGAFIVDDKDRAIRQYAAPRFEELNSGIIRPNIQATQFELKPV 95
Query: 61 MVQMIQNAGQFGGHPGEDPHEHIRSFYFICASFHMPGISPKELRFALFPLTLRDEAKRWA 120
M QM+Q GQF G P EDPH H+R F I SF G+ LR LFP ++RD A+ W
Sbjct: 96 MFQMLQTIGQFSGMPTEDPHLHLRLFMEISDSFKFQGVPEDALRLKLFPYSVRDRARTWL 155
Query: 121 NALEDGEVGTWDQLIEKFMKKLFPPHENARRRKELMSFQQKDKENYM------------- 180
N+L G V TW+ L EKF+ K FPP+ NA+ R E+ SFQQ+D E+
Sbjct: 156 NSLPAGSVTTWNDLTEKFLSKYFPPNMNAKLRNEINSFQQQDDESLYDAWERFKELLRKC 215
Query: 181 -----------------MRGATQQTADAVFVGGMLKSSYNQIKATLDTMLAITKN 194
+ T+ DA G +L SYNQ L+T+ TKN
Sbjct: 216 PHHGILHCIQMETFYNGLNAQTKMVVDASANGALLSKSYNQAYEILETI--ATKN 265
BLAST of PI0017666 vs. NCBI nr
Match:
XP_030497803.1 (uncharacterized protein LOC115713460 [Cannabis sativa])
HSP 1 Score: 139.4 bits (350), Expect = 3.7e-29
Identity = 73/152 (48.03%), Postives = 87/152 (57.24%), Query Frame = 0
Query: 13 AAQEPNAAYMAHDLDRPIRSYAAPNLYNFNLGI------------KPVMVQMIQNAGQFG 72
A E N +A D R IR YAAP N GI KPVM QM+Q GQFG
Sbjct: 10 AHNEANPIALADDRTRAIREYAAPMFNELNPGIVRPEIQAPHFELKPVMFQMLQTVGQFG 69
Query: 73 GHPGEDPHEHIRSFYFICASFHMPGISPKELRFALFPLTLRDEAKRWANALEDGEVGTWD 132
G P EDPH HIRSF + SF + G+S + LR LFP +LRD A+ W N L V W+
Sbjct: 70 GSPTEDPHLHIRSFLEVSDSFKLQGVSEEALRLKLFPFSLRDRARAWLNTLPPDSVTNWN 129
Query: 133 QLIEKFMKKLFPPHENARRRKELMSFQQKDKE 153
L EKF++K FPP NA+ R E+MSFQQ + E
Sbjct: 130 DLAEKFLRKYFPPTRNAKFRSEIMSFQQSEDE 161
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
U5CUI2 | 9.9e-28 | 46.41 | Retrotrans_gag domain-containing protein OS=Amborella trichopoda OX=13333 GN=AMT... | [more] |
A0A5A7V2U7 | 1.3e-24 | 59.26 | Putative disease resistance RPP13-like protein 1 OS=Cucumis melo var. makuwa OX=... | [more] |
A0A6J1H7E4 | 1.7e-24 | 35.53 | uncharacterized protein LOC111461168 OS=Cucurbita moschata OX=3662 GN=LOC1114611... | [more] |
A0A6J1DW02 | 2.3e-24 | 41.61 | uncharacterized protein LOC111024897 OS=Momordica charantia OX=3673 GN=LOC111024... | [more] |
A0A6J1DY39 | 3.0e-24 | 39.86 | uncharacterized protein LOC111025653 OS=Momordica charantia OX=3673 GN=LOC111025... | [more] |
Match Name | E-value | Identity | Description | |
WP_217833177.1 | 2.6e-38 | 40.18 | retrotransposon gag domain-containing protein, partial [Synechococcus sp. PCC 70... | [more] |
XP_038880527.1 | 4.2e-33 | 38.36 | uncharacterized protein LOC120072192 [Benincasa hispida] | [more] |
XP_038887458.1 | 1.4e-31 | 47.90 | uncharacterized protein LOC120077591 [Benincasa hispida] | [more] |
XP_017233063.1 | 3.7e-29 | 37.45 | PREDICTED: uncharacterized protein LOC108207110 [Daucus carota subsp. sativus] | [more] |
XP_030497803.1 | 3.7e-29 | 48.03 | uncharacterized protein LOC115713460 [Cannabis sativa] | [more] |
Match Name | E-value | Identity | Description | |