Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: CDSexonpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCCCAAGGGGAGAACTGCAGGCTTGCACCCTTTTGATCCTGAGATTGAAAGGACCTATAGGAGACTCCTAAGGGAAGGAAGAGCAGAACCTCAGGAGATGGCTGATCAGGAGCTCATCAACAACCCTAAACCTATCAGGGAGTATTTCCAGCCTGTGTTTAATTCTGAGCAGGCTGGAATAGTCCATGCCCCTATCAATGTTAATAACTTTGAACTCAAAACAGGGCTAATACAAATGGTTAGGGAAGGTGCTTTCAGAGGCCTAGCCACAGAAGATCCTAACTTTCATCTTAAAATTTTTTTGGAAATCTGTGGTACATTTAAAATGAATGGTATCTCTGCTAATAGTATTCGCTTACGTCTCTTTCCTTTTTCCTTGCAGGGTAGAGCCAAAGAATAG
mRNA sequence
ATGCCCAAGGGGAGAACTGCAGGCTTGCACCCTTTTGATCCTGAGATTGAAAGGACCTATAGGAGACTCCTAAGGGAAGGAAGAGCAGAACCTCAGGAGATGGCTGATCAGGAGCTCATCAACAACCCTAAACCTATCAGGGAGTATTTCCAGCCTGTGTTTAATTCTGAGCAGGCTGGAATAGTCCATGCCCCTATCAATGTTAATAACTTTGAACTCAAAACAGGGCTAATACAAATGGTTAGGGAAGGTGCTTTCAGAGGCCTAGCCACAGAAGATCCTAACTTTCATCTTAAAATTTTTTTGGAAATCTGTGGTACATTTAAAATGAATGGTATCTCTGCTAATAGTATTCGCTTACGTCTCTTTCCTTTTTCCTTGCAGGGTAGAGCCAAAGAATAG
Coding sequence (CDS)
ATGCCCAAGGGGAGAACTGCAGGCTTGCACCCTTTTGATCCTGAGATTGAAAGGACCTATAGGAGACTCCTAAGGGAAGGAAGAGCAGAACCTCAGGAGATGGCTGATCAGGAGCTCATCAACAACCCTAAACCTATCAGGGAGTATTTCCAGCCTGTGTTTAATTCTGAGCAGGCTGGAATAGTCCATGCCCCTATCAATGTTAATAACTTTGAACTCAAAACAGGGCTAATACAAATGGTTAGGGAAGGTGCTTTCAGAGGCCTAGCCACAGAAGATCCTAACTTTCATCTTAAAATTTTTTTGGAAATCTGTGGTACATTTAAAATGAATGGTATCTCTGCTAATAGTATTCGCTTACGTCTCTTTCCTTTTTCCTTGCAGGGTAGAGCCAAAGAATAG
Protein sequence
MPKGRTAGLHPFDPEIERTYRRLLREGRAEPQEMADQELINNPKPIREYFQPVFNSEQAGIVHAPINVNNFELKTGLIQMVREGAFRGLATEDPNFHLKIFLEICGTFKMNGISANSIRLRLFPFSLQGRAKE
Homology
BLAST of Tan0009582 vs. NCBI nr
Match:
WP_217833153.1 (retrotransposon gag domain-containing protein, partial [Synechococcus sp. PCC 7002])
HSP 1 Score: 156.8 bits (395), Expect = 1.3e-34
Identity = 81/133 (60.90%), Postives = 97/133 (72.93%), Query Frame = 0
Query: 1 MPKGRTAGLHPFDPEIERTYRRLLREGRAEPQEMADQELINNPKPIREYFQPVFNSEQAG 60
MP+ T L P DPEI+RTYRR LR + EMA++ PK IR+YFQP + Q G
Sbjct: 17 MPRDNT-NLLPLDPEIDRTYRRNLRALLNQTTEMAEE----IPKAIRDYFQPTLPASQPG 76
Query: 61 IVHAPINVNNFELKTGLIQMVREGAFRGLATEDPNFHLKIFLEICGTFKMNGISANSIRL 120
I++ PINVNNFELK GLIQM RE AFRG EDP+ HL+ FLEICGT KMNG+S ++I+L
Sbjct: 77 IMNVPINVNNFELKPGLIQMARELAFRGRTNEDPHKHLRSFLEICGTVKMNGVSNDAIKL 136
Query: 121 RLFPFSLQGRAKE 134
RLFPFSLQ RAK+
Sbjct: 137 RLFPFSLQDRAKD 144
BLAST of Tan0009582 vs. NCBI nr
Match:
KAG7990634.1 (hypothetical protein I3843_02G035100 [Carya illinoinensis])
HSP 1 Score: 125.6 bits (314), Expect = 3.3e-25
Identity = 61/132 (46.21%), Postives = 87/132 (65.91%), Query Frame = 0
Query: 1 MPKGRTAGLHPFDPEIERTYRRLLREGRAEPQEMADQELINNPKPIREYFQPVFNSEQAG 60
M + R+ + P DPEIERT R L R + MA+++ P+ +++Y +PV N +
Sbjct: 1 MRRARSRDIIPVDPEIERTLRSLRRN---KILAMAEEDREVLPRTLKDYVRPVVNGNYSS 60
Query: 61 IVHAPINVNNFELKTGLIQMVREGAFRGLATEDPNFHLKIFLEICGTFKMNGISANSIRL 120
I+ PIN NNFELK LI MV++ F G +DPN HL +FLEIC T K+NG++ ++IRL
Sbjct: 61 IMRQPINANNFELKPALISMVQQAQFSGSPLDDPNIHLAMFLEICDTVKINGVTEDTIRL 120
Query: 121 RLFPFSLQGRAK 133
RLFPFSL+ +A+
Sbjct: 121 RLFPFSLRDKAR 129
BLAST of Tan0009582 vs. NCBI nr
Match:
XP_022843226.1 (uncharacterized protein LOC111366761 [Olea europaea var. sylvestris])
HSP 1 Score: 122.1 bits (305), Expect = 3.6e-24
Identity = 66/137 (48.18%), Postives = 84/137 (61.31%), Query Frame = 0
Query: 1 MPKGRTAGLHPFDPEIERTYRRLLREGRAEPQEMADQEL-----INNPKPIREYFQPVFN 60
M + R L DPE ERT+R L R E + MA+Q++ N + IR+Y +PV N
Sbjct: 94 MRRARNLDLLHVDPEPERTFRILRGIQRNEREAMAEQDVRAANEDNQQRAIRDYIRPVVN 153
Query: 61 SEQAGIVHAPINVNNFELKTGLIQMVREGAFRGLATEDPNFHLKIFLEICGTFKMNGISA 120
+GI I NFELK GLI MV++ F G A EDPN HL FLEIC T KMNG++
Sbjct: 154 DNYSGIARPAIVAKNFELKPGLIDMVQQNQFGGAAVEDPNAHLGSFLEICDTVKMNGVTE 213
Query: 121 NSIRLRLFPFSLQGRAK 133
++IRLRLF FSL+ +AK
Sbjct: 214 DAIRLRLFSFSLRDKAK 230
BLAST of Tan0009582 vs. NCBI nr
Match:
XP_022157708.1 (uncharacterized protein LOC111024361 [Momordica charantia])
HSP 1 Score: 117.1 bits (292), Expect = 1.2e-22
Identity = 60/101 (59.41%), Postives = 70/101 (69.31%), Query Frame = 0
Query: 30 EPQEMADQELINNPKPIREYFQPVFNSEQAGIVHAPINVNNFELKTGLIQMVREGAFRGL 89
+P E E NN IR+Y QP F GI++ PIN NN ELK GLIQMVRE FRG
Sbjct: 12 QPMERPQLEQ-NNQMTIRDYCQPNF-PNHVGIINLPINANNSELKPGLIQMVRENTFRGN 71
Query: 90 ATEDPNFHLKIFLEICGTFKMNGISANSIRLRLFPFSLQGR 131
ATEDPN HL IFL++CGT KMNG+ ++IRLRLFP SLQ +
Sbjct: 72 ATEDPNNHLTIFLDVCGTVKMNGVIDDAIRLRLFPLSLQDK 110
BLAST of Tan0009582 vs. NCBI nr
Match:
XP_012833448.1 (PREDICTED: uncharacterized protein LOC105954320 [Erythranthe guttata])
HSP 1 Score: 115.5 bits (288), Expect = 3.4e-22
Identity = 68/151 (45.03%), Postives = 85/151 (56.29%), Query Frame = 0
Query: 5 RTAGLHPFDPEIERTYRRL----------LREGRAEPQEMA---DQEL---INNPKP--- 64
R + P+DPEIERT RRL + E + P+ +A DQE P+P
Sbjct: 355 RNLEIIPYDPEIERTLRRLRATRNQQSEPMAEDQGNPEHLAFEYDQEPGRDHEQPRPNLP 414
Query: 65 ----IREYFQPVFNSEQAGIVHAPINVNNFELKTGLIQMVREGAFRGLATEDPNFHLKIF 124
+REY P N +GI I NNFELKTGLI MV F G AT DPN HL F
Sbjct: 415 PERTMREYRTPAMNENYSGIRKPTIAANNFELKTGLINMVMANQFSGAATADPNLHLANF 474
Query: 125 LEICGTFKMNGISANSIRLRLFPFSLQGRAK 133
LEIC T K+NG+S ++IRL+LF FS++ +AK
Sbjct: 475 LEICDTIKVNGVSDDAIRLKLFSFSVRDKAK 505
BLAST of Tan0009582 vs. ExPASy TrEMBL
Match:
A0A6J1DU19 (uncharacterized protein LOC111024361 OS=Momordica charantia OX=3673 GN=LOC111024361 PE=4 SV=1)
HSP 1 Score: 117.1 bits (292), Expect = 5.7e-23
Identity = 60/101 (59.41%), Postives = 70/101 (69.31%), Query Frame = 0
Query: 30 EPQEMADQELINNPKPIREYFQPVFNSEQAGIVHAPINVNNFELKTGLIQMVREGAFRGL 89
+P E E NN IR+Y QP F GI++ PIN NN ELK GLIQMVRE FRG
Sbjct: 12 QPMERPQLEQ-NNQMTIRDYCQPNF-PNHVGIINLPINANNSELKPGLIQMVRENTFRGN 71
Query: 90 ATEDPNFHLKIFLEICGTFKMNGISANSIRLRLFPFSLQGR 131
ATEDPN HL IFL++CGT KMNG+ ++IRLRLFP SLQ +
Sbjct: 72 ATEDPNNHLTIFLDVCGTVKMNGVIDDAIRLRLFPLSLQDK 110
BLAST of Tan0009582 vs. ExPASy TrEMBL
Match:
A0A1U8Q202 (uncharacterized protein LOC109114208 OS=Nelumbo nucifera OX=4432 GN=LOC109114208 PE=4 SV=1)
HSP 1 Score: 114.0 bits (284), Expect = 4.8e-22
Identity = 60/126 (47.62%), Postives = 80/126 (63.49%), Query Frame = 0
Query: 11 PFDPEIERTY---RRLLREGRAEPQEMADQELINNPKPIREYFQPVFNSEQAGIVHAPIN 70
P+DPEIERT R R+ RAE +EMA+ P+ + +Y +P IV I+
Sbjct: 5 PYDPEIERTLCIRLRAARQVRAETEEMAE------PRTMMDYAKPTLTGAALSIVRPAIS 64
Query: 71 VNNFELKTGLIQMVREGA-FRGLATEDPNFHLKIFLEICGTFKMNGISANSIRLRLFPFS 130
NNFE+K +IQM++ F G+A EDPN H+ FLEIC TFK NG+S + +RLRLFPFS
Sbjct: 65 ANNFEIKPAIIQMIQNTVQFCGMANEDPNSHIANFLEICDTFKHNGVSDDVVRLRLFPFS 124
Query: 131 LQGRAK 133
L+ +AK
Sbjct: 125 LKDKAK 124
BLAST of Tan0009582 vs. ExPASy TrEMBL
Match:
A0A2H9ZY12 (Uncharacterized protein OS=Apostasia shenzhenica OX=1088818 GN=AXF42_Ash020512 PE=4 SV=1)
HSP 1 Score: 113.6 bits (283), Expect = 6.3e-22
Identity = 61/147 (41.50%), Postives = 84/147 (57.14%), Query Frame = 0
Query: 1 MPKGRTAGLHPFDPEIERTYRRLLREGRAEP-------------QEMADQELINNP-KPI 60
M + L PFDPEIERT ++ R+ + + +M DQ I + +
Sbjct: 1 MTRSSKKDLAPFDPEIERTIAKITRQLKEQEVRGQLNKVKKDLFTKMEDQPTIQEAGRAL 60
Query: 61 REYFQPVFNSEQAGIVHAPINVNNFELKTGLIQMVREGA-FRGLATEDPNFHLKIFLEIC 120
REY P N +V + NNFE+K LIQM+++ F GL ++DPN H+ FLEIC
Sbjct: 61 REYALPSINGANTSVVRPAVQANNFEIKPALIQMIQQSVQFYGLPSDDPNTHIANFLEIC 120
Query: 121 GTFKMNGISANSIRLRLFPFSLQGRAK 133
TFK NG+S ++IRLRLFPFSL+ +AK
Sbjct: 121 DTFKHNGVSDDAIRLRLFPFSLKDKAK 147
BLAST of Tan0009582 vs. ExPASy TrEMBL
Match:
A0A803PJA6 (Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1)
HSP 1 Score: 111.7 bits (278), Expect = 2.4e-21
Identity = 61/139 (43.88%), Postives = 81/139 (58.27%), Query Frame = 0
Query: 13 DPEIERTYRRLLREGRAEPQEMADQELINN-------------------PKPIREYFQPV 72
+PEIERT L++ +A+ + INN P+ +R+Y PV
Sbjct: 14 NPEIERT----LKQSKAKKKIDFTMAAINNNNSNGLNNNQSAPAAADAQPRAVRDYCLPV 73
Query: 73 FNSEQAGIVHAPINVNNFELKTGLIQMVREGAFRGLATEDPNFHLKIFLEICGTFKMNGI 132
N GI + I NNFELK LI MV++ F GLAT+DPN HL IFLE+C T KMNG+
Sbjct: 74 VNKNLTGIANPVIAANNFELKPALINMVQQNHFGGLATKDPNIHLAIFLEVCATMKMNGL 133
BLAST of Tan0009582 vs. ExPASy TrEMBL
Match:
A0A6P6W382 (uncharacterized protein LOC113729769 OS=Coffea arabica OX=13443 GN=LOC113729769 PE=4 SV=1)
HSP 1 Score: 110.9 bits (276), Expect = 4.1e-21
Identity = 61/138 (44.20%), Postives = 80/138 (57.97%), Query Frame = 0
Query: 11 PFDPEIERTYRRLLRE-GRAEPQEM------------ADQELINNP---KPIREYFQPVF 70
PFDPEIER RR R E QE+ ++E+ N + +R++ P
Sbjct: 10 PFDPEIERALRRQRRNTPHQEEQEIWQPIEEILIELPFEEEIAENEPNRRILRDFALPET 69
Query: 71 NSEQAGIVHAPINVNNFELKTGLIQMVREGAFRGLATEDPNFHLKIFLEICGTFKMNGIS 130
Q I +N NNFE+K LIQMV++ + G ATEDPN HL FLEIC T K NG+S
Sbjct: 70 QGSQTSIARPMVNANNFEIKPSLIQMVQQSQYGGNATEDPNSHLSTFLEICDTIKFNGVS 129
Query: 131 ANSIRLRLFPFSLQGRAK 133
++I+LRLFPFSL+ +AK
Sbjct: 130 DDAIKLRLFPFSLKDKAK 147
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
WP_217833153.1 | 1.3e-34 | 60.90 | retrotransposon gag domain-containing protein, partial [Synechococcus sp. PCC 70... | [more] |
KAG7990634.1 | 3.3e-25 | 46.21 | hypothetical protein I3843_02G035100 [Carya illinoinensis] | [more] |
XP_022843226.1 | 3.6e-24 | 48.18 | uncharacterized protein LOC111366761 [Olea europaea var. sylvestris] | [more] |
XP_022157708.1 | 1.2e-22 | 59.41 | uncharacterized protein LOC111024361 [Momordica charantia] | [more] |
XP_012833448.1 | 3.4e-22 | 45.03 | PREDICTED: uncharacterized protein LOC105954320 [Erythranthe guttata] | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1DU19 | 5.7e-23 | 59.41 | uncharacterized protein LOC111024361 OS=Momordica charantia OX=3673 GN=LOC111024... | [more] |
A0A1U8Q202 | 4.8e-22 | 47.62 | uncharacterized protein LOC109114208 OS=Nelumbo nucifera OX=4432 GN=LOC109114208... | [more] |
A0A2H9ZY12 | 6.3e-22 | 41.50 | Uncharacterized protein OS=Apostasia shenzhenica OX=1088818 GN=AXF42_Ash020512 P... | [more] |
A0A803PJA6 | 2.4e-21 | 43.88 | Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1 | [more] |
A0A6P6W382 | 4.1e-21 | 44.20 | uncharacterized protein LOC113729769 OS=Coffea arabica OX=13443 GN=LOC113729769 ... | [more] |
Match Name | E-value | Identity | Description | |