Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GTCTGGTTTCACCGTTCAAATTCTCAACGGAGCCGTCGTCTTCTCATTTCTTTCCTTAATTCCTCCACTTTTCGATGGCGCGAAAACTCAGGCGTTCACCGCGGCCATTGCCTCGACGTACTGTCGACTATGCCTCTGATTATGATGCTTCTCCTTCTCCATCTCAGTCTCTCTATGCATCGAATGAAGACGACTATGACGCTTCTGAATCTATTAACTTCCAACCCACTGACCCCAAATCGAAAGCCCAAGAAATCAAGGGCTCCGATTTATTGACCTCTGCAGAATCCGCCTCCAACAGTCCATCGTATTTTCAGAGTCCAAACGCCGCCGAAACTCTATTTCCATACATCAACATTGCACCGTTGCCAGCTTTTCACGGCGGCGTCGATGAGTGTCCGGCGATGCATTTAAGCAGATTCGCCAAAGTCTGCCGTGCGAACAACGCGGCCTCCGTCGACATGATGATGAGGATCTTTCCGGTGACGTTAGAGGGTGAGGCTGCGCTTTGGTACGACTTGAACATTGAACCGTACCCTCCGATTTCTTGGGAAGAATTGAAGTCTTCGTTCTTGGACGCATATAATAAAATTGAATTGACTGATCAGTTGCGATCGGAGCTTATGACGATCAATCAACAGCAGGAGGAGAATGTACGTTCGTATTTTCTGAGGCTGCAGTTGATTTTGAAGAAATGGCCGACGGGGAACGAACTTTCCGATGGCTTGTTGAAAGCGATTTTCATGGATGGATTGAGAGAAGAGTTTAAGGAATGGATGATTCCACAGAAACCCAGTTCACTGAACGAGGCATTGAGACTTGCATTTGGGTTCGAACAAGTGAGAAGCGTTCGTACATCTGGCAAAAAGCGCTTTCTGCAGTGTGGGTTTTGTGAGGGGCCGCATGAGGAACTGCTTTGTGAGGTTAGGGAGAGAATGAGACAGTTGTGGAAGAGCAGGGAAAAGAAGAAGGCTGTTGATCTGGCAGAGAGTGACGGCCGTGAAGCGGCGACGGCGACGGCGACGGCGGAGCTTGTGAGATCGGTTTCGGCGATAAGTAGAAATGAAGCGGGGGTTGATAAGGATGGTGGAGAGATGGTGGGTTTGAAGAAGAAGAGTCAGTGCCAGTGTTGGAAGCATCAGTGCGGGATGAAGAAATTGGATCGAAACCTTAGCATCGTATCAAGAAATTCTAAAGGCTAATTTCAAGACTATTTTGGGGGGCAAGTTGCTGATAGATTATGAATATTTGTGTGTGTGTTTAATTTAAGAGTAAATTCTTTTGTTTGTAGAAGAATCTTGAAATCTAACTATCACTGATTCATCTATTTGATGCATGAAGAGAAATTGTATTGCCTTTTCCTGCTTCCACTGTGGGATCAATGTAATAATATTCTTATCTGAAACAGCATCTTATGAACAA
mRNA sequence
GTCTGGTTTCACCGTTCAAATTCTCAACGGAGCCGTCGTCTTCTCATTTCTTTCCTTAATTCCTCCACTTTTCGATGGCGCGAAAACTCAGGCGTTCACCGCGGCCATTGCCTCGACGTACTGTCGACTATGCCTCTGATTATGATGCTTCTCCTTCTCCATCTCAGTCTCTCTATGCATCGAATGAAGACGACTATGACGCTTCTGAATCTATTAACTTCCAACCCACTGACCCCAAATCGAAAGCCCAAGAAATCAAGGGCTCCGATTTATTGACCTCTGCAGAATCCGCCTCCAACAGTCCATCGTATTTTCAGAGTCCAAACGCCGCCGAAACTCTATTTCCATACATCAACATTGCACCGTTGCCAGCTTTTCACGGCGGCGTCGATGAGTGTCCGGCGATGCATTTAAGCAGATTCGCCAAAGTCTGCCGTGCGAACAACGCGGCCTCCGTCGACATGATGATGAGGATCTTTCCGGTGACGTTAGAGGGTGAGGCTGCGCTTTGGTACGACTTGAACATTGAACCGTACCCTCCGATTTCTTGGGAAGAATTGAAGTCTTCGTTCTTGGACGCATATAATAAAATTGAATTGACTGATCAGTTGCGATCGGAGCTTATGACGATCAATCAACAGCAGGAGGAGAATGTACGTTCGTATTTTCTGAGGCTGCAGTTGATTTTGAAGAAATGGCCGACGGGGAACGAACTTTCCGATGGCTTGTTGAAAGCGATTTTCATGGATGGATTGAGAGAAGAGTTTAAGGAATGGATGATTCCACAGAAACCCAGTTCACTGAACGAGGCATTGAGACTTGCATTTGGGTTCGAACAAGTGAGAAGCGTTCGTACATCTGGCAAAAAGCGCTTTCTGCAGTGTGGGTTTTGTGAGGGGCCGCATGAGGAACTGCTTTGTGAGGTTAGGGAGAGAATGAGACAGTTGTGGAAGAGCAGGGAAAAGAAGAAGGCTGTTGATCTGGCAGAGAGTGACGGCCGTGAAGCGGCGACGGCGACGGCGACGGCGGAGCTTGTGAGATCGGTTTCGGCGATAAGTAGAAATGAAGCGGGGGTTGATAAGGATGGTGGAGAGATGGTGGGTTTGAAGAAGAAGAGTCAGTGCCAGTGTTGGAAGCATCAGTGCGGGATGAAGAAATTGGATCGAAACCTTAGCATCGTATCAAGAAATTCTAAAGGCTAATTTCAAGACTATTTTGGGGGGCAAGTTGCTGATAGATTATGAATATTTGTGTGTGTGTTTAATTTAAGAGTAAATTCTTTTGTTTGTAGAAGAATCTTGAAATCTAACTATCACTGATTCATCTATTTGATGCATGAAGAGAAATTGTATTGCCTTTTCCTGCTTCCACTGTGGGATCAATGTAATAATATTCTTATCTGAAACAGCATCTTATGAACAA
Coding sequence (CDS)
ATGGCGCGAAAACTCAGGCGTTCACCGCGGCCATTGCCTCGACGTACTGTCGACTATGCCTCTGATTATGATGCTTCTCCTTCTCCATCTCAGTCTCTCTATGCATCGAATGAAGACGACTATGACGCTTCTGAATCTATTAACTTCCAACCCACTGACCCCAAATCGAAAGCCCAAGAAATCAAGGGCTCCGATTTATTGACCTCTGCAGAATCCGCCTCCAACAGTCCATCGTATTTTCAGAGTCCAAACGCCGCCGAAACTCTATTTCCATACATCAACATTGCACCGTTGCCAGCTTTTCACGGCGGCGTCGATGAGTGTCCGGCGATGCATTTAAGCAGATTCGCCAAAGTCTGCCGTGCGAACAACGCGGCCTCCGTCGACATGATGATGAGGATCTTTCCGGTGACGTTAGAGGGTGAGGCTGCGCTTTGGTACGACTTGAACATTGAACCGTACCCTCCGATTTCTTGGGAAGAATTGAAGTCTTCGTTCTTGGACGCATATAATAAAATTGAATTGACTGATCAGTTGCGATCGGAGCTTATGACGATCAATCAACAGCAGGAGGAGAATGTACGTTCGTATTTTCTGAGGCTGCAGTTGATTTTGAAGAAATGGCCGACGGGGAACGAACTTTCCGATGGCTTGTTGAAAGCGATTTTCATGGATGGATTGAGAGAAGAGTTTAAGGAATGGATGATTCCACAGAAACCCAGTTCACTGAACGAGGCATTGAGACTTGCATTTGGGTTCGAACAAGTGAGAAGCGTTCGTACATCTGGCAAAAAGCGCTTTCTGCAGTGTGGGTTTTGTGAGGGGCCGCATGAGGAACTGCTTTGTGAGGTTAGGGAGAGAATGAGACAGTTGTGGAAGAGCAGGGAAAAGAAGAAGGCTGTTGATCTGGCAGAGAGTGACGGCCGTGAAGCGGCGACGGCGACGGCGACGGCGGAGCTTGTGAGATCGGTTTCGGCGATAAGTAGAAATGAAGCGGGGGTTGATAAGGATGGTGGAGAGATGGTGGGTTTGAAGAAGAAGAGTCAGTGCCAGTGTTGGAAGCATCAGTGCGGGATGAAGAAATTGGATCGAAACCTTAGCATCGTATCAAGAAATTCTAAAGGCTAA
Protein sequence
MARKLRRSPRPLPRRTVDYASDYDASPSPSQSLYASNEDDYDASESINFQPTDPKSKAQEIKGSDLLTSAESASNSPSYFQSPNAAETLFPYINIAPLPAFHGGVDECPAMHLSRFAKVCRANNAASVDMMMRIFPVTLEGEAALWYDLNIEPYPPISWEELKSSFLDAYNKIELTDQLRSELMTINQQQEENVRSYFLRLQLILKKWPTGNELSDGLLKAIFMDGLREEFKEWMIPQKPSSLNEALRLAFGFEQVRSVRTSGKKRFLQCGFCEGPHEELLCEVRERMRQLWKSREKKKAVDLAESDGREAATATATAELVRSVSAISRNEAGVDKDGGEMVGLKKKSQCQCWKHQCGMKKLDRNLSIVSRNSKG
Homology
BLAST of Tan0012462 vs. NCBI nr
Match:
KAG6604769.1 (hypothetical protein SDJN03_02086, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 565.1 bits (1455), Expect = 4.6e-157
Identity = 297/374 (79.41%), Postives = 320/374 (85.56%), Query Frame = 0
Query: 1 MARKLRRSPRPLPRRTVDYASDYDASPSPSQSLYASNEDDYDASESINFQPTDPKSKAQE 60
MA KLRRSP PL RR +YA+DYDA S SQSL ASNEDDYDASES NFQ + KSK+ E
Sbjct: 1 MAPKLRRSPPPLRRR--NYATDYDA--SLSQSLDASNEDDYDASESNNFQTSGHKSKSLE 60
Query: 61 IKGSDLLTSAESASNSPSYFQSPNAAETLFPYINIAPLPAFHGGVDECPAMHLSRFAKVC 120
I + ESA+NSP+ QSPNAA T+FPYINIAPLP FHGG DECPA HLSRFAKVC
Sbjct: 61 I-------NEESATNSPTNLQSPNAAATVFPYINIAPLPVFHGGSDECPATHLSRFAKVC 120
Query: 121 RANNAASVDMMMRIFPVTLEGEAALWYDLNIEPYPPISWEELKSSFLDAYNKIELTDQLR 180
RANNAASV++MMRIFPVTL+GEA LWYDLNIEPYPPISWEELKSSFLDAYNKIEL +QLR
Sbjct: 121 RANNAASVEIMMRIFPVTLQGEALLWYDLNIEPYPPISWEELKSSFLDAYNKIELAEQLR 180
Query: 181 SELMTINQQQEENVRSYFLRLQLILKKWPTGNELSDGLLKAIFMDGLREEFKEWMIPQKP 240
SELMTI+Q+ EENVRSYFLRLQLILKKWP GNELSDG LKAIFMDGLREEFKEWMIPQKP
Sbjct: 181 SELMTISQRPEENVRSYFLRLQLILKKWPPGNELSDGFLKAIFMDGLREEFKEWMIPQKP 240
Query: 241 SSLNEALRLAFGFEQVRSVRTSGKKRFLQCGFCEGPHEELLCEVRERMRQLWKSREKKKA 300
SLNEALRLAFG EQV +RTSG KRFL+CGFCEG HEEL+CEVRERMR+LWKSREKK
Sbjct: 241 DSLNEALRLAFGLEQVTVIRTSGGKRFLRCGFCEGRHEELVCEVRERMRRLWKSREKKNG 300
Query: 301 VDLAESDGREAATATATAELVRSVSAISRNEAGVDKDGGEMVGLKKKSQCQCWKHQCGMK 360
D+AES+G TAELVRSVSAISRNEA V KDGGEMVGLKKK QCQCWKHQCGMK
Sbjct: 301 GDMAESEGHN------TAELVRSVSAISRNEAEVGKDGGEMVGLKKKGQCQCWKHQCGMK 357
Query: 361 KLDRNLSIVSRNSK 375
KLDRNLS++S+ SK
Sbjct: 361 KLDRNLSMLSKTSK 357
BLAST of Tan0012462 vs. NCBI nr
Match:
EEF44287.1 (conserved hypothetical protein [Ricinus communis])
HSP 1 Score: 341.7 bits (875), Expect = 8.2e-90
Identity = 192/377 (50.93%), Postives = 251/377 (66.58%), Query Frame = 0
Query: 1 MARKLRRSPRPLPRRTVDYAS--DYDASPSPSQSLYASNEDDYDASESINFQPTDPKSKA 60
M RK + S R+++ ++S DY S SPSQS Y SN+DD + + QP +S
Sbjct: 1 MTRKAKNS-----RKSLQFSSRHDYSESTSPSQSPYDSNDDDDEIEDDDEEQPIISESVT 60
Query: 61 QEIKGSDLLTSAESASNSPSYFQSPNAAETLFPYINIAPLPAFHGGVDECPAMHLSRFAK 120
+ L S+ S SNS PN + YIN+APLP FHG +ECP HLSRF K
Sbjct: 61 NSLNADQL--SSSSYSNS-----QPNNS-----YINVAPLPVFHGNSNECPIAHLSRFVK 120
Query: 121 VCRANNAASVDMMMRIFPVTLEGEAALWYDLNIEPYPPISWEELKSSFLDAYNKIELTDQ 180
VCRANNA+S DMMMRIFPVTLE EAALWYDLNI+PYP +SW+E+ SFL+AY +I+L DQ
Sbjct: 121 VCRANNASSTDMMMRIFPVTLENEAALWYDLNIQPYPSLSWDEIMLSFLEAYQRIKLVDQ 180
Query: 181 LRSELMTINQQQEENVRSYFLRLQLILKKWPTGNELSDGLLKAIFMDGLREEFKEWMIPQ 240
LRS+LM +NQ +E+VRSYF+RLQ ILK+WP + LSD +LK IF+DGL FK+W+IP
Sbjct: 181 LRSDLMMLNQGSDESVRSYFMRLQWILKRWP-DHGLSDNMLKWIFIDGLMGNFKDWIIPH 240
Query: 241 KPSSLNEALRLAFGFEQVRSVRTSGKKRFLQCGFCEGPHEELLCEVRERMRQLWKSREKK 300
KP+SLNEALRLAF FEQV+S+R + K++ ++CGFCEG HEE C VRE+MR+L+++ +KK
Sbjct: 241 KPNSLNEALRLAFSFEQVKSIRGT-KQKVVKCGFCEGSHEENCCVVREKMRELFRNSKKK 300
Query: 301 KAVDLAESDGREAATATATAELVRSVSAISRNEAGVDKDGGEMVGLKK--KSQCQCWKHQ 360
+ S+ EA A + + + G DK+ M+ K KS CQC KH
Sbjct: 301 MMIPKEASERSEAGNEMAENKDGKEGEEEEEVDVGDDKEEKRMLSSSKTGKSPCQCSKHH 358
Query: 361 CGMKKLDRNLSIVSRNS 374
C MKK +R+ S+ +RNS
Sbjct: 361 CWMKKFERSNSVTTRNS 358
BLAST of Tan0012462 vs. NCBI nr
Match:
KAF3973300.1 (hypothetical protein CMV_003263 [Castanea mollissima])
HSP 1 Score: 329.3 bits (843), Expect = 4.2e-86
Identity = 193/345 (55.94%), Postives = 243/345 (70.43%), Query Frame = 0
Query: 37 NEDDYDASESINFQPTDPKS---KAQEIKGSDLLTSA--ESASNSPSYFQSPNAAETLFP 96
N+D Y SES P D S + + + +L T+A S SN P Q P+ L
Sbjct: 63 NDDAYIGSESETNAPGDRFSSQLRDPDSQSVNLSTTAFPNSTSNFPKISQPPST--HLAS 122
Query: 97 YINIAPLPAFHGGVDECPAMHLSRFAKVCRANNAASVDMMMRIFPVTLEGEAALWYDLNI 156
YINIAP P FHG +ECP H+SRFAKVC ANN ++ DMMM IFPVTLE EAALWYDLNI
Sbjct: 123 YINIAPFPIFHGNPNECPVKHVSRFAKVCVANNVSTTDMMMSIFPVTLEDEAALWYDLNI 182
Query: 157 EPYPPISWEELKSSFLDAYNKIELTDQLRSELMTINQQQEENVRSYFLRLQLILKKWPTG 216
+PYP ++WEE+KSSFL AY+KI++ DQLRSELM INQ EE+VRSYFLRLQ ILK+WP
Sbjct: 183 DPYPSLTWEEIKSSFLHAYHKIQVVDQLRSELMMINQGDEESVRSYFLRLQWILKQWP-D 242
Query: 217 NELSDGLLKAIFMDGLREEFKEWMIPQKPSSLNEALRLAFGFEQVRSVRTSGKKRFLQCG 276
+ + DGLLK +F+DGLREEF++W+ PQKP SL+EALRLAF FEQV+S+R K+ L+CG
Sbjct: 243 HGIPDGLLKGVFIDGLREEFRDWIFPQKPDSLHEALRLAFAFEQVKSIRAVRKE--LKCG 302
Query: 277 FCEGPHEELLCEVRERMRQLWK-SREKKKAVDLAESDGREAATATATAELVRSV-----S 336
FC+G HEE CEVRERMR+LW+ S+EK++ V LA+S + ELVRSV S
Sbjct: 303 FCDGMHEERDCEVRERMRKLWRESKEKEEPVVLAKSTRSDDDLG---KELVRSVSIGASS 362
Query: 337 AISRNEAGVDKDGGEMVGLKKKSQCQCWKHQCGMKKLDRNLSIVS 371
++ +N G ++GG M G KK+Q Q K+Q MKKL+RN S++S
Sbjct: 363 SVGKNNEG--EEGGFMDG--KKNQFQYRKYQRWMKKLERNNSLIS 395
BLAST of Tan0012462 vs. NCBI nr
Match:
KAF3973299.1 (hypothetical protein CMV_003263 [Castanea mollissima])
HSP 1 Score: 329.3 bits (843), Expect = 4.2e-86
Identity = 193/345 (55.94%), Postives = 243/345 (70.43%), Query Frame = 0
Query: 37 NEDDYDASESINFQPTDPKS---KAQEIKGSDLLTSA--ESASNSPSYFQSPNAAETLFP 96
N+D Y SES P D S + + + +L T+A S SN P Q P+ L
Sbjct: 63 NDDAYIGSESETNAPGDRFSSQLRDPDSQSVNLSTTAFPNSTSNFPKISQPPST--HLAS 122
Query: 97 YINIAPLPAFHGGVDECPAMHLSRFAKVCRANNAASVDMMMRIFPVTLEGEAALWYDLNI 156
YINIAP P FHG +ECP H+SRFAKVC ANN ++ DMMM IFPVTLE EAALWYDLNI
Sbjct: 123 YINIAPFPIFHGNPNECPVKHVSRFAKVCVANNVSTTDMMMSIFPVTLEDEAALWYDLNI 182
Query: 157 EPYPPISWEELKSSFLDAYNKIELTDQLRSELMTINQQQEENVRSYFLRLQLILKKWPTG 216
+PYP ++WEE+KSSFL AY+KI++ DQLRSELM INQ EE+VRSYFLRLQ ILK+WP
Sbjct: 183 DPYPSLTWEEIKSSFLHAYHKIQVVDQLRSELMMINQGDEESVRSYFLRLQWILKQWP-D 242
Query: 217 NELSDGLLKAIFMDGLREEFKEWMIPQKPSSLNEALRLAFGFEQVRSVRTSGKKRFLQCG 276
+ + DGLLK +F+DGLREEF++W+ PQKP SL+EALRLAF FEQV+S+R K+ L+CG
Sbjct: 243 HGIPDGLLKGVFIDGLREEFRDWIFPQKPDSLHEALRLAFAFEQVKSIRAVRKE--LKCG 302
Query: 277 FCEGPHEELLCEVRERMRQLWK-SREKKKAVDLAESDGREAATATATAELVRSV-----S 336
FC+G HEE CEVRERMR+LW+ S+EK++ V LA+S + ELVRSV S
Sbjct: 303 FCDGMHEERDCEVRERMRKLWRESKEKEEPVVLAKSTRSDDDLG---KELVRSVSIGASS 362
Query: 337 AISRNEAGVDKDGGEMVGLKKKSQCQCWKHQCGMKKLDRNLSIVS 371
++ +N G ++GG M G KK+Q Q K+Q MKKL+RN S++S
Sbjct: 363 SVGKNNEG--EEGGFMDG--KKNQFQYRKYQRWMKKLERNNSLIS 395
BLAST of Tan0012462 vs. NCBI nr
Match:
EXB78111.1 (hypothetical protein L484_004813 [Morus notabilis])
HSP 1 Score: 328.2 bits (840), Expect = 9.4e-86
Identity = 199/394 (50.51%), Postives = 251/394 (63.71%), Query Frame = 0
Query: 6 RRSPRPLPRRTVDYASDYD-------ASPSPSQSL-YASNEDD--YDASESINFQPTDPK 65
RR+P P DY+S YD SP+ S N+DD DAS++ T+P
Sbjct: 19 RRTPTP-----QDYSSTYDDDYTTVVRSPNDSTEFDQPENDDDDNDDASDAPTDSATNPL 78
Query: 66 SKAQEIKGSDLLTSAE---SASNSPSYF--QSPNAAETLFPYINIAPLPAFHGGVDECPA 125
S Q S+ + + + SAS+SP Q P + Y+NIA P F GG +ECP
Sbjct: 79 SD-QFSSVSERINARKKSCSASHSPILHLPQQPVSQTGYNSYMNIAQFPIFRGGSEECPF 138
Query: 126 MHLSRFAKVCRANNAASVDMMMRIFPVTLEGEAALWYDLNIEPYPPISWEELKSSFLDAY 185
HLSRFAKVCRANN +S+DMMM+IFPVTLE EAALWYDLN+EPY +SWEE+KSSF AY
Sbjct: 139 AHLSRFAKVCRANNVSSIDMMMKIFPVTLEDEAALWYDLNVEPYEELSWEEIKSSFYHAY 198
Query: 186 NKIELTDQLRSELMTINQQQEENVRSYFLRLQLILKKWPTGNELSDGLLKAIFMDGLREE 245
KIELT+QLRS+LMTINQ E+VRSYFLRLQ ILKKWP + LSD LLK +F+DGLR +
Sbjct: 199 GKIELTEQLRSQLMTINQGDAESVRSYFLRLQWILKKWPE-HGLSDDLLKGVFVDGLRGD 258
Query: 246 FKEWMIPQKPSSLNEALRLAFGFEQVRSVRTSGKKRFLQCGFCEGPHEELLCEVRERMRQ 305
F+EWM PQKP SLN+ALRLAF FEQV+S+R + ++CGFC G HEE CEVRERMR+
Sbjct: 259 FQEWMAPQKPGSLNKALRLAFCFEQVKSIRNVRRNASVKCGFCGGLHEERGCEVRERMRE 318
Query: 306 LWKSREKKKAVD--------LAESDGREAATATATAELVRSVSAISRNEAGVDKDG---- 365
LW K + + +S+G + + + RS + +N+ V++DG
Sbjct: 319 LWLKSNKDDGLGKGMLERNLIEKSEGVKELGRSVSMATSRSTCVVGKNDQ-VEEDGKEEE 378
Query: 366 GEMVGLKKKSQCQCWKHQCGMKKLDRNLSIVSRN 373
E+ KK+SQCQC KHQC K ++RN S VS N
Sbjct: 379 DELGSKKKRSQCQCGKHQCWKKNIERNNSTVSGN 404
BLAST of Tan0012462 vs. ExPASy TrEMBL
Match:
A0A7N2R9A7 (Retrotrans_gag domain-containing protein OS=Quercus lobata OX=97700 PE=4 SV=1)
HSP 1 Score: 344.4 bits (882), Expect = 6.2e-91
Identity = 197/341 (57.77%), Postives = 247/341 (72.43%), Query Frame = 0
Query: 37 NEDDYDASESINFQPTDPKS---KAQEIKGSDLLTSA--ESASNSPSYFQSPNAAETLFP 96
N+D Y +SES P D S + + + +L T+A S SN P Q P+ L
Sbjct: 62 NDDAYISSESETNAPGDRFSSQLRDPDSQSINLSTTAFPNSTSNFPKISQPPST--HLAS 121
Query: 97 YINIAPLPAFHGGVDECPAMHLSRFAKVCRANNAASVDMMMRIFPVTLEGEAALWYDLNI 156
Y+NIAP+P FHG +ECP H+SRFAKVC ANN ++ DMMMRIFPVTLE EAALWYDLNI
Sbjct: 122 YMNIAPIPIFHGNTNECPVKHVSRFAKVCVANNVSTTDMMMRIFPVTLEDEAALWYDLNI 181
Query: 157 EPYPPISWEELKSSFLDAYNKIELTDQLRSELMTINQQQEENVRSYFLRLQLILKKWPTG 216
EPYP ++WEE+KSSFL AY+KIE+ DQLRSELM INQ EE+VRSYFLRLQ ILK+WP
Sbjct: 182 EPYPSLTWEEIKSSFLHAYHKIEVVDQLRSELMMINQGDEESVRSYFLRLQWILKQWP-D 241
Query: 217 NELSDGLLKAIFMDGLREEFKEWMIPQKPSSLNEALRLAFGFEQVRSVRTSGKKRFLQCG 276
+ +SDGLLK +F+DGLREEF+ W+IPQKP SL+EALRLAFGFEQV+S+R K+ L+CG
Sbjct: 242 HGISDGLLKGVFIDGLREEFRGWIIPQKPDSLHEALRLAFGFEQVKSIRAVRKE--LKCG 301
Query: 277 FCEGPHEELLCEVRERMRQLWK-SREKKKAVDLAESDGREAATATATAELVRSVSAISRN 336
FC+G HEE CEVRERMR+LW+ S+EK++AV LA+S G + ELVRSVS + +
Sbjct: 302 FCDGMHEERDCEVRERMRKLWRESKEKEEAVVLAKSTGGDDELG---KELVRSVSIGASS 361
Query: 337 EAGVDKDGGEMVGLK-KKSQCQCWKHQCGMKKLDRNLSIVS 371
G + +G E + KK+Q Q K+Q MKKL+RN S++S
Sbjct: 362 SVGKNNEGEEAGFMDGKKNQFQYGKYQRWMKKLERNNSLIS 394
BLAST of Tan0012462 vs. ExPASy TrEMBL
Match:
B9RWN5 (Retrotrans_gag domain-containing protein OS=Ricinus communis OX=3988 GN=RCOM_1022950 PE=4 SV=1)
HSP 1 Score: 341.7 bits (875), Expect = 4.0e-90
Identity = 192/377 (50.93%), Postives = 251/377 (66.58%), Query Frame = 0
Query: 1 MARKLRRSPRPLPRRTVDYAS--DYDASPSPSQSLYASNEDDYDASESINFQPTDPKSKA 60
M RK + S R+++ ++S DY S SPSQS Y SN+DD + + QP +S
Sbjct: 1 MTRKAKNS-----RKSLQFSSRHDYSESTSPSQSPYDSNDDDDEIEDDDEEQPIISESVT 60
Query: 61 QEIKGSDLLTSAESASNSPSYFQSPNAAETLFPYINIAPLPAFHGGVDECPAMHLSRFAK 120
+ L S+ S SNS PN + YIN+APLP FHG +ECP HLSRF K
Sbjct: 61 NSLNADQL--SSSSYSNS-----QPNNS-----YINVAPLPVFHGNSNECPIAHLSRFVK 120
Query: 121 VCRANNAASVDMMMRIFPVTLEGEAALWYDLNIEPYPPISWEELKSSFLDAYNKIELTDQ 180
VCRANNA+S DMMMRIFPVTLE EAALWYDLNI+PYP +SW+E+ SFL+AY +I+L DQ
Sbjct: 121 VCRANNASSTDMMMRIFPVTLENEAALWYDLNIQPYPSLSWDEIMLSFLEAYQRIKLVDQ 180
Query: 181 LRSELMTINQQQEENVRSYFLRLQLILKKWPTGNELSDGLLKAIFMDGLREEFKEWMIPQ 240
LRS+LM +NQ +E+VRSYF+RLQ ILK+WP + LSD +LK IF+DGL FK+W+IP
Sbjct: 181 LRSDLMMLNQGSDESVRSYFMRLQWILKRWP-DHGLSDNMLKWIFIDGLMGNFKDWIIPH 240
Query: 241 KPSSLNEALRLAFGFEQVRSVRTSGKKRFLQCGFCEGPHEELLCEVRERMRQLWKSREKK 300
KP+SLNEALRLAF FEQV+S+R + K++ ++CGFCEG HEE C VRE+MR+L+++ +KK
Sbjct: 241 KPNSLNEALRLAFSFEQVKSIRGT-KQKVVKCGFCEGSHEENCCVVREKMRELFRNSKKK 300
Query: 301 KAVDLAESDGREAATATATAELVRSVSAISRNEAGVDKDGGEMVGLKK--KSQCQCWKHQ 360
+ S+ EA A + + + G DK+ M+ K KS CQC KH
Sbjct: 301 MMIPKEASERSEAGNEMAENKDGKEGEEEEEVDVGDDKEEKRMLSSSKTGKSPCQCSKHH 358
Query: 361 CGMKKLDRNLSIVSRNS 374
C MKK +R+ S+ +RNS
Sbjct: 361 CWMKKFERSNSVTTRNS 358
BLAST of Tan0012462 vs. ExPASy TrEMBL
Match:
W9R9S0 (Retrotrans_gag domain-containing protein OS=Morus notabilis OX=981085 GN=L484_004813 PE=4 SV=1)
HSP 1 Score: 328.2 bits (840), Expect = 4.6e-86
Identity = 199/394 (50.51%), Postives = 251/394 (63.71%), Query Frame = 0
Query: 6 RRSPRPLPRRTVDYASDYD-------ASPSPSQSL-YASNEDD--YDASESINFQPTDPK 65
RR+P P DY+S YD SP+ S N+DD DAS++ T+P
Sbjct: 19 RRTPTP-----QDYSSTYDDDYTTVVRSPNDSTEFDQPENDDDDNDDASDAPTDSATNPL 78
Query: 66 SKAQEIKGSDLLTSAE---SASNSPSYF--QSPNAAETLFPYINIAPLPAFHGGVDECPA 125
S Q S+ + + + SAS+SP Q P + Y+NIA P F GG +ECP
Sbjct: 79 SD-QFSSVSERINARKKSCSASHSPILHLPQQPVSQTGYNSYMNIAQFPIFRGGSEECPF 138
Query: 126 MHLSRFAKVCRANNAASVDMMMRIFPVTLEGEAALWYDLNIEPYPPISWEELKSSFLDAY 185
HLSRFAKVCRANN +S+DMMM+IFPVTLE EAALWYDLN+EPY +SWEE+KSSF AY
Sbjct: 139 AHLSRFAKVCRANNVSSIDMMMKIFPVTLEDEAALWYDLNVEPYEELSWEEIKSSFYHAY 198
Query: 186 NKIELTDQLRSELMTINQQQEENVRSYFLRLQLILKKWPTGNELSDGLLKAIFMDGLREE 245
KIELT+QLRS+LMTINQ E+VRSYFLRLQ ILKKWP + LSD LLK +F+DGLR +
Sbjct: 199 GKIELTEQLRSQLMTINQGDAESVRSYFLRLQWILKKWPE-HGLSDDLLKGVFVDGLRGD 258
Query: 246 FKEWMIPQKPSSLNEALRLAFGFEQVRSVRTSGKKRFLQCGFCEGPHEELLCEVRERMRQ 305
F+EWM PQKP SLN+ALRLAF FEQV+S+R + ++CGFC G HEE CEVRERMR+
Sbjct: 259 FQEWMAPQKPGSLNKALRLAFCFEQVKSIRNVRRNASVKCGFCGGLHEERGCEVRERMRE 318
Query: 306 LWKSREKKKAVD--------LAESDGREAATATATAELVRSVSAISRNEAGVDKDG---- 365
LW K + + +S+G + + + RS + +N+ V++DG
Sbjct: 319 LWLKSNKDDGLGKGMLERNLIEKSEGVKELGRSVSMATSRSTCVVGKNDQ-VEEDGKEEE 378
Query: 366 GEMVGLKKKSQCQCWKHQCGMKKLDRNLSIVSRN 373
E+ KK+SQCQC KHQC K ++RN S VS N
Sbjct: 379 DELGSKKKRSQCQCGKHQCWKKNIERNNSTVSGN 404
BLAST of Tan0012462 vs. ExPASy TrEMBL
Match:
A0A6J5UDI4 (Retrotrans_gag domain-containing protein OS=Prunus armeniaca OX=36596 GN=CURHAP_LOCUS20487 PE=4 SV=1)
HSP 1 Score: 326.2 bits (835), Expect = 1.7e-85
Identity = 188/361 (52.08%), Postives = 242/361 (67.04%), Query Frame = 0
Query: 22 DYDASPSPSQSLYASNEDDYDASESINFQPTDPKSKAQ----EIKGSDLLTSAESASNSP 81
D D S SQS N+ Y ASES P+D S +Q + S+ + + S +
Sbjct: 13 DDDCSNEVSQS---QNQSIYLASESETNSPSDQFSSSQPPPESVSSSNQIKARASLPSKT 72
Query: 82 SYFQSPNAAETLFPYINIAPLPAFHGGVDECPAMHLSRFAKVCRAN-NAASVDMMMRIFP 141
F P +T + YI IAPLP F GG +ECP HL+RFAK+CRAN + +VD+M+RIFP
Sbjct: 73 KNFSEPTTNQTPY-YIPIAPLPIFRGGSNECPVTHLTRFAKICRANFSCPTVDVMVRIFP 132
Query: 142 VTLEGEAALWYDLNIEPYPPISWEELKSSFLDAYNKIELTDQLRSELMTINQQQEENVRS 201
VTLE EAALWYDLNI+PYP +SWEE++S F AY++I DQLRSEL I Q ++E VRS
Sbjct: 133 VTLENEAALWYDLNIDPYPSLSWEEIRSLFFQAYDQI---DQLRSELTMIKQGRDETVRS 192
Query: 202 YFLRLQLILKKWPTGNELSDGLLKAIFMDGLREEFKEWMIPQKPSSLNEALRLAFGFEQV 261
YFLRLQ ILK+WP + L D +LK +F+DGLR+EFK+W++ +KPSSLN+ALRLAFGFE+V
Sbjct: 193 YFLRLQWILKRWP-DHGLQDNVLKGVFIDGLRKEFKDWIVAEKPSSLNDALRLAFGFEKV 252
Query: 262 RSVR--TSGKKRFLQCGFCEGPHEELLCEVRERMRQLW-KSREKKKAVDLAESDGREAAT 321
+SVR T+ K++ ++CGFC G HEE CEVRERMR+LW KS+E+
Sbjct: 253 KSVRATTAAKEKAVECGFCGGGHEEKGCEVRERMRKLWVKSKEE---------------- 312
Query: 322 ATATAELVRSVSAI-SRNEAGVDK-DGGEMVGLKKKSQCQCWKHQCGMKKLDRNLSIVSR 373
LVR VS + R E GV++ + GE+V LKKK QCQCWKHQC KKL+R+ S+V
Sbjct: 313 -----GLVRMVSVVGKREEEGVEREEEGELVDLKKKGQCQCWKHQCWKKKLERSKSLVVT 344
BLAST of Tan0012462 vs. ExPASy TrEMBL
Match:
A5C7E6 (Retrotrans_gag domain-containing protein OS=Vitis vinifera OX=29760 GN=VITISV_007470 PE=4 SV=1)
HSP 1 Score: 324.7 bits (831), Expect = 5.1e-85
Identity = 188/361 (52.08%), Postives = 241/361 (66.76%), Query Frame = 0
Query: 19 YASDYDASPSPSQSLYASNEDDYDA----SESINFQPTDPKSKAQEIKGSDLLTSAESAS 78
+ DY SPSQS Y +E++ D +++ + T+ + + + +S
Sbjct: 158 FYDDY-TEQSPSQSPYEFDEEEEDEQSXYTDNESASGTNAPGDQFSLPALESIPKGKSFR 217
Query: 79 NSPSYFQSPNAAETLF--PYINIAPLPAFHGGVDECPAMHLSRFAKVCRANNAASVDMMM 138
S S S N+ YINIAPLP F G DECP HLSRF KVCRANN +SV+M+M
Sbjct: 218 PSSSLNSSSNSLNPFXQSSYINIAPLPIFRGSSDECPVTHLSRFTKVCRANNVSSVEMIM 277
Query: 139 RIFPVTLEGEAALWYDLNIEPYPPISWEELKSSFLDAYNKIELTDQLRSELMTINQQQEE 198
RIFPVTL+GEAALWYDLNIEPY +SWEE+KSSFL AY++ LTD+LRSELM INQ EE
Sbjct: 278 RIFPVTLDGEAALWYDLNIEPYSSLSWEEIKSSFLQAYHRXGLTDELRSELMMINQGTEE 337
Query: 199 NVRSYFLRLQLILKKWPTGNELSDGLLKAIFMDGLREEFKEWMIPQKPSSLNEALRLAFG 258
+VRSYFLRLQ ILK+WP + L DGLL+ IF+DGLR++F++W+IPQKPSSLNEALRLAF
Sbjct: 338 SVRSYFLRLQWILKRWP-DHGLPDGLLEGIFIDGLRKDFQDWIIPQKPSSLNEALRLAFA 397
Query: 259 FEQVRSVRTSGKKRFLQCGFCEGPHEELLCEVRERMRQLWKSREKKKAVDLAESDGREAA 318
+E+V+S+R +K +CGFC G H+E CE+RERMR LW + KK+ D + GR
Sbjct: 398 WEKVQSIRGGREK---ECGFCSGGHDEEGCEIRERMRXLW-VKSKKQTRDYS---GRIVN 457
Query: 319 TATATAELVR--SVSAISRNEAGVDKDGGE-MVGLKKKSQCQCWKHQCGMKKLDRNLSIV 371
E R SV SR+ +++G E +G KKKSQCQC KHQC KKL+RN S++
Sbjct: 458 DEDGEKEFERRVSVGGESRBVGKNEEEGEEGXMGWKKKSQCQCGKHQCWKKKLERNNSLL 509
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
KAG6604769.1 | 4.6e-157 | 79.41 | hypothetical protein SDJN03_02086, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
EEF44287.1 | 8.2e-90 | 50.93 | conserved hypothetical protein [Ricinus communis] | [more] |
KAF3973300.1 | 4.2e-86 | 55.94 | hypothetical protein CMV_003263 [Castanea mollissima] | [more] |
KAF3973299.1 | 4.2e-86 | 55.94 | hypothetical protein CMV_003263 [Castanea mollissima] | [more] |
EXB78111.1 | 9.4e-86 | 50.51 | hypothetical protein L484_004813 [Morus notabilis] | [more] |
Match Name | E-value | Identity | Description | |
A0A7N2R9A7 | 6.2e-91 | 57.77 | Retrotrans_gag domain-containing protein OS=Quercus lobata OX=97700 PE=4 SV=1 | [more] |
B9RWN5 | 4.0e-90 | 50.93 | Retrotrans_gag domain-containing protein OS=Ricinus communis OX=3988 GN=RCOM_102... | [more] |
W9R9S0 | 4.6e-86 | 50.51 | Retrotrans_gag domain-containing protein OS=Morus notabilis OX=981085 GN=L484_00... | [more] |
A0A6J5UDI4 | 1.7e-85 | 52.08 | Retrotrans_gag domain-containing protein OS=Prunus armeniaca OX=36596 GN=CURHAP_... | [more] |
A5C7E6 | 5.1e-85 | 52.08 | Retrotrans_gag domain-containing protein OS=Vitis vinifera OX=29760 GN=VITISV_00... | [more] |
Match Name | E-value | Identity | Description | |