Tan0012462 (gene) Snake gourd v1

Overview
NameTan0012462
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRetrotrans_gag domain-containing protein
LocationLG05: 5208783 .. 5210204 (-)
RNA-Seq ExpressionTan0012462
SyntenyTan0012462
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GTCTGGTTTCACCGTTCAAATTCTCAACGGAGCCGTCGTCTTCTCATTTCTTTCCTTAATTCCTCCACTTTTCGATGGCGCGAAAACTCAGGCGTTCACCGCGGCCATTGCCTCGACGTACTGTCGACTATGCCTCTGATTATGATGCTTCTCCTTCTCCATCTCAGTCTCTCTATGCATCGAATGAAGACGACTATGACGCTTCTGAATCTATTAACTTCCAACCCACTGACCCCAAATCGAAAGCCCAAGAAATCAAGGGCTCCGATTTATTGACCTCTGCAGAATCCGCCTCCAACAGTCCATCGTATTTTCAGAGTCCAAACGCCGCCGAAACTCTATTTCCATACATCAACATTGCACCGTTGCCAGCTTTTCACGGCGGCGTCGATGAGTGTCCGGCGATGCATTTAAGCAGATTCGCCAAAGTCTGCCGTGCGAACAACGCGGCCTCCGTCGACATGATGATGAGGATCTTTCCGGTGACGTTAGAGGGTGAGGCTGCGCTTTGGTACGACTTGAACATTGAACCGTACCCTCCGATTTCTTGGGAAGAATTGAAGTCTTCGTTCTTGGACGCATATAATAAAATTGAATTGACTGATCAGTTGCGATCGGAGCTTATGACGATCAATCAACAGCAGGAGGAGAATGTACGTTCGTATTTTCTGAGGCTGCAGTTGATTTTGAAGAAATGGCCGACGGGGAACGAACTTTCCGATGGCTTGTTGAAAGCGATTTTCATGGATGGATTGAGAGAAGAGTTTAAGGAATGGATGATTCCACAGAAACCCAGTTCACTGAACGAGGCATTGAGACTTGCATTTGGGTTCGAACAAGTGAGAAGCGTTCGTACATCTGGCAAAAAGCGCTTTCTGCAGTGTGGGTTTTGTGAGGGGCCGCATGAGGAACTGCTTTGTGAGGTTAGGGAGAGAATGAGACAGTTGTGGAAGAGCAGGGAAAAGAAGAAGGCTGTTGATCTGGCAGAGAGTGACGGCCGTGAAGCGGCGACGGCGACGGCGACGGCGGAGCTTGTGAGATCGGTTTCGGCGATAAGTAGAAATGAAGCGGGGGTTGATAAGGATGGTGGAGAGATGGTGGGTTTGAAGAAGAAGAGTCAGTGCCAGTGTTGGAAGCATCAGTGCGGGATGAAGAAATTGGATCGAAACCTTAGCATCGTATCAAGAAATTCTAAAGGCTAATTTCAAGACTATTTTGGGGGGCAAGTTGCTGATAGATTATGAATATTTGTGTGTGTGTTTAATTTAAGAGTAAATTCTTTTGTTTGTAGAAGAATCTTGAAATCTAACTATCACTGATTCATCTATTTGATGCATGAAGAGAAATTGTATTGCCTTTTCCTGCTTCCACTGTGGGATCAATGTAATAATATTCTTATCTGAAACAGCATCTTATGAACAA

mRNA sequence

GTCTGGTTTCACCGTTCAAATTCTCAACGGAGCCGTCGTCTTCTCATTTCTTTCCTTAATTCCTCCACTTTTCGATGGCGCGAAAACTCAGGCGTTCACCGCGGCCATTGCCTCGACGTACTGTCGACTATGCCTCTGATTATGATGCTTCTCCTTCTCCATCTCAGTCTCTCTATGCATCGAATGAAGACGACTATGACGCTTCTGAATCTATTAACTTCCAACCCACTGACCCCAAATCGAAAGCCCAAGAAATCAAGGGCTCCGATTTATTGACCTCTGCAGAATCCGCCTCCAACAGTCCATCGTATTTTCAGAGTCCAAACGCCGCCGAAACTCTATTTCCATACATCAACATTGCACCGTTGCCAGCTTTTCACGGCGGCGTCGATGAGTGTCCGGCGATGCATTTAAGCAGATTCGCCAAAGTCTGCCGTGCGAACAACGCGGCCTCCGTCGACATGATGATGAGGATCTTTCCGGTGACGTTAGAGGGTGAGGCTGCGCTTTGGTACGACTTGAACATTGAACCGTACCCTCCGATTTCTTGGGAAGAATTGAAGTCTTCGTTCTTGGACGCATATAATAAAATTGAATTGACTGATCAGTTGCGATCGGAGCTTATGACGATCAATCAACAGCAGGAGGAGAATGTACGTTCGTATTTTCTGAGGCTGCAGTTGATTTTGAAGAAATGGCCGACGGGGAACGAACTTTCCGATGGCTTGTTGAAAGCGATTTTCATGGATGGATTGAGAGAAGAGTTTAAGGAATGGATGATTCCACAGAAACCCAGTTCACTGAACGAGGCATTGAGACTTGCATTTGGGTTCGAACAAGTGAGAAGCGTTCGTACATCTGGCAAAAAGCGCTTTCTGCAGTGTGGGTTTTGTGAGGGGCCGCATGAGGAACTGCTTTGTGAGGTTAGGGAGAGAATGAGACAGTTGTGGAAGAGCAGGGAAAAGAAGAAGGCTGTTGATCTGGCAGAGAGTGACGGCCGTGAAGCGGCGACGGCGACGGCGACGGCGGAGCTTGTGAGATCGGTTTCGGCGATAAGTAGAAATGAAGCGGGGGTTGATAAGGATGGTGGAGAGATGGTGGGTTTGAAGAAGAAGAGTCAGTGCCAGTGTTGGAAGCATCAGTGCGGGATGAAGAAATTGGATCGAAACCTTAGCATCGTATCAAGAAATTCTAAAGGCTAATTTCAAGACTATTTTGGGGGGCAAGTTGCTGATAGATTATGAATATTTGTGTGTGTGTTTAATTTAAGAGTAAATTCTTTTGTTTGTAGAAGAATCTTGAAATCTAACTATCACTGATTCATCTATTTGATGCATGAAGAGAAATTGTATTGCCTTTTCCTGCTTCCACTGTGGGATCAATGTAATAATATTCTTATCTGAAACAGCATCTTATGAACAA

Coding sequence (CDS)

ATGGCGCGAAAACTCAGGCGTTCACCGCGGCCATTGCCTCGACGTACTGTCGACTATGCCTCTGATTATGATGCTTCTCCTTCTCCATCTCAGTCTCTCTATGCATCGAATGAAGACGACTATGACGCTTCTGAATCTATTAACTTCCAACCCACTGACCCCAAATCGAAAGCCCAAGAAATCAAGGGCTCCGATTTATTGACCTCTGCAGAATCCGCCTCCAACAGTCCATCGTATTTTCAGAGTCCAAACGCCGCCGAAACTCTATTTCCATACATCAACATTGCACCGTTGCCAGCTTTTCACGGCGGCGTCGATGAGTGTCCGGCGATGCATTTAAGCAGATTCGCCAAAGTCTGCCGTGCGAACAACGCGGCCTCCGTCGACATGATGATGAGGATCTTTCCGGTGACGTTAGAGGGTGAGGCTGCGCTTTGGTACGACTTGAACATTGAACCGTACCCTCCGATTTCTTGGGAAGAATTGAAGTCTTCGTTCTTGGACGCATATAATAAAATTGAATTGACTGATCAGTTGCGATCGGAGCTTATGACGATCAATCAACAGCAGGAGGAGAATGTACGTTCGTATTTTCTGAGGCTGCAGTTGATTTTGAAGAAATGGCCGACGGGGAACGAACTTTCCGATGGCTTGTTGAAAGCGATTTTCATGGATGGATTGAGAGAAGAGTTTAAGGAATGGATGATTCCACAGAAACCCAGTTCACTGAACGAGGCATTGAGACTTGCATTTGGGTTCGAACAAGTGAGAAGCGTTCGTACATCTGGCAAAAAGCGCTTTCTGCAGTGTGGGTTTTGTGAGGGGCCGCATGAGGAACTGCTTTGTGAGGTTAGGGAGAGAATGAGACAGTTGTGGAAGAGCAGGGAAAAGAAGAAGGCTGTTGATCTGGCAGAGAGTGACGGCCGTGAAGCGGCGACGGCGACGGCGACGGCGGAGCTTGTGAGATCGGTTTCGGCGATAAGTAGAAATGAAGCGGGGGTTGATAAGGATGGTGGAGAGATGGTGGGTTTGAAGAAGAAGAGTCAGTGCCAGTGTTGGAAGCATCAGTGCGGGATGAAGAAATTGGATCGAAACCTTAGCATCGTATCAAGAAATTCTAAAGGCTAA

Protein sequence

MARKLRRSPRPLPRRTVDYASDYDASPSPSQSLYASNEDDYDASESINFQPTDPKSKAQEIKGSDLLTSAESASNSPSYFQSPNAAETLFPYINIAPLPAFHGGVDECPAMHLSRFAKVCRANNAASVDMMMRIFPVTLEGEAALWYDLNIEPYPPISWEELKSSFLDAYNKIELTDQLRSELMTINQQQEENVRSYFLRLQLILKKWPTGNELSDGLLKAIFMDGLREEFKEWMIPQKPSSLNEALRLAFGFEQVRSVRTSGKKRFLQCGFCEGPHEELLCEVRERMRQLWKSREKKKAVDLAESDGREAATATATAELVRSVSAISRNEAGVDKDGGEMVGLKKKSQCQCWKHQCGMKKLDRNLSIVSRNSKG
Homology
BLAST of Tan0012462 vs. NCBI nr
Match: KAG6604769.1 (hypothetical protein SDJN03_02086, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 565.1 bits (1455), Expect = 4.6e-157
Identity = 297/374 (79.41%), Postives = 320/374 (85.56%), Query Frame = 0

Query: 1   MARKLRRSPRPLPRRTVDYASDYDASPSPSQSLYASNEDDYDASESINFQPTDPKSKAQE 60
           MA KLRRSP PL RR  +YA+DYDA  S SQSL ASNEDDYDASES NFQ +  KSK+ E
Sbjct: 1   MAPKLRRSPPPLRRR--NYATDYDA--SLSQSLDASNEDDYDASESNNFQTSGHKSKSLE 60

Query: 61  IKGSDLLTSAESASNSPSYFQSPNAAETLFPYINIAPLPAFHGGVDECPAMHLSRFAKVC 120
           I       + ESA+NSP+  QSPNAA T+FPYINIAPLP FHGG DECPA HLSRFAKVC
Sbjct: 61  I-------NEESATNSPTNLQSPNAAATVFPYINIAPLPVFHGGSDECPATHLSRFAKVC 120

Query: 121 RANNAASVDMMMRIFPVTLEGEAALWYDLNIEPYPPISWEELKSSFLDAYNKIELTDQLR 180
           RANNAASV++MMRIFPVTL+GEA LWYDLNIEPYPPISWEELKSSFLDAYNKIEL +QLR
Sbjct: 121 RANNAASVEIMMRIFPVTLQGEALLWYDLNIEPYPPISWEELKSSFLDAYNKIELAEQLR 180

Query: 181 SELMTINQQQEENVRSYFLRLQLILKKWPTGNELSDGLLKAIFMDGLREEFKEWMIPQKP 240
           SELMTI+Q+ EENVRSYFLRLQLILKKWP GNELSDG LKAIFMDGLREEFKEWMIPQKP
Sbjct: 181 SELMTISQRPEENVRSYFLRLQLILKKWPPGNELSDGFLKAIFMDGLREEFKEWMIPQKP 240

Query: 241 SSLNEALRLAFGFEQVRSVRTSGKKRFLQCGFCEGPHEELLCEVRERMRQLWKSREKKKA 300
            SLNEALRLAFG EQV  +RTSG KRFL+CGFCEG HEEL+CEVRERMR+LWKSREKK  
Sbjct: 241 DSLNEALRLAFGLEQVTVIRTSGGKRFLRCGFCEGRHEELVCEVRERMRRLWKSREKKNG 300

Query: 301 VDLAESDGREAATATATAELVRSVSAISRNEAGVDKDGGEMVGLKKKSQCQCWKHQCGMK 360
            D+AES+G        TAELVRSVSAISRNEA V KDGGEMVGLKKK QCQCWKHQCGMK
Sbjct: 301 GDMAESEGHN------TAELVRSVSAISRNEAEVGKDGGEMVGLKKKGQCQCWKHQCGMK 357

Query: 361 KLDRNLSIVSRNSK 375
           KLDRNLS++S+ SK
Sbjct: 361 KLDRNLSMLSKTSK 357

BLAST of Tan0012462 vs. NCBI nr
Match: EEF44287.1 (conserved hypothetical protein [Ricinus communis])

HSP 1 Score: 341.7 bits (875), Expect = 8.2e-90
Identity = 192/377 (50.93%), Postives = 251/377 (66.58%), Query Frame = 0

Query: 1   MARKLRRSPRPLPRRTVDYAS--DYDASPSPSQSLYASNEDDYDASESINFQPTDPKSKA 60
           M RK + S     R+++ ++S  DY  S SPSQS Y SN+DD +  +    QP   +S  
Sbjct: 1   MTRKAKNS-----RKSLQFSSRHDYSESTSPSQSPYDSNDDDDEIEDDDEEQPIISESVT 60

Query: 61  QEIKGSDLLTSAESASNSPSYFQSPNAAETLFPYINIAPLPAFHGGVDECPAMHLSRFAK 120
             +    L  S+ S SNS      PN +     YIN+APLP FHG  +ECP  HLSRF K
Sbjct: 61  NSLNADQL--SSSSYSNS-----QPNNS-----YINVAPLPVFHGNSNECPIAHLSRFVK 120

Query: 121 VCRANNAASVDMMMRIFPVTLEGEAALWYDLNIEPYPPISWEELKSSFLDAYNKIELTDQ 180
           VCRANNA+S DMMMRIFPVTLE EAALWYDLNI+PYP +SW+E+  SFL+AY +I+L DQ
Sbjct: 121 VCRANNASSTDMMMRIFPVTLENEAALWYDLNIQPYPSLSWDEIMLSFLEAYQRIKLVDQ 180

Query: 181 LRSELMTINQQQEENVRSYFLRLQLILKKWPTGNELSDGLLKAIFMDGLREEFKEWMIPQ 240
           LRS+LM +NQ  +E+VRSYF+RLQ ILK+WP  + LSD +LK IF+DGL   FK+W+IP 
Sbjct: 181 LRSDLMMLNQGSDESVRSYFMRLQWILKRWP-DHGLSDNMLKWIFIDGLMGNFKDWIIPH 240

Query: 241 KPSSLNEALRLAFGFEQVRSVRTSGKKRFLQCGFCEGPHEELLCEVRERMRQLWKSREKK 300
           KP+SLNEALRLAF FEQV+S+R + K++ ++CGFCEG HEE  C VRE+MR+L+++ +KK
Sbjct: 241 KPNSLNEALRLAFSFEQVKSIRGT-KQKVVKCGFCEGSHEENCCVVREKMRELFRNSKKK 300

Query: 301 KAVDLAESDGREAATATATAELVRSVSAISRNEAGVDKDGGEMVGLKK--KSQCQCWKHQ 360
             +    S+  EA    A  +  +        + G DK+   M+   K  KS CQC KH 
Sbjct: 301 MMIPKEASERSEAGNEMAENKDGKEGEEEEEVDVGDDKEEKRMLSSSKTGKSPCQCSKHH 358

Query: 361 CGMKKLDRNLSIVSRNS 374
           C MKK +R+ S+ +RNS
Sbjct: 361 CWMKKFERSNSVTTRNS 358

BLAST of Tan0012462 vs. NCBI nr
Match: KAF3973300.1 (hypothetical protein CMV_003263 [Castanea mollissima])

HSP 1 Score: 329.3 bits (843), Expect = 4.2e-86
Identity = 193/345 (55.94%), Postives = 243/345 (70.43%), Query Frame = 0

Query: 37  NEDDYDASESINFQPTDPKS---KAQEIKGSDLLTSA--ESASNSPSYFQSPNAAETLFP 96
           N+D Y  SES    P D  S   +  + +  +L T+A   S SN P   Q P+    L  
Sbjct: 63  NDDAYIGSESETNAPGDRFSSQLRDPDSQSVNLSTTAFPNSTSNFPKISQPPST--HLAS 122

Query: 97  YINIAPLPAFHGGVDECPAMHLSRFAKVCRANNAASVDMMMRIFPVTLEGEAALWYDLNI 156
           YINIAP P FHG  +ECP  H+SRFAKVC ANN ++ DMMM IFPVTLE EAALWYDLNI
Sbjct: 123 YINIAPFPIFHGNPNECPVKHVSRFAKVCVANNVSTTDMMMSIFPVTLEDEAALWYDLNI 182

Query: 157 EPYPPISWEELKSSFLDAYNKIELTDQLRSELMTINQQQEENVRSYFLRLQLILKKWPTG 216
           +PYP ++WEE+KSSFL AY+KI++ DQLRSELM INQ  EE+VRSYFLRLQ ILK+WP  
Sbjct: 183 DPYPSLTWEEIKSSFLHAYHKIQVVDQLRSELMMINQGDEESVRSYFLRLQWILKQWP-D 242

Query: 217 NELSDGLLKAIFMDGLREEFKEWMIPQKPSSLNEALRLAFGFEQVRSVRTSGKKRFLQCG 276
           + + DGLLK +F+DGLREEF++W+ PQKP SL+EALRLAF FEQV+S+R   K+  L+CG
Sbjct: 243 HGIPDGLLKGVFIDGLREEFRDWIFPQKPDSLHEALRLAFAFEQVKSIRAVRKE--LKCG 302

Query: 277 FCEGPHEELLCEVRERMRQLWK-SREKKKAVDLAESDGREAATATATAELVRSV-----S 336
           FC+G HEE  CEVRERMR+LW+ S+EK++ V LA+S   +        ELVRSV     S
Sbjct: 303 FCDGMHEERDCEVRERMRKLWRESKEKEEPVVLAKSTRSDDDLG---KELVRSVSIGASS 362

Query: 337 AISRNEAGVDKDGGEMVGLKKKSQCQCWKHQCGMKKLDRNLSIVS 371
           ++ +N  G  ++GG M G  KK+Q Q  K+Q  MKKL+RN S++S
Sbjct: 363 SVGKNNEG--EEGGFMDG--KKNQFQYRKYQRWMKKLERNNSLIS 395

BLAST of Tan0012462 vs. NCBI nr
Match: KAF3973299.1 (hypothetical protein CMV_003263 [Castanea mollissima])

HSP 1 Score: 329.3 bits (843), Expect = 4.2e-86
Identity = 193/345 (55.94%), Postives = 243/345 (70.43%), Query Frame = 0

Query: 37  NEDDYDASESINFQPTDPKS---KAQEIKGSDLLTSA--ESASNSPSYFQSPNAAETLFP 96
           N+D Y  SES    P D  S   +  + +  +L T+A   S SN P   Q P+    L  
Sbjct: 63  NDDAYIGSESETNAPGDRFSSQLRDPDSQSVNLSTTAFPNSTSNFPKISQPPST--HLAS 122

Query: 97  YINIAPLPAFHGGVDECPAMHLSRFAKVCRANNAASVDMMMRIFPVTLEGEAALWYDLNI 156
           YINIAP P FHG  +ECP  H+SRFAKVC ANN ++ DMMM IFPVTLE EAALWYDLNI
Sbjct: 123 YINIAPFPIFHGNPNECPVKHVSRFAKVCVANNVSTTDMMMSIFPVTLEDEAALWYDLNI 182

Query: 157 EPYPPISWEELKSSFLDAYNKIELTDQLRSELMTINQQQEENVRSYFLRLQLILKKWPTG 216
           +PYP ++WEE+KSSFL AY+KI++ DQLRSELM INQ  EE+VRSYFLRLQ ILK+WP  
Sbjct: 183 DPYPSLTWEEIKSSFLHAYHKIQVVDQLRSELMMINQGDEESVRSYFLRLQWILKQWP-D 242

Query: 217 NELSDGLLKAIFMDGLREEFKEWMIPQKPSSLNEALRLAFGFEQVRSVRTSGKKRFLQCG 276
           + + DGLLK +F+DGLREEF++W+ PQKP SL+EALRLAF FEQV+S+R   K+  L+CG
Sbjct: 243 HGIPDGLLKGVFIDGLREEFRDWIFPQKPDSLHEALRLAFAFEQVKSIRAVRKE--LKCG 302

Query: 277 FCEGPHEELLCEVRERMRQLWK-SREKKKAVDLAESDGREAATATATAELVRSV-----S 336
           FC+G HEE  CEVRERMR+LW+ S+EK++ V LA+S   +        ELVRSV     S
Sbjct: 303 FCDGMHEERDCEVRERMRKLWRESKEKEEPVVLAKSTRSDDDLG---KELVRSVSIGASS 362

Query: 337 AISRNEAGVDKDGGEMVGLKKKSQCQCWKHQCGMKKLDRNLSIVS 371
           ++ +N  G  ++GG M G  KK+Q Q  K+Q  MKKL+RN S++S
Sbjct: 363 SVGKNNEG--EEGGFMDG--KKNQFQYRKYQRWMKKLERNNSLIS 395

BLAST of Tan0012462 vs. NCBI nr
Match: EXB78111.1 (hypothetical protein L484_004813 [Morus notabilis])

HSP 1 Score: 328.2 bits (840), Expect = 9.4e-86
Identity = 199/394 (50.51%), Postives = 251/394 (63.71%), Query Frame = 0

Query: 6   RRSPRPLPRRTVDYASDYD-------ASPSPSQSL-YASNEDD--YDASESINFQPTDPK 65
           RR+P P      DY+S YD        SP+ S       N+DD   DAS++     T+P 
Sbjct: 19  RRTPTP-----QDYSSTYDDDYTTVVRSPNDSTEFDQPENDDDDNDDASDAPTDSATNPL 78

Query: 66  SKAQEIKGSDLLTSAE---SASNSPSYF--QSPNAAETLFPYINIAPLPAFHGGVDECPA 125
           S  Q    S+ + + +   SAS+SP     Q P +      Y+NIA  P F GG +ECP 
Sbjct: 79  SD-QFSSVSERINARKKSCSASHSPILHLPQQPVSQTGYNSYMNIAQFPIFRGGSEECPF 138

Query: 126 MHLSRFAKVCRANNAASVDMMMRIFPVTLEGEAALWYDLNIEPYPPISWEELKSSFLDAY 185
            HLSRFAKVCRANN +S+DMMM+IFPVTLE EAALWYDLN+EPY  +SWEE+KSSF  AY
Sbjct: 139 AHLSRFAKVCRANNVSSIDMMMKIFPVTLEDEAALWYDLNVEPYEELSWEEIKSSFYHAY 198

Query: 186 NKIELTDQLRSELMTINQQQEENVRSYFLRLQLILKKWPTGNELSDGLLKAIFMDGLREE 245
            KIELT+QLRS+LMTINQ   E+VRSYFLRLQ ILKKWP  + LSD LLK +F+DGLR +
Sbjct: 199 GKIELTEQLRSQLMTINQGDAESVRSYFLRLQWILKKWPE-HGLSDDLLKGVFVDGLRGD 258

Query: 246 FKEWMIPQKPSSLNEALRLAFGFEQVRSVRTSGKKRFLQCGFCEGPHEELLCEVRERMRQ 305
           F+EWM PQKP SLN+ALRLAF FEQV+S+R   +   ++CGFC G HEE  CEVRERMR+
Sbjct: 259 FQEWMAPQKPGSLNKALRLAFCFEQVKSIRNVRRNASVKCGFCGGLHEERGCEVRERMRE 318

Query: 306 LWKSREKKKAVD--------LAESDGREAATATATAELVRSVSAISRNEAGVDKDG---- 365
           LW    K   +         + +S+G +    + +    RS   + +N+  V++DG    
Sbjct: 319 LWLKSNKDDGLGKGMLERNLIEKSEGVKELGRSVSMATSRSTCVVGKNDQ-VEEDGKEEE 378

Query: 366 GEMVGLKKKSQCQCWKHQCGMKKLDRNLSIVSRN 373
            E+   KK+SQCQC KHQC  K ++RN S VS N
Sbjct: 379 DELGSKKKRSQCQCGKHQCWKKNIERNNSTVSGN 404

BLAST of Tan0012462 vs. ExPASy TrEMBL
Match: A0A7N2R9A7 (Retrotrans_gag domain-containing protein OS=Quercus lobata OX=97700 PE=4 SV=1)

HSP 1 Score: 344.4 bits (882), Expect = 6.2e-91
Identity = 197/341 (57.77%), Postives = 247/341 (72.43%), Query Frame = 0

Query: 37  NEDDYDASESINFQPTDPKS---KAQEIKGSDLLTSA--ESASNSPSYFQSPNAAETLFP 96
           N+D Y +SES    P D  S   +  + +  +L T+A   S SN P   Q P+    L  
Sbjct: 62  NDDAYISSESETNAPGDRFSSQLRDPDSQSINLSTTAFPNSTSNFPKISQPPST--HLAS 121

Query: 97  YINIAPLPAFHGGVDECPAMHLSRFAKVCRANNAASVDMMMRIFPVTLEGEAALWYDLNI 156
           Y+NIAP+P FHG  +ECP  H+SRFAKVC ANN ++ DMMMRIFPVTLE EAALWYDLNI
Sbjct: 122 YMNIAPIPIFHGNTNECPVKHVSRFAKVCVANNVSTTDMMMRIFPVTLEDEAALWYDLNI 181

Query: 157 EPYPPISWEELKSSFLDAYNKIELTDQLRSELMTINQQQEENVRSYFLRLQLILKKWPTG 216
           EPYP ++WEE+KSSFL AY+KIE+ DQLRSELM INQ  EE+VRSYFLRLQ ILK+WP  
Sbjct: 182 EPYPSLTWEEIKSSFLHAYHKIEVVDQLRSELMMINQGDEESVRSYFLRLQWILKQWP-D 241

Query: 217 NELSDGLLKAIFMDGLREEFKEWMIPQKPSSLNEALRLAFGFEQVRSVRTSGKKRFLQCG 276
           + +SDGLLK +F+DGLREEF+ W+IPQKP SL+EALRLAFGFEQV+S+R   K+  L+CG
Sbjct: 242 HGISDGLLKGVFIDGLREEFRGWIIPQKPDSLHEALRLAFGFEQVKSIRAVRKE--LKCG 301

Query: 277 FCEGPHEELLCEVRERMRQLWK-SREKKKAVDLAESDGREAATATATAELVRSVSAISRN 336
           FC+G HEE  CEVRERMR+LW+ S+EK++AV LA+S G +        ELVRSVS  + +
Sbjct: 302 FCDGMHEERDCEVRERMRKLWRESKEKEEAVVLAKSTGGDDELG---KELVRSVSIGASS 361

Query: 337 EAGVDKDGGEMVGLK-KKSQCQCWKHQCGMKKLDRNLSIVS 371
             G + +G E   +  KK+Q Q  K+Q  MKKL+RN S++S
Sbjct: 362 SVGKNNEGEEAGFMDGKKNQFQYGKYQRWMKKLERNNSLIS 394

BLAST of Tan0012462 vs. ExPASy TrEMBL
Match: B9RWN5 (Retrotrans_gag domain-containing protein OS=Ricinus communis OX=3988 GN=RCOM_1022950 PE=4 SV=1)

HSP 1 Score: 341.7 bits (875), Expect = 4.0e-90
Identity = 192/377 (50.93%), Postives = 251/377 (66.58%), Query Frame = 0

Query: 1   MARKLRRSPRPLPRRTVDYAS--DYDASPSPSQSLYASNEDDYDASESINFQPTDPKSKA 60
           M RK + S     R+++ ++S  DY  S SPSQS Y SN+DD +  +    QP   +S  
Sbjct: 1   MTRKAKNS-----RKSLQFSSRHDYSESTSPSQSPYDSNDDDDEIEDDDEEQPIISESVT 60

Query: 61  QEIKGSDLLTSAESASNSPSYFQSPNAAETLFPYINIAPLPAFHGGVDECPAMHLSRFAK 120
             +    L  S+ S SNS      PN +     YIN+APLP FHG  +ECP  HLSRF K
Sbjct: 61  NSLNADQL--SSSSYSNS-----QPNNS-----YINVAPLPVFHGNSNECPIAHLSRFVK 120

Query: 121 VCRANNAASVDMMMRIFPVTLEGEAALWYDLNIEPYPPISWEELKSSFLDAYNKIELTDQ 180
           VCRANNA+S DMMMRIFPVTLE EAALWYDLNI+PYP +SW+E+  SFL+AY +I+L DQ
Sbjct: 121 VCRANNASSTDMMMRIFPVTLENEAALWYDLNIQPYPSLSWDEIMLSFLEAYQRIKLVDQ 180

Query: 181 LRSELMTINQQQEENVRSYFLRLQLILKKWPTGNELSDGLLKAIFMDGLREEFKEWMIPQ 240
           LRS+LM +NQ  +E+VRSYF+RLQ ILK+WP  + LSD +LK IF+DGL   FK+W+IP 
Sbjct: 181 LRSDLMMLNQGSDESVRSYFMRLQWILKRWP-DHGLSDNMLKWIFIDGLMGNFKDWIIPH 240

Query: 241 KPSSLNEALRLAFGFEQVRSVRTSGKKRFLQCGFCEGPHEELLCEVRERMRQLWKSREKK 300
           KP+SLNEALRLAF FEQV+S+R + K++ ++CGFCEG HEE  C VRE+MR+L+++ +KK
Sbjct: 241 KPNSLNEALRLAFSFEQVKSIRGT-KQKVVKCGFCEGSHEENCCVVREKMRELFRNSKKK 300

Query: 301 KAVDLAESDGREAATATATAELVRSVSAISRNEAGVDKDGGEMVGLKK--KSQCQCWKHQ 360
             +    S+  EA    A  +  +        + G DK+   M+   K  KS CQC KH 
Sbjct: 301 MMIPKEASERSEAGNEMAENKDGKEGEEEEEVDVGDDKEEKRMLSSSKTGKSPCQCSKHH 358

Query: 361 CGMKKLDRNLSIVSRNS 374
           C MKK +R+ S+ +RNS
Sbjct: 361 CWMKKFERSNSVTTRNS 358

BLAST of Tan0012462 vs. ExPASy TrEMBL
Match: W9R9S0 (Retrotrans_gag domain-containing protein OS=Morus notabilis OX=981085 GN=L484_004813 PE=4 SV=1)

HSP 1 Score: 328.2 bits (840), Expect = 4.6e-86
Identity = 199/394 (50.51%), Postives = 251/394 (63.71%), Query Frame = 0

Query: 6   RRSPRPLPRRTVDYASDYD-------ASPSPSQSL-YASNEDD--YDASESINFQPTDPK 65
           RR+P P      DY+S YD        SP+ S       N+DD   DAS++     T+P 
Sbjct: 19  RRTPTP-----QDYSSTYDDDYTTVVRSPNDSTEFDQPENDDDDNDDASDAPTDSATNPL 78

Query: 66  SKAQEIKGSDLLTSAE---SASNSPSYF--QSPNAAETLFPYINIAPLPAFHGGVDECPA 125
           S  Q    S+ + + +   SAS+SP     Q P +      Y+NIA  P F GG +ECP 
Sbjct: 79  SD-QFSSVSERINARKKSCSASHSPILHLPQQPVSQTGYNSYMNIAQFPIFRGGSEECPF 138

Query: 126 MHLSRFAKVCRANNAASVDMMMRIFPVTLEGEAALWYDLNIEPYPPISWEELKSSFLDAY 185
            HLSRFAKVCRANN +S+DMMM+IFPVTLE EAALWYDLN+EPY  +SWEE+KSSF  AY
Sbjct: 139 AHLSRFAKVCRANNVSSIDMMMKIFPVTLEDEAALWYDLNVEPYEELSWEEIKSSFYHAY 198

Query: 186 NKIELTDQLRSELMTINQQQEENVRSYFLRLQLILKKWPTGNELSDGLLKAIFMDGLREE 245
            KIELT+QLRS+LMTINQ   E+VRSYFLRLQ ILKKWP  + LSD LLK +F+DGLR +
Sbjct: 199 GKIELTEQLRSQLMTINQGDAESVRSYFLRLQWILKKWPE-HGLSDDLLKGVFVDGLRGD 258

Query: 246 FKEWMIPQKPSSLNEALRLAFGFEQVRSVRTSGKKRFLQCGFCEGPHEELLCEVRERMRQ 305
           F+EWM PQKP SLN+ALRLAF FEQV+S+R   +   ++CGFC G HEE  CEVRERMR+
Sbjct: 259 FQEWMAPQKPGSLNKALRLAFCFEQVKSIRNVRRNASVKCGFCGGLHEERGCEVRERMRE 318

Query: 306 LWKSREKKKAVD--------LAESDGREAATATATAELVRSVSAISRNEAGVDKDG---- 365
           LW    K   +         + +S+G +    + +    RS   + +N+  V++DG    
Sbjct: 319 LWLKSNKDDGLGKGMLERNLIEKSEGVKELGRSVSMATSRSTCVVGKNDQ-VEEDGKEEE 378

Query: 366 GEMVGLKKKSQCQCWKHQCGMKKLDRNLSIVSRN 373
            E+   KK+SQCQC KHQC  K ++RN S VS N
Sbjct: 379 DELGSKKKRSQCQCGKHQCWKKNIERNNSTVSGN 404

BLAST of Tan0012462 vs. ExPASy TrEMBL
Match: A0A6J5UDI4 (Retrotrans_gag domain-containing protein OS=Prunus armeniaca OX=36596 GN=CURHAP_LOCUS20487 PE=4 SV=1)

HSP 1 Score: 326.2 bits (835), Expect = 1.7e-85
Identity = 188/361 (52.08%), Postives = 242/361 (67.04%), Query Frame = 0

Query: 22  DYDASPSPSQSLYASNEDDYDASESINFQPTDPKSKAQ----EIKGSDLLTSAESASNSP 81
           D D S   SQS    N+  Y ASES    P+D  S +Q     +  S+ + +  S  +  
Sbjct: 13  DDDCSNEVSQS---QNQSIYLASESETNSPSDQFSSSQPPPESVSSSNQIKARASLPSKT 72

Query: 82  SYFQSPNAAETLFPYINIAPLPAFHGGVDECPAMHLSRFAKVCRAN-NAASVDMMMRIFP 141
             F  P   +T + YI IAPLP F GG +ECP  HL+RFAK+CRAN +  +VD+M+RIFP
Sbjct: 73  KNFSEPTTNQTPY-YIPIAPLPIFRGGSNECPVTHLTRFAKICRANFSCPTVDVMVRIFP 132

Query: 142 VTLEGEAALWYDLNIEPYPPISWEELKSSFLDAYNKIELTDQLRSELMTINQQQEENVRS 201
           VTLE EAALWYDLNI+PYP +SWEE++S F  AY++I   DQLRSEL  I Q ++E VRS
Sbjct: 133 VTLENEAALWYDLNIDPYPSLSWEEIRSLFFQAYDQI---DQLRSELTMIKQGRDETVRS 192

Query: 202 YFLRLQLILKKWPTGNELSDGLLKAIFMDGLREEFKEWMIPQKPSSLNEALRLAFGFEQV 261
           YFLRLQ ILK+WP  + L D +LK +F+DGLR+EFK+W++ +KPSSLN+ALRLAFGFE+V
Sbjct: 193 YFLRLQWILKRWP-DHGLQDNVLKGVFIDGLRKEFKDWIVAEKPSSLNDALRLAFGFEKV 252

Query: 262 RSVR--TSGKKRFLQCGFCEGPHEELLCEVRERMRQLW-KSREKKKAVDLAESDGREAAT 321
           +SVR  T+ K++ ++CGFC G HEE  CEVRERMR+LW KS+E+                
Sbjct: 253 KSVRATTAAKEKAVECGFCGGGHEEKGCEVRERMRKLWVKSKEE---------------- 312

Query: 322 ATATAELVRSVSAI-SRNEAGVDK-DGGEMVGLKKKSQCQCWKHQCGMKKLDRNLSIVSR 373
                 LVR VS +  R E GV++ + GE+V LKKK QCQCWKHQC  KKL+R+ S+V  
Sbjct: 313 -----GLVRMVSVVGKREEEGVEREEEGELVDLKKKGQCQCWKHQCWKKKLERSKSLVVT 344

BLAST of Tan0012462 vs. ExPASy TrEMBL
Match: A5C7E6 (Retrotrans_gag domain-containing protein OS=Vitis vinifera OX=29760 GN=VITISV_007470 PE=4 SV=1)

HSP 1 Score: 324.7 bits (831), Expect = 5.1e-85
Identity = 188/361 (52.08%), Postives = 241/361 (66.76%), Query Frame = 0

Query: 19  YASDYDASPSPSQSLYASNEDDYDA----SESINFQPTDPKSKAQEIKGSDLLTSAESAS 78
           +  DY    SPSQS Y  +E++ D     +++ +   T+       +   + +   +S  
Sbjct: 158 FYDDY-TEQSPSQSPYEFDEEEEDEQSXYTDNESASGTNAPGDQFSLPALESIPKGKSFR 217

Query: 79  NSPSYFQSPNAAETLF--PYINIAPLPAFHGGVDECPAMHLSRFAKVCRANNAASVDMMM 138
            S S   S N+        YINIAPLP F G  DECP  HLSRF KVCRANN +SV+M+M
Sbjct: 218 PSSSLNSSSNSLNPFXQSSYINIAPLPIFRGSSDECPVTHLSRFTKVCRANNVSSVEMIM 277

Query: 139 RIFPVTLEGEAALWYDLNIEPYPPISWEELKSSFLDAYNKIELTDQLRSELMTINQQQEE 198
           RIFPVTL+GEAALWYDLNIEPY  +SWEE+KSSFL AY++  LTD+LRSELM INQ  EE
Sbjct: 278 RIFPVTLDGEAALWYDLNIEPYSSLSWEEIKSSFLQAYHRXGLTDELRSELMMINQGTEE 337

Query: 199 NVRSYFLRLQLILKKWPTGNELSDGLLKAIFMDGLREEFKEWMIPQKPSSLNEALRLAFG 258
           +VRSYFLRLQ ILK+WP  + L DGLL+ IF+DGLR++F++W+IPQKPSSLNEALRLAF 
Sbjct: 338 SVRSYFLRLQWILKRWP-DHGLPDGLLEGIFIDGLRKDFQDWIIPQKPSSLNEALRLAFA 397

Query: 259 FEQVRSVRTSGKKRFLQCGFCEGPHEELLCEVRERMRQLWKSREKKKAVDLAESDGREAA 318
           +E+V+S+R   +K   +CGFC G H+E  CE+RERMR LW  + KK+  D +   GR   
Sbjct: 398 WEKVQSIRGGREK---ECGFCSGGHDEEGCEIRERMRXLW-VKSKKQTRDYS---GRIVN 457

Query: 319 TATATAELVR--SVSAISRNEAGVDKDGGE-MVGLKKKSQCQCWKHQCGMKKLDRNLSIV 371
                 E  R  SV   SR+    +++G E  +G KKKSQCQC KHQC  KKL+RN S++
Sbjct: 458 DEDGEKEFERRVSVGGESRBVGKNEEEGEEGXMGWKKKSQCQCGKHQCWKKKLERNNSLL 509

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
KAG6604769.14.6e-15779.41hypothetical protein SDJN03_02086, partial [Cucurbita argyrosperma subsp. sorori... [more]
EEF44287.18.2e-9050.93conserved hypothetical protein [Ricinus communis][more]
KAF3973300.14.2e-8655.94hypothetical protein CMV_003263 [Castanea mollissima][more]
KAF3973299.14.2e-8655.94hypothetical protein CMV_003263 [Castanea mollissima][more]
EXB78111.19.4e-8650.51hypothetical protein L484_004813 [Morus notabilis][more]
Match NameE-valueIdentityDescription
A0A7N2R9A76.2e-9157.77Retrotrans_gag domain-containing protein OS=Quercus lobata OX=97700 PE=4 SV=1[more]
B9RWN54.0e-9050.93Retrotrans_gag domain-containing protein OS=Ricinus communis OX=3988 GN=RCOM_102... [more]
W9R9S04.6e-8650.51Retrotrans_gag domain-containing protein OS=Morus notabilis OX=981085 GN=L484_00... [more]
A0A6J5UDI41.7e-8552.08Retrotrans_gag domain-containing protein OS=Prunus armeniaca OX=36596 GN=CURHAP_... [more]
A5C7E65.1e-8552.08Retrotrans_gag domain-containing protein OS=Vitis vinifera OX=29760 GN=VITISV_00... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR005162Retrotransposon gag domainPFAMPF03732Retrotrans_gagcoord: 134..229
e-value: 1.9E-10
score: 40.9
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..59
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 23..59
NoneNo IPR availablePANTHERPTHR33223FAMILY NOT NAMEDcoord: 35..373

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0012462.1Tan0012462.1mRNA