Tan0013100 (gene) Snake gourd v1

Overview
NameTan0013100
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionTy3-gypsy retrotransposon protein
LocationLG02: 9307925 .. 9313008 (+)
RNA-Seq ExpressionTan0013100
SyntenyTan0013100
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAATTGGGTAAAATAAATAAAACCCAAATCCAAATGATTAGGCCCAAATCCAATAAAGCCCAAGTTCAGGTTGTTGGGCCCAAAAGCCAGTAAAGCCCAAACCCACATACCTTGGAAAGTTCTATAAATAGAGAGTTTCCCCTTCATTCCAAAGGTTCAGAAAATTCACTCCCAAAAGGAATTCAGAGCAAAGTTCTCAAGTGCTGAAAACTCCCTGAAGTCCTTGGTGCTGAACGTTGCAAGACGAGTCTTCAAGTGCTGAACGTTGTAAGACAAGTCCCTAGGTGCGGAACACTACAAGTCCACAGGTGTTAAGCACTCCGCGAAGACCAAATATTCTTCCAAGTCTCCAACCTCGAGAACAAGCTCTTTAGCCCTCGAGTGACATACGTACAAGAGAGAGAATCAGAGGATCATACTTAGAGATTGTATTCCATACTCACAAAATTAATATAATACAAAGTTTATTACACGTGTCTCGTTGTTATTCGTTCGCTGAATTCACGTGTTTACAAATTGGCACGCCCGGTGGGACTATCTCTACCTCTCATCTCTTTCTCTTACAGTAAAACAATGGAATCAAAGAAGGTTGCATCAACTGCTGCTACTGCCGCAAGCAACACCTAATACAGGGCCAGTAACACGCAGTCGTTCTTGGTGGGTTGAGATAAAGGAGGACCGAACCCCTGATGCGATTGCAAACAAGATCGTCAAGTTGATTGAAGGATCCTCCAAGGATAGAGTGGTCGTCAAAGATAACCCGTTGTTTGACTAGTTTACCCCTGTTGTCGGTCAATCAAAGGAGGCATCGAATCAAGATGTGATGTCTGTGATGATGGCCGATGTGGAATCCGACGAAAGGATGACAGAGATGGAGAGAAAGATTAGTCTCCTGATGAAGGCGGTTGAAGAAAGGGATCTAGAGATTGCCTACTTGAAGAATCAAATGCAGAATCGCGAGACTGCTGAGTCAAGTCAAACCCCTGTTGCAAAGAAGAGTGACAAAGGGAAGATTGTTGTTCAAGAAGAGCAACCACCAAACTCGGCCTTGGTAGCTTCTCTGTCCGTCCAACAGCTACAGGACGTGATCATGAGCTCCATCAGAGCTCAATACGATGGACCCACCCAAAGTTCTTTCATGTACTCTAAGCCGTACACTAAGAGAATAGACAATCTGAGGATGCCCGTCGGATACCAACCTCCCAAATTCCAACAGTTCGATGGGAAAGGTAACCCGAAGCAACATATCACTCACTTTGTGGAAACATGCGAGATTGCTGGGACTCGCGGCGACCTTCTAGTCAAAGCGATGATTCGTTAAATGCTGCGAAAGGAAACGCCTTTGACTGGTATACCGACTTGGAACCTGAGACGATCGACAGTTGGGAACAGTTTGAAAGAGAGTTCCTCAATCGCTTCTATAGTACGAGGCGAACTGTCAGCATGATGGAGCTCACGAGCACGAAGCAGCGAAAGGGTGAGCCTGTCGTCGATTACATCAATCGTTGGAGACCTCTGAGTCTAGACTGCAAAGATCGACTCACCGAACTGTCTGCTGTAGAAATGTGTACTTAAGGCATGCATTGGGGGGTTGCTCTACATCTTACAAGGTATAAAACCTCGTAGTTTCGAAGAGTTAGCAACTCGAGCTCATGATATGGAGTTAAGCATTGCTGCTAGGGGAAACAAAGACTTGTTAGTCCCAGATGTGAAGAAGGAAAAGAAAGAAGTGAAGAGCACCGAAAAAACTTCGAAGGGTGCTACTACTAAAGAATCTATGGTCGTCAATACGACCCCGTTGAAATTTGTGTCTAAGGGAAAAGAAAAGAAAGTCGAGAAGTTCCAAGATGACGGCGCGAGGCGTCGTCTGACCTTGAAAGAGAGACAGGAAAAGACCTACCCTTTTCCAGATTCCGACATTCCAAATATGTTGGAGCAATTGCTGGAAAAGCAACTTATACCGCTGCCGGAATGCAAACAACCGGAAGAGTTAGGGAAAGTGAACGATCCTAACTACTGCAAATATCACCGAGTAGTCAGCCATCCGGTGGAGAAGTGCTTCGTACTAAAGGAGTTGATCATAAGGCTGGCACGCGATAGAAAGATTGAGCTAGATCTAGACGAAGTAGCTCAAGTGAATCATGCGACTGTAACGAGTCATCCCAGAATTCAAACACCAACAAAGCGCGCCATGATGATAAGAGGAAGCCTAGTTCGGATTCGGATCCTTTGAACCTGTCATTGTGTGGATGAATGATGAACCCTCGAGTAAGAATTCTCAAGAGGGAGGCATCCAAAAGCAGTACATTCAAGAAACGAATAAGCGGACCGAAGATGAAAACAAAGGTTGGACTGTCGTGACTCGTCGCAAGAAGCGACAACAAAGTTACGCACAGAAGGAATCGCGACTATTCCGACACCATAAGAGAAAAAACAAGTCGCAAAAGAAGAAAAGAAAACAGGTCACAAAGAAGCCTGTTTACGCCATGAGGGAAGACGAAAACCTCTTCCGCCCACGACAACTGATAACTTTGGATGAATTCTTTCCAAAGAATTTCCTAAGTAAAGGCCAGGAGGAGGCGTTTGAGGTAGTTGCGTGTCACGTTACCGGTACGATTGAAGATCCTTCATGCTTGTATGAGACGACAACAGAGTTAGGAAACTCATCCTCATTTAGTATAGAGGACTTGTTGTCACTCCCCCAAGCAGCCAAGAGTGCTCTTATCGAGGCATTGATAGACTCTGACAATGCAAGTGCCCCAAGCTCTATGACACGCACATGCACGTCATGTTGCATGTCCATAAGTTTTACGGATGAGGACCTGTTGCTGGGTTCAAAGCCCCATAATAGACCTCTGTTTGTCTCAGGGTACATTCGAGAGCAAGGGGTCGGTCAGATCCTTATCGACGATAGATCAGCTGTTAACATAATGCCCAAGTCAACCATGAAGCAATTGGGCATCCTGGTAGAAGAGTTATCAAGCAGCAAACTTGTGATCCAAGGCTTCAACCAAGGAGGCCAGCGAGCTATTGGCATGAGACGTTTAGAGCTTATCATTGGGGACCTCAAGGCCGACACTCTGTTCCACGTCATAGACTCCAAGACTACCTATAAGTTGCTACTAGGTCGTCCTTGGATTCATGGAAATGGAGTTGTAACTTCTACGTTACACCAGTGCTTTAAGTTTTACCAAGATGGCGTCAAGAAGGTGGAGGCAGATACCAAGCCATTTTCAGAAGCTGAATCCCATTTTGCTGACGCGAAATTTTACATGAAGGGTGACGGTATAGGGGAAACAATACCAACAAAGATCCCTTTAATAAAGAGCGACAGTCGGCCTAAGGAGGTACCGCAGATTAACGTGAAAGAAGAAACTATCAAATGTACAAATGGGCCTGCTCTGAGAAACAACGAAGTCTCTACGAATTTCGCAAAGTCTGAAATTTTGAAAGATGAAGGAAGCACAACCTCTCCTGTTTTGCGTTACGTTCCTTTGTCTCGACGAAAGAAGGGTGAGTCACCATTTGCAGAGTGCACAAAAAGCCTGACTGTGGGTGAAATCGAATTTTGAAGGGAAGCTTCACAATGCCGCTCACGAAGATAACGAAGCAAGAGGTCAAGAAACTTGAAGATGACCGATTGGAAGCAAGTTTGCCTAAGAGTCGAACGAAAGATGGGTTTGACCTTAAGGCATATAAACTTCTATCAAAGGCAGGATATGACTTCACAACTCACACAGAGTTCAAAAGTCTGAAGATCTTCGATGAGAGACCTGAGCTCTCATTAACACAAAAAAAGCATTTAAAGGAAGGTTATAATATCCCTACGTCAAGGAAAGGACTGGGATATAAGTCTCCTGAGCCGGTCCGCATAATAAGAAAAGGGAAGGCAAAGGTGGCAGACGCAAACCACATAATAGTGGAAGAGGTAGACGATTCAGATGAAAAGGAAAACGTTACCCAAAGGACTTCTGTTTTTAGCCGCGTCGGGCCGTTGGTGGCACGACCTTCAGCCCTCCAACGATTGAGCACCACTCAAGTAGAAGAAGAGTGGTCACCTCCTGTTTCTGGTTCCACTCGAACCTCAGCCCTCATGAGGATAAGGATGCCCACTGAAACAGAAGGAAGCGTTTTACTAGCCCTTACACCTGACTCCGTTCGACCTTCCGTCCGTTAGAGGCTAAGTGTGTCTGCTGGTGAAAGAGAAGGTATGACATCCACTCAAGTCGTAGTACAACCATCAGCATTACAAAGGTTAGGTGCGCCTGCGAAGGAAAATAAAAATGTACCTTCTACCTGAGACGTGATGCGTCGTTCAGCTTTTCAAAGGCTAAGTGTAACCACTTCAAAGAAAGAGGGGCCCTCCGTATCAGTTTTTGATAGGATTCACCACGATTGCCCAGTGAGAACCTCGAAGGATGACACCTTTGTAAAAATAAAAATGGACGAAGAAGTTCACAGTACTGTTCCTTCTCGGATGAAGAGGAAAACATTCGTTACGGTAAATACAGAAGGTTCATTGAAGGTAAAACGACGTGATGTCGTCATAACCAATCCCCGAGGAGGATTGAGCAGGGGAGGAGTGCGAGACCTCTTGCTACCATATTACTGATTGAAGAAGAAACTAAAGTGAATTTTCAGGATGAAGATGCGAGAAGAGGCGCCATCTTCGGCGAAGATGGTAGCCAATCAACGATGGATGAGCTTAAAGAGGTGAACCTCGGTACACCAGAGAAGCCACGCCCGACCTTCGTTAGCGCATCCCTCACTTGCGAGGAGGAAGGCGAATATATGAGTCTGCTCACGTCATAGAGAGATGTTTTTGCGTGGTCCTATAAAGAGATGCCAGGACTCGATCCAAAAGTAGCTGTCCACCATCTTTCCATCAAAGCAGAGTATCTCCCGGTCAAGCAAAGCGCAACGTCGTTTTTCGACGAGCTTATCCTCAAATTGAGATAGAGGTCAACAAATTAATCAAGCAGATTCACGCGTGAGGTGAAGTATCCAACTTGGATAGCCAACATTGTTCTGCCGGAAGAGAGAGCAGGCGACCAGCGTTTGCGTCGACTTTCGTGACCTGA

mRNA sequence

ATGGAATTGGGTAAAATAAATAAAACCCAAATCCAAATGATTAGGCCCAAATCCAATAAAGCCCAAGTTCAGGTGTTAAGCACTCCGCGAAGACCAAATATTCTTCCAATAAAACAATGGAATCAAAGAAGGTTGCATCAACTGCTGCTACTGCCGCAAGCAACACCTAATACAGGGCCAATAAAGGAGGACCGAACCCCTGATGCGATTGCAAACAAGATCGTCAAGTTGATTGAAGGATCCTCCAAGGATAGAGTGTTTACCCCTGTTGTCGGTCAATCAAAGGAGGCATCGAATCAAGATGTGATGTCTGTGATGATGGCCGATGTGGAATCCGACGAAAGGATGACAGAGATGGAGAGAAAGATTAGTCTCCTGATGAAGGCGGTTGAAGAAAGGGATCTAGAGATTGCCTACTTGAAGAATCAAATGCAGAATCGCGAGACTGCTGAGTCAAGTCAAACCCCTTTCGGATTCGGATCCTTTGAACCTGTCATTGTGTGGATGAATGATGAACCCTCGAGTAAGAATTCTCAAGAGGGAGGCATCCAAAAGCAGTACATTCAAGAAACGAATAAGCGGACCGAAGATGAAAACAAAGGTTGGACTGTCGTGACTCGTCGCAAGAAGCGACAACAAAGTTACGCACAGAAGGAATCGCGACTATTCCGACACCATAAGAGAAAAAACAAGTCGCAAAAGAAGAAAAGAAAACAGGTCACAAAGAAGCCTGTTTACGCCATGAGGGAAGACGAAAACCTCTTCCGCCCACGACAACTGATAACTTTGGATGAATTCTTTCCAAAGAATTTCCTAAGTAAAGGCCAGGAGGAGGCGTTTGAGGTAGTTGCGTGTCACGTTACCGGTACGATTGAAGATCCTTCATGCTTGTATGAGACGACAACAGAGTTAGGAAACTCATCCTCATTTAGTATAGAGGACTTGTTGTCACTCCCCCAAGCAGCCAAGAGTGCTCTTATCGAGGCATTGATAGACTCTGACAATGCAAAAGAAACTATCAAATGTACAAATGGGCCTGCTCTGAGAAACAACGAAGTCTCTACGAATTTCGCAAAGTCTGAAATTTTGAAAGATGAAGGAAGCACAACCTCTCCTGTTTTGCGTTACGTTCCTTTGTCTCGACGAAAGAAGGGTGAGTCACCATTTGCAGAGTGCACAAAAAGCCTGACTCTCTCATTAACACAAAAAAAGCATTTAAAGGAAGGTTATAATATCCCTACGTCAAGGAAAGGACTGGGATATAAGTCTCCTGAGCCGGTCCGCATAATAAGAAAAGGGAAGGCAAAGGTGGCAGACGCAAACCACATAATAGTGGAAGAGGTAGACGATTCAGATGAAAAGGAAAACGTTACCCAAAGGACTTCTGTTTTTAGCCGCGTCGGGCCGTTGGTGGCACGACCTTCAGCCCTCCAACGATTGAGCACCACTCAAGTAGAAGAAGAGTGGTCACCTCCTGTTTCTGGTTCCACTCGAACCTCAGCCCTCATGAGGATAAGGATGCCCACTGAAACAGAAGGAAGCGTTTTACTAGCCCTTACACCTGACTCCGTTCGACCTTCCGTCCTATCCAACTTGGATAGCCAACATTGTTCTGCCGGAAGAGAGAGCAGGCGACCAGCGTTTGCGTCGACTTTCGTGACCTGA

Coding sequence (CDS)

ATGGAATTGGGTAAAATAAATAAAACCCAAATCCAAATGATTAGGCCCAAATCCAATAAAGCCCAAGTTCAGGTGTTAAGCACTCCGCGAAGACCAAATATTCTTCCAATAAAACAATGGAATCAAAGAAGGTTGCATCAACTGCTGCTACTGCCGCAAGCAACACCTAATACAGGGCCAATAAAGGAGGACCGAACCCCTGATGCGATTGCAAACAAGATCGTCAAGTTGATTGAAGGATCCTCCAAGGATAGAGTGTTTACCCCTGTTGTCGGTCAATCAAAGGAGGCATCGAATCAAGATGTGATGTCTGTGATGATGGCCGATGTGGAATCCGACGAAAGGATGACAGAGATGGAGAGAAAGATTAGTCTCCTGATGAAGGCGGTTGAAGAAAGGGATCTAGAGATTGCCTACTTGAAGAATCAAATGCAGAATCGCGAGACTGCTGAGTCAAGTCAAACCCCTTTCGGATTCGGATCCTTTGAACCTGTCATTGTGTGGATGAATGATGAACCCTCGAGTAAGAATTCTCAAGAGGGAGGCATCCAAAAGCAGTACATTCAAGAAACGAATAAGCGGACCGAAGATGAAAACAAAGGTTGGACTGTCGTGACTCGTCGCAAGAAGCGACAACAAAGTTACGCACAGAAGGAATCGCGACTATTCCGACACCATAAGAGAAAAAACAAGTCGCAAAAGAAGAAAAGAAAACAGGTCACAAAGAAGCCTGTTTACGCCATGAGGGAAGACGAAAACCTCTTCCGCCCACGACAACTGATAACTTTGGATGAATTCTTTCCAAAGAATTTCCTAAGTAAAGGCCAGGAGGAGGCGTTTGAGGTAGTTGCGTGTCACGTTACCGGTACGATTGAAGATCCTTCATGCTTGTATGAGACGACAACAGAGTTAGGAAACTCATCCTCATTTAGTATAGAGGACTTGTTGTCACTCCCCCAAGCAGCCAAGAGTGCTCTTATCGAGGCATTGATAGACTCTGACAATGCAAAAGAAACTATCAAATGTACAAATGGGCCTGCTCTGAGAAACAACGAAGTCTCTACGAATTTCGCAAAGTCTGAAATTTTGAAAGATGAAGGAAGCACAACCTCTCCTGTTTTGCGTTACGTTCCTTTGTCTCGACGAAAGAAGGGTGAGTCACCATTTGCAGAGTGCACAAAAAGCCTGACTCTCTCATTAACACAAAAAAAGCATTTAAAGGAAGGTTATAATATCCCTACGTCAAGGAAAGGACTGGGATATAAGTCTCCTGAGCCGGTCCGCATAATAAGAAAAGGGAAGGCAAAGGTGGCAGACGCAAACCACATAATAGTGGAAGAGGTAGACGATTCAGATGAAAAGGAAAACGTTACCCAAAGGACTTCTGTTTTTAGCCGCGTCGGGCCGTTGGTGGCACGACCTTCAGCCCTCCAACGATTGAGCACCACTCAAGTAGAAGAAGAGTGGTCACCTCCTGTTTCTGGTTCCACTCGAACCTCAGCCCTCATGAGGATAAGGATGCCCACTGAAACAGAAGGAAGCGTTTTACTAGCCCTTACACCTGACTCCGTTCGACCTTCCGTCCTATCCAACTTGGATAGCCAACATTGTTCTGCCGGAAGAGAGAGCAGGCGACCAGCGTTTGCGTCGACTTTCGTGACCTGA

Protein sequence

MELGKINKTQIQMIRPKSNKAQVQVLSTPRRPNILPIKQWNQRRLHQLLLLPQATPNTGPIKEDRTPDAIANKIVKLIEGSSKDRVFTPVVGQSKEASNQDVMSVMMADVESDERMTEMERKISLLMKAVEERDLEIAYLKNQMQNRETAESSQTPFGFGSFEPVIVWMNDEPSSKNSQEGGIQKQYIQETNKRTEDENKGWTVVTRRKKRQQSYAQKESRLFRHHKRKNKSQKKKRKQVTKKPVYAMREDENLFRPRQLITLDEFFPKNFLSKGQEEAFEVVACHVTGTIEDPSCLYETTTELGNSSSFSIEDLLSLPQAAKSALIEALIDSDNAKETIKCTNGPALRNNEVSTNFAKSEILKDEGSTTSPVLRYVPLSRRKKGESPFAECTKSLTLSLTQKKHLKEGYNIPTSRKGLGYKSPEPVRIIRKGKAKVADANHIIVEEVDDSDEKENVTQRTSVFSRVGPLVARPSALQRLSTTQVEEEWSPPVSGSTRTSALMRIRMPTETEGSVLLALTPDSVRPSVLSNLDSQHCSAGRESRRPAFASTFVT
Homology
BLAST of Tan0013100 vs. NCBI nr
Match: TYK05005.1 (ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa])

HSP 1 Score: 178.7 bits (452), Expect = 1.4e-40
Identity = 169/572 (29.55%), Postives = 228/572 (39.86%), Query Frame = 0

Query: 159 FGSFEPVIVWMNDEPSSKNSQEGGIQKQYIQETNKRTEDENKGWTVVTRRKKRQQSYAQK 218
           FG+FEPV+V  + E + ++SQE   +++ I+E ++R       WT+VTRRKKR+ +  QK
Sbjct: 344 FGTFEPVVVRFHQEVAPEDSQE---KERLIEEDDER-------WTIVTRRKKRKSTPIQK 403

Query: 219 ESRLFRHHKRKNKSQKKKRKQVTKKPVYAMREDENLFRPRQLITLDEFFPKNFLSKGQEE 278
           E R +R+++R NK+QK K+K+ T+K     +ED++  R ++LITL +FFP  FL   Q+E
Sbjct: 404 EHRFYRNYRRWNKAQKNKKKKKTRKLKLMHKEDKDFPRTQRLITLADFFPTRFLGDHQDE 463

Query: 279 AFEVVACHVTGTIEDPSCLYETTTELGNS---SSFSIEDLLSLPQAAKSALIEALIDS-- 338
              VVACH     E+ S    +  E   S   S F+++DLLSLPQ  K+ LI AL++S  
Sbjct: 464 NPGVVACHAINATEEESIPLRSLEEEKVSKDLSRFNVDDLLSLPQEIKTILINALLNSAA 523

Query: 339 -----------------------------------------------------DN----- 398
                                                                DN     
Sbjct: 524 SSSSASTVTYESTPYCMSIDFSNDDLLLGSKLHNRPLYVSGYVREQRVDRILVDNGSAVN 583

Query: 399 ---------------------------------------------------------AKE 458
                                                                    AK 
Sbjct: 584 IMPKSTIRQLGILMKELSNSKLVIQGFNQGSQRVIDGVKKVEADSNPFSEAESHFADAKF 643

Query: 459 TIKCTNGPAL---------------------------------RNNEVSTNFAKSEILKD 506
            +K  + P +                                   +E ST+ AKS IL D
Sbjct: 644 YLKNDSSPEVVSVEVPLVNREDNLQLKSLASKEPHISIGTFHSEKSEASTSTAKSVILMD 703

BLAST of Tan0013100 vs. NCBI nr
Match: KAA0061113.1 (ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa] >TYK03782.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa])

HSP 1 Score: 173.7 bits (439), Expect = 4.4e-39
Identity = 156/503 (31.01%), Postives = 218/503 (43.34%), Query Frame = 0

Query: 165 VIVWMNDEPSSKNSQEGGIQKQYIQETNKRTEDENKGWTVVTRRKKRQQSYAQKESRLFR 224
           +I+ +  E   K   E  +  +  QE  +  E++++GWTVVTRRKKR+ +  QKESRL+ 
Sbjct: 444 LILRLAREKKIKLDLEEEVTPEDSQEKERLIEEDDEGWTVVTRRKKRKSTLIQKESRLYI 503

Query: 225 HHKRKNKSQKKKRKQVTKKPVYAMREDENLFRPRQLITLDEFFPKNFLSKGQEEAFEVVA 284
           +++R NK+QK K+K+ T+K      +D++  R ++++TL +FFP  FL   Q+E   VVA
Sbjct: 504 NYRRGNKTQKNKKKKKTRKLKLVHEKDKDFPRTQRVVTLADFFPTRFLGDHQDENPGVVA 563

Query: 285 CHVTGTIEDPSCLYETTTELGNSSSFSIEDLL---------------------------- 344
                +   P   YE+T     S  FS EDLL                            
Sbjct: 564 S--ASSSSAPIATYESTPYC-MSIDFSDEDLLLGSKLHNRPLYVSGYVQEQRVERILVDN 623

Query: 345 -----------------SLPQAAKSALI--------EALIDS------------------ 404
                             + + + S L+        + +ID+                  
Sbjct: 624 GSAVNIMPKSTMRQLGILMEELSNSKLVIQGFNQGSQRVIDAKFYLKNDGSPEVVSVEVP 683

Query: 405 -----DN-------AKETIKCTNGPALRNNEVSTNFAKSEILKDEGSTTSPVLRYVPLSR 464
                DN       ++E+ K T       +E ST+ AKS I+ DE ++  P+LRYVPLSR
Sbjct: 684 LVNKEDNLQLKSLASRESHKSTGTFHSGKSEASTSTAKSVIVMDEKTSNPPILRYVPLSR 743

Query: 465 RKKGESPFAECTKSL--------------------------------------------- 513
           RKKGESPF E  + L                                             
Sbjct: 744 RKKGESPFVEFPQGLKVGDIEVLKESFTTPLTKITKQEIKIDLIEASLPQRRTKDGFDPK 803

BLAST of Tan0013100 vs. NCBI nr
Match: KAA0061113.1 (ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa] >TYK03782.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa])

HSP 1 Score: 79.0 bits (193), Expect = 1.5e-10
Identity = 46/98 (46.94%), Postives = 64/98 (65.31%), Query Frame = 0

Query: 93  QSKEASNQDVMSVMMADVESDERMTEMERKISLLMKAVEERDLEIAYLKNQMQNRETAES 152
           +SK+ ++ DVMSVMMAD+  +  M EMERKI+ LMKAVEERD EI  L+ QM+ RETAES
Sbjct: 72  KSKKETHPDVMSVMMADITPEAAMAEMERKINFLMKAVEERDHEIIALREQMRTRETAES 131

Query: 153 SQTPFGFGSFEPVIVWMNDEPSSKNSQEGGIQKQYIQE 191
           SQTP    + +   V   ++P  ++     +  Q +Q+
Sbjct: 132 SQTPIVKATDKGKNVVQENQPQQQSVSVASLSVQQLQD 169


HSP 2 Score: 172.6 bits (436), Expect = 9.8e-39
Identity = 189/709 (26.66%), Postives = 252/709 (35.54%), Query Frame = 0

Query: 120  ERKISL-LMKAVEERDLEIAYLKNQMQNRETAESSQTPFGFGSFEPVIVWMNDEPSSKNS 179
            E++I L L +  +    E+  +     +R   E  ++   FG+FEP++V    E S ++ 
Sbjct: 503  EKRIELDLEEVAQTNHAEVTIMSEASSSRLIFEQRKSLVQFGTFEPIVVQFFQEISYEDP 562

Query: 180  QEGGIQKQYIQETNKRTEDENKGWTVVTRRKKRQQSYAQKESRLFRHHKRKNKSQKKKRK 239
                      Q   +  E++++GW VVT RKKRQ    Q+ESR +++++R NK+QK K+K
Sbjct: 563  ----------QGEKRPIEEDDEGWIVVTHRKKRQSIPTQRESRSYQNYRRGNKTQKNKKK 622

Query: 240  QVTKKPVYAMREDENLFRPRQLITLDEFFPKNFLSKGQEEAFEVVACHVTGTIED---PS 299
            + T K      ED N  RP++L+TL +F PK+FL   Q+E  EVVACH   T E+   P 
Sbjct: 623  KKTHKLKLVHNEDMNFSRPQRLVTLADFLPKSFLCDHQDEDPEVVACHAINTTEEEIIPP 682

Query: 300  CLYETTTELGNSSSFSIEDLLSLPQAAKSALIEALIDS---------------------- 359
               E      + S F++EDLLSLPQ  K+ LI+AL++S                      
Sbjct: 683  RSLEGEGVSKDLSRFNVEDLLSLPQETKTILIDALLNSRASSSSTPTMTYESGSYCMSID 742

Query: 360  ------------------------DNAKETIKCTNGPAL--------------------- 419
                                    +   + I   NG A+                     
Sbjct: 743  FSDEDLLLGSKLHNRPLYVSGYVREQRVDRILIDNGSAVNIMPKSTMWQLGILMDELSNS 802

Query: 420  ------------------------------------------------------------ 479
                                                                        
Sbjct: 803  KLVIQGFNQGSQRAIGMIRLELIIGDLKASALFHVIDSRTTYKLLLGRPWIHGNGVVTST 862

Query: 480  ------------------------------------RNN--------------------- 539
                                                +NN                     
Sbjct: 863  LHQCFKFYQDGVKKVEADSNPFSEAESHFADAKFYSKNNNILEVLPAETPLTKGEDNSQL 922

Query: 540  --------------------EVSTNFAKSEILKDEGSTTSPVLRYVPLSRRKKGESPFAE 549
                                E  T+  K  ILKDE +  +PVLRYVPLSRRKKGESPF E
Sbjct: 923  KSLATTEPHESARTFNSGKGEAYTSSTKGMILKDENAANTPVLRYVPLSRRKKGESPFME 982

BLAST of Tan0013100 vs. NCBI nr
Match: XP_031735972.1 (uncharacterized protein LOC116401693 [Cucumis sativus])

HSP 1 Score: 172.6 bits (436), Expect = 9.8e-39
Identity = 189/709 (26.66%), Postives = 252/709 (35.54%), Query Frame = 0

Query: 120  ERKISL-LMKAVEERDLEIAYLKNQMQNRETAESSQTPFGFGSFEPVIVWMNDEPSSKNS 179
            E++I L L +  +    E+  +     +R   E  ++   FG+FEP++V    E S ++ 
Sbjct: 503  EKRIELDLEEVAQTNHAEVTIMSEASSSRLIFEQRKSLVQFGTFEPIVVQFFQEISYEDP 562

Query: 180  QEGGIQKQYIQETNKRTEDENKGWTVVTRRKKRQQSYAQKESRLFRHHKRKNKSQKKKRK 239
                      Q   +  E++++GW VVT RKKRQ    Q+ESR +++++R NK+QK K+K
Sbjct: 563  ----------QGEKRPIEEDDEGWIVVTHRKKRQSIPTQRESRSYQNYRRGNKTQKNKKK 622

Query: 240  QVTKKPVYAMREDENLFRPRQLITLDEFFPKNFLSKGQEEAFEVVACHVTGTIED---PS 299
            + T K      ED N  RP++L+TL +F PK+FL   Q+E  EVVACH   T E+   P 
Sbjct: 623  KKTHKLKLVHNEDMNFSRPQRLVTLADFLPKSFLCDHQDEDPEVVACHAINTTEEEIIPP 682

Query: 300  CLYETTTELGNSSSFSIEDLLSLPQAAKSALIEALIDS---------------------- 359
               E      + S F++EDLLSLPQ  K+ LI+AL++S                      
Sbjct: 683  RSLEGEGVSKDLSRFNVEDLLSLPQETKTILIDALLNSRASSSSTPTMTYESGSYCMSID 742

Query: 360  ------------------------DNAKETIKCTNGPAL--------------------- 419
                                    +   + I   NG A+                     
Sbjct: 743  FSDEDLLLGSKLHNRPLYVSGYVREQRVDRILIDNGSAVNIMPKSTMWQLGILMDELSNS 802

Query: 420  ------------------------------------------------------------ 479
                                                                        
Sbjct: 803  KLVIQGFNQGSQRAIGMIRLELIIGDLKASALFHVIDSRTTYKLLLGRPWIHGNGVVTST 862

Query: 480  ------------------------------------RNN--------------------- 539
                                                +NN                     
Sbjct: 863  LHQCFKFYQDGVKKVEADSNPFSEAESHFADAKFYSKNNNILEVLPAETPLTKGEDNSQL 922

Query: 540  --------------------EVSTNFAKSEILKDEGSTTSPVLRYVPLSRRKKGESPFAE 549
                                E  T+  K  ILKDE +  +PVLRYVPLSRRKKGESPF E
Sbjct: 923  KSLATTEPHESARTFNSGKGEAYTSSTKGMILKDENAANTPVLRYVPLSRRKKGESPFME 982

BLAST of Tan0013100 vs. NCBI nr
Match: XP_031740568.1 (uncharacterized protein LOC116403508 [Cucumis sativus])

HSP 1 Score: 172.6 bits (436), Expect = 9.8e-39
Identity = 189/709 (26.66%), Postives = 252/709 (35.54%), Query Frame = 0

Query: 120  ERKISL-LMKAVEERDLEIAYLKNQMQNRETAESSQTPFGFGSFEPVIVWMNDEPSSKNS 179
            E++I L L +  +    E+  +     +R   E  ++   FG+FEP++V    E S ++ 
Sbjct: 503  EKRIELDLEEVAQTNHAEVTIMSEASSSRLIFEQRKSLVQFGTFEPIVVQFFQEISYEDP 562

Query: 180  QEGGIQKQYIQETNKRTEDENKGWTVVTRRKKRQQSYAQKESRLFRHHKRKNKSQKKKRK 239
                      Q   +  E++++GW VVT RKKRQ    Q+ESR +++++R NK+QK K+K
Sbjct: 563  ----------QGEKRPIEEDDEGWIVVTHRKKRQSIPTQRESRSYQNYRRGNKTQKNKKK 622

Query: 240  QVTKKPVYAMREDENLFRPRQLITLDEFFPKNFLSKGQEEAFEVVACHVTGTIED---PS 299
            + T K      ED N  RP++L+TL +F PK+FL   Q+E  EVVACH   T E+   P 
Sbjct: 623  KKTHKLKLVHNEDMNFSRPQRLVTLADFLPKSFLCDHQDEDPEVVACHAINTTEEEIIPP 682

Query: 300  CLYETTTELGNSSSFSIEDLLSLPQAAKSALIEALIDS---------------------- 359
               E      + S F++EDLLSLPQ  K+ LI+AL++S                      
Sbjct: 683  RSLEGEGVSKDLSRFNVEDLLSLPQETKTILIDALLNSRASSSSTPTMTYESGSYCMSID 742

Query: 360  ------------------------DNAKETIKCTNGPAL--------------------- 419
                                    +   + I   NG A+                     
Sbjct: 743  FSDEDLLLGSKLHNRPLYVSGYVREQRVDRILIDNGSAVNIMPKSTMWQLGILMDELSNS 802

Query: 420  ------------------------------------------------------------ 479
                                                                        
Sbjct: 803  KLVIQGFNQGSQRAIGMIRLELIIGDLKASALFHVIDSRTTYKLLLGRPWIHGNGVVTST 862

Query: 480  ------------------------------------RNN--------------------- 539
                                                +NN                     
Sbjct: 863  LHQCFKFYQDGVKKVEADSNPFSEAESHFADAKFYSKNNNILEVLPAETPLTKGEDNSQL 922

Query: 540  --------------------EVSTNFAKSEILKDEGSTTSPVLRYVPLSRRKKGESPFAE 549
                                E  T+  K  ILKDE +  +PVLRYVPLSRRKKGESPF E
Sbjct: 923  KSLATTEPHESARTFNSGKGEAYTSSTKGMILKDENAANTPVLRYVPLSRRKKGESPFME 982

BLAST of Tan0013100 vs. ExPASy TrEMBL
Match: A0A5D3C0W6 (Ty3-gypsy retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold143G002360 PE=4 SV=1)

HSP 1 Score: 178.7 bits (452), Expect = 6.6e-41
Identity = 169/572 (29.55%), Postives = 228/572 (39.86%), Query Frame = 0

Query: 159 FGSFEPVIVWMNDEPSSKNSQEGGIQKQYIQETNKRTEDENKGWTVVTRRKKRQQSYAQK 218
           FG+FEPV+V  + E + ++SQE   +++ I+E ++R       WT+VTRRKKR+ +  QK
Sbjct: 344 FGTFEPVVVRFHQEVAPEDSQE---KERLIEEDDER-------WTIVTRRKKRKSTPIQK 403

Query: 219 ESRLFRHHKRKNKSQKKKRKQVTKKPVYAMREDENLFRPRQLITLDEFFPKNFLSKGQEE 278
           E R +R+++R NK+QK K+K+ T+K     +ED++  R ++LITL +FFP  FL   Q+E
Sbjct: 404 EHRFYRNYRRWNKAQKNKKKKKTRKLKLMHKEDKDFPRTQRLITLADFFPTRFLGDHQDE 463

Query: 279 AFEVVACHVTGTIEDPSCLYETTTELGNS---SSFSIEDLLSLPQAAKSALIEALIDS-- 338
              VVACH     E+ S    +  E   S   S F+++DLLSLPQ  K+ LI AL++S  
Sbjct: 464 NPGVVACHAINATEEESIPLRSLEEEKVSKDLSRFNVDDLLSLPQEIKTILINALLNSAA 523

Query: 339 -----------------------------------------------------DN----- 398
                                                                DN     
Sbjct: 524 SSSSASTVTYESTPYCMSIDFSNDDLLLGSKLHNRPLYVSGYVREQRVDRILVDNGSAVN 583

Query: 399 ---------------------------------------------------------AKE 458
                                                                    AK 
Sbjct: 584 IMPKSTIRQLGILMKELSNSKLVIQGFNQGSQRVIDGVKKVEADSNPFSEAESHFADAKF 643

Query: 459 TIKCTNGPAL---------------------------------RNNEVSTNFAKSEILKD 506
            +K  + P +                                   +E ST+ AKS IL D
Sbjct: 644 YLKNDSSPEVVSVEVPLVNREDNLQLKSLASKEPHISIGTFHSEKSEASTSTAKSVILMD 703

BLAST of Tan0013100 vs. ExPASy TrEMBL
Match: A0A5D3BY54 (Ty3-gypsy retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold863G001700 PE=4 SV=1)

HSP 1 Score: 173.7 bits (439), Expect = 2.1e-39
Identity = 156/503 (31.01%), Postives = 218/503 (43.34%), Query Frame = 0

Query: 165 VIVWMNDEPSSKNSQEGGIQKQYIQETNKRTEDENKGWTVVTRRKKRQQSYAQKESRLFR 224
           +I+ +  E   K   E  +  +  QE  +  E++++GWTVVTRRKKR+ +  QKESRL+ 
Sbjct: 444 LILRLAREKKIKLDLEEEVTPEDSQEKERLIEEDDEGWTVVTRRKKRKSTLIQKESRLYI 503

Query: 225 HHKRKNKSQKKKRKQVTKKPVYAMREDENLFRPRQLITLDEFFPKNFLSKGQEEAFEVVA 284
           +++R NK+QK K+K+ T+K      +D++  R ++++TL +FFP  FL   Q+E   VVA
Sbjct: 504 NYRRGNKTQKNKKKKKTRKLKLVHEKDKDFPRTQRVVTLADFFPTRFLGDHQDENPGVVA 563

Query: 285 CHVTGTIEDPSCLYETTTELGNSSSFSIEDLL---------------------------- 344
                +   P   YE+T     S  FS EDLL                            
Sbjct: 564 S--ASSSSAPIATYESTPYC-MSIDFSDEDLLLGSKLHNRPLYVSGYVQEQRVERILVDN 623

Query: 345 -----------------SLPQAAKSALI--------EALIDS------------------ 404
                             + + + S L+        + +ID+                  
Sbjct: 624 GSAVNIMPKSTMRQLGILMEELSNSKLVIQGFNQGSQRVIDAKFYLKNDGSPEVVSVEVP 683

Query: 405 -----DN-------AKETIKCTNGPALRNNEVSTNFAKSEILKDEGSTTSPVLRYVPLSR 464
                DN       ++E+ K T       +E ST+ AKS I+ DE ++  P+LRYVPLSR
Sbjct: 684 LVNKEDNLQLKSLASRESHKSTGTFHSGKSEASTSTAKSVIVMDEKTSNPPILRYVPLSR 743

Query: 465 RKKGESPFAECTKSL--------------------------------------------- 513
           RKKGESPF E  + L                                             
Sbjct: 744 RKKGESPFVEFPQGLKVGDIEVLKESFTTPLTKITKQEIKIDLIEASLPQRRTKDGFDPK 803

BLAST of Tan0013100 vs. ExPASy TrEMBL
Match: A0A5D3BY54 (Ty3-gypsy retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold863G001700 PE=4 SV=1)

HSP 1 Score: 79.0 bits (193), Expect = 7.1e-11
Identity = 46/98 (46.94%), Postives = 64/98 (65.31%), Query Frame = 0

Query: 93  QSKEASNQDVMSVMMADVESDERMTEMERKISLLMKAVEERDLEIAYLKNQMQNRETAES 152
           +SK+ ++ DVMSVMMAD+  +  M EMERKI+ LMKAVEERD EI  L+ QM+ RETAES
Sbjct: 72  KSKKETHPDVMSVMMADITPEAAMAEMERKINFLMKAVEERDHEIIALREQMRTRETAES 131

Query: 153 SQTPFGFGSFEPVIVWMNDEPSSKNSQEGGIQKQYIQE 191
           SQTP    + +   V   ++P  ++     +  Q +Q+
Sbjct: 132 SQTPIVKATDKGKNVVQENQPQQQSVSVASLSVQQLQD 169


HSP 2 Score: 152.1 bits (383), Expect = 6.6e-33
Identity = 98/234 (41.88%), Postives = 120/234 (51.28%), Query Frame = 0

Query: 351 NEVSTNFAKSEILKDEGSTTSPVLRYVPLSRRKKGESPFAECTKSL-------------- 410
           +E STN AKS IL DE ++  P+LRYVPLSRRKKGESPF E  + L              
Sbjct: 204 SEASTNTAKSVILMDEKTSNPPILRYVPLSRRKKGESPFVESPQGLKRRTKDGFDPKAYK 263

Query: 411 ------------------------TLSLTQKKHLKEGYNIPTSRKGLGYKSPEPVRIIRK 470
                                    LS TQKK L+EG+ IP SRKGLGYKSPEP+RI RK
Sbjct: 264 LMAKAGYDFITHTEFKSLKIHEQPKLSSTQKKLLREGHVIPMSRKGLGYKSPEPIRITRK 323

Query: 471 GKAKVADANHIIVEEVDDSDEKENVTQRTSVFSRVGPLVARPSALQRLSTTQVEEEWSPP 530
           GK KV D NHI V+EVD  +EKE   QRTS F R+ P VAR    +RLS T+ + +    
Sbjct: 324 GKKKVVDNNHITVKEVDSMEEKEGDNQRTSTFDRISPHVARAPVFERLSMTEAKRKDHQS 383

Query: 531 VSGSTRTSALMRIRMPTETEGSVLLALTPDSVRPSVLSNLDSQHCSAGRESRRP 547
            S   R SA  R+ +  + E  +    T  + +PS    L        +  R P
Sbjct: 384 TSNLDRRSAFQRLTITFKEEKGI--CQTSMTTKPSAFERLSITKKKNAQTPRAP 435

BLAST of Tan0013100 vs. ExPASy TrEMBL
Match: A0A5A7TPR5 (RNase H domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold1639G00040 PE=4 SV=1)

HSP 1 Score: 150.6 bits (379), Expect = 1.9e-32
Identity = 94/208 (45.19%), Postives = 118/208 (56.73%), Query Frame = 0

Query: 349 RNNEVSTNFAKSEILKDEGSTTSPVLRYVPLSRRKKGESPFAECTKSL--------TLSL 408
           R  E ST+  KS IL DE ++   +LRYVPLSRR+KGESPF +  + L         LSL
Sbjct: 70  RKGEASTSTTKSMILMDEKTSNPLILRYVPLSRRQKGESPFVKFPQGLKVGDMKQPKLSL 129

Query: 409 TQKKHLKEGYNIPTSRKGLGYKSPEPVRIIRKGKAKVADANHIIVEEVDDSDEKENVTQR 468
            QKK L+EG+ IP SRKG GYKSPEP+ IIRK K KV D+NHI V EVD  +EKE  +QR
Sbjct: 130 IQKKLLREGHVIPVSRKGPGYKSPEPIHIIRKRKKKVIDSNHITVGEVDSMEEKEGGSQR 189

Query: 469 TSVFSRVGPLVARPSALQRLSTTQVEEEWSPPVSGSTRTSALMRIRMPTETEGSVLLALT 528
            S F ++ P V R +  +RLS T+ E +     S   + SAL R+ M  + E     AL 
Sbjct: 190 ISAFDQIRPHVVRTTVFERLSVTKTERKCHQSTSSLNKRSALQRLTMTFKKEKCTCQALR 249

Query: 529 PDSVRPSVLSNLDSQHCSAGRESRRPAF 549
             + RPS    L        +  R P F
Sbjct: 250 --ATRPSAFERLSVAKQKDAQTPRVPIF 275

BLAST of Tan0013100 vs. ExPASy TrEMBL
Match: A0A5D3DXC7 (Reverse transcriptase domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold289G00760 PE=4 SV=1)

HSP 1 Score: 149.4 bits (376), Expect = 4.3e-32
Identity = 98/244 (40.16%), Postives = 121/244 (49.59%), Query Frame = 0

Query: 351 NEVSTNFAKSEILKDEGSTTSPVLRYVPLSRRKKGESPFAECTKSL-------------- 410
           +E STN AKS IL DE ++  P+LRYVPLSRRKKGESPF E  + L              
Sbjct: 168 SEASTNTAKSVILMDEKTSNPPILRYVPLSRRKKGESPFVESPQGLKIDLTEASLPQRRT 227

Query: 411 ----------------------------------TLSLTQKKHLKEGYNIPTSRKGLGYK 470
                                              LS TQKK L+EG+ IP SRKGLGYK
Sbjct: 228 KDGFDPKAYKLMAKAGYDFTTHTDFKSLKIHEQPKLSSTQKKLLREGHVIPMSRKGLGYK 287

Query: 471 SPEPVRIIRKGKAKVADANHIIVEEVDDSDEKENVTQRTSVFSRVGPLVARPSALQRLST 530
           SPEP+RI RKGK KV D+NHI V+EVD  +EKE   QRTS F R+ P VAR    +RLS 
Sbjct: 288 SPEPIRITRKGKEKVVDSNHITVKEVDSMEEKEGDNQRTSTFDRISPHVARAPVFERLSM 347

Query: 531 TQVEEEWSPPVSGSTRTSALMRIRMPTETEGSVLLALTPDSVRPSVLSNLDSQHCSAGRE 547
           T+ E +     S   + SA  R+ +  + E  +    T  + +PS    L        + 
Sbjct: 348 TEAERKDHQSTSNLDQRSAFQRLTITFKEEKGI--CQTSMTTKPSAFERLSITKKKNAQT 407

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
TYK05005.11.4e-4029.55ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa][more]
KAA0061113.14.4e-3931.01ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa] >TYK03782.1 ty3-gyp... [more]
KAA0061113.11.5e-1046.94ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa] >TYK03782.1 ty3-gyp... [more]
XP_031735972.19.8e-3926.66uncharacterized protein LOC116401693 [Cucumis sativus][more]
XP_031740568.19.8e-3926.66uncharacterized protein LOC116403508 [Cucumis sativus][more]
Match NameE-valueIdentityDescription
A0A5D3C0W66.6e-4129.55Ty3-gypsy retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E567... [more]
A0A5D3BY542.1e-3931.01Ty3-gypsy retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E567... [more]
A0A5D3BY547.1e-1146.94Ty3-gypsy retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E567... [more]
A0A5A7TPR51.9e-3245.19RNase H domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C2... [more]
A0A5D3DXC74.3e-3240.16Reverse transcriptase domain-containing protein OS=Cucumis melo var. makuwa OX=1... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 188..243
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 190..205
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 221..242

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0013100.1Tan0013100.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0090304 nucleic acid metabolic process