Tan0019863 (gene) Snake gourd v1

Overview
NameTan0019863
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionReverse transcriptase domain-containing protein
LocationLG06: 32938000 .. 32943078 (+)
RNA-Seq ExpressionTan0019863
SyntenyTan0019863
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTTGCCGAGGGAGGGAACGAAAAACAACGATGGAAGCTGTACTCTAAATTTGAGGCCTAAAGATCAGAAGTCGTCGGAAAGGGAGGTCGGAGAGGCGGCCGGCGACCAAGGAAAGAACCCAACGGGCAAAACTCGAAATTTGGAACCGGTGGATGGGACGTCACAGACCAAGGTCAGGGAGACAGGACAGGGCATGTATGTTGGAGAAGCCACGTGGAGAGGGGGGACCCACTCTGATGAGAATGGAGGGTTGGCTGGGGACCTCGATAATAACATAGGAGAAGTTTTCTTGGATTTGATTAGTGGGCCGAAAACTTGTGGGCTGGTCAATGGGCCAGAAAAGAAGTTGGGGCTGAATAGGGGTCTAGAAAAGGAAATGGTTGATAAGGGCTCATTTACCAATGTCAAACCCAATAAGCAAATCAAATGGATTGACGGGGAAGAAGGAAGAGTTTCGACCCATTCAATTCTAAACCCAGTAACGGAGGAAGGAGCAGACAACAGGAAGCCGATTACAGGTGGTCCATATCACGAACCAAGGAACCAAAAGGATAAAGAGAGAGACTTCGAGAGCTTGACCTATAAAGGAGAGGGGTCCAACCAGGTTCTCAGGGTTGAAATTGAGAGTCTTGGATCAGATTACTATCCAGCAGATACTTCACGAAAAGCAGAGAGAGTAGAAAATAAAGCCTCTAGGAATTGGAAAAGAGTTGCTAGAGAGAAGTTACATAGCCCGAAAATGAGAGATATAAAAGTCAGCCTGAGTGGAGACAAACATGAGCTGGAAGAGATGGAAATGGAGGGCCCAAACAAAAAACAGTGTAAAGAAACAAAAGATGGAAATTTGAGGAGATCGGCAGAGGCTAGGATGAACCAGCCCCGCCGGAAGCAATGATTGTCTTAAGCTGGAACGTCCGAGGATTTGGGGAATCCTCAGATGCTCCGTGCACTCCGGTACGAAGTGCAAAAACATTCCCCTGATATCGTGTTCTTAATCGAAACCATATGCGATAAGGGTTCGGCTGACAGAACTAAAGCCAACCTTCATTTTGATAATGCCTTTGAAGTTCCGAGAGTGGGCAGAAGTGGCGGGTTAATGTTGTTGTGGAAGGAGCCGACTCATCTTTCGATTATCTCTTTCTCTAAAGCGAACATTGATGTCATTATCAAAGACATTCAAGGGGATTGGAGATTCACTGGTTTCTATGGAGATCCTGCGGAGGAGAATAGAATGAATTCCTGGATGCTTTTGGATCGGTTTAGTAGTTTGTTCGATCTCCCTTGGCTGGTGGGGGGTGACTTCAACGAGCTGCTGTCAGAGGAAGAGAAATGGGGAGGGGCACCCAAAAGTAAGAAACTCATGAATAACTTTGCTGACTGTATTTTTAGATGTAACTTGGTTGATACAGGTTGTAGGGGCAATAAATTCACATGGAGAAAAAGCAGACATCACAATGCAACCAAGGAACGTCTTGACAGGTATTTCTTAAATCAAAGCATGTTGATTCGCACCACGAAGGTCAGTATCTCTCATCTTACTTTTCATGCTTCTGATCATAAGCCTATTCTTGCTCACCTCAAGTTTGATACAATTAATACTAGCCATAGGAGACGGAGACCTAATGCTCGTTTTGAGGAAAATTGGGTTGGTTGTGAGGAGGCTAGGGTGCTGATAAGAGGTCACTGGATGGAAAGTATAAGTAGGAGTCCTGCCGATCTGAAAGAGAAAATAACTTCCTGTATTCTTAAGTTGAAAACCTGGGATAGACAAAGATTAAAAGGGTCCTTAAAAAAGGCCATTCAAAGGAAAGAGGAGACTGTGGCTCGGTTACTAAACACCCCTTTTATCCCTAATCAGGACGAACGTACGAGGGTTGAGATCGAGCTAGAGGGGCTCCTGTTGGAGGAAGAAAAATATTGGAAGATTCGATCCAGAGAGGGTTGGTTAAAGTGGGGCGATAAAAACACGAAGTGGTTCCACCATAAAGCAAGTCAAAGGCAAAAAGGGAATCATATTGATAGACTTGTTGACCCTACCGGAAACTGGATTACTGATGAGGAGGAGATTGGAAAAGTGGCTAGTAATTACTTTAAAAATATTTTCCAGTCTTCCAATCCCTGTACCCAGGACATAGAGAGCGTTTTGGAAAGCTTGGAAAAGAAAATTACGGAGAATCATCGCAGAATCCTGGACAAACCTTTTGAGATCGATAAGGCTATTAGGGAGCTAAGCCCTTTGAAAGCCCCCGACCCTGATGGTTGCCATGCTAAGTTTTACCAAGAGTATTGGGAGATTGTGGGGGAAGATACTAAACGTGTTTGCCTGGAAATCTTGAACAATGGGGAGGCTTTGGATAGTATCAATAAAACGTTGCTTGTCCTTATCCCTAGGTGTAAGAATGCCAACCGAATGGAAGACTTCAGACCCATAAGCCTTTGCAATGTTCTCTATAAAATAGTAGCCAAAGCAATAGTTAACAGATTGAAGAGGATTATGGACGACATTATCTCTCCCTATCAGTCAGCCTTCATCCCGGGAAGGTTGATCTCGGATAATGTGATCATCGGGTTTGAATGTATCCAAGCTATTGCTTCTTCTAGGAATAGGAAACAAGGCCATGTGGCGCTCAAGCTAGATATGAGCAAAGCGTATGACAGGGTCGAATGGACCTTTCTAAGGAAAGTCATGAGCAAGTTGGGTTTTGAGGAAAGGTGGGTGAATCTCATAATGAATTGTGTGGAGTCTGTCTCGTTCTCTGTTCTCCTCAATGGGTGTCCTTTTGAAGAATTTCGACCGAATCATGGACTTAGACAAGGGGACCCCCTTTCCCCTTATCTGCTCCTATTCTGTGCAGAAGGCTTTACTTCCATCCTCAAAAGAAAGGAAGCCGAAGGAAAGTACAAAGGGATTAGAATCAATAGATTTTGCCCTTCCGTTTCTCACCTGTTTTTCGCAGATGATAGCCTACTTTTCTTCAGAGCTAGACATGAGGATTGTATATCCATCAAAGAAGCTTTGGAGTTATATGAGAAGACTTCAGGTTAAACAATAAATTTCAGAAAATCTGCCTTTATGACTAACAAATTTGTCAAAAGTGAGGTGGTTAGAGGGGTTGAGGATACTTTGGGGGTCAAAAGGGTGGACAATCTAGGCTATTACCTCGGTTGACCTTTTAAAAATGCGAGAAGTAAGGGCTCCTTGTTTAATAAAATCAAATTCAGAATTGGAAAAGTTCTCCAAGGGTGGAAAAGGAATCTCTTCTCTTTAAGCGGGAAAGAAACTCTTATCAAGTCCATAGCCCAGGCCATTCCCAACTATACTATGAGTTGCTTCAGACTCCCCAAGTTCATTTGTAAGGAAATAAACCAACTTTGCGCTAGGTTTTGGTGGGGAGAGGCTGGAGAGAAAAGAAAAACTCACTGGGTTAGTTGGGACAAGCTTTGCAATAGCAAGTTTCATGGTGGCATGGGGTTTCGTGAGTTAGAGTGTTTTAATCAAGCCATGTTGGCCAAGATAAGCTGGTGAATTGTTAGAAATCTTGAAAGTCTTATGGCTAGAGTGCTAAGAGGTCGATATTTTAAGGATGGAAATTTCTTAGAAGCCCATGAAGGAAAAAACCTCTCTCTCATCTAGAGAAGTATAATTTGGGGTCGGGAACTTTTCAAAAAAGGAATCCGTTGGAGAATAGGAGATGGAAGATACATAAGCATAGGAGAAGACCCCTGGATTGGGAGAGCGGGCCTTAGAAGACCCTTCAACATTTCTAACACTTTGGAGGGTAGACGCGTTTGCGATTTGATTGATACAGATGGCAAGTGGAATGAAGAAGCGGCCAGAAATGGAGTTTCTCCTCAGGACTATATTGATATTATGAATACTCCTCTTGGCCCAAGAGGGGCAAAGGATGTCATTATTTGGGGAGAGGATATGAAAGGGATTTTTTCGGTCAAAAGTGCATATCACTTGGCCAAAAGATACAGAATTGGCTCTTCCAGTTCCAAAGTGAGTGAAGGGAGTTCCAAGGAGCCCTGGAACAATCTCTGGAAAGCCAAGACCTTATCTAGAGCAAAACTCTGTGTGTGGAAAGTGTTAGGAGATATCATCCCTACCAAAATTAACATTATTAAAAGAGGTGTTGACATTAATCCCTTTTGCTGTTTTTGCAAGGTACACCGTGAGGACACAATCCATGTCATGTGGGGGTGCAAAATCGCCAAAAGAATCTGGATCAACTTTATCCCCGAGATGGAGACTTTGCTTTATACCTGTAAAAGAGAATGGACGCCTACGGATTGTTGGGACTGGATGATTTCAAACCTCAATGTGGAGGAGATAGAATCGACCATCATCATCATGTGGAACATTTGGAAAGCCAGGAACTTTATAAACACTGTTAAATATTTGTTTGTAAATGATGATCTGCTGAAAATTATCAGGGACATTTCACAGGCTAGGGAGGAGCTATGTTCGTTGGACCGACGAAACACAAATTTCCAAAAAGCAAGATTGGAGAGCTGTGAGAGTCATGGAGAATGGACCCCCCCTGAGGCAAACACGTGGAAACTCAATTGCGACGCCTCTTGGAATGATAATCTAGAGGTTGGTGGGATTGGATGGGTAATCCGTGACTCTTATGGGTCTTGGATTTGTGCAGGAGGGAAATAGGTCAAAGAGAAGTGGCCAATCAAACTGCTTAAAGCCAGAGCTATTCTGGATGGTCTTAAAGCGGTGACAAGTTGCGATGTCAATCACCGAAAGATGATGACATTGGAAACTGACTCAAGCGAAGTAGTAAAGAACATCAATGGGGAAGCGGAAGGCATGTCGGAATTGTATAACTTTGTTGAGGCGATTGCGAATGTGGAGAGTCGTTCTCTTATTGTTAAGTTTGTAAAGTGCCCTAGATCTAGCAATAGTGTAGCTCATAACATTGCTAGGGGGTGTGTATTCATGGTGATTTTCAGGGTCCTTTTAGCTCCCCTCTTGTTGAAGGAGCTTTTGAAGTTGTTTTTGGTGAGTACCCGTCTTGGGTCTCCAAGTTGTTGCCTGCGGGCTGTCTCCCCAACTCTTCTCTAG

mRNA sequence

ATGTTGCCGAGGGAGGGAACGAAAAACAACGATGGAAGCTGTACTCTAAATTTGAGGCCTAAAGATCAGAAGTCGTCGGAAAGGGAGGTCGGAGAGGCGGCCGGCGACCAAGGAAAGAACCCAACGGGCAAAACTCGAAATTTGGAACCGGTGGATGGGACGTCACAGACCAAGGTCAGGGAGACAGGACAGGGCATGTATGTTGGAGAAGCCACGTGGAGAGGGGGGACCCACTCTGATGAGAATGGAGGGTTGGCTGGGGACCTCGATAATAACATAGGAGAAGTTTTCTTGGATTTGATTAGTGGGCCGAAAACTTGTGGGCTGGTCAATGGGCCAGAAAAGAAGTTGGGGCTGAATAGGGGTCTAGAAAAGGAAATGGTTGATAAGGGCTCATTTACCAATGTCAAACCCAATAAGCAAATCAAATGGATTGACGGGGAAGAAGGAAGAGTTTCGACCCATTCAATTCTAAACCCAGTAACGGAGGAAGGAGCAGACAACAGGAAGCCGATTACAGGTGGTCCATATCACGAACCAAGGAACCAAAAGGATAAAGAGAGAGACTTCGAGAGCTTGACCTATAAAGGAGAGGGGTCCAACCAGGTTCTCAGGGTTGAAATTGAGAGTCTTGGATCAGATTACTATCCAGCAGATACTTCACGAAAAGCAGAGAGAGTAGAAAATAAAGCCTCTAGGAATTGGAAAAGAGTTGCTAGAGAGAAAGTGGGCAGAAGTGGCGGGTTAATGTTGTTGTGGAAGGAGCCGACTCATCTTTCGATTATCTCTTTCTCTAAAGCGAACATTGATGTCATTATCAAAGACATTCAAGGGGATTGGAGATTCACTGGTTTCTATGGAGATCCTGCGGAGGAGAATAGAATGAATTCCTGGATGCTTTTGGATCGGTTTAGTAGTTTGTTCGATCTCCCTTGGCTGGTGGGGGGTGACTTCAACGAGCTGCTGTCAGAGGAAGAGAAATGGGGAGGGGCACCCAAAAGTAAGAAACTCATGAATAACTTTGCTGACTGTATTTTTAGATGTAACTTGGTTGATACAGGTTGTAGGGGCAATAAATTCACATGGAGAAAAAGCAGACATCACAATGCAACCAAGGAACGTCTTGACAGGTATTTCTTAAATCAAAGCATGTTGATTCGCACCACGAAGGTCAGTATCTCTCATCTTACTTTTCATGCTTCTGATCATAAGCCTATTCTTGCTCACCTCAAGTTTGATACAATTAATACTAGCCATAGGAGACGGAGACCTAATGCTCGTTTTGAGGAAAATTGGGTTGGTTGTGAGGAGGCTAGGGTGCTGATAAGAGGTCACTGGATGGAAAGTATAAGTAGGAGTCCTGCCGATCTGAAAGAGAAAATAACTTCCTGTATTCTTAAGTTGAAAACCTGGGATAGACAAAGATTAAAAGGGTCCTTAAAAAAGGCCATTCAAAGGAAAGAGGAGACTTGGAATGAAGAAGCGGCCAGAAATGGAGTTTCTCCTCAGGACTATATTGATATTATGAATACTCCTCTTGGCCCAAGAGGGGCAAAGGATGTCATTATTTGGGGAGAGGATATGAAAGGGATTTTTTCGGTCAAAAGTGCATATCACTTGGCCAAAAGATACAGAATTGGCTCTTCCAGTTCCAAAGTGAGTGAAGGGAGTTCCAAGGAGCCCTGGAACAATCTCTGGAAAGCCAAGACCTTATCTAGAGCAAAACTCTGTGTGTGGAAAGTGTTAGGAGATATCATCCCTACCAAAATTAACATTATTAAAAGAGGTGTTGACATTAATCCCTTTTGCTGTTTTTGCAAGGTACACCGTGAGGACACAATCCATGTCATGTGGGGGTGCAAAATCGCCAAAAGAATCTGGATCAACTTTATCCCCGAGATGGAGACTTTGCTTTATACCTGTAAAAGAGAATGGACGCCTACGGATTGTTGGGACTGGATGATTTCAAACCTCAATGTGGAGGAGATAGAATCGACCATCATCATCATGTGGAACATTTGGAAAGCCAGGAACTTTATAAACACTGTTAAATATTTGTTTGTAAATGATGATCTGCTGAAAATTATCAGGGACATTTCACAGGCTAGGGAGGAGCTATGTTCGTTGGACCGACGAAACACAAATTTCCAAAAAGCAAGATTGGAGAGCTGTGAGAGTCATGGAGAATGGACCCCCCCTGAGGCAAACACGTGGAAACTCAATTGCGACGCCTCTTGGAATGATAATCTAGAGGTTGGTGGGATTGGATGGGTCAAAGAGAAGTGGCCAATCAAACTGCTTAAAGCCAGAGCTATTCTGGATGGTCTTAAAGCGGTGACAAGTTGCGATGTCAATCACCGAAAGATGATGACATTGGAAACTGACTCAAGCGAAGTAGTAAAGAACATCAATGGGGAAGCGGAAGGCATGTCGGAATTGTATAACTTTGTTGAGGCGATTGCGAATGTGGAGAGTCGTTCTCTTATTGTTAAGTTTGTAAAGTGCCCTAGATCTAGCAATAGTGTAGCTCATAACATTGCTAGGGGGTGTGTATTCATGGTGATTTTCAGGGTCCTTTTAGCTCCCCTCTTGTTGAAGGAGCTTTTGAAGTTGTTTTTGGTGAGTACCCGTCTTGGGTCTCCAAGTTGTTGCCTGCGGGCTGTCTCCCCAACTCTTCTCTAG

Coding sequence (CDS)

ATGTTGCCGAGGGAGGGAACGAAAAACAACGATGGAAGCTGTACTCTAAATTTGAGGCCTAAAGATCAGAAGTCGTCGGAAAGGGAGGTCGGAGAGGCGGCCGGCGACCAAGGAAAGAACCCAACGGGCAAAACTCGAAATTTGGAACCGGTGGATGGGACGTCACAGACCAAGGTCAGGGAGACAGGACAGGGCATGTATGTTGGAGAAGCCACGTGGAGAGGGGGGACCCACTCTGATGAGAATGGAGGGTTGGCTGGGGACCTCGATAATAACATAGGAGAAGTTTTCTTGGATTTGATTAGTGGGCCGAAAACTTGTGGGCTGGTCAATGGGCCAGAAAAGAAGTTGGGGCTGAATAGGGGTCTAGAAAAGGAAATGGTTGATAAGGGCTCATTTACCAATGTCAAACCCAATAAGCAAATCAAATGGATTGACGGGGAAGAAGGAAGAGTTTCGACCCATTCAATTCTAAACCCAGTAACGGAGGAAGGAGCAGACAACAGGAAGCCGATTACAGGTGGTCCATATCACGAACCAAGGAACCAAAAGGATAAAGAGAGAGACTTCGAGAGCTTGACCTATAAAGGAGAGGGGTCCAACCAGGTTCTCAGGGTTGAAATTGAGAGTCTTGGATCAGATTACTATCCAGCAGATACTTCACGAAAAGCAGAGAGAGTAGAAAATAAAGCCTCTAGGAATTGGAAAAGAGTTGCTAGAGAGAAAGTGGGCAGAAGTGGCGGGTTAATGTTGTTGTGGAAGGAGCCGACTCATCTTTCGATTATCTCTTTCTCTAAAGCGAACATTGATGTCATTATCAAAGACATTCAAGGGGATTGGAGATTCACTGGTTTCTATGGAGATCCTGCGGAGGAGAATAGAATGAATTCCTGGATGCTTTTGGATCGGTTTAGTAGTTTGTTCGATCTCCCTTGGCTGGTGGGGGGTGACTTCAACGAGCTGCTGTCAGAGGAAGAGAAATGGGGAGGGGCACCCAAAAGTAAGAAACTCATGAATAACTTTGCTGACTGTATTTTTAGATGTAACTTGGTTGATACAGGTTGTAGGGGCAATAAATTCACATGGAGAAAAAGCAGACATCACAATGCAACCAAGGAACGTCTTGACAGGTATTTCTTAAATCAAAGCATGTTGATTCGCACCACGAAGGTCAGTATCTCTCATCTTACTTTTCATGCTTCTGATCATAAGCCTATTCTTGCTCACCTCAAGTTTGATACAATTAATACTAGCCATAGGAGACGGAGACCTAATGCTCGTTTTGAGGAAAATTGGGTTGGTTGTGAGGAGGCTAGGGTGCTGATAAGAGGTCACTGGATGGAAAGTATAAGTAGGAGTCCTGCCGATCTGAAAGAGAAAATAACTTCCTGTATTCTTAAGTTGAAAACCTGGGATAGACAAAGATTAAAAGGGTCCTTAAAAAAGGCCATTCAAAGGAAAGAGGAGACTTGGAATGAAGAAGCGGCCAGAAATGGAGTTTCTCCTCAGGACTATATTGATATTATGAATACTCCTCTTGGCCCAAGAGGGGCAAAGGATGTCATTATTTGGGGAGAGGATATGAAAGGGATTTTTTCGGTCAAAAGTGCATATCACTTGGCCAAAAGATACAGAATTGGCTCTTCCAGTTCCAAAGTGAGTGAAGGGAGTTCCAAGGAGCCCTGGAACAATCTCTGGAAAGCCAAGACCTTATCTAGAGCAAAACTCTGTGTGTGGAAAGTGTTAGGAGATATCATCCCTACCAAAATTAACATTATTAAAAGAGGTGTTGACATTAATCCCTTTTGCTGTTTTTGCAAGGTACACCGTGAGGACACAATCCATGTCATGTGGGGGTGCAAAATCGCCAAAAGAATCTGGATCAACTTTATCCCCGAGATGGAGACTTTGCTTTATACCTGTAAAAGAGAATGGACGCCTACGGATTGTTGGGACTGGATGATTTCAAACCTCAATGTGGAGGAGATAGAATCGACCATCATCATCATGTGGAACATTTGGAAAGCCAGGAACTTTATAAACACTGTTAAATATTTGTTTGTAAATGATGATCTGCTGAAAATTATCAGGGACATTTCACAGGCTAGGGAGGAGCTATGTTCGTTGGACCGACGAAACACAAATTTCCAAAAAGCAAGATTGGAGAGCTGTGAGAGTCATGGAGAATGGACCCCCCCTGAGGCAAACACGTGGAAACTCAATTGCGACGCCTCTTGGAATGATAATCTAGAGGTTGGTGGGATTGGATGGGTCAAAGAGAAGTGGCCAATCAAACTGCTTAAAGCCAGAGCTATTCTGGATGGTCTTAAAGCGGTGACAAGTTGCGATGTCAATCACCGAAAGATGATGACATTGGAAACTGACTCAAGCGAAGTAGTAAAGAACATCAATGGGGAAGCGGAAGGCATGTCGGAATTGTATAACTTTGTTGAGGCGATTGCGAATGTGGAGAGTCGTTCTCTTATTGTTAAGTTTGTAAAGTGCCCTAGATCTAGCAATAGTGTAGCTCATAACATTGCTAGGGGGTGTGTATTCATGGTGATTTTCAGGGTCCTTTTAGCTCCCCTCTTGTTGAAGGAGCTTTTGAAGTTGTTTTTGGTGAGTACCCGTCTTGGGTCTCCAAGTTGTTGCCTGCGGGCTGTCTCCCCAACTCTTCTCTAG

Protein sequence

MLPREGTKNNDGSCTLNLRPKDQKSSEREVGEAAGDQGKNPTGKTRNLEPVDGTSQTKVRETGQGMYVGEATWRGGTHSDENGGLAGDLDNNIGEVFLDLISGPKTCGLVNGPEKKLGLNRGLEKEMVDKGSFTNVKPNKQIKWIDGEEGRVSTHSILNPVTEEGADNRKPITGGPYHEPRNQKDKERDFESLTYKGEGSNQVLRVEIESLGSDYYPADTSRKAERVENKASRNWKRVAREKVGRSGGLMLLWKEPTHLSIISFSKANIDVIIKDIQGDWRFTGFYGDPAEENRMNSWMLLDRFSSLFDLPWLVGGDFNELLSEEEKWGGAPKSKKLMNNFADCIFRCNLVDTGCRGNKFTWRKSRHHNATKERLDRYFLNQSMLIRTTKVSISHLTFHASDHKPILAHLKFDTINTSHRRRRPNARFEENWVGCEEARVLIRGHWMESISRSPADLKEKITSCILKLKTWDRQRLKGSLKKAIQRKEETWNEEAARNGVSPQDYIDIMNTPLGPRGAKDVIIWGEDMKGIFSVKSAYHLAKRYRIGSSSSKVSEGSSKEPWNNLWKAKTLSRAKLCVWKVLGDIIPTKINIIKRGVDINPFCCFCKVHREDTIHVMWGCKIAKRIWINFIPEMETLLYTCKREWTPTDCWDWMISNLNVEEIESTIIIMWNIWKARNFINTVKYLFVNDDLLKIIRDISQAREELCSLDRRNTNFQKARLESCESHGEWTPPEANTWKLNCDASWNDNLEVGGIGWVKEKWPIKLLKARAILDGLKAVTSCDVNHRKMMTLETDSSEVVKNINGEAEGMSELYNFVEAIANVESRSLIVKFVKCPRSSNSVAHNIARGCVFMVIFRVLLAPLLLKELLKLFLVSTRLGSPSCCLRAVSPTLL
Homology
BLAST of Tan0019863 vs. NCBI nr
Match: KAF8408042.1 (hypothetical protein HHK36_007182 [Tetracentron sinense])

HSP 1 Score: 228.0 bits (580), Expect = 3.2e-55
Identity = 166/602 (27.57%), Postives = 255/602 (42.36%), Query Frame = 0

Query: 241 EKVGRSGGLMLLWKEPTHLSIISFSKANIDVIIKDI--QGDWRFTGFYGDPAEENRMNSW 300
           E VGRSGGL LLW++   +S+ S+SK +IDV++  +  +  WR TG YG P    +  +W
Sbjct: 60  ESVGRSGGLCLLWRQDLDISLQSYSKNHIDVVLGTVGEREVWRLTGMYGHPEAAKKWETW 119

Query: 301 MLLDRFSSLFDLPWLVGGDFNELLSEEEKWGGAPKSKKLMNNFADCIFRCNLVDTGCRGN 360
            L+   S  + +PW+  GDFNE+   EEK G   K+   M  F + I  C+L+  G  GN
Sbjct: 120 ELIRYLSRSYSMPWVCFGDFNEITCAEEKSGRVEKAAWKMRKFKEAILDCHLIGLGFEGN 179

Query: 361 KFTW-RKSRHHNATKERLDRYFLNQSMLIRTTKVSISHLTFHASDHKPILAHLKFDTINT 420
            FTW  K       +ERLDR              ++ HL+ H SDH P+L  L FD    
Sbjct: 180 TFTWCNKRSGEGNVRERLDRAMATSDWCFLFPFTTVKHLSCHTSDHSPLL--LAFDKEAP 239

Query: 421 SHRRRRPNARFEENWVGCEEARVLIRGHW----MESISRSPADLKEKITSCILKLKTWDR 480
              R++ + RFE  W+   E   +I   W      S    P D K  +            
Sbjct: 240 KIARKKRSFRFEAMWIHSPECAEIIDSAWTGCHQVSAQVLPRDAKVSL------------ 299

Query: 481 QRLKGSLKKAIQRKEETWNEEAARNGVSPQDYIDIMNTPLGPRGAKDVIIWGEDMKGIFS 540
                     I + ++TWN         P +   I + PL  R   D  +W    KG FS
Sbjct: 300 ---------LIDKDQKTWNHTLLMTVFMPHEAELISSIPLSERLPPDKRVWHFTSKG-FS 359

Query: 541 VKSAYHLAKRYRIGSSSSKVSEGS-------SKEPWNNLWKAKTLSRAKLCVWKVLGDII 600
           V+SAYHL    R   S++  S  S       S   W+ +W+     + K+ +WKV  +I+
Sbjct: 360 VRSAYHLTSTLRDRESATSSSTSSLSWNGSLSGIKWSQVWQLAIPPKVKIFIWKVALNIL 419

Query: 601 PTKINIIKRGVDINPFCCFCKVHREDTIHVMWGCKIAKRIWINFIPEMETLLYTCKREWT 660
           P + N+ KR + +   C  C    E  +HV+  C  A+++W+      +  L +      
Sbjct: 420 PVRANLCKRKIPVENVCGVCGEEGETILHVLKNCHYARQVWL----LSQLGLRSDATSAD 479

Query: 661 PTDCWDWMISNLNVEE-IESTIIIMWNIWKARNFINTVKYLFVNDDLLKIIRDISQAREE 720
               W   I   + EE + +  +I W+IWK RN     +Y+F     +K+       R  
Sbjct: 480 SLSSWVEEIMKSHGEEGLSAFFMIAWSIWKHRN-----EYIFSG---VKMTPFNCVQRAN 539

Query: 721 LCSLDRRNTNFQKARLESCESHGEWTPPEANTWKLNCDASWNDNLEVGGIGWV------- 780
               D  N N  +A  ES  +   W  P  + +K+N D + +      G+G V       
Sbjct: 540 KLLADFHNAN-DRAAPESISAARSWLAPPGDLFKVNIDGALHLEDRSAGVGVVVRDHNGD 599

Query: 781 ---------KEKWPIKLLKARAILDGLKAVTSCDVNHRKMMTLETDSSEVVKNINGEAEG 812
                           +++A A  +GLK      +     + LE+DS   ++ +  + E 
Sbjct: 600 LIAAMSKRISNTQSAAVIEAIAAREGLKFALELGIQE---VILESDSVNTIRALTSQEEN 621

BLAST of Tan0019863 vs. NCBI nr
Match: KAF7824053.1 (hypothetical protein G2W53_022197 [Senna tora])

HSP 1 Score: 220.7 bits (561), Expect = 5.1e-53
Identity = 187/690 (27.10%), Postives = 291/690 (42.17%), Query Frame = 0

Query: 245  RSGGLMLLWKEPTHLSIISFSKANIDVIIKD--IQGDWRFTGFYGDPAEENRMNSWMLLD 304
            R+GGL L W     L++ SFS  +IDVI+ D  +   WR TG +G P E+++  +W LL 
Sbjct: 472  RAGGLALFWYNTLDLTLASFSSNHIDVIVSDAGLNIKWRLTGVHGFPEEQSKHQTWDLLR 531

Query: 305  RFSSLFDLPWLVGGDFNELLSEEEKWGGAPKSKKLMNNFADCIFRCNLVDTGCRGNKFTW 364
              +S  DLPWL  GDFNE++   EK GG  KS + M  F D    C   D G +G  FTW
Sbjct: 532  LLASNSDLPWLCYGDFNEIMFASEKQGGTAKSDRSMQAFRDACNHCGFTDMGFKGYPFTW 591

Query: 365  RKSR--HHNATKERLDRYFLNQSMLIRTTKVSISHLTFHASDHKPILAHLKFDTINTS-H 424
               R   HN  +ERLDR F  +  L++     ++H++  +SDH  +   + FD++  S H
Sbjct: 592  NNGRRDQHN-IQERLDRVFATEEWLLKFPYTEVNHISHFSSDHCAL--EISFDSLPPSNH 651

Query: 425  RRRRPNARFEENWVGCEEARVLIRGHWMESISRSPADLKEKITS---------------- 484
             RR+   RFEE W   E  + +I   W  + + S     EK+TS                
Sbjct: 652  TRRQRLFRFEEAWCSDERCKDVINAVWQGTDAFS-----EKLTSVASNLSFMEATLAPNA 711

Query: 485  -----CILK------------------LKTWDRQRLKG-----------------SLKKA 544
                 CI+K                  +  W+   +                    +   
Sbjct: 712  SFTWRCIMKARWVLQLGAGWRVGNGFQINIWEDNWISPITGLRILSPRPLDSDLVMVADL 771

Query: 545  IQRKEETWNEEAARNGVSPQDYIDIMNTPLGPRGAKDVIIWGEDMKGIFSVKSAYHLAKR 604
            I  +  TW  +   N   P +   I++ PL  R  +D  IW  +    +SVKSAYH+  +
Sbjct: 772  INHENRTWKRDLI-NSAFPSEARIILSLPLSWRNIQDKFIWPFEKHLSYSVKSAYHIISK 831

Query: 605  YRIGSSSSKVSEGSSKEPWNNLWKAKTLSRAKLCVWKVLGDIIPTKINIIKRGVDINPFC 664
                 +S   S  S+K  W  +WK     + ++  W++  + +PT +N+ KRGV I   C
Sbjct: 832  ---NMASLSPSTSSTKVNWARIWKLTVTPKIRIFFWRLCHNALPTCLNLGKRGVQILNEC 891

Query: 665  CFCKVHREDTIHVMWGCKIAKRIW----INFIPEMETLLYTCKREWTPTDCWDWMISNLN 724
              C++  EDT+H   GC  A+ +W    + F P +           +     DW+ S L 
Sbjct: 892  PRCRLPNEDTVHTFIGCTWAQLVWFLSPLGFFPILH----------SNEAFIDWLHSKLE 951

Query: 725  VEEIES---TIIIMWNIWKAR-NFINTVKYLFVNDDLLKIIRDISQAREELCSLDRRNTN 784
             E IE       + W IW  R NF+   K++ V +     + ++     E    D+    
Sbjct: 952  DESIEVLNWIATVCWAIWNDRNNFLFNAKHVSVEE----CVHNVLSLMAEYSLHDK---- 1011

Query: 785  FQKARLESCESH--GEWTPPEANTWKLNCDASWNDNLE----------VGGIGWVKEKW- 844
             Q    + C S    +W  P++   KLN DA+   + E           G + +V     
Sbjct: 1012 -QSLSNDPCVSSLISKWKKPKSYFHKLNVDAARLSDTEWSLGAVVRDDCGDLHFVAATRI 1071

Query: 845  ----PIKLLKARAILDGLKAVTSCDVNHRKMMTLETDSSEVVKNINGEAEGMSELYNFVE 849
                   + +A AI  GL  V   DV +   + +ETD     K+ +  A+  S +   ++
Sbjct: 1072 QCLDDSSIAEALAIRWGLLLVIQLDVTN---LEVETDCLIACKSYHDGADD-SLMDTILQ 1126

BLAST of Tan0019863 vs. NCBI nr
Match: GAU38731.1 (hypothetical protein TSUD_208420 [Trifolium subterraneum])

HSP 1 Score: 213.8 bits (543), Expect = 6.2e-51
Identity = 173/622 (27.81%), Postives = 273/622 (43.89%), Query Frame = 0

Query: 241 EKVGRSGGLMLLWKEPTHLSIISFSKANIDVIIKDIQ-GDWRFTGFYGDPAEENRMNSWM 300
           ++ GR GG+ ++W++  + SI ++S  +ID+ + D+Q G WR TGFYG P    R +SW 
Sbjct: 25  DRDGRGGGVAVMWRKVVNCSITNYSLNHIDIEVDDLQRGKWRLTGFYGYPEGSRRRDSWN 84

Query: 301 LLDRFSSLFDLPWLVGGDFNELLSEEEKWGGAPKSKKLMNNFADCIFRCNLVDTGCRGNK 360
            L + S+   LPW + GDFN++LS +EK G + + + L+N F + +    LVD   +G  
Sbjct: 85  FLRQLSNASQLPWCIIGDFNDILSSDEKQGRSQRPQWLINGFREAVSDSGLVDIHWKGYP 144

Query: 361 FTWRKS-RHHNATKERLDRYFLNQSMLIRTTKVSISHLTFHASDHKPILAHLKFDTINTS 420
           FTW KS     A +E+LDR   N          ++  LT  ASDH P+L  L+ D     
Sbjct: 145 FTWFKSLGTERAVEEKLDRAMANDIWCNMFQYATVECLTTTASDHYPLL--LECDPKPIQ 204

Query: 421 HRRRRPNARFEENWVGCEEARVLIRGHW----MESISRSPADLKEKITSCI--------- 480
           HR  +   +FE  W    E    ++ HW      +I+R   D    +TS           
Sbjct: 205 HRHLK-QFKFENAWFAEPEFDTFVKQHWETYGNTTITRKLDDCASDLTSWSGHKWSIGTG 264

Query: 481 LKLKTWDRQRLKG--SLKK---------------AIQRKEETWNEEAARNGVSPQDYID- 540
             +  WD+  L    S+KK                +    + W+    R G+   D  D 
Sbjct: 265 HNISLWDQNWLSDGTSIKKPDNIDSQLNNLTVADLLHHNAKEWDSGLIR-GLLNDDIADK 324

Query: 541 IMNTPLGPRGAKDVIIWGEDMKGIFSVKSAYHLAKRYRIGSSSSKVSEGSSKEPWNNLWK 600
           I++TPL      D I W  +  G+++VKSAY        G    +V EG     W+ +W+
Sbjct: 325 ILHTPLLESVQNDKITWQHEKNGLYTVKSAYRFCISTIPGRDQHRV-EGK----WHLIWQ 384

Query: 601 AKTLSRAKLCVWKVLGDIIPTKINIIKRGVDINPFCCFCKVHREDTIHVMWGCKIAKRIW 660
            +   + K  +W++  + +PT+  +  RGV     C  C    ED+ H+ + C+ +   W
Sbjct: 385 TQMPPKIKNFMWRICRNCLPTRARLHDRGVTCPINCVLCDAGDEDSNHLFFSCQNSINCW 444

Query: 661 INF-----IPEMETLLYTCKREWTPTDCWDWMISNLNVEEIESTIIIMWNIWKARNFINT 720
                   I +   L  + K      + ++ M+  LN +       +MW+IWK RN +  
Sbjct: 445 QQMGLWSSIMQHRNLTISVKE-----NVFNIML-QLNEDSRAVFACVMWSIWKQRNDV-- 504

Query: 721 VKYLFVNDDLLKIIRDISQAREELCSLDRRNTNFQKARLESCESHG----EWTPPEANTW 780
              ++ N+   ++ R +   R        RN    + R  + + H     EWT P+A TW
Sbjct: 505 ---IWRNE---RVHRTVVCERANSLLTGWRNAREVRDRYNN-QQHSPQRFEWTRPDAGTW 564

Query: 781 KLNCDASWNDNLEVGGIG-----------WVKEKWPIKLL-----KARAILDGLKAVTSC 805
           K N DAS++ +    GIG             K +W   +L     +A  +L  LK V   
Sbjct: 565 KCNVDASFSRSRNKVGIGVCIRDDQGQFVLAKTEWYSPILDVDTGEALGLLSALKWVKDL 619

BLAST of Tan0019863 vs. NCBI nr
Match: MBA0733287.1 (hypothetical protein [Gossypium gossypioides])

HSP 1 Score: 201.1 bits (510), Expect = 4.1e-47
Identity = 174/630 (27.62%), Postives = 280/630 (44.44%), Query Frame = 0

Query: 259 LSIISFSKANIDVIIKDIQG--DWRFTGFYGDPAEENRMNSWMLLDRFSSLFDLPWLVGG 318
           + + SFSK +IDV+I D +    WRFTGFYG P  ++R  SW  L R +S  ++PWLV  
Sbjct: 229 IGLRSFSKRHIDVLIDDNEKGVKWRFTGFYGSPYVQDRNVSWDTLKRLASGVEIPWLVCV 288

Query: 319 DFNELLSEEEKWGGAPKSKKLMNNFADCIFRCNLVDTGCRGNKFTW-RKSRHHNATKERL 378
           DFNE++   EK GG P+ ++ M  F   +  C L D G  G+ FTW R +      +ERL
Sbjct: 289 DFNEIMYGFEKKGGIPRYERRMECFRKTLDDCQLYDVGFEGSWFTWERGNLPETNIRERL 348

Query: 379 DRYFLNQSMLIRTTKVSISHLTFHASDHKPILAHLKFDTINTSHRRRRPNARFEENWVGC 438
           DR  +N   +    +V + HL+   SDH P+L     DT     +    + +FE  W+  
Sbjct: 349 DRGVVNMGWITMFPEVKVKHLSHSFSDHCPLL----IDTTEKGVKVGNNDFKFETWWLLE 408

Query: 439 EEARVLIRGHWMESISRSPADLKEKITSCILKLKTWDRQRL-KGSLKKAIQRKE------ 498
           +    +++G W      S  DL +K+    + LK W  + + K  +  +  R+E      
Sbjct: 409 DSFIEVVKGIW----ESSTGDLMQKLEILKIGLKEWAGEGVDKIDINNSTSRREIKLVAD 468

Query: 499 ------ETWNEEAARNGVSPQDYI-DIMNTPLGPRGAKDVIIWGEDMKGIFSVKSAYHLA 558
                   WN    +N   P+D +  I+  PL      D   W  ++ G FSV+S Y L 
Sbjct: 469 LLDHSNRKWNLGLIQN-TFPEDIVRKILQIPLAETAHDDFQAWKGELSGEFSVRSTYKLL 528

Query: 559 KRYRIGSSSSKVSEGSSKEPWNNLWKAKTLSRAKLCVWKVLGDIIPTKINIIKRGVDINP 618
           +   +   S  + +  +K+ +  LW  +  S+    VW++  D IP  +N+  R V  N 
Sbjct: 529 QNANM-DPSDYLLQAETKDFYKKLWSLQLPSKIAFTVWRISWDFIPCFVNLRIRRVVSND 588

Query: 619 FCCFCKVHREDTIHVMWGCKIAKRIW--INFIPEMETLLYTCKREWTPTDCWDWMISNLN 678
            C  C    E ++HV   C     +W  +NF   M    +T   EW       W+    N
Sbjct: 589 RCPRCSSEVEGSLHVFRDCPTTTEVWHLLNFEWVMNNRSHTI-WEWL-----TWVFKRGN 648

Query: 679 VEEIESTIIIMWNIWKARNFINTVKYLFVNDDL-LKIIRDISQAREELCSLDRRNTNFQK 738
            ++  S    +W IW +RN +   + +     L L I R +S         +++  N  K
Sbjct: 649 NDQCFSFCYALWWIWFSRNQLIHERNIIPGRALVLNIQRYVS---------EQKGLNGLK 708

Query: 739 ARLESCESH--GEWTPPEANTWKLNCDASWNDNLE---VGGIGWVKEKWPIKL------- 798
            +  +C S+   E TP    T +++ DA+++ N      G +GW      + L       
Sbjct: 709 TKAITCRSYRVQELTP----TARIHFDAAYDSNTSSSASGLVGWDTRGILVALKTIIHRN 768

Query: 799 ------LKARAILDGLKAVTSCDVNHRKMMTLETDSSEVVKNINGEAEGMSELYNFVEAI 849
                  +A A L+G+K   S  ++  K+M    DS  V+K     +   S +   +  I
Sbjct: 769 VPSPFAAEAHACLEGVKLGISLRIHSVKLM---GDSKTVIKKCQESSTDKSAIGAIIRDI 824

BLAST of Tan0019863 vs. NCBI nr
Match: RYR18269.1 (hypothetical protein Ahy_B03g062876 [Arachis hypogaea])

HSP 1 Score: 186.4 bits (472), Expect = 1.1e-42
Identity = 158/646 (24.46%), Postives = 281/646 (43.50%), Query Frame = 0

Query: 241 EKVGRSGGLMLLWKEPTHLSII----SFSKANIDVIIKDIQGDWRFTGFYGDPAEENRMN 300
           E  G SGGL LLWK  T++++     ++ KANI+ I  D+  +W+    YG+P  + R  
Sbjct: 5   EPRGLSGGLSLLWKSNTNINVYELCDNYIKANIN-INNDL--NWQGIFVYGNPVFQKRRR 64

Query: 301 SWMLLDRFSSLFDLPWLVGGDFNELLSEEEKWGGAPKSKKLMNNFADCIFRCNLVDTGCR 360
            W  L   +   ++P    GDFN++L++ EK G  P+ +  +  F   +   +L+D   +
Sbjct: 65  LWHELTVSNMSKEVPQAYLGDFNDILNQYEKVGIHPQPRIYLETFRRFVDDNDLIDIDLK 124

Query: 361 GNKFTWRKSRHHNA-TKERLDRYFLNQSMLIRTTKVSISHLTFHASDHKPILAHLKFDTI 420
           GNK+TW  +  +N  T++RLDR  +N   L     V++       SDH  ++       +
Sbjct: 125 GNKYTWFSNPRNNVITRKRLDRVLVNWKWLQIYQNVNLRASPAVTSDHCALI-------L 184

Query: 421 NTSHR-RRRPNARFEENWVGCEEARVLIRGHW-MESISRSP-ADLKEKITSCILKLKTWD 480
           +T  R R + + +FE  WV  EE + +I+  W  E  SR+      +K   CI +L  W 
Sbjct: 185 DTQQRVRIKKDFKFEAYWVEHEECKEVIQRSWKWEDGSRNCWNQFTKKRNRCIRELMEWS 244

Query: 481 RQRLKGSLKKAIQRKEETWN-EEAARNGVSPQDYIDIMNTPLGPRGAKDVIIWGEDMKGI 540
            ++ K + KK  ++K E    +EAA           I  TP+     KD  +W     G 
Sbjct: 245 SRKFKRADKKIERKKIELHQIQEAAEL---------ITKTPISLINKKDHFVWQHRKDGQ 304

Query: 541 FSVKSAYHLAKRYRIGSSSSKVSEGSSKEPWNNLWKA----KTLSRAKLCVWKVLGDIIP 600
           ++V++ YH+AK  +      ++ + S+ + W  +W+A        + ++ +WK +  I+P
Sbjct: 305 YTVRTGYHVAKEEKDSKEEGRICKASTSQDWREIWEAIWRLPVPQKVRMFLWKTVHRILP 364

Query: 601 TKINIIKRGVDINPFCCFCKVHREDTIHVMWGCKIAKRIWINFIPEMETLLYTCK--REW 660
              N+ +R + + P C  C+   E   H +  C   + +W     ++    Y  +  REW
Sbjct: 365 VNTNLHQRRITMTPTCSICQKENETIEHALLLCPWTRAVWFESSIQIVPTAYNVRSFREW 424

Query: 661 TPTDCWDWMISNLNVEEIESTI----IIMWNIWKARNFINTVKYLFVNDDLLKIIRDISQ 720
                    I   +  E E T+     + W IWKARN      ++F   +++   + I+Q
Sbjct: 425 IMDKI--KRIKTESGSEQEKTLCKLGCVCWCIWKARN-----HHIFQQTEIIP-QKVITQ 484

Query: 721 AREELCSLDRRNTNFQKARLESCESHG-----EWTPPEANTWKLNCDASWNDNLEVGGIG 780
           +        +      KA +      G      W PP  N  K+N DA++  +     + 
Sbjct: 485 SEYLTAEFHKATQESSKANIPDTGRGGVRKRITWRPPPKNRLKVNTDAAFYQDTGTAALA 544

Query: 781 WVKEKWPIKLL---------------KARAILDGLKAVTSCDVNHRKMMTLETDSSEVVK 840
                W  K++               +A+A  + L  + +  + +     +ETDS  +V+
Sbjct: 545 AKVRDWQGKVITGTTATFKTISPLIAEAQAYREALVLIKNLQIPN---CIIETDSLPLVQ 604

Query: 841 NINGEAEGMSELYNFVEAIANVESRSLIVKFVKCPRSSNSVAHNIA 848
            I      +++    +  I  +   +  V     PR  N +AH +A
Sbjct: 605 AIKARTL-IAKADAIIRDILQLLEEAPDVGVTWTPRGGNKLAHQLA 619

BLAST of Tan0019863 vs. ExPASy TrEMBL
Match: A0A2Z6N4T0 (Uncharacterized protein OS=Trifolium subterraneum OX=3900 GN=TSUD_208420 PE=4 SV=1)

HSP 1 Score: 213.8 bits (543), Expect = 3.0e-51
Identity = 173/622 (27.81%), Postives = 273/622 (43.89%), Query Frame = 0

Query: 241 EKVGRSGGLMLLWKEPTHLSIISFSKANIDVIIKDIQ-GDWRFTGFYGDPAEENRMNSWM 300
           ++ GR GG+ ++W++  + SI ++S  +ID+ + D+Q G WR TGFYG P    R +SW 
Sbjct: 25  DRDGRGGGVAVMWRKVVNCSITNYSLNHIDIEVDDLQRGKWRLTGFYGYPEGSRRRDSWN 84

Query: 301 LLDRFSSLFDLPWLVGGDFNELLSEEEKWGGAPKSKKLMNNFADCIFRCNLVDTGCRGNK 360
            L + S+   LPW + GDFN++LS +EK G + + + L+N F + +    LVD   +G  
Sbjct: 85  FLRQLSNASQLPWCIIGDFNDILSSDEKQGRSQRPQWLINGFREAVSDSGLVDIHWKGYP 144

Query: 361 FTWRKS-RHHNATKERLDRYFLNQSMLIRTTKVSISHLTFHASDHKPILAHLKFDTINTS 420
           FTW KS     A +E+LDR   N          ++  LT  ASDH P+L  L+ D     
Sbjct: 145 FTWFKSLGTERAVEEKLDRAMANDIWCNMFQYATVECLTTTASDHYPLL--LECDPKPIQ 204

Query: 421 HRRRRPNARFEENWVGCEEARVLIRGHW----MESISRSPADLKEKITSCI--------- 480
           HR  +   +FE  W    E    ++ HW      +I+R   D    +TS           
Sbjct: 205 HRHLK-QFKFENAWFAEPEFDTFVKQHWETYGNTTITRKLDDCASDLTSWSGHKWSIGTG 264

Query: 481 LKLKTWDRQRLKG--SLKK---------------AIQRKEETWNEEAARNGVSPQDYID- 540
             +  WD+  L    S+KK                +    + W+    R G+   D  D 
Sbjct: 265 HNISLWDQNWLSDGTSIKKPDNIDSQLNNLTVADLLHHNAKEWDSGLIR-GLLNDDIADK 324

Query: 541 IMNTPLGPRGAKDVIIWGEDMKGIFSVKSAYHLAKRYRIGSSSSKVSEGSSKEPWNNLWK 600
           I++TPL      D I W  +  G+++VKSAY        G    +V EG     W+ +W+
Sbjct: 325 ILHTPLLESVQNDKITWQHEKNGLYTVKSAYRFCISTIPGRDQHRV-EGK----WHLIWQ 384

Query: 601 AKTLSRAKLCVWKVLGDIIPTKINIIKRGVDINPFCCFCKVHREDTIHVMWGCKIAKRIW 660
            +   + K  +W++  + +PT+  +  RGV     C  C    ED+ H+ + C+ +   W
Sbjct: 385 TQMPPKIKNFMWRICRNCLPTRARLHDRGVTCPINCVLCDAGDEDSNHLFFSCQNSINCW 444

Query: 661 INF-----IPEMETLLYTCKREWTPTDCWDWMISNLNVEEIESTIIIMWNIWKARNFINT 720
                   I +   L  + K      + ++ M+  LN +       +MW+IWK RN +  
Sbjct: 445 QQMGLWSSIMQHRNLTISVKE-----NVFNIML-QLNEDSRAVFACVMWSIWKQRNDV-- 504

Query: 721 VKYLFVNDDLLKIIRDISQAREELCSLDRRNTNFQKARLESCESHG----EWTPPEANTW 780
              ++ N+   ++ R +   R        RN    + R  + + H     EWT P+A TW
Sbjct: 505 ---IWRNE---RVHRTVVCERANSLLTGWRNAREVRDRYNN-QQHSPQRFEWTRPDAGTW 564

Query: 781 KLNCDASWNDNLEVGGIG-----------WVKEKWPIKLL-----KARAILDGLKAVTSC 805
           K N DAS++ +    GIG             K +W   +L     +A  +L  LK V   
Sbjct: 565 KCNVDASFSRSRNKVGIGVCIRDDQGQFVLAKTEWYSPILDVDTGEALGLLSALKWVKDL 619

BLAST of Tan0019863 vs. ExPASy TrEMBL
Match: A0A444ZVS3 (Uncharacterized protein OS=Arachis hypogaea OX=3818 GN=Ahy_B03g062876 PE=4 SV=1)

HSP 1 Score: 186.4 bits (472), Expect = 5.1e-43
Identity = 158/646 (24.46%), Postives = 281/646 (43.50%), Query Frame = 0

Query: 241 EKVGRSGGLMLLWKEPTHLSII----SFSKANIDVIIKDIQGDWRFTGFYGDPAEENRMN 300
           E  G SGGL LLWK  T++++     ++ KANI+ I  D+  +W+    YG+P  + R  
Sbjct: 5   EPRGLSGGLSLLWKSNTNINVYELCDNYIKANIN-INNDL--NWQGIFVYGNPVFQKRRR 64

Query: 301 SWMLLDRFSSLFDLPWLVGGDFNELLSEEEKWGGAPKSKKLMNNFADCIFRCNLVDTGCR 360
            W  L   +   ++P    GDFN++L++ EK G  P+ +  +  F   +   +L+D   +
Sbjct: 65  LWHELTVSNMSKEVPQAYLGDFNDILNQYEKVGIHPQPRIYLETFRRFVDDNDLIDIDLK 124

Query: 361 GNKFTWRKSRHHNA-TKERLDRYFLNQSMLIRTTKVSISHLTFHASDHKPILAHLKFDTI 420
           GNK+TW  +  +N  T++RLDR  +N   L     V++       SDH  ++       +
Sbjct: 125 GNKYTWFSNPRNNVITRKRLDRVLVNWKWLQIYQNVNLRASPAVTSDHCALI-------L 184

Query: 421 NTSHR-RRRPNARFEENWVGCEEARVLIRGHW-MESISRSP-ADLKEKITSCILKLKTWD 480
           +T  R R + + +FE  WV  EE + +I+  W  E  SR+      +K   CI +L  W 
Sbjct: 185 DTQQRVRIKKDFKFEAYWVEHEECKEVIQRSWKWEDGSRNCWNQFTKKRNRCIRELMEWS 244

Query: 481 RQRLKGSLKKAIQRKEETWN-EEAARNGVSPQDYIDIMNTPLGPRGAKDVIIWGEDMKGI 540
            ++ K + KK  ++K E    +EAA           I  TP+     KD  +W     G 
Sbjct: 245 SRKFKRADKKIERKKIELHQIQEAAEL---------ITKTPISLINKKDHFVWQHRKDGQ 304

Query: 541 FSVKSAYHLAKRYRIGSSSSKVSEGSSKEPWNNLWKA----KTLSRAKLCVWKVLGDIIP 600
           ++V++ YH+AK  +      ++ + S+ + W  +W+A        + ++ +WK +  I+P
Sbjct: 305 YTVRTGYHVAKEEKDSKEEGRICKASTSQDWREIWEAIWRLPVPQKVRMFLWKTVHRILP 364

Query: 601 TKINIIKRGVDINPFCCFCKVHREDTIHVMWGCKIAKRIWINFIPEMETLLYTCK--REW 660
              N+ +R + + P C  C+   E   H +  C   + +W     ++    Y  +  REW
Sbjct: 365 VNTNLHQRRITMTPTCSICQKENETIEHALLLCPWTRAVWFESSIQIVPTAYNVRSFREW 424

Query: 661 TPTDCWDWMISNLNVEEIESTI----IIMWNIWKARNFINTVKYLFVNDDLLKIIRDISQ 720
                    I   +  E E T+     + W IWKARN      ++F   +++   + I+Q
Sbjct: 425 IMDKI--KRIKTESGSEQEKTLCKLGCVCWCIWKARN-----HHIFQQTEIIP-QKVITQ 484

Query: 721 AREELCSLDRRNTNFQKARLESCESHG-----EWTPPEANTWKLNCDASWNDNLEVGGIG 780
           +        +      KA +      G      W PP  N  K+N DA++  +     + 
Sbjct: 485 SEYLTAEFHKATQESSKANIPDTGRGGVRKRITWRPPPKNRLKVNTDAAFYQDTGTAALA 544

Query: 781 WVKEKWPIKLL---------------KARAILDGLKAVTSCDVNHRKMMTLETDSSEVVK 840
                W  K++               +A+A  + L  + +  + +     +ETDS  +V+
Sbjct: 545 AKVRDWQGKVITGTTATFKTISPLIAEAQAYREALVLIKNLQIPN---CIIETDSLPLVQ 604

Query: 841 NINGEAEGMSELYNFVEAIANVESRSLIVKFVKCPRSSNSVAHNIA 848
            I      +++    +  I  +   +  V     PR  N +AH +A
Sbjct: 605 AIKARTL-IAKADAIIRDILQLLEEAPDVGVTWTPRGGNKLAHQLA 619

BLAST of Tan0019863 vs. ExPASy TrEMBL
Match: A0A5C7IW34 (Uncharacterized protein OS=Acer yangbiense OX=1000413 GN=EZV62_001315 PE=4 SV=1)

HSP 1 Score: 178.3 bits (451), Expect = 1.4e-40
Identity = 111/334 (33.23%), Postives = 155/334 (46.41%), Query Frame = 0

Query: 241 EKVGRSGGLMLLWKEPTHLSIISFSKANIDVIIKDIQGD-WRFTGFYGDPAEENRMNSWM 300
           ++VG SGGL LLWKE  ++ I SFS ++ID I+ D +G+ WRF GFY       R +SW 
Sbjct: 74  DQVGLSGGLALLWKEGFNVQIKSFSSSHIDSIVVDSRGNCWRFNGFYSSLKYGERQHSWT 133

Query: 301 LLDRFSSLFDLPWLVGGDFNELLSEEEKWGGAPKSKKLMNNFADCIFRCNLVDTGCRGNK 360
           LL   S LF+LPWL   DFNE+    EK  G  K   L++ F D +  C   D G  G+ 
Sbjct: 134 LLRHLSELFNLPWLCCRDFNEISMASEKKRGNDKPYSLLSAFRDFLNLCEFRDLGFIGSP 193

Query: 361 FTWRKSRHH-NATKERLDRYFLNQSMLIRTTKVSISHLTFHASDHKPILAHLKFDTINTS 420
           FTW   R   NA  E LD      S        S+ HL++  SDH+P+L  ++F +++ +
Sbjct: 194 FTWWNKRDGINAIFEILDHGLGTSSWCNLFPSNSVCHLSYWGSDHRPLL--IEFGSLSQT 253

Query: 421 HRRRRPNA----RFEENWVGCEEARVLIRGHWMESISRSP-ADLKEKITSCILKLKTWDR 480
                P+       EE W+  ++   +I   W ES   S   D++EKI +C L L  W  
Sbjct: 254 RHSNGPSRSRRFHMEEMWIKFQDCEDIISSSWKESSKASEMKDIQEKINTCALNLSRWSH 313

Query: 481 QRLKGSLKKAIQRKE--------------------------ETWNEEAARNGVSPQDYID 540
           Q+  G++K  +  K                             WN    RN   P D   
Sbjct: 314 QKFGGNVKDCLMLKNCLKAYEAASGQSCVAFNLALDSTIRIRDWNSALVRNSFLPDDANL 373

Query: 541 IMNTPLGPRGAKDVIIWGEDMKGIFSVKSAYHLA 542
           I+N P       D + W  D +G +SV+  Y LA
Sbjct: 374 ILNLPQLSLNRDDTLCWHFDKRGFYSVRRGYKLA 405

BLAST of Tan0019863 vs. ExPASy TrEMBL
Match: A0A445CZL3 (Uncharacterized protein OS=Arachis hypogaea OX=3818 GN=Ahy_A05g022025 PE=4 SV=1)

HSP 1 Score: 178.3 bits (451), Expect = 1.4e-40
Identity = 143/574 (24.91%), Postives = 238/574 (41.46%), Query Frame = 0

Query: 241 EKVGRSGGLMLLWKEPTHLSIISFSKANIDVIIKDIQGD-WRFTGFYGDPAEENRMNSWM 300
           E  G SGGL LLW E  ++ I  + + +I   I D +G  W     YG+P    R   W 
Sbjct: 38  EPRGLSGGLCLLWNEIYNVDIYFWCENHIKTRIDDRKGKIWECNFIYGNPCFGRRKEQWR 97

Query: 301 LLDRFSSLFDLPWLVGGDFNELLSEEEKWGGAPKSKKLMNNFADCIFRCNLVDTGCRGNK 360
            + R +S    P +  GDFN++LS+EEK G  PK +  +  F   +    L+D   +G +
Sbjct: 98  AITRNNSNRGEPQVFIGDFNDILSQEEKIGLHPKPQSQVREFRQFVDMNYLMDLDIKGGR 157

Query: 361 FT-WRKSRHHNATKERLDRYFLNQSMLIRTTKVSISHLTFHASDHKPILAHLKFDTINTS 420
           FT +   R+   T+E++DR  +N        + S+  L   +SDH+P++ ++        
Sbjct: 158 FTGFGNPRNGVITREKIDRALVNWEWRALYPQASLKALPAISSDHRPLILNM------NQ 217

Query: 421 HRRRRPNARFEENWVGCEEARVLIRGHWMESISRSPADLKEKITSCI---LKLKTWDRQR 480
            +R+  N +FE  W   EE   ++R  W +   +    LKE     I    ++  W    
Sbjct: 218 IQRKEKNFKFEAFWTDHEECENIVRKGWEKEDIQGRQLLKENAKWSIGNGARVSIWKDNW 277

Query: 481 LKGSLK------------KAIQRKEETWNEEAARNGVSPQDYIDIMNTPLGPRGAKDVII 540
           + G  K            K +  + E W+     +    +   +I++TP+     +D++ 
Sbjct: 278 ITGRSKPLNSNSTNDFRVKDLIVEGEGWDRRKIESNFPQEICKEILSTPISVMNKEDILY 337

Query: 541 WGEDMKGIFSVKSAYHLAKRYRIGSSSSKVSEGSSK-EPWNNLWKAKTLSRAKLCVWKVL 600
           W     G +S+K+ Y+ A+R     +    S    K E W  +W+ +   + ++ +WK  
Sbjct: 338 WPWREDGNYSIKTGYYAARRTEQSDNHRNPSTSEDKREIWREVWRMEVPQKIRMFLWKAC 397

Query: 601 GDIIPTKINIIKRGVDINPFCCFCKVHREDTIHVMWGCKIAKRIWINFIPEMETLLYTCK 660
            DI+P   N+ KR +  +P C  C    E   H +  C  A+  W           +  +
Sbjct: 398 QDILPVGSNLYKRKIASDPKCQICLKSPETVEHALLLCDWARATW-----------FGAE 457

Query: 661 REWTP-----TDCWDWMISNL---------NVEE-IESTIIIMWNIWKARNFINTVKYLF 720
            +WTP     T   +W++  +         N E  I     +MW IWK RN       +F
Sbjct: 458 GQWTPTVKTVTSIGNWIVECIKKLRAGGGENQERGISKLGFLMWEIWKTRN-----NKMF 517

Query: 721 VNDDL--------LKIIRDISQAREELCSLDRRNTNFQKARLESCESHGEWTPPEANTWK 774
              ++         KI+  I     +    +++  N  K  L       +W PP +N  K
Sbjct: 518 QQQEVNPRGTICRAKILEAIYWKLADTQQPNKKEGNHSKTNLV------KWRPPPSNWLK 577

BLAST of Tan0019863 vs. ExPASy TrEMBL
Match: A0A6J1DUG8 (uncharacterized protein LOC111024135 OS=Momordica charantia OX=3673 GN=LOC111024135 PE=4 SV=1)

HSP 1 Score: 174.9 bits (442), Expect = 1.5e-39
Identity = 99/254 (38.98%), Postives = 139/254 (54.72%), Query Frame = 0

Query: 238 VAREKVGRSGGLMLLWKEPTHLSIISFSKANIDVIIKDIQGDWRFTGFYGDPAEENRMNS 297
           V+    G+SGGLMLLW   +++ I S S  +ID II D  G WRFTGFYG+P    R  S
Sbjct: 57  VSVASTGKSGGLMLLWNSDSNVRIQSMSPGHIDSIITDKYGSWRFTGFYGNPCTYKRSAS 116

Query: 298 WMLLDRFSSLFDLPWLVGGDFNELLSEEEKWGGAPKSKKLMNNFADCIFRCNLVDTGCRG 357
           W LL+R + + DLPW++GGDFNE++S  EK GG  +++  M               GC  
Sbjct: 117 WKLLERLARMMDLPWIIGGDFNEIVSMTEKMGGVCRNESQMR--------------GCP- 176

Query: 358 NKFTWRKSRHHNATKERLDRYFLNQSMLIRTTKVSISHLTFHASDHKPILAHLKFDTIN- 417
               W          ERLDR+ +N+SML +   + ++HL   +SDH+PILA   F+    
Sbjct: 177 ---IW----------ERLDRFLINESMLNKCLNLKVTHLELLSSDHRPILASWDFELPRA 236

Query: 418 -TSHRRRRPNARFEENWVGCEEARVLIRGHWMESISRSPADLKEKITSCILKLKTWDRQR 477
            T H++R+   RFEE+W+  +  R +I G W           + KI SC+ +L  W++ R
Sbjct: 237 WTCHKQRK--IRFEESWLQIDGCRDIITGTWGSLPGIGIEAFQAKICSCLSRLNEWNKIR 280

Query: 478 LKGSLKKAIQRKEE 490
           L  SLK AI  KE+
Sbjct: 297 LNRSLKGAIAYKEK 280

BLAST of Tan0019863 vs. TAIR 10
Match: AT4G29090.1 (Ribonuclease H-like superfamily protein )

HSP 1 Score: 102.8 bits (255), Expect = 1.4e-21
Identity = 98/358 (27.37%), Postives = 156/358 (43.58%), Query Frame = 0

Query: 514 GPRGAKDVIIWGEDMKGIFSVKSAYH-LAKRYRIGSSSSKVSEGSSKEPWNNLWKAKTLS 573
           G R   D   W     G ++VKS Y  L +     SS  +VSE S    +  +WK++T  
Sbjct: 205 GGRRILDSYTWDYTSSGDYTVKSGYWVLTQIINKRSSPQEVSEPSLNPIYQKIWKSQTSP 264

Query: 574 RAKLCVWKVLGDIIPTKINIIKRGVDINPFCCFCKVHREDTIHVMWGCKIAKRIW-INFI 633
           + +  +WK L + +P    +  R +     C  C   +E   H+++ C  A+  W I+ I
Sbjct: 265 KIQHFLWKCLSNSLPVAGALAYRHLSKESACIRCPSCKETVNHLLFKCTFARLTWAISSI 324

Query: 634 PEMETLLYTCKREWTPTD----CWDWMISNLNVE-EIESTII--IMWNIWKARNFINTVK 693
           P           EW  +      W + + N N + E  S ++  ++W +WK RN +    
Sbjct: 325 P------IPLGGEWADSIYVNLYWVFNLGNGNPQWEKASQLVPWLLWRLWKNRNELVFRG 384

Query: 694 YLFVNDDLLKIIRDISQAREELCSLDRRNTNFQKARLESCESHGEWTPPEANTWKLNCDA 753
             F   ++L+   D  +        +   T  Q  R  SC   G W PP     K N DA
Sbjct: 385 REFNAQEVLRRAEDDLEEWRIRTEAESCGTKPQVNR-SSC---GRWRPPPHQWVKCNTDA 444

Query: 754 SWNDNLEVGGIGWV--KEKWPIKLLKARAILDGLKAVTSCDVNHRKMMTL---------- 813
           +WN + E  GIGWV   EK  +K + ARA L  LK+V   ++   +   L          
Sbjct: 445 TWNRDNERCGIGWVLRNEKGEVKWMGARA-LPKLKSVLEAELEAMRWAVLSLSRFQYNYV 504

Query: 814 --ETDSSEVVKNINGEAEGMSELYNFVEAIANVESRSLIVKFVKCPRSSNSVAHNIAR 849
             E+DS  +++ +N + E    L   ++ +  + S+   VKFV  PR  N++A  +AR
Sbjct: 505 IFESDSQVLIEILNND-EIWPSLKPTIQDLQRLLSQFTEVKFVFIPREGNTLAERVAR 550

BLAST of Tan0019863 vs. TAIR 10
Match: AT1G40390.1 (DNAse I-like superfamily protein )

HSP 1 Score: 56.2 bits (134), Expect = 1.5e-07
Identity = 32/96 (33.33%), Postives = 48/96 (50.00%), Query Frame = 0

Query: 291 EENRMNSWMLLDRFSS---LFDLPWLVGGDFNELLSEEEKWGGAPKSKKL--MNNFADCI 350
           E  R + W  + R S+   L + PWLV GDFN++ S  E +   P +  L  + +   C+
Sbjct: 98  EAERRSLWDDITRLSASSPLCNSPWLVVGDFNQIASVTEHYSLMPSNISLQGLEDLQACM 157

Query: 351 FRCNLVDTGCRGNKFTWRKSRHHNATKERLDRYFLN 382
              +LVD  CRG  +TW   +  N    +LDR  +N
Sbjct: 158 RDSDLVDLPCRGVLYTWSNHQQDNPILRKLDRAIVN 193

BLAST of Tan0019863 vs. TAIR 10
Match: AT3G26855.1 (RNA-directed DNA polymerase (reverse transcriptase)-related family protein )

HSP 1 Score: 50.8 bits (120), Expect = 6.5e-06
Identity = 19/62 (30.65%), Postives = 33/62 (53.23%), Query Frame = 0

Query: 564 NLWKAKTLSRAKLCVWKVLGDIIPTKINIIKRGVDINPFCCFCKVHREDTIHVMWGCKIA 623
           ++W  K   + KL +WK L + +P    ++ R + I PFC  C+   E   H+++ C  A
Sbjct: 8   DIWSLKISPKIKLLIWKALNNALPVGAQLLSRNISIEPFCTRCR-DFETITHILFNCPFA 67

Query: 624 KR 626
           +R
Sbjct: 68  QR 68

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
KAF8408042.13.2e-5527.57hypothetical protein HHK36_007182 [Tetracentron sinense][more]
KAF7824053.15.1e-5327.10hypothetical protein G2W53_022197 [Senna tora][more]
GAU38731.16.2e-5127.81hypothetical protein TSUD_208420 [Trifolium subterraneum][more]
MBA0733287.14.1e-4727.62hypothetical protein [Gossypium gossypioides][more]
RYR18269.11.1e-4224.46hypothetical protein Ahy_B03g062876 [Arachis hypogaea][more]
Match NameE-valueIdentityDescription
A0A2Z6N4T03.0e-5127.81Uncharacterized protein OS=Trifolium subterraneum OX=3900 GN=TSUD_208420 PE=4 SV... [more]
A0A444ZVS35.1e-4324.46Uncharacterized protein OS=Arachis hypogaea OX=3818 GN=Ahy_B03g062876 PE=4 SV=1[more]
A0A5C7IW341.4e-4033.23Uncharacterized protein OS=Acer yangbiense OX=1000413 GN=EZV62_001315 PE=4 SV=1[more]
A0A445CZL31.4e-4024.91Uncharacterized protein OS=Arachis hypogaea OX=3818 GN=Ahy_A05g022025 PE=4 SV=1[more]
A0A6J1DUG81.5e-3938.98uncharacterized protein LOC111024135 OS=Momordica charantia OX=3673 GN=LOC111024... [more]
Match NameE-valueIdentityDescription
AT4G29090.11.4e-2127.37Ribonuclease H-like superfamily protein [more]
AT1G40390.11.5e-0733.33DNAse I-like superfamily protein [more]
AT3G26855.16.5e-0630.65RNA-directed DNA polymerase (reverse transcriptase)-related family protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR036691Endonuclease/exonuclease/phosphatase superfamilyGENE3D3.60.10.10Endonuclease/exonuclease/phosphatasecoord: 218..410
e-value: 5.8E-21
score: 77.3
IPR036691Endonuclease/exonuclease/phosphatase superfamilySUPERFAMILY56219DNase I-likecoord: 231..411
IPR026960Reverse transcriptase zinc-binding domainPFAMPF13966zf-RVTcoord: 532..627
e-value: 1.2E-15
score: 57.9
IPR002156Ribonuclease H domainPFAMPF13456RVT_3coord: 748..848
e-value: 4.2E-8
score: 33.0
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 164..187
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..65
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 40..62
NoneNo IPR availablePANTHERPTHR33116:SF39RETROTRANSPOSON, UNCLASSIFIED-LIKE PROTEINcoord: 488..679
coord: 243..453
NoneNo IPR availablePANTHERPTHR33116REVERSE TRANSCRIPTASE ZINC-BINDING DOMAIN-CONTAINING PROTEIN-RELATED-RELATEDcoord: 488..679
coord: 243..453
IPR044730Ribonuclease H-like domain, plant typeCDDcd06222RNase_H_likecoord: 740..848
e-value: 1.87968E-13
score: 65.7984
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 736..853

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0019863.1Tan0019863.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0004523 RNA-DNA hybrid ribonuclease activity