Lag0041193 (gene) Sponge gourd (AG‐4) v1

Overview
NameLag0041193
Typegene
OrganismLuffa acutangula (Sponge gourd (AG‐4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Locationchr13: 13521702 .. 13523514 (-)
RNA-Seq ExpressionLag0041193
SyntenyLag0041193
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCGACGGAAAGCTCCACTAGTGCGTCCATGGAGGAAACTGCAAATCCGTCTTCTCAGACGTTTAGTCCCGGTAACAAAATATCTATAGTCAAGCTTACTGATGATAATTTTCTGTTATGGAAATTTCAGATCCTCATGGCTTTAGAAGGTTACAACCTTGAAAAATACCTAGAAGATGATCCGCCTGCGAAAAACCCTAATTACTGCCTCTGAAGGTTCCTCGTCCGCTGAACCAGTTCGACAGGAGACTCTAAATCCCGCCTATACCCTATGGAAGAAACAAGATCGAATGATCTCGTCGTGGCTAGTTGGTTCCATGTCTGAGGAAATACTCCATCAAATGATACATTGTACATCCTCCAAGGAGATTTGGGTAAGTCTCAAACAGATATTCACCACTCGAAATCTTGCCCAGATGATGAAAATAAAAACCAAGCTCCAAACAATACAAAAAGGAGGTATGTCTTTAAAAGAATACTTCTCGAAAATTCAGCAATATATTGATGCTCTTGCTGTTGTGGGGAAACCGGTAGAAGTTGAAGATCATATCCTTTTTATTTTAGCTGGTTTGGGATCTGAATATGAATCTATGGTGTTCGTTATCTCTGCTAAAATTGGTCCTCAAACGGTCCAAGAAGTTATGTCTCTGTTGTTAACTCAGGAAAATCGAATTGAAAGCAAAATAGCTTCCACTGAAAGCTCTCTTCCCTCGGCGAATCTCATGGTTCATTCTAAACCGCCAGAGTCCGACTTGCAAAAGTCTAATACTAATCATTTTTCTCCCAATCCTGGTAGCGGTAACAGAGGAAGAGGTGGTGGTCGTGGAGGTTTTAACACAAATCGTGGAGGTCGTTCCTGGAACAATCGCAACCGACCACAGTGTCAAGTCTGTGGGAAATTCAACCATACAGCTCCCAAATGCTTCTTCCGATATGCTCCATTTGGATCCTCAAATACTCCAGGTTCGTTCTCTCCAAATTTTAACCAATTTAATCGACCTCCCTCATATCCTCAGATGGAAGCCATGGTGACTTCCCCTGATCTGAATCAAGATACCAATTGGTATCCGGACTCCGGTGCTACCAATCACCTTACTCATTCCTTCAACAACCTCTCGATTGGAACTGAATACGGTGGTGGCAATCAAGTGCACGTGGGAAATGGAGCAGGTTTGCCTATCCTTAATTTTAGCTTTACTTCATTTTCTTCACATGTCTGTTCTAATAGAATTTTTCGATTAAACAACTTACTTCATGTGTCTTCTATCACCAAAAATTTAATCAGTGTTAGTCAATTTGCTAAGGACAATGGAGTTTATTTTGAGTTTCATCCTACCCTTTGCTATGTGAAGGACCAAGTCTTTGGGCAGGTTTTACTCCAAGGGACTCTCCATGATGGACTTTATCGCTCACATTTATATAGTTCATCGTTGGTGGAACCTCAAGATAAACTGCCAGTGCAAGCTCTCACTTCTCAACTTTTTTTGTCTCCTTCTAATGTTAACTGTTTTGTGTTTGATCTTTGGCATAGGCGTCTAGGCCACTCTTCTCTTTCTACTGTTAAAAGTGTCATTCAGAGGTTTAATCCTCGATTGTTGATAAATAACAAATTTCAATTCTGTTCTGCGTGTGCTATGGGAAAAGTTCACAATCTTCCTTTTCATAATTCCACTACTGTCTATACAGCTCCTCTTCAACTAATTATTGTTACTGATCTATGGGGGCCTACCTATATACCATCTTCTCAAGGTTATAGATACTGTATTAGCTTCATAGATGCATTCAGTAGATACACCTGGTTTACTTCTTGA

mRNA sequence

ATGTCGACGGAAAGCTCCACTAGTGCGTCCATGGAGGAAACTGCAAATCCGTCTTCTCAGACGTTTAGTCCCGGTAACAAAATATCTATAGTCAAGCTTACTGATGATAATTTTCTGTTATGGAAATTTCAGATCCTCATGGCTTTAGAAGGTTCCTCGTCCGCTGAACCAGTTCGACAGGAGACTCTAAATCCCGCCTATACCCTATGGAAGAAACAAGATCGAATGATCTCGTCGTGGCTAGTTGGTTCCATGTCTGAGGAAATACTCCATCAAATGATACATTGTACATCCTCCAAGGAGATTTGGGTAAGTCTCAAACAGATATTCACCACTCGAAATCTTGCCCAGATGATGAAAATAAAAACCAAGCTCCAAACAATACAAAAAGGAGGTATGTCTTTAAAAGAATACTTCTCGAAAATTCAGCAATATATTGATGCTCTTGCTGTTGTGGGGAAACCGGTAGAAGTTGAAGATCATATCCTTTTTATTTTAGCTGGTTTGGGATCTGAATATGAATCTATGGTGTTCGTTATCTCTGCTAAAATTGGTCCTCAAACGGTCCAAGAAGTTATGTCTCTGTTGTTAACTCAGGAAAATCGAATTGAAAGCAAAATAGCTTCCACTGAAAGCTCTCTTCCCTCGGCGAATCTCATGGTTCATTCTAAACCGCCAGAGTCCGACTTGCAAAAGTCTAATACTAATCATTTTTCTCCCAATCCTGGTAGCGGTAACAGAGGAAGAGGTGGTGGTCGTGGAGGTTTTAACACAAATCGTGGAGGTCGTTCCTGGAACAATCGCAACCGACCACAGTGTCAAGTCTGTGGGAAATTCAACCATACAGCTCCCAAATGCTTCTTCCGATATGCTCCATTTGGATCCTCAAATACTCCAGGTTCGTTCTCTCCAAATTTTAACCAATTTAATCGACCTCCCTCATATCCTCAGATGGAAGCCATGGTGACTTCCCCTGATCTGAATCAAGATACCAATTGGTATCCGGACTCCGGTGCTACCAATCACCTTACTCATTCCTTCAACAACCTCTCGATTGGAACTGAATACGGTGGTGGCAATCAAGTGCACGTGGGAAATGGAGCAGGTTTGCCTATCCTTAATTTTAGCTTTACTTCATTTTCTTCACATGTCTGTTCTAATAGAATTTTTCGATTAAACAACTTACTTCATGTGTCTTCTATCACCAAAAATTTAATCAGTGTTAGTCAATTTGCTAAGGACAATGGAGTTTATTTTGAGTTTCATCCTACCCTTTGCTATGTGAAGGACCAAGTCTTTGGGCAGGTTTTACTCCAAGGGACTCTCCATGATGGACTTTATCGCTCACATTTATATAGTTCATCGTTGGTGGAACCTCAAGATAAACTGCCAGTGCAAGCTCTCACTTCTCAACTTTTTTTGTCTCCTTCTAATGTTAACTGTTTTGTGTTTGATCTTTGGCATAGGCGTCTAGGCCACTCTTCTCTTTCTACTGTTAAAAGTGTCATTCAGAGGTTTAATCCTCGATTGTTGATAAATAACAAATTTCAATTCTGTTCTGCGTGTGCTATGGGAAAAGTTCACAATCTTCCTTTTCATAATTCCACTACTGTCTATACAGCTCCTCTTCAACTAATTATTGTTACTGATCTATGGGGGCCTACCTATATACCATCTTCTCAAGGTTATAGATACTGTATTAGCTTCATAGATGCATTCAGTAGATACACCTGGTTTACTTCTTGA

Coding sequence (CDS)

ATGTCGACGGAAAGCTCCACTAGTGCGTCCATGGAGGAAACTGCAAATCCGTCTTCTCAGACGTTTAGTCCCGGTAACAAAATATCTATAGTCAAGCTTACTGATGATAATTTTCTGTTATGGAAATTTCAGATCCTCATGGCTTTAGAAGGTTCCTCGTCCGCTGAACCAGTTCGACAGGAGACTCTAAATCCCGCCTATACCCTATGGAAGAAACAAGATCGAATGATCTCGTCGTGGCTAGTTGGTTCCATGTCTGAGGAAATACTCCATCAAATGATACATTGTACATCCTCCAAGGAGATTTGGGTAAGTCTCAAACAGATATTCACCACTCGAAATCTTGCCCAGATGATGAAAATAAAAACCAAGCTCCAAACAATACAAAAAGGAGGTATGTCTTTAAAAGAATACTTCTCGAAAATTCAGCAATATATTGATGCTCTTGCTGTTGTGGGGAAACCGGTAGAAGTTGAAGATCATATCCTTTTTATTTTAGCTGGTTTGGGATCTGAATATGAATCTATGGTGTTCGTTATCTCTGCTAAAATTGGTCCTCAAACGGTCCAAGAAGTTATGTCTCTGTTGTTAACTCAGGAAAATCGAATTGAAAGCAAAATAGCTTCCACTGAAAGCTCTCTTCCCTCGGCGAATCTCATGGTTCATTCTAAACCGCCAGAGTCCGACTTGCAAAAGTCTAATACTAATCATTTTTCTCCCAATCCTGGTAGCGGTAACAGAGGAAGAGGTGGTGGTCGTGGAGGTTTTAACACAAATCGTGGAGGTCGTTCCTGGAACAATCGCAACCGACCACAGTGTCAAGTCTGTGGGAAATTCAACCATACAGCTCCCAAATGCTTCTTCCGATATGCTCCATTTGGATCCTCAAATACTCCAGGTTCGTTCTCTCCAAATTTTAACCAATTTAATCGACCTCCCTCATATCCTCAGATGGAAGCCATGGTGACTTCCCCTGATCTGAATCAAGATACCAATTGGTATCCGGACTCCGGTGCTACCAATCACCTTACTCATTCCTTCAACAACCTCTCGATTGGAACTGAATACGGTGGTGGCAATCAAGTGCACGTGGGAAATGGAGCAGGTTTGCCTATCCTTAATTTTAGCTTTACTTCATTTTCTTCACATGTCTGTTCTAATAGAATTTTTCGATTAAACAACTTACTTCATGTGTCTTCTATCACCAAAAATTTAATCAGTGTTAGTCAATTTGCTAAGGACAATGGAGTTTATTTTGAGTTTCATCCTACCCTTTGCTATGTGAAGGACCAAGTCTTTGGGCAGGTTTTACTCCAAGGGACTCTCCATGATGGACTTTATCGCTCACATTTATATAGTTCATCGTTGGTGGAACCTCAAGATAAACTGCCAGTGCAAGCTCTCACTTCTCAACTTTTTTTGTCTCCTTCTAATGTTAACTGTTTTGTGTTTGATCTTTGGCATAGGCGTCTAGGCCACTCTTCTCTTTCTACTGTTAAAAGTGTCATTCAGAGGTTTAATCCTCGATTGTTGATAAATAACAAATTTCAATTCTGTTCTGCGTGTGCTATGGGAAAAGTTCACAATCTTCCTTTTCATAATTCCACTACTGTCTATACAGCTCCTCTTCAACTAATTATTGTTACTGATCTATGGGGGCCTACCTATATACCATCTTCTCAAGGTTATAGATACTGTATTAGCTTCATAGATGCATTCAGTAGATACACCTGGTTTACTTCTTGA

Protein sequence

MSTESSTSASMEETANPSSQTFSPGNKISIVKLTDDNFLLWKFQILMALEGSSSAEPVRQETLNPAYTLWKKQDRMISSWLVGSMSEEILHQMIHCTSSKEIWVSLKQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYIDALAVVGKPVEVEDHILFILAGLGSEYESMVFVISAKIGPQTVQEVMSLLLTQENRIESKIASTESSLPSANLMVHSKPPESDLQKSNTNHFSPNPGSGNRGRGGGRGGFNTNRGGRSWNNRNRPQCQVCGKFNHTAPKCFFRYAPFGSSNTPGSFSPNFNQFNRPPSYPQMEAMVTSPDLNQDTNWYPDSGATNHLTHSFNNLSIGTEYGGGNQVHVGNGAGLPILNFSFTSFSSHVCSNRIFRLNNLLHVSSITKNLISVSQFAKDNGVYFEFHPTLCYVKDQVFGQVLLQGTLHDGLYRSHLYSSSLVEPQDKLPVQALTSQLFLSPSNVNCFVFDLWHRRLGHSSLSTVKSVIQRFNPRLLINNKFQFCSACAMGKVHNLPFHNSTTVYTAPLQLIIVTDLWGPTYIPSSQGYRYCISFIDAFSRYTWFTS
Homology
BLAST of Lag0041193 vs. NCBI nr
Match: KAA0048297.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa])

HSP 1 Score: 559.3 bits (1440), Expect = 3.9e-155
Identity = 323/605 (53.39%), Postives = 407/605 (67.27%), Query Frame = 0

Query: 2   STESSTSASMEETANPSSQTFSPGNKISIVKLTDDNFLLWKFQILMALEG-------SSS 61
           ST S       E ++P +Q F  GNKIS+VKL DD FLLWKFQIL ALE         S 
Sbjct: 3   STSSLLGVENTEASSPINQIFGSGNKISLVKLNDDTFLLWKFQILTALEAYDLENFLESE 62

Query: 62  AEPVRQ-------------ETLNPAYTLWKKQDRMISSWLVGSMSEEILHQMIHCTSSKE 121
           +EP  +              T NPAY +WK+QDR+ISSWL+GSMSEEIL+QM+HC S+KE
Sbjct: 63  SEPPSKYLISTESSSASATGTPNPAYKVWKRQDRLISSWLLGSMSEEILNQMLHCKSAKE 122

Query: 122 IWVSLKQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYIDALAVVGKPVEVEDH 181
           IW +L+ IF++R LAQ M+ K KL  I+KG M LKEYF KI Q +DALA + KPV  +DH
Sbjct: 123 IWETLQGIFSSRYLAQAMQFKNKLHNIKKGSMPLKEYFLKILQCVDALASINKPVSSDDH 182

Query: 182 ILFILAGLGSEYESMVFVISAKIGPQTVQEVMSLLLTQENRIESKIASTESSLPSANLMV 241
           IL+ILAGLGS+Y+SM+ VISA+    +VQEVMSLLLTQE++ ESK+ S E++LPS N++ 
Sbjct: 183 ILYILAGLGSDYQSMISVISARTDSPSVQEVMSLLLTQESQNESKLIS-ETALPSVNIVT 242

Query: 242 HS--KPPESDLQKSNTNHFSPNPGSGNRGRGGGRGGFNTNRGGRSWNNRNRPQCQVCGKF 301
            +  K  ES + ++N N++  N     R   GGRG   +NRG R   NRN+PQCQ+C K 
Sbjct: 243 QTTEKGAESYI-RTNQNNYHNNHSYNQR---GGRGNGRSNRGRR--GNRNKPQCQICAKL 302

Query: 302 NHTAPKCFFRYAPFGSSN--TPGSFSPNFNQFNRPPSYPQMEAMVTSPDLNQDTNWYPDS 361
            ++A +CFFRY P  +S+  +P S + ++   N   ++PQM AMV + DLN D+NWYPDS
Sbjct: 303 GYSADRCFFRYTPRSNSSGYSPNSHNTSYTNMN---NHPQMSAMVAALDLNIDSNWYPDS 362

Query: 362 GATNHLTHSFNNLSIGTEYGGGNQVHVGNGAGLPILNFSFTSFSSHVCSNRIFRLNNLLH 421
           GATNHLTHS +NLSIG+EYGGGNQ++  NG+GLPI ++   SF+S     + F LNNLL 
Sbjct: 363 GATNHLTHSLSNLSIGSEYGGGNQIYAANGSGLPITHYGSMSFNSSTLPFKSFTLNNLLQ 422

Query: 422 VSSITKNLISVSQFAKDNGVYFEFHPTLCYVKDQVFGQVLLQGTLHDGLYRSHLYSSSLV 481
           V SITKNLISVSQFAKDN V+FEFHPTLCYVKD   GQVLLQG L+DGLY+        +
Sbjct: 423 VPSITKNLISVSQFAKDNHVFFEFHPTLCYVKDLDTGQVLLQGLLNDGLYK------FTI 482

Query: 482 EPQDKL--PVQALTSQLF--LSPSNVNCFVFDLWHRRLGHSSLSTVKSVIQRFNPRLLIN 541
           EP  K      + T  +F  + P + N  + DLWHRRLGH  L  VK+V+   +      
Sbjct: 483 EPSHKRLHHSNSNTKPVFNTVVPKS-NTPLLDLWHRRLGHPHLPIVKAVLNHIDNSSGTI 542

Query: 542 NKFQFCSACAMGKVHNLPFHNSTTVYTAPLQLIIVTDLWGPTYIPSSQGYRYCISFIDAF 579
           NK  FC ACA+GK H LPF +S T+YT PLQL I  DLWGP    S  G+RY ISF+DA+
Sbjct: 543 NKLNFCEACALGKHHALPFSHSLTLYTHPLQL-ITCDLWGPAVNVSHNGFRYYISFVDAY 589

BLAST of Lag0041193 vs. NCBI nr
Match: TYK10642.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa])

HSP 1 Score: 559.3 bits (1440), Expect = 3.9e-155
Identity = 323/605 (53.39%), Postives = 407/605 (67.27%), Query Frame = 0

Query: 2   STESSTSASMEETANPSSQTFSPGNKISIVKLTDDNFLLWKFQILMALEG-------SSS 61
           ST S       E ++P +Q F  GNKIS+VKL DD FLLWKFQIL ALE         S 
Sbjct: 3   STSSLLGVENTEASSPINQIFGSGNKISLVKLNDDTFLLWKFQILTALEAYDLENFLESE 62

Query: 62  AEPVRQ-------------ETLNPAYTLWKKQDRMISSWLVGSMSEEILHQMIHCTSSKE 121
           +EP  +              T NPAY +WK+QDR+ISSWL+GSMSEEIL+QM+HC S+KE
Sbjct: 63  SEPPSKYLISTESSSASATGTPNPAYKVWKRQDRLISSWLLGSMSEEILNQMLHCKSAKE 122

Query: 122 IWVSLKQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYIDALAVVGKPVEVEDH 181
           IW +L+ IF++R LAQ M+ K KL  I+KG M LKEYF KI Q +DALA + KPV  +DH
Sbjct: 123 IWETLQGIFSSRYLAQAMQFKNKLHNIKKGSMPLKEYFLKILQCVDALASINKPVSSDDH 182

Query: 182 ILFILAGLGSEYESMVFVISAKIGPQTVQEVMSLLLTQENRIESKIASTESSLPSANLMV 241
           IL+ILAGLGS+Y+SM+ VISA+    +VQEVMSLLLTQE++ ESK+ S E++LPS N++ 
Sbjct: 183 ILYILAGLGSDYQSMISVISARTDSPSVQEVMSLLLTQESQNESKLIS-ETALPSVNIVT 242

Query: 242 HS--KPPESDLQKSNTNHFSPNPGSGNRGRGGGRGGFNTNRGGRSWNNRNRPQCQVCGKF 301
            +  K  ES + ++N N++  N     R   GGRG   +NRG R   NRN+PQCQ+C K 
Sbjct: 243 QTTEKGAESYI-RTNQNNYHNNHSYNQR---GGRGNGRSNRGRR--GNRNKPQCQICAKL 302

Query: 302 NHTAPKCFFRYAPFGSSN--TPGSFSPNFNQFNRPPSYPQMEAMVTSPDLNQDTNWYPDS 361
            ++A +CFFRY P  +S+  +P S + ++   N   ++PQM AMV + DLN D+NWYPDS
Sbjct: 303 GYSADRCFFRYTPRSNSSGYSPNSHNTSYTNMN---NHPQMSAMVAALDLNIDSNWYPDS 362

Query: 362 GATNHLTHSFNNLSIGTEYGGGNQVHVGNGAGLPILNFSFTSFSSHVCSNRIFRLNNLLH 421
           GATNHLTHS +NLSIG+EYGGGNQ++  NG+GLPI ++   SF+S     + F LNNLL 
Sbjct: 363 GATNHLTHSLSNLSIGSEYGGGNQIYAANGSGLPITHYGSMSFNSSTLPFKSFTLNNLLQ 422

Query: 422 VSSITKNLISVSQFAKDNGVYFEFHPTLCYVKDQVFGQVLLQGTLHDGLYRSHLYSSSLV 481
           V SITKNLISVSQFAKDN V+FEFHPTLCYVKD   GQVLLQG L+DGLY+        +
Sbjct: 423 VPSITKNLISVSQFAKDNHVFFEFHPTLCYVKDLDTGQVLLQGLLNDGLYK------FTI 482

Query: 482 EPQDKL--PVQALTSQLF--LSPSNVNCFVFDLWHRRLGHSSLSTVKSVIQRFNPRLLIN 541
           EP  K      + T  +F  + P + N  + DLWHRRLGH  L  VK+V+   +      
Sbjct: 483 EPSHKRLHHSNSNTKPVFNTVVPKS-NTPLLDLWHRRLGHPHLPIVKAVLNHIDNSSGTI 542

Query: 542 NKFQFCSACAMGKVHNLPFHNSTTVYTAPLQLIIVTDLWGPTYIPSSQGYRYCISFIDAF 579
           NK  FC ACA+GK H LPF +S T+YT PLQL I  DLWGP    S  G+RY ISF+DA+
Sbjct: 543 NKLNFCEACALGKHHALPFSHSLTLYTHPLQL-ITCDLWGPAVNVSHNGFRYYISFVDAY 589

BLAST of Lag0041193 vs. NCBI nr
Match: RVW60229.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera])

HSP 1 Score: 397.1 bits (1019), Expect = 2.6e-106
Identity = 241/615 (39.19%), Postives = 351/615 (57.07%), Query Frame = 0

Query: 23  SPGNKISIVKLTDDNFLLWKFQILMALEG--------SSSAEPVRQET-------LNPAY 82
           SP +++  ++L DDNFL+WK+QI  A+ G         +   P +  T        NP +
Sbjct: 144 SPLSQLITMRLEDDNFLMWKYQIENAVRGYGLEGFLFGTEQVPPKMVTDKIGVLVPNPKF 203

Query: 83  TLWKKQDRMISSWLVGSMSEEILHQMIHCTSSKEIWVSLKQIFTTRNLAQMMKIKTKLQT 142
             +++QD ++ SWL+ S+    L Q++ C+S+ E+W ++ Q F +++ A++M  K+++Q 
Sbjct: 204 RDYQRQDHLLISWLLSSIGSAFLPQVVGCSSAFEVWNTISQNFNSQSSAKVMFYKSQMQM 263

Query: 143 IQKGGMSLKEYFSKIQQYIDALAVVGKPVEVEDHILFILAGLGSEYESMVFVISAKIGPQ 202
           ++K G+++++Y +K++ Y D LA  G  +   DHIL I+ GLG EYES++ VIS+K    
Sbjct: 264 LKKDGLTMRDYLTKMKNYCDLLATAGHKISDTDHILAIMQGLGDEYESVIAVISSKKSSP 323

Query: 203 TVQEVMSLLLTQENRIESKIASTESSLPSANLMVHSKPPESDLQKSNTNHFSPNPGSGNR 262
           ++Q V S L+  E RI  KI+S + S+   +   +  P  S     N+N + P+ G  NR
Sbjct: 324 SLQYVTSTLIAHEGRIAHKISSNDLSVNYTSQYSNRGPSSS----WNSNGY-PSSGFQNR 383

Query: 263 GRGGG----RGGFNTNRG---GRSWNNRNRPQCQVCGKFNHTAPKCFFRYAPFGSSNTP- 322
            + GG    RG F  NRG   GR+     +PQCQ+C KF HT  +CF+RY P    N P 
Sbjct: 384 NQFGGNQVTRGSFVHNRGRGRGRA-QGGIKPQCQLCNKFGHTVHRCFYRYDPNFHGNMPA 443

Query: 323 -----------------GSFSP----NFNQFNRPPS--YPQMEAMVTSPDLNQDTNWYPD 382
                            GS S     N  +++   +  Y +MEAMV +P+  Q+  W+PD
Sbjct: 444 NGPTPGVLGSGARNGASGSISSAGNVNLTEYDAQENQDYSEMEAMVATPEDLQNCCWFPD 503

Query: 383 SGATNHLTHSFNNLSIGTEYGGGNQVHVGNGAGLPILNFSFTSFSSHVCSNRIFRLNNLL 442
           SGATNH+TH   NL+ G EY G +++H+GNG GL I +   + F S    N++  L N+L
Sbjct: 504 SGATNHVTHDLGNLNSGAEYNGNSKIHMGNGTGLKISHIGLSVFPSSSSPNKVLFLKNIL 563

Query: 443 HVSSITKNLISVSQFAKDNGVYFEFHPTLCYVKDQVFGQVLLQGTLHDGLYRSHL----- 502
            V +I KNL+SVSQFA+DN VYFEFHP +C+VKD+    +LLQG LH GLY+ +L     
Sbjct: 564 RVPAIKKNLLSVSQFARDNNVYFEFHPKVCFVKDKSNHSLLLQGNLHKGLYQFNLSKKLF 623

Query: 503 -YSSSLVEPQDKLPVQALTSQL-------FLSPSNVNCFVFDLWHRRLGHSSLSTVKSVI 562
             +S L    DK  +    + L       F   +N +  VFDLWH+RLGH +   V  V+
Sbjct: 624 GKASGLSLSNDKNELTCCNASLVHNDNSDFPEKTNSSFHVFDLWHKRLGHPASKIVTQVL 683

Query: 563 QRFNPRLLINNKFQFCSACAMGKVHNLPFHNSTTVYTAPLQLIIVTDLWGPTYIPSSQGY 579
                     +    CSAC +GK HNLPF  S TVYT PLQL +V+DLWGP  I SS G+
Sbjct: 684 NDNKIPFSTKSGSSICSACQLGKSHNLPFPISQTVYTKPLQL-VVSDLWGPAPINSSYGF 743

BLAST of Lag0041193 vs. NCBI nr
Match: KZV26181.1 (hypothetical protein F511_06348 [Dorcoceras hygrometricum])

HSP 1 Score: 396.7 bits (1018), Expect = 3.3e-106
Identity = 227/524 (43.32%), Postives = 318/524 (60.69%), Query Frame = 0

Query: 61  ETLNPAYTLWKKQDRMISSWLVGSMSEEILHQMIHCTSSKEIWVSLKQIFTTRNLAQMMK 120
           E +NP +  W +QD+++ S+L+ SMSE    QMI C +S ++W  + Q+F TR+ A++M+
Sbjct: 9   EVMNPNFVTWNRQDQLLFSFLLASMSESAQSQMIGCQTSSQLWTRVTQLFATRSKARVMQ 68

Query: 121 IKTKLQTIQKGGMSLKEYFSKIQQYIDALAVVGKPVEVEDHILFILAGLGSEYESMVFVI 180
            K +LQT++KG +S+K+Y  K++ YID LA  G  +  +D IL IL G+G EYES+V  +
Sbjct: 69  YKLQLQTLKKGNLSMKDYLGKMKGYIDILAACGNSIPEDDQILHILGGVGPEYESVVVHV 128

Query: 181 SAKIGPQTVQEVMSLLLTQENRIES-KIASTESSLPSANLMVHSKPPESDLQKSNTNHFS 240
           ++++   ++ EV +LLL  E RIE+  I    ++ PS N+        S  +  NT+   
Sbjct: 129 TSRVESLSLSEVGALLLAHEGRIETYNITGGHTASPSVNVTT----APSQRKAENTSQSQ 188

Query: 241 PNPGSGNRGRGGGRGGFNTNRGGRS-WNNRNRPQCQVCGKFNHTAPKCFFRYAPFGSSNT 300
           P      RGRG GR G    RGGR  W+N  RP CQ+CG   H A  C++R+       +
Sbjct: 189 P----VYRGRGRGRNG----RGGRKPWHNNGRPVCQICGIPGHVAEICYYRFDKEFVPKS 248

Query: 301 PGSFSPNFNQFNR-PPSYPQMEAMVTSPDLNQDTNWYPDSGATNHLTHSFNNLSIGTEYG 360
            G    +  QFNR  PSYP      T  +   +  WYPDSGA++H+T+   NLS+ +EY 
Sbjct: 249 SGVSRTSQQQFNRSSPSYPPSAFASTKSESASEEWWYPDSGASHHVTNDLGNLSVSSEYT 308

Query: 361 GGNQVHVGNGAGLPILNFSFTSFSSHVCSNRIFRLNNLLHVSSITKNLISVSQFAKDNGV 420
           GG++V VGNGAGL I N   ++ +    S+R F L NLLHV  ITKNLISVS+FA DN V
Sbjct: 309 GGSKVQVGNGAGLSISNIGESNLNMFP-SSRPFLLKNLLHVPLITKNLISVSKFAYDNHV 368

Query: 421 YFEFHPTLCYVKDQVFGQVLLQGTLHDGLYRSHLYSSSLVEPQDKLPVQALTSQLFLSPS 480
           YFEFHP+ C VKD     VLL+GTLH+GLYR +L S           +Q+  S + +   
Sbjct: 369 YFEFHPSFCLVKDPATHVVLLRGTLHNGLYRFNLKSRISGPLHSPACLQSSVSPIKVPDQ 428

Query: 481 NVNCF---VFDLWHRRLGHSSLSTVKSVIQRFNPRLLINNKFQFCSACAMGKVHNLPFHN 540
           +  C      D WH RLGH S++TVK V+   N R+  N+   FCS+C +GK H LPF  
Sbjct: 429 SPLCLPQNTLDKWHLRLGHPSIATVKQVLLDCNERISKNDNISFCSSCQLGKNHLLPFPQ 488

Query: 541 STTVYTAPLQLIIVTDLWGPTYIPSSQGYRYCISFIDAFSRYTW 579
           STT ++AP + ++ +DLWGP +IPS  G RY ISF+DA++RYTW
Sbjct: 489 STTNFSAPFE-VVYSDLWGPAHIPSRNGSRYYISFVDAYTRYTW 518

BLAST of Lag0041193 vs. NCBI nr
Match: KAF7832320.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Senna tora])

HSP 1 Score: 374.4 bits (960), Expect = 1.8e-99
Identity = 242/600 (40.33%), Postives = 333/600 (55.50%), Query Frame = 0

Query: 31  VKLTDDNFLLWKFQILMALEGS----------------SSAEPVRQETLNPAYTLWKKQD 90
           +KL + N+LLW+ QI+ A++G                  S E    E L+  Y  WKKQD
Sbjct: 32  IKLDEKNYLLWRLQIMAAIQGHDLEQYIAGKSSIPAKFDSNEDRLAEKLSDKYVAWKKQD 91

Query: 91  RMISSWLVGSMSEEILHQMIHCTSSKEIWVSLKQIFTTRNLAQMMKIKTKLQTIQKGGMS 150
           +++ SWL+ SM+E ++ ++I CT S E+W  ++Q F T   A++ + +T+L+ I+KG  S
Sbjct: 92  QLVLSWLISSMTESMITRVIGCTHSYEVWDRVQQFFGTNTRAKVHQFRTELRNIKKGNRS 151

Query: 151 LKEYFSKIQQYIDALAVVG-KPVEVEDHILFILAGLGSEYESMVFVISAKIGPQTVQEVM 210
           + EY  KI+   DAL  +G   V   +H+  +L GL  EYES V  I  +  P TV E+ 
Sbjct: 152 MTEYLLKIKSITDALIAIGSSEVSDHEHVQCVLEGLPQEYESFVTGIHMRTEPCTVYELE 211

Query: 211 SLLLTQENRIESKIASTESSLPSANL-------MVHSKPPESDLQKSNTNHFSPNPGSG- 270
            LL+ QE R+E  + +T +  PSAN+          S  P +   +S+   F+   G G 
Sbjct: 212 PLLIAQEIRVEKNLKATITETPSANVANTDSKDKNSSAKPNNYQNQSSNRQFNNQRGGGR 271

Query: 271 -----NRGRGGG--RGGFNTNRGGRSWNNRNRP--QCQVCGKFNHTAPKCFFRYAPFGSS 330
                 RGRG G  RGG++  RGG S NN NRP   CQVC K  H A  C+ R+      
Sbjct: 272 QSFGQGRGRGNGSPRGGYSP-RGGYS-NNSNRPYVMCQVCSKPGHIASACYHRF-DSAYH 331

Query: 331 NTPGSFSPNFNQFNRPPSYPQMEAMVTSPDLNQDTNWYPDSGATNHLTHSFNNLSIGTEY 390
            +   +S +F QF +  S P M A + +P++  D+ W+PDSGATNH+T   NNL  G+EY
Sbjct: 332 LSEQQYSQHFGQFRQRTS-PNMSAFIATPEVVTDSAWFPDSGATNHVTSDANNLMTGSEY 391

Query: 391 GGGNQVHVGNGAGLPILNFSFTSFSSHVCSNRIFRLNNLLHVSSITKNLISVSQFAKDNG 450
            G  Q+H+GNG GL I +   +   S    N    LN+LLHV +ITKNLISVS+FAKDN 
Sbjct: 392 TGPEQLHMGNGTGLLISSVGQSLVKSSK-PNLHLTLNHLLHVPNITKNLISVSKFAKDNC 451

Query: 451 VYFEFHPTLCYVKDQVFGQVLLQGTL-HDGLYR---------SHLYSSSLVEPQDKLPVQ 510
           V+FEFH   C VK QV  QVLL+GT+  DGLY+         S+ Y SS   P    P  
Sbjct: 452 VFFEFHSNYCVVKCQVTKQVLLKGTVRQDGLYQFDEFQLCHLSNPYLSSTSSPSSSPPCA 511

Query: 511 ALTSQLFLSPSNVNCFVFDL--------WHRRLGHSSLSTVKSVIQRFNPRLLINNKFQF 570
            + S    S S  + FVF++        WH RLGH++ + V +V++  N  +   +  +F
Sbjct: 512 FVNSASSSSVSVPSTFVFNVTASNLYSTWHYRLGHANSAVVNNVLKLCNVSISNKSTIEF 571

Query: 571 CSACAMGKVHNLPFHNSTTVYTAPLQLIIVTDLWGPTYIPSSQGYRYCISFIDAFSRYTW 579
           C AC  GK H LP   S ++YT PL+L + TDLWGP  I S  GY Y ISF+DAFSRY W
Sbjct: 572 CDACCRGKHHKLPSPPSQSIYTRPLEL-VYTDLWGPAPISSCNGYLYYISFVDAFSRYVW 625

BLAST of Lag0041193 vs. ExPASy Swiss-Prot
Match: Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)

HSP 1 Score: 249.6 bits (636), Expect = 8.7e-65
Identity = 185/581 (31.84%), Postives = 294/581 (50.60%), Query Frame = 0

Query: 26  NKISIVKLTDDNFLLWK---------FQILMALEGSSSAEPVRQET-----LNPAYTLWK 85
           N  ++ KLT  N+L+W          +++   L+GS++  P    T     +NP YT WK
Sbjct: 19  NMSNVTKLTSTNYLMWSRQVHALFDGYELAGFLDGSTTMPPATIGTDAAPRVNPDYTRWK 78

Query: 86  KQDRMISSWLVGSMSEEILHQMIHCTSSKEIWVSLKQIFTTRNLAQMMKIKTKLQTIQKG 145
           +QD++I S ++G++S  +   +   T++ +IW +L++I+   +   + +++T+L+   KG
Sbjct: 79  RQDKLIYSAVLGAISMSVQPAVSRATTAAQIWETLRKIYANPSYGHVTQLRTQLKQWTKG 138

Query: 146 GMSLKEYFSKIQQYIDALAVVGKPVEVEDHILFILAGLGSEYESMVFVISAKIGPQTVQE 205
             ++ +Y   +    D LA++GKP++ ++ +  +L  L  EY+ ++  I+AK  P T+ E
Sbjct: 139 TKTIDDYMQGLVTRFDQLALLGKPMDHDEQVERVLENLPEEYKPVIDQIAAKDTPPTLTE 198

Query: 206 VMSLLLTQENRIESKIASTESSLPSANLMVHSKPPESDLQKSNTNHFSPNPGSGNRGRGG 265
           +   LL  E++I +  ++T   + +AN + H     +    +N N+ + N    NR    
Sbjct: 199 IHERLLNHESKILAVSSATVIPI-TANAVSH----RNTTTTNNNNNGNRNNRYDNR---- 258

Query: 266 GRGGFNTNRGGRSW----------NNRNRP---QCQVCGKFNHTAPKCFFRYAPFGSSNT 325
                N N   + W          NN+++P   +CQ+CG   H+A +C        S N+
Sbjct: 259 -----NNNNNSKPWQQSSTNFHPNNNQSKPYLGKCQICGVQGHSAKRCSQLQHFLSSVNS 318

Query: 326 PGSFSPNFNQFNRPPSYPQMEAMVTSPDLNQDTNWYPDSGATNHLTHSFNNLSIGTEYGG 385
               SP        P  P+    + SP      NW  DSGAT+H+T  FNNLS+   Y G
Sbjct: 319 QQPPSP------FTPWQPRANLALGSP--YSSNNWLLDSGATHHITSDFNNLSLHQPYTG 378

Query: 386 GNQVHVGNGAGLPILNFSFTSFSSHVCSNRIFRLNNLLHVSSITKNLISVSQFAKDNGVY 445
           G+ V V +G+ +PI +   TS S+    +R   L+N+L+V +I KNLISV +    NGV 
Sbjct: 379 GDDVMVADGSTIPISHTGSTSLST---KSRPLNLHNILYVPNIHKNLISVYRLCNANGVS 438

Query: 446 FEFHPTLCYVKDQVFGQVLLQGTLHDGLYRSHLYSSSLVEPQDKLPVQALTSQLFLSPSN 505
            EF P    VKD   G  LLQG   D LY   + SS  V              LF SPS+
Sbjct: 439 VEFFPASFQVKDLNTGVPLLQGKTKDELYEWPIASSQPV-------------SLFASPSS 498

Query: 506 VNCFVFDLWHRRLGHSSLSTVKSVIQRFNPRLL-INNKFQFCSACAMGKVHNLPFHNSTT 565
                   WH RLGH + S + SVI  ++  +L  ++KF  CS C + K + +PF  ST 
Sbjct: 499 K--ATHSSWHARLGHPAPSILNSVISNYSLSVLNPSHKFLSCSDCLINKSNKVPFSQSTI 557

Query: 566 VYTAPLQLIIVTDLWGPTYIPSSQGYRYCISFIDAFSRYTW 579
             T PL+  I +D+W  + I S   YRY + F+D F+RYTW
Sbjct: 559 NSTRPLE-YIYSDVWS-SPILSHDNYRYYVIFVDHFTRYTW 557

BLAST of Lag0041193 vs. ExPASy Swiss-Prot
Match: Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)

HSP 1 Score: 222.2 bits (565), Expect = 1.5e-56
Identity = 179/575 (31.13%), Postives = 283/575 (49.22%), Query Frame = 0

Query: 26  NKISIVKLTDDNFLLWK---------FQILMALEGSSSAEPVRQET-----LNPAYTLWK 85
           N  ++ KLT  N+L+W          +++   L+GS+   P    T     +NP YT W+
Sbjct: 19  NMSNVTKLTSTNYLMWSRQVHALFDGYELAGFLDGSTPMPPATIGTDAVPRVNPDYTRWR 78

Query: 86  KQDRMISSWLVGSMSEEILHQMIHCTSSKEIWVSLKQIFTTRNLAQMMKIKTKLQTIQKG 145
           +QD++I S ++G++S  +   +   T++ +IW +L++I+   +   +    T+L+ I + 
Sbjct: 79  RQDKLIYSAILGAISMSVQPAVSRATTAAQIWETLRKIYANPSYGHV----TQLRFITR- 138

Query: 146 GMSLKEYFSKIQQYIDALAVVGKPVEVEDHILFILAGLGSEYESMVFVISAKIGPQTVQE 205
                          D LA++GKP++ ++ +  +L  L  +Y+ ++  I+AK  P ++ E
Sbjct: 139 --------------FDQLALLGKPMDHDEQVERVLENLPDDYKPVIDQIAAKDTPPSLTE 198

Query: 206 VMSLLLTQENRIESKIASTESSLPSANLMVHSKPPESDLQKSNTNHFSPNPGSGNRGRGG 265
           +   L+ +E+++ + + S E    +AN++ H        + +NTN    N G  NR    
Sbjct: 199 IHERLINRESKLLA-LNSAEVVPITANVVTH--------RNTNTNRNQNNRGD-NRNYNN 258

Query: 266 GRGGFN----TNRGGRSWNNRNRP---QCQVCGKFNHTAPKCFFRYAPFGSSNTPGSFSP 325
                N    ++ G RS N + +P   +CQ+C    H+A +C   +    ++N   S SP
Sbjct: 259 NNNRSNSWQPSSSGSRSDNRQPKPYLGRCQICSVQGHSAKRCPQLHQFQSTTNQQQSTSP 318

Query: 326 NFNQFNRPPSYPQMEAMVTSPDLNQDTNWYPDSGATNHLTHSFNNLSIGTEYGGGNQVHV 385
                   P  P+    V SP      NW  DSGAT+H+T  FNNLS    Y GG+ V +
Sbjct: 319 ------FTPWQPRANLAVNSP--YNANNWLLDSGATHHITSDFNNLSFHQPYTGGDDVMI 378

Query: 386 GNGAGLPILNFSFTSFSSHVCSNRIFRLNNLLHVSSITKNLISVSQFAKDNGVYFEFHPT 445
            +G+ +PI   + T  +S   S+R   LN +L+V +I KNLISV +    N V  EF P 
Sbjct: 379 ADGSTIPI---THTGSASLPTSSRSLDLNKVLYVPNIHKNLISVYRLCNTNRVSVEFFPA 438

Query: 446 LCYVKDQVFGQVLLQGTLHDGLYRSHLYSSSLVEPQDKLPVQALTSQLFLSPSNVNCFVF 505
              VKD   G  LLQG   D LY   + SS  V              +F SP +      
Sbjct: 439 SFQVKDLNTGVPLLQGKTKDELYEWPIASSQAV-------------SMFASPCSK--ATH 498

Query: 506 DLWHRRLGHSSLSTVKSVIQRFN-PRLLINNKFQFCSACAMGKVHNLPFHNSTTVYTAPL 565
             WH RLGH SL+ + SVI   + P L  ++K   CS C + K H +PF NST   + PL
Sbjct: 499 SSWHSRLGHPSLAILNSVISNHSLPVLNPSHKLLSCSDCFINKSHKVPFSNSTITSSKPL 536

Query: 566 QLIIVTDLWGPTYIPSSQGYRYCISFIDAFSRYTW 579
           +  I +D+W  + I S   YRY + F+D F+RYTW
Sbjct: 559 E-YIYSDVWS-SPILSIDNYRYYVIFVDHFTRYTW 536

BLAST of Lag0041193 vs. ExPASy Swiss-Prot
Match: P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)

HSP 1 Score: 82.8 bits (203), Expect = 1.4e-14
Identity = 124/571 (21.72%), Postives = 215/571 (37.65%), Query Frame = 0

Query: 25  GNKISIVKLTDDN-FLLWK-----FQILMALEGSSSAEPVRQETLNPAYTLWKKQDRMIS 84
           G K  + K   DN F  W+       I   L      +  + +T+      W   D   +
Sbjct: 3   GVKYEVAKFNGDNGFSTWQRRMRDLLIQQGLHKVLDVDSKKPDTMKAED--WADLDERAA 62

Query: 85  SWLVGSMSEEILHQMIHCTSSKEIWVSLKQIFTTRNLAQMMKIKTKLQTIQKG-GMSLKE 144
           S +   +S+++++ +I   +++ IW  L+ ++ ++ L   + +K +L  +    G +   
Sbjct: 63  SAIRLHLSDDVVNNIIDEDTARGIWTRLESLYMSKTLTNKLYLKKQLYALHMSEGTNFLS 122

Query: 145 YFSKIQQYIDALAVVGKPVEVEDHILFILAGLGSEYESMVFVISAKIGPQTVQEVMSLLL 204
           + +     I  LA +G  +E ED  + +L  L S Y+++   I        +++V S LL
Sbjct: 123 HLNVFNGLITQLANLGVKIEEEDKAILLLNSLPSSYDNLATTILHGKTTIELKDVTSALL 182

Query: 205 TQENRIESKIASTESSLPSANLMVHSKPPESDLQKSNTNHFSPNPGSGNRGRGGGRGGFN 264
             E                       K PE+  Q   T           RGR   R   N
Sbjct: 183 LNEKM--------------------RKKPENQGQALITE---------GRGRSYQRSSNN 242

Query: 265 TNRGGRSWNNRNRPQCQVCGKFNHTAPKCFFRYAPFGSSNTPGSFSPNFNQFNRPPSYPQ 324
             R G    ++NR + +V   +N   P  F R  P       G  S   N  N       
Sbjct: 243 YGRSGARGKSKNRSKSRVRNCYNCNQPGHFKRDCP-NPRKGKGETSGQKNDDNTAAMVQN 302

Query: 325 MEAMVTSPDLNQ--------DTNWYPDSGATNHLTHSFNNLSIGTEYGGGNQVHVGNGAG 384
            + +V   +  +        ++ W  D+ A++H T    +L      G    V +GN + 
Sbjct: 303 NDNVVLFINEEEECMHLSGPESEWVVDTAASHHAT-PVRDLFCRYVAGDFGTVKMGNTSY 362

Query: 385 LPILNFSFTSFSSHVCSNRIFRLNNLLHVSSITKNLISVSQFAKDNGVYFEFHPTLCYVK 444
             I         ++V    +  L ++ HV  +  NLIS     +D    +  +      K
Sbjct: 363 SKIAGIGDICIKTNVGCTLV--LKDVRHVPDLRMNLISGIALDRDGYESYFANQKWRLTK 422

Query: 445 DQVFGQVLLQGTLHDGLYRSH--LYSSSLVEPQDKLPVQALTSQLFLSPSNVNCFVFDLW 504
             +   V+ +G     LYR++  +    L   QD++ V                   DLW
Sbjct: 423 GSL---VIAKGVARGTLYRTNAEICQGELNAAQDEISV-------------------DLW 482

Query: 505 HRRLGHSSLSTVKSVIQRFNPRLLINNKFQFCSACAMGKVHNLPFHNSTTVYTAPLQLII 564
           H+R+GH S   ++ + ++           + C  C  GK H + F  S+      L L +
Sbjct: 483 HKRMGHMSEKGLQILAKKSLISYAKGTTVKPCDYCLFGKQHRVSFQTSSERKLNILDL-V 515

Query: 565 VTDLWGPTYIPSSQGYRYCISFIDAFSRYTW 579
            +D+ GP  I S  G +Y ++FID  SR  W
Sbjct: 543 YSDVCGPMEIESMGGNKYFVTFIDDASRKLW 515

BLAST of Lag0041193 vs. ExPASy TrEMBL
Match: A0A5A7U233 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold264G00060 PE=4 SV=1)

HSP 1 Score: 559.3 bits (1440), Expect = 1.9e-155
Identity = 323/605 (53.39%), Postives = 407/605 (67.27%), Query Frame = 0

Query: 2   STESSTSASMEETANPSSQTFSPGNKISIVKLTDDNFLLWKFQILMALEG-------SSS 61
           ST S       E ++P +Q F  GNKIS+VKL DD FLLWKFQIL ALE         S 
Sbjct: 3   STSSLLGVENTEASSPINQIFGSGNKISLVKLNDDTFLLWKFQILTALEAYDLENFLESE 62

Query: 62  AEPVRQ-------------ETLNPAYTLWKKQDRMISSWLVGSMSEEILHQMIHCTSSKE 121
           +EP  +              T NPAY +WK+QDR+ISSWL+GSMSEEIL+QM+HC S+KE
Sbjct: 63  SEPPSKYLISTESSSASATGTPNPAYKVWKRQDRLISSWLLGSMSEEILNQMLHCKSAKE 122

Query: 122 IWVSLKQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYIDALAVVGKPVEVEDH 181
           IW +L+ IF++R LAQ M+ K KL  I+KG M LKEYF KI Q +DALA + KPV  +DH
Sbjct: 123 IWETLQGIFSSRYLAQAMQFKNKLHNIKKGSMPLKEYFLKILQCVDALASINKPVSSDDH 182

Query: 182 ILFILAGLGSEYESMVFVISAKIGPQTVQEVMSLLLTQENRIESKIASTESSLPSANLMV 241
           IL+ILAGLGS+Y+SM+ VISA+    +VQEVMSLLLTQE++ ESK+ S E++LPS N++ 
Sbjct: 183 ILYILAGLGSDYQSMISVISARTDSPSVQEVMSLLLTQESQNESKLIS-ETALPSVNIVT 242

Query: 242 HS--KPPESDLQKSNTNHFSPNPGSGNRGRGGGRGGFNTNRGGRSWNNRNRPQCQVCGKF 301
            +  K  ES + ++N N++  N     R   GGRG   +NRG R   NRN+PQCQ+C K 
Sbjct: 243 QTTEKGAESYI-RTNQNNYHNNHSYNQR---GGRGNGRSNRGRR--GNRNKPQCQICAKL 302

Query: 302 NHTAPKCFFRYAPFGSSN--TPGSFSPNFNQFNRPPSYPQMEAMVTSPDLNQDTNWYPDS 361
            ++A +CFFRY P  +S+  +P S + ++   N   ++PQM AMV + DLN D+NWYPDS
Sbjct: 303 GYSADRCFFRYTPRSNSSGYSPNSHNTSYTNMN---NHPQMSAMVAALDLNIDSNWYPDS 362

Query: 362 GATNHLTHSFNNLSIGTEYGGGNQVHVGNGAGLPILNFSFTSFSSHVCSNRIFRLNNLLH 421
           GATNHLTHS +NLSIG+EYGGGNQ++  NG+GLPI ++   SF+S     + F LNNLL 
Sbjct: 363 GATNHLTHSLSNLSIGSEYGGGNQIYAANGSGLPITHYGSMSFNSSTLPFKSFTLNNLLQ 422

Query: 422 VSSITKNLISVSQFAKDNGVYFEFHPTLCYVKDQVFGQVLLQGTLHDGLYRSHLYSSSLV 481
           V SITKNLISVSQFAKDN V+FEFHPTLCYVKD   GQVLLQG L+DGLY+        +
Sbjct: 423 VPSITKNLISVSQFAKDNHVFFEFHPTLCYVKDLDTGQVLLQGLLNDGLYK------FTI 482

Query: 482 EPQDKL--PVQALTSQLF--LSPSNVNCFVFDLWHRRLGHSSLSTVKSVIQRFNPRLLIN 541
           EP  K      + T  +F  + P + N  + DLWHRRLGH  L  VK+V+   +      
Sbjct: 483 EPSHKRLHHSNSNTKPVFNTVVPKS-NTPLLDLWHRRLGHPHLPIVKAVLNHIDNSSGTI 542

Query: 542 NKFQFCSACAMGKVHNLPFHNSTTVYTAPLQLIIVTDLWGPTYIPSSQGYRYCISFIDAF 579
           NK  FC ACA+GK H LPF +S T+YT PLQL I  DLWGP    S  G+RY ISF+DA+
Sbjct: 543 NKLNFCEACALGKHHALPFSHSLTLYTHPLQL-ITCDLWGPAVNVSHNGFRYYISFVDAY 589

BLAST of Lag0041193 vs. ExPASy TrEMBL
Match: A0A5D3CH97 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold25G00040 PE=4 SV=1)

HSP 1 Score: 559.3 bits (1440), Expect = 1.9e-155
Identity = 323/605 (53.39%), Postives = 407/605 (67.27%), Query Frame = 0

Query: 2   STESSTSASMEETANPSSQTFSPGNKISIVKLTDDNFLLWKFQILMALEG-------SSS 61
           ST S       E ++P +Q F  GNKIS+VKL DD FLLWKFQIL ALE         S 
Sbjct: 3   STSSLLGVENTEASSPINQIFGSGNKISLVKLNDDTFLLWKFQILTALEAYDLENFLESE 62

Query: 62  AEPVRQ-------------ETLNPAYTLWKKQDRMISSWLVGSMSEEILHQMIHCTSSKE 121
           +EP  +              T NPAY +WK+QDR+ISSWL+GSMSEEIL+QM+HC S+KE
Sbjct: 63  SEPPSKYLISTESSSASATGTPNPAYKVWKRQDRLISSWLLGSMSEEILNQMLHCKSAKE 122

Query: 122 IWVSLKQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYIDALAVVGKPVEVEDH 181
           IW +L+ IF++R LAQ M+ K KL  I+KG M LKEYF KI Q +DALA + KPV  +DH
Sbjct: 123 IWETLQGIFSSRYLAQAMQFKNKLHNIKKGSMPLKEYFLKILQCVDALASINKPVSSDDH 182

Query: 182 ILFILAGLGSEYESMVFVISAKIGPQTVQEVMSLLLTQENRIESKIASTESSLPSANLMV 241
           IL+ILAGLGS+Y+SM+ VISA+    +VQEVMSLLLTQE++ ESK+ S E++LPS N++ 
Sbjct: 183 ILYILAGLGSDYQSMISVISARTDSPSVQEVMSLLLTQESQNESKLIS-ETALPSVNIVT 242

Query: 242 HS--KPPESDLQKSNTNHFSPNPGSGNRGRGGGRGGFNTNRGGRSWNNRNRPQCQVCGKF 301
            +  K  ES + ++N N++  N     R   GGRG   +NRG R   NRN+PQCQ+C K 
Sbjct: 243 QTTEKGAESYI-RTNQNNYHNNHSYNQR---GGRGNGRSNRGRR--GNRNKPQCQICAKL 302

Query: 302 NHTAPKCFFRYAPFGSSN--TPGSFSPNFNQFNRPPSYPQMEAMVTSPDLNQDTNWYPDS 361
            ++A +CFFRY P  +S+  +P S + ++   N   ++PQM AMV + DLN D+NWYPDS
Sbjct: 303 GYSADRCFFRYTPRSNSSGYSPNSHNTSYTNMN---NHPQMSAMVAALDLNIDSNWYPDS 362

Query: 362 GATNHLTHSFNNLSIGTEYGGGNQVHVGNGAGLPILNFSFTSFSSHVCSNRIFRLNNLLH 421
           GATNHLTHS +NLSIG+EYGGGNQ++  NG+GLPI ++   SF+S     + F LNNLL 
Sbjct: 363 GATNHLTHSLSNLSIGSEYGGGNQIYAANGSGLPITHYGSMSFNSSTLPFKSFTLNNLLQ 422

Query: 422 VSSITKNLISVSQFAKDNGVYFEFHPTLCYVKDQVFGQVLLQGTLHDGLYRSHLYSSSLV 481
           V SITKNLISVSQFAKDN V+FEFHPTLCYVKD   GQVLLQG L+DGLY+        +
Sbjct: 423 VPSITKNLISVSQFAKDNHVFFEFHPTLCYVKDLDTGQVLLQGLLNDGLYK------FTI 482

Query: 482 EPQDKL--PVQALTSQLF--LSPSNVNCFVFDLWHRRLGHSSLSTVKSVIQRFNPRLLIN 541
           EP  K      + T  +F  + P + N  + DLWHRRLGH  L  VK+V+   +      
Sbjct: 483 EPSHKRLHHSNSNTKPVFNTVVPKS-NTPLLDLWHRRLGHPHLPIVKAVLNHIDNSSGTI 542

Query: 542 NKFQFCSACAMGKVHNLPFHNSTTVYTAPLQLIIVTDLWGPTYIPSSQGYRYCISFIDAF 579
           NK  FC ACA+GK H LPF +S T+YT PLQL I  DLWGP    S  G+RY ISF+DA+
Sbjct: 543 NKLNFCEACALGKHHALPFSHSLTLYTHPLQL-ITCDLWGPAVNVSHNGFRYYISFVDAY 589

BLAST of Lag0041193 vs. ExPASy TrEMBL
Match: A0A438FJP6 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX=29760 GN=POLX_1134 PE=4 SV=1)

HSP 1 Score: 397.1 bits (1019), Expect = 1.2e-106
Identity = 241/615 (39.19%), Postives = 351/615 (57.07%), Query Frame = 0

Query: 23  SPGNKISIVKLTDDNFLLWKFQILMALEG--------SSSAEPVRQET-------LNPAY 82
           SP +++  ++L DDNFL+WK+QI  A+ G         +   P +  T        NP +
Sbjct: 144 SPLSQLITMRLEDDNFLMWKYQIENAVRGYGLEGFLFGTEQVPPKMVTDKIGVLVPNPKF 203

Query: 83  TLWKKQDRMISSWLVGSMSEEILHQMIHCTSSKEIWVSLKQIFTTRNLAQMMKIKTKLQT 142
             +++QD ++ SWL+ S+    L Q++ C+S+ E+W ++ Q F +++ A++M  K+++Q 
Sbjct: 204 RDYQRQDHLLISWLLSSIGSAFLPQVVGCSSAFEVWNTISQNFNSQSSAKVMFYKSQMQM 263

Query: 143 IQKGGMSLKEYFSKIQQYIDALAVVGKPVEVEDHILFILAGLGSEYESMVFVISAKIGPQ 202
           ++K G+++++Y +K++ Y D LA  G  +   DHIL I+ GLG EYES++ VIS+K    
Sbjct: 264 LKKDGLTMRDYLTKMKNYCDLLATAGHKISDTDHILAIMQGLGDEYESVIAVISSKKSSP 323

Query: 203 TVQEVMSLLLTQENRIESKIASTESSLPSANLMVHSKPPESDLQKSNTNHFSPNPGSGNR 262
           ++Q V S L+  E RI  KI+S + S+   +   +  P  S     N+N + P+ G  NR
Sbjct: 324 SLQYVTSTLIAHEGRIAHKISSNDLSVNYTSQYSNRGPSSS----WNSNGY-PSSGFQNR 383

Query: 263 GRGGG----RGGFNTNRG---GRSWNNRNRPQCQVCGKFNHTAPKCFFRYAPFGSSNTP- 322
            + GG    RG F  NRG   GR+     +PQCQ+C KF HT  +CF+RY P    N P 
Sbjct: 384 NQFGGNQVTRGSFVHNRGRGRGRA-QGGIKPQCQLCNKFGHTVHRCFYRYDPNFHGNMPA 443

Query: 323 -----------------GSFSP----NFNQFNRPPS--YPQMEAMVTSPDLNQDTNWYPD 382
                            GS S     N  +++   +  Y +MEAMV +P+  Q+  W+PD
Sbjct: 444 NGPTPGVLGSGARNGASGSISSAGNVNLTEYDAQENQDYSEMEAMVATPEDLQNCCWFPD 503

Query: 383 SGATNHLTHSFNNLSIGTEYGGGNQVHVGNGAGLPILNFSFTSFSSHVCSNRIFRLNNLL 442
           SGATNH+TH   NL+ G EY G +++H+GNG GL I +   + F S    N++  L N+L
Sbjct: 504 SGATNHVTHDLGNLNSGAEYNGNSKIHMGNGTGLKISHIGLSVFPSSSSPNKVLFLKNIL 563

Query: 443 HVSSITKNLISVSQFAKDNGVYFEFHPTLCYVKDQVFGQVLLQGTLHDGLYRSHL----- 502
            V +I KNL+SVSQFA+DN VYFEFHP +C+VKD+    +LLQG LH GLY+ +L     
Sbjct: 564 RVPAIKKNLLSVSQFARDNNVYFEFHPKVCFVKDKSNHSLLLQGNLHKGLYQFNLSKKLF 623

Query: 503 -YSSSLVEPQDKLPVQALTSQL-------FLSPSNVNCFVFDLWHRRLGHSSLSTVKSVI 562
             +S L    DK  +    + L       F   +N +  VFDLWH+RLGH +   V  V+
Sbjct: 624 GKASGLSLSNDKNELTCCNASLVHNDNSDFPEKTNSSFHVFDLWHKRLGHPASKIVTQVL 683

Query: 563 QRFNPRLLINNKFQFCSACAMGKVHNLPFHNSTTVYTAPLQLIIVTDLWGPTYIPSSQGY 579
                     +    CSAC +GK HNLPF  S TVYT PLQL +V+DLWGP  I SS G+
Sbjct: 684 NDNKIPFSTKSGSSICSACQLGKSHNLPFPISQTVYTKPLQL-VVSDLWGPAPINSSYGF 743

BLAST of Lag0041193 vs. ExPASy TrEMBL
Match: A0A2Z7AWA7 (Integrase catalytic domain-containing protein OS=Dorcoceras hygrometricum OX=472368 GN=F511_06348 PE=4 SV=1)

HSP 1 Score: 396.7 bits (1018), Expect = 1.6e-106
Identity = 227/524 (43.32%), Postives = 318/524 (60.69%), Query Frame = 0

Query: 61  ETLNPAYTLWKKQDRMISSWLVGSMSEEILHQMIHCTSSKEIWVSLKQIFTTRNLAQMMK 120
           E +NP +  W +QD+++ S+L+ SMSE    QMI C +S ++W  + Q+F TR+ A++M+
Sbjct: 9   EVMNPNFVTWNRQDQLLFSFLLASMSESAQSQMIGCQTSSQLWTRVTQLFATRSKARVMQ 68

Query: 121 IKTKLQTIQKGGMSLKEYFSKIQQYIDALAVVGKPVEVEDHILFILAGLGSEYESMVFVI 180
            K +LQT++KG +S+K+Y  K++ YID LA  G  +  +D IL IL G+G EYES+V  +
Sbjct: 69  YKLQLQTLKKGNLSMKDYLGKMKGYIDILAACGNSIPEDDQILHILGGVGPEYESVVVHV 128

Query: 181 SAKIGPQTVQEVMSLLLTQENRIES-KIASTESSLPSANLMVHSKPPESDLQKSNTNHFS 240
           ++++   ++ EV +LLL  E RIE+  I    ++ PS N+        S  +  NT+   
Sbjct: 129 TSRVESLSLSEVGALLLAHEGRIETYNITGGHTASPSVNVTT----APSQRKAENTSQSQ 188

Query: 241 PNPGSGNRGRGGGRGGFNTNRGGRS-WNNRNRPQCQVCGKFNHTAPKCFFRYAPFGSSNT 300
           P      RGRG GR G    RGGR  W+N  RP CQ+CG   H A  C++R+       +
Sbjct: 189 P----VYRGRGRGRNG----RGGRKPWHNNGRPVCQICGIPGHVAEICYYRFDKEFVPKS 248

Query: 301 PGSFSPNFNQFNR-PPSYPQMEAMVTSPDLNQDTNWYPDSGATNHLTHSFNNLSIGTEYG 360
            G    +  QFNR  PSYP      T  +   +  WYPDSGA++H+T+   NLS+ +EY 
Sbjct: 249 SGVSRTSQQQFNRSSPSYPPSAFASTKSESASEEWWYPDSGASHHVTNDLGNLSVSSEYT 308

Query: 361 GGNQVHVGNGAGLPILNFSFTSFSSHVCSNRIFRLNNLLHVSSITKNLISVSQFAKDNGV 420
           GG++V VGNGAGL I N   ++ +    S+R F L NLLHV  ITKNLISVS+FA DN V
Sbjct: 309 GGSKVQVGNGAGLSISNIGESNLNMFP-SSRPFLLKNLLHVPLITKNLISVSKFAYDNHV 368

Query: 421 YFEFHPTLCYVKDQVFGQVLLQGTLHDGLYRSHLYSSSLVEPQDKLPVQALTSQLFLSPS 480
           YFEFHP+ C VKD     VLL+GTLH+GLYR +L S           +Q+  S + +   
Sbjct: 369 YFEFHPSFCLVKDPATHVVLLRGTLHNGLYRFNLKSRISGPLHSPACLQSSVSPIKVPDQ 428

Query: 481 NVNCF---VFDLWHRRLGHSSLSTVKSVIQRFNPRLLINNKFQFCSACAMGKVHNLPFHN 540
           +  C      D WH RLGH S++TVK V+   N R+  N+   FCS+C +GK H LPF  
Sbjct: 429 SPLCLPQNTLDKWHLRLGHPSIATVKQVLLDCNERISKNDNISFCSSCQLGKNHLLPFPQ 488

Query: 541 STTVYTAPLQLIIVTDLWGPTYIPSSQGYRYCISFIDAFSRYTW 579
           STT ++AP + ++ +DLWGP +IPS  G RY ISF+DA++RYTW
Sbjct: 489 STTNFSAPFE-VVYSDLWGPAHIPSRNGSRYYISFVDAYTRYTW 518

BLAST of Lag0041193 vs. ExPASy TrEMBL
Match: A0A438H844 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Vitis vinifera OX=29760 GN=RE1_3152 PE=4 SV=1)

HSP 1 Score: 369.8 bits (948), Expect = 2.1e-98
Identity = 238/611 (38.95%), Postives = 344/611 (56.30%), Query Frame = 0

Query: 11  MEETANPSSQTFSPGNKISIV--KLTDDNFLLWKFQILMALEGSS--------------- 70
           MEET+  +S  F P +    V  KL + NFL+W+ QIL  L G                 
Sbjct: 1   MEETSRTTS--FLPVSFPHPVSSKLDNHNFLVWRKQILTTLRGHKLQHFLSETSVLPSEF 60

Query: 71  -SAEPVRQETLNPAYTLWKKQDRMISSWLVGSMSEEILHQMIHCTSSKEIWVSLKQIFTT 130
            S++   Q  +NP +  W++QD++I SWL+ S+++ +L +M++C +S ++W +L+  F T
Sbjct: 61  LSSDDETQNHVNPKFQDWEQQDQLIMSWLLASITDALLTRMVNCDTSAQVWKTLELYFAT 120

Query: 131 RNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYIDALAVVGKPVEVEDHILFILAGLGSE 190
           +  A++ + KT+L   +KG +S+ +Y  KI+  +D LA+VG  + V+DHI  I  GL  +
Sbjct: 121 QVRAKVTQFKTQLHNTKKGDLSISDYLLKIRNVVDLLALVGHKISVKDHIDAIFEGLPQD 180

Query: 191 YESMVFVISAKIGPQTVQEVMSLLLTQENRIESKIASTESSLPS-ANLMVHSK--PPESD 250
           YE+ +  +++++ P TV+E+  LLL QE+RIE  I   + S PS A+L+  ++   P  +
Sbjct: 181 YETFIISVNSRLDPYTVEEIEVLLLAQESRIEKNIKIADLSTPSLAHLITTNRNGSPHFN 240

Query: 251 LQKSNTN-HFSPNPGSGNRGRGGGRGGFNTNRGGR----SWNNRNRPQCQVCGKFNHTAP 310
            + S  N +F P   SGN G    RG F     GR    SW   N+PQCQ+CG+  H   
Sbjct: 241 YRASTRNSNFRPPTHSGN-GMQHFRGNFTQQGRGRHGRGSWKGNNKPQCQLCGRIGHVVM 300

Query: 311 KCFFRYAPFGSSNTPGSFSPNFNQFNRPPSYPQM---------EAMVTSPDLNQDTNWYP 370
           +C++R+    S   P     N  Q N    + Q+             T+ ++ QD NWYP
Sbjct: 301 QCYYRFDQ--SFTGPSQLQGNRPQGNMAHLHQQLSENFFPGTSSVKPTTAEIIQDNNWYP 360

Query: 371 DSGATNHLTHSFNNLSIGTEYGGGNQVHVGNGAGLPILNFSFTSFSSHVCSNRIFRLNNL 430
           DSGAT+HLT + NNL   +++   ++V VGNG GLPI +   TSFSS    ++   L  L
Sbjct: 361 DSGATHHLTPNLNNLLTKSQFPSSDEVFVGNGKGLPIHHIGHTSFSSSFIPSKTLALKQL 420

Query: 431 LHVSSITKNLISVSQFAKDNGVYFEFHPTLCYVKDQVFGQVLLQGTLHDGLYRSHLYSSS 490
           LHV  ITKNL+SVS+FA DN V+FEFHPT C+VKD     VL+ G L  GLY   ++ ++
Sbjct: 421 LHVPEITKNLLSVSKFAADNHVFFEFHPTSCFVKDLSTRTVLMHGQLKGGLY---VFDNT 480

Query: 491 LVEPQDKLPVQ--------ALTSQLFLSPSNVNCFVFDLWHRRLGHSSLSTVKSVIQRFN 550
               Q KLP+         AL S+    P++ +   F LWH RLGH S   V  V+ + N
Sbjct: 481 ----QLKLPLHNSSCFASTALPSKEPTVPAS-STSPFTLWHNRLGHPSSHIVSLVLNKCN 540

Query: 551 PRLLINNKFQFCSACAMGKVHNLPFHNSTTVYTAPLQLIIVTDLWGPTYIPSSQGYRYCI 579
              L       CSAC MGK+H  PF +S + YT PL+L I TDLWGP   PSS G++Y I
Sbjct: 541 LPHLNKIPSLICSACCMGKIHKSPFLHSKSSYTKPLEL-IHTDLWGPISTPSSHGHQYYI 597

BLAST of Lag0041193 vs. TAIR 10
Match: AT5G48050.1 (CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G34070.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )

HSP 1 Score: 78.6 bits (192), Expect = 1.9e-14
Identity = 67/270 (24.81%), Postives = 135/270 (50.00%), Query Frame = 0

Query: 39  LLWKFQILMALEGSSSAEPVRQETLNPAYTLWKKQDRMISSWLVGSMSEEILHQMIH--C 98
           L   F +L  ++GSS+  P+ ++        WK++D ++  W+ G++++ +L  +I   C
Sbjct: 43  LCLSFGVLGHIDGSSTPTPMTEKR-------WKERDGLVKMWIYGTITDSLLDTIIKVGC 102

Query: 99  TSSKEIWVSLKQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYIDALAVVGKPV 158
           T ++++W+SL+ +F     A+ ++ + +L+T     +S+ EY  K++   D L  V  P+
Sbjct: 103 T-ARDLWLSLENLFRDNKEARALQFENELRTTTIDDLSVHEYCQKLKSLSDLLTNVDSPI 162

Query: 159 EVEDHILFILAGLGSEYESMVFVISAKIGPQTVQEVMSLLLTQENRI--ESKIASTESSL 218
                ++ +L GL  +Y+ ++ VI  K    +  E  S+LL +E+R+  +SK + + ++ 
Sbjct: 163 SDRVLVMHLLNGLTEKYDYILNVIKHKSPFPSFTEARSMLLMEESRLSNKSKSSLSHTNH 222

Query: 219 PSANLMVHSKPPESDLQKSNTNHFSPNPGSG-----NRGRGGGRGGFNTNRGGRSWNNRN 278
           PS + ++ + P + +      ++ + N G G     NRG G   G +N N   R     N
Sbjct: 223 PSLSNVLFTVPRQQERYPQEYHNNNSNMGRGRSKKKNRGGGSSDGRYNNNNNWR----LN 282

Query: 279 RPQCQVCG------KFNHTAPKCFFRYAPF 294
           +P   + G       + H  P+ F +   F
Sbjct: 283 QPPTWIYGPPQSPYMYPHGGPQFFHKKTYF 300

BLAST of Lag0041193 vs. TAIR 10
Match: AT1G21280.1 (CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 63.2 bits (152), Expect = 8.2e-10
Identity = 37/152 (24.34%), Postives = 83/152 (54.61%), Query Frame = 0

Query: 5   SSTSASMEETANPSSQTFSP-----GNKISIVKLT--DDNFLLWKFQILMALE-----GS 64
           + T  S+  T++P S  + P      +  SI KL+  +DN++ WK +    L      G 
Sbjct: 2   AETIKSVSPTSDPDSPYYLPPDIHHPSDFSIQKLSKDEDNYVAWKIRFRSFLRVTKKFGF 61

Query: 65  SSAEPVRQETLNPAYTLWKKQDRMISSWLVGSMSEEILHQMIHCTSSKEIWVSLKQIFTT 124
                 + +  +P Y  W++ + M+  WL+ SM++++L  +++  ++ ++W  L+++F  
Sbjct: 62  IDGTLPKPDPFSPLYQPWEQCNAMVMYWLMNSMTDKLLESVMYAETAHKMWEDLRRVFVP 121

Query: 125 RNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQ 145
               ++ +++ +L T+++GG S++EYF K+ +
Sbjct: 122 CVDLKIYQLRRRLATLRQGGDSVEEYFGKLSK 153

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAA0048297.13.9e-15553.39Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. m... [more]
TYK10642.13.9e-15553.39Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. m... [more]
RVW60229.12.6e-10639.19Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera][more]
KZV26181.13.3e-10643.32hypothetical protein F511_06348 [Dorcoceras hygrometricum][more]
KAF7832320.11.8e-9940.33Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Senna tora][more]
Match NameE-valueIdentityDescription
Q94HW28.7e-6531.84Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... [more]
Q9ZT941.5e-5631.13Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... [more]
P109781.4e-1421.72Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
Match NameE-valueIdentityDescription
A0A5A7U2331.9e-15553.39Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var.... [more]
A0A5D3CH971.9e-15553.39Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var.... [more]
A0A438FJP61.2e-10639.19Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX... [more]
A0A2Z7AWA71.6e-10643.32Integrase catalytic domain-containing protein OS=Dorcoceras hygrometricum OX=472... [more]
A0A438H8442.1e-9838.95Retrovirus-related Pol polyprotein from transposon RE1 OS=Vitis vinifera OX=2976... [more]
Match NameE-valueIdentityDescription
AT5G48050.11.9e-1424.81CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); BE... [more]
AT1G21280.18.2e-1024.34CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Ha... [more]
InterPro
Analysis Name: InterPro Annotations of Sponge gourd (AG-4) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePFAMPF14223Retrotran_gag_2coord: 70..203
e-value: 3.2E-20
score: 72.3
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 227..243
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 214..269
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..21
NoneNo IPR availablePANTHERPTHR34222:SF48SUBFAMILY NOT NAMEDcoord: 61..380
NoneNo IPR availablePANTHERPTHR34222FAMILY NOT NAMEDcoord: 61..380
IPR025724GAG-pre-integrase domainPFAMPF13976gag_pre-integrscoord: 483..526
e-value: 1.2E-9
score: 37.9

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lag0041193.1Lag0041193.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005488 binding