Lag0017663 (gene) Sponge gourd (AG‐4) v1

Overview
NameLag0017663
Typegene
OrganismLuffa acutangula (Sponge gourd (AG‐4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Locationchr5: 6476477 .. 6478243 (-)
RNA-Seq ExpressionLag0017663
SyntenyLag0017663
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCGGAAGATATCCTTCATCAGATGATTCATTGCTCTTCAACGAAAGCTATATGGTCTTGTCTCGGGCAAATCTTCACTACACGTAACTTGGCCCAAATGATGAAAATCAAAACCAAGTTACAAACCATCCAGAAGGGAGGTATGTCACTCAAAGAATATTTTTCAAAAATCCAACAATATGTTGATGCTTTGTCTGCTGTTGGGAAACCGGTGGATGTTGAGGACCATATCTTATTTATTTTATCTGGTTTGGGCTCTGACTATGAATCGATGGTGTCTGTCATTTCTGCTAAAATTGGTCCTCAGTCGGTTCACGAAGTTATGTCGCTTTTATTGACTCAAGAAAATCGTAATGAAAGTAAGTTGTCAAGTTCTGAAACTGCCCTCCCCTCTGTGAACCTTACGGTGAGTCCGAAACCTCCTGATTCTGAGTCCCCGAAACCTAATCCCAATCCATATCCTTCATCCTTCACTGGTGGGAATCGAGGGCGTGGTGGTGGTCGTTCTAGTTCCAACCGTGGAGGACGCACCTGGAACAACCGTAATCGGATTCAATGTCAAGTGTGTGGGAAATTTGGCCACACTGCTCAACGTTGCTATTTTCGTTATGCCCCGCCTGGTCCCTCTAATAATCCTGCCTCATTTTCTCCACACTTTAATCAGTCTAATCGTCCAAATCAATTTCCACAGATGGCTGCTATGCTCACTGCTCCTGATATTAATCATGATACCAGCTGGTACCCTGACTCCGGTGCAACGAATCATCTTACTCATTCCTTTGGTAATCTCTCAGTAGGTACCGAGTATGGTGGCGGAAATCAAGTTCATGTGGGAAATGGAGCAGGTTTGCCAATACTTAACTATGGTTACTCTTCTTTTTCTTCTCCTACTTGTGCTAATCGGGTCTTCTTTTTAAATAATCTTCTTCATGTCCCTTCCATAACGAAAAATCTTATTAGTGTGAGTCAGTTTGCTAGAGATAATGGTGTATTTTTTGAGTTTCATCCAACGTTGTGTTATGTGAAGGATCAAGCATCTGGTCGGGTTCTGCTCCAAGGGACTCTCCATGAAGGTCTCTACCGCTTCAACCTCTCAGTTCCCTCGTCTTCCACGCCGATTAAGAAGGATACAGCGGTTCAGACTCTTCTTTCTCAGTCTCTCTCTTCTTCTCCGACTGTCTTATCTGTGTCTGGTGGTTCTGACTTAAATGTATGGCATAGACGTCTCGACCATCCTAGTTTAGCCATTGTTAAATCTGTTTTACGGTTACAACAGCCTCAAATGTCCATAAATAATGATTTTCAGTTCTGTACTGCCTGTGCGTTGGGGAAAACTCATAGTTTACCTTTTTTTCCCTCTCATACAGTGTACTCTGCCCCTCTTCAATTAATAGTATCAGATCTTTGGGGCCCTGCTTATATACCTTCTGTCTCAGGCTATCGTTACTATATTACATTTGTGGATGTCTTTAGTCGGTATACATGGATCTATTTTTTGAAATCTAAGTCTGATGCTTTGAATGTGTTTCTTAAATTCAAATTACATGTGGAAAATCTTCTAGGTTTATCTATCAAAACCTTCCAATCTGACAGTGGAGGTGAGTTCAAATCTTTTTCTTCCATGTTGAATAGCTATGGCATTTCTCATCGCTTTACTTGTCCTCACACTTCCAAACAAAACGGCATTGTTGAGCGTAAGCACAGACATGTAGTGGATACAGGGTTAGCTCTTCTTTCTCATTCCTCTATGCCTTTAAAATATTGA

mRNA sequence

ATGTCGGAAGATATCCTTCATCAGATGATTCATTGCTCTTCAACGAAAGCTATATGGTCTTGTCTCGGGCAAATCTTCACTACACGTAACTTGGCCCAAATGATGAAAATCAAAACCAAGTTACAAACCATCCAGAAGGGAGGTATGTCACTCAAAGAATATTTTTCAAAAATCCAACAATATGTTGATGCTTTGTCTGCTGTTGGGAAACCGGTGGATGTTGAGGACCATATCTTATTTATTTTATCTGGTTTGGGCTCTGACTATGAATCGATGGTGTCTGTCATTTCTGCTAAAATTGGTCCTCAGTCGGTTCACGAAGTTATGTCGCTTTTATTGACTCAAGAAAATCGTAATGAAAGTAAGTTGTCAAGTTCTGAAACTGCCCTCCCCTCTGTGAACCTTACGGTGAGTCCGAAACCTCCTGATTCTGAGTCCCCGAAACCTAATCCCAATCCATATCCTTCATCCTTCACTGGTGGGAATCGAGGGCGTGGTGGTGGTCGTTCTAGTTCCAACCGTGGAGGACGCACCTGGAACAACCGTAATCGGATTCAATGTCAAGTGTGTGGGAAATTTGGCCACACTGCTCAACGTTGCTATTTTCGTTATGCCCCGCCTGGTCCCTCTAATAATCCTGCCTCATTTTCTCCACACTTTAATCAGTCTAATCGTCCAAATCAATTTCCACAGATGGCTGCTATGCTCACTGCTCCTGATATTAATCATGATACCAGCTGGTACCCTGACTCCGGTGCAACGAATCATCTTACTCATTCCTTTGGTAATCTCTCAGTAGGTACCGAGTATGGTGGCGGAAATCAAGTTCATGTGGGAAATGGAGCAGGTTTGCCAATACTTAACTATGGTTACTCTTCTTTTTCTTCTCCTACTTGTGCTAATCGGGTCTTCTTTTTAAATAATCTTCTTCATGTCCCTTCCATAACGAAAAATCTTATTAGTGTGAGTCAGTTTGCTAGAGATAATGGTGTATTTTTTGAGTTTCATCCAACGTTGTGTTATGTGAAGGATCAAGCATCTGGTCGGGTTCTGCTCCAAGGGACTCTCCATGAAGGTCTCTACCGCTTCAACCTCTCAGTTCCCTCGTCTTCCACGCCGATTAAGAAGGATACAGCGGTTCAGACTCTTCTTTCTCAGTCTCTCTCTTCTTCTCCGACTGTCTTATCTGTGTCTGGTGGTTCTGACTTAAATGTATGGCATAGACGTCTCGACCATCCTAGTTTAGCCATTGTTAAATCTGTTTTACGGTTACAACAGCCTCAAATGTCCATAAATAATGATTTTCAGTTCTGTACTGCCTGTGCGTTGGGGAAAACTCATAGTTTACCTTTTTTTCCCTCTCATACAGTGTACTCTGCCCCTCTTCAATTAATAGTATCAGATCTTTGGGGCCCTGCTTATATACCTTCTGTCTCAGGCTATCGTTACTATATTACATTTGTGGATGTCTTTAGTCGGTATACATGGATCTATTTTTTGAAATCTAAGTCTGATGCTTTGAATGTGTTTCTTAAATTCAAATTACATGTGGAAAATCTTCTAGGTTTATCTATCAAAACCTTCCAATCTGACAGTGGAGGTGAGTTCAAATCTTTTTCTTCCATGTTGAATAGCTATGGCATTTCTCATCGCTTTACTTGTCCTCACACTTCCAAACAAAACGGCATTGTTGAGCGTAAGCACAGACATGTAGTGGATACAGGGTTAGCTCTTCTTTCTCATTCCTCTATGCCTTTAAAATATTGA

Coding sequence (CDS)

ATGTCGGAAGATATCCTTCATCAGATGATTCATTGCTCTTCAACGAAAGCTATATGGTCTTGTCTCGGGCAAATCTTCACTACACGTAACTTGGCCCAAATGATGAAAATCAAAACCAAGTTACAAACCATCCAGAAGGGAGGTATGTCACTCAAAGAATATTTTTCAAAAATCCAACAATATGTTGATGCTTTGTCTGCTGTTGGGAAACCGGTGGATGTTGAGGACCATATCTTATTTATTTTATCTGGTTTGGGCTCTGACTATGAATCGATGGTGTCTGTCATTTCTGCTAAAATTGGTCCTCAGTCGGTTCACGAAGTTATGTCGCTTTTATTGACTCAAGAAAATCGTAATGAAAGTAAGTTGTCAAGTTCTGAAACTGCCCTCCCCTCTGTGAACCTTACGGTGAGTCCGAAACCTCCTGATTCTGAGTCCCCGAAACCTAATCCCAATCCATATCCTTCATCCTTCACTGGTGGGAATCGAGGGCGTGGTGGTGGTCGTTCTAGTTCCAACCGTGGAGGACGCACCTGGAACAACCGTAATCGGATTCAATGTCAAGTGTGTGGGAAATTTGGCCACACTGCTCAACGTTGCTATTTTCGTTATGCCCCGCCTGGTCCCTCTAATAATCCTGCCTCATTTTCTCCACACTTTAATCAGTCTAATCGTCCAAATCAATTTCCACAGATGGCTGCTATGCTCACTGCTCCTGATATTAATCATGATACCAGCTGGTACCCTGACTCCGGTGCAACGAATCATCTTACTCATTCCTTTGGTAATCTCTCAGTAGGTACCGAGTATGGTGGCGGAAATCAAGTTCATGTGGGAAATGGAGCAGGTTTGCCAATACTTAACTATGGTTACTCTTCTTTTTCTTCTCCTACTTGTGCTAATCGGGTCTTCTTTTTAAATAATCTTCTTCATGTCCCTTCCATAACGAAAAATCTTATTAGTGTGAGTCAGTTTGCTAGAGATAATGGTGTATTTTTTGAGTTTCATCCAACGTTGTGTTATGTGAAGGATCAAGCATCTGGTCGGGTTCTGCTCCAAGGGACTCTCCATGAAGGTCTCTACCGCTTCAACCTCTCAGTTCCCTCGTCTTCCACGCCGATTAAGAAGGATACAGCGGTTCAGACTCTTCTTTCTCAGTCTCTCTCTTCTTCTCCGACTGTCTTATCTGTGTCTGGTGGTTCTGACTTAAATGTATGGCATAGACGTCTCGACCATCCTAGTTTAGCCATTGTTAAATCTGTTTTACGGTTACAACAGCCTCAAATGTCCATAAATAATGATTTTCAGTTCTGTACTGCCTGTGCGTTGGGGAAAACTCATAGTTTACCTTTTTTTCCCTCTCATACAGTGTACTCTGCCCCTCTTCAATTAATAGTATCAGATCTTTGGGGCCCTGCTTATATACCTTCTGTCTCAGGCTATCGTTACTATATTACATTTGTGGATGTCTTTAGTCGGTATACATGGATCTATTTTTTGAAATCTAAGTCTGATGCTTTGAATGTGTTTCTTAAATTCAAATTACATGTGGAAAATCTTCTAGGTTTATCTATCAAAACCTTCCAATCTGACAGTGGAGGTGAGTTCAAATCTTTTTCTTCCATGTTGAATAGCTATGGCATTTCTCATCGCTTTACTTGTCCTCACACTTCCAAACAAAACGGCATTGTTGAGCGTAAGCACAGACATGTAGTGGATACAGGGTTAGCTCTTCTTTCTCATTCCTCTATGCCTTTAAAATATTGA

Protein sequence

MSEDILHQMIHCSSTKAIWSCLGQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYVDALSAVGKPVDVEDHILFILSGLGSDYESMVSVISAKIGPQSVHEVMSLLLTQENRNESKLSSSETALPSVNLTVSPKPPDSESPKPNPNPYPSSFTGGNRGRGGGRSSSNRGGRTWNNRNRIQCQVCGKFGHTAQRCYFRYAPPGPSNNPASFSPHFNQSNRPNQFPQMAAMLTAPDINHDTSWYPDSGATNHLTHSFGNLSVGTEYGGGNQVHVGNGAGLPILNYGYSSFSSPTCANRVFFLNNLLHVPSITKNLISVSQFARDNGVFFEFHPTLCYVKDQASGRVLLQGTLHEGLYRFNLSVPSSSTPIKKDTAVQTLLSQSLSSSPTVLSVSGGSDLNVWHRRLDHPSLAIVKSVLRLQQPQMSINNDFQFCTACALGKTHSLPFFPSHTVYSAPLQLIVSDLWGPAYIPSVSGYRYYITFVDVFSRYTWIYFLKSKSDALNVFLKFKLHVENLLGLSIKTFQSDSGGEFKSFSSMLNSYGISHRFTCPHTSKQNGIVERKHRHVVDTGLALLSHSSMPLKY
Homology
BLAST of Lag0017663 vs. NCBI nr
Match: KAA0048297.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa])

HSP 1 Score: 606.3 bits (1562), Expect = 2.8e-169
Identity = 331/591 (56.01%), Postives = 412/591 (69.71%), Query Frame = 0

Query: 1   MSEDILHQMIHCSSTKAIWSCLGQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQ 60
           MSE+IL+QM+HC S K IW  L  IF++R LAQ M+ K KL  I+KG M LKEYF KI Q
Sbjct: 106 MSEEILNQMLHCKSAKEIWETLQGIFSSRYLAQAMQFKNKLHNIKKGSMPLKEYFLKILQ 165

Query: 61  YVDALSAVGKPVDVEDHILFILSGLGSDYESMVSVISAKIGPQSVHEVMSLLLTQENRNE 120
            VDAL+++ KPV  +DHIL+IL+GLGSDY+SM+SVISA+    SV EVMSLLLTQE++NE
Sbjct: 166 CVDALASINKPVSSDDHILYILAGLGSDYQSMISVISARTDSPSVQEVMSLLLTQESQNE 225

Query: 121 SKLSSSETALPSVNLTVSPKPPDSES-PKPNPNPYPSSFTGGNRGRGGGRSSSNRGGRTW 180
           SKL  SETALPSVN+        +ES  + N N Y ++ +   RG G G   SNRG R  
Sbjct: 226 SKL-ISETALPSVNIVTQTTEKGAESYIRTNQNNYHNNHSYNQRG-GRGNGRSNRGRR-- 285

Query: 181 NNRNRIQCQVCGKFGHTAQRCYFRYAPPGPSNNPASFSP--HFNQSNRPNQFPQMAAMLT 240
            NRN+ QCQ+C K G++A RC+FRY    P +N + +SP  H       N  PQM+AM+ 
Sbjct: 286 GNRNKPQCQICAKLGYSADRCFFRYT---PRSNSSGYSPNSHNTSYTNMNNHPQMSAMVA 345

Query: 241 APDINHDTSWYPDSGATNHLTHSFGNLSVGTEYGGGNQVHVGNGAGLPILNYGYSSFSSP 300
           A D+N D++WYPDSGATNHLTHS  NLS+G+EYGGGNQ++  NG+GLPI +YG  SF+S 
Sbjct: 346 ALDLNIDSNWYPDSGATNHLTHSLSNLSIGSEYGGGNQIYAANGSGLPITHYGSMSFNSS 405

Query: 301 TCANRVFFLNNLLHVPSITKNLISVSQFARDNGVFFEFHPTLCYVKDQASGRVLLQGTLH 360
           T   + F LNNLL VPSITKNLISVSQFA+DN VFFEFHPTLCYVKD  +G+VLLQG L+
Sbjct: 406 TLPFKSFTLNNLLQVPSITKNLISVSQFAKDNHVFFEFHPTLCYVKDLDTGQVLLQGLLN 465

Query: 361 EGLYRFNLSVPSSSTPIKKDTAVQTLLSQSLSSSPTVLSVSGGSDLNVWHRRLDHPSLAI 420
           +GLY+F +  PS       ++  + + +  +  S T L       L++WHRRL HP L I
Sbjct: 466 DGLYKFTIE-PSHKRLHHSNSNTKPVFNTVVPKSNTPL-------LDLWHRRLGHPHLPI 525

Query: 421 VKSVLRLQQPQMSINNDFQFCTACALGKTHSLPFFPSHTVYSAPLQLIVSDLWGPAYIPS 480
           VK+VL          N   FC ACALGK H+LPF  S T+Y+ PLQLI  DLWGPA   S
Sbjct: 526 VKAVLNHIDNSSGTINKLNFCEACALGKHHALPFSHSLTLYTHPLQLITCDLWGPAVNVS 585

Query: 481 VSGYRYYITFVDVFSRYTWIYFLKSKSDALNVFLKFKLHVENLLGLSIKTFQSDSGGEFK 540
            +G+RYYI+FVD +SRYTWIYFL SKSDA   F KFK  VE  LG SIK+ Q+D G EFK
Sbjct: 586 HNGFRYYISFVDAYSRYTWIYFLNSKSDAFLAFQKFKTCVEKSLGQSIKSLQTDGGTEFK 645

Query: 541 SFSSMLNSYGISHRFTCPHTSKQNGIVERKHRHVVDTGLALLSHSSMPLKY 589
            F   L+ +GI HR TCP+TSKQN IVERKHR++++ GL LLS +++PL +
Sbjct: 646 PFKPFLDQHGIEHRITCPYTSKQNDIVERKHRYIMEMGLTLLSQATLPLSF 681

BLAST of Lag0017663 vs. NCBI nr
Match: TYK10642.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa])

HSP 1 Score: 606.3 bits (1562), Expect = 2.8e-169
Identity = 331/591 (56.01%), Postives = 412/591 (69.71%), Query Frame = 0

Query: 1   MSEDILHQMIHCSSTKAIWSCLGQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQ 60
           MSE+IL+QM+HC S K IW  L  IF++R LAQ M+ K KL  I+KG M LKEYF KI Q
Sbjct: 106 MSEEILNQMLHCKSAKEIWETLQGIFSSRYLAQAMQFKNKLHNIKKGSMPLKEYFLKILQ 165

Query: 61  YVDALSAVGKPVDVEDHILFILSGLGSDYESMVSVISAKIGPQSVHEVMSLLLTQENRNE 120
            VDAL+++ KPV  +DHIL+IL+GLGSDY+SM+SVISA+    SV EVMSLLLTQE++NE
Sbjct: 166 CVDALASINKPVSSDDHILYILAGLGSDYQSMISVISARTDSPSVQEVMSLLLTQESQNE 225

Query: 121 SKLSSSETALPSVNLTVSPKPPDSES-PKPNPNPYPSSFTGGNRGRGGGRSSSNRGGRTW 180
           SKL  SETALPSVN+        +ES  + N N Y ++ +   RG G G   SNRG R  
Sbjct: 226 SKL-ISETALPSVNIVTQTTEKGAESYIRTNQNNYHNNHSYNQRG-GRGNGRSNRGRR-- 285

Query: 181 NNRNRIQCQVCGKFGHTAQRCYFRYAPPGPSNNPASFSP--HFNQSNRPNQFPQMAAMLT 240
            NRN+ QCQ+C K G++A RC+FRY    P +N + +SP  H       N  PQM+AM+ 
Sbjct: 286 GNRNKPQCQICAKLGYSADRCFFRYT---PRSNSSGYSPNSHNTSYTNMNNHPQMSAMVA 345

Query: 241 APDINHDTSWYPDSGATNHLTHSFGNLSVGTEYGGGNQVHVGNGAGLPILNYGYSSFSSP 300
           A D+N D++WYPDSGATNHLTHS  NLS+G+EYGGGNQ++  NG+GLPI +YG  SF+S 
Sbjct: 346 ALDLNIDSNWYPDSGATNHLTHSLSNLSIGSEYGGGNQIYAANGSGLPITHYGSMSFNSS 405

Query: 301 TCANRVFFLNNLLHVPSITKNLISVSQFARDNGVFFEFHPTLCYVKDQASGRVLLQGTLH 360
           T   + F LNNLL VPSITKNLISVSQFA+DN VFFEFHPTLCYVKD  +G+VLLQG L+
Sbjct: 406 TLPFKSFTLNNLLQVPSITKNLISVSQFAKDNHVFFEFHPTLCYVKDLDTGQVLLQGLLN 465

Query: 361 EGLYRFNLSVPSSSTPIKKDTAVQTLLSQSLSSSPTVLSVSGGSDLNVWHRRLDHPSLAI 420
           +GLY+F +  PS       ++  + + +  +  S T L       L++WHRRL HP L I
Sbjct: 466 DGLYKFTIE-PSHKRLHHSNSNTKPVFNTVVPKSNTPL-------LDLWHRRLGHPHLPI 525

Query: 421 VKSVLRLQQPQMSINNDFQFCTACALGKTHSLPFFPSHTVYSAPLQLIVSDLWGPAYIPS 480
           VK+VL          N   FC ACALGK H+LPF  S T+Y+ PLQLI  DLWGPA   S
Sbjct: 526 VKAVLNHIDNSSGTINKLNFCEACALGKHHALPFSHSLTLYTHPLQLITCDLWGPAVNVS 585

Query: 481 VSGYRYYITFVDVFSRYTWIYFLKSKSDALNVFLKFKLHVENLLGLSIKTFQSDSGGEFK 540
            +G+RYYI+FVD +SRYTWIYFL SKSDA   F KFK  VE  LG SIK+ Q+D G EFK
Sbjct: 586 HNGFRYYISFVDAYSRYTWIYFLNSKSDAFLAFQKFKTCVEKSLGQSIKSLQTDGGTEFK 645

Query: 541 SFSSMLNSYGISHRFTCPHTSKQNGIVERKHRHVVDTGLALLSHSSMPLKY 589
            F   L+ +GI HR TCP+TSKQN IVERKHR++++ GL LLS +++PL +
Sbjct: 646 PFKPFLDQHGIEHRITCPYTSKQNDIVERKHRYIMEMGLTLLSQATLPLSF 681

BLAST of Lag0017663 vs. NCBI nr
Match: KZV26181.1 (hypothetical protein F511_06348 [Dorcoceras hygrometricum])

HSP 1 Score: 497.3 bits (1279), Expect = 1.8e-136
Identity = 276/593 (46.54%), Postives = 385/593 (64.92%), Query Frame = 0

Query: 1   MSEDILHQMIHCSSTKAIWSCLGQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQ 60
           MSE    QMI C ++  +W+ + Q+F TR+ A++M+ K +LQT++KG +S+K+Y  K++ 
Sbjct: 33  MSESAQSQMIGCQTSSQLWTRVTQLFATRSKARVMQYKLQLQTLKKGNLSMKDYLGKMKG 92

Query: 61  YVDALSAVGKPVDVEDHILFILSGLGSDYESMVSVISAKIGPQSVHEVMSLLLTQENRNE 120
           Y+D L+A G  +  +D IL IL G+G +YES+V  +++++   S+ EV +LLL  E R E
Sbjct: 93  YIDILAACGNSIPEDDQILHILGGVGPEYESVVVHVTSRVESLSLSEVGALLLAHEGRIE 152

Query: 121 S-KLSSSETALPSVNLTVSPKPPDSESPKPNPNPYPSSFTGGNRGRGGGRSSSNRGGR-T 180
           +  ++   TA PSVN+T +P    +E+   +   Y        RGRG GR  + RGGR  
Sbjct: 153 TYNITGGHTASPSVNVTTAPSQRKAENTSQSQPVY--------RGRGRGR--NGRGGRKP 212

Query: 181 WNNRNRIQCQVCGKFGHTAQRCYFRYAP---PGPSNNPASFSPHFNQSNRPNQFPQMAAM 240
           W+N  R  CQ+CG  GH A+ CY+R+     P  S    +    FN+S+    +P  A  
Sbjct: 213 WHNNGRPVCQICGIPGHVAEICYYRFDKEFVPKSSGVSRTSQQQFNRSS--PSYPPSAFA 272

Query: 241 LTAPDINHDTSWYPDSGATNHLTHSFGNLSVGTEYGGGNQVHVGNGAGLPILNYGYSSFS 300
            T  +   +  WYPDSGA++H+T+  GNLSV +EY GG++V VGNGAGL I N G S+ +
Sbjct: 273 STKSESASEEWWYPDSGASHHVTNDLGNLSVSSEYTGGSKVQVGNGAGLSISNIGESNLN 332

Query: 301 SPTCANRVFFLNNLLHVPSITKNLISVSQFARDNGVFFEFHPTLCYVKDQASGRVLLQGT 360
               ++R F L NLLHVP ITKNLISVS+FA DN V+FEFHP+ C VKD A+  VLL+GT
Sbjct: 333 M-FPSSRPFLLKNLLHVPLITKNLISVSKFAYDNHVYFEFHPSFCLVKDPATHVVLLRGT 392

Query: 361 LHEGLYRFNLSVPSSSTPIKKDTAVQTLLSQSLSSSPTVLSVSGGSDLNVWHRRLDHPSL 420
           LH GLYRFNL     S P+     +Q+ +S       + L +   + L+ WH RL HPS+
Sbjct: 393 LHNGLYRFNLK-SRISGPLHSPACLQSSVSPIKVPDQSPLCLPQNT-LDKWHLRLGHPSI 452

Query: 421 AIVKSVLRLQQPQMSINNDFQFCTACALGKTHSLPFFPSHTVYSAPLQLIVSDLWGPAYI 480
           A VK VL     ++S N++  FC++C LGK H LPF  S T +SAP +++ SDLWGPA+I
Sbjct: 453 ATVKQVLLDCNERISKNDNISFCSSCQLGKNHLLPFPQSTTNFSAPFEVVYSDLWGPAHI 512

Query: 481 PSVSGYRYYITFVDVFSRYTWIYFLKSKSDALNVFLKFKLHVENLLGLSIKTFQSDSGGE 540
           PS +G RYYI+FVD ++RYTWIYFLK KS+    F+ F+ + E      IKT Q+D GGE
Sbjct: 513 PSRNGSRYYISFVDAYTRYTWIYFLKLKSEVTQTFINFQKYTELHFNAKIKTLQTDGGGE 572

Query: 541 FKSFSSMLNSYGISHRFTCPHTSKQNGIVERKHRHVVDTGLALLSHSSMPLKY 589
           F+S ++   S GI HRF+CP+TSKQNG+VERKHRHVVDTGL+LL+H+S+P ++
Sbjct: 573 FRSLTAYCQSNGILHRFSCPYTSKQNGVVERKHRHVVDTGLSLLAHASLPFEF 610

BLAST of Lag0017663 vs. NCBI nr
Match: RVW60229.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera])

HSP 1 Score: 474.9 bits (1221), Expect = 9.8e-130
Identity = 268/622 (43.09%), Postives = 370/622 (59.49%), Query Frame = 0

Query: 6   LHQMIHCSSTKAIWSCLGQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYVDAL 65
           L Q++ CSS   +W+ + Q F +++ A++M  K+++Q ++K G+++++Y +K++ Y D L
Sbjct: 226 LPQVVGCSSAFEVWNTISQNFNSQSSAKVMFYKSQMQMLKKDGLTMRDYLTKMKNYCDLL 285

Query: 66  SAVGKPVDVEDHILFILSGLGSDYESMVSVISAKIGPQSVHEVMSLLLTQENRNESKLSS 125
           +  G  +   DHIL I+ GLG +YES+++VIS+K    S+  V S L+  E R   K+SS
Sbjct: 286 ATAGHKISDTDHILAIMQGLGDEYESVIAVISSKKSSPSLQYVTSTLIAHEGRIAHKISS 345

Query: 126 SETALPSVNLTVSPKPPDSESPKPNPNPYPSS-------FTGGNRGRGGGRSSSNRGGRT 185
           ++    SVN T S       S   N N YPSS       F G    RG    +  RG   
Sbjct: 346 NDL---SVNYT-SQYSNRGPSSSWNSNGYPSSGFQNRNQFGGNQVTRGSFVHNRGRGRGR 405

Query: 186 WNNRNRIQCQVCGKFGHTAQRCYFRYAPPGPSNNPA------------------SFSPHF 245
                + QCQ+C KFGHT  RC++RY P    N PA                  S S   
Sbjct: 406 AQGGIKPQCQLCNKFGHTVHRCFYRYDPNFHGNMPANGPTPGVLGSGARNGASGSISSAG 465

Query: 246 N------QSNRPNQFPQMAAMLTAPDINHDTSWYPDSGATNHLTHSFGNLSVGTEYGGGN 305
           N       +     + +M AM+  P+   +  W+PDSGATNH+TH  GNL+ G EY G +
Sbjct: 466 NVNLTEYDAQENQDYSEMEAMVATPEDLQNCCWFPDSGATNHVTHDLGNLNSGAEYNGNS 525

Query: 306 QVHVGNGAGLPILNYGYSSFSSPTCANRVFFLNNLLHVPSITKNLISVSQFARDNGVFFE 365
           ++H+GNG GL I + G S F S +  N+V FL N+L VP+I KNL+SVSQFARDN V+FE
Sbjct: 526 KIHMGNGTGLKISHIGLSVFPSSSSPNKVLFLKNILRVPAIKKNLLSVSQFARDNNVYFE 585

Query: 366 FHPTLCYVKDQASGRVLLQGTLHEGLYRFNLSV----PSSSTPIKKDTAVQTLLSQSL-- 425
           FHP +C+VKD+++  +LLQG LH+GLY+FNLS      +S   +  D    T  + SL  
Sbjct: 586 FHPKVCFVKDKSNHSLLLQGNLHKGLYQFNLSKKLFGKASGLSLSNDKNELTCCNASLVH 645

Query: 426 --SSSPTVLSVSGGSDLNVWHRRLDHPSLAIVKSVLRLQQPQMSINNDFQFCTACALGKT 485
             +S     + S     ++WH+RL HP+  IV  VL   +   S  +    C+AC LGK+
Sbjct: 646 NDNSDFPEKTNSSFHVFDLWHKRLGHPASKIVTQVLNDNKIPFSTKSGSSICSACQLGKS 705

Query: 486 HSLPFFPSHTVYSAPLQLIVSDLWGPAYIPSVSGYRYYITFVDVFSRYTWIYFLKSKSDA 545
           H+LPF  S TVY+ PLQL+VSDLWGPA I S  G+ YY++FVD +SRYTW+YFLK+KS  
Sbjct: 706 HNLPFPISQTVYTKPLQLVVSDLWGPAPINSSYGFTYYVSFVDAYSRYTWVYFLKTKSQT 765

Query: 546 LNVFLKFKLHVENLLGLSIKTFQSDSGGEFKSFSSMLNSYGISHRFTCPHTSKQNGIVER 589
              FL FK   E   G  +KTFQ+D GGEF+S  +     GI HR +CPHTSKQNGI+ER
Sbjct: 766 REAFLMFKAQAELQFGCKLKTFQTDWGGEFRSLKTYFEQNGIIHRLSCPHTSKQNGIIER 825

BLAST of Lag0017663 vs. NCBI nr
Match: RVW44519.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera])

HSP 1 Score: 448.0 bits (1151), Expect = 1.3e-121
Identity = 255/580 (43.97%), Postives = 342/580 (58.97%), Query Frame = 0

Query: 48  GMSLKEYFSKIQQYVDALSAVGKPVDVEDHILFILSGLGSDYESMVSVISAKIGPQSVHE 107
           G+++++Y +K++ Y D L+  G  +   DHIL I+ GLG +YES+++VIS+K    S+  
Sbjct: 132 GLTMRDYLTKMKNYCDLLATAGHKISDTDHILAIMQGLGDEYESVIAVISSKKSSPSLQY 191

Query: 108 VMSLLLTQENRNESKLSSSETALPSVNLTVSPKPPDSESPKPNPNPYPSS-------FTG 167
           V S L+  E R   K+SS++    SVN T S       S   N N YPSS       F G
Sbjct: 192 VTSTLIAHEGRIAHKISSNDL---SVNYT-SQYSNRGPSSSWNSNGYPSSGFQNRNQFGG 251

Query: 168 GNRGRGGGRSSSNRGGRTWNNRNRIQCQVCGKFGHTAQRCYFRYAPPGPSNNPA------ 227
               RG    +  RG        + QCQ+C KFGHT  RC++RY P    N PA      
Sbjct: 252 NQVTRGSFVHNRGRGRGRAQGGIKPQCQLCNKFGHTVHRCFYRYDPNFHGNMPANGPTPG 311

Query: 228 ------------SFSPHFN------QSNRPNQFPQMAAMLTAPDINHDTSWYPDSGATNH 287
                       S S   N       +     + +M AM+  P+   +  W+PDSGATNH
Sbjct: 312 VLGSGARNGASGSISSAGNVNLTEYDAQENQDYSEMEAMVATPEDLQNCCWFPDSGATNH 371

Query: 288 LTHSFGNLSVGTEYGGGNQVHVGNGAGLPILNYGYSSFSSPTCANRVFFLNNLLHVPSIT 347
           +TH  GNL+ G EY G +++H+GNG GL I + G S F S +  N+V FL N+L VP+I 
Sbjct: 372 VTHDLGNLNSGAEYNGNSKIHMGNGTGLKISHIGLSVFPSSSSPNKVLFLKNILRVPAIK 431

Query: 348 KNLISVSQFARDNGVFFEFHPTLCYVKDQASGRVLLQGTLHEGLYRFNLSV----PSSST 407
           KNL+SVSQFARDN V+FEFHP +C+VKD+++  +LLQG LH+GLY+FNLS      +S  
Sbjct: 432 KNLLSVSQFARDNNVYFEFHPKVCFVKDKSNHSLLLQGNLHKGLYQFNLSKKLFGKASGL 491

Query: 408 PIKKDTAVQTLLSQSL----SSSPTVLSVSGGSDLNVWHRRLDHPSLAIVKSVLRLQQPQ 467
            +  D    T  + SL    +S     + S     ++WH+RL HP+  IV  VL   +  
Sbjct: 492 SLSNDKNELTCCNASLVHNDNSDFPEKTNSSFHVFDLWHKRLGHPASKIVTQVLNDNKIP 551

Query: 468 MSINNDFQFCTACALGKTHSLPFFPSHTVYSAPLQLIVSDLWGPAYIPSVSGYRYYITFV 527
            S  +    C+AC LGK+H+LPF  S TVY+ PLQL+VSDLWGPA I S  G+ YY++FV
Sbjct: 552 FSTKSGSSICSACQLGKSHNLPFPISQTVYTKPLQLVVSDLWGPAPINSSYGFTYYVSFV 611

Query: 528 DVFSRYTWIYFLKSKSDALNVFLKFKLHVENLLGLSIKTFQSDSGGEFKSFSSMLNSYGI 587
           D +SRYTW+YFLK+KS     FL FK   E   G  +KTFQ+D GGEF+S  +     GI
Sbjct: 612 DAYSRYTWVYFLKTKSQTREAFLMFKAQAELQFGCKLKTFQTDWGGEFRSLKTYFEQNGI 671

Query: 588 SHRFTCPHTSKQNGIVERKHRHVVDTGLALLSHSSMPLKY 589
            HR +CPHTSKQNGI+ERKHRH+V+ GL LL+ +S+PLKY
Sbjct: 672 IHRLSCPHTSKQNGIIERKHRHIVELGLTLLAQASLPLKY 707

BLAST of Lag0017663 vs. ExPASy Swiss-Prot
Match: Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)

HSP 1 Score: 282.3 bits (721), Expect = 1.2e-74
Identity = 197/587 (33.56%), Postives = 302/587 (51.45%), Query Frame = 0

Query: 18  IWSCLGQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYVDALSAVGKPVDVEDH 77
           IW  L +I+   +   + +++T+L+   KG  ++ +Y   +    D L+ +GKP+D ++ 
Sbjct: 109 IWETLRKIYANPSYGHVTQLRTQLKQWTKGTKTIDDYMQGLVTRFDQLALLGKPMDHDEQ 168

Query: 78  ILFILSGLGSDYESMVSVISAKIGPQSVHEVMSLLLTQENRNESKLSSSETALPSVNLTV 137
           +  +L  L  +Y+ ++  I+AK  P ++ E+   LL  E++      SS T +P     V
Sbjct: 169 VERVLENLPEEYKPVIDQIAAKDTPPTLTEIHERLLNHESK--ILAVSSATVIPITANAV 228

Query: 138 SPKPPDSESPKPNPNPYPSSFTGGNRGRGGGRSSSNRGGRTW----------NNRNRI-- 197
           S +   + +   N          GNR       ++N   + W          NN+++   
Sbjct: 229 SHRNTTTTNNNNN----------GNRNNRYDNRNNNNNSKPWQQSSTNFHPNNNQSKPYL 288

Query: 198 -QCQVCGKFGHTAQRCYFRYAPPGPSNNPASFSPHFNQSNRPNQFP--QMAAMLTAPDIN 257
            +CQ+CG  GH+A+RC          +    F    N    P+ F   Q  A L      
Sbjct: 289 GKCQICGVQGHSAKRC----------SQLQHFLSSVNSQQPPSPFTPWQPRANLALGSPY 348

Query: 258 HDTSWYPDSGATNHLTHSFGNLSVGTEYGGGNQVHVGNGAGLPILNYGYSSFSSPTCANR 317
              +W  DSGAT+H+T  F NLS+   Y GG+ V V +G+ +PI + G +S S+    +R
Sbjct: 349 SSNNWLLDSGATHHITSDFNNLSLHQPYTGGDDVMVADGSTIPISHTGSTSLST---KSR 408

Query: 318 VFFLNNLLHVPSITKNLISVSQFARDNGVFFEFHPTLCYVKDQASGRVLLQGTLHEGLYR 377
              L+N+L+VP+I KNLISV +    NGV  EF P    VKD  +G  LLQG   + LY 
Sbjct: 409 PLNLHNILYVPNIHKNLISVYRLCNANGVSVEFFPASFQVKDLNTGVPLLQGKTKDELYE 468

Query: 378 FNLSVPSSSTPIKKDTAVQTLLSQSLSSSPTVLSVSGGSDLNVWHRRLDHPSLAIVKSVL 437
           + +   +SS P+            SL +SP     S  +  + WH RL HP+ +I+ SV+
Sbjct: 469 WPI---ASSQPV------------SLFASP-----SSKATHSSWHARLGHPAPSILNSVI 528

Query: 438 -RLQQPQMSINNDFQFCTACALGKTHSLPFFPSHTVYSAPLQLIVSDLWGPAYIPSVSGY 497
                  ++ ++ F  C+ C + K++ +PF  S    + PL+ I SD+W  + I S   Y
Sbjct: 529 SNYSLSVLNPSHKFLSCSDCLINKSNKVPFSQSTINSTRPLEYIYSDVWS-SPILSHDNY 588

Query: 498 RYYITFVDVFSRYTWIYFLKSKSDALNVFLKFKLHVENLLGLSIKTFQSDSGGEFKSFSS 557
           RYY+ FVD F+RYTW+Y LK KS     F+ FK  +EN     I TF SD+GGEF +   
Sbjct: 589 RYYVIFVDHFTRYTWLYPLKQKSQVKETFITFKNLLENRFQTRIGTFYSDNGGEFVALWE 648

Query: 558 MLNSYGISHRFTCPHTSKQNGIVERKHRHVVDTGLALLSHSSMPLKY 589
             + +GISH  + PHT + NG+ ERKHRH+V+TGL LLSH+S+P  Y
Sbjct: 649 YFSQHGISHLTSPPHTPEHNGLSERKHRHIVETGLTLLSHASIPKTY 649

BLAST of Lag0017663 vs. ExPASy Swiss-Prot
Match: Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)

HSP 1 Score: 268.9 bits (686), Expect = 1.4e-70
Identity = 189/530 (35.66%), Postives = 277/530 (52.26%), Query Frame = 0

Query: 63  DALSAVGKPVDVEDHILFILSGLGSDYESMVSVISAKIGPQSVHEVMSLLLTQENRNESK 122
           D L+ +GKP+D ++ +  +L  L  DY+ ++  I+AK  P S+ E+   L+ +E++  + 
Sbjct: 135 DQLALLGKPMDHDEQVERVLENLPDDYKPVIDQIAAKDTPPSLTEIHERLINRESKLLA- 194

Query: 123 LSSSETALPSVNLTVSPKPPDSESPKPNPNPYPSSFTGGNRGRGGGRSSSNRGGRTWNNR 182
           L+S+E    + N+       ++ + +   N   +     N  R      S+ G R+ N +
Sbjct: 195 LNSAEVVPITANVVTH---RNTNTNRNQNNRGDNRNYNNNNNRSNSWQPSSSGSRSDNRQ 254

Query: 183 NRI---QCQVCGKFGHTAQRCYFRYAPPGPSNNPASFSPHFNQSNRPNQFPQMAAMLTAP 242
            +    +CQ+C   GH+A+RC   +     +N   S SP      R N        L   
Sbjct: 255 PKPYLGRCQICSVQGHSAKRCPQLHQFQSTTNQQQSTSPFTPWQPRAN--------LAVN 314

Query: 243 DINHDTSWYPDSGATNHLTHSFGNLSVGTEYGGGNQVHVGNGAGLPILNYGYSSFSSPTC 302
              +  +W  DSGAT+H+T  F NLS    Y GG+ V + +G+ +PI + G  S S PT 
Sbjct: 315 SPYNANNWLLDSGATHHITSDFNNLSFHQPYTGGDDVMIADGSTIPITHTG--SASLPT- 374

Query: 303 ANRVFFLNNLLHVPSITKNLISVSQFARDNGVFFEFHPTLCYVKDQASGRVLLQGTLHEG 362
           ++R   LN +L+VP+I KNLISV +    N V  EF P    VKD  +G  LLQG   + 
Sbjct: 375 SSRSLDLNKVLYVPNIHKNLISVYRLCNTNRVSVEFFPASFQVKDLNTGVPLLQGKTKDE 434

Query: 363 LYRFNLSVPSSSTPIKKDTAVQTLLSQSLSSSPTVLSVSGGSDLNVWHRRLDHPSLAIVK 422
           LY +         PI    AV      S+ +SP   +         WH RL HPSLAI+ 
Sbjct: 435 LYEW---------PIASSQAV------SMFASPCSKATHSS-----WHSRLGHPSLAILN 494

Query: 423 SVLRLQQ-PQMSINNDFQFCTACALGKTHSLPFFPSHTVYSAPLQLIVSDLWGPAYIPSV 482
           SV+     P ++ ++    C+ C + K+H +PF  S    S PL+ I SD+W  + I S+
Sbjct: 495 SVISNHSLPVLNPSHKLLSCSDCFINKSHKVPFSNSTITSSKPLEYIYSDVWS-SPILSI 554

Query: 483 SGYRYYITFVDVFSRYTWIYFLKSKSDALNVFLKFKLHVENLLGLSIKTFQSDSGGEFKS 542
             YRYY+ FVD F+RYTW+Y LK KS   + F+ FK  VEN     I T  SD+GGEF  
Sbjct: 555 DNYRYYVIFVDHFTRYTWLYPLKQKSQVKDTFIIFKSLVENRFQTRIGTLYSDNGGEFVV 614

Query: 543 FSSMLNSYGISHRFTCPHTSKQNGIVERKHRHVVDTGLALLSHSSMPLKY 589
               L+ +GISH  + PHT + NG+ ERKHRH+V+ GL LLSH+S+P  Y
Sbjct: 615 LRDYLSQHGISHFTSPPHTPEHNGLSERKHRHIVEMGLTLLSHASVPKTY 628

BLAST of Lag0017663 vs. ExPASy Swiss-Prot
Match: P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)

HSP 1 Score: 145.2 bits (365), Expect = 2.3e-33
Identity = 148/613 (24.14%), Postives = 250/613 (40.78%), Query Frame = 0

Query: 1   MSEDILHQMIHCSSTKAIWSCLGQIFTTRNLAQMMKIKTKLQTIQKG-GMSLKEYFSKIQ 60
           +S+D+++ +I   + + IW+ L  ++ ++ L   + +K +L  +    G +   + +   
Sbjct: 67  LSDDVVNNIIDEDTARGIWTRLESLYMSKTLTNKLYLKKQLYALHMSEGTNFLSHLNVFN 126

Query: 61  QYVDALSAVGKPVDVEDHILFILSGLGSDYESMVSVISAKIGPQSVHEVMSLLLTQENRN 120
             +  L+ +G  ++ ED  + +L+ L S Y+++ + I        + +V S LL  E   
Sbjct: 127 GLITQLANLGVKIEEEDKAILLLNSLPSSYDNLATTILHGKTTIELKDVTSALLLNEKMR 186

Query: 121 ESKLSSSETALPSVNLTVSPKPPDSESPKPNPNPYPSSFTGGNRGRGGGRSSSN--RGGR 180
                               K P+++           +     RGR   RSS+N  R G 
Sbjct: 187 --------------------KKPENQG---------QALITEGRGRSYQRSSNNYGRSGA 246

Query: 181 TWNNRNRIQ-----CQVCGKFGHTAQRCYFRYAPPGPSNNPASFSPHFNQSNRPNQFPQM 240
              ++NR +     C  C + GH  + C      P P       S   N  N        
Sbjct: 247 RGKSKNRSKSRVRNCYNCNQPGHFKRDC------PNPRKGKGETSGQKNDDN-------T 306

Query: 241 AAMLTAPD-----INH----------DTSWYPDSGATNHLTHSFGNLSVGTEYGGGNQVH 300
           AAM+   D     IN           ++ W  D+ A++H T    +L      G    V 
Sbjct: 307 AAMVQNNDNVVLFINEEEECMHLSGPESEWVVDTAASHHAT-PVRDLFCRYVAGDFGTVK 366

Query: 301 VGNGAGLPILNYGYSSFSSPTCANRVFFLNNLLHVPSITKNLISVSQFARDNGVFFEFHP 360
           +GN +   I   G       T       L ++ HVP +  NLIS     RD    +E + 
Sbjct: 367 MGNTSYSKIA--GIGDICIKTNVGCTLVLKDVRHVPDLRMNLISGIALDRDG---YESYF 426

Query: 361 TLCYVKDQASGRVLLQGTLHEGLYRFNLSVPSSSTPIKKDTAVQTLLSQSLSSSPTVLSV 420
                +      V+ +G     LYR N  +                    L+++   +SV
Sbjct: 427 ANQKWRLTKGSLVIAKGVARGTLYRTNAEI----------------CQGELNAAQDEISV 486

Query: 421 SGGSDLNVWHRRLDHPSLAIVKSVLRLQQPQMSINNDFQFCTACALGKTHSLPFFPSHTV 480
                 ++WH+R+ H S   ++ + +      +     + C  C  GK H + F  S   
Sbjct: 487 ------DLWHKRMGHMSEKGLQILAKKSLISYAKGTTVKPCDYCLFGKQHRVSFQTSSER 546

Query: 481 YSAPLQLIVSDLWGPAYIPSVSGYRYYITFVDVFSRYTWIYFLKSKSDALNVFLKFKLHV 540
               L L+ SD+ GP  I S+ G +Y++TF+D  SR  W+Y LK+K     VF KF   V
Sbjct: 547 KLNILDLVYSDVCGPMEIESMGGNKYFVTFIDDASRKLWVYILKTKDQVFQVFQKFHALV 606

Query: 541 ENLLGLSIKTFQSDSGGEFKS--FSSMLNSYGISHRFTCPHTSKQNGIVERKHRHVVDTG 589
           E   G  +K  +SD+GGE+ S  F    +S+GI H  T P T + NG+ ER +R +V+  
Sbjct: 607 ERETGRKLKRLRSDNGGEYTSREFEEYCSSHGIRHEKTVPGTPQHNGVAERMNRTIVEKV 609

BLAST of Lag0017663 vs. ExPASy Swiss-Prot
Match: P04146 (Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3)

HSP 1 Score: 80.5 bits (197), Expect = 7.0e-14
Identity = 128/606 (21.12%), Postives = 235/606 (38.78%), Query Frame = 0

Query: 1   MSEDILHQMIHCSSTKAIWSCLGQIFTTRNLAQMMKIKTKLQTIQ-KGGMSLKEYFSKIQ 60
           +S+  L+      + + I   L  ++  ++LA  + ++ +L +++    MSL  +F    
Sbjct: 63  LSDSFLNFATSDITARQILENLDAVYERKSLASQLALRKRLLSLKLSSEMSLLSHFHIFD 122

Query: 61  QYVDALSAVGKPVDVEDHILFILSGLGSDYESMVSVISAKIGPQ-SVHEVMSLLLTQENR 120
           + +  L A G  ++  D I  +L  L S Y+ +++ I        ++  V + LL QE +
Sbjct: 123 ELISELLAAGAKIEEMDKISHLLITLPSCYDGIITAIETLSEENLTLAFVKNRLLDQEIK 182

Query: 121 NESKLSSSETALPSVNLTVSPKPPDSESPKPNPNPYPSSFTGGNRGRGGGRSSSNRGGRT 180
              K   ++T+   +N  V            N N Y ++           R +  +    
Sbjct: 183 --IKNDHNDTSKKVMNAIV----------HNNNNTYKNNLF-------KNRVTKPKKIFK 242

Query: 181 WNNRNRIQCQVCGKFGHTAQRCYFRYAPPGPSNNPASFSPHFNQSNRPNQFPQMAAMLTA 240
            N++ +++C  CG+ GH  + C F Y      NN    +    Q+   +    M   +  
Sbjct: 243 GNSKYKVKCHHCGREGHIKKDC-FHY--KRILNNKNKENEKQVQTATSHGIAFMVKEVNN 302

Query: 241 PDINHDTSWYPDSGATNHLTHSFGNLSVGTEYGGGNQVHVGNGAGLPILNYGYSSFSSPT 300
             +  +  +  DSGA++HL        +  E    + V V     + +   G   +++  
Sbjct: 303 TSVMDNCGFVLDSGASDHL--------INDESLYTDSVEVVPPLKIAVAKQGEFIYATKR 362

Query: 301 CANRV-----FFLNNLLHVPSITKNLISVSQFARDNGVFFEFHPTLCYVKDQASGRVLLQ 360
              R+       L ++L       NL+SV +  ++ G+  EF                  
Sbjct: 363 GIVRLRNDHEITLEDVLFCKEAAGNLMSVKRL-QEAGMSIEF------------------ 422

Query: 361 GTLHEGLYRFNLSVPSSSTPIKKDTAVQTLLSQSLSSSPTV------LSVSGGSDLNVWH 420
                           S   I K+  +    S  L++ P +      ++    ++  +WH
Sbjct: 423 --------------DKSGVTISKNGLMVVKNSGMLNNVPVINFQAYSINAKHKNNFRLWH 482

Query: 421 RRLDHPSLAIVKSVLR--LQQPQMSINN---DFQFCTACALGKTHSLPF--FPSHTVYSA 480
            R  H S   +  + R  +   Q  +NN     + C  C  GK   LPF      T    
Sbjct: 483 ERFGHISDGKLLEIKRKNMFSDQSLLNNLELSCEICEPCLNGKQARLPFKQLKDKTHIKR 542

Query: 481 PLQLIVSDLWGPAYIPSVSGYRYYITFVDVFSRYTWIYFLKSKSDALNVFLKFKLHVENL 540
           PL ++ SD+ GP    ++    Y++ FVD F+ Y   Y +K KSD  ++F  F    E  
Sbjct: 543 PLFVVHSDVCGPITPVTLDDKNYFVIFVDQFTHYCVTYLIKYKSDVFSMFQDFVAKSEAH 602

Query: 541 LGLSIKTFQSDSGGEFKS--FSSMLNSYGISHRFTCPHTSKQNGIVERKHRHVVDTGLAL 585
             L +     D+G E+ S          GIS+  T PHT + NG+ ER  R + +    +
Sbjct: 603 FNLKVVYLYIDNGREYLSNEMRQFCVKKGISYHLTVPHTPQLNGVSERMIRTITEKARTM 605

BLAST of Lag0017663 vs. ExPASy Swiss-Prot
Match: Q07791 (Transposon Ty2-DR3 Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TY2B-DR3 PE=3 SV=1)

HSP 1 Score: 79.0 bits (193), Expect = 2.1e-13
Identity = 75/296 (25.34%), Postives = 128/296 (43.24%), Query Frame = 0

Query: 310 LHVPSITKNLISVSQFARDNGVFFEFHPTLCYVK---DQASGRVLLQGTLHEGLYRFNLS 369
           LH P+I  +L+S+S+ A  N        T C+ +   +++ G VL     H   Y  +  
Sbjct: 513 LHTPNIAYDLLSLSELANQN-------ITACFTRNTLERSDGTVLAPIVKHGDFYWLSKK 572

Query: 370 --VPSSSTPIKKDTAVQTLLSQSLSSSPTVLSVSGGSDLNVWHRRLDHPSLAIVKSVLR- 429
             +PS    I K T      S+S++  P  L           HR L H +   ++  L+ 
Sbjct: 573 YLIPSH---ISKLTINNVNKSKSVNKYPYPLI----------HRMLGHANFRSIQKSLKK 632

Query: 430 -----LQQPQMSINNDFQF-CTACALGK-THSLPFFPSHTVYS---APLQLIVSDLWGPA 489
                L++  +  +N   + C  C +GK T       S   Y     P Q + +D++GP 
Sbjct: 633 NAVTYLKESDIEWSNASTYQCPDCLIGKSTKHRHIKGSRLKYQESYEPFQYLHTDIFGPV 692

Query: 490 YIPSVSGYRYYITFVDVFSRYTWIYFL--KSKSDALNVFLKFKLHVENLLGLSIKTFQSD 549
           +    S   Y+I+F D  +R+ W+Y L  + +   LNVF      ++N     +   Q D
Sbjct: 693 HHLPKSAPSYFISFTDEKTRFQWVYPLHDRREESILNVFTSILAFIKNQFNARVLVIQMD 752

Query: 550 SGGEF--KSFSSMLNSYGISHRFTCPHTSKQNGIVERKHRHVVDTGLALLSHSSMP 586
            G E+  K+      + GI+  +T    S+ +G+ ER +R +++    LL  S +P
Sbjct: 753 RGSEYTNKTLHKFFTNRGITACYTTTADSRAHGVAERLNRTLLNDCRTLLHCSGLP 788

BLAST of Lag0017663 vs. ExPASy TrEMBL
Match: A0A5A7U233 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold264G00060 PE=4 SV=1)

HSP 1 Score: 606.3 bits (1562), Expect = 1.4e-169
Identity = 331/591 (56.01%), Postives = 412/591 (69.71%), Query Frame = 0

Query: 1   MSEDILHQMIHCSSTKAIWSCLGQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQ 60
           MSE+IL+QM+HC S K IW  L  IF++R LAQ M+ K KL  I+KG M LKEYF KI Q
Sbjct: 106 MSEEILNQMLHCKSAKEIWETLQGIFSSRYLAQAMQFKNKLHNIKKGSMPLKEYFLKILQ 165

Query: 61  YVDALSAVGKPVDVEDHILFILSGLGSDYESMVSVISAKIGPQSVHEVMSLLLTQENRNE 120
            VDAL+++ KPV  +DHIL+IL+GLGSDY+SM+SVISA+    SV EVMSLLLTQE++NE
Sbjct: 166 CVDALASINKPVSSDDHILYILAGLGSDYQSMISVISARTDSPSVQEVMSLLLTQESQNE 225

Query: 121 SKLSSSETALPSVNLTVSPKPPDSES-PKPNPNPYPSSFTGGNRGRGGGRSSSNRGGRTW 180
           SKL  SETALPSVN+        +ES  + N N Y ++ +   RG G G   SNRG R  
Sbjct: 226 SKL-ISETALPSVNIVTQTTEKGAESYIRTNQNNYHNNHSYNQRG-GRGNGRSNRGRR-- 285

Query: 181 NNRNRIQCQVCGKFGHTAQRCYFRYAPPGPSNNPASFSP--HFNQSNRPNQFPQMAAMLT 240
            NRN+ QCQ+C K G++A RC+FRY    P +N + +SP  H       N  PQM+AM+ 
Sbjct: 286 GNRNKPQCQICAKLGYSADRCFFRYT---PRSNSSGYSPNSHNTSYTNMNNHPQMSAMVA 345

Query: 241 APDINHDTSWYPDSGATNHLTHSFGNLSVGTEYGGGNQVHVGNGAGLPILNYGYSSFSSP 300
           A D+N D++WYPDSGATNHLTHS  NLS+G+EYGGGNQ++  NG+GLPI +YG  SF+S 
Sbjct: 346 ALDLNIDSNWYPDSGATNHLTHSLSNLSIGSEYGGGNQIYAANGSGLPITHYGSMSFNSS 405

Query: 301 TCANRVFFLNNLLHVPSITKNLISVSQFARDNGVFFEFHPTLCYVKDQASGRVLLQGTLH 360
           T   + F LNNLL VPSITKNLISVSQFA+DN VFFEFHPTLCYVKD  +G+VLLQG L+
Sbjct: 406 TLPFKSFTLNNLLQVPSITKNLISVSQFAKDNHVFFEFHPTLCYVKDLDTGQVLLQGLLN 465

Query: 361 EGLYRFNLSVPSSSTPIKKDTAVQTLLSQSLSSSPTVLSVSGGSDLNVWHRRLDHPSLAI 420
           +GLY+F +  PS       ++  + + +  +  S T L       L++WHRRL HP L I
Sbjct: 466 DGLYKFTIE-PSHKRLHHSNSNTKPVFNTVVPKSNTPL-------LDLWHRRLGHPHLPI 525

Query: 421 VKSVLRLQQPQMSINNDFQFCTACALGKTHSLPFFPSHTVYSAPLQLIVSDLWGPAYIPS 480
           VK+VL          N   FC ACALGK H+LPF  S T+Y+ PLQLI  DLWGPA   S
Sbjct: 526 VKAVLNHIDNSSGTINKLNFCEACALGKHHALPFSHSLTLYTHPLQLITCDLWGPAVNVS 585

Query: 481 VSGYRYYITFVDVFSRYTWIYFLKSKSDALNVFLKFKLHVENLLGLSIKTFQSDSGGEFK 540
            +G+RYYI+FVD +SRYTWIYFL SKSDA   F KFK  VE  LG SIK+ Q+D G EFK
Sbjct: 586 HNGFRYYISFVDAYSRYTWIYFLNSKSDAFLAFQKFKTCVEKSLGQSIKSLQTDGGTEFK 645

Query: 541 SFSSMLNSYGISHRFTCPHTSKQNGIVERKHRHVVDTGLALLSHSSMPLKY 589
            F   L+ +GI HR TCP+TSKQN IVERKHR++++ GL LLS +++PL +
Sbjct: 646 PFKPFLDQHGIEHRITCPYTSKQNDIVERKHRYIMEMGLTLLSQATLPLSF 681

BLAST of Lag0017663 vs. ExPASy TrEMBL
Match: A0A5D3CH97 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold25G00040 PE=4 SV=1)

HSP 1 Score: 606.3 bits (1562), Expect = 1.4e-169
Identity = 331/591 (56.01%), Postives = 412/591 (69.71%), Query Frame = 0

Query: 1   MSEDILHQMIHCSSTKAIWSCLGQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQ 60
           MSE+IL+QM+HC S K IW  L  IF++R LAQ M+ K KL  I+KG M LKEYF KI Q
Sbjct: 106 MSEEILNQMLHCKSAKEIWETLQGIFSSRYLAQAMQFKNKLHNIKKGSMPLKEYFLKILQ 165

Query: 61  YVDALSAVGKPVDVEDHILFILSGLGSDYESMVSVISAKIGPQSVHEVMSLLLTQENRNE 120
            VDAL+++ KPV  +DHIL+IL+GLGSDY+SM+SVISA+    SV EVMSLLLTQE++NE
Sbjct: 166 CVDALASINKPVSSDDHILYILAGLGSDYQSMISVISARTDSPSVQEVMSLLLTQESQNE 225

Query: 121 SKLSSSETALPSVNLTVSPKPPDSES-PKPNPNPYPSSFTGGNRGRGGGRSSSNRGGRTW 180
           SKL  SETALPSVN+        +ES  + N N Y ++ +   RG G G   SNRG R  
Sbjct: 226 SKL-ISETALPSVNIVTQTTEKGAESYIRTNQNNYHNNHSYNQRG-GRGNGRSNRGRR-- 285

Query: 181 NNRNRIQCQVCGKFGHTAQRCYFRYAPPGPSNNPASFSP--HFNQSNRPNQFPQMAAMLT 240
            NRN+ QCQ+C K G++A RC+FRY    P +N + +SP  H       N  PQM+AM+ 
Sbjct: 286 GNRNKPQCQICAKLGYSADRCFFRYT---PRSNSSGYSPNSHNTSYTNMNNHPQMSAMVA 345

Query: 241 APDINHDTSWYPDSGATNHLTHSFGNLSVGTEYGGGNQVHVGNGAGLPILNYGYSSFSSP 300
           A D+N D++WYPDSGATNHLTHS  NLS+G+EYGGGNQ++  NG+GLPI +YG  SF+S 
Sbjct: 346 ALDLNIDSNWYPDSGATNHLTHSLSNLSIGSEYGGGNQIYAANGSGLPITHYGSMSFNSS 405

Query: 301 TCANRVFFLNNLLHVPSITKNLISVSQFARDNGVFFEFHPTLCYVKDQASGRVLLQGTLH 360
           T   + F LNNLL VPSITKNLISVSQFA+DN VFFEFHPTLCYVKD  +G+VLLQG L+
Sbjct: 406 TLPFKSFTLNNLLQVPSITKNLISVSQFAKDNHVFFEFHPTLCYVKDLDTGQVLLQGLLN 465

Query: 361 EGLYRFNLSVPSSSTPIKKDTAVQTLLSQSLSSSPTVLSVSGGSDLNVWHRRLDHPSLAI 420
           +GLY+F +  PS       ++  + + +  +  S T L       L++WHRRL HP L I
Sbjct: 466 DGLYKFTIE-PSHKRLHHSNSNTKPVFNTVVPKSNTPL-------LDLWHRRLGHPHLPI 525

Query: 421 VKSVLRLQQPQMSINNDFQFCTACALGKTHSLPFFPSHTVYSAPLQLIVSDLWGPAYIPS 480
           VK+VL          N   FC ACALGK H+LPF  S T+Y+ PLQLI  DLWGPA   S
Sbjct: 526 VKAVLNHIDNSSGTINKLNFCEACALGKHHALPFSHSLTLYTHPLQLITCDLWGPAVNVS 585

Query: 481 VSGYRYYITFVDVFSRYTWIYFLKSKSDALNVFLKFKLHVENLLGLSIKTFQSDSGGEFK 540
            +G+RYYI+FVD +SRYTWIYFL SKSDA   F KFK  VE  LG SIK+ Q+D G EFK
Sbjct: 586 HNGFRYYISFVDAYSRYTWIYFLNSKSDAFLAFQKFKTCVEKSLGQSIKSLQTDGGTEFK 645

Query: 541 SFSSMLNSYGISHRFTCPHTSKQNGIVERKHRHVVDTGLALLSHSSMPLKY 589
            F   L+ +GI HR TCP+TSKQN IVERKHR++++ GL LLS +++PL +
Sbjct: 646 PFKPFLDQHGIEHRITCPYTSKQNDIVERKHRYIMEMGLTLLSQATLPLSF 681

BLAST of Lag0017663 vs. ExPASy TrEMBL
Match: A0A2Z7AWA7 (Integrase catalytic domain-containing protein OS=Dorcoceras hygrometricum OX=472368 GN=F511_06348 PE=4 SV=1)

HSP 1 Score: 497.3 bits (1279), Expect = 8.9e-137
Identity = 276/593 (46.54%), Postives = 385/593 (64.92%), Query Frame = 0

Query: 1   MSEDILHQMIHCSSTKAIWSCLGQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQ 60
           MSE    QMI C ++  +W+ + Q+F TR+ A++M+ K +LQT++KG +S+K+Y  K++ 
Sbjct: 33  MSESAQSQMIGCQTSSQLWTRVTQLFATRSKARVMQYKLQLQTLKKGNLSMKDYLGKMKG 92

Query: 61  YVDALSAVGKPVDVEDHILFILSGLGSDYESMVSVISAKIGPQSVHEVMSLLLTQENRNE 120
           Y+D L+A G  +  +D IL IL G+G +YES+V  +++++   S+ EV +LLL  E R E
Sbjct: 93  YIDILAACGNSIPEDDQILHILGGVGPEYESVVVHVTSRVESLSLSEVGALLLAHEGRIE 152

Query: 121 S-KLSSSETALPSVNLTVSPKPPDSESPKPNPNPYPSSFTGGNRGRGGGRSSSNRGGR-T 180
           +  ++   TA PSVN+T +P    +E+   +   Y        RGRG GR  + RGGR  
Sbjct: 153 TYNITGGHTASPSVNVTTAPSQRKAENTSQSQPVY--------RGRGRGR--NGRGGRKP 212

Query: 181 WNNRNRIQCQVCGKFGHTAQRCYFRYAP---PGPSNNPASFSPHFNQSNRPNQFPQMAAM 240
           W+N  R  CQ+CG  GH A+ CY+R+     P  S    +    FN+S+    +P  A  
Sbjct: 213 WHNNGRPVCQICGIPGHVAEICYYRFDKEFVPKSSGVSRTSQQQFNRSS--PSYPPSAFA 272

Query: 241 LTAPDINHDTSWYPDSGATNHLTHSFGNLSVGTEYGGGNQVHVGNGAGLPILNYGYSSFS 300
            T  +   +  WYPDSGA++H+T+  GNLSV +EY GG++V VGNGAGL I N G S+ +
Sbjct: 273 STKSESASEEWWYPDSGASHHVTNDLGNLSVSSEYTGGSKVQVGNGAGLSISNIGESNLN 332

Query: 301 SPTCANRVFFLNNLLHVPSITKNLISVSQFARDNGVFFEFHPTLCYVKDQASGRVLLQGT 360
               ++R F L NLLHVP ITKNLISVS+FA DN V+FEFHP+ C VKD A+  VLL+GT
Sbjct: 333 M-FPSSRPFLLKNLLHVPLITKNLISVSKFAYDNHVYFEFHPSFCLVKDPATHVVLLRGT 392

Query: 361 LHEGLYRFNLSVPSSSTPIKKDTAVQTLLSQSLSSSPTVLSVSGGSDLNVWHRRLDHPSL 420
           LH GLYRFNL     S P+     +Q+ +S       + L +   + L+ WH RL HPS+
Sbjct: 393 LHNGLYRFNLK-SRISGPLHSPACLQSSVSPIKVPDQSPLCLPQNT-LDKWHLRLGHPSI 452

Query: 421 AIVKSVLRLQQPQMSINNDFQFCTACALGKTHSLPFFPSHTVYSAPLQLIVSDLWGPAYI 480
           A VK VL     ++S N++  FC++C LGK H LPF  S T +SAP +++ SDLWGPA+I
Sbjct: 453 ATVKQVLLDCNERISKNDNISFCSSCQLGKNHLLPFPQSTTNFSAPFEVVYSDLWGPAHI 512

Query: 481 PSVSGYRYYITFVDVFSRYTWIYFLKSKSDALNVFLKFKLHVENLLGLSIKTFQSDSGGE 540
           PS +G RYYI+FVD ++RYTWIYFLK KS+    F+ F+ + E      IKT Q+D GGE
Sbjct: 513 PSRNGSRYYISFVDAYTRYTWIYFLKLKSEVTQTFINFQKYTELHFNAKIKTLQTDGGGE 572

Query: 541 FKSFSSMLNSYGISHRFTCPHTSKQNGIVERKHRHVVDTGLALLSHSSMPLKY 589
           F+S ++   S GI HRF+CP+TSKQNG+VERKHRHVVDTGL+LL+H+S+P ++
Sbjct: 573 FRSLTAYCQSNGILHRFSCPYTSKQNGVVERKHRHVVDTGLSLLAHASLPFEF 610

BLAST of Lag0017663 vs. ExPASy TrEMBL
Match: A0A438FJP6 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX=29760 GN=POLX_1134 PE=4 SV=1)

HSP 1 Score: 474.9 bits (1221), Expect = 4.7e-130
Identity = 268/622 (43.09%), Postives = 370/622 (59.49%), Query Frame = 0

Query: 6   LHQMIHCSSTKAIWSCLGQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYVDAL 65
           L Q++ CSS   +W+ + Q F +++ A++M  K+++Q ++K G+++++Y +K++ Y D L
Sbjct: 226 LPQVVGCSSAFEVWNTISQNFNSQSSAKVMFYKSQMQMLKKDGLTMRDYLTKMKNYCDLL 285

Query: 66  SAVGKPVDVEDHILFILSGLGSDYESMVSVISAKIGPQSVHEVMSLLLTQENRNESKLSS 125
           +  G  +   DHIL I+ GLG +YES+++VIS+K    S+  V S L+  E R   K+SS
Sbjct: 286 ATAGHKISDTDHILAIMQGLGDEYESVIAVISSKKSSPSLQYVTSTLIAHEGRIAHKISS 345

Query: 126 SETALPSVNLTVSPKPPDSESPKPNPNPYPSS-------FTGGNRGRGGGRSSSNRGGRT 185
           ++    SVN T S       S   N N YPSS       F G    RG    +  RG   
Sbjct: 346 NDL---SVNYT-SQYSNRGPSSSWNSNGYPSSGFQNRNQFGGNQVTRGSFVHNRGRGRGR 405

Query: 186 WNNRNRIQCQVCGKFGHTAQRCYFRYAPPGPSNNPA------------------SFSPHF 245
                + QCQ+C KFGHT  RC++RY P    N PA                  S S   
Sbjct: 406 AQGGIKPQCQLCNKFGHTVHRCFYRYDPNFHGNMPANGPTPGVLGSGARNGASGSISSAG 465

Query: 246 N------QSNRPNQFPQMAAMLTAPDINHDTSWYPDSGATNHLTHSFGNLSVGTEYGGGN 305
           N       +     + +M AM+  P+   +  W+PDSGATNH+TH  GNL+ G EY G +
Sbjct: 466 NVNLTEYDAQENQDYSEMEAMVATPEDLQNCCWFPDSGATNHVTHDLGNLNSGAEYNGNS 525

Query: 306 QVHVGNGAGLPILNYGYSSFSSPTCANRVFFLNNLLHVPSITKNLISVSQFARDNGVFFE 365
           ++H+GNG GL I + G S F S +  N+V FL N+L VP+I KNL+SVSQFARDN V+FE
Sbjct: 526 KIHMGNGTGLKISHIGLSVFPSSSSPNKVLFLKNILRVPAIKKNLLSVSQFARDNNVYFE 585

Query: 366 FHPTLCYVKDQASGRVLLQGTLHEGLYRFNLSV----PSSSTPIKKDTAVQTLLSQSL-- 425
           FHP +C+VKD+++  +LLQG LH+GLY+FNLS      +S   +  D    T  + SL  
Sbjct: 586 FHPKVCFVKDKSNHSLLLQGNLHKGLYQFNLSKKLFGKASGLSLSNDKNELTCCNASLVH 645

Query: 426 --SSSPTVLSVSGGSDLNVWHRRLDHPSLAIVKSVLRLQQPQMSINNDFQFCTACALGKT 485
             +S     + S     ++WH+RL HP+  IV  VL   +   S  +    C+AC LGK+
Sbjct: 646 NDNSDFPEKTNSSFHVFDLWHKRLGHPASKIVTQVLNDNKIPFSTKSGSSICSACQLGKS 705

Query: 486 HSLPFFPSHTVYSAPLQLIVSDLWGPAYIPSVSGYRYYITFVDVFSRYTWIYFLKSKSDA 545
           H+LPF  S TVY+ PLQL+VSDLWGPA I S  G+ YY++FVD +SRYTW+YFLK+KS  
Sbjct: 706 HNLPFPISQTVYTKPLQLVVSDLWGPAPINSSYGFTYYVSFVDAYSRYTWVYFLKTKSQT 765

Query: 546 LNVFLKFKLHVENLLGLSIKTFQSDSGGEFKSFSSMLNSYGISHRFTCPHTSKQNGIVER 589
              FL FK   E   G  +KTFQ+D GGEF+S  +     GI HR +CPHTSKQNGI+ER
Sbjct: 766 REAFLMFKAQAELQFGCKLKTFQTDWGGEFRSLKTYFEQNGIIHRLSCPHTSKQNGIIER 825

BLAST of Lag0017663 vs. ExPASy TrEMBL
Match: A0A438EA49 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX=29760 GN=POLX_2917 PE=4 SV=1)

HSP 1 Score: 448.0 bits (1151), Expect = 6.2e-122
Identity = 255/580 (43.97%), Postives = 342/580 (58.97%), Query Frame = 0

Query: 48  GMSLKEYFSKIQQYVDALSAVGKPVDVEDHILFILSGLGSDYESMVSVISAKIGPQSVHE 107
           G+++++Y +K++ Y D L+  G  +   DHIL I+ GLG +YES+++VIS+K    S+  
Sbjct: 132 GLTMRDYLTKMKNYCDLLATAGHKISDTDHILAIMQGLGDEYESVIAVISSKKSSPSLQY 191

Query: 108 VMSLLLTQENRNESKLSSSETALPSVNLTVSPKPPDSESPKPNPNPYPSS-------FTG 167
           V S L+  E R   K+SS++    SVN T S       S   N N YPSS       F G
Sbjct: 192 VTSTLIAHEGRIAHKISSNDL---SVNYT-SQYSNRGPSSSWNSNGYPSSGFQNRNQFGG 251

Query: 168 GNRGRGGGRSSSNRGGRTWNNRNRIQCQVCGKFGHTAQRCYFRYAPPGPSNNPA------ 227
               RG    +  RG        + QCQ+C KFGHT  RC++RY P    N PA      
Sbjct: 252 NQVTRGSFVHNRGRGRGRAQGGIKPQCQLCNKFGHTVHRCFYRYDPNFHGNMPANGPTPG 311

Query: 228 ------------SFSPHFN------QSNRPNQFPQMAAMLTAPDINHDTSWYPDSGATNH 287
                       S S   N       +     + +M AM+  P+   +  W+PDSGATNH
Sbjct: 312 VLGSGARNGASGSISSAGNVNLTEYDAQENQDYSEMEAMVATPEDLQNCCWFPDSGATNH 371

Query: 288 LTHSFGNLSVGTEYGGGNQVHVGNGAGLPILNYGYSSFSSPTCANRVFFLNNLLHVPSIT 347
           +TH  GNL+ G EY G +++H+GNG GL I + G S F S +  N+V FL N+L VP+I 
Sbjct: 372 VTHDLGNLNSGAEYNGNSKIHMGNGTGLKISHIGLSVFPSSSSPNKVLFLKNILRVPAIK 431

Query: 348 KNLISVSQFARDNGVFFEFHPTLCYVKDQASGRVLLQGTLHEGLYRFNLSV----PSSST 407
           KNL+SVSQFARDN V+FEFHP +C+VKD+++  +LLQG LH+GLY+FNLS      +S  
Sbjct: 432 KNLLSVSQFARDNNVYFEFHPKVCFVKDKSNHSLLLQGNLHKGLYQFNLSKKLFGKASGL 491

Query: 408 PIKKDTAVQTLLSQSL----SSSPTVLSVSGGSDLNVWHRRLDHPSLAIVKSVLRLQQPQ 467
            +  D    T  + SL    +S     + S     ++WH+RL HP+  IV  VL   +  
Sbjct: 492 SLSNDKNELTCCNASLVHNDNSDFPEKTNSSFHVFDLWHKRLGHPASKIVTQVLNDNKIP 551

Query: 468 MSINNDFQFCTACALGKTHSLPFFPSHTVYSAPLQLIVSDLWGPAYIPSVSGYRYYITFV 527
            S  +    C+AC LGK+H+LPF  S TVY+ PLQL+VSDLWGPA I S  G+ YY++FV
Sbjct: 552 FSTKSGSSICSACQLGKSHNLPFPISQTVYTKPLQLVVSDLWGPAPINSSYGFTYYVSFV 611

Query: 528 DVFSRYTWIYFLKSKSDALNVFLKFKLHVENLLGLSIKTFQSDSGGEFKSFSSMLNSYGI 587
           D +SRYTW+YFLK+KS     FL FK   E   G  +KTFQ+D GGEF+S  +     GI
Sbjct: 612 DAYSRYTWVYFLKTKSQTREAFLMFKAQAELQFGCKLKTFQTDWGGEFRSLKTYFEQNGI 671

Query: 588 SHRFTCPHTSKQNGIVERKHRHVVDTGLALLSHSSMPLKY 589
            HR +CPHTSKQNGI+ERKHRH+V+ GL LL+ +S+PLKY
Sbjct: 672 IHRLSCPHTSKQNGIIERKHRHIVELGLTLLAQASLPLKY 707

BLAST of Lag0017663 vs. TAIR 10
Match: AT5G48050.1 (CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G34070.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )

HSP 1 Score: 59.3 bits (142), Expect = 1.2e-08
Identity = 55/191 (28.80%), Postives = 93/191 (48.69%), Query Frame = 0

Query: 1   MSEDILHQMIHCSST-KAIWSCLGQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQ 60
           +++ +L  +I    T + +W  L  +F     A+ ++ + +L+T     +S+ EY  K++
Sbjct: 82  ITDSLLDTIIKVGCTARDLWLSLENLFRDNKEARALQFENELRTTTIDDLSVHEYCQKLK 141

Query: 61  QYVDALSAVGKPVDVEDHILFILSGLGSDYESMVSVISAKIGPQSVHEVMSLLLTQENR- 120
              D L+ V  P+     ++ +L+GL   Y+ +++VI  K    S  E  S+LL +E+R 
Sbjct: 142 SLSDLLTNVDSPISDRVLVMHLLNGLTEKYDYILNVIKHKSPFPSFTEARSMLLMEESRL 201

Query: 121 -NESKLSSSETALPSVNLTVSPKPPDSESPKPNPNPYPSSFTGGNRGRGGGRS-SSNRGG 180
            N+SK S S T  PS++  +   P   E        YP  +   N   G GRS   NRGG
Sbjct: 202 SNKSKSSLSHTNHPSLSNVLFTVPRQQER-------YPQEYHNNNSNMGRGRSKKKNRGG 261

Query: 181 RT----WNNRN 184
            +    +NN N
Sbjct: 262 GSSDGRYNNNN 265

BLAST of Lag0017663 vs. TAIR 10
Match: ATMG00300.1 (Gag-Pol-related retrotransposon family protein )

HSP 1 Score: 52.8 bits (125), Expect = 1.1e-06
Identity = 32/128 (25.00%), Postives = 53/128 (41.41%), Query Frame = 0

Query: 349 RVLLQGTLHEGLYRFNLSVPSSSTPIKKDTAVQTLLSQSLSSSPTVLSVSGGSDLNVWHR 408
           R +L+G  H+ LY                     +L  S+ +  + L+ +   +  +WH 
Sbjct: 36  RTILKGNRHDSLY---------------------ILQGSVETGESNLAETAKDETRLWHS 95

Query: 409 RLDHPSLAIVKSVLRLQQPQMSINNDFQFCTACALGKTHSLPFFPSHTVYSAPLQLIVSD 468
           RL H S   ++ +++      S  +  +FC  C  GKTH + F         PL  + SD
Sbjct: 96  RLAHMSQRGMELLVKKGFLDSSKVSSLKFCEDCIYGKTHRVNFSTGQHTTKNPLDYVHSD 142

Query: 469 LWGPAYIP 477
           LWG   +P
Sbjct: 156 LWGAPSVP 142

BLAST of Lag0017663 vs. TAIR 10
Match: AT1G34070.1 (CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G48050.1); Has 648 Blast hits to 647 proteins in 29 species: Archae - 0; Bacteria - 0; Metazoa - 16; Fungi - 25; Plants - 607; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 45.4 bits (106), Expect = 1.8e-04
Identity = 46/184 (25.00%), Postives = 91/184 (49.46%), Query Frame = 0

Query: 13  SSTKAIWSCLGQIFTTRNLAQMMKIKTKLQTIQKGGMSLKEYFSKIQQYVDALSAVGKPV 72
           S+++ IW  +   F     A+ +++ ++L+T   G M + +Y+ K+++  D+L  V  PV
Sbjct: 93  STSRDIWLRIKNQFRNNKDARALRLDSELRTKDIGDMRVADYYRKMKKLADSLRNVDVPV 152

Query: 73  DVEDHILFILSGLGSDYESMVSVISAKIGPQSVHEVMSLLLTQENRNESKLSSSET---- 132
              + ++++L+GL   ++++++VI  +    S  +  ++L  +E+R +  +  + T    
Sbjct: 153 TDRNLVMYVLNGLNPKFDNIINVIKHRQPFPSFDDAATMLQEEEDRLKRAIKPNPTHVDH 212

Query: 133 ALPSVNLTVSPKPPDSESPKPNPNPYPSSFTGGNRGRGGGRS-SSNRGGR-------TWN 185
           +  S  L  S  PP +   +   N        G RGRG G +    RGGR       T+N
Sbjct: 213 SSSSTVLACSEAPPVTNFQRSGGNQM------GYRGRGRGNNIFRGRGGRFSYYNMPTFN 270

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAA0048297.12.8e-16956.01Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. m... [more]
TYK10642.12.8e-16956.01Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. m... [more]
KZV26181.11.8e-13646.54hypothetical protein F511_06348 [Dorcoceras hygrometricum][more]
RVW60229.19.8e-13043.09Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera][more]
RVW44519.11.3e-12143.97Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera][more]
Match NameE-valueIdentityDescription
Q94HW21.2e-7433.56Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... [more]
Q9ZT941.4e-7035.66Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... [more]
P109782.3e-3324.14Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
P041467.0e-1421.12Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3[more]
Q077912.1e-1325.34Transposon Ty2-DR3 Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC ... [more]
Match NameE-valueIdentityDescription
A0A5A7U2331.4e-16956.01Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var.... [more]
A0A5D3CH971.4e-16956.01Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var.... [more]
A0A2Z7AWA78.9e-13746.54Integrase catalytic domain-containing protein OS=Dorcoceras hygrometricum OX=472... [more]
A0A438FJP64.7e-13043.09Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX... [more]
A0A438EA496.2e-12243.97Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX... [more]
Match NameE-valueIdentityDescription
AT5G48050.11.2e-0828.80CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); BE... [more]
ATMG00300.11.1e-0625.00Gag-Pol-related retrotransposon family protein [more]
AT1G34070.11.8e-0425.00CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); BE... [more]
InterPro
Analysis Name: InterPro Annotations of Sponge gourd (AG-4) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001584Integrase, catalytic corePFAMPF00665rvecoord: 460..557
e-value: 4.4E-12
score: 46.2
IPR001584Integrase, catalytic corePROSITEPS50994INTEGRASEcoord: 457..588
score: 18.389437
IPR025724GAG-pre-integrase domainPFAMPF13976gag_pre-integrscoord: 387..445
e-value: 1.6E-9
score: 37.5
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 454..587
e-value: 1.6E-27
score: 98.1
NoneNo IPR availablePFAMPF14223Retrotran_gag_2coord: 1..121
e-value: 2.1E-16
score: 59.9
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 155..180
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 120..180
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 120..141
NoneNo IPR availablePANTHERPTHR34222:SF48SUBFAMILY NOT NAMEDcoord: 2..297
NoneNo IPR availablePANTHERPTHR34222FAMILY NOT NAMEDcoord: 2..297
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 457..586

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lag0017663.1Lag0017663.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
molecular_function GO:0003824 catalytic activity
molecular_function GO:0003676 nucleic acid binding