Tan0020073 (gene) Snake gourd v1

Overview
NameTan0020073
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRetrotransposon protein
LocationLG02: 8611242 .. 8612267 (+)
RNA-Seq ExpressionTan0020073
SyntenyTan0020073
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCATGCATGTAGCATTATGGCAGGTACTTCAAAACACTCAAAGCATACGTGGACGAAAGTAGAGGATGCGAGGTTGGTAGAGTCACTGGTGTATTTAGTACATAATGGGTGGCGATCAGACAATGGGACATTCAGGCCTGGGTATCTCCAACATCTTCAGAAGATGCTAGCAGAGAAATTGCCAAATTCATCATTAGAACTAAATACCATCGACTGCAAAGTGAGAACTCTGGAAAAAACAATACAATCTTCATTGCGAGAGATGCTTGGGAATGGTTGTAGTGGTTTTGGTTGGAATGAAGAATTTAAATGTGTTGAAGCAGAGAAAGAGGTATTCGATGCGTGGGTTAAGGTAAGATAGTGTTATTATAACATATTATTATTAGCATTCATGGTAGATATATGTAATATATTTATTCACATGTAGAGCCATACAAATGCCAAAGGGATGAGGAACAAGCCATTTTCGCACTATGATGAGCTGGCAGTTGTCTTCGGAAAAGATAGAGCTACAGGAATAGGCGCAGAGACCCTAATGGAAATGACCTCTAATGTTGCGGAACAAATAGAGGAGGAGATTCGTTTGGGATCACAAGACTTCATAGGGACAAAACAACGAACGATGGAGAATCCAGGACTTGGTGACGTAGGGGAAGATGACTTGCCAGACACTCCTACTAGTAGGCGTAATACATCTGACATGTCTTCTAGATGTACTGGGAGCAAAAGAAAACGATCGTCCTTCCAGACTGAATTAATTGATGTTGTGCGGACAACAATGGATATGCATACCAGTCACATGCAACAACTTCTATCATGACAGAAGGAGAAGTATGAATTGGAGGCCACACGAAGGAAGGAAGTAGTCGATCTCTTGTATCAGATAGAAGGATTGACTGAGCATGATCGTGTCTCCCTGATTGACTTGCTTGTGACTGATATCCAGAAGACTGATTGTTTTCTACAGGTTCCGCCTCAATCGAGAAAGGCGTATTGCATGCGTCTTCTAGGAAGGACTGAATGA

mRNA sequence

ATGCATGCATGTAGCATTATGGCAGGTACTTCAAAACACTCAAAGCATACGTGGACGAAAGTAGAGGATGCGAGGTTGGTAGAGTCACTGGTGTATTTAGTACATAATGGGTGGCGATCAGACAATGGGACATTCAGGCCTGGGTATCTCCAACATCTTCAGAAGATGCTAGCAGAGAAATTGCCAAATTCATCATTAGAACTAAATACCATCGACTGCAAAGTGAGAACTCTGGAAAAAACAATACAATCTTCATTGCGAGAGATGCTTGGGAATGGTTGTAGTGGTTTTGGTTGGAATGAAGAATTTAAATGTGTTGAAGCAGAGAAAGAGGTATTCGATGCGTGGGTTAAGAGCCATACAAATGCCAAAGGGATGAGGAACAAGCCATTTTCGCACTATGATGAGCTGGCAGTTGTCTTCGGAAAAGATAGAGCTACAGGAATAGGCGCAGAGACCCTAATGGAAATGACCTCTAATGTTGCGGAACAAATAGAGGAGGAGATTCGTTTGGGATCACAAGACTTCATAGGGACAAAACAACGAACGATGGAGAATCCAGGACTTGGTGACGTAGGGGAAGATGACTTGCCAGACACTCCTACTAGTAGGCGTAATACATCTGACATGTCTTCTAGATGTACTGGGAGCAAAAGAAAACGATCGTCCTTCCAGACTGAATTAATTGATGTTAAGGAGAAGTATGAATTGGAGGCCACACGAAGGAAGGAAGTAGTCGATCTCTTGTATCAGATAGAAGGATTGACTGAGCATGATCGTGTCTCCCTGATTGACTTGCTTGTGACTGATATCCAGAAGACTGATTGTTTTCTACAGGTTCCGCCTCAATCGAGAAAGGCGTATTGCATGCGTCTTCTAGGAAGGACTGAATGA

Coding sequence (CDS)

ATGCATGCATGTAGCATTATGGCAGGTACTTCAAAACACTCAAAGCATACGTGGACGAAAGTAGAGGATGCGAGGTTGGTAGAGTCACTGGTGTATTTAGTACATAATGGGTGGCGATCAGACAATGGGACATTCAGGCCTGGGTATCTCCAACATCTTCAGAAGATGCTAGCAGAGAAATTGCCAAATTCATCATTAGAACTAAATACCATCGACTGCAAAGTGAGAACTCTGGAAAAAACAATACAATCTTCATTGCGAGAGATGCTTGGGAATGGTTGTAGTGGTTTTGGTTGGAATGAAGAATTTAAATGTGTTGAAGCAGAGAAAGAGGTATTCGATGCGTGGGTTAAGAGCCATACAAATGCCAAAGGGATGAGGAACAAGCCATTTTCGCACTATGATGAGCTGGCAGTTGTCTTCGGAAAAGATAGAGCTACAGGAATAGGCGCAGAGACCCTAATGGAAATGACCTCTAATGTTGCGGAACAAATAGAGGAGGAGATTCGTTTGGGATCACAAGACTTCATAGGGACAAAACAACGAACGATGGAGAATCCAGGACTTGGTGACGTAGGGGAAGATGACTTGCCAGACACTCCTACTAGTAGGCGTAATACATCTGACATGTCTTCTAGATGTACTGGGAGCAAAAGAAAACGATCGTCCTTCCAGACTGAATTAATTGATGTTAAGGAGAAGTATGAATTGGAGGCCACACGAAGGAAGGAAGTAGTCGATCTCTTGTATCAGATAGAAGGATTGACTGAGCATGATCGTGTCTCCCTGATTGACTTGCTTGTGACTGATATCCAGAAGACTGATTGTTTTCTACAGGTTCCGCCTCAATCGAGAAAGGCGTATTGCATGCGTCTTCTAGGAAGGACTGAATGA

Protein sequence

MHACSIMAGTSKHSKHTWTKVEDARLVESLVYLVHNGWRSDNGTFRPGYLQHLQKMLAEKLPNSSLELNTIDCKVRTLEKTIQSSLREMLGNGCSGFGWNEEFKCVEAEKEVFDAWVKSHTNAKGMRNKPFSHYDELAVVFGKDRATGIGAETLMEMTSNVAEQIEEEIRLGSQDFIGTKQRTMENPGLGDVGEDDLPDTPTSRRNTSDMSSRCTGSKRKRSSFQTELIDVKEKYELEATRRKEVVDLLYQIEGLTEHDRVSLIDLLVTDIQKTDCFLQVPPQSRKAYCMRLLGRTE
Homology
BLAST of Tan0020073 vs. NCBI nr
Match: XP_038887234.1 (uncharacterized protein LOC120077425 [Benincasa hispida])

HSP 1 Score: 283.1 bits (723), Expect = 2.8e-72
Identity = 158/308 (51.30%), Postives = 196/308 (63.64%), Query Frame = 0

Query: 7   MAGTSKHSKHTWTKVEDARLVESLVYLVHNGWRSDNGTFRPGYLQHLQKMLAEKLPNSSL 66
           M G SK SKH W+KVEDARLVE+L+YLV  GWRSDNGTFRPGYLQHL+++L EK+P  +L
Sbjct: 1   MTGNSKRSKHVWSKVEDARLVEALLYLVETGWRSDNGTFRPGYLQHLEQILHEKVPGCAL 60

Query: 67  ELNTIDCKVRTLEKTIQSSLREMLGNGCSGFGWNEEFKCVEAEKEVFDAWVKSHTNAKGM 126
             NTI+CKVR+L+K   +++ EML    SGF WNEEFKCV+ E+E+FD WV+SH NAKGM
Sbjct: 61  NKNTIECKVRSLKKQ-YNAVSEMLSQ--SGFNWNEEFKCVQVEREIFDLWVRSHPNAKGM 120

Query: 127 RNKPFSHYDELAVVFGKDRATGIGAETLMEMTSNVAEQIEEEIRLGSQDFIGTKQRTMEN 186
             KPF HYD+L+ VFGKDRA                            D    + R  E+
Sbjct: 121 WKKPFPHYDDLSAVFGKDRA----------------------------DCHTPEVRQTES 180

Query: 187 PGLGDVGEDDLPDTPTSRRNTSDMSSRCTGSKRKRSSFQTELIDV--------------- 246
           P   D  +++  +  T R +    SSR  GSKRKRSSFQ E+ID+               
Sbjct: 181 PLNQDEIDEEPAEQSTGRASVPTESSR--GSKRKRSSFQVEMIDIVKSTVEMQSTHMGRL 240

Query: 247 ----KEKYELEATRRKEVVDLLYQIEGLTEHDRVSLIDLLVTDIQKTDCFLQVPPQSRKA 296
                EKYELE    KEVV+ +Y I+ L E+D+V+LIDL+VTDIQKTDCFL VP  +RK 
Sbjct: 241 ASWQNEKYELEL---KEVVNAIYNIDDLEENDQVTLIDLIVTDIQKTDCFLAVPEHARKR 272

BLAST of Tan0020073 vs. NCBI nr
Match: XP_038896380.1 (uncharacterized protein LOC120084641 [Benincasa hispida])

HSP 1 Score: 278.5 bits (711), Expect = 6.8e-71
Identity = 156/308 (50.65%), Postives = 193/308 (62.66%), Query Frame = 0

Query: 7   MAGTSKHSKHTWTKVEDARLVESLVYLVHNGWRSDNGTFRPGYLQHLQKMLAEKLPNSSL 66
           MAG+ K SKH W+KVED +LVE+L+YLV  GWRSDNGTFR GYLQ+L+++L EK+P  +L
Sbjct: 1   MAGSGKRSKHVWSKVEDTKLVEALLYLVETGWRSDNGTFRLGYLQYLERILHEKVPGCAL 60

Query: 67  ELNTIDCKVRTLEKTIQSSLREMLGNGCSGFGWNEEFKCVEAEKEVFDAWVKSHTNAKGM 126
             NTI+CKVR+L+K   +++ EML    SGFGWNEEFKCV+ EKE+FD WV+SH NAKGM
Sbjct: 61  NQNTIECKVRSLKKQ-YNAVSEMLSQ--SGFGWNEEFKCVQVEKEIFDLWVRSHLNAKGM 120

Query: 127 RNKPFSHYDELAVVFGKDRATGIGAETLMEMTSNVAEQIEEEIRLGSQDFIGTKQRTMEN 186
            NK F HYD+L+ VFGKDRA     E     +    ++I+EE                  
Sbjct: 121 WNKSFLHYDDLSTVFGKDRANCHTPEVCQAESPLNQDEIDEE------------------ 180

Query: 187 PGLGDVGEDDLPDTPTSRRNTSDMSSRCTGSKRKRSSFQTELIDV--------------- 246
                       +  T R +    SSR  GSKRKR SFQ E+ID+               
Sbjct: 181 ----------PAEQSTGRASVLAESSR--GSKRKRPSFQAEMIDIMRSTVEMQSTHMGRL 240

Query: 247 ----KEKYELEATRRKEVVDLLYQIEGLTEHDRVSLIDLLVTDIQKTDCFLQVPPQSRKA 296
               KEKYELE  RRKEVV+ +Y I+GL E D+V+ IDLLVTDIQKTDCFL VP  +RK 
Sbjct: 241 ASWQKEKYELEFGRRKEVVNAIYSIDGLDEDDQVTFIDLLVTDIQKTDCFLAVPEHARKR 275

BLAST of Tan0020073 vs. NCBI nr
Match: XP_038895773.1 (uncharacterized protein LOC120083935 [Benincasa hispida])

HSP 1 Score: 272.7 bits (696), Expect = 3.7e-69
Identity = 148/284 (52.11%), Postives = 187/284 (65.85%), Query Frame = 0

Query: 12  KHSKHTWTKVEDARLVESLVYLVHNGWRSDNGTFRPGYLQHLQKMLAEKLPNSSLELNTI 71
           K SKH W+KVEDA+ VE+L+YLV  GWRSDNGTFR  YLQHL+++  EK+   +L  NTI
Sbjct: 44  KRSKHVWSKVEDAKFVEALLYLVDTGWRSDNGTFRLEYLQHLERIHHEKVLGCALNQNTI 103

Query: 72  DCKVRTLEKTIQSSLREMLGNGCSGFGWNEEFKCVEAEKEVFDAWVKSHTNAKGMRNKPF 131
           +CKVR+L+K   +++ EML    SGF WNEEFKCV+ E+E+FD WV+SH NAKGM NKPF
Sbjct: 104 ECKVRSLKKQC-NAVSEMLSQ--SGFDWNEEFKCVQVEREIFDPWVRSHPNAKGMWNKPF 163

Query: 132 SHYDELAVVFGKDRATGIGAETLMEMTSNVAEQIEEEIRLGSQDFIGTKQRTMENPGLGD 191
            HYD+L+ VFGK +A G  +E    MT+N   + E+EIRLGSQD       T E+  +G 
Sbjct: 164 PHYDDLSTVFGKYKAVGQSSEDPYVMTTNAFREFEDEIRLGSQDC-----HTPESTHMG- 223

Query: 192 VGEDDLPDTPTSRRNTSDMSSRCTGSKRKRSSFQTELIDVKEKYELEATRRKEVVDLLYQ 251
                                       + +S+Q      KEKYELE  RRKEVV+ +Y 
Sbjct: 224 ----------------------------RLASWQ------KEKYELEFGRRKEVVNAIYN 283

Query: 252 IEGLTEHDRVSLIDLLVTDIQKTDCFLQVPPQSRKAYCMRLLGR 296
           I+GL E D+V+LIDLLVTDIQKT+CFL VP  +RK YC+RLLGR
Sbjct: 284 IDGLDEDDQVTLIDLLVTDIQKTNCFLAVPEHARKRYCLRLLGR 284

BLAST of Tan0020073 vs. NCBI nr
Match: XP_038902479.1 (uncharacterized protein At2g29880-like [Benincasa hispida])

HSP 1 Score: 226.9 bits (577), Expect = 2.3e-55
Identity = 118/218 (54.13%), Postives = 148/218 (67.89%), Query Frame = 0

Query: 7   MAGTSKHSKHTWTKVEDARLVESLVYLVHNGWRSDNGTFRPGYLQHLQKMLAEKLPNSSL 66
           M    K SKH W+KVEDA+LVE+L+YLV  GWRSDNGTFRPGYLQHL+++L EK+P  +L
Sbjct: 1   MTSNGKRSKHIWSKVEDAKLVEALLYLVETGWRSDNGTFRPGYLQHLERILHEKVPGCTL 60

Query: 67  ELNTIDCKVRTLEKTIQSSLREMLGNGCSGFGWNEEFKCVEAEKEVFDAWVKSHTNAKGM 126
             NTI+CKVR+L+K   + + EML    SGF WNEEFKCV+ E+E+FD WV SH NAK M
Sbjct: 61  NQNTIECKVRSLKKQ-YNIVSEMLSQ--SGFDWNEEFKCVQVEREIFDLWVLSHPNAKRM 120

Query: 127 RNKPFSHYDELAVVFGKDRATGIGAETLMEMTSNVAEQIEEEIRLGSQDFIGTKQRTMEN 186
            NKPF HYD+ + VFGKDR  G  +E    M +N   + E+EIRLGSQD    + R  E+
Sbjct: 121 WNKPFPHYDDFSTVFGKDRVVGKSSEDPYVMATNAFREFEDEIRLGSQDCQTPEVRQTES 180

Query: 187 PGLGDVGEDDLPDTPTSRRNTSDMSSRCTGSKRKRSSF 225
           P   D  +++  +  T R +    SSR  GSKRKR SF
Sbjct: 181 PLNQDEIDEEPAEQSTGRASVPAKSSR--GSKRKRPSF 213

BLAST of Tan0020073 vs. NCBI nr
Match: XP_008441954.1 (PREDICTED: uncharacterized protein LOC103485953 [Cucumis melo] >KAA0047736.1 retrotransposon protein [Cucumis melo var. makuwa] >TYK08388.1 retrotransposon protein [Cucumis melo var. makuwa])

HSP 1 Score: 219.9 bits (559), Expect = 2.9e-53
Identity = 130/306 (42.48%), Postives = 182/306 (59.48%), Query Frame = 0

Query: 7   MAGTSKHSKHTWTKVEDARLVESLVYLVHN-GWRSDNGTFRPGYLQHLQKMLAEKLPNSS 66
           MA  S+  KHTWTK E+ + VE LV LV + GWRSDNGTF+PGYL  LQ+M+AEKLP ++
Sbjct: 1   MASLSRAPKHTWTKEEEEKFVECLVELVSSGGWRSDNGTFQPGYLAQLQRMMAEKLPGTN 60

Query: 67  L-ELNTIDCKVRTLEKTIQSSLREMLGNGCSGFGWNEEFKCVEAEKEVFDAWVKSHTNAK 126
           + E +TIDC V++L+KT   ++ EM G  CSGFGWNEEF+C+ AE+++FD+W+KSH  AK
Sbjct: 61  IQESSTIDCHVKSLKKTYH-AIAEMRGPSCSGFGWNEEFQCIIAERDLFDSWIKSHPAAK 120

Query: 127 GMRNKPFSHYDELAVVFGKDRATGIGAETLMEMTSNVAEQIEEEIRLGSQDFIGTKQRTM 186
           G+ +K F +YD+L+ VFGKDRATG  +ET   + SNV+    + I LG  D       TM
Sbjct: 121 GLLHKSFPYYDDLSYVFGKDRATGARSETFPNVGSNVSNMFNDTIPLG--DSHDEDIPTM 180

Query: 187 ENPGLGDVGEDDL----PDTPTSRRNTSDMSSRCTGSKRKRS--------SFQTELIDV- 246
            + G+  +  D++        + RRN S +S R  GS+R  +         F  E +   
Sbjct: 181 YSQGV-HMSPDEMFGIRAGQASERRNCSSVSKRKRGSERYETVEVIRSVMEFGNEQLKAI 240

Query: 247 ----KEKYELEATRRKEVVDLLYQIEGLTEHDRVSLIDLLVTDIQKTDCFLQVPPQSRKA 294
               KEK  +E   R +VV  L  I  L   DR  L+ +L   ++  + FL +P + +  
Sbjct: 241 ADWPKEKRAMEVEMRAQVVKQLQDIPKLRSQDRAKLMQILFRSLEAIEGFLSIPTELKLE 300

BLAST of Tan0020073 vs. ExPASy TrEMBL
Match: A0A5A7U0H7 (Retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold648G002060 PE=4 SV=1)

HSP 1 Score: 219.9 bits (559), Expect = 1.4e-53
Identity = 130/306 (42.48%), Postives = 182/306 (59.48%), Query Frame = 0

Query: 7   MAGTSKHSKHTWTKVEDARLVESLVYLVHN-GWRSDNGTFRPGYLQHLQKMLAEKLPNSS 66
           MA  S+  KHTWTK E+ + VE LV LV + GWRSDNGTF+PGYL  LQ+M+AEKLP ++
Sbjct: 1   MASLSRAPKHTWTKEEEEKFVECLVELVSSGGWRSDNGTFQPGYLAQLQRMMAEKLPGTN 60

Query: 67  L-ELNTIDCKVRTLEKTIQSSLREMLGNGCSGFGWNEEFKCVEAEKEVFDAWVKSHTNAK 126
           + E +TIDC V++L+KT   ++ EM G  CSGFGWNEEF+C+ AE+++FD+W+KSH  AK
Sbjct: 61  IQESSTIDCHVKSLKKTYH-AIAEMRGPSCSGFGWNEEFQCIIAERDLFDSWIKSHPAAK 120

Query: 127 GMRNKPFSHYDELAVVFGKDRATGIGAETLMEMTSNVAEQIEEEIRLGSQDFIGTKQRTM 186
           G+ +K F +YD+L+ VFGKDRATG  +ET   + SNV+    + I LG  D       TM
Sbjct: 121 GLLHKSFPYYDDLSYVFGKDRATGARSETFPNVGSNVSNMFNDTIPLG--DSHDEDIPTM 180

Query: 187 ENPGLGDVGEDDL----PDTPTSRRNTSDMSSRCTGSKRKRS--------SFQTELIDV- 246
            + G+  +  D++        + RRN S +S R  GS+R  +         F  E +   
Sbjct: 181 YSQGV-HMSPDEMFGIRAGQASERRNCSSVSKRKRGSERYETVEVIRSVMEFGNEQLKAI 240

Query: 247 ----KEKYELEATRRKEVVDLLYQIEGLTEHDRVSLIDLLVTDIQKTDCFLQVPPQSRKA 294
               KEK  +E   R +VV  L  I  L   DR  L+ +L   ++  + FL +P + +  
Sbjct: 241 ADWPKEKRAMEVEMRAQVVKQLQDIPKLRSQDRAKLMQILFRSLEAIEGFLSIPTELKLE 300

BLAST of Tan0020073 vs. ExPASy TrEMBL
Match: A0A1S3B4L3 (uncharacterized protein LOC103485953 OS=Cucumis melo OX=3656 GN=LOC103485953 PE=4 SV=1)

HSP 1 Score: 219.9 bits (559), Expect = 1.4e-53
Identity = 130/306 (42.48%), Postives = 182/306 (59.48%), Query Frame = 0

Query: 7   MAGTSKHSKHTWTKVEDARLVESLVYLVHN-GWRSDNGTFRPGYLQHLQKMLAEKLPNSS 66
           MA  S+  KHTWTK E+ + VE LV LV + GWRSDNGTF+PGYL  LQ+M+AEKLP ++
Sbjct: 1   MASLSRAPKHTWTKEEEEKFVECLVELVSSGGWRSDNGTFQPGYLAQLQRMMAEKLPGTN 60

Query: 67  L-ELNTIDCKVRTLEKTIQSSLREMLGNGCSGFGWNEEFKCVEAEKEVFDAWVKSHTNAK 126
           + E +TIDC V++L+KT   ++ EM G  CSGFGWNEEF+C+ AE+++FD+W+KSH  AK
Sbjct: 61  IQESSTIDCHVKSLKKTYH-AIAEMRGPSCSGFGWNEEFQCIIAERDLFDSWIKSHPAAK 120

Query: 127 GMRNKPFSHYDELAVVFGKDRATGIGAETLMEMTSNVAEQIEEEIRLGSQDFIGTKQRTM 186
           G+ +K F +YD+L+ VFGKDRATG  +ET   + SNV+    + I LG  D       TM
Sbjct: 121 GLLHKSFPYYDDLSYVFGKDRATGARSETFPNVGSNVSNMFNDTIPLG--DSHDEDIPTM 180

Query: 187 ENPGLGDVGEDDL----PDTPTSRRNTSDMSSRCTGSKRKRS--------SFQTELIDV- 246
            + G+  +  D++        + RRN S +S R  GS+R  +         F  E +   
Sbjct: 181 YSQGV-HMSPDEMFGIRAGQASERRNCSSVSKRKRGSERYETVEVIRSVMEFGNEQLKAI 240

Query: 247 ----KEKYELEATRRKEVVDLLYQIEGLTEHDRVSLIDLLVTDIQKTDCFLQVPPQSRKA 294
               KEK  +E   R +VV  L  I  L   DR  L+ +L   ++  + FL +P + +  
Sbjct: 241 ADWPKEKRAMEVEMRAQVVKQLQDIPKLRSQDRAKLMQILFRSLEAIEGFLSIPTELKLE 300

BLAST of Tan0020073 vs. ExPASy TrEMBL
Match: A0A5A7UME4 (Retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold615G00290 PE=4 SV=1)

HSP 1 Score: 206.5 bits (524), Expect = 1.6e-49
Identity = 132/307 (43.00%), Postives = 177/307 (57.65%), Query Frame = 0

Query: 7   MAGTSKHSKHTWTKVEDARLVESLVYLVH-NGWRSDNGTFRPGYLQHLQKMLAEKLPNSS 66
           M  +S+  KHTWTK E+A LVE LV LV+  GWRSDNGTFRPGYL  L +M+A K+P S+
Sbjct: 1   MTSSSRLPKHTWTKEEEAGLVECLVELVNAGGWRSDNGTFRPGYLNQLARMMAFKIPGSN 60

Query: 67  LELNTIDCKVRTLEKTIQSSLREMLGNGCSGFGWNEEFKCVEAEKEVFDAWVKSHTNAKG 126
           +  +TID +++ L K +  +L EM G  CSGFGWN+E KC+ AEKEVFD W  SH  AKG
Sbjct: 61  IHASTIDSRIK-LMKRMFHALAEMRGPNCSGFGWNDEKKCIVAEKEVFDDW--SHPAAKG 120

Query: 127 MRNKPFSHYDELAVVFGKDRATGIGAETLMEMTSNVAEQIEEEIRLGSQDFIGTKQRTME 186
           + NK F HYDEL+ VFGKDRATG  AE+  ++ SN     + E      D   T    M 
Sbjct: 121 LLNKSFVHYDELSYVFGKDRATGGRAESFADIGSNDPPGYDAEAADAMPD---TDFPPMY 180

Query: 187 NPGLGDVGEDDLPDTPTSRRNTSDMSSRCTGSKRKRSSFQTELIDVKEK----------- 246
           +PGL ++  DDL +T T+R   S+  +  +GSKRKR    T+  D+              
Sbjct: 181 SPGL-NMSPDDLMETRTAR--VSERRNVSSGSKRKRPGHATDSGDIVRTAIEYGNEQLHR 240

Query: 247 -------YELEATR-RKEVVDLLYQIEGLTEHDRVSLIDLLVTDIQKTDCFLQVPPQSRK 294
                     +AT+ R+E+V  L  I  LT  DR  L+ +L+ ++     FL+VP   + 
Sbjct: 241 IAEWPILQRQDATQTRQEIVRHLEAIPELTLMDRCRLMRILMRNVDDMKAFLEVPDHMKY 298

BLAST of Tan0020073 vs. ExPASy TrEMBL
Match: E5GCB5 (Retrotransposon protein OS=Cucumis melo subsp. melo OX=412675 PE=3 SV=1)

HSP 1 Score: 206.5 bits (524), Expect = 1.6e-49
Identity = 133/309 (43.04%), Postives = 180/309 (58.25%), Query Frame = 0

Query: 6   IMAGTSKHSKHTWTKVEDARLVESLVYLVH-NGWRSDNGTFRPGYLQHLQKMLAEKLPNS 65
           IM  +S+  KHTWTK E+A LVE LV LV+  GWRSDNGTFRPGYL  L +M+A K+P S
Sbjct: 355 IMTSSSRLPKHTWTKEEEAGLVECLVELVNAGGWRSDNGTFRPGYLNQLARMMAFKIPGS 414

Query: 66  SLELNTIDCKVRTLEKTIQSSLREMLGNGCSGFGWNEEFKCVEAEKEVFDAWVKSHTNAK 125
           ++  +TID +++ L K +  +L EM G  CSGFGWN+E KC+ AEKEVFD W  SH  AK
Sbjct: 415 NIHASTIDSRIK-LMKRMFHALAEMRGPNCSGFGWNDEKKCIVAEKEVFDDW--SHPAAK 474

Query: 126 GMRNKPFSHYDELAVVFGKDRATGIGAETLMEMTSNVAEQIEEEIRLGSQDFI-GTKQRT 185
           G+ NK F HYDEL+ VFGKDRATG  AE+  ++ SN     +     G+ D +  T    
Sbjct: 475 GLLNKSFVHYDELSYVFGKDRATGGRAESFADIGSNDPPGYD----AGAADAMPDTDFPP 534

Query: 186 MENPGLGDVGEDDLPDTPTSRRNTSDMSSRCTGSKRKRSSFQTELIDVKEK--------- 245
           M +PGL ++  DDL +T T+R   S+  +  +GSKRKR    T+  D+            
Sbjct: 535 MYSPGL-NMSPDDLMETRTAR--VSERRNVSSGSKRKRPGHATDSGDIVRTAIEYGNEQL 594

Query: 246 ---------YELEATR-RKEVVDLLYQIEGLTEHDRVSLIDLLVTDIQKTDCFLQVPPQS 294
                       +AT+ R+E+V  L  I  LT  DR  L+ +L+ ++     FL+VP   
Sbjct: 595 HRIAEWPILQRQDATQTRQEIVRHLEAIPELTLMDRCRLMRILMRNVDDMKAFLEVPDHM 653

BLAST of Tan0020073 vs. ExPASy TrEMBL
Match: A0A5D3CBF7 (Retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1112G00350 PE=4 SV=1)

HSP 1 Score: 204.9 bits (520), Expect = 4.6e-49
Identity = 132/308 (42.86%), Postives = 179/308 (58.12%), Query Frame = 0

Query: 7   MAGTSKHSKHTWTKVEDARLVESLVYLVH-NGWRSDNGTFRPGYLQHLQKMLAEKLPNSS 66
           M  +S+  KHTWTK E+A LVE LV LV+  GWRSDNGTFRPGYL  L +M+A K+P S+
Sbjct: 1   MTSSSRLPKHTWTKEEEAGLVECLVELVNAGGWRSDNGTFRPGYLNQLARMMAFKIPGSN 60

Query: 67  LELNTIDCKVRTLEKTIQSSLREMLGNGCSGFGWNEEFKCVEAEKEVFDAWVKSHTNAKG 126
           +  +TID +++ L K +  +L EM G  CSGFGWN+E KC+ AEKEVFD W  SH  AKG
Sbjct: 61  IHASTIDSRIK-LMKRMFHALAEMRGPNCSGFGWNDEKKCIVAEKEVFDDW--SHPAAKG 120

Query: 127 MRNKPFSHYDELAVVFGKDRATGIGAETLMEMTSNVAEQIEEEIRLGSQDFI-GTKQRTM 186
           + NK F HYDEL+ VFGKDRATG  AE+  ++ SN     +     G+ D +  T    M
Sbjct: 121 LLNKSFVHYDELSYVFGKDRATGGRAESFADIGSNDPPGYD----AGAADAMPDTDFPPM 180

Query: 187 ENPGLGDVGEDDLPDTPTSRRNTSDMSSRCTGSKRKRSSFQTELIDVKEK---------- 246
            +PGL ++  DDL +T T+R   S+  +  +GSKRKR    T+  D+             
Sbjct: 181 YSPGL-NMSPDDLMETRTAR--VSERRNVSSGSKRKRPGHATDSGDIVRTAIEYGNEQLH 240

Query: 247 --------YELEATR-RKEVVDLLYQIEGLTEHDRVSLIDLLVTDIQKTDCFLQVPPQSR 294
                      +AT+ R+E+V  L  I  LT  DR  L+ +L+ ++     FL+VP   +
Sbjct: 241 RIAEWPILQRQDATQTRQEIVRHLEAIPELTLMDRCRLMRILMRNVDDMKAFLEVPDHMK 298

BLAST of Tan0020073 vs. TAIR 10
Match: AT1G30140.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G27260.1); Has 313 Blast hits to 256 proteins in 15 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 8; Plants - 295; Viruses - 0; Other Eukaryotes - 10 (source: NCBI BLink). )

HSP 1 Score: 60.1 bits (144), Expect = 3.5e-09
Identity = 45/152 (29.61%), Postives = 69/152 (45.39%), Query Frame = 0

Query: 9   GTSKHSKHTWTKVEDARLVESLVYLVHNGWRSDNGTFRPGYLQHLQKML--AEKLPNSSL 68
           G  K   + WT  E   L+E    L+   WR  +G    G L    K+L    K    + 
Sbjct: 8   GKEKGPYNQWTPDETDVLIE----LIRQNWRDSSGII--GKLTVESKLLPALNKRLGCNK 67

Query: 69  ELNTIDCKVRTLEKTIQSSLREMLGNGCSGFGWNEEFKCVEAEKEVFDAWVKSHTNAKGM 128
                  +++ L+   QS L   L    SGFGW+ E K   A  EV+  ++K+H N K M
Sbjct: 68  NHKNYMSRLKFLKNLYQSYLD--LKRFSSGFGWDPETKKFTAPDEVWRDYLKAHPNHKHM 127

Query: 129 RNKPFSHYDELAVVFGKDRATGIGAETLMEMT 159
           + +   H+++L ++FG   ATG  A  + + T
Sbjct: 128 QTESIDHFEDLQIIFGDVVATGSFAVGMSDST 151

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_038887234.12.8e-7251.30uncharacterized protein LOC120077425 [Benincasa hispida][more]
XP_038896380.16.8e-7150.65uncharacterized protein LOC120084641 [Benincasa hispida][more]
XP_038895773.13.7e-6952.11uncharacterized protein LOC120083935 [Benincasa hispida][more]
XP_038902479.12.3e-5554.13uncharacterized protein At2g29880-like [Benincasa hispida][more]
XP_008441954.12.9e-5342.48PREDICTED: uncharacterized protein LOC103485953 [Cucumis melo] >KAA0047736.1 ret... [more]
Match NameE-valueIdentityDescription
A0A5A7U0H71.4e-5342.48Retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A1S3B4L31.4e-5342.48uncharacterized protein LOC103485953 OS=Cucumis melo OX=3656 GN=LOC103485953 PE=... [more]
A0A5A7UME41.6e-4943.00Retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
E5GCB51.6e-4943.04Retrotransposon protein OS=Cucumis melo subsp. melo OX=412675 PE=3 SV=1[more]
A0A5D3CBF74.6e-4942.86Retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
Match NameE-valueIdentityDescription
AT1G30140.13.5e-0929.61unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR024752Myb/SANT-like domainPFAMPF12776Myb_DNA-bind_3coord: 17..114
e-value: 1.2E-5
score: 26.2
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 203..219
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 181..219
NoneNo IPR availablePANTHERPTHR46250MYB/SANT-LIKE DNA-BINDING DOMAIN PROTEIN-RELATEDcoord: 10..294

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0020073.1Tan0020073.1mRNA