Tan0009582 (gene) Snake gourd v1

Overview
NameTan0009582
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRetrotrans_gag domain-containing protein
LocationLG04: 65603279 .. 65603680 (-)
RNA-Seq ExpressionTan0009582
SyntenyTan0009582
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSexonpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCCCAAGGGGAGAACTGCAGGCTTGCACCCTTTTGATCCTGAGATTGAAAGGACCTATAGGAGACTCCTAAGGGAAGGAAGAGCAGAACCTCAGGAGATGGCTGATCAGGAGCTCATCAACAACCCTAAACCTATCAGGGAGTATTTCCAGCCTGTGTTTAATTCTGAGCAGGCTGGAATAGTCCATGCCCCTATCAATGTTAATAACTTTGAACTCAAAACAGGGCTAATACAAATGGTTAGGGAAGGTGCTTTCAGAGGCCTAGCCACAGAAGATCCTAACTTTCATCTTAAAATTTTTTTGGAAATCTGTGGTACATTTAAAATGAATGGTATCTCTGCTAATAGTATTCGCTTACGTCTCTTTCCTTTTTCCTTGCAGGGTAGAGCCAAAGAATAG

mRNA sequence

ATGCCCAAGGGGAGAACTGCAGGCTTGCACCCTTTTGATCCTGAGATTGAAAGGACCTATAGGAGACTCCTAAGGGAAGGAAGAGCAGAACCTCAGGAGATGGCTGATCAGGAGCTCATCAACAACCCTAAACCTATCAGGGAGTATTTCCAGCCTGTGTTTAATTCTGAGCAGGCTGGAATAGTCCATGCCCCTATCAATGTTAATAACTTTGAACTCAAAACAGGGCTAATACAAATGGTTAGGGAAGGTGCTTTCAGAGGCCTAGCCACAGAAGATCCTAACTTTCATCTTAAAATTTTTTTGGAAATCTGTGGTACATTTAAAATGAATGGTATCTCTGCTAATAGTATTCGCTTACGTCTCTTTCCTTTTTCCTTGCAGGGTAGAGCCAAAGAATAG

Coding sequence (CDS)

ATGCCCAAGGGGAGAACTGCAGGCTTGCACCCTTTTGATCCTGAGATTGAAAGGACCTATAGGAGACTCCTAAGGGAAGGAAGAGCAGAACCTCAGGAGATGGCTGATCAGGAGCTCATCAACAACCCTAAACCTATCAGGGAGTATTTCCAGCCTGTGTTTAATTCTGAGCAGGCTGGAATAGTCCATGCCCCTATCAATGTTAATAACTTTGAACTCAAAACAGGGCTAATACAAATGGTTAGGGAAGGTGCTTTCAGAGGCCTAGCCACAGAAGATCCTAACTTTCATCTTAAAATTTTTTTGGAAATCTGTGGTACATTTAAAATGAATGGTATCTCTGCTAATAGTATTCGCTTACGTCTCTTTCCTTTTTCCTTGCAGGGTAGAGCCAAAGAATAG

Protein sequence

MPKGRTAGLHPFDPEIERTYRRLLREGRAEPQEMADQELINNPKPIREYFQPVFNSEQAGIVHAPINVNNFELKTGLIQMVREGAFRGLATEDPNFHLKIFLEICGTFKMNGISANSIRLRLFPFSLQGRAKE
Homology
BLAST of Tan0009582 vs. NCBI nr
Match: WP_217833153.1 (retrotransposon gag domain-containing protein, partial [Synechococcus sp. PCC 7002])

HSP 1 Score: 156.8 bits (395), Expect = 1.3e-34
Identity = 81/133 (60.90%), Postives = 97/133 (72.93%), Query Frame = 0

Query: 1   MPKGRTAGLHPFDPEIERTYRRLLREGRAEPQEMADQELINNPKPIREYFQPVFNSEQAG 60
           MP+  T  L P DPEI+RTYRR LR    +  EMA++     PK IR+YFQP   + Q G
Sbjct: 17  MPRDNT-NLLPLDPEIDRTYRRNLRALLNQTTEMAEE----IPKAIRDYFQPTLPASQPG 76

Query: 61  IVHAPINVNNFELKTGLIQMVREGAFRGLATEDPNFHLKIFLEICGTFKMNGISANSIRL 120
           I++ PINVNNFELK GLIQM RE AFRG   EDP+ HL+ FLEICGT KMNG+S ++I+L
Sbjct: 77  IMNVPINVNNFELKPGLIQMARELAFRGRTNEDPHKHLRSFLEICGTVKMNGVSNDAIKL 136

Query: 121 RLFPFSLQGRAKE 134
           RLFPFSLQ RAK+
Sbjct: 137 RLFPFSLQDRAKD 144

BLAST of Tan0009582 vs. NCBI nr
Match: KAG7990634.1 (hypothetical protein I3843_02G035100 [Carya illinoinensis])

HSP 1 Score: 125.6 bits (314), Expect = 3.3e-25
Identity = 61/132 (46.21%), Postives = 87/132 (65.91%), Query Frame = 0

Query: 1   MPKGRTAGLHPFDPEIERTYRRLLREGRAEPQEMADQELINNPKPIREYFQPVFNSEQAG 60
           M + R+  + P DPEIERT R L R    +   MA+++    P+ +++Y +PV N   + 
Sbjct: 1   MRRARSRDIIPVDPEIERTLRSLRRN---KILAMAEEDREVLPRTLKDYVRPVVNGNYSS 60

Query: 61  IVHAPINVNNFELKTGLIQMVREGAFRGLATEDPNFHLKIFLEICGTFKMNGISANSIRL 120
           I+  PIN NNFELK  LI MV++  F G   +DPN HL +FLEIC T K+NG++ ++IRL
Sbjct: 61  IMRQPINANNFELKPALISMVQQAQFSGSPLDDPNIHLAMFLEICDTVKINGVTEDTIRL 120

Query: 121 RLFPFSLQGRAK 133
           RLFPFSL+ +A+
Sbjct: 121 RLFPFSLRDKAR 129

BLAST of Tan0009582 vs. NCBI nr
Match: XP_022843226.1 (uncharacterized protein LOC111366761 [Olea europaea var. sylvestris])

HSP 1 Score: 122.1 bits (305), Expect = 3.6e-24
Identity = 66/137 (48.18%), Postives = 84/137 (61.31%), Query Frame = 0

Query: 1   MPKGRTAGLHPFDPEIERTYRRLLREGRAEPQEMADQEL-----INNPKPIREYFQPVFN 60
           M + R   L   DPE ERT+R L    R E + MA+Q++      N  + IR+Y +PV N
Sbjct: 94  MRRARNLDLLHVDPEPERTFRILRGIQRNEREAMAEQDVRAANEDNQQRAIRDYIRPVVN 153

Query: 61  SEQAGIVHAPINVNNFELKTGLIQMVREGAFRGLATEDPNFHLKIFLEICGTFKMNGISA 120
              +GI    I   NFELK GLI MV++  F G A EDPN HL  FLEIC T KMNG++ 
Sbjct: 154 DNYSGIARPAIVAKNFELKPGLIDMVQQNQFGGAAVEDPNAHLGSFLEICDTVKMNGVTE 213

Query: 121 NSIRLRLFPFSLQGRAK 133
           ++IRLRLF FSL+ +AK
Sbjct: 214 DAIRLRLFSFSLRDKAK 230

BLAST of Tan0009582 vs. NCBI nr
Match: XP_022157708.1 (uncharacterized protein LOC111024361 [Momordica charantia])

HSP 1 Score: 117.1 bits (292), Expect = 1.2e-22
Identity = 60/101 (59.41%), Postives = 70/101 (69.31%), Query Frame = 0

Query: 30  EPQEMADQELINNPKPIREYFQPVFNSEQAGIVHAPINVNNFELKTGLIQMVREGAFRGL 89
           +P E    E  NN   IR+Y QP F     GI++ PIN NN ELK GLIQMVRE  FRG 
Sbjct: 12  QPMERPQLEQ-NNQMTIRDYCQPNF-PNHVGIINLPINANNSELKPGLIQMVRENTFRGN 71

Query: 90  ATEDPNFHLKIFLEICGTFKMNGISANSIRLRLFPFSLQGR 131
           ATEDPN HL IFL++CGT KMNG+  ++IRLRLFP SLQ +
Sbjct: 72  ATEDPNNHLTIFLDVCGTVKMNGVIDDAIRLRLFPLSLQDK 110

BLAST of Tan0009582 vs. NCBI nr
Match: XP_012833448.1 (PREDICTED: uncharacterized protein LOC105954320 [Erythranthe guttata])

HSP 1 Score: 115.5 bits (288), Expect = 3.4e-22
Identity = 68/151 (45.03%), Postives = 85/151 (56.29%), Query Frame = 0

Query: 5   RTAGLHPFDPEIERTYRRL----------LREGRAEPQEMA---DQEL---INNPKP--- 64
           R   + P+DPEIERT RRL          + E +  P+ +A   DQE       P+P   
Sbjct: 355 RNLEIIPYDPEIERTLRRLRATRNQQSEPMAEDQGNPEHLAFEYDQEPGRDHEQPRPNLP 414

Query: 65  ----IREYFQPVFNSEQAGIVHAPINVNNFELKTGLIQMVREGAFRGLATEDPNFHLKIF 124
               +REY  P  N   +GI    I  NNFELKTGLI MV    F G AT DPN HL  F
Sbjct: 415 PERTMREYRTPAMNENYSGIRKPTIAANNFELKTGLINMVMANQFSGAATADPNLHLANF 474

Query: 125 LEICGTFKMNGISANSIRLRLFPFSLQGRAK 133
           LEIC T K+NG+S ++IRL+LF FS++ +AK
Sbjct: 475 LEICDTIKVNGVSDDAIRLKLFSFSVRDKAK 505

BLAST of Tan0009582 vs. ExPASy TrEMBL
Match: A0A6J1DU19 (uncharacterized protein LOC111024361 OS=Momordica charantia OX=3673 GN=LOC111024361 PE=4 SV=1)

HSP 1 Score: 117.1 bits (292), Expect = 5.7e-23
Identity = 60/101 (59.41%), Postives = 70/101 (69.31%), Query Frame = 0

Query: 30  EPQEMADQELINNPKPIREYFQPVFNSEQAGIVHAPINVNNFELKTGLIQMVREGAFRGL 89
           +P E    E  NN   IR+Y QP F     GI++ PIN NN ELK GLIQMVRE  FRG 
Sbjct: 12  QPMERPQLEQ-NNQMTIRDYCQPNF-PNHVGIINLPINANNSELKPGLIQMVRENTFRGN 71

Query: 90  ATEDPNFHLKIFLEICGTFKMNGISANSIRLRLFPFSLQGR 131
           ATEDPN HL IFL++CGT KMNG+  ++IRLRLFP SLQ +
Sbjct: 72  ATEDPNNHLTIFLDVCGTVKMNGVIDDAIRLRLFPLSLQDK 110

BLAST of Tan0009582 vs. ExPASy TrEMBL
Match: A0A1U8Q202 (uncharacterized protein LOC109114208 OS=Nelumbo nucifera OX=4432 GN=LOC109114208 PE=4 SV=1)

HSP 1 Score: 114.0 bits (284), Expect = 4.8e-22
Identity = 60/126 (47.62%), Postives = 80/126 (63.49%), Query Frame = 0

Query: 11  PFDPEIERTY---RRLLREGRAEPQEMADQELINNPKPIREYFQPVFNSEQAGIVHAPIN 70
           P+DPEIERT     R  R+ RAE +EMA+      P+ + +Y +P        IV   I+
Sbjct: 5   PYDPEIERTLCIRLRAARQVRAETEEMAE------PRTMMDYAKPTLTGAALSIVRPAIS 64

Query: 71  VNNFELKTGLIQMVREGA-FRGLATEDPNFHLKIFLEICGTFKMNGISANSIRLRLFPFS 130
            NNFE+K  +IQM++    F G+A EDPN H+  FLEIC TFK NG+S + +RLRLFPFS
Sbjct: 65  ANNFEIKPAIIQMIQNTVQFCGMANEDPNSHIANFLEICDTFKHNGVSDDVVRLRLFPFS 124

Query: 131 LQGRAK 133
           L+ +AK
Sbjct: 125 LKDKAK 124

BLAST of Tan0009582 vs. ExPASy TrEMBL
Match: A0A2H9ZY12 (Uncharacterized protein OS=Apostasia shenzhenica OX=1088818 GN=AXF42_Ash020512 PE=4 SV=1)

HSP 1 Score: 113.6 bits (283), Expect = 6.3e-22
Identity = 61/147 (41.50%), Postives = 84/147 (57.14%), Query Frame = 0

Query: 1   MPKGRTAGLHPFDPEIERTYRRLLREGRAEP-------------QEMADQELINNP-KPI 60
           M +     L PFDPEIERT  ++ R+ + +               +M DQ  I    + +
Sbjct: 1   MTRSSKKDLAPFDPEIERTIAKITRQLKEQEVRGQLNKVKKDLFTKMEDQPTIQEAGRAL 60

Query: 61  REYFQPVFNSEQAGIVHAPINVNNFELKTGLIQMVREGA-FRGLATEDPNFHLKIFLEIC 120
           REY  P  N     +V   +  NNFE+K  LIQM+++   F GL ++DPN H+  FLEIC
Sbjct: 61  REYALPSINGANTSVVRPAVQANNFEIKPALIQMIQQSVQFYGLPSDDPNTHIANFLEIC 120

Query: 121 GTFKMNGISANSIRLRLFPFSLQGRAK 133
            TFK NG+S ++IRLRLFPFSL+ +AK
Sbjct: 121 DTFKHNGVSDDAIRLRLFPFSLKDKAK 147

BLAST of Tan0009582 vs. ExPASy TrEMBL
Match: A0A803PJA6 (Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1)

HSP 1 Score: 111.7 bits (278), Expect = 2.4e-21
Identity = 61/139 (43.88%), Postives = 81/139 (58.27%), Query Frame = 0

Query: 13  DPEIERTYRRLLREGRAEPQEMADQELINN-------------------PKPIREYFQPV 72
           +PEIERT    L++ +A+ +       INN                   P+ +R+Y  PV
Sbjct: 14  NPEIERT----LKQSKAKKKIDFTMAAINNNNSNGLNNNQSAPAAADAQPRAVRDYCLPV 73

Query: 73  FNSEQAGIVHAPINVNNFELKTGLIQMVREGAFRGLATEDPNFHLKIFLEICGTFKMNGI 132
            N    GI +  I  NNFELK  LI MV++  F GLAT+DPN HL IFLE+C T KMNG+
Sbjct: 74  VNKNLTGIANPVIAANNFELKPALINMVQQNHFGGLATKDPNIHLAIFLEVCATMKMNGL 133

BLAST of Tan0009582 vs. ExPASy TrEMBL
Match: A0A6P6W382 (uncharacterized protein LOC113729769 OS=Coffea arabica OX=13443 GN=LOC113729769 PE=4 SV=1)

HSP 1 Score: 110.9 bits (276), Expect = 4.1e-21
Identity = 61/138 (44.20%), Postives = 80/138 (57.97%), Query Frame = 0

Query: 11  PFDPEIERTYRRLLRE-GRAEPQEM------------ADQELINNP---KPIREYFQPVF 70
           PFDPEIER  RR  R     E QE+             ++E+  N    + +R++  P  
Sbjct: 10  PFDPEIERALRRQRRNTPHQEEQEIWQPIEEILIELPFEEEIAENEPNRRILRDFALPET 69

Query: 71  NSEQAGIVHAPINVNNFELKTGLIQMVREGAFRGLATEDPNFHLKIFLEICGTFKMNGIS 130
              Q  I    +N NNFE+K  LIQMV++  + G ATEDPN HL  FLEIC T K NG+S
Sbjct: 70  QGSQTSIARPMVNANNFEIKPSLIQMVQQSQYGGNATEDPNSHLSTFLEICDTIKFNGVS 129

Query: 131 ANSIRLRLFPFSLQGRAK 133
            ++I+LRLFPFSL+ +AK
Sbjct: 130 DDAIKLRLFPFSLKDKAK 147

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
WP_217833153.11.3e-3460.90retrotransposon gag domain-containing protein, partial [Synechococcus sp. PCC 70... [more]
KAG7990634.13.3e-2546.21hypothetical protein I3843_02G035100 [Carya illinoinensis][more]
XP_022843226.13.6e-2448.18uncharacterized protein LOC111366761 [Olea europaea var. sylvestris][more]
XP_022157708.11.2e-2259.41uncharacterized protein LOC111024361 [Momordica charantia][more]
XP_012833448.13.4e-2245.03PREDICTED: uncharacterized protein LOC105954320 [Erythranthe guttata][more]
Match NameE-valueIdentityDescription
A0A6J1DU195.7e-2359.41uncharacterized protein LOC111024361 OS=Momordica charantia OX=3673 GN=LOC111024... [more]
A0A1U8Q2024.8e-2247.62uncharacterized protein LOC109114208 OS=Nelumbo nucifera OX=4432 GN=LOC109114208... [more]
A0A2H9ZY126.3e-2241.50Uncharacterized protein OS=Apostasia shenzhenica OX=1088818 GN=AXF42_Ash020512 P... [more]
A0A803PJA62.4e-2143.88Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1[more]
A0A6P6W3824.1e-2144.20uncharacterized protein LOC113729769 OS=Coffea arabica OX=13443 GN=LOC113729769 ... [more]
Match NameE-valueIdentityDescription
Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0009582.1Tan0009582.1mRNA