Tan0008235 (gene) Snake gourd v1

Overview
NameTan0008235
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionDUF761 domain-containing protein
LocationLG04: 80638037 .. 80638952 (-)
RNA-Seq ExpressionTan0008235
SyntenyTan0008235
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
CTTTTCTTCTTCCTTCCTCGCTAATGGCAGAGCAAATCATTTCTGTAGAATCTCCAACCGCTACTACAGACTTGATCGCCATTAACGAGGATAAGAATAAGAACAAGTTGGTCGTCCTCGACTGTAATAACAATAACGATCATGGCATCGTCGTCCCTAAGAAGATGAAGAAGAAGAGCGTTAACATTCTCAAGGTGGCTTTGATGCTCCTTCGTCGCCGCTCCAGCAAATCCAATGTCGCCGCCGTTGATGTCGCCTCCAAGAGTATGTGGAAGCGCTTTGTTGGCTCCATGCGCCCTTTGCATTTGCAGAGCCCTGAGCAAGAAGGTGTTGTTGTTCCTCCCTTGTCATTGCCCTCGACCGAGCCCTTGACTTTGGAGCCTATAGCGGTCGGGTTGCGCGCCTCCCCGTCCCTCGAGAGCTTTGAAGATGTTATGTCGTCGCCGTTTTCTCCTATTCATGCTCCTAAATCACCCTCCTCTTCTGTTGATGGGATGAGCAGGTATCATACTCTTTTCATAAAATTGAGATCTCGATCATCTACCTTGACATTTTGAACTTGTTTATAGCCAAATTTTAAGAGTTGGTGCATTGATACTTAATTTTTTTAAATTACAAATTTATCATGAAACATGTAAAAGTTAGGTGAGATTTTCTTTCGGTGGATGCTTAGTTACCCCTTTCATGTTTTTAACTCAGGTACGCCTCGGCCATCAACCTTCATGATCTAGACCAAAACGACGACAATGTCGAAGACGAGAACGACGGTGAAGCCGAGGCAAACATGAACGGAGGCGACGAGATGATCGACGCGAAAGCAGAGATGTTCATAGCTCAATTCTACGAACAAATGAGGCTTCAACGTTCGGACTCTGATATCCGTTACCACGAGATGATTAAGAGATCGATTGGCTAA

mRNA sequence

CTTTTCTTCTTCCTTCCTCGCTAATGGCAGAGCAAATCATTTCTGTAGAATCTCCAACCGCTACTACAGACTTGATCGCCATTAACGAGGATAAGAATAAGAACAAGTTGGTCGTCCTCGACTGTAATAACAATAACGATCATGGCATCGTCGTCCCTAAGAAGATGAAGAAGAAGAGCGTTAACATTCTCAAGGTGGCTTTGATGCTCCTTCGTCGCCGCTCCAGCAAATCCAATGTCGCCGCCGTTGATGTCGCCTCCAAGAGTATGTGGAAGCGCTTTGTTGGCTCCATGCGCCCTTTGCATTTGCAGAGCCCTGAGCAAGAAGGTGTTGTTGTTCCTCCCTTGTCATTGCCCTCGACCGAGCCCTTGACTTTGGAGCCTATAGCGGTCGGGTTGCGCGCCTCCCCGTCCCTCGAGAGCTTTGAAGATGTTATGTCGTCGCCGTTTTCTCCTATTCATGCTCCTAAATCACCCTCCTCTTCTGTTGATGGGATGAGCAGGTACGCCTCGGCCATCAACCTTCATGATCTAGACCAAAACGACGACAATGTCGAAGACGAGAACGACGGTGAAGCCGAGGCAAACATGAACGGAGGCGACGAGATGATCGACGCGAAAGCAGAGATGTTCATAGCTCAATTCTACGAACAAATGAGGCTTCAACGTTCGGACTCTGATATCCGTTACCACGAGATGATTAAGAGATCGATTGGCTAA

Coding sequence (CDS)

ATGGCAGAGCAAATCATTTCTGTAGAATCTCCAACCGCTACTACAGACTTGATCGCCATTAACGAGGATAAGAATAAGAACAAGTTGGTCGTCCTCGACTGTAATAACAATAACGATCATGGCATCGTCGTCCCTAAGAAGATGAAGAAGAAGAGCGTTAACATTCTCAAGGTGGCTTTGATGCTCCTTCGTCGCCGCTCCAGCAAATCCAATGTCGCCGCCGTTGATGTCGCCTCCAAGAGTATGTGGAAGCGCTTTGTTGGCTCCATGCGCCCTTTGCATTTGCAGAGCCCTGAGCAAGAAGGTGTTGTTGTTCCTCCCTTGTCATTGCCCTCGACCGAGCCCTTGACTTTGGAGCCTATAGCGGTCGGGTTGCGCGCCTCCCCGTCCCTCGAGAGCTTTGAAGATGTTATGTCGTCGCCGTTTTCTCCTATTCATGCTCCTAAATCACCCTCCTCTTCTGTTGATGGGATGAGCAGGTACGCCTCGGCCATCAACCTTCATGATCTAGACCAAAACGACGACAATGTCGAAGACGAGAACGACGGTGAAGCCGAGGCAAACATGAACGGAGGCGACGAGATGATCGACGCGAAAGCAGAGATGTTCATAGCTCAATTCTACGAACAAATGAGGCTTCAACGTTCGGACTCTGATATCCGTTACCACGAGATGATTAAGAGATCGATTGGCTAA

Protein sequence

MAEQIISVESPTATTDLIAINEDKNKNKLVVLDCNNNNDHGIVVPKKMKKKSVNILKVALMLLRRRSSKSNVAAVDVASKSMWKRFVGSMRPLHLQSPEQEGVVVPPLSLPSTEPLTLEPIAVGLRASPSLESFEDVMSSPFSPIHAPKSPSSSVDGMSRYASAINLHDLDQNDDNVEDENDGEAEANMNGGDEMIDAKAEMFIAQFYEQMRLQRSDSDIRYHEMIKRSIG
Homology
BLAST of Tan0008235 vs. NCBI nr
Match: XP_038906029.1 (uncharacterized protein LOC120091932 [Benincasa hispida])

HSP 1 Score: 209.1 bits (531), Expect = 3.9e-50
Identity = 139/240 (57.92%), Postives = 162/240 (67.50%), Query Frame = 0

Query: 1   MAEQIISV-----ESPTATTDL----IAINEDKNKNKLVVLDCNNNNDHGIVVPKKMKKK 60
           MAEQ ISV      S ++ T+L    + IN+DK+ N + V       D   +  KK KKK
Sbjct: 1   MAEQPISVMTVNSSSSSSETELPNLPLQINDDKSNNNVKV-------DDSFITKKKKKKK 60

Query: 61  SVNILKVALMLLRRRSSKSNVAAVDVASKSMWKRFVGSMRPLHLQSPEQEGVVVPPLSLP 120
           S+NILKVALMLLRRRS KS   AVDVASK MW R +G+MRPLHLQ  +Q   +    SLP
Sbjct: 61  SINILKVALMLLRRRSGKSKNPAVDVASKGMWNRLIGAMRPLHLQRDDQSPPIQVVPSLP 120

Query: 121 STEPLTLEPIAVGLRASPSLESFEDVMSSPFSPIHAPKSPSSSVDGM-SRYASAINLHDL 180
           +TEP    P+   L  SPS++SFEDV SS           SSSVDGM SRYASAINLH+L
Sbjct: 121 TTEP--APPL---LHPSPSVDSFEDVNSS-----------SSSVDGMSSRYASAINLHEL 180

Query: 181 DQNDDNVEDENDGEAEANMNGGDEMIDAKAEMFIAQFYEQMRLQRSDSDIRYHEMIKRSI 231
           DQND + E+EN G+  ANM G DEMID KAEMFIAQFY QMRLQRSDSDI Y+EMIK+SI
Sbjct: 181 DQNDQDDENENSGKVYANMKGEDEMIDVKAEMFIAQFYVQMRLQRSDSDICYNEMIKKSI 217

BLAST of Tan0008235 vs. NCBI nr
Match: XP_004152232.2 (uncharacterized protein LOC101222123 [Cucumis sativus] >KGN52840.1 hypothetical protein Csa_015370 [Cucumis sativus])

HSP 1 Score: 190.3 bits (482), Expect = 1.9e-44
Identity = 129/228 (56.58%), Postives = 152/228 (66.67%), Query Frame = 0

Query: 17  LIAINEDKNKNKLVVLDCNNNNDHGIVV-----PKKMKKKSVNILKVALMLLRRRSSKSN 76
           ++ +N    +  L + + + N DH + V     PKK KKKS NILKVALMLLR+RS K N
Sbjct: 8   VMTVNSSSYETVLQITNDDTNYDHNVKVEHECEPKKKKKKSTNILKVALMLLRQRSRKPN 67

Query: 77  V-----AAVD-VASKSMWKRFVGSMRPLHLQSPEQEGVVVPPLSLPSTEPLTLEPIAVGL 136
           V     A VD V SK MW R VG+MRPLHLQS   E   VP L   STEP    P+   L
Sbjct: 68  VVVNNSAIVDHVGSKGMWNRLVGAMRPLHLQS--DESTTVPSLPAASTEP----PLP-RL 127

Query: 137 RASPSLESFEDVMSSPFSPIHAPKSPSSSVDGMSRYASAINLHDLDQNDDNVEDENDGEA 196
            +SPSL++FEDV SS           SSSVDGMSRYASA NL DLDQND   E+ N+ + 
Sbjct: 128 PSSPSLDNFEDVNSS-----------SSSVDGMSRYASAANLQDLDQNDGEDEEINNDKV 187

Query: 197 EANMNG--GDEMIDAKAEMFIAQFYEQMRLQRSDSDIRYHEMIKRSIG 232
            +N++G   DEMIDAKAEMFIAQFYEQM+LQRS+SDIRY+EMIKRSIG
Sbjct: 188 SSNVDGEDEDEMIDAKAEMFIAQFYEQMKLQRSESDIRYNEMIKRSIG 217

BLAST of Tan0008235 vs. NCBI nr
Match: XP_023524600.1 (GATA zinc finger domain-containing protein 8-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 183.0 bits (463), Expect = 3.0e-42
Identity = 126/238 (52.94%), Postives = 159/238 (66.81%), Query Frame = 0

Query: 1   MAEQIISVESPTATTDLIAINEDKNKNKLVVLDCNNNNDHGIVVPK----KMKKKSVNIL 60
           MA+ +IS+ES  + TD     ++KN N  +V + +NN ++G+  PK    K K KS+NIL
Sbjct: 1   MADNLISLES--SITD-----DNKNNNNNLVGNNDNNGNNGVAAPKNTTTKKKNKSLNIL 60

Query: 61  KVALMLLRRRSSKSNVAAVDVASKSMWKRFVGSMRPLHLQSPEQEGVVVPPLSLPSTEPL 120
           +VALMLLRRRSSK N A+V+VASK MW R V S+RPLH+QS         P+ +PS    
Sbjct: 61  RVALMLLRRRSSKPNAASVEVASKGMWNRLVASIRPLHVQS-NHSPQHPQPIIVPSMPDA 120

Query: 121 TLEPIAVGLRASPSLESFEDVMSSPFSPIHAPKSPSSSVDGMSRYASAINLHDLDQNDDN 180
           T       LRASPS++ FEDV S          S +SSVDGMSRYASAINL + DQN+D+
Sbjct: 121 T-------LRASPSIDCFEDVKS----------SSASSVDGMSRYASAINLQEFDQNNDD 180

Query: 181 VEDENDGEAEANM---NGGDEMIDAKAEMFIAQFYEQMRLQRSDSDIRYHEMIKRSIG 232
             +END + E  M   +  D+MIDAKAEMFIA+FYEQMR  RS+SD+RY EMIKRSIG
Sbjct: 181 -NNENDNQVEPKMDEDDNVDDMIDAKAEMFIARFYEQMR--RSNSDVRYREMIKRSIG 210

BLAST of Tan0008235 vs. NCBI nr
Match: XP_022998628.1 (uncharacterized protein LOC111493213 [Cucurbita maxima])

HSP 1 Score: 182.2 bits (461), Expect = 5.2e-42
Identity = 125/237 (52.74%), Postives = 157/237 (66.24%), Query Frame = 0

Query: 1   MAEQIISVESPTATTDLIAINEDKNKNKLVVLDCNNNNDHGIVVPKK---MKKKSVNILK 60
           MA+ +IS+ES        +I +D NKN  +V    NN ++G+  PK     K KS+NIL+
Sbjct: 1   MADNLISLES--------SITDDNNKNNNLV---GNNGNNGVAPPKNTTTKKNKSLNILR 60

Query: 61  VALMLLRRRSSKSNVAAVDVASKSMWKRFVGSMRPLHLQSPEQEGVVVPPLSLPSTEPLT 120
           VALMLLRRRSSK N A+V+VASK MW R V S+RPLH+QS         P+ +PS    T
Sbjct: 61  VALMLLRRRSSKPNAASVEVASKGMWNRLVASIRPLHVQS-NHSPQHPQPIIVPSFPDAT 120

Query: 121 LEPIAVGLRASPSLESFEDVMSSPFSPIHAPKSPSSSVDGMSRYASAINLHDLDQNDDNV 180
                  LR SPS++ FEDV S          S +SSVDGMSRYASAINL +LDQN+++ 
Sbjct: 121 -------LRTSPSIDCFEDVKS----------SSASSVDGMSRYASAINLQELDQNNED- 180

Query: 181 EDENDGEAEANMN---GGDEMIDAKAEMFIAQFYEQMRLQRSDSDIRYHEMIKRSIG 232
           +++ND EAE  M+     D+MIDAKAEMFIAQFYEQ+R  RS+SD+RY EMIKRSIG
Sbjct: 181 DNDNDNEAEPKMDEDENADDMIDAKAEMFIAQFYEQIR--RSNSDVRYREMIKRSIG 205

BLAST of Tan0008235 vs. NCBI nr
Match: XP_016901496.1 (PREDICTED: uncharacterized protein LOC103494745 [Cucumis melo] >KAA0044405.1 DUF761 domain-containing protein [Cucumis melo var. makuwa] >TYK29532.1 DUF761 domain-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 181.8 bits (460), Expect = 6.7e-42
Identity = 121/204 (59.31%), Postives = 143/204 (70.10%), Query Frame = 0

Query: 37  NNDHGIV----VPKKMKKKSVNILKVALMLLRRRSSKSNV----AAVDVASKSMWKRFVG 96
           +NDH  V     PKK KK+S NILKVAL LLR+RS K NV    AA+DV SK MW R VG
Sbjct: 35  DNDHVKVDEYCAPKK-KKRSTNILKVALKLLRQRSRKPNVTNVPAAIDVGSKGMWNRLVG 94

Query: 97  SMRPLHLQSPEQEGVVVPPLSLP-STEPLTLEPIAVGLRASPSLESFEDVMSSPFSPIHA 156
           +MRPLHLQS   E   +P  SLP ST+P    P+     +SPS+++FEDV S        
Sbjct: 95  AMRPLHLQS--DESTTIP--SLPTSTDPHPPPPL---FPSSPSVDNFEDVNS-------- 154

Query: 157 PKSPSSSVDGMSRYASAINLHDLDQNDDNVEDENDGEAEANMNGGDEMIDAKAEMFIAQF 216
             S SSSVDGMSRYAS  NL  LDQND+  E+ N+ +  ANM+G DEMIDAKAEMFIAQF
Sbjct: 155 --SSSSSVDGMSRYASVDNLQALDQNDEEEEERNNDKFYANMDGEDEMIDAKAEMFIAQF 214

Query: 217 YEQMRLQRSDSDIRYHEMIKRSIG 232
           YEQ++LQRS+SD+RY+EMIKRSIG
Sbjct: 215 YEQIKLQRSESDVRYNEMIKRSIG 220

BLAST of Tan0008235 vs. ExPASy TrEMBL
Match: A0A0A0KT88 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G002570 PE=4 SV=1)

HSP 1 Score: 190.3 bits (482), Expect = 9.2e-45
Identity = 129/228 (56.58%), Postives = 152/228 (66.67%), Query Frame = 0

Query: 17  LIAINEDKNKNKLVVLDCNNNNDHGIVV-----PKKMKKKSVNILKVALMLLRRRSSKSN 76
           ++ +N    +  L + + + N DH + V     PKK KKKS NILKVALMLLR+RS K N
Sbjct: 8   VMTVNSSSYETVLQITNDDTNYDHNVKVEHECEPKKKKKKSTNILKVALMLLRQRSRKPN 67

Query: 77  V-----AAVD-VASKSMWKRFVGSMRPLHLQSPEQEGVVVPPLSLPSTEPLTLEPIAVGL 136
           V     A VD V SK MW R VG+MRPLHLQS   E   VP L   STEP    P+   L
Sbjct: 68  VVVNNSAIVDHVGSKGMWNRLVGAMRPLHLQS--DESTTVPSLPAASTEP----PLP-RL 127

Query: 137 RASPSLESFEDVMSSPFSPIHAPKSPSSSVDGMSRYASAINLHDLDQNDDNVEDENDGEA 196
            +SPSL++FEDV SS           SSSVDGMSRYASA NL DLDQND   E+ N+ + 
Sbjct: 128 PSSPSLDNFEDVNSS-----------SSSVDGMSRYASAANLQDLDQNDGEDEEINNDKV 187

Query: 197 EANMNG--GDEMIDAKAEMFIAQFYEQMRLQRSDSDIRYHEMIKRSIG 232
            +N++G   DEMIDAKAEMFIAQFYEQM+LQRS+SDIRY+EMIKRSIG
Sbjct: 188 SSNVDGEDEDEMIDAKAEMFIAQFYEQMKLQRSESDIRYNEMIKRSIG 217

BLAST of Tan0008235 vs. ExPASy TrEMBL
Match: A0A6J1KHA3 (uncharacterized protein LOC111493213 OS=Cucurbita maxima OX=3661 GN=LOC111493213 PE=4 SV=1)

HSP 1 Score: 182.2 bits (461), Expect = 2.5e-42
Identity = 125/237 (52.74%), Postives = 157/237 (66.24%), Query Frame = 0

Query: 1   MAEQIISVESPTATTDLIAINEDKNKNKLVVLDCNNNNDHGIVVPKK---MKKKSVNILK 60
           MA+ +IS+ES        +I +D NKN  +V    NN ++G+  PK     K KS+NIL+
Sbjct: 1   MADNLISLES--------SITDDNNKNNNLV---GNNGNNGVAPPKNTTTKKNKSLNILR 60

Query: 61  VALMLLRRRSSKSNVAAVDVASKSMWKRFVGSMRPLHLQSPEQEGVVVPPLSLPSTEPLT 120
           VALMLLRRRSSK N A+V+VASK MW R V S+RPLH+QS         P+ +PS    T
Sbjct: 61  VALMLLRRRSSKPNAASVEVASKGMWNRLVASIRPLHVQS-NHSPQHPQPIIVPSFPDAT 120

Query: 121 LEPIAVGLRASPSLESFEDVMSSPFSPIHAPKSPSSSVDGMSRYASAINLHDLDQNDDNV 180
                  LR SPS++ FEDV S          S +SSVDGMSRYASAINL +LDQN+++ 
Sbjct: 121 -------LRTSPSIDCFEDVKS----------SSASSVDGMSRYASAINLQELDQNNED- 180

Query: 181 EDENDGEAEANMN---GGDEMIDAKAEMFIAQFYEQMRLQRSDSDIRYHEMIKRSIG 232
           +++ND EAE  M+     D+MIDAKAEMFIAQFYEQ+R  RS+SD+RY EMIKRSIG
Sbjct: 181 DNDNDNEAEPKMDEDENADDMIDAKAEMFIAQFYEQIR--RSNSDVRYREMIKRSIG 205

BLAST of Tan0008235 vs. ExPASy TrEMBL
Match: A0A5A7TSQ6 (DUF761 domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold655G001350 PE=4 SV=1)

HSP 1 Score: 181.8 bits (460), Expect = 3.3e-42
Identity = 121/204 (59.31%), Postives = 143/204 (70.10%), Query Frame = 0

Query: 37  NNDHGIV----VPKKMKKKSVNILKVALMLLRRRSSKSNV----AAVDVASKSMWKRFVG 96
           +NDH  V     PKK KK+S NILKVAL LLR+RS K NV    AA+DV SK MW R VG
Sbjct: 35  DNDHVKVDEYCAPKK-KKRSTNILKVALKLLRQRSRKPNVTNVPAAIDVGSKGMWNRLVG 94

Query: 97  SMRPLHLQSPEQEGVVVPPLSLP-STEPLTLEPIAVGLRASPSLESFEDVMSSPFSPIHA 156
           +MRPLHLQS   E   +P  SLP ST+P    P+     +SPS+++FEDV S        
Sbjct: 95  AMRPLHLQS--DESTTIP--SLPTSTDPHPPPPL---FPSSPSVDNFEDVNS-------- 154

Query: 157 PKSPSSSVDGMSRYASAINLHDLDQNDDNVEDENDGEAEANMNGGDEMIDAKAEMFIAQF 216
             S SSSVDGMSRYAS  NL  LDQND+  E+ N+ +  ANM+G DEMIDAKAEMFIAQF
Sbjct: 155 --SSSSSVDGMSRYASVDNLQALDQNDEEEEERNNDKFYANMDGEDEMIDAKAEMFIAQF 214

Query: 217 YEQMRLQRSDSDIRYHEMIKRSIG 232
           YEQ++LQRS+SD+RY+EMIKRSIG
Sbjct: 215 YEQIKLQRSESDVRYNEMIKRSIG 220

BLAST of Tan0008235 vs. ExPASy TrEMBL
Match: A0A1S4DZU7 (uncharacterized protein LOC103494745 OS=Cucumis melo OX=3656 GN=LOC103494745 PE=4 SV=1)

HSP 1 Score: 181.8 bits (460), Expect = 3.3e-42
Identity = 121/204 (59.31%), Postives = 143/204 (70.10%), Query Frame = 0

Query: 37  NNDHGIV----VPKKMKKKSVNILKVALMLLRRRSSKSNV----AAVDVASKSMWKRFVG 96
           +NDH  V     PKK KK+S NILKVAL LLR+RS K NV    AA+DV SK MW R VG
Sbjct: 35  DNDHVKVDEYCAPKK-KKRSTNILKVALKLLRQRSRKPNVTNVPAAIDVGSKGMWNRLVG 94

Query: 97  SMRPLHLQSPEQEGVVVPPLSLP-STEPLTLEPIAVGLRASPSLESFEDVMSSPFSPIHA 156
           +MRPLHLQS   E   +P  SLP ST+P    P+     +SPS+++FEDV S        
Sbjct: 95  AMRPLHLQS--DESTTIP--SLPTSTDPHPPPPL---FPSSPSVDNFEDVNS-------- 154

Query: 157 PKSPSSSVDGMSRYASAINLHDLDQNDDNVEDENDGEAEANMNGGDEMIDAKAEMFIAQF 216
             S SSSVDGMSRYAS  NL  LDQND+  E+ N+ +  ANM+G DEMIDAKAEMFIAQF
Sbjct: 155 --SSSSSVDGMSRYASVDNLQALDQNDEEEEERNNDKFYANMDGEDEMIDAKAEMFIAQF 214

Query: 217 YEQMRLQRSDSDIRYHEMIKRSIG 232
           YEQ++LQRS+SD+RY+EMIKRSIG
Sbjct: 215 YEQIKLQRSESDVRYNEMIKRSIG 220

BLAST of Tan0008235 vs. ExPASy TrEMBL
Match: A0A6J1GBN8 (GATA zinc finger domain-containing protein 8-like OS=Cucurbita moschata OX=3662 GN=LOC111452503 PE=4 SV=1)

HSP 1 Score: 180.3 bits (456), Expect = 9.5e-42
Identity = 125/238 (52.52%), Postives = 154/238 (64.71%), Query Frame = 0

Query: 1   MAEQIISVESPTATTDLIAINEDKNKNKLVVLDCNNNNDHGIVVPK---KMKKKSVNILK 60
           MA+ + S++S       I  + +KN N LV     NN ++G+  PK   K K KS+NIL+
Sbjct: 1   MADNLKSLQSS------ITDDNNKNNNNLV----GNNGNNGVAPPKNTTKKKNKSLNILR 60

Query: 61  VALMLLRRRSSKSNVAAVDVASKSMWKRFVGSMRPLHLQSPEQEGVVVPPLSLPSTEPLT 120
           VALMLLRRRSSK N A+V+VASK MW R V S+RPLH+QS         P+ +PS     
Sbjct: 61  VALMLLRRRSSKPNAASVEVASKGMWNRLVASIRPLHVQS-NHSPQHPQPIIVPSM---- 120

Query: 121 LEPIAVGLRASPSLESFEDVMSSPFSPIHAPKSPSSSVDGMSRYASAINLHDLDQNDDNV 180
             P A  LRASPS++ FEDV S          S +SSVDGMSRYASAINL +LDQN+D+ 
Sbjct: 121 --PDATTLRASPSIDCFEDVKS----------SSASSVDGMSRYASAINLQELDQNNDDD 180

Query: 181 EDENDGEAEANM-----NGGDEMIDAKAEMFIAQFYEQMRLQRSDSDIRYHEMIKRSI 231
           +D ND + +A          D+MIDAKAEMFIAQFYEQMR  RS+SD+RY EMIKRSI
Sbjct: 181 DDNNDNDNQAEPKIDEDENADDMIDAKAEMFIAQFYEQMR--RSNSDVRYLEMIKRSI 209

BLAST of Tan0008235 vs. TAIR 10
Match: AT4G02160.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G61710.1); Has 35 Blast hits to 35 proteins in 9 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 35; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 70.1 bits (170), Expect = 2.7e-12
Identity = 63/177 (35.59%), Postives = 89/177 (50.28%), Query Frame = 0

Query: 46  KKMKKKSVNILKVALMLLRRRSSKSNVAAVDVASKSMWKRFVGSMRPLHLQSPEQEGVVV 105
           KK + +  +++ V L +LRRR     +      +   W+R V S   L     + + V V
Sbjct: 19  KKKRSRGFHVIGVVLYMLRRRRRSKPL------NNGFWRRVVESFGQL-----KNDNVTV 78

Query: 106 PPLSLPSTEPLTLEPIAVGLRASPSLESFEDVMSSPFSPIHAPKSPSSSVDGMSRYASAI 165
               LPS+  +T+ P +          S +D +S     + A  S  SS  G+S Y SA 
Sbjct: 79  ----LPSSSNITILPPSSSPVTDEVPASSDDQVSEMVEVLTATSSSCSS--GISGYGSAK 138

Query: 166 NLHDLDQNDDNVEDENDGEAEANMNGGDEMIDAKAEMFIAQFYEQMRLQRSDSDIRY 223
           +L D+D  D   ED++D E   N +GGDEMIDAKAE FI +FYEQMR+Q      RY
Sbjct: 139 SLRDMDCLD---EDDDDDENYGNDDGGDEMIDAKAEEFIVRFYEQMRMQNQAYTERY 175

BLAST of Tan0008235 vs. TAIR 10
Match: AT5G61710.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G02160.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )

HSP 1 Score: 56.2 bits (134), Expect = 4.0e-08
Identity = 50/170 (29.41%), Postives = 82/170 (48.24%), Query Frame = 0

Query: 46  KKMKKKSVNILKVALMLLRRRSSKSNVAAVDVASKSMWKRFVGSMRPLHLQSPEQEGVVV 105
           KK   + +++  V + +LRRR +         ++   W+R V S+R +  +         
Sbjct: 5   KKKSSRGMHMFSVVMFMLRRRRT---------SNTRFWRRVVESVRKVRSE--------- 64

Query: 106 PPLSLPSTEPLTLEPIAVGLRASPSLESFEDVMSSPFSPIHAPKSPSSSVDGMSRYASAI 165
                     +T+ P+   +         +D +S       AP S SSS  G+S Y SA+
Sbjct: 65  ----------ITIMPV-TEMGGDDGDNEVDDRLSETMEVFTAPSSSSSS--GISGYGSAM 124

Query: 166 NLHDLD-QNDDNVEDENDGEAEANMNGGDEMIDAKAEMFIAQFYEQMRLQ 215
           +L DLD   DD+++     E  +++ GGD+MID KAE FI +FY QM++Q
Sbjct: 125 SLRDLDYPYDDDID-----ECYSDVQGGDDMIDEKAEEFIVRFYAQMKMQ 138

BLAST of Tan0008235 vs. TAIR 10
Match: AT4G26130.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G56980.1); Has 121 Blast hits to 116 proteins in 19 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 113; Viruses - 0; Other Eukaryotes - 4 (source: NCBI BLink). )

HSP 1 Score: 43.1 bits (100), Expect = 3.5e-04
Identity = 20/38 (52.63%), Postives = 29/38 (76.32%), Query Frame = 0

Query: 190 NGGDEMIDAKAEMFIAQFYEQMRLQRSDSDIRYHEMIK 228
           +GG+E +D KA  FI +F +Q++LQR DS +RY EM+K
Sbjct: 247 DGGEEGVDDKASNFINKFKQQLKLQRLDSFLRYREMLK 284

BLAST of Tan0008235 vs. TAIR 10
Match: AT5G56980.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; EXPRESSED IN: 18 plant structures; EXPRESSED DURING: 12 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G26130.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 42.0 bits (97), Expect = 7.8e-04
Identity = 20/44 (45.45%), Postives = 30/44 (68.18%), Query Frame = 0

Query: 184 EAEANMNGGDEMIDAKAEMFIAQFYEQMRLQRSDSDIRYHEMIK 228
           E   +   G++ +DAKA  FI +F +Q++LQR DS +RY EM+K
Sbjct: 334 ERSTSFGDGEDGVDAKASDFINKFKQQLKLQRLDSILRYKEMLK 377

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_038906029.13.9e-5057.92uncharacterized protein LOC120091932 [Benincasa hispida][more]
XP_004152232.21.9e-4456.58uncharacterized protein LOC101222123 [Cucumis sativus] >KGN52840.1 hypothetical ... [more]
XP_023524600.13.0e-4252.94GATA zinc finger domain-containing protein 8-like [Cucurbita pepo subsp. pepo][more]
XP_022998628.15.2e-4252.74uncharacterized protein LOC111493213 [Cucurbita maxima][more]
XP_016901496.16.7e-4259.31PREDICTED: uncharacterized protein LOC103494745 [Cucumis melo] >KAA0044405.1 DUF... [more]
Match NameE-valueIdentityDescription
A0A0A0KT889.2e-4556.58Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G002570 PE=4 SV=1[more]
A0A6J1KHA32.5e-4252.74uncharacterized protein LOC111493213 OS=Cucurbita maxima OX=3661 GN=LOC111493213... [more]
A0A5A7TSQ63.3e-4259.31DUF761 domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676... [more]
A0A1S4DZU73.3e-4259.31uncharacterized protein LOC103494745 OS=Cucumis melo OX=3656 GN=LOC103494745 PE=... [more]
A0A6J1GBN89.5e-4252.52GATA zinc finger domain-containing protein 8-like OS=Cucurbita moschata OX=3662 ... [more]
Match NameE-valueIdentityDescription
AT4G02160.12.7e-1235.59unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT5G61710.14.0e-0829.41unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT4G26130.13.5e-0452.63unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT5G56980.17.8e-0445.45unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR008480Protein of unknown function DUF761, plantPFAMPF05553DUF761coord: 193..228
e-value: 3.5E-15
score: 55.3
NoneNo IPR availablePANTHERPTHR36378COTTON FIBER PROTEINcoord: 20..225
NoneNo IPR availablePANTHERPTHR36378:SF1COTTON FIBER PROTEINcoord: 20..225

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0008235.1Tan0008235.1mRNA