Tan0018542 (gene) Snake gourd v1

Overview
NameTan0018542
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionUnknown protein
LocationLG04: 7269734 .. 7272302 (+)
RNA-Seq ExpressionTan0018542
SyntenyTan0018542
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATATTAAAAAATGGAAATATAAATTTAAAAACCCAAATTCCGAAGTGAAGACTCGGCCCGAGCGAGTTGGGAACCGTCGCTGGTCTTCGTCTTCCTCTCCTTCATCCGCCTTATCTCTCTATTCTCTCTCTCTCTGTTCTTATTTTTTGTTCTGCTATTCTGATTGCTTTGTCTCCCGCTTTGTAAACAGCCTTTTCTCCTTTTTACTCTTTTTCTCTTTATATTTCTGCGTAACAGTTTCCCCGTTTACGCCCACCACATCCAAGGTTTTGCTTCTTTCATTTTTGTATATAATACTGCTCAACTTTTTCGAAGTCTCATACTCAACTGCTGCCGCTTAATTTTGAGGTATTCTTCTTTTTCTCTTGTTTCTTATTGTGGCTCTCGTGTAGTTGAGTTTGAATCTGTTGATTTCGTTGTGAGGACTGGGAATTCTCGCTTTGGTTGTTTGATTGCTTATTGCGCGTGGATCTAGTGTTTGATTTCTGAAATTTTTAGATCTTTTTCGTGTTAAGAGATGATTTAGTTGTACTCTGAACTCTAAGGTGGATGAATCGTTGGGATTTTTAGGGAACGAACTGATGTCTTCCAGTTTAATTTTCTCCGGAATTTTAGGTTTCGGCGATTGATTGAATCGTTTGTGAGTTTAGTTTTTGAAATTTTGAATTCGTGCAGGTTTTGTTTATTTTCCCCCCCTCTATTTGCTTTAATAGATCCACTTTTTTCTATTTTACTTGATTTAAACGGGGCGAGTCTTTCGATTCCATCTGTGCGTGTTGTCTTAATTTTTAATAGTTAAGACAGTGATTTAAAGGATCATGCCAACTAATGAGCTAATTTTTGTCCGTTCCTTTTAACTTTAATTAATTTTAATTATTCGAGAAATTGCCGAACAGATTCCGATCTGCTGAGGGGACAAAATCAAACTTTGAATTTTCTTCCGCTTTCATCGTTTGTTTTGTTTGGTGCCATTTGTCGCGAGGATGATTTCTTTCTTTTGTTAAATATATGTTTTTGGTTCTTTTCTCCTCTCTCTCCACAGCTTTATTTGTCCCTTCAAATAACCGAAGACGAATATGAATCACTGCGCCATTCTGTCAAACGCCTTCTCGGGCCACGAGGAGATGAGAACCTCTGTTCCGGGTCCCATTTCTGACCTCAGAGATCAAGTTGTTTGCCCTAAACCACGGCGTCTTAGGAATCTTAAGGTCACCGTCAACGGCCACGCCGATAGCTCTCTCCGGTGGAATCTCAGGTTAAATTTTCTGACTCTTATCTTTTATTTTACAAACCCCCCTTTTGAAATACCTTCCTTTTGTTAATTTTACGGTGCTAATTCTTCTTGGAACTGAATTTCAGTCACCAAGTAGAGCAAATTGACATGGCAGCCGGACCGGATCTGCTTGACTTCCTCCTGACAAAAGTATGCATGCTCTTATCTTGTCTCTTCCTTTTTAGTTTCTTCAGATCATAAGAGATTAGTATTTTGGAATTTAGAGGAGGCCCTCGTAGCTTAAGTTGTTAATAATCATGGATTCATTCATGTGGTGTTAATCTTACAGGGTGGTTGCAGCGTGGACCAATCGTTTACGCAGTTGGCTTCGTCGCCCCCTTTTTTATGTGGGTCTCCGCCGAGCAGAGTAGCCAACCCATTGATTCAGGACGCCCGATTTGGGGATGAAAAATTCATCCCCTTTGCACCGATTGCTTCACCGGCGGGTCAGTTGTCGCCCTCTACAGCTGCTTCCAGGAAAGGAAACCGCGTAAGGGCGAGTTTTGGGAACAAACCAACGGTGAGGATTGAGGGTTTCGATTGCCTTGACAGGGATAGGCAGAATTGCAGCATCCCTGCCTTCGCTTAGAAACCCATCTCAATCAATATCAAATCCATAGAGTAATACAAGACAAGCATATTATAGTTTCAAGAGAAGCAAGATTCCCAATTACAAAACAATATCGAGGGTCTCTCAAGAGAAAGAAAAGGATACACAATTTGCCCTTTTTTTTGTGAATACGTTTGAAGTGTTCTTTGTAAATGTAAATAAAAATCAATGCCCATGTATACTAAAGCTGTGTATTTTTTAGGCTTTTTGAGTCGATGGAGAGCAAACGATGCGTGGATGGTCAACCTCTTGTATTATATTTTTGTAATTAGAAGTCTTAAGGTTGAGAGAATTTTCATTTGAAGTTGAAAGTAAGAGAGAAAGAGAGAGAGAGAGGCAATCTGTACATATTGCAGAACTTGAAATGGAGTCTGTAGAAGTGATCAACATGTGAATAATTGTAGAACAATTTCTTATTGTGAAGTTTTGTTCATCTTTCTTTCTGGGTTTGAATTTGGGTTTGGAAGTTTGCTTGCTTAAGCATAACGCTAGCGATGGATCCATTAGATTTGCTTTTTCAGGGCATCTTCTTCATGGAATCGCCGACCTCTCGGCTTTTGCTAAAGCAACTTTGCTGGTGGAATTAATACCTCTTTCTTTGTCTCTCCTCTCTGTTCAAAGCTACTTTGTTGCAGAGCTTGTCTCTTTATTGGAGTGTTGATTTCTGAATTCTGACACAATTTTAT

mRNA sequence

ATATTAAAAAATGGAAATATAAATTTAAAAACCCAAATTCCGAAGTGAAGACTCGGCCCGAGCGAGTTGGGAACCGTCGCTGGTCTTCGTCTTCCTCTCCTTCATCCGCCTTATCTCTCTATTCTCTCTCTCTCTGTTCTTATTTTTTGTTCTGCTATTCTGATTGCTTTGTCTCCCGCTTTGTAAACAGCCTTTTCTCCTTTTTACTCTTTTTCTCTTTATATTTCTGCGTAACAGTTTCCCCGTTTACGCCCACCACATCCAAGGTTTTGCTTCTTTCATTTTTGTATATAATACTGCTCAACTTTTTCGAAGTCTCATACTCAACTGCTGCCGCTTAATTTTGAGCTTTATTTGTCCCTTCAAATAACCGAAGACGAATATGAATCACTGCGCCATTCTGTCAAACGCCTTCTCGGGCCACGAGGAGATGAGAACCTCTGTTCCGGGTCCCATTTCTGACCTCAGAGATCAAGTTGTTTGCCCTAAACCACGGCGTCTTAGGAATCTTAAGGTCACCGTCAACGGCCACGCCGATAGCTCTCTCCGGTGGAATCTCAGTCACCAAGTAGAGCAAATTGACATGGCAGCCGGACCGGATCTGCTTGACTTCCTCCTGACAAAAGGTGGTTGCAGCGTGGACCAATCGTTTACGCAGTTGGCTTCGTCGCCCCCTTTTTTATGTGGGTCTCCGCCGAGCAGAGTAGCCAACCCATTGATTCAGGACGCCCGATTTGGGGATGAAAAATTCATCCCCTTTGCACCGATTGCTTCACCGGCGGGTCAGTTGTCGCCCTCTACAGCTGCTTCCAGGAAAGGAAACCGCGTAAGGGCGAGTTTTGGGAACAAACCAACGGTGAGGATTGAGGGTTTCGATTGCCTTGACAGGGATAGGCAGAATTGCAGCATCCCTGCCTTCGCTTAGAAACCCATCTCAATCAATATCAAATCCATAGAGTAATACAAGACAAGCATATTATAGTTTCAAGAGAAGCAAGATTCCCAATTACAAAACAATATCGAGGGTCTCTCAAGAGAAAGAAAAGGATACACAATTTGCCCTTTTTTTTGTGAATACGTTTGAAGTGTTCTTTGTAAATGTAAATAAAAATCAATGCCCATGTATACTAAAGCTGTGTATTTTTTAGGCTTTTTGAGTCGATGGAGAGCAAACGATGCGTGGATGGTCAACCTCTTGTATTATATTTTTGTAATTAGAAGTCTTAAGGTTGAGAGAATTTTCATTTGAAGTTGAAAGTAAGAGAGAAAGAGAGAGAGAGAGGCAATCTGTACATATTGCAGAACTTGAAATGGAGTCTGTAGAAGTGATCAACATGTGAATAATTGTAGAACAATTTCTTATTGTGAAGTTTTGTTCATCTTTCTTTCTGGGTTTGAATTTGGGTTTGGAAGTTTGCTTGCTTAAGCATAACGCTAGCGATGGATCCATTAGATTTGCTTTTTCAGGGCATCTTCTTCATGGAATCGCCGACCTCTCGGCTTTTGCTAAAGCAACTTTGCTGGTGGAATTAATACCTCTTTCTTTGTCTCTCCTCTCTGTTCAAAGCTACTTTGTTGCAGAGCTTGTCTCTTTATTGGAGTGTTGATTTCTGAATTCTGACACAATTTTAT

Coding sequence (CDS)

ATGAATCACTGCGCCATTCTGTCAAACGCCTTCTCGGGCCACGAGGAGATGAGAACCTCTGTTCCGGGTCCCATTTCTGACCTCAGAGATCAAGTTGTTTGCCCTAAACCACGGCGTCTTAGGAATCTTAAGGTCACCGTCAACGGCCACGCCGATAGCTCTCTCCGGTGGAATCTCAGTCACCAAGTAGAGCAAATTGACATGGCAGCCGGACCGGATCTGCTTGACTTCCTCCTGACAAAAGGTGGTTGCAGCGTGGACCAATCGTTTACGCAGTTGGCTTCGTCGCCCCCTTTTTTATGTGGGTCTCCGCCGAGCAGAGTAGCCAACCCATTGATTCAGGACGCCCGATTTGGGGATGAAAAATTCATCCCCTTTGCACCGATTGCTTCACCGGCGGGTCAGTTGTCGCCCTCTACAGCTGCTTCCAGGAAAGGAAACCGCGTAAGGGCGAGTTTTGGGAACAAACCAACGGTGAGGATTGAGGGTTTCGATTGCCTTGACAGGGATAGGCAGAATTGCAGCATCCCTGCCTTCGCTTAG

Protein sequence

MNHCAILSNAFSGHEEMRTSVPGPISDLRDQVVCPKPRRLRNLKVTVNGHADSSLRWNLSHQVEQIDMAAGPDLLDFLLTKGGCSVDQSFTQLASSPPFLCGSPPSRVANPLIQDARFGDEKFIPFAPIASPAGQLSPSTAASRKGNRVRASFGNKPTVRIEGFDCLDRDRQNCSIPAFA
Homology
BLAST of Tan0018542 vs. NCBI nr
Match: XP_022957437.1 (uncharacterized protein LOC111458833 [Cucurbita moschata] >KAG7032074.1 hypothetical protein SDJN02_06117 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 344.4 bits (882), Expect = 6.1e-91
Identity = 167/180 (92.78%), Postives = 170/180 (94.44%), Query Frame = 0

Query: 1   MNHCAILSNAFSGHEEMRTSVPGPISDLRDQVVCPKPRRLRNLKVTVNGHADSSLRWNLS 60
           MNHCAILSNAFS HEEMR  VPGPISDLRDQ+VCPKPRRL N KVTV GHADSSLRWNLS
Sbjct: 1   MNHCAILSNAFSSHEEMRAPVPGPISDLRDQLVCPKPRRLSNPKVTVIGHADSSLRWNLS 60

Query: 61  HQVEQIDMAAGPDLLDFLLTKGGCSVDQSFTQLASSPPFLCGSPPSRVANPLIQDARFGD 120
           HQVEQIDMA GPDLLDFLLT+GGCSVDQSFTQLASSPPFLCGSPPSRVANPLIQDARFGD
Sbjct: 61  HQVEQIDMATGPDLLDFLLTRGGCSVDQSFTQLASSPPFLCGSPPSRVANPLIQDARFGD 120

Query: 121 EKFIPFAPIASPAGQLSPSTAASRKGNRVRASFGNKPTVRIEGFDCLDRDRQNCSIPAFA 180
           EKFIPFAPIASP+GQLSPST ASRKG RVRASFGNKPTVRIEGFDC DRDRQNCSIPAFA
Sbjct: 121 EKFIPFAPIASPSGQLSPST-ASRKGGRVRASFGNKPTVRIEGFDCRDRDRQNCSIPAFA 179

BLAST of Tan0018542 vs. NCBI nr
Match: XP_022150691.1 (uncharacterized protein LOC111018762 [Momordica charantia])

HSP 1 Score: 344.4 bits (882), Expect = 6.1e-91
Identity = 168/181 (92.82%), Postives = 172/181 (95.03%), Query Frame = 0

Query: 1   MNHCAILSNAFSGHEEMRTSVPGPISDLRDQVVCPKPRRLRNLKVTVNGHADSSLRWNLS 60
           MNHCAILSN FSGHEEMRTSVPGPISDLRDQ+VCPKPRRL NLKVTVNGHAD+SLRWNL 
Sbjct: 1   MNHCAILSNTFSGHEEMRTSVPGPISDLRDQIVCPKPRRLSNLKVTVNGHADASLRWNLC 60

Query: 61  HQVEQIDMAAGPDLLDFLLTKGGCSVDQSFTQLASSPPFLCGSPPSRVANPLIQDARFGD 120
           HQVEQIDMAAGPDLLDFLLTK GCSVDQSFTQLASSPPFLCGSPPSRVANPLIQDARFGD
Sbjct: 61  HQVEQIDMAAGPDLLDFLLTKNGCSVDQSFTQLASSPPFLCGSPPSRVANPLIQDARFGD 120

Query: 121 EKFIPFAPI-ASPAGQLSPSTAASRKGNRVRASFGNKPTVRIEGFDCLDRDRQNCSIPAF 180
           EKFIPFAPI ASP+ QLSPST ASRKG RVRA+FGNKP VRIEGFDCLDRDRQNCSIPAF
Sbjct: 121 EKFIPFAPIAASPSVQLSPST-ASRKGGRVRANFGNKPAVRIEGFDCLDRDRQNCSIPAF 180

BLAST of Tan0018542 vs. NCBI nr
Match: KAG6601286.1 (hypothetical protein SDJN03_06519, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 344.4 bits (882), Expect = 6.1e-91
Identity = 167/180 (92.78%), Postives = 170/180 (94.44%), Query Frame = 0

Query: 1   MNHCAILSNAFSGHEEMRTSVPGPISDLRDQVVCPKPRRLRNLKVTVNGHADSSLRWNLS 60
           MNHCAILSNAFS HEEMR  VPGPISDLRDQ+VCPKPRRL N KVTV GHADSSLRWNLS
Sbjct: 1   MNHCAILSNAFSSHEEMRAPVPGPISDLRDQLVCPKPRRLSNPKVTVIGHADSSLRWNLS 60

Query: 61  HQVEQIDMAAGPDLLDFLLTKGGCSVDQSFTQLASSPPFLCGSPPSRVANPLIQDARFGD 120
           HQVEQIDMA GPDLLDFLLT+GGCSVDQSFTQLASSPPFLCGSPPSRVANPLIQDARFGD
Sbjct: 61  HQVEQIDMATGPDLLDFLLTRGGCSVDQSFTQLASSPPFLCGSPPSRVANPLIQDARFGD 120

Query: 121 EKFIPFAPIASPAGQLSPSTAASRKGNRVRASFGNKPTVRIEGFDCLDRDRQNCSIPAFA 180
           EKFIPFAPIASP+GQLSPST ASRKG RVRASFGNKPTVRIEGFDC DRDRQNCSIPAFA
Sbjct: 121 EKFIPFAPIASPSGQLSPST-ASRKGGRVRASFGNKPTVRIEGFDCRDRDRQNCSIPAFA 179

BLAST of Tan0018542 vs. NCBI nr
Match: XP_022986983.1 (uncharacterized protein LOC111484540 [Cucurbita maxima])

HSP 1 Score: 341.7 bits (875), Expect = 4.0e-90
Identity = 166/180 (92.22%), Postives = 169/180 (93.89%), Query Frame = 0

Query: 1   MNHCAILSNAFSGHEEMRTSVPGPISDLRDQVVCPKPRRLRNLKVTVNGHADSSLRWNLS 60
           MNHCAILSNAFS HEEMR  VPGPISDLRDQ+VCPKPRRL N KVTV GHADSSLRWNLS
Sbjct: 1   MNHCAILSNAFSSHEEMRDPVPGPISDLRDQLVCPKPRRLSNPKVTVIGHADSSLRWNLS 60

Query: 61  HQVEQIDMAAGPDLLDFLLTKGGCSVDQSFTQLASSPPFLCGSPPSRVANPLIQDARFGD 120
           HQVEQIDMA GPDLLDFLLT+GGCSVDQSFTQLASSPPFLCGSPPSRVANPLIQDARFGD
Sbjct: 61  HQVEQIDMATGPDLLDFLLTRGGCSVDQSFTQLASSPPFLCGSPPSRVANPLIQDARFGD 120

Query: 121 EKFIPFAPIASPAGQLSPSTAASRKGNRVRASFGNKPTVRIEGFDCLDRDRQNCSIPAFA 180
           EKFIP APIASP+GQLSPST ASRKG RVRASFGNKPTVRIEGFDC DRDRQNCSIPAFA
Sbjct: 121 EKFIPLAPIASPSGQLSPST-ASRKGGRVRASFGNKPTVRIEGFDCRDRDRQNCSIPAFA 179

BLAST of Tan0018542 vs. NCBI nr
Match: XP_023517111.1 (uncharacterized protein LOC111780963 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 341.3 bits (874), Expect = 5.2e-90
Identity = 165/180 (91.67%), Postives = 168/180 (93.33%), Query Frame = 0

Query: 1   MNHCAILSNAFSGHEEMRTSVPGPISDLRDQVVCPKPRRLRNLKVTVNGHADSSLRWNLS 60
           MNHCAILSNAFS HEEMR  VPGP SDLRDQ+VCPKPRRL N KVTV GHADSSLRWNLS
Sbjct: 1   MNHCAILSNAFSSHEEMRAPVPGPFSDLRDQLVCPKPRRLSNPKVTVIGHADSSLRWNLS 60

Query: 61  HQVEQIDMAAGPDLLDFLLTKGGCSVDQSFTQLASSPPFLCGSPPSRVANPLIQDARFGD 120
           HQVEQIDM  GPDLLDFLLT+GGCSVDQSFTQLASSPPFLCGSPPSRVANPLIQDARFGD
Sbjct: 61  HQVEQIDMGTGPDLLDFLLTRGGCSVDQSFTQLASSPPFLCGSPPSRVANPLIQDARFGD 120

Query: 121 EKFIPFAPIASPAGQLSPSTAASRKGNRVRASFGNKPTVRIEGFDCLDRDRQNCSIPAFA 180
           EKFIPFAPIASP+GQLSPST ASRKG RVRASFGNKPTVRIEGFDC DRDRQNCSIPAFA
Sbjct: 121 EKFIPFAPIASPSGQLSPST-ASRKGGRVRASFGNKPTVRIEGFDCRDRDRQNCSIPAFA 179

BLAST of Tan0018542 vs. ExPASy TrEMBL
Match: A0A6J1DC97 (uncharacterized protein LOC111018762 OS=Momordica charantia OX=3673 GN=LOC111018762 PE=4 SV=1)

HSP 1 Score: 344.4 bits (882), Expect = 3.0e-91
Identity = 168/181 (92.82%), Postives = 172/181 (95.03%), Query Frame = 0

Query: 1   MNHCAILSNAFSGHEEMRTSVPGPISDLRDQVVCPKPRRLRNLKVTVNGHADSSLRWNLS 60
           MNHCAILSN FSGHEEMRTSVPGPISDLRDQ+VCPKPRRL NLKVTVNGHAD+SLRWNL 
Sbjct: 1   MNHCAILSNTFSGHEEMRTSVPGPISDLRDQIVCPKPRRLSNLKVTVNGHADASLRWNLC 60

Query: 61  HQVEQIDMAAGPDLLDFLLTKGGCSVDQSFTQLASSPPFLCGSPPSRVANPLIQDARFGD 120
           HQVEQIDMAAGPDLLDFLLTK GCSVDQSFTQLASSPPFLCGSPPSRVANPLIQDARFGD
Sbjct: 61  HQVEQIDMAAGPDLLDFLLTKNGCSVDQSFTQLASSPPFLCGSPPSRVANPLIQDARFGD 120

Query: 121 EKFIPFAPI-ASPAGQLSPSTAASRKGNRVRASFGNKPTVRIEGFDCLDRDRQNCSIPAF 180
           EKFIPFAPI ASP+ QLSPST ASRKG RVRA+FGNKP VRIEGFDCLDRDRQNCSIPAF
Sbjct: 121 EKFIPFAPIAASPSVQLSPST-ASRKGGRVRANFGNKPAVRIEGFDCLDRDRQNCSIPAF 180

BLAST of Tan0018542 vs. ExPASy TrEMBL
Match: A0A6J1GZ47 (uncharacterized protein LOC111458833 OS=Cucurbita moschata OX=3662 GN=LOC111458833 PE=4 SV=1)

HSP 1 Score: 344.4 bits (882), Expect = 3.0e-91
Identity = 167/180 (92.78%), Postives = 170/180 (94.44%), Query Frame = 0

Query: 1   MNHCAILSNAFSGHEEMRTSVPGPISDLRDQVVCPKPRRLRNLKVTVNGHADSSLRWNLS 60
           MNHCAILSNAFS HEEMR  VPGPISDLRDQ+VCPKPRRL N KVTV GHADSSLRWNLS
Sbjct: 1   MNHCAILSNAFSSHEEMRAPVPGPISDLRDQLVCPKPRRLSNPKVTVIGHADSSLRWNLS 60

Query: 61  HQVEQIDMAAGPDLLDFLLTKGGCSVDQSFTQLASSPPFLCGSPPSRVANPLIQDARFGD 120
           HQVEQIDMA GPDLLDFLLT+GGCSVDQSFTQLASSPPFLCGSPPSRVANPLIQDARFGD
Sbjct: 61  HQVEQIDMATGPDLLDFLLTRGGCSVDQSFTQLASSPPFLCGSPPSRVANPLIQDARFGD 120

Query: 121 EKFIPFAPIASPAGQLSPSTAASRKGNRVRASFGNKPTVRIEGFDCLDRDRQNCSIPAFA 180
           EKFIPFAPIASP+GQLSPST ASRKG RVRASFGNKPTVRIEGFDC DRDRQNCSIPAFA
Sbjct: 121 EKFIPFAPIASPSGQLSPST-ASRKGGRVRASFGNKPTVRIEGFDCRDRDRQNCSIPAFA 179

BLAST of Tan0018542 vs. ExPASy TrEMBL
Match: A0A6J1JI50 (uncharacterized protein LOC111484540 OS=Cucurbita maxima OX=3661 GN=LOC111484540 PE=4 SV=1)

HSP 1 Score: 341.7 bits (875), Expect = 1.9e-90
Identity = 166/180 (92.22%), Postives = 169/180 (93.89%), Query Frame = 0

Query: 1   MNHCAILSNAFSGHEEMRTSVPGPISDLRDQVVCPKPRRLRNLKVTVNGHADSSLRWNLS 60
           MNHCAILSNAFS HEEMR  VPGPISDLRDQ+VCPKPRRL N KVTV GHADSSLRWNLS
Sbjct: 1   MNHCAILSNAFSSHEEMRDPVPGPISDLRDQLVCPKPRRLSNPKVTVIGHADSSLRWNLS 60

Query: 61  HQVEQIDMAAGPDLLDFLLTKGGCSVDQSFTQLASSPPFLCGSPPSRVANPLIQDARFGD 120
           HQVEQIDMA GPDLLDFLLT+GGCSVDQSFTQLASSPPFLCGSPPSRVANPLIQDARFGD
Sbjct: 61  HQVEQIDMATGPDLLDFLLTRGGCSVDQSFTQLASSPPFLCGSPPSRVANPLIQDARFGD 120

Query: 121 EKFIPFAPIASPAGQLSPSTAASRKGNRVRASFGNKPTVRIEGFDCLDRDRQNCSIPAFA 180
           EKFIP APIASP+GQLSPST ASRKG RVRASFGNKPTVRIEGFDC DRDRQNCSIPAFA
Sbjct: 121 EKFIPLAPIASPSGQLSPST-ASRKGGRVRASFGNKPTVRIEGFDCRDRDRQNCSIPAFA 179

BLAST of Tan0018542 vs. ExPASy TrEMBL
Match: A0A5D3CB96 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold475G001020 PE=4 SV=1)

HSP 1 Score: 329.3 bits (843), Expect = 9.8e-87
Identity = 161/180 (89.44%), Postives = 167/180 (92.78%), Query Frame = 0

Query: 1   MNHCAILSNAFSGHEEMRTSVPGPISDLRDQVVCPKPRRLRNLKVTVNGHADSSLRWNLS 60
           MNHCAILSNAFSGHEEMRTSVP PISD RDQ+VCPKPRRL     TVN H+D+SLRWNLS
Sbjct: 1   MNHCAILSNAFSGHEEMRTSVPCPISDFRDQLVCPKPRRL-----TVNAHSDTSLRWNLS 60

Query: 61  HQVEQIDMAAGPDLLDFLLTKGGCSVDQSFTQLASSPPFLCGSPPSRVANPLIQDARFGD 120
           HQVE IDMAAGPDLLDFLLTKGGCSVDQSFTQLASSPPFLCGSPPSRVANPLIQDARF +
Sbjct: 61  HQVEPIDMAAGPDLLDFLLTKGGCSVDQSFTQLASSPPFLCGSPPSRVANPLIQDARFRE 120

Query: 121 EKFIPFAPIASPAGQLSPSTAASRKGNRVRASFGNKPTVRIEGFDCLDRDRQNCSIPAFA 180
           EKFIPF PIASP+GQLSPST +SRKG RVRASFGNKPTVRIEGFDCLDRDRQNCSIPAFA
Sbjct: 121 EKFIPFTPIASPSGQLSPST-SSRKGGRVRASFGNKPTVRIEGFDCLDRDRQNCSIPAFA 174

BLAST of Tan0018542 vs. ExPASy TrEMBL
Match: A0A1S3BG95 (uncharacterized protein LOC103489287 OS=Cucumis melo OX=3656 GN=LOC103489287 PE=4 SV=1)

HSP 1 Score: 329.3 bits (843), Expect = 9.8e-87
Identity = 161/180 (89.44%), Postives = 167/180 (92.78%), Query Frame = 0

Query: 1   MNHCAILSNAFSGHEEMRTSVPGPISDLRDQVVCPKPRRLRNLKVTVNGHADSSLRWNLS 60
           MNHCAILSNAFSGHEEMRTSVP PISD RDQ+VCPKPRRL     TVN H+D+SLRWNLS
Sbjct: 1   MNHCAILSNAFSGHEEMRTSVPCPISDFRDQLVCPKPRRL-----TVNAHSDTSLRWNLS 60

Query: 61  HQVEQIDMAAGPDLLDFLLTKGGCSVDQSFTQLASSPPFLCGSPPSRVANPLIQDARFGD 120
           HQVE IDMAAGPDLLDFLLTKGGCSVDQSFTQLASSPPFLCGSPPSRVANPLIQDARF +
Sbjct: 61  HQVEPIDMAAGPDLLDFLLTKGGCSVDQSFTQLASSPPFLCGSPPSRVANPLIQDARFRE 120

Query: 121 EKFIPFAPIASPAGQLSPSTAASRKGNRVRASFGNKPTVRIEGFDCLDRDRQNCSIPAFA 180
           EKFIPF PIASP+GQLSPST +SRKG RVRASFGNKPTVRIEGFDCLDRDRQNCSIPAFA
Sbjct: 121 EKFIPFTPIASPSGQLSPST-SSRKGGRVRASFGNKPTVRIEGFDCLDRDRQNCSIPAFA 174

BLAST of Tan0018542 vs. TAIR 10
Match: AT1G68490.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G13390.2); Has 125 Blast hits to 125 proteins in 18 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 125; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 162.2 bits (409), Expect = 4.0e-40
Identity = 94/187 (50.27%), Postives = 115/187 (61.50%), Query Frame = 0

Query: 1   MNHCAILSNAFSGHEEMRTSVPGPISDLRDQVVCPKPRR--LRNLKVTVNGHADSSLRWN 60
           MNH A+  NAF+   ++R+S    +   +  VVCPKPRR  LRN     + H   SLR  
Sbjct: 1   MNHFAVQPNAFAAGGDLRSSSVSVVERDQTTVVCPKPRRIGLRN----NHHHPSRSLRCY 60

Query: 61  LSHQVEQIDMAAGPDLLDFLLTKGGCSVDQSFTQLASSP-PFLCGSPPSRVANPLIQDAR 120
            SHQ+E  +  A  D+LD +LTK G   +Q   Q+  SP PFLCGSPPSRVANPL QDAR
Sbjct: 61  FSHQLELCESKAETDILDIILTKDGYGAEQVNKQVIDSPSPFLCGSPPSRVANPLTQDAR 120

Query: 121 FGDEKFIPFAPIASPAG---QLSPSTAASRKGN-RVRASFGNKPTVRIEGFDCLDRDRQN 180
           F DE     + I    G     SPS+++ RKG   VR +FGN P VR+EGFDCLDRD +N
Sbjct: 121 FRDEIVSVSSVIPPQLGLPPSSSPSSSSGRKGGCVVRGNFGNSPKVRVEGFDCLDRDSRN 180

BLAST of Tan0018542 vs. TAIR 10
Match: AT1G13390.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G68490.1); Has 114 Blast hits to 114 proteins in 18 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 114; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 137.5 bits (345), Expect = 1.1e-32
Identity = 85/185 (45.95%), Postives = 109/185 (58.92%), Query Frame = 0

Query: 1   MNHCAILSNAFSGHEEMRTSVPGPISDLRDQVVCPKPRRLRNLKVTVNGHADSSLRWNLS 60
           MN C I  NAF   EEMR +    +SD RD V+CPKPRR+  L    N H+  SLRW L+
Sbjct: 2   MNSCGIQQNAF---EEMRRN--AAVSDRRDAVICPKPRRVGAL----NHHSSRSLRWQLN 61

Query: 61  HQVEQIDMAAGPDLLDFLLTK-GGCSVDQSFTQLASSPP-FLCGSPPSRVANPLIQDARF 120
           HQ+E  +  +G ++LDF+LTK GG   +Q  T+   +PP F  GSPPSRV+NPL +D+ F
Sbjct: 62  HQMELCESNSGSEILDFILTKGGGGGGEQDQTRTVMTPPLFFTGSPPSRVSNPLTKDSLF 121

Query: 121 GDEKFIPFAPIASPAGQLSPSTAAS-RKGNRVRA--SFGNKPTVRIEGFDCLDRDRQNCS 180
            +E  +  +P  S      P   +S R G+ V A  SFGN P VR+ GFDC DR   N S
Sbjct: 122 REELLMVASPSPSTPRATKPQPPSSPRNGSCVMAATSFGNNPVVRVVGFDC-DRRSSNRS 176

BLAST of Tan0018542 vs. TAIR 10
Match: AT1G13390.2 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G68490.1); Has 114 Blast hits to 114 proteins in 18 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 114; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 137.5 bits (345), Expect = 1.1e-32
Identity = 85/185 (45.95%), Postives = 109/185 (58.92%), Query Frame = 0

Query: 1   MNHCAILSNAFSGHEEMRTSVPGPISDLRDQVVCPKPRRLRNLKVTVNGHADSSLRWNLS 60
           MN C I  NAF   EEMR +    +SD RD V+CPKPRR+  L    N H+  SLRW L+
Sbjct: 2   MNSCGIQQNAF---EEMRRN--AAVSDRRDAVICPKPRRVGAL----NHHSSRSLRWQLN 61

Query: 61  HQVEQIDMAAGPDLLDFLLTK-GGCSVDQSFTQLASSPP-FLCGSPPSRVANPLIQDARF 120
           HQ+E  +  +G ++LDF+LTK GG   +Q  T+   +PP F  GSPPSRV+NPL +D+ F
Sbjct: 62  HQMELCESNSGSEILDFILTKGGGGGGEQDQTRTVMTPPLFFTGSPPSRVSNPLTKDSLF 121

Query: 121 GDEKFIPFAPIASPAGQLSPSTAAS-RKGNRVRA--SFGNKPTVRIEGFDCLDRDRQNCS 180
            +E  +  +P  S      P   +S R G+ V A  SFGN P VR+ GFDC DR   N S
Sbjct: 122 REELLMVASPSPSTPRATKPQPPSSPRNGSCVMAATSFGNNPVVRVVGFDC-DRRSSNRS 176

BLAST of Tan0018542 vs. TAIR 10
Match: AT3G02555.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G16110.1); Has 130 Blast hits to 130 proteins in 18 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 130; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 137.1 bits (344), Expect = 1.4e-32
Identity = 88/181 (48.62%), Postives = 100/181 (55.25%), Query Frame = 0

Query: 1   MNHCAILSNAFSGHEEMRTSVPGPISDLRDQVVCPKPRRLRNLKVTVNGHADSSLRWNLS 60
           MNHC++  NAF   EE R  VP   S   D VVCPKPRR  N+           L ++LS
Sbjct: 1   MNHCSLQQNAFLSREESRGFVP-IYSHPVDSVVCPKPRRANNV------IRPFRLHFSLS 60

Query: 61  HQVEQIDMAAGPDLLDFLLTKGGCSVDQSFTQLASSPPFLCGSPPSRVANPLIQDARFGD 120
              +  D  AG DLLD    K   S        + SPPF  GSPPSR ANPL QDARFGD
Sbjct: 61  GADDVCDSKAGEDLLDIFRRKESVS--------SRSPPFFLGSPPSRAANPLAQDARFGD 120

Query: 121 EKFIPFAPIASPAGQLSPSTAASRKGNRVRASFGNKP-TVRIEGFDCLDRDRQNCSIPAF 180
           EK    +P  SP   L PS +  + G   R  FG KP TVR+EGFDCL+RDR N SIPA 
Sbjct: 121 EKLNTVSPSLSP---LLPSASRVKSGCG-RMKFGVKPATVRVEGFDCLNRDRPNSSIPAM 162

BLAST of Tan0018542 vs. TAIR 10
Match: AT5G16110.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 15 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G02555.1); Has 133 Blast hits to 133 proteins in 18 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 133; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 134.0 bits (336), Expect = 1.2e-31
Identity = 87/191 (45.55%), Postives = 109/191 (57.07%), Query Frame = 0

Query: 1   MNHCAILSNAFSGHEEMRTSVPGPISDLRDQVVCPKPRRLRNLKVTVNGHADSSLRWNLS 60
           MNHC +  NAF   EEM         D +D VVCPKPRR+  L      +    LR ++S
Sbjct: 67  MNHCNLQQNAFMSREEMMG------FDRKDLVVCPKPRRVGLLA----NNVIRPLRLHMS 126

Query: 61  HQVEQI-DMAAGPDLLDFLLTK-GGCSVDQSFTQLASSPPFLCGSPPSRVANPLIQDARF 120
                + D  AG +LL+ +  K    ++ Q    L+SSPP+  GSPPSR ANPL QDARF
Sbjct: 127 QAAADLCDSKAGAELLEIIRRKEDNGTIGQ---LLSSSPPYFPGSPPSRAANPLAQDARF 186

Query: 121 GDEKFIPFAPIA------SPAGQLSPSTAASRKGNR--VRASFG-NKPTVRIEGFDCLDR 180
            DEK  P +P +      S  G  SPS+++S   +R  VR  FG N P VR+EGFDCL+R
Sbjct: 187 RDEKLNPISPNSPFLQPYSATGFPSPSSSSSSSSSRGCVRMKFGLNSPAVRVEGFDCLNR 244

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_022957437.16.1e-9192.78uncharacterized protein LOC111458833 [Cucurbita moschata] >KAG7032074.1 hypothet... [more]
XP_022150691.16.1e-9192.82uncharacterized protein LOC111018762 [Momordica charantia][more]
KAG6601286.16.1e-9192.78hypothetical protein SDJN03_06519, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_022986983.14.0e-9092.22uncharacterized protein LOC111484540 [Cucurbita maxima][more]
XP_023517111.15.2e-9091.67uncharacterized protein LOC111780963 [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
A0A6J1DC973.0e-9192.82uncharacterized protein LOC111018762 OS=Momordica charantia OX=3673 GN=LOC111018... [more]
A0A6J1GZ473.0e-9192.78uncharacterized protein LOC111458833 OS=Cucurbita moschata OX=3662 GN=LOC1114588... [more]
A0A6J1JI501.9e-9092.22uncharacterized protein LOC111484540 OS=Cucurbita maxima OX=3661 GN=LOC111484540... [more]
A0A5D3CB969.8e-8789.44Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A1S3BG959.8e-8789.44uncharacterized protein LOC103489287 OS=Cucumis melo OX=3656 GN=LOC103489287 PE=... [more]
Match NameE-valueIdentityDescription
AT1G68490.14.0e-4050.27unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT1G13390.11.1e-3245.95unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT1G13390.21.1e-3245.95unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT3G02555.11.4e-3248.62unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT5G16110.11.2e-3145.55unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR33384:SF18BNAA07G24630D PROTEINcoord: 1..180
NoneNo IPR availablePANTHERPTHR33384EXPRESSED PROTEINcoord: 1..180

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0018542.1Tan0018542.1mRNA