Tan0018851 (gene) Snake gourd v1

Overview
NameTan0018851
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRNA-binding protein
LocationLG04: 9420103 .. 9422797 (+)
RNA-Seq ExpressionTan0018851
SyntenyTan0018851
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GCTGAACTCTCAAGTTCGATCGTTGTCGTCTTCGTGTCTGCCTCTCTTCTTCTCTTTCGTTAGTGGTTGGTAAATTTTTCTGCTCGATTTCTCTTCATCTTAGTTGTTCAATCATCCTGATTTGTTGGTTTCAAACCCTAATGATATAGTTTTGTTTTGCTATAATTTTGATTTTCTTCTGGGGTTCTCATTTTTATTGTTTCCAGTGGACCATTGGATCGGATCGAAGTTCTAAAAGGAGGAGCCATGTCCTTCATGAAAGGAGATTTACTAACTAGAACCAGAAAGCTCGTCAAGGGCTTGGCCAAAGCAGAACCTGTCTGGCTCAAAGCCATGGAACAGTCAGTTCTTCTATTCTTGACTCCTCGCACTTCTGTTCTCTTTACAAGTATTTCATTTTTTGAGATTCTGACCATGTTTTTTTTGTTCTTACATTTTAGTGAGACGGCATAATTATAATCCTTTCCCTTTTTAGCTTGTCAAATTCTATTTGATTACCTTTTGTTCTTTTCTTCGAATCTCATTTTTCTTAGATTAGTAAAACTCTTTGGTATACTATTAGATTCAAGTTTCTCAATTTATCATGTTTTTGACATCAGTTTTTGTGAAAAATTTTCCCAGCAGGTTTCTGTTCATTGAATTAAATAAACTAAAACTACTTATTGAGATTAGTTTCTCTGGAAGGGCTCAAAGTTTTGGGGATTTTGTTTTCCTATTGTTTCTGTTGAATGTTCGTGATTTGAGCTCATTCTGGTATCAATGAATTCATAAAAAGTTACCATGCTCCTAGCTTCTAACTTTTTGAAACAGTTATCTTATCTTCATCCATTAAACTCTTGTTCAGGGCACCGCCTCCTACATTTCCTCGAGTAGATGGAACAGTCAAAACAATCAGTCTTCCTGAGGATGTATATGTAAAAAAGTTCTTCCAGAAACATCCAGATTCAAAATATGAAGATGCCATCAAGTATGCTGCTCCTTTTATATATGTAGTTTGTCTCATTTTACTCCATCATACATGTTTGAAGTGAGGGATTAAGTTCACTGTTTGACAGCAACCTGTCTGCCTTGAAGTAAGATTAGATTCTCGGGTTACATTTAAGAGTTTGGAGTACAAGTAGAATACCCCTGATGGATATCCATGAATTGAAGTTCCATTTGCATGCTGCTTAATGGTATCATTGACATATTTGGGAGTGATTTTGAAATAGTTAAAATCACTTTCGTCGTATTCAAAATTACTCCGAAACATTAAGTTGGTTTTGAATGATTAAAGCCAAGGCATATTTCGGAGTAGTTTTGAACGTGACAAAAGTGATTTTAACCCTTTCAAAATCACTCCCAAACATGCCATAAGAGGCTTTGCCATTGCTTTTCTTGTTGATTGTTTGTGTGCATTGTGCTTTGTACATTCAACCACGTGTGTGCCTACCATGAAGAATTTCAAGCACTGCTAATTCTTCTGAGGGCCCAGTATTATCAAAATTTTAGTGAAGCTACTTTTTACGGCTAATTATACGAAGTAAGTTGGGGTTTAAAAGAGCTGACTTTTTCTGATAAAAAGAAAAAAAAAACGAACTGACCTTTTGTTCTATTTCCTGTTGATTGGTATGTTTGTTTGCTTCTAATCAATAAGTAGAGAGTGGAGACATCATTGGTTTAGAATAGATCAGCTGATAAAAGGTTTCTGATGTCACTTTGCATTTTGGGAAGGATGCCGAATTGAGGTTGTCATAATTCTCCCTCTTCTTTGAGATAGAATCTCTAATGGACATCAGCTTTGCTAAAATAGATGCTTCAATTGGAAACATATATACATCTATGTGGAGTTTTATCTTGCATTAATGAAATCCTGATATCCAAAGCATGAAAAAAAAAAAAAAGAGAAAGTTCATAAGTTGTTGTATTTTGTTGCAACAGGTTTTGTAGTTTCGATCCTCCCCCAGCTCGAATATTCGGCATGCGGGTGCTTGAATTGAAGGAACAAGGTGTCAGTGAGGAGGAAGCCATGGCAGTAGCCAATGTATGGATTATGGTTCTTTTGTTCATATAAACTTCTCTTGAATCCCCCCATTTTGTCAATAATGTCTTTTGTCATCCAGATGGAATACCGAGCAGAGAAAAAGGCAAAAAAGAAAGCTTACTCACGCTTGAAGCAAATTGCTCGTCTTCAAGGGAAGAAACCTCCTCCTAACCCTTATCCAAGTGCTATTAAGGAGATACAAGCCGAGGAAAGGAAATTCGTTCGGGATCGTTTCTTCAACCCCAAGATTAAAGAGATTGCACAAAGGTTGAAGGAAGAGAGAGCAGCTGAAATGCAGGACAGAATGCGAGGCAGTGGCTGGTGATGAAGAGTTCTAAATTACATCGCTCAACTTCCATCCTCCCATCTTAACATCAGGAAACTAGCTTTTGCAATAATTTTGGCAGTTTAGCAATCAAGAAGTACTTACAATTCCTACATTCAAGTGAATATTACTTCTGAAGAATTGGATCAAACATGTTCAAAAATTTTCCTTTGTTGATTGCTGTTCGATTAAATTTTTGTTTTCCTCTGAGAGAAGGTTGCTAGATTATTCAACCATGACTGATGATGTCGTCATAAGGTGGTTGATGGATACCTACACAATTGTTCAATTAGATTAATTTTTATTCAAATTATCATGTACTCTCGATTAGAGAAGTCAGAAGTTTG

mRNA sequence

GCTGAACTCTCAAGTTCGATCGTTGTCGTCTTCGTGTCTGCCTCTCTTCTTCTCTTTCGTTAGTGGTTGTGGACCATTGGATCGGATCGAAGTTCTAAAAGGAGGAGCCATGTCCTTCATGAAAGGAGATTTACTAACTAGAACCAGAAAGCTCGTCAAGGGCTTGGCCAAAGCAGAACCTGTCTGGCTCAAAGCCATGGAACAGGCACCGCCTCCTACATTTCCTCGAGTAGATGGAACAGTCAAAACAATCAGTCTTCCTGAGGATGTATATGTAAAAAAGTTCTTCCAGAAACATCCAGATTCAAAATATGAAGATGCCATCAAGTTTTGTAGTTTCGATCCTCCCCCAGCTCGAATATTCGGCATGCGGGTGCTTGAATTGAAGGAACAAGGTGTCAGTGAGGAGGAAGCCATGGCAGTAGCCAATATGGAATACCGAGCAGAGAAAAAGGCAAAAAAGAAAGCTTACTCACGCTTGAAGCAAATTGCTCGTCTTCAAGGGAAGAAACCTCCTCCTAACCCTTATCCAAGTGCTATTAAGGAGATACAAGCCGAGGAAAGGAAATTCGTTCGGGATCGTTTCTTCAACCCCAAGATTAAAGAGATTGCACAAAGGTTGAAGGAAGAGAGAGCAGCTGAAATGCAGGACAGAATGCGAGGCAGTGGCTGGTGATGAAGAGTTCTAAATTACATCGCTCAACTTCCATCCTCCCATCTTAACATCAGGAAACTAGCTTTTGCAATAATTTTGGCAGTTTAGCAATCAAGAAGTACTTACAATTCCTACATTCAAGTGAATATTACTTCTGAAGAATTGGATCAAACATGTTCAAAAATTTTCCTTTGTTGATTGCTGTTCGATTAAATTTTTGTTTTCCTCTGAGAGAAGGTTGCTAGATTATTCAACCATGACTGATGATGTCGTCATAAGGTGGTTGATGGATACCTACACAATTGTTCAATTAGATTAATTTTTATTCAAATTATCATGTACTCTCGATTAGAGAAGTCAGAAGTTTG

Coding sequence (CDS)

ATGTCCTTCATGAAAGGAGATTTACTAACTAGAACCAGAAAGCTCGTCAAGGGCTTGGCCAAAGCAGAACCTGTCTGGCTCAAAGCCATGGAACAGGCACCGCCTCCTACATTTCCTCGAGTAGATGGAACAGTCAAAACAATCAGTCTTCCTGAGGATGTATATGTAAAAAAGTTCTTCCAGAAACATCCAGATTCAAAATATGAAGATGCCATCAAGTTTTGTAGTTTCGATCCTCCCCCAGCTCGAATATTCGGCATGCGGGTGCTTGAATTGAAGGAACAAGGTGTCAGTGAGGAGGAAGCCATGGCAGTAGCCAATATGGAATACCGAGCAGAGAAAAAGGCAAAAAAGAAAGCTTACTCACGCTTGAAGCAAATTGCTCGTCTTCAAGGGAAGAAACCTCCTCCTAACCCTTATCCAAGTGCTATTAAGGAGATACAAGCCGAGGAAAGGAAATTCGTTCGGGATCGTTTCTTCAACCCCAAGATTAAAGAGATTGCACAAAGGTTGAAGGAAGAGAGAGCAGCTGAAATGCAGGACAGAATGCGAGGCAGTGGCTGGTGA

Protein sequence

MSFMKGDLLTRTRKLVKGLAKAEPVWLKAMEQAPPPTFPRVDGTVKTISLPEDVYVKKFFQKHPDSKYEDAIKFCSFDPPPARIFGMRVLELKEQGVSEEEAMAVANMEYRAEKKAKKKAYSRLKQIARLQGKKPPPNPYPSAIKEIQAEERKFVRDRFFNPKIKEIAQRLKEERAAEMQDRMRGSGW
Homology
BLAST of Tan0018851 vs. NCBI nr
Match: XP_022957544.1 (uncharacterized protein LOC111458914 [Cucurbita moschata] >XP_022957545.1 uncharacterized protein LOC111458914 [Cucurbita moschata] >XP_022957546.1 uncharacterized protein LOC111458914 [Cucurbita moschata] >XP_022957547.1 uncharacterized protein LOC111458914 [Cucurbita moschata] >XP_023513675.1 uncharacterized protein LOC111778212 [Cucurbita pepo subsp. pepo] >XP_023513683.1 uncharacterized protein LOC111778212 [Cucurbita pepo subsp. pepo] >XP_023513691.1 uncharacterized protein LOC111778212 [Cucurbita pepo subsp. pepo] >XP_023513699.1 uncharacterized protein LOC111778212 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 367.1 bits (941), Expect = 9.2e-98
Identity = 183/188 (97.34%), Postives = 188/188 (100.00%), Query Frame = 0

Query: 1   MSFMKGDLLTRTRKLVKGLAKAEPVWLKAMEQAPPPTFPRVDGTVKTISLPEDVYVKKFF 60
           MSFMKGDLLTRTRKLVKGLAKAEP+WLKAMEQAPPPTFPR+DGT+KTI+LPEDVYVKKFF
Sbjct: 1   MSFMKGDLLTRTRKLVKGLAKAEPIWLKAMEQAPPPTFPRIDGTIKTITLPEDVYVKKFF 60

Query: 61  QKHPDSKYEDAIKFCSFDPPPARIFGMRVLELKEQGVSEEEAMAVANMEYRAEKKAKKKA 120
           QKHPDSKYEDAIKFCSFDPPPARIFG+RVLELKEQGVSEEEAMAVANMEYRAEKKAKKKA
Sbjct: 61  QKHPDSKYEDAIKFCSFDPPPARIFGLRVLELKEQGVSEEEAMAVANMEYRAEKKAKKKA 120

Query: 121 YSRLKQIARLQGKKPPPNPYPSAIKEIQAEERKFVRDRFFNPKIKEIAQRLKEERAAEMQ 180
           YSRLKQIARLQGKKPPPNPYPSAIKEIQAEERKFVRDRFFNPKIKEIAQRLKEERAAEMQ
Sbjct: 121 YSRLKQIARLQGKKPPPNPYPSAIKEIQAEERKFVRDRFFNPKIKEIAQRLKEERAAEMQ 180

Query: 181 DRMRGSGW 189
           DRMRGSGW
Sbjct: 181 DRMRGSGW 188

BLAST of Tan0018851 vs. NCBI nr
Match: KAG6601448.1 (Galactoside 2-alpha-L-fucosyltransferase, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 367.1 bits (941), Expect = 9.2e-98
Identity = 183/188 (97.34%), Postives = 188/188 (100.00%), Query Frame = 0

Query: 1   MSFMKGDLLTRTRKLVKGLAKAEPVWLKAMEQAPPPTFPRVDGTVKTISLPEDVYVKKFF 60
           MSFMKGDLLTRTRKLVKGLAKAEP+WLKAMEQAPPPTFPR+DGT+KTI+LPEDVYVKKFF
Sbjct: 588 MSFMKGDLLTRTRKLVKGLAKAEPIWLKAMEQAPPPTFPRIDGTIKTITLPEDVYVKKFF 647

Query: 61  QKHPDSKYEDAIKFCSFDPPPARIFGMRVLELKEQGVSEEEAMAVANMEYRAEKKAKKKA 120
           QKHPDSKYEDAIKFCSFDPPPARIFG+RVLELKEQGVSEEEAMAVANMEYRAEKKAKKKA
Sbjct: 648 QKHPDSKYEDAIKFCSFDPPPARIFGLRVLELKEQGVSEEEAMAVANMEYRAEKKAKKKA 707

Query: 121 YSRLKQIARLQGKKPPPNPYPSAIKEIQAEERKFVRDRFFNPKIKEIAQRLKEERAAEMQ 180
           YSRLKQIARLQGKKPPPNPYPSAIKEIQAEERKFVRDRFFNPKIKEIAQRLKEERAAEMQ
Sbjct: 708 YSRLKQIARLQGKKPPPNPYPSAIKEIQAEERKFVRDRFFNPKIKEIAQRLKEERAAEMQ 767

Query: 181 DRMRGSGW 189
           DRMRGSGW
Sbjct: 768 DRMRGSGW 775

BLAST of Tan0018851 vs. NCBI nr
Match: KAG7032230.1 (hypothetical protein SDJN02_06273 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 365.5 bits (937), Expect = 2.7e-97
Identity = 182/188 (96.81%), Postives = 188/188 (100.00%), Query Frame = 0

Query: 1   MSFMKGDLLTRTRKLVKGLAKAEPVWLKAMEQAPPPTFPRVDGTVKTISLPEDVYVKKFF 60
           MSFMKGDLLTRTRKLVKGLAKAEP+WLKAMEQAPPPTFPR+DGT+KTI+LPEDVYV+KFF
Sbjct: 1   MSFMKGDLLTRTRKLVKGLAKAEPIWLKAMEQAPPPTFPRIDGTIKTITLPEDVYVEKFF 60

Query: 61  QKHPDSKYEDAIKFCSFDPPPARIFGMRVLELKEQGVSEEEAMAVANMEYRAEKKAKKKA 120
           QKHPDSKYEDAIKFCSFDPPPARIFG+RVLELKEQGVSEEEAMAVANMEYRAEKKAKKKA
Sbjct: 61  QKHPDSKYEDAIKFCSFDPPPARIFGLRVLELKEQGVSEEEAMAVANMEYRAEKKAKKKA 120

Query: 121 YSRLKQIARLQGKKPPPNPYPSAIKEIQAEERKFVRDRFFNPKIKEIAQRLKEERAAEMQ 180
           YSRLKQIARLQGKKPPPNPYPSAIKEIQAEERKFVRDRFFNPKIKEIAQRLKEERAAEMQ
Sbjct: 121 YSRLKQIARLQGKKPPPNPYPSAIKEIQAEERKFVRDRFFNPKIKEIAQRLKEERAAEMQ 180

Query: 181 DRMRGSGW 189
           DRMRGSGW
Sbjct: 181 DRMRGSGW 188

BLAST of Tan0018851 vs. NCBI nr
Match: XP_022996766.1 (uncharacterized protein LOC111491881 [Cucurbita maxima] >XP_022996833.1 uncharacterized protein LOC111491881 [Cucurbita maxima] >XP_022996916.1 uncharacterized protein LOC111491881 [Cucurbita maxima] >XP_022996999.1 uncharacterized protein LOC111491881 [Cucurbita maxima])

HSP 1 Score: 364.0 bits (933), Expect = 7.8e-97
Identity = 182/188 (96.81%), Postives = 187/188 (99.47%), Query Frame = 0

Query: 1   MSFMKGDLLTRTRKLVKGLAKAEPVWLKAMEQAPPPTFPRVDGTVKTISLPEDVYVKKFF 60
           MSFMKGDLL+RTRKLVKGLAKAEP+WLKAMEQAPPPTFPR+DG VKTI+LPEDVYVKKFF
Sbjct: 1   MSFMKGDLLSRTRKLVKGLAKAEPIWLKAMEQAPPPTFPRIDGAVKTITLPEDVYVKKFF 60

Query: 61  QKHPDSKYEDAIKFCSFDPPPARIFGMRVLELKEQGVSEEEAMAVANMEYRAEKKAKKKA 120
           QKHPDSKYEDAIKFCSFDPPPARIFG+RVLELKEQGVSEEEAMAVANMEYRAEKKAKKKA
Sbjct: 61  QKHPDSKYEDAIKFCSFDPPPARIFGLRVLELKEQGVSEEEAMAVANMEYRAEKKAKKKA 120

Query: 121 YSRLKQIARLQGKKPPPNPYPSAIKEIQAEERKFVRDRFFNPKIKEIAQRLKEERAAEMQ 180
           YSRLKQIARLQGKKPPPNPYPSAIKEIQAEERKFVRDRFFNPKIKEIAQRLKEERAAEMQ
Sbjct: 121 YSRLKQIARLQGKKPPPNPYPSAIKEIQAEERKFVRDRFFNPKIKEIAQRLKEERAAEMQ 180

Query: 181 DRMRGSGW 189
           DRMRGSGW
Sbjct: 181 DRMRGSGW 188

BLAST of Tan0018851 vs. NCBI nr
Match: XP_022151151.1 (uncharacterized protein LOC111019147 [Momordica charantia])

HSP 1 Score: 361.3 bits (926), Expect = 5.0e-96
Identity = 181/188 (96.28%), Postives = 184/188 (97.87%), Query Frame = 0

Query: 1   MSFMKGDLLTRTRKLVKGLAKAEPVWLKAMEQAPPPTFPRVDGTVKTISLPEDVYVKKFF 60
           MSFMKGDLL RTRKLVKGLAKAEPVWLKAMEQAPPPTFPRVDGT+KTISLPEDVYVKKFF
Sbjct: 1   MSFMKGDLLARTRKLVKGLAKAEPVWLKAMEQAPPPTFPRVDGTIKTISLPEDVYVKKFF 60

Query: 61  QKHPDSKYEDAIKFCSFDPPPARIFGMRVLELKEQGVSEEEAMAVANMEYRAEKKAKKKA 120
           QKHPDSKYEDAIKFCSFDPPPARIFG+RVLELKEQGV EEEAMAVANMEYRAEKK KKKA
Sbjct: 61  QKHPDSKYEDAIKFCSFDPPPARIFGLRVLELKEQGVGEEEAMAVANMEYRAEKKVKKKA 120

Query: 121 YSRLKQIARLQGKKPPPNPYPSAIKEIQAEERKFVRDRFFNPKIKEIAQRLKEERAAEMQ 180
           Y+RLKQIARLQGKKPPPNPYPSAIKEIQAEERKFVRDRFFNPKIKEIAQRLKEERAAEMQ
Sbjct: 121 YARLKQIARLQGKKPPPNPYPSAIKEIQAEERKFVRDRFFNPKIKEIAQRLKEERAAEMQ 180

Query: 181 DRMRGSGW 189
           DRMRG GW
Sbjct: 181 DRMRGGGW 188

BLAST of Tan0018851 vs. ExPASy TrEMBL
Match: A0A6J1GZF0 (uncharacterized protein LOC111458914 OS=Cucurbita moschata OX=3662 GN=LOC111458914 PE=4 SV=1)

HSP 1 Score: 367.1 bits (941), Expect = 4.5e-98
Identity = 183/188 (97.34%), Postives = 188/188 (100.00%), Query Frame = 0

Query: 1   MSFMKGDLLTRTRKLVKGLAKAEPVWLKAMEQAPPPTFPRVDGTVKTISLPEDVYVKKFF 60
           MSFMKGDLLTRTRKLVKGLAKAEP+WLKAMEQAPPPTFPR+DGT+KTI+LPEDVYVKKFF
Sbjct: 1   MSFMKGDLLTRTRKLVKGLAKAEPIWLKAMEQAPPPTFPRIDGTIKTITLPEDVYVKKFF 60

Query: 61  QKHPDSKYEDAIKFCSFDPPPARIFGMRVLELKEQGVSEEEAMAVANMEYRAEKKAKKKA 120
           QKHPDSKYEDAIKFCSFDPPPARIFG+RVLELKEQGVSEEEAMAVANMEYRAEKKAKKKA
Sbjct: 61  QKHPDSKYEDAIKFCSFDPPPARIFGLRVLELKEQGVSEEEAMAVANMEYRAEKKAKKKA 120

Query: 121 YSRLKQIARLQGKKPPPNPYPSAIKEIQAEERKFVRDRFFNPKIKEIAQRLKEERAAEMQ 180
           YSRLKQIARLQGKKPPPNPYPSAIKEIQAEERKFVRDRFFNPKIKEIAQRLKEERAAEMQ
Sbjct: 121 YSRLKQIARLQGKKPPPNPYPSAIKEIQAEERKFVRDRFFNPKIKEIAQRLKEERAAEMQ 180

Query: 181 DRMRGSGW 189
           DRMRGSGW
Sbjct: 181 DRMRGSGW 188

BLAST of Tan0018851 vs. ExPASy TrEMBL
Match: A0A6J1KC52 (uncharacterized protein LOC111491881 OS=Cucurbita maxima OX=3661 GN=LOC111491881 PE=4 SV=1)

HSP 1 Score: 364.0 bits (933), Expect = 3.8e-97
Identity = 182/188 (96.81%), Postives = 187/188 (99.47%), Query Frame = 0

Query: 1   MSFMKGDLLTRTRKLVKGLAKAEPVWLKAMEQAPPPTFPRVDGTVKTISLPEDVYVKKFF 60
           MSFMKGDLL+RTRKLVKGLAKAEP+WLKAMEQAPPPTFPR+DG VKTI+LPEDVYVKKFF
Sbjct: 1   MSFMKGDLLSRTRKLVKGLAKAEPIWLKAMEQAPPPTFPRIDGAVKTITLPEDVYVKKFF 60

Query: 61  QKHPDSKYEDAIKFCSFDPPPARIFGMRVLELKEQGVSEEEAMAVANMEYRAEKKAKKKA 120
           QKHPDSKYEDAIKFCSFDPPPARIFG+RVLELKEQGVSEEEAMAVANMEYRAEKKAKKKA
Sbjct: 61  QKHPDSKYEDAIKFCSFDPPPARIFGLRVLELKEQGVSEEEAMAVANMEYRAEKKAKKKA 120

Query: 121 YSRLKQIARLQGKKPPPNPYPSAIKEIQAEERKFVRDRFFNPKIKEIAQRLKEERAAEMQ 180
           YSRLKQIARLQGKKPPPNPYPSAIKEIQAEERKFVRDRFFNPKIKEIAQRLKEERAAEMQ
Sbjct: 121 YSRLKQIARLQGKKPPPNPYPSAIKEIQAEERKFVRDRFFNPKIKEIAQRLKEERAAEMQ 180

Query: 181 DRMRGSGW 189
           DRMRGSGW
Sbjct: 181 DRMRGSGW 188

BLAST of Tan0018851 vs. ExPASy TrEMBL
Match: A0A6J1DDQ7 (uncharacterized protein LOC111019147 OS=Momordica charantia OX=3673 GN=LOC111019147 PE=4 SV=1)

HSP 1 Score: 361.3 bits (926), Expect = 2.4e-96
Identity = 181/188 (96.28%), Postives = 184/188 (97.87%), Query Frame = 0

Query: 1   MSFMKGDLLTRTRKLVKGLAKAEPVWLKAMEQAPPPTFPRVDGTVKTISLPEDVYVKKFF 60
           MSFMKGDLL RTRKLVKGLAKAEPVWLKAMEQAPPPTFPRVDGT+KTISLPEDVYVKKFF
Sbjct: 1   MSFMKGDLLARTRKLVKGLAKAEPVWLKAMEQAPPPTFPRVDGTIKTISLPEDVYVKKFF 60

Query: 61  QKHPDSKYEDAIKFCSFDPPPARIFGMRVLELKEQGVSEEEAMAVANMEYRAEKKAKKKA 120
           QKHPDSKYEDAIKFCSFDPPPARIFG+RVLELKEQGV EEEAMAVANMEYRAEKK KKKA
Sbjct: 61  QKHPDSKYEDAIKFCSFDPPPARIFGLRVLELKEQGVGEEEAMAVANMEYRAEKKVKKKA 120

Query: 121 YSRLKQIARLQGKKPPPNPYPSAIKEIQAEERKFVRDRFFNPKIKEIAQRLKEERAAEMQ 180
           Y+RLKQIARLQGKKPPPNPYPSAIKEIQAEERKFVRDRFFNPKIKEIAQRLKEERAAEMQ
Sbjct: 121 YARLKQIARLQGKKPPPNPYPSAIKEIQAEERKFVRDRFFNPKIKEIAQRLKEERAAEMQ 180

Query: 181 DRMRGSGW 189
           DRMRG GW
Sbjct: 181 DRMRGGGW 188

BLAST of Tan0018851 vs. ExPASy TrEMBL
Match: A0A7N2LBH9 (Uncharacterized protein OS=Quercus lobata OX=97700 PE=4 SV=1)

HSP 1 Score: 332.0 bits (850), Expect = 1.6e-87
Identity = 164/188 (87.23%), Postives = 177/188 (94.15%), Query Frame = 0

Query: 1   MSFMKGDLLTRTRKLVKGLAKAEPVWLKAMEQAPPPTFPRVDGTVKTISLPEDVYVKKFF 60
           MSF+KGDLLTRTRKLVKGLAK++P+WLKAME APP TFPR DG VK ISLPEDVY+KKFF
Sbjct: 1   MSFLKGDLLTRTRKLVKGLAKSKPIWLKAMEHAPPATFPRADGKVKRISLPEDVYIKKFF 60

Query: 61  QKHPDSKYEDAIKFCSFDPPPARIFGMRVLELKEQGVSEEEAMAVANMEYRAEKKAKKKA 120
           QKHPDSK+EDAIK CSFDPPPAR+FG+RVL+LKEQGVSEEEAMAVA+MEYRAEKKAKKKA
Sbjct: 61  QKHPDSKHEDAIKICSFDPPPARLFGLRVLDLKEQGVSEEEAMAVADMEYRAEKKAKKKA 120

Query: 121 YSRLKQIARLQGKKPPPNPYPSAIKEIQAEERKFVRDRFFNPKIKEIAQRLKEERAAEMQ 180
           YSRLKQIARLQGKKPPPNPYPSAIKEIQAEERK+VRDRFFNPKI EI Q+LKEE+AAE Q
Sbjct: 121 YSRLKQIARLQGKKPPPNPYPSAIKEIQAEERKYVRDRFFNPKILEIVQKLKEEKAAEAQ 180

Query: 181 DRMRGSGW 189
           DR RG GW
Sbjct: 181 DRFRGGGW 188

BLAST of Tan0018851 vs. ExPASy TrEMBL
Match: A0A0A0KQQ3 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G606340 PE=4 SV=1)

HSP 1 Score: 325.5 bits (833), Expect = 1.5e-85
Identity = 165/185 (89.19%), Postives = 174/185 (94.05%), Query Frame = 0

Query: 1   MSFMKGDLLTRTRKLVKGLAKAEPVWLKAMEQAPPPTFPRVDGTVKTISLPEDVYVKKFF 60
           MSFMKGDLLT+TRKLVKGLAKAEPVWLKAMEQAPPP+FPRVDGT+KTI+LPEDVYVKKFF
Sbjct: 1   MSFMKGDLLTKTRKLVKGLAKAEPVWLKAMEQAPPPSFPRVDGTIKTITLPEDVYVKKFF 60

Query: 61  QKHPDSKYEDAIKFCSFDPPPARIFGMRVLELKEQGVSEEEAMAVANMEYRAEKKAKKKA 120
           +KHPDS Y DAIKFC F+PPPARIF  RVLELKEQGV+EEEAM VANMEYRAEKK KK A
Sbjct: 61  KKHPDSYYHDAIKFCGFNPPPARIFAWRVLELKEQGVNEEEAMTVANMEYRAEKKMKKNA 120

Query: 121 YSRLKQIARLQGKKPPPNPYPSAIKEIQAEERKFVRDRFFNPKIKEIAQRLKEERAAEMQ 180
           YSRLKQIARLQGKKPP NPYPSAIKEIQAEERKFVRDRFF+PKIKEIAQRLKEERAAEMQ
Sbjct: 121 YSRLKQIARLQGKKPPRNPYPSAIKEIQAEERKFVRDRFFDPKIKEIAQRLKEERAAEMQ 180

Query: 181 DRMRG 186
           +R  G
Sbjct: 181 ERTGG 185

BLAST of Tan0018851 vs. TAIR 10
Match: AT1G26750.1 (unknown protein; Has 44 Blast hits to 44 proteins in 16 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 44; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 204.1 bits (518), Expect = 9.6e-53
Identity = 102/186 (54.84%), Postives = 134/186 (72.04%), Query Frame = 0

Query: 1   MSFMKGDLLTRTRKLVKGLAKAEPVWLKAMEQAPPPTFPRVDGTVKTISLPEDVYVKKFF 60
           MS+MKGDLL++TR+LV GLA  EPVWLKAME +PPP FPR +G ++ I LPED YV+KF 
Sbjct: 1   MSWMKGDLLSKTRRLVGGLATREPVWLKAMEASPPPVFPRSNGKIQKIVLPEDPYVRKFA 60

Query: 61  QKHPDSKYEDAIKFCSFDPPPARIFGMRVLELKEQGVSEEEAMAVANMEYRAEKKAKKKA 120
            KHP +K +D  K  +F P  AR++G RVLELKE G+SE +AM+VANMEY +E+K  KKA
Sbjct: 61  NKHPGTKIDDPAKISAFIPDQARVYGCRVLELKEHGISEGDAMSVANMEYLSERKEMKKA 120

Query: 121 YSRLKQIARLQGKKPPPNPYPSAIKEIQAEERKFVRDRFFNPKIKEIAQRLKEERAAEMQ 180
           Y RLK++A +Q K PPP PYPSA K +  + +   +DRF  P ++ +  +LK E+   +Q
Sbjct: 121 YKRLKELAVMQDKDPPPKPYPSAKKGLITQSKTSAKDRFQTPSVRRLVNQLKNEKDVLLQ 180

Query: 181 DRMRGS 187
           DR  GS
Sbjct: 181 DRTGGS 186

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_022957544.19.2e-9897.34uncharacterized protein LOC111458914 [Cucurbita moschata] >XP_022957545.1 unchar... [more]
KAG6601448.19.2e-9897.34Galactoside 2-alpha-L-fucosyltransferase, partial [Cucurbita argyrosperma subsp.... [more]
KAG7032230.12.7e-9796.81hypothetical protein SDJN02_06273 [Cucurbita argyrosperma subsp. argyrosperma][more]
XP_022996766.17.8e-9796.81uncharacterized protein LOC111491881 [Cucurbita maxima] >XP_022996833.1 uncharac... [more]
XP_022151151.15.0e-9696.28uncharacterized protein LOC111019147 [Momordica charantia][more]
Match NameE-valueIdentityDescription
A0A6J1GZF04.5e-9897.34uncharacterized protein LOC111458914 OS=Cucurbita moschata OX=3662 GN=LOC1114589... [more]
A0A6J1KC523.8e-9796.81uncharacterized protein LOC111491881 OS=Cucurbita maxima OX=3661 GN=LOC111491881... [more]
A0A6J1DDQ72.4e-9696.28uncharacterized protein LOC111019147 OS=Momordica charantia OX=3673 GN=LOC111019... [more]
A0A7N2LBH91.6e-8787.23Uncharacterized protein OS=Quercus lobata OX=97700 PE=4 SV=1[more]
A0A0A0KQQ31.5e-8589.19Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G606340 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G26750.19.6e-5354.84unknown protein; Has 44 Blast hits to 44 proteins in 16 species: Archae - 0; Bac... [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR35693EXPRESSED PROTEINcoord: 1..188
NoneNo IPR availablePANTHERPTHR35693:SF1EXPRESSED PROTEINcoord: 1..188

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0018851.1Tan0018851.1mRNA