Tan0017441 (gene) Snake gourd v1

Overview
NameTan0017441
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProtein GL2-INTERACTING REPRESSOR 2
LocationLG11: 8041941 .. 8047428 (+)
RNA-Seq ExpressionTan0017441
SyntenyTan0017441
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GTGAAATCAGAAATTTCTGAATCCCAATTTGGGGCTTTCGACGAAATGGGTGTTTTTAGCGGCAATCTCATCTTCTCCCATTTTCCTTATCAAATGATTTTCAACAGTGGTATCTCTACCTGAAGATATGTACGAGAAAATGAAAATGGAAGTGAAGAAAGGGAAGAAAAGGAAGATTCGGGTTGTCGTTAATTCCAATTTCTGCAAAAGGGTTGGCGATTCTTTTATTTATTACTATGGAGTTTTGGGTCATCGGCATTTATGGGTTTTCAGCGGCAGTCAAGTCGGCGAGGAGAGGAGGGCTATGGTGCAATGTCCGGCGGCACTATACTGGTTAATCGGCGATCGATTGATTGCGTTTTGTAGATGATCAGGAAGGTACTGCCCCTTGAAAGTATTTGTTTCTTTTGATCATGAGTATGGGAGTGTTAAAAATTGTCGTGTTCTGTTTTGTTGTTGCTGAACTTTATATGAACTTAAGAAATCGTGGGTGAAGGATTAAATTTGGAATTGAACATTTGGATTAATGCTTCTGATTTGATTAAAGCTTTGATGACTGGAATTTAGTGAGATGTCATTGCTGGTTTTGTTGTTTATGAGAGCTCCTAGAATAATAGAGACCCATCTTAATGACAATTATTCTGTAGAATTCTCATATGAATATCTTTAATTAACCTTGATTGTAGCTCCTAGGGAATGCTAAGAGGTGGATACTTTGAATCTTTTATCACGTAAGCTTTTTTGGAACGACATGAAATATTTAACTTTCCGGATGTGAGATAAGGAAAGAATGAGGGAAGAATGAGGATAAGGAAAGAAAAATCCAATGCAACTTTTGAATGTGCGTAAGAAAGTTAAAGATTTAAATAGCCACTTTTTTTGCAATAAGTTTATACAGATCTCGGTGAGATAGAATCTTTCATTTGATTTTGAGTGTAAAATGATGTATGAAAATGGTGAAATCGATTAAGATGTGAGGAGATTTTGTTATGTCTTTAGTGTTATTACTGGTGTTAATACATAGAGTAATCATAATAAATAAATAAAGGCTTGAAACAATGAATGTAGAAAGTGAATTAAAGTTCTAGATTATTTAACGTTCAGTCAAGTTGTCAAGTCATATGTAATGGAGTATAAAAGTGATGGTGCTTATGACCCCCAGCTATAGACATTTCACAATTATGTAATGGATTCCAGACAAACGTGTTATAGAAACTTTAAAATGAATTGTGAAGTTATTGACGATATTATATATATGTATACAGCCTCTAATATTTGGCGTTTCACCTGTTTCAAATTCAGCCCAGTTGTACTAATCTCTATAAGATAAATAACTGAAAATAACATATCTTGTCATTTAGAATTTTTTCTATAAATATGTACATGTTAGTATCTACTATGTGACTTTCATCATTTTCAAATTCAGTGCTTCTTTCATTGATCTTAATTTTCAAATTCAGTGCTTCTTTCATTGATCTTAATGATATCACTGATTAAGATTCAAGAAGCTTCAAGTTGTAGAGTCCTCGAGTATAAAAGTTAGTTGCTGCGTTCACCTTTAAGGTAAAGTTTACTATTATGGGTGTCTGTTTTAAAGCTCTGTTGAAAAATAAATGTAAACTGTAAACTTGATGTTGTAATTCACAAAGGATATGGGGAGGCGTGCACAATTGAGTATAGAATGACCCAGCTATTGGTTAGTTTCCAAATTTTGATGTTTCATATGGACTATATACATCCCCCAACTTGAACTTTGTATTACTCGAACAAACAAATATGTAAGTTGCCCCTTCATGATAATTGCCATTGTCAGGATACTATGGGGTATTTGACTTGTAGTATCAGAAATTTGACTGATTATCCACCATAGTGATGGAAATTGGATTATGAACCAAGTTAATCTTTATTCTAAAATAAAATTATACTGGCTAGCTGCTAACTACGGGTTGAATGAAGAAGGGTCTAAATATGCTTACTCTTTCTCTTGTGACTTTAGTTGACTTTGGGCATGTGATGCATTTAGGTTTGCATTTATAATTATGAAAAAACTGAATCAAGTATGGGTTAGCCTAGCTAGTGGTAATTGCGACAAGTAAAAGAAGTAAAAATTTTTGGGAACATGGGTTCATACTTGTGGCCAGCTTCCGAGGATGTTAAAATTCTACGAGTTCCTAATAGCTATTTTAGTGTAGAGGGTTGTCTCCCAAGTCTTTGGCCTAGGAAATTTATCCCTTCTAAGTGGTTGATTCAAACTGTCTTAAATTATATTAGTTCTTAATGCTGCCCTGTTCCAGAGGGATTATGAAAAGTAGAATGAATATTGCAAGTGCTTAATGTATAATACTAAAAACAATCATGACAGCTAATATTTAGGAAATATGAATGAAGATTACTTTCTGTATCCTTCTAGAACATATGGAGGTCTTCATTTTTGTAGTGGCCATGATTGTTGATCTTCTTATTCTTATTGCTACCAAGTTTTTAAGTCTGTTTCTATTTTATTCTAGGTGTTTTGTTATCATTTTACTTTTATTTTTTTTTAATTAAAAAAATGGAACCAGGGATTATTGTCTTCTTATCCTAATCAAGTTTTTATTTCTAAGGTTTTCTCTCTGTTATTATGATGCCAAACAAGGAAAAACCTAGATGAAGAACTTGTCTTCTCGGTTACAATCTTATCTCTATTGTTTTGTTCTTCAAGAGGTTAAGTAGCATTTACCACTAGAAGACAAATCTATTAATTGTTGTTCTACCAAACAATAATGGCACCAGATTGGCTCCTATATTTGCACCAAAGGAATCACTTCCTTTTTGTATATAATTCAAATGGTGCAACGGTTACTAAAATATTCATGCCATTAACCTTATGAATTTCATTGAAGTACTAAATTACAGAATTTGTTGAAACATTTTCCTTAAAATGGCTTTTTATACTTGTACTTCCATTCATCTCTCTTCTCCTCCCTTGGGTTTTACCTTGAATATTTTAACTAATGTACTGTAATTGTTGGTCAATAAATTCTGAAGAATCATATATTTGGCATCAAGATTTTCAGTTTGACTTTGTTTAAACTCCTATTGATAAATAGTTTAATGCACACCATTTGATAACTTTTTGAAAATTGTGTTTGTTTTCTCACAATTTCTCAACAATGATTTTCATCTTCTCTACAATGGTTTCCATCTTTCTTAAAAAAGTAATTGAGTCTTTAGCCAAATCCCAAAAACAAAAACAAGTTTTTTTAGTTTTCAAAACTTGGCTTGGTTTTTTAGAACACTTGTAAAAACTAGATAAAAAAACATAGAAACCCATTGGTGGAAGTAGTATTTATAGACTTAATTTTCAAAAACTAAAAACCAAAGACCAAATGGTTATCAAGGGTTTAGATTTTGTTTTTTTTGTTTACATTTGATACCCATTGAATTAGAAGCAAATATAACTCTGGACTGTTTATGCATACTTCAGTTTATGATCTCTCTCTATCTTGTTGAAGTTGAAGATACAAATAGAACTTTACTTCTATTAATGCAATGCCCTTGCTAGAACTTCAAACATTAAGAAGCCTTGGTTTTAATATTGTGTATTGAAATTTATTATCTACACCACTTCACGCTTATGCATACAAATGCTGGGTAAGGAAATATCCTTCTCAAATGGGACAAATAAATATTCCGGAAGTTCTCGAGCATAAATGCTTTTGCTAATCTTTTGAAATACTTACACTATCATGTACGGTTGTTTCTGTTCATCATACTGGCATCATTGTGATTGTTTCTGTGCTATTGCTTTTACTATTGAATATTTAAATTTTATATGTTACAAAATTTTCTATAGGCAGTATCTATCATTGTTTAGTAATTTAATTTATTGTTTTAACTCTTTGTTTGTTAGCATCTCGTCTTCGAGAAAAACTTAAATGATAGATTCTTTTCTTGGTAATCAATTTCGTCTATTTTGGAATGCTCTTAAAGAGGCTAACTTCCATCTACTTGTACAACTCAGAGTCTTTAATTTAAGTAATCAAAATCACCCTAGTGGTAATGGCAACAGATGTTCCAACTTTTGCATTTTAGAAATTGCTATTATCACTTTTATTTTCTAACATTGTAACGGTTACTAAATACGACGTGACTATCCACGCAGGTTAGCATTTTATAATTAATTGTGAAGACTCTGTTTAACACAAACCTAAAGACAGCTTAACCGTGTGATTTCCTTACGGTTCGCTTTGCCTCTCCCATCATTTAAGAACTTGACAAATTTTCCAAAAAGGATACTCTACTTGCTTTTATTTATTTTTTACGAAACCATATCTGAGGCTTACAGATTTCCAGTTGAGCATCTATGTTTATTGTTTACTTATACCGATGTGCATCTATGCATTAATTTTACTGTTATTTCTTCTGTTTTTATTTTGAAATTGCAGTGTCTTACATTTTGAATTGGTTCAATTAGTTTTTAGCCAAGTACTAGATCGGACTTTACTTGACGAATCAGTTTACTGCAAAACCACACTCCTTTTATATAAATTGAGGAAATTTGTAGAAACCATTACTCACATCTCTTGTTTCTGGTATGGCTTCAAAACCTCAGAAAACCACTCAATATCTGGAGCATGCTGGTGATGCAAAGGACCGTTGTGGTGGCGACCTAGAGAGCCATGTTCATAGTCCTGAGGAGTTCTTGAGCGTGGATCAACTCAATCAAGACTTTGACAAATCTCTTGTTCTTAGAAGTTCGTCTGCTTATTCTTCCCCAATTAAATTAAAGGATAGTTCTAATCTGACTATGAAGCCATCAGAAGTTCCTGACAAGGTATAATTCTTCTCCAATTTCTAAATATCATAATCTGTGTTGGATTTTTCCCCTAAATGATTTTGGTAGTCTTGTTTTGATCTTGTTGTTTTTTCCCTAACCAGGAAGATTATGATCTCTTTCCTTTCCCCAAATAAATATTTTAGCCCATTAAGGAAAAATCTGCACATAATCTAGGCATATTTAGTTAAATTCTAATATATTTCCTATTTCATGTTTGGTGTTCTGGGAAAAAAATCCAAAAGAAGGAAACTAAAAGAAGATATGCAGCGGAAATGGGAACGGCTCCTAATGAAAATGATGCATATTTGGATCTCAAACTGTCACCTCCAGGGGTCTACTTAAGGGGTCAATCATCAAATGAATCAAAATCGTCATCCCCAAGATCTCAAGACTCATGCGTATCTGCTGAGGTCGAGTCAAATGCGAACTTGGAGAATAACCTTCGAGTTGAAGGCTCACCCTTGATTGTGATGGGATGTACTTTTTGCCTCCTTTATGTGATGGTGACAGATGCAGATCCCAGATGCCCTAAATGCAAAAATTCTGGCTTGCTCGACATTTTCCGTGGAAATCAGGCGAAGAGATCGAGAAAGAACTAGTTTTCTTTCAAGGAGTTTTCAAATGCTCCAGTAGTATGCTCTTGAACTGATGCACCCTCTGCTTTTTTTT

mRNA sequence

GTGAAATCAGAAATTTCTGAATCCCAATTTGGGGCTTTCGACGAAATGGGTGTTTTTAGCGGCAATCTCATCTTCTCCCATTTTCCTTATCAAATGATTTTCAACAGTGGTATCTCTACCTGAAGATATGTACGAGAAAATGAAAATGGAAGTGAAGAAAGGGAAGAAAAGGAAGATTCGGGTTGTCGTTAATTCCAATTTCTGCAAAAGGGTTGGCGATTCTTTTATTTATTACTATGGAGTTTTGGGTCATCGGCATTTATGGGTTTTCAGCGGCAGTCAAGTCGGCGAGGAGAGGAGGGCTATGGTGCAATGTCCGGCGGCACTATACTGGTTAATCGGCGATCGATTGATTGCGTTTTGTAGATGATCAGGAAGAAAACCACTCAATATCTGGAGCATGCTGGTGATGCAAAGGACCGTTGTGGTGGCGACCTAGAGAGCCATGTTCATAGTCCTGAGGAGTTCTTGAGCGTGGATCAACTCAATCAAGACTTTGACAAATCTCTTGTTCTTAGAAGTTCGTCTGCTTATTCTTCCCCAATTAAATTAAAGGATAGTTCTAATCTGACTATGAAGCCATCAGAAGTTCCTGACAAGAAGGAAACTAAAAGAAGATATGCAGCGGAAATGGGAACGGCTCCTAATGAAAATGATGCATATTTGGATCTCAAACTGTCACCTCCAGGGGTCTACTTAAGGGGTCAATCATCAAATGAATCAAAATCGTCATCCCCAAGATCTCAAGACTCATGCGTATCTGCTGAGGTCGAGTCAAATGCGAACTTGGAGAATAACCTTCGAGTTGAAGGCTCACCCTTGATTGTGATGGGATGTACTTTTTGCCTCCTTTATGTGATGGTGACAGATGCAGATCCCAGATGCCCTAAATGCAAAAATTCTGGCTTGCTCGACATTTTCCGTGGAAATCAGGCGAAGAGATCGAGAAAGAACTAGTTTTCTTTCAAGGAGTTTTCAAATGCTCCAGTAGTATGCTCTTGAACTGATGCACCCTCTGCTTTTTTTT

Coding sequence (CDS)

ATGATCAGGAAGAAAACCACTCAATATCTGGAGCATGCTGGTGATGCAAAGGACCGTTGTGGTGGCGACCTAGAGAGCCATGTTCATAGTCCTGAGGAGTTCTTGAGCGTGGATCAACTCAATCAAGACTTTGACAAATCTCTTGTTCTTAGAAGTTCGTCTGCTTATTCTTCCCCAATTAAATTAAAGGATAGTTCTAATCTGACTATGAAGCCATCAGAAGTTCCTGACAAGAAGGAAACTAAAAGAAGATATGCAGCGGAAATGGGAACGGCTCCTAATGAAAATGATGCATATTTGGATCTCAAACTGTCACCTCCAGGGGTCTACTTAAGGGGTCAATCATCAAATGAATCAAAATCGTCATCCCCAAGATCTCAAGACTCATGCGTATCTGCTGAGGTCGAGTCAAATGCGAACTTGGAGAATAACCTTCGAGTTGAAGGCTCACCCTTGATTGTGATGGGATGTACTTTTTGCCTCCTTTATGTGATGGTGACAGATGCAGATCCCAGATGCCCTAAATGCAAAAATTCTGGCTTGCTCGACATTTTCCGTGGAAATCAGGCGAAGAGATCGAGAAAGAACTAG

Protein sequence

MIRKKTTQYLEHAGDAKDRCGGDLESHVHSPEEFLSVDQLNQDFDKSLVLRSSSAYSSPIKLKDSSNLTMKPSEVPDKKETKRRYAAEMGTAPNENDAYLDLKLSPPGVYLRGQSSNESKSSSPRSQDSCVSAEVESNANLENNLRVEGSPLIVMGCTFCLLYVMVTDADPRCPKCKNSGLLDIFRGNQAKRSRKN
Homology
BLAST of Tan0017441 vs. ExPASy Swiss-Prot
Match: Q9FNI1 (Protein GL2-INTERACTING REPRESSOR 1 OS=Arabidopsis thaliana OX=3702 GN=GIR1 PE=1 SV=1)

HSP 1 Score: 63.2 bits (152), Expect = 3.9e-09
Identity = 39/96 (40.62%), Postives = 59/96 (61.46%), Query Frame = 0

Query: 100 LDLKLSPPGVYLRG--QSSNESKSSSPRS-QDSCVSAEVESNANLENNLRVEGSP----L 159
           L L LSPP    R   +S + S ++SP S   SCVS+E+  +   E ++R   SP    +
Sbjct: 10  LKLNLSPPTSSQRRMVRSPSRSATTSPTSPPSSCVSSEMNQD---EPSVRYSTSPETTSM 69

Query: 160 IVMGCTFCLLYVMVTDADPRCPKCKNSGLLDIFRGN 189
           +++GC  CL+YVM+++ DP+CPKCK++ LLD    N
Sbjct: 70  VLVGCPRCLMYVMLSEDDPKCPKCKSTVLLDFLHEN 102

BLAST of Tan0017441 vs. ExPASy Swiss-Prot
Match: Q9SRN4 (Protein GL2-INTERACTING REPRESSOR 2 OS=Arabidopsis thaliana OX=3702 GN=GIR2 PE=1 SV=1)

HSP 1 Score: 62.0 bits (149), Expect = 8.6e-09
Identity = 40/98 (40.82%), Postives = 59/98 (60.20%), Query Frame = 0

Query: 94  NENDAYLDLK--LSPPGVYLRGQSSNESKSSSP-RSQDSCVSAEVESNANLENNLRVEGS 153
           N+N   L+L+  LSPP      Q+S  S   SP RS  +  S+ V S  N E N  +  +
Sbjct: 5   NKNGPKLELRLNLSPP----PSQASQMSLVRSPNRSNTTSPSSCVSSETNQEENETI--T 64

Query: 154 PLIVMGCTFCLLYVMVTDADPRCPKCKNSGLLDIFRGN 189
            ++++GC  CL+YVM++D DP+CPKCK++ LLD  + N
Sbjct: 65  SMVLVGCPRCLMYVMLSDDDPKCPKCKSTVLLDFLQEN 96

BLAST of Tan0017441 vs. NCBI nr
Match: XP_038885898.1 (uncharacterized protein LOC120076204 [Benincasa hispida])

HSP 1 Score: 304.7 bits (779), Expect = 5.8e-79
Identity = 162/194 (83.51%), Postives = 172/194 (88.66%), Query Frame = 0

Query: 5   KTTQYLEHAGDAKDRCGGDLESHVHSPEEF-LSVDQLNQDFDKSLVLRSSSAYSSPIKLK 64
           +TTQYLEHAGDAKD  G DLES   SPE+F LSVDQLNQDF+KSLVL++SS+    I+LK
Sbjct: 84  ETTQYLEHAGDAKDCGGDDLESCARSPEDFSLSVDQLNQDFNKSLVLKNSSS----IELK 143

Query: 65  DSSNLTMKPSEVPDKKETKRRYAAEMGTAPNENDAYLDLKLSPPGVYLRGQSSNESKSSS 124
           DSSNLTMK SEVPDKKETKRRYA EMGTA NENDAYLDLKLSPPGVY RG+ SNESKSSS
Sbjct: 144 DSSNLTMKQSEVPDKKETKRRYAVEMGTAANENDAYLDLKLSPPGVYSRGKLSNESKSSS 203

Query: 125 PRSQDSCVSAEVESNANLE-NNLRVEGSPLIVMGCTFCLLYVMVTDADPRCPKCKNSGLL 184
           PRSQDSC+SAEVESN N E NNLRVE SPLIVMGCTFCLLYVMVTDADPRCPKCKN GLL
Sbjct: 204 PRSQDSCISAEVESNVNSENNNLRVESSPLIVMGCTFCLLYVMVTDADPRCPKCKNPGLL 263

Query: 185 DIFRGNQAKRSRKN 197
           D+FRGNQ KRSRKN
Sbjct: 264 DVFRGNQVKRSRKN 273

BLAST of Tan0017441 vs. NCBI nr
Match: XP_022939372.1 (uncharacterized protein LOC111445310 isoform X1 [Cucurbita moschata] >XP_022939373.1 uncharacterized protein LOC111445310 isoform X1 [Cucurbita moschata])

HSP 1 Score: 298.5 bits (763), Expect = 4.2e-77
Identity = 161/197 (81.73%), Postives = 175/197 (88.83%), Query Frame = 0

Query: 2   IRKKTTQYLEHAGDAKDRC-GGDLESHVHSPEEF-LSVDQLNQDFDKSLVLRSSSAYSSP 61
           + ++TTQYLE +GDAKD   GGDLES + SPEEF  SVDQLNQ+F+KSLVL+SSS+ SSP
Sbjct: 50  VEQETTQYLESSGDAKDCAGGGDLES-LCSPEEFSSSVDQLNQNFNKSLVLKSSSSRSSP 109

Query: 62  IKLKDSSNLTMKPSEVPDKKETKRRYAAEMGTAPNENDAYLDLKLSPPGVYLRGQSSNES 121
           I++KDSS L+MK SEV DKKETKRRYAA MGT  NEN AYLDLKLSPPGVYLRG+SSNES
Sbjct: 110 IEVKDSSKLSMKQSEVTDKKETKRRYAAGMGTTGNENYAYLDLKLSPPGVYLRGKSSNES 169

Query: 122 KSSSPRSQDSCVSAEVESNANLENNLRVEGSPLIVMGCTFCLLYVMVTDADPRCPKCKNS 181
           KSSSP+SQDSCVSAEVESN NLENNL VEGSPLIVMGCTFCLLYVMVTD DPRCP CKNS
Sbjct: 170 KSSSPKSQDSCVSAEVESNVNLENNLEVEGSPLIVMGCTFCLLYVMVTDDDPRCPICKNS 229

Query: 182 GLLDIFRGNQAKRSRKN 197
           GLLDIFRGNQ KRSRKN
Sbjct: 230 GLLDIFRGNQPKRSRKN 245

BLAST of Tan0017441 vs. NCBI nr
Match: XP_022939374.1 (uncharacterized protein LOC111445310 isoform X2 [Cucurbita moschata])

HSP 1 Score: 298.5 bits (763), Expect = 4.2e-77
Identity = 161/197 (81.73%), Postives = 175/197 (88.83%), Query Frame = 0

Query: 2   IRKKTTQYLEHAGDAKDRC-GGDLESHVHSPEEF-LSVDQLNQDFDKSLVLRSSSAYSSP 61
           + ++TTQYLE +GDAKD   GGDLES + SPEEF  SVDQLNQ+F+KSLVL+SSS+ SSP
Sbjct: 43  VEQETTQYLESSGDAKDCAGGGDLES-LCSPEEFSSSVDQLNQNFNKSLVLKSSSSRSSP 102

Query: 62  IKLKDSSNLTMKPSEVPDKKETKRRYAAEMGTAPNENDAYLDLKLSPPGVYLRGQSSNES 121
           I++KDSS L+MK SEV DKKETKRRYAA MGT  NEN AYLDLKLSPPGVYLRG+SSNES
Sbjct: 103 IEVKDSSKLSMKQSEVTDKKETKRRYAAGMGTTGNENYAYLDLKLSPPGVYLRGKSSNES 162

Query: 122 KSSSPRSQDSCVSAEVESNANLENNLRVEGSPLIVMGCTFCLLYVMVTDADPRCPKCKNS 181
           KSSSP+SQDSCVSAEVESN NLENNL VEGSPLIVMGCTFCLLYVMVTD DPRCP CKNS
Sbjct: 163 KSSSPKSQDSCVSAEVESNVNLENNLEVEGSPLIVMGCTFCLLYVMVTDDDPRCPICKNS 222

Query: 182 GLLDIFRGNQAKRSRKN 197
           GLLDIFRGNQ KRSRKN
Sbjct: 223 GLLDIFRGNQPKRSRKN 238

BLAST of Tan0017441 vs. NCBI nr
Match: XP_022939375.1 (uncharacterized protein LOC111445310 isoform X3 [Cucurbita moschata])

HSP 1 Score: 296.6 bits (758), Expect = 1.6e-76
Identity = 161/194 (82.99%), Postives = 173/194 (89.18%), Query Frame = 0

Query: 5   KTTQYLEHAGDAKDRC-GGDLESHVHSPEEF-LSVDQLNQDFDKSLVLRSSSAYSSPIKL 64
           +TTQYLE +GDAKD   GGDLES + SPEEF  SVDQLNQ+F+KSLVL+SSS+ SSPI++
Sbjct: 43  ETTQYLESSGDAKDCAGGGDLES-LCSPEEFSSSVDQLNQNFNKSLVLKSSSSRSSPIEV 102

Query: 65  KDSSNLTMKPSEVPDKKETKRRYAAEMGTAPNENDAYLDLKLSPPGVYLRGQSSNESKSS 124
           KDSS L+MK SEV DKKETKRRYAA MGT  NEN AYLDLKLSPPGVYLRG+SSNESKSS
Sbjct: 103 KDSSKLSMKQSEVTDKKETKRRYAAGMGTTGNENYAYLDLKLSPPGVYLRGKSSNESKSS 162

Query: 125 SPRSQDSCVSAEVESNANLENNLRVEGSPLIVMGCTFCLLYVMVTDADPRCPKCKNSGLL 184
           SP+SQDSCVSAEVESN NLENNL VEGSPLIVMGCTFCLLYVMVTD DPRCP CKNSGLL
Sbjct: 163 SPKSQDSCVSAEVESNVNLENNLEVEGSPLIVMGCTFCLLYVMVTDDDPRCPICKNSGLL 222

Query: 185 DIFRGNQAKRSRKN 197
           DIFRGNQ KRSRKN
Sbjct: 223 DIFRGNQPKRSRKN 235

BLAST of Tan0017441 vs. NCBI nr
Match: XP_022939376.1 (uncharacterized protein LOC111445310 isoform X4 [Cucurbita moschata])

HSP 1 Score: 296.6 bits (758), Expect = 1.6e-76
Identity = 161/194 (82.99%), Postives = 173/194 (89.18%), Query Frame = 0

Query: 5   KTTQYLEHAGDAKDRC-GGDLESHVHSPEEF-LSVDQLNQDFDKSLVLRSSSAYSSPIKL 64
           +TTQYLE +GDAKD   GGDLES + SPEEF  SVDQLNQ+F+KSLVL+SSS+ SSPI++
Sbjct: 36  ETTQYLESSGDAKDCAGGGDLES-LCSPEEFSSSVDQLNQNFNKSLVLKSSSSRSSPIEV 95

Query: 65  KDSSNLTMKPSEVPDKKETKRRYAAEMGTAPNENDAYLDLKLSPPGVYLRGQSSNESKSS 124
           KDSS L+MK SEV DKKETKRRYAA MGT  NEN AYLDLKLSPPGVYLRG+SSNESKSS
Sbjct: 96  KDSSKLSMKQSEVTDKKETKRRYAAGMGTTGNENYAYLDLKLSPPGVYLRGKSSNESKSS 155

Query: 125 SPRSQDSCVSAEVESNANLENNLRVEGSPLIVMGCTFCLLYVMVTDADPRCPKCKNSGLL 184
           SP+SQDSCVSAEVESN NLENNL VEGSPLIVMGCTFCLLYVMVTD DPRCP CKNSGLL
Sbjct: 156 SPKSQDSCVSAEVESNVNLENNLEVEGSPLIVMGCTFCLLYVMVTDDDPRCPICKNSGLL 215

Query: 185 DIFRGNQAKRSRKN 197
           DIFRGNQ KRSRKN
Sbjct: 216 DIFRGNQPKRSRKN 228

BLAST of Tan0017441 vs. ExPASy TrEMBL
Match: A0A6J1FLG8 (uncharacterized protein LOC111445310 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111445310 PE=4 SV=1)

HSP 1 Score: 298.5 bits (763), Expect = 2.0e-77
Identity = 161/197 (81.73%), Postives = 175/197 (88.83%), Query Frame = 0

Query: 2   IRKKTTQYLEHAGDAKDRC-GGDLESHVHSPEEF-LSVDQLNQDFDKSLVLRSSSAYSSP 61
           + ++TTQYLE +GDAKD   GGDLES + SPEEF  SVDQLNQ+F+KSLVL+SSS+ SSP
Sbjct: 50  VEQETTQYLESSGDAKDCAGGGDLES-LCSPEEFSSSVDQLNQNFNKSLVLKSSSSRSSP 109

Query: 62  IKLKDSSNLTMKPSEVPDKKETKRRYAAEMGTAPNENDAYLDLKLSPPGVYLRGQSSNES 121
           I++KDSS L+MK SEV DKKETKRRYAA MGT  NEN AYLDLKLSPPGVYLRG+SSNES
Sbjct: 110 IEVKDSSKLSMKQSEVTDKKETKRRYAAGMGTTGNENYAYLDLKLSPPGVYLRGKSSNES 169

Query: 122 KSSSPRSQDSCVSAEVESNANLENNLRVEGSPLIVMGCTFCLLYVMVTDADPRCPKCKNS 181
           KSSSP+SQDSCVSAEVESN NLENNL VEGSPLIVMGCTFCLLYVMVTD DPRCP CKNS
Sbjct: 170 KSSSPKSQDSCVSAEVESNVNLENNLEVEGSPLIVMGCTFCLLYVMVTDDDPRCPICKNS 229

Query: 182 GLLDIFRGNQAKRSRKN 197
           GLLDIFRGNQ KRSRKN
Sbjct: 230 GLLDIFRGNQPKRSRKN 245

BLAST of Tan0017441 vs. ExPASy TrEMBL
Match: A0A6J1FGZ2 (uncharacterized protein LOC111445310 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111445310 PE=4 SV=1)

HSP 1 Score: 298.5 bits (763), Expect = 2.0e-77
Identity = 161/197 (81.73%), Postives = 175/197 (88.83%), Query Frame = 0

Query: 2   IRKKTTQYLEHAGDAKDRC-GGDLESHVHSPEEF-LSVDQLNQDFDKSLVLRSSSAYSSP 61
           + ++TTQYLE +GDAKD   GGDLES + SPEEF  SVDQLNQ+F+KSLVL+SSS+ SSP
Sbjct: 43  VEQETTQYLESSGDAKDCAGGGDLES-LCSPEEFSSSVDQLNQNFNKSLVLKSSSSRSSP 102

Query: 62  IKLKDSSNLTMKPSEVPDKKETKRRYAAEMGTAPNENDAYLDLKLSPPGVYLRGQSSNES 121
           I++KDSS L+MK SEV DKKETKRRYAA MGT  NEN AYLDLKLSPPGVYLRG+SSNES
Sbjct: 103 IEVKDSSKLSMKQSEVTDKKETKRRYAAGMGTTGNENYAYLDLKLSPPGVYLRGKSSNES 162

Query: 122 KSSSPRSQDSCVSAEVESNANLENNLRVEGSPLIVMGCTFCLLYVMVTDADPRCPKCKNS 181
           KSSSP+SQDSCVSAEVESN NLENNL VEGSPLIVMGCTFCLLYVMVTD DPRCP CKNS
Sbjct: 163 KSSSPKSQDSCVSAEVESNVNLENNLEVEGSPLIVMGCTFCLLYVMVTDDDPRCPICKNS 222

Query: 182 GLLDIFRGNQAKRSRKN 197
           GLLDIFRGNQ KRSRKN
Sbjct: 223 GLLDIFRGNQPKRSRKN 238

BLAST of Tan0017441 vs. ExPASy TrEMBL
Match: A0A6J1FMI8 (uncharacterized protein LOC111445310 isoform X3 OS=Cucurbita moschata OX=3662 GN=LOC111445310 PE=4 SV=1)

HSP 1 Score: 296.6 bits (758), Expect = 7.7e-77
Identity = 161/194 (82.99%), Postives = 173/194 (89.18%), Query Frame = 0

Query: 5   KTTQYLEHAGDAKDRC-GGDLESHVHSPEEF-LSVDQLNQDFDKSLVLRSSSAYSSPIKL 64
           +TTQYLE +GDAKD   GGDLES + SPEEF  SVDQLNQ+F+KSLVL+SSS+ SSPI++
Sbjct: 43  ETTQYLESSGDAKDCAGGGDLES-LCSPEEFSSSVDQLNQNFNKSLVLKSSSSRSSPIEV 102

Query: 65  KDSSNLTMKPSEVPDKKETKRRYAAEMGTAPNENDAYLDLKLSPPGVYLRGQSSNESKSS 124
           KDSS L+MK SEV DKKETKRRYAA MGT  NEN AYLDLKLSPPGVYLRG+SSNESKSS
Sbjct: 103 KDSSKLSMKQSEVTDKKETKRRYAAGMGTTGNENYAYLDLKLSPPGVYLRGKSSNESKSS 162

Query: 125 SPRSQDSCVSAEVESNANLENNLRVEGSPLIVMGCTFCLLYVMVTDADPRCPKCKNSGLL 184
           SP+SQDSCVSAEVESN NLENNL VEGSPLIVMGCTFCLLYVMVTD DPRCP CKNSGLL
Sbjct: 163 SPKSQDSCVSAEVESNVNLENNLEVEGSPLIVMGCTFCLLYVMVTDDDPRCPICKNSGLL 222

Query: 185 DIFRGNQAKRSRKN 197
           DIFRGNQ KRSRKN
Sbjct: 223 DIFRGNQPKRSRKN 235

BLAST of Tan0017441 vs. ExPASy TrEMBL
Match: A0A6J1FFQ0 (uncharacterized protein LOC111445310 isoform X4 OS=Cucurbita moschata OX=3662 GN=LOC111445310 PE=4 SV=1)

HSP 1 Score: 296.6 bits (758), Expect = 7.7e-77
Identity = 161/194 (82.99%), Postives = 173/194 (89.18%), Query Frame = 0

Query: 5   KTTQYLEHAGDAKDRC-GGDLESHVHSPEEF-LSVDQLNQDFDKSLVLRSSSAYSSPIKL 64
           +TTQYLE +GDAKD   GGDLES + SPEEF  SVDQLNQ+F+KSLVL+SSS+ SSPI++
Sbjct: 36  ETTQYLESSGDAKDCAGGGDLES-LCSPEEFSSSVDQLNQNFNKSLVLKSSSSRSSPIEV 95

Query: 65  KDSSNLTMKPSEVPDKKETKRRYAAEMGTAPNENDAYLDLKLSPPGVYLRGQSSNESKSS 124
           KDSS L+MK SEV DKKETKRRYAA MGT  NEN AYLDLKLSPPGVYLRG+SSNESKSS
Sbjct: 96  KDSSKLSMKQSEVTDKKETKRRYAAGMGTTGNENYAYLDLKLSPPGVYLRGKSSNESKSS 155

Query: 125 SPRSQDSCVSAEVESNANLENNLRVEGSPLIVMGCTFCLLYVMVTDADPRCPKCKNSGLL 184
           SP+SQDSCVSAEVESN NLENNL VEGSPLIVMGCTFCLLYVMVTD DPRCP CKNSGLL
Sbjct: 156 SPKSQDSCVSAEVESNVNLENNLEVEGSPLIVMGCTFCLLYVMVTDDDPRCPICKNSGLL 215

Query: 185 DIFRGNQAKRSRKN 197
           DIFRGNQ KRSRKN
Sbjct: 216 DIFRGNQPKRSRKN 228

BLAST of Tan0017441 vs. ExPASy TrEMBL
Match: A0A6J1FGM1 (uncharacterized protein LOC111445310 isoform X5 OS=Cucurbita moschata OX=3662 GN=LOC111445310 PE=4 SV=1)

HSP 1 Score: 296.6 bits (758), Expect = 7.7e-77
Identity = 161/194 (82.99%), Postives = 173/194 (89.18%), Query Frame = 0

Query: 5   KTTQYLEHAGDAKDRC-GGDLESHVHSPEEF-LSVDQLNQDFDKSLVLRSSSAYSSPIKL 64
           +TTQYLE +GDAKD   GGDLES + SPEEF  SVDQLNQ+F+KSLVL+SSS+ SSPI++
Sbjct: 9   ETTQYLESSGDAKDCAGGGDLES-LCSPEEFSSSVDQLNQNFNKSLVLKSSSSRSSPIEV 68

Query: 65  KDSSNLTMKPSEVPDKKETKRRYAAEMGTAPNENDAYLDLKLSPPGVYLRGQSSNESKSS 124
           KDSS L+MK SEV DKKETKRRYAA MGT  NEN AYLDLKLSPPGVYLRG+SSNESKSS
Sbjct: 69  KDSSKLSMKQSEVTDKKETKRRYAAGMGTTGNENYAYLDLKLSPPGVYLRGKSSNESKSS 128

Query: 125 SPRSQDSCVSAEVESNANLENNLRVEGSPLIVMGCTFCLLYVMVTDADPRCPKCKNSGLL 184
           SP+SQDSCVSAEVESN NLENNL VEGSPLIVMGCTFCLLYVMVTD DPRCP CKNSGLL
Sbjct: 129 SPKSQDSCVSAEVESNVNLENNLEVEGSPLIVMGCTFCLLYVMVTDDDPRCPICKNSGLL 188

Query: 185 DIFRGNQAKRSRKN 197
           DIFRGNQ KRSRKN
Sbjct: 189 DIFRGNQPKRSRKN 201

BLAST of Tan0017441 vs. TAIR 10
Match: AT5G06270.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 21 plant structures; EXPRESSED DURING: 11 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G11600.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )

HSP 1 Score: 63.2 bits (152), Expect = 2.8e-10
Identity = 39/96 (40.62%), Postives = 59/96 (61.46%), Query Frame = 0

Query: 100 LDLKLSPPGVYLRG--QSSNESKSSSPRS-QDSCVSAEVESNANLENNLRVEGSP----L 159
           L L LSPP    R   +S + S ++SP S   SCVS+E+  +   E ++R   SP    +
Sbjct: 10  LKLNLSPPTSSQRRMVRSPSRSATTSPTSPPSSCVSSEMNQD---EPSVRYSTSPETTSM 69

Query: 160 IVMGCTFCLLYVMVTDADPRCPKCKNSGLLDIFRGN 189
           +++GC  CL+YVM+++ DP+CPKCK++ LLD    N
Sbjct: 70  VLVGCPRCLMYVMLSEDDPKCPKCKSTVLLDFLHEN 102

BLAST of Tan0017441 vs. TAIR 10
Match: AT3G11600.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: response to karrikin; EXPRESSED IN: 21 plant structures; EXPRESSED DURING: 12 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G06270.1); Has 171 Blast hits to 171 proteins in 15 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 171; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 62.0 bits (149), Expect = 6.1e-10
Identity = 40/98 (40.82%), Postives = 59/98 (60.20%), Query Frame = 0

Query: 94  NENDAYLDLK--LSPPGVYLRGQSSNESKSSSP-RSQDSCVSAEVESNANLENNLRVEGS 153
           N+N   L+L+  LSPP      Q+S  S   SP RS  +  S+ V S  N E N  +  +
Sbjct: 5   NKNGPKLELRLNLSPP----PSQASQMSLVRSPNRSNTTSPSSCVSSETNQEENETI--T 64

Query: 154 PLIVMGCTFCLLYVMVTDADPRCPKCKNSGLLDIFRGN 189
            ++++GC  CL+YVM++D DP+CPKCK++ LLD  + N
Sbjct: 65  SMVLVGCPRCLMYVMLSDDDPKCPKCKSTVLLDFLQEN 96

BLAST of Tan0017441 vs. TAIR 10
Match: AT3G52561.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G11600.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 45.4 bits (106), Expect = 5.9e-05
Identity = 27/77 (35.06%), Postives = 41/77 (53.25%), Query Frame = 0

Query: 109 VYLRGQSSNESKSS-SPRSQDSCVSAEVESNANLENNL--RVEGSPLIVMGCTFCLLYVM 168
           V   G+    S SS +  SQ+SC++   E    + ++     E   ++VMGC  C++YVM
Sbjct: 21  VVANGRFEGRSPSSDTSSSQNSCLTRTEEVKEEVASSWVDEEEAPEMVVMGCRSCMMYVM 80

Query: 169 VTDADPRCPKCKNSGLL 183
           V     RCPKCK + L+
Sbjct: 81  VLQERQRCPKCKCTDLI 97

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9FNI13.9e-0940.63Protein GL2-INTERACTING REPRESSOR 1 OS=Arabidopsis thaliana OX=3702 GN=GIR1 PE=1... [more]
Q9SRN48.6e-0940.82Protein GL2-INTERACTING REPRESSOR 2 OS=Arabidopsis thaliana OX=3702 GN=GIR2 PE=1... [more]
Match NameE-valueIdentityDescription
XP_038885898.15.8e-7983.51uncharacterized protein LOC120076204 [Benincasa hispida][more]
XP_022939372.14.2e-7781.73uncharacterized protein LOC111445310 isoform X1 [Cucurbita moschata] >XP_0229393... [more]
XP_022939374.14.2e-7781.73uncharacterized protein LOC111445310 isoform X2 [Cucurbita moschata][more]
XP_022939375.11.6e-7682.99uncharacterized protein LOC111445310 isoform X3 [Cucurbita moschata][more]
XP_022939376.11.6e-7682.99uncharacterized protein LOC111445310 isoform X4 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
A0A6J1FLG82.0e-7781.73uncharacterized protein LOC111445310 isoform X1 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1FGZ22.0e-7781.73uncharacterized protein LOC111445310 isoform X2 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1FMI87.7e-7782.99uncharacterized protein LOC111445310 isoform X3 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1FFQ07.7e-7782.99uncharacterized protein LOC111445310 isoform X4 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1FGM17.7e-7782.99uncharacterized protein LOC111445310 isoform X5 OS=Cucurbita moschata OX=3662 GN... [more]
Match NameE-valueIdentityDescription
AT5G06270.12.8e-1040.63unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT3G11600.16.1e-1040.82unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: response... [more]
AT3G52561.15.9e-0535.06unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..30
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 13..30
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 112..136
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 60..74
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 60..87
NoneNo IPR availablePANTHERPTHR33177PUTATIVE-RELATEDcoord: 91..196
NoneNo IPR availablePANTHERPTHR33177:SF51GB|AAF02129.1coord: 91..196

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0017441.1Tan0017441.1mRNA
Tan0017441.2Tan0017441.2mRNA