Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ACGTGGTATTTCATAATTGATCCTTTAAGCTCTGTTTCCCCTGAAACACAACCCTGGAGCCTTGGAAAAATTATTGGATGTGAACGTCAGAATGAGGCAACCCCCTCCCAATTTGCAGAATCCCCATTGGCACCCATTTCTTCTTCTTCTTCTTCTTCTTCCTCTTTCTCTGTCGATTTTTCAATGGCTAATGTTAAGCAGAGGGAGGAAGGAAGGTCACGGAAGCCACGCCATCGGAAAAACTTTGAGATGCAGGATTTGCCCACCTTTGCCAAATGGCTAACGTCCTTCCACGACGATGGCATTTCCAAATCTAAGCGACCTAAGAGGACGCCTACCGTCGCTCTAGATTCGTCGGAGAAGGAGGGGGTAGCTATGAGCGATTCTAGATCTATGCGAGGGGGTTGTTGCTGCTGCTGGCAACGTTCCAAATCTACGCCGCGGGAGATTTCACCGAAATCCCAAATCTCTCTAAGTAAGAGAACGGTGGTTCAAGTTGAAGGGGTCGCTGCCGACGTTTCGGTTCACCGGGAGGAGAAGGTGGTGGCTACCGGCAGTGGGTGCTGGTGTTGCAGATGTTTGCCAACATTCCATATCTGTGGGAGGAGAAAAATGGCGGTGGTCGCCGACGTTCCGAATTTGCACAAAGAAGAGGGAGTGGCTACCGACGTTCCGGATCTGGCTACTAGTGGTCCAGATTTGGTGGAGGAAGGAAAGGCCACCGTCGATCCAGGTCTGTGCAAGGAAGGGAGTGGTTGTTGTGGATGCTGGAAATGTTTGCCGGCGCTTCAGATCTGCAGGAGGAGGAGGAGGATGAAAGTAGTTGCCGGCAATGTTCCAAATGGATCAGAGGAGGGGGTGGCTTCCGACGTTGTAAATCCGGGCAGGGAAGAGAAAGTGGTTGTCGGCGTTTCAGATCTGCAGGAGGAGAAGGAGGTGGCTGCTGTCGCCGTTCTCAAATTGCGTGGGGACGTAGATTCCAAATCGGCTGGTGAGGGAAGCTGTTGGCCGTTCCAGTTCTGCGTGAGAGGGTGGTTGCCCAAGTTCTTTCTGTGCGGGGGAAGGACGGCCGTCGATGCCTCAAATCATCGGGAGGAAGAAAAAGAAGAAGAAACGGCATCTGTCTGCGTTCCAGATCTCCAAAACGGCGTAGTTTCCGGCGACAATATAGATCATCGCAAGGAGGAGCGGGTGTCCGCCGGCGACATTCCGGTTCTTAGTAAAGAAGAGAAGGTGTCTGCCGGCGATTCTTCAGATCTTCACAAGGAAGAAGCGGTGGCTTCCGGCGTTCAAGATCTTAGCAAGGAGAAGGAGAAGGAGGAGGAGGGTGGCTGCAGGTGTTTCAAACTTGCAGGAAAAGGAAGTAGTCGCCGCGACCGCCGCCGGAGTTCCAGATCTAAGGAAGAAGGATGTTGGCCATTCAATATTTGTAGAGGGAGAATTAGCATTCCAAATTCGCTTGAAGAAGAGGGGTTGGTTGTCAATGGCGTCTCAAATGTTCATAACGAGGTGGAGGTGGAGGTGGAGGTGGCGGCTGTTGCCGGCGTTACGGGCGTGGTGGCTGCCACCGATTGTTCCAAACGACGACGGAGATGGCTGCCGACGTTCCAAATATTTAGGAGGAAGACGGTGGCTACAGATAAGGAGGACGGCGGCAGTGGGCGTTCAAACTCTAAGCGTAGAAGAGGGTGCCTACGACGACGGGGAAGAAACAAAGATAGAGAAAGAGATAGAAAATGAAATTCAAAAATTTTATTTTTATTTTTTTTTAAAAAATGATTCGTTAGCAAAATTAATAAAAACACGCAAACATATTCTTTTGTCACCTCAACTTGTATAGAATTAATATTTTTTTTTTAAAAAAAGATACTGTAAGGAAAGGAAAAAGTTTCTTGACCAAAATAGACAAATGCTGGGATGTACATGTTTTAATCAAATTAAAATTCCTGTTTGAACT
mRNA sequence
ACGTGGTATTTCATAATTGATCCTTTAAGCTCTGTTTCCCCTGAAACACAACCCTGGAGCCTTGGAAAAATTATTGGATGTGAACGTCAGAATGAGGCAACCCCCTCCCAATTTGCAGAATCCCCATTGGCACCCATTTCTTCTTCTTCTTCTTCTTCTTCCTCTTTCTCTGTCGATTTTTCAATGGCTAATGTTAAGCAGAGGGAGGAAGGAAGGTCACGGAAGCCACGCCATCGGAAAAACTTTGAGATGCAGGATTTGCCCACCTTTGCCAAATGGCTAACGTCCTTCCACGACGATGGCATTTCCAAATCTAAGCGACCTAAGAGGACGCCTACCGTCGCTCTAGATTCGTCGGAGAAGGAGGGGGTAGCTATGAGCGATTCTAGATCTATGCGAGGGGGTTGTTGCTGCTGCTGGCAACGTTCCAAATCTACGCCGCGGGAGATTTCACCGAAATCCCAAATCTCTCTAAGTAAGAGAACGGTGGTTCAAGTTGAAGGGGTCGCTGCCGACGTTTCGGTTCACCGGGAGGAGAAGGTGGTGGCTACCGGCAGTGGGTGCTGGTGTTGCAGATGTTTGCCAACATTCCATATCTGTGGGAGGAGAAAAATGGCGGTGGTCGCCGACGTTCCGAATTTGCACAAAGAAGAGGGAGTGGCTACCGACGTTCCGGATCTGGCTACTAGTGGTCCAGATTTGGTGGAGGAAGGAAAGGCCACCGTCGATCCAGGTCTGTGCAAGGAAGGGAGTGGTTGTTGTGGATGCTGGAAATGTTTGCCGGCGCTTCAGATCTGCAGGAGGAGGAGGAGGATGAAAGTAGTTGCCGGCAATGTTCCAAATGGATCAGAGGAGGGGGTGGCTTCCGACGTTGTAAATCCGGGCAGGGAAGAGAAAGTGGTTGTCGGCGTTTCAGATCTGCAGGAGGAGAAGGAGGTGGCTGCTGTCGCCGTTCTCAAATTGCGTGGGGACGTAGATTCCAAATCGGCTGGTGAGGGAAGCTGTTGGCCGTTCCAGTTCTGCGTGAGAGGGTGGTTGCCCAAGTTCTTTCTGTGCGGGGGAAGGACGGCCGTCGATGCCTCAAATCATCGGGAGGAAGAAAAAGAAGAAGAAACGGCATCTGTCTGCGTTCCAGATCTCCAAAACGGCGTAGTTTCCGGCGACAATATAGATCATCGCAAGGAGGAGCGGGTGTCCGCCGGCGACATTCCGGTTCTTAGTAAAGAAGAGAAGGTGTCTGCCGGCGATTCTTCAGATCTTCACAAGGAAGAAGCGGTGGCTTCCGGCGTTCAAGATCTTAGCAAGGAGAAGGAGAAGGAGGAGGAGGGTGGCTGCAGGTGTTTCAAACTTGCAGGAAAAGGAAGTAGTCGCCGCGACCGCCGCCGGAGTTCCAGATCTAAGGAAGAAGGATGTTGGCCATTCAATATTTGTAGAGGGAGAATTAGCATTCCAAATTCGCTTGAAGAAGAGGGGTTGGTTGTCAATGGCGTCTCAAATGTTCATAACGAGGTGGAGGTGGAGGTGGAGGTGGCGGCTGTTGCCGGCGTTACGGGCGTGGTGGCTGCCACCGATTGTTCCAAACGACGACGGAGATGGCTGCCGACGTTCCAAATATTTAGGAGGAAGACGGTGGCTACAGATAAGGAGGACGGCGGCAGTGGGCGTTCAAACTCTAAGCGTAGAAGAGGGTGCCTACGACGACGGGGAAGAAACAAAGATAGAGAAAGAGATAGAAAATGAAATTCAAAAATTTTATTTTTATTTTTTTTTAAAAAATGATTCGTTAGCAAAATTAATAAAAACACGCAAACATATTCTTTTGTCACCTCAACTTGTATAGAATTAATATTTTTTTTTTAAAAAAAGATACTGTAAGGAAAGGAAAAAGTTTCTTGACCAAAATAGACAAATGCTGGGATGTACATGTTTTAATCAAATTAAAATTCCTGTTTGAACT
Coding sequence (CDS)
ATGGCTAATGTTAAGCAGAGGGAGGAAGGAAGGTCACGGAAGCCACGCCATCGGAAAAACTTTGAGATGCAGGATTTGCCCACCTTTGCCAAATGGCTAACGTCCTTCCACGACGATGGCATTTCCAAATCTAAGCGACCTAAGAGGACGCCTACCGTCGCTCTAGATTCGTCGGAGAAGGAGGGGGTAGCTATGAGCGATTCTAGATCTATGCGAGGGGGTTGTTGCTGCTGCTGGCAACGTTCCAAATCTACGCCGCGGGAGATTTCACCGAAATCCCAAATCTCTCTAAGTAAGAGAACGGTGGTTCAAGTTGAAGGGGTCGCTGCCGACGTTTCGGTTCACCGGGAGGAGAAGGTGGTGGCTACCGGCAGTGGGTGCTGGTGTTGCAGATGTTTGCCAACATTCCATATCTGTGGGAGGAGAAAAATGGCGGTGGTCGCCGACGTTCCGAATTTGCACAAAGAAGAGGGAGTGGCTACCGACGTTCCGGATCTGGCTACTAGTGGTCCAGATTTGGTGGAGGAAGGAAAGGCCACCGTCGATCCAGGTCTGTGCAAGGAAGGGAGTGGTTGTTGTGGATGCTGGAAATGTTTGCCGGCGCTTCAGATCTGCAGGAGGAGGAGGAGGATGAAAGTAGTTGCCGGCAATGTTCCAAATGGATCAGAGGAGGGGGTGGCTTCCGACGTTGTAAATCCGGGCAGGGAAGAGAAAGTGGTTGTCGGCGTTTCAGATCTGCAGGAGGAGAAGGAGGTGGCTGCTGTCGCCGTTCTCAAATTGCGTGGGGACGTAGATTCCAAATCGGCTGGTGAGGGAAGCTGTTGGCCGTTCCAGTTCTGCGTGAGAGGGTGGTTGCCCAAGTTCTTTCTGTGCGGGGGAAGGACGGCCGTCGATGCCTCAAATCATCGGGAGGAAGAAAAAGAAGAAGAAACGGCATCTGTCTGCGTTCCAGATCTCCAAAACGGCGTAGTTTCCGGCGACAATATAGATCATCGCAAGGAGGAGCGGGTGTCCGCCGGCGACATTCCGGTTCTTAGTAAAGAAGAGAAGGTGTCTGCCGGCGATTCTTCAGATCTTCACAAGGAAGAAGCGGTGGCTTCCGGCGTTCAAGATCTTAGCAAGGAGAAGGAGAAGGAGGAGGAGGGTGGCTGCAGGTGTTTCAAACTTGCAGGAAAAGGAAGTAGTCGCCGCGACCGCCGCCGGAGTTCCAGATCTAAGGAAGAAGGATGTTGGCCATTCAATATTTGTAGAGGGAGAATTAGCATTCCAAATTCGCTTGAAGAAGAGGGGTTGGTTGTCAATGGCGTCTCAAATGTTCATAACGAGGTGGAGGTGGAGGTGGAGGTGGCGGCTGTTGCCGGCGTTACGGGCGTGGTGGCTGCCACCGATTGTTCCAAACGACGACGGAGATGGCTGCCGACGTTCCAAATATTTAGGAGGAAGACGGTGGCTACAGATAAGGAGGACGGCGGCAGTGGGCGTTCAAACTCTAAGCGTAGAAGAGGGTGCCTACGACGACGGGGAAGAAACAAAGATAGAGAAAGAGATAGAAAATGA
Protein sequence
MANVKQREEGRSRKPRHRKNFEMQDLPTFAKWLTSFHDDGISKSKRPKRTPTVALDSSEKEGVAMSDSRSMRGGCCCCWQRSKSTPREISPKSQISLSKRTVVQVEGVAADVSVHREEKVVATGSGCWCCRCLPTFHICGRRKMAVVADVPNLHKEEGVATDVPDLATSGPDLVEEGKATVDPGLCKEGSGCCGCWKCLPALQICRRRRRMKVVAGNVPNGSEEGVASDVVNPGREEKVVVGVSDLQEEKEVAAVAVLKLRGDVDSKSAGEGSCWPFQFCVRGWLPKFFLCGGRTAVDASNHREEEKEEETASVCVPDLQNGVVSGDNIDHRKEERVSAGDIPVLSKEEKVSAGDSSDLHKEEAVASGVQDLSKEKEKEEEGGCRCFKLAGKGSSRRDRRRSSRSKEEGCWPFNICRGRISIPNSLEEEGLVVNGVSNVHNEVEVEVEVAAVAGVTGVVAATDCSKRRRRWLPTFQIFRRKTVATDKEDGGSGRSNSKRRRGCLRRRGRNKDRERDRK
Homology
BLAST of Tan0005703 vs. NCBI nr
Match:
XP_038880648.1 (uncharacterized protein LOC120072275 [Benincasa hispida])
HSP 1 Score: 328.6 bits (841), Expect = 1.0e-85
Identity = 279/668 (41.77%), Postives = 326/668 (48.80%), Query Frame = 0
Query: 1 MANVKQREEGRSRKPRHRKNFEMQDLPTFAKWLTSFH-----DDGISKSK-----RP-KR 60
M KQ EE +SRKPR R+N +M++LPTF KWL S +D SKSK RP R
Sbjct: 6 MTTDKQMEEPKSRKPRQRRNLQMEELPTFTKWLNSVGHSGSCNDANSKSKPVNGNRPVVR 65
Query: 61 TPTVALDSSEKEGV-----------AMSDSRSMRGGCCCCWQRSKSTPREISPKSQISLS 120
TP V DSSE G A+ DS S R G CCCWQ SKST RE + K +SL
Sbjct: 66 TPVVVFDSSEDGGAVVADVPKDRVQAVDDSISARAG-CCCWQSSKSTRRECALKFHLSLR 125
Query: 121 KRTVV----QVEGVAADVSVHREEKVVATG-----SGCWCCRCLPTFHICGRRKMAVVAD 180
KR VV +VE V ++ EE VVA GC CRC PTF ICGRRK VV
Sbjct: 126 KRKVVTNGSEVEVVTVVSNLREEETVVAVSDVLEKDGCG-CRCFPTFQICGRRKSIVV-- 185
Query: 181 VPNLHKEEGVATDVPDLATS-----GPDLVEEGKATVDPG---LCKEGSG-CCGCWKCLP 240
L KE+G TD P+L T GPDLV+E + V G +EGSG CCG C P
Sbjct: 186 ---LQKEDGAVTDDPNLRTEEVANRGPDLVKEEEVVVVGGSDLRKEEGSGCCCGRCGCFP 245
Query: 241 ALQICRRRRRM---------------------------------------------KVVA 300
A QICRRR + K V
Sbjct: 246 AFQICRRRNVVASKEEAVVDVPEVEEEVANDKVNKQEGDSVECLQAFQSHICCAGRKTVD 305
Query: 301 GN------------------VPNGSEEG----------------------VASDVVNPGR 360
N VP+ +EG S+V NPGR
Sbjct: 306 DNPKTFEKEALASEDSSNVDVPDLQKEGSGGCCSCFKCMPTLHICGRRRNAVSEVPNPGR 365
Query: 361 EEKVVVGVSDLQEEKEVAAVAVLKLRGDVDSKSAGEGSCWPFQFCVRGWLPKFFLCGGRT 420
EEKVVV VSD E +EV A + SKS G CWPFQ C RGWLP+FFLCG R
Sbjct: 366 EEKVVVSVSDPPEGEEVVAA-----DEEWHSKSTQGGICWPFQICTRGWLPRFFLCGERI 425
Query: 421 AVDASNHREEE--------KEEETASVCVPDLQNGVVSGDNIDHRKEERVSAGDIPVLSK 480
VDASNHREEE KEE A + P ++ V+ D KE++V+A DIPV S
Sbjct: 426 TVDASNHREEEEKAPPDVQKEEVVAVIPDPQKESIAVADGIPDDGKEKQVAADDIPVQST 485
Query: 481 EEKVSAGDSSDLHKEEAVASG--VQDLSKEKEKEEEGG-CRCFKLAGKGSSRRDRRRSSR 519
EEK+SAG KEE +SG + + +E+EGG CRCFKL GK SRR RRS +
Sbjct: 486 EEKMSAG------KEENASSGSIQETFEPDLNQEDEGGCCRCFKLGGKEGSRRQHRRSPK 545
BLAST of Tan0005703 vs. NCBI nr
Match:
KGN57524.1 (hypothetical protein Csa_011487 [Cucumis sativus])
HSP 1 Score: 284.6 bits (727), Expect = 1.7e-72
Identity = 260/664 (39.16%), Postives = 320/664 (48.19%), Query Frame = 0
Query: 1 MANVKQREEGRSRKPRHRKNFEMQDLPTFAKWLTSFH-----DDGISKSK------RPKR 60
MA K +EE RSRKPRHR+N +M++ PTF KWLT+F +D SKSK RP +
Sbjct: 1 MATNKPKEEQRSRKPRHRRNQQMEEFPTFTKWLTTFGHSGSCNDAKSKSKQLNPANRPLQ 60
Query: 61 TPTVALDSS------------EKEGV---AMSDSRSMRGGC-CCCWQRSKSTPREISPKS 120
P V L +S E+EGV A+ S S RGG CCCWQ SKST RE + K
Sbjct: 61 RPPVVLPASSEDAVPVVTNVPEEEGVQTMAVDGSISARGGAGCCCWQSSKSTRRECALKF 120
Query: 121 QISLSKRTVV----QVEGVAADVSVHREEKVVATGSGCWCCRCLPTFHICGRRKMAVVAD 180
ISL KR VV + E VAA ++ E V GC CRCL TF I RRK VV
Sbjct: 121 HISLRKRKVVANSTEAEVVAAVLNPPEEATVEDVKDGCG-CRCLRTFKIFRRRKSRVVG- 180
Query: 181 VPNLHKEEGVATD-----VPDLATSGPDLVEEGKATVDPGLCKEGSG--CCGCWKCLPAL 240
V +L KEEG TD ++A+SG D+++E + + P KE CCG WKC P
Sbjct: 181 VSDLQKEEGAVTDGVNLRTEEVASSGSDMMKEEEVVIAPDRRKEEESGCCCGRWKCSPTF 240
Query: 241 QICRRRR----RMKVVAG----------NVPNGSEEGVA--------------------- 300
QICRRR+ + +VV G NV E+ V
Sbjct: 241 QICRRRKVVAGKEEVVGGAPKVEEVGNDNVTKQEEDSVGCLQAFHICGGRKRVDDNPKTS 300
Query: 301 ----------------------------------------------SDVVNPGREEKVVV 360
S V PGREEKV+V
Sbjct: 301 EKEPLVSNDSSNLDVQNLQKEESGCCSCFRCIPTFQICGGRRSNEDSGVPKPGREEKVIV 360
Query: 361 GVSDLQEEKEVAAVAVLKLRGDVDSKSAGEGSCWPFQFCVRGWLPKFFLCGGRTAVDASN 420
VSD E+ +V + R S+ G+CW GW P+F LCG TAVDA N
Sbjct: 361 DVSD---PPEMGSVVDGRER---HSRPVQGGTCW------SGWFPRFLLCGEGTAVDAPN 420
Query: 421 HREEEKEEETASVCVPDLQNGVVSGDNI-DHRKEERVSAGDIPVLSKEEK-VSAGDSSDL 480
HREEE++ + + + GD I DH KE+ V+A DIPV+++EE V AGD+ DL
Sbjct: 421 HREEEEKAPSDARKEEKVVVATAVGDEISDHDKEKPVAAIDIPVVNEEEVFVGAGDTLDL 480
Query: 481 HKEEAVAS-GVQDLSKEK-----EKEEEGGCRCFKLAGKGSSRRDRRRSSRSKEEGCWPF 519
HKE+ V+S +QD+ KE+ EK E GGC C+ GK S R + RSSRS EGCW F
Sbjct: 481 HKEKNVSSCNIQDVRKEEIVDSDEKVEGGGCGCW---GKESGSRQQHRSSRSM-EGCWSF 540
BLAST of Tan0005703 vs. NCBI nr
Match:
XP_022922695.1 (uncharacterized protein LOC111430613 isoform X4 [Cucurbita moschata])
HSP 1 Score: 237.7 bits (605), Expect = 2.3e-58
Identity = 275/890 (30.90%), Postives = 341/890 (38.31%), Query Frame = 0
Query: 5 KQREEGRSRKPRHRKNFEMQDLPTFAKWL-----TSFHDDGISKSKR-------PKRTP- 64
KQ+E+G+ +K HR+N ++++ PTF KWL +S DD SKS + +R P
Sbjct: 6 KQKEDGKPKKKCHRRNLQIEEFPTFTKWLGPRGRSSSRDDTSSKSYKRGVPYPPVRRIPR 65
Query: 65 --------TVALD-SSEKEGVAMSDSRSMRGGCCCCWQRSKSTPREISPKSQISLSKRTV 124
VA D S E GVA+ +S S RGG CCCWQRSKST RE SL K V
Sbjct: 66 DSSSGGGGVVACDVSKEDGGVAIGESISARGG-CCCWQRSKSTQRECGLNFHFSLRKSKV 125
Query: 125 VQ--VEGVAADVSVHREEKVVATGS-----GCWCCRCLPTFHICGRRKM----------- 184
V EGVAA VS REE+VV G+ GC CRC PTFHICGRRK+
Sbjct: 126 VTNVSEGVAAGVSDVREEQVVEAGAVINREGCG-CRCSPTFHICGRRKVPPTGVSYLPEK 185
Query: 185 --------------AVVADVPNLHKEEGVATDVPD-----LATSGPDLVEEGKATVDPGL 244
V D PNLHKEEG ATD PD +ATS PDLV+EG+A PG
Sbjct: 186 RSGRRKSRSIRRRGLPVVDAPNLHKEEGPATDGPDVLKEEVATSCPDLVKEGEAIAAPGP 245
Query: 245 CKEGS-GCCGCWKCLPALQICRRR------------------------------------ 304
CKEGS CC WKCLP+ +C R+
Sbjct: 246 CKEGSRSCCSQWKCLPSFPMCGRKLSVVEEEVTIEEEVSVYDPNGPEVEEVANGPELVKE 305
Query: 305 ------------------------------------------------------------ 364
Sbjct: 306 GEATVAPGPCKEESGYCFPQWKCLPSFPTCGRKLSVVEEEVTVDDPNVPEVANRPDLAKE 365
Query: 365 -----------------------------RRMKVVAG----------------------- 424
R KV AG
Sbjct: 366 GEATVAPDPRKEGSGCCSPRLKDVSEFQIRNSKVAAGKQEMTVDVPNVLEVEEVANDVVN 425
Query: 425 -------------------------------------NVP-------------------- 484
NVP
Sbjct: 426 KQDDICPEKQEVIGLPGFQICSSKVAAGKQEMTIDVPNVPEVEEVANDVVKKQKKEVDGD 485
Query: 485 --------------------------------NGSEEG--------------VASDVVNP 519
NGS G V SDV NP
Sbjct: 486 PNTGEAVASCSSSSNLHEEEEKINASVPELHKNGSGCGWFKWMPSFLICGSKVVSDVPNP 545
BLAST of Tan0005703 vs. NCBI nr
Match:
XP_022922694.1 (uncharacterized protein LOC111430613 isoform X3 [Cucurbita moschata])
HSP 1 Score: 208.4 bits (529), Expect = 1.5e-49
Identity = 275/964 (28.53%), Postives = 341/964 (35.37%), Query Frame = 0
Query: 5 KQREEGRSRKPRHRKNFEMQDLPTFAKWL-----TSFHDDGISKSKR-------PKRTP- 64
KQ+E+G+ +K HR+N ++++ PTF KWL +S DD SKS + +R P
Sbjct: 6 KQKEDGKPKKKCHRRNLQIEEFPTFTKWLGPRGRSSSRDDTSSKSYKRGVPYPPVRRIPR 65
Query: 65 --------TVALD-SSEKEGVAMSDSRSMRGGCCCCWQRSKSTPREISPKSQISLSKRTV 124
VA D S E GVA+ +S S RGG CCCWQRSKST RE SL K V
Sbjct: 66 DSSSGGGGVVACDVSKEDGGVAIGESISARGG-CCCWQRSKSTQRECGLNFHFSLRKSKV 125
Query: 125 VQ--VEGVAADVSVHREEKVVATGS-----GCWCCRCLPTFHICGRRKM----------- 184
V EGVAA VS REE+VV G+ GC CRC PTFHICGRRK+
Sbjct: 126 VTNVSEGVAAGVSDVREEQVVEAGAVINREGCG-CRCSPTFHICGRRKVPPTGVSYLPEK 185
Query: 185 --------------AVVADVPNLHKEEGVATDVPD-----LATSGPDLVEEGKATVDPGL 244
V D PNLHKEEG ATD PD +ATS PDLV+EG+A PG
Sbjct: 186 RSGRRKSRSIRRRGLPVVDAPNLHKEEGPATDGPDVLKEEVATSCPDLVKEGEAIAAPGP 245
Query: 245 CKEGS------------------------------------------------------- 304
CKEGS
Sbjct: 246 CKEGSRSCCSQWKCLPSFPMCGRKLSVVEEEVTIEEEVSVYDPNGPEVEEVANGPESVKE 305
Query: 305 ------------------------------------------------------------ 364
Sbjct: 306 GEATIAPAPCKEGSESCSQWKCSPSIPMCERKVATGEEELTVVNVPNLLDVEVANGPELV 365
Query: 365 ----------------GCCGCWKCLPALQICRRR-------------------------- 424
CC WKCLPA +C R+
Sbjct: 366 QEGEATDAPSLCKRSGSCCSQWKCLPAFPMCGRKVATSKVELTVVDIPNLLEVEEVANGP 425
Query: 425 ------------------------------------------------------------ 484
Sbjct: 426 ELVKEGVATGSPGPYKKRSGSCCSQWKCLPSFPTCGRKLSVVEEEVTVDDPNVPEVANRP 485
Query: 485 ----------------------------------RRMKVVAG------------------ 519
R KV AG
Sbjct: 486 DLAKEGEATVAPDPRKEGSGCCSPRLKDVSEFQIRNSKVAAGKQEMTVDVPNVLEVEEVA 545
BLAST of Tan0005703 vs. NCBI nr
Match:
XP_022922693.1 (uncharacterized protein LOC111430613 isoform X2 [Cucurbita moschata])
HSP 1 Score: 190.7 bits (483), Expect = 3.2e-44
Identity = 276/1015 (27.19%), Postives = 341/1015 (33.60%), Query Frame = 0
Query: 5 KQREEGRSRKPRHRKNFEMQDLPTFAKWL-----TSFHDDGISKSKR-------PKRTP- 64
KQ+E+G+ +K HR+N ++++ PTF KWL +S DD SKS + +R P
Sbjct: 6 KQKEDGKPKKKCHRRNLQIEEFPTFTKWLGPRGRSSSRDDTSSKSYKRGVPYPPVRRIPR 65
Query: 65 --------TVALD-SSEKEGVAMSDSRSMRGGCCCCWQRSKSTPREISPKSQISLSKRTV 124
VA D S E GVA+ +S S RGG CCCWQRSKST RE SL K V
Sbjct: 66 DSSSGGGGVVACDVSKEDGGVAIGESISARGG-CCCWQRSKSTQRECGLNFHFSLRKSKV 125
Query: 125 VQ--VEGVAADVSVHREEKVVATGS-----GCWCCRCLPTFHICGRRKM----------- 184
V EGVAA VS REE+VV G+ GC CRC PTFHICGRRK+
Sbjct: 126 VTNVSEGVAAGVSDVREEQVVEAGAVINREGCG-CRCSPTFHICGRRKVPPTGVSYLPEK 185
Query: 185 --------------AVVADVPNLHKEEGVATDVPD-----LATSGPDLVEEGKATVDPGL 244
V D PNLHKEEG ATD PD +ATS PDLV+EG+A PG
Sbjct: 186 RSGRRKSRSIRRRGLPVVDAPNLHKEEGPATDGPDVLKEEVATSCPDLVKEGEAIAAPGP 245
Query: 245 CKEGS------------------------------------------------------- 304
CKEGS
Sbjct: 246 CKEGSRSCCSQWKCLPSFPMCGRKLSVVEEEVTIEEEVSVYDPNGPEVEEVANGPESVKE 305
Query: 305 ------------------------------------------------------------ 364
Sbjct: 306 GEATIAPAPCKEGSESCSQWKCSPSIPMCERKVATGEEELTVVNVPNLLDVEVANGPELV 365
Query: 365 ----------------GCCGCWKCLPALQICRRR-------------------------- 424
CC WKCLPA +C R+
Sbjct: 366 QEGEATDAPSLCKRSGSCCSQWKCLPAFPMCGRKVATSKEELTDVDVPNLLEVEEVANGP 425
Query: 425 ------------------------------------------------------------ 484
Sbjct: 426 ELVKEGEATVAPGPCKEESGYCFPQWKCLPSFPTCGRKLSVVEEEVTVDDPNVPEVANRP 485
Query: 485 ----------------------------------RRMKVVAG------------------ 519
R KV AG
Sbjct: 486 DLAKEGEATVAPDPRKEGSGCCSPRLKDVSEFQIRNSKVAAGKQEMTVDVPNVLEVEEVA 545
BLAST of Tan0005703 vs. ExPASy TrEMBL
Match:
A0A0A0L996 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G204780 PE=4 SV=1)
HSP 1 Score: 284.6 bits (727), Expect = 8.0e-73
Identity = 260/664 (39.16%), Postives = 320/664 (48.19%), Query Frame = 0
Query: 1 MANVKQREEGRSRKPRHRKNFEMQDLPTFAKWLTSFH-----DDGISKSK------RPKR 60
MA K +EE RSRKPRHR+N +M++ PTF KWLT+F +D SKSK RP +
Sbjct: 1 MATNKPKEEQRSRKPRHRRNQQMEEFPTFTKWLTTFGHSGSCNDAKSKSKQLNPANRPLQ 60
Query: 61 TPTVALDSS------------EKEGV---AMSDSRSMRGGC-CCCWQRSKSTPREISPKS 120
P V L +S E+EGV A+ S S RGG CCCWQ SKST RE + K
Sbjct: 61 RPPVVLPASSEDAVPVVTNVPEEEGVQTMAVDGSISARGGAGCCCWQSSKSTRRECALKF 120
Query: 121 QISLSKRTVV----QVEGVAADVSVHREEKVVATGSGCWCCRCLPTFHICGRRKMAVVAD 180
ISL KR VV + E VAA ++ E V GC CRCL TF I RRK VV
Sbjct: 121 HISLRKRKVVANSTEAEVVAAVLNPPEEATVEDVKDGCG-CRCLRTFKIFRRRKSRVVG- 180
Query: 181 VPNLHKEEGVATD-----VPDLATSGPDLVEEGKATVDPGLCKEGSG--CCGCWKCLPAL 240
V +L KEEG TD ++A+SG D+++E + + P KE CCG WKC P
Sbjct: 181 VSDLQKEEGAVTDGVNLRTEEVASSGSDMMKEEEVVIAPDRRKEEESGCCCGRWKCSPTF 240
Query: 241 QICRRRR----RMKVVAG----------NVPNGSEEGVA--------------------- 300
QICRRR+ + +VV G NV E+ V
Sbjct: 241 QICRRRKVVAGKEEVVGGAPKVEEVGNDNVTKQEEDSVGCLQAFHICGGRKRVDDNPKTS 300
Query: 301 ----------------------------------------------SDVVNPGREEKVVV 360
S V PGREEKV+V
Sbjct: 301 EKEPLVSNDSSNLDVQNLQKEESGCCSCFRCIPTFQICGGRRSNEDSGVPKPGREEKVIV 360
Query: 361 GVSDLQEEKEVAAVAVLKLRGDVDSKSAGEGSCWPFQFCVRGWLPKFFLCGGRTAVDASN 420
VSD E+ +V + R S+ G+CW GW P+F LCG TAVDA N
Sbjct: 361 DVSD---PPEMGSVVDGRER---HSRPVQGGTCW------SGWFPRFLLCGEGTAVDAPN 420
Query: 421 HREEEKEEETASVCVPDLQNGVVSGDNI-DHRKEERVSAGDIPVLSKEEK-VSAGDSSDL 480
HREEE++ + + + GD I DH KE+ V+A DIPV+++EE V AGD+ DL
Sbjct: 421 HREEEEKAPSDARKEEKVVVATAVGDEISDHDKEKPVAAIDIPVVNEEEVFVGAGDTLDL 480
Query: 481 HKEEAVAS-GVQDLSKEK-----EKEEEGGCRCFKLAGKGSSRRDRRRSSRSKEEGCWPF 519
HKE+ V+S +QD+ KE+ EK E GGC C+ GK S R + RSSRS EGCW F
Sbjct: 481 HKEKNVSSCNIQDVRKEEIVDSDEKVEGGGCGCW---GKESGSRQQHRSSRSM-EGCWSF 540
BLAST of Tan0005703 vs. ExPASy TrEMBL
Match:
A0A6J1E9I0 (uncharacterized protein LOC111430613 isoform X4 OS=Cucurbita moschata OX=3662 GN=LOC111430613 PE=4 SV=1)
HSP 1 Score: 237.7 bits (605), Expect = 1.1e-58
Identity = 275/890 (30.90%), Postives = 341/890 (38.31%), Query Frame = 0
Query: 5 KQREEGRSRKPRHRKNFEMQDLPTFAKWL-----TSFHDDGISKSKR-------PKRTP- 64
KQ+E+G+ +K HR+N ++++ PTF KWL +S DD SKS + +R P
Sbjct: 6 KQKEDGKPKKKCHRRNLQIEEFPTFTKWLGPRGRSSSRDDTSSKSYKRGVPYPPVRRIPR 65
Query: 65 --------TVALD-SSEKEGVAMSDSRSMRGGCCCCWQRSKSTPREISPKSQISLSKRTV 124
VA D S E GVA+ +S S RGG CCCWQRSKST RE SL K V
Sbjct: 66 DSSSGGGGVVACDVSKEDGGVAIGESISARGG-CCCWQRSKSTQRECGLNFHFSLRKSKV 125
Query: 125 VQ--VEGVAADVSVHREEKVVATGS-----GCWCCRCLPTFHICGRRKM----------- 184
V EGVAA VS REE+VV G+ GC CRC PTFHICGRRK+
Sbjct: 126 VTNVSEGVAAGVSDVREEQVVEAGAVINREGCG-CRCSPTFHICGRRKVPPTGVSYLPEK 185
Query: 185 --------------AVVADVPNLHKEEGVATDVPD-----LATSGPDLVEEGKATVDPGL 244
V D PNLHKEEG ATD PD +ATS PDLV+EG+A PG
Sbjct: 186 RSGRRKSRSIRRRGLPVVDAPNLHKEEGPATDGPDVLKEEVATSCPDLVKEGEAIAAPGP 245
Query: 245 CKEGS-GCCGCWKCLPALQICRRR------------------------------------ 304
CKEGS CC WKCLP+ +C R+
Sbjct: 246 CKEGSRSCCSQWKCLPSFPMCGRKLSVVEEEVTIEEEVSVYDPNGPEVEEVANGPELVKE 305
Query: 305 ------------------------------------------------------------ 364
Sbjct: 306 GEATVAPGPCKEESGYCFPQWKCLPSFPTCGRKLSVVEEEVTVDDPNVPEVANRPDLAKE 365
Query: 365 -----------------------------RRMKVVAG----------------------- 424
R KV AG
Sbjct: 366 GEATVAPDPRKEGSGCCSPRLKDVSEFQIRNSKVAAGKQEMTVDVPNVLEVEEVANDVVN 425
Query: 425 -------------------------------------NVP-------------------- 484
NVP
Sbjct: 426 KQDDICPEKQEVIGLPGFQICSSKVAAGKQEMTIDVPNVPEVEEVANDVVKKQKKEVDGD 485
Query: 485 --------------------------------NGSEEG--------------VASDVVNP 519
NGS G V SDV NP
Sbjct: 486 PNTGEAVASCSSSSNLHEEEEKINASVPELHKNGSGCGWFKWMPSFLICGSKVVSDVPNP 545
BLAST of Tan0005703 vs. ExPASy TrEMBL
Match:
A0A6J1E452 (uncharacterized protein LOC111430613 isoform X3 OS=Cucurbita moschata OX=3662 GN=LOC111430613 PE=4 SV=1)
HSP 1 Score: 208.4 bits (529), Expect = 7.3e-50
Identity = 275/964 (28.53%), Postives = 341/964 (35.37%), Query Frame = 0
Query: 5 KQREEGRSRKPRHRKNFEMQDLPTFAKWL-----TSFHDDGISKSKR-------PKRTP- 64
KQ+E+G+ +K HR+N ++++ PTF KWL +S DD SKS + +R P
Sbjct: 6 KQKEDGKPKKKCHRRNLQIEEFPTFTKWLGPRGRSSSRDDTSSKSYKRGVPYPPVRRIPR 65
Query: 65 --------TVALD-SSEKEGVAMSDSRSMRGGCCCCWQRSKSTPREISPKSQISLSKRTV 124
VA D S E GVA+ +S S RGG CCCWQRSKST RE SL K V
Sbjct: 66 DSSSGGGGVVACDVSKEDGGVAIGESISARGG-CCCWQRSKSTQRECGLNFHFSLRKSKV 125
Query: 125 VQ--VEGVAADVSVHREEKVVATGS-----GCWCCRCLPTFHICGRRKM----------- 184
V EGVAA VS REE+VV G+ GC CRC PTFHICGRRK+
Sbjct: 126 VTNVSEGVAAGVSDVREEQVVEAGAVINREGCG-CRCSPTFHICGRRKVPPTGVSYLPEK 185
Query: 185 --------------AVVADVPNLHKEEGVATDVPD-----LATSGPDLVEEGKATVDPGL 244
V D PNLHKEEG ATD PD +ATS PDLV+EG+A PG
Sbjct: 186 RSGRRKSRSIRRRGLPVVDAPNLHKEEGPATDGPDVLKEEVATSCPDLVKEGEAIAAPGP 245
Query: 245 CKEGS------------------------------------------------------- 304
CKEGS
Sbjct: 246 CKEGSRSCCSQWKCLPSFPMCGRKLSVVEEEVTIEEEVSVYDPNGPEVEEVANGPESVKE 305
Query: 305 ------------------------------------------------------------ 364
Sbjct: 306 GEATIAPAPCKEGSESCSQWKCSPSIPMCERKVATGEEELTVVNVPNLLDVEVANGPELV 365
Query: 365 ----------------GCCGCWKCLPALQICRRR-------------------------- 424
CC WKCLPA +C R+
Sbjct: 366 QEGEATDAPSLCKRSGSCCSQWKCLPAFPMCGRKVATSKVELTVVDIPNLLEVEEVANGP 425
Query: 425 ------------------------------------------------------------ 484
Sbjct: 426 ELVKEGVATGSPGPYKKRSGSCCSQWKCLPSFPTCGRKLSVVEEEVTVDDPNVPEVANRP 485
Query: 485 ----------------------------------RRMKVVAG------------------ 519
R KV AG
Sbjct: 486 DLAKEGEATVAPDPRKEGSGCCSPRLKDVSEFQIRNSKVAAGKQEMTVDVPNVLEVEEVA 545
BLAST of Tan0005703 vs. ExPASy TrEMBL
Match:
A0A6J1E7K5 (uncharacterized protein LOC111430613 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111430613 PE=4 SV=1)
HSP 1 Score: 190.7 bits (483), Expect = 1.6e-44
Identity = 276/1015 (27.19%), Postives = 341/1015 (33.60%), Query Frame = 0
Query: 5 KQREEGRSRKPRHRKNFEMQDLPTFAKWL-----TSFHDDGISKSKR-------PKRTP- 64
KQ+E+G+ +K HR+N ++++ PTF KWL +S DD SKS + +R P
Sbjct: 6 KQKEDGKPKKKCHRRNLQIEEFPTFTKWLGPRGRSSSRDDTSSKSYKRGVPYPPVRRIPR 65
Query: 65 --------TVALD-SSEKEGVAMSDSRSMRGGCCCCWQRSKSTPREISPKSQISLSKRTV 124
VA D S E GVA+ +S S RGG CCCWQRSKST RE SL K V
Sbjct: 66 DSSSGGGGVVACDVSKEDGGVAIGESISARGG-CCCWQRSKSTQRECGLNFHFSLRKSKV 125
Query: 125 VQ--VEGVAADVSVHREEKVVATGS-----GCWCCRCLPTFHICGRRKM----------- 184
V EGVAA VS REE+VV G+ GC CRC PTFHICGRRK+
Sbjct: 126 VTNVSEGVAAGVSDVREEQVVEAGAVINREGCG-CRCSPTFHICGRRKVPPTGVSYLPEK 185
Query: 185 --------------AVVADVPNLHKEEGVATDVPD-----LATSGPDLVEEGKATVDPGL 244
V D PNLHKEEG ATD PD +ATS PDLV+EG+A PG
Sbjct: 186 RSGRRKSRSIRRRGLPVVDAPNLHKEEGPATDGPDVLKEEVATSCPDLVKEGEAIAAPGP 245
Query: 245 CKEGS------------------------------------------------------- 304
CKEGS
Sbjct: 246 CKEGSRSCCSQWKCLPSFPMCGRKLSVVEEEVTIEEEVSVYDPNGPEVEEVANGPESVKE 305
Query: 305 ------------------------------------------------------------ 364
Sbjct: 306 GEATIAPAPCKEGSESCSQWKCSPSIPMCERKVATGEEELTVVNVPNLLDVEVANGPELV 365
Query: 365 ----------------GCCGCWKCLPALQICRRR-------------------------- 424
CC WKCLPA +C R+
Sbjct: 366 QEGEATDAPSLCKRSGSCCSQWKCLPAFPMCGRKVATSKEELTDVDVPNLLEVEEVANGP 425
Query: 425 ------------------------------------------------------------ 484
Sbjct: 426 ELVKEGEATVAPGPCKEESGYCFPQWKCLPSFPTCGRKLSVVEEEVTVDDPNVPEVANRP 485
Query: 485 ----------------------------------RRMKVVAG------------------ 519
R KV AG
Sbjct: 486 DLAKEGEATVAPDPRKEGSGCCSPRLKDVSEFQIRNSKVAAGKQEMTVDVPNVLEVEEVA 545
BLAST of Tan0005703 vs. ExPASy TrEMBL
Match:
A0A6J1E412 (uncharacterized protein LOC111430613 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111430613 PE=4 SV=1)
HSP 1 Score: 190.7 bits (483), Expect = 1.6e-44
Identity = 276/1015 (27.19%), Postives = 341/1015 (33.60%), Query Frame = 0
Query: 5 KQREEGRSRKPRHRKNFEMQDLPTFAKWL-----TSFHDDGISKSKR-------PKRTP- 64
KQ+E+G+ +K HR+N ++++ PTF KWL +S DD SKS + +R P
Sbjct: 6 KQKEDGKPKKKCHRRNLQIEEFPTFTKWLGPRGRSSSRDDTSSKSYKRGVPYPPVRRIPR 65
Query: 65 --------TVALD-SSEKEGVAMSDSRSMRGGCCCCWQRSKSTPREISPKSQISLSKRTV 124
VA D S E GVA+ +S S RGG CCCWQRSKST RE SL K V
Sbjct: 66 DSSSGGGGVVACDVSKEDGGVAIGESISARGG-CCCWQRSKSTQRECGLNFHFSLRKSKV 125
Query: 125 VQ--VEGVAADVSVHREEKVVATGS-----GCWCCRCLPTFHICGRRKM----------- 184
V EGVAA VS REE+VV G+ GC CRC PTFHICGRRK+
Sbjct: 126 VTNVSEGVAAGVSDVREEQVVEAGAVINREGCG-CRCSPTFHICGRRKVPPTGVSYLPEK 185
Query: 185 --------------AVVADVPNLHKEEGVATDVPD-----LATSGPDLVEEGKATVDPGL 244
V D PNLHKEEG ATD PD +ATS PDLV+EG+A PG
Sbjct: 186 RSGRRKSRSIRRRGLPVVDAPNLHKEEGPATDGPDVLKEEVATSCPDLVKEGEAIAAPGP 245
Query: 245 CKEGS------------------------------------------------------- 304
CKEGS
Sbjct: 246 CKEGSRSCCSQWKCLPSFPMCGRKLSVVEEEVTIEEEVSVYDPNGPEVEEVANGPESVKE 305
Query: 305 ------------------------------------------------------------ 364
Sbjct: 306 GEATIAPAPCKEGSESCSQWKCSPSIPMCERKVATGEEELTVVNVPNLLDVEVANGPELV 365
Query: 365 ----------------GCCGCWKCLPALQICRRR-------------------------- 424
CC WKCLPA +C R+
Sbjct: 366 QEGEATDAPSLCKRSGSCCSQWKCLPAFPMCGRKVATSKVELTVVDIPNLLEVEEVANGP 425
Query: 425 ------------------------------------------------------------ 484
Sbjct: 426 ELVKEGVATGSPGPYKKRSGSCCSQWKCLPSFPTCGRKLSVVEEEVTVDDPNVPEVANRP 485
Query: 485 ----------------------------------RRMKVVAG------------------ 519
R KV AG
Sbjct: 486 DLAKEGEATVAPDPRKEGSGCCSPRLKDVSEFQIRNSKVAAGKQEMTVDVPNVLEVEEVA 545
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
XP_038880648.1 | 1.0e-85 | 41.77 | uncharacterized protein LOC120072275 [Benincasa hispida] | [more] |
KGN57524.1 | 1.7e-72 | 39.16 | hypothetical protein Csa_011487 [Cucumis sativus] | [more] |
XP_022922695.1 | 2.3e-58 | 30.90 | uncharacterized protein LOC111430613 isoform X4 [Cucurbita moschata] | [more] |
XP_022922694.1 | 1.5e-49 | 28.53 | uncharacterized protein LOC111430613 isoform X3 [Cucurbita moschata] | [more] |
XP_022922693.1 | 3.2e-44 | 27.19 | uncharacterized protein LOC111430613 isoform X2 [Cucurbita moschata] | [more] |
Match Name | E-value | Identity | Description | |
A0A0A0L996 | 8.0e-73 | 39.16 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G204780 PE=4 SV=1 | [more] |
A0A6J1E9I0 | 1.1e-58 | 30.90 | uncharacterized protein LOC111430613 isoform X4 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1E452 | 7.3e-50 | 28.53 | uncharacterized protein LOC111430613 isoform X3 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1E7K5 | 1.6e-44 | 27.19 | uncharacterized protein LOC111430613 isoform X2 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1E412 | 1.6e-44 | 27.19 | uncharacterized protein LOC111430613 isoform X1 OS=Cucurbita moschata OX=3662 GN... | [more] |
Match Name | E-value | Identity | Description | |