Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CAAGCAAGAAAAAAAAAAAAAGAAACGCAAAAGCGATCTCATAATACGGCACAACAAAAGCAAACTCAAAGGCAAAGCTAAGAAAGAGAGAAAAAAGACCCATCATCTCGCTTTCTGTAAAGGAAAAAAGCCCAACCAAACCGAGTTCACCGGCGAAAATGGAGGAAGACGGCAACGCGCCGCCGCCGTTCTGGCTTCAATCCTCTACCTCCATGGACCAAGTCGACTACAATCGCCGCCGTCGCCTCAGCCGCGCATCGTCGTTCCTCCTCAACTCCAGCGCCTTTCTCATTGTTTTGTTAGTAATAGTTCTCTGTTTCATCTTGATTGTGATTCCTAAATTTGTACAGTTCGCTTCTCAATTGATTCGGCCTCAATCGGTCAAGAAGAGCTGGGATTCCCTCAATTTGGTTCTTGTTCTCTTCGCCATTGTTTGTGGATTTCTCAGTAGAAACACTGGTGATGATAATAGAGGCTCTTTTGAAGATCGGAGCGTTTCTTCGAGGCGGAGAATGAAGTCAAACCCTACGACTCCGCGCCGATGGGATGGATATTCCGATCATCGGCCGAATCATTTCACCGTCAATCGGATGAGGAGTAGTAGTTCGTATCCCGATCTACGTCTTCAGGAGTCTTCATTGGATGCCGGGGATGAACGGTGGCGATTTTACGATGATACTCATGTGCATAATCATCGGTTTGCGTCCTCCGATCAGCTTCATCACCGTCGTGAAGCTCGGCCGGAGCTTGAACGCGAAGATTCTGGTGCCAAAAGTATAGGTTTCGACAGATCTGAGATTCGTGAAGATGTATATTCACAACCGGCGATACCTTCTCCCCCGCGATCGCCGCCGCCGCGGGTGTCTCCTCCGCGATCTCCATCACCGCCTCCTACGCCTCCGCCTCCTGCTAATACGACTCCTAAAGTGGTTAAACGAAGGCCAAAGAGAACCCATAAGGTCCATAGCCATACGCCCGATGGAGCAATCGATCAACAGCAGAAGAATGACGATTCGGACGTAGCCGATTTTCGACGGATTCAGCTTCCACCACTCTCGCCGCCGTCATTTTATCGGGAATCGGAGCAGAAGAGCGGCAAAAACGAGAAGAAGAGAGGTGGCGCTCCAAAAGAAATTTGGTCCGCACTGAGGAGGAGGAAGAAGAAGCAAAGACAAAAGAGCGTCGAAAGCTTCGAGGCTATCATCGCCTCCCAAAACGCTTCAACATCGTCATTACCACCGCCGTCACCACCGCCGCCTCCGCCGCTCCCGCCGCCGTCAGTTCTGCAAAATCTATTTTCATCCAAGAAAGGAAAAGCAAAAAAGGTACAGTCCACACCTCCACCATCAATAGTCTCCTCAGAACCTAAACCAGAGATCGAAGATCAAAATCACCTCCTCAAACCTCACGATCCTCCAATGGAGCTTGAGAGACTGAGCAGTTTAAACGACGAAGAGTACAATACGCGCATTGGCGGTGAGTCGCCATTTCATCCGATTCCTCCGCCACCACCGCCGCCGCCGCCGTTCAGAATGCATGGAGACTTCGACAGTGTAGGAAGCAACAGCAGTACACCAAGAGCCATCTCGCCGGACATTGACGAGAGTGAAGCCGATGGACCGCCCGCGGCCGGCGAAATGAAACTCATGAAAGATTCAACAATTCCGATGTTCTGTTCAAGCCCAGATGTTAACAGTAAAGCCGATAATTTCATTGCAAGATTCAGAGCCGATTTGAAGTTGCAGAAGATGAATTCCATCAAAGAGAAGACGGCGAGGAAGAGATCTAACCTAGGCCGAACACCAGGCCCAGGCCCAAAGTAAATCAAGATAAGGCTCAGCCCATATACACTTTTTTTTCAATAATGAAAATTAAAAAAAAAAAGAAAAAAAAAATCTCAATTCTTTTGTTTGTTGTTATTTGTTGTTTTTTAAGAGCATGTTTTGTTTGAAGCTTTTTCGAATACCATGACATGATAGGACAAGTAGAAACATAGGCTGATATATATTTTAAGGTATTTTGGATATGAATTTTTTTTTGTTTTTTTTTTCTCTCAAATTATTTGCTTGGTGAGAAATGAAAGGATGCTATAATTTGAATA
mRNA sequence
CAAGCAAGAAAAAAAAAAAAAGAAACGCAAAAGCGATCTCATAATACGGCACAACAAAAGCAAACTCAAAGGCAAAGCTAAGAAAGAGAGAAAAAAGACCCATCATCTCGCTTTCTGTAAAGGAAAAAAGCCCAACCAAACCGAGTTCACCGGCGAAAATGGAGGAAGACGGCAACGCGCCGCCGCCGTTCTGGCTTCAATCCTCTACCTCCATGGACCAAGTCGACTACAATCGCCGCCGTCGCCTCAGCCGCGCATCGTCGTTCCTCCTCAACTCCAGCGCCTTTCTCATTGTTTTGTTAGTAATAGTTCTCTGTTTCATCTTGATTGTGATTCCTAAATTTGTACAGTTCGCTTCTCAATTGATTCGGCCTCAATCGGTCAAGAAGAGCTGGGATTCCCTCAATTTGGTTCTTGTTCTCTTCGCCATTGTTTGTGGATTTCTCAGTAGAAACACTGGTGATGATAATAGAGGCTCTTTTGAAGATCGGAGCGTTTCTTCGAGGCGGAGAATGAAGTCAAACCCTACGACTCCGCGCCGATGGGATGGATATTCCGATCATCGGCCGAATCATTTCACCGTCAATCGGATGAGGAGTAGTAGTTCGTATCCCGATCTACGTCTTCAGGAGTCTTCATTGGATGCCGGGGATGAACGGTGGCGATTTTACGATGATACTCATGTGCATAATCATCGGTTTGCGTCCTCCGATCAGCTTCATCACCGTCGTGAAGCTCGGCCGGAGCTTGAACGCGAAGATTCTGGTGCCAAAAGTATAGGTTTCGACAGATCTGAGATTCGTGAAGATGTATATTCACAACCGGCGATACCTTCTCCCCCGCGATCGCCGCCGCCGCGGGTGTCTCCTCCGCGATCTCCATCACCGCCTCCTACGCCTCCGCCTCCTGCTAATACGACTCCTAAAGTGGTTAAACGAAGGCCAAAGAGAACCCATAAGGTCCATAGCCATACGCCCGATGGAGCAATCGATCAACAGCAGAAGAATGACGATTCGGACGTAGCCGATTTTCGACGGATTCAGCTTCCACCACTCTCGCCGCCGTCATTTTATCGGGAATCGGAGCAGAAGAGCGGCAAAAACGAGAAGAAGAGAGGTGGCGCTCCAAAAGAAATTTGGTCCGCACTGAGGAGGAGGAAGAAGAAGCAAAGACAAAAGAGCGTCGAAAGCTTCGAGGCTATCATCGCCTCCCAAAACGCTTCAACATCGTCATTACCACCGCCGTCACCACCGCCGCCTCCGCCGCTCCCGCCGCCGTCAGTTCTGCAAAATCTATTTTCATCCAAGAAAGGAAAAGCAAAAAAGGTACAGTCCACACCTCCACCATCAATAGTCTCCTCAGAACCTAAACCAGAGATCGAAGATCAAAATCACCTCCTCAAACCTCACGATCCTCCAATGGAGCTTGAGAGACTGAGCAGTTTAAACGACGAAGAGTACAATACGCGCATTGGCGGTGAGTCGCCATTTCATCCGATTCCTCCGCCACCACCGCCGCCGCCGCCGTTCAGAATGCATGGAGACTTCGACAGTGTAGGAAGCAACAGCAGTACACCAAGAGCCATCTCGCCGGACATTGACGAGAGTGAAGCCGATGGACCGCCCGCGGCCGGCGAAATGAAACTCATGAAAGATTCAACAATTCCGATGTTCTGTTCAAGCCCAGATGTTAACAGTAAAGCCGATAATTTCATTGCAAGATTCAGAGCCGATTTGAAGTTGCAGAAGATGAATTCCATCAAAGAGAAGACGGCGAGGAAGAGATCTAACCTAGGCCGAACACCAGGCCCAGGCCCAAAGTAAATCAAGATAAGGCTCAGCCCATATACACTTTTTTTTCAATAATGAAAATTAAAAAAAAAAAGAAAAAAAAAATCTCAATTCTTTTGTTTGTTGTTATTTGTTGTTTTTTAAGAGCATGTTTTGTTTGAAGCTTTTTCGAATACCATGACATGATAGGACAAGTAGAAACATAGGCTGATATATATTTTAAGGTATTTTGGATATGAATTTTTTTTTGTTTTTTTTTTCTCTCAAATTATTTGCTTGGTGAGAAATGAAAGGATGCTATAATTTGAATA
Coding sequence (CDS)
ATGGAGGAAGACGGCAACGCGCCGCCGCCGTTCTGGCTTCAATCCTCTACCTCCATGGACCAAGTCGACTACAATCGCCGCCGTCGCCTCAGCCGCGCATCGTCGTTCCTCCTCAACTCCAGCGCCTTTCTCATTGTTTTGTTAGTAATAGTTCTCTGTTTCATCTTGATTGTGATTCCTAAATTTGTACAGTTCGCTTCTCAATTGATTCGGCCTCAATCGGTCAAGAAGAGCTGGGATTCCCTCAATTTGGTTCTTGTTCTCTTCGCCATTGTTTGTGGATTTCTCAGTAGAAACACTGGTGATGATAATAGAGGCTCTTTTGAAGATCGGAGCGTTTCTTCGAGGCGGAGAATGAAGTCAAACCCTACGACTCCGCGCCGATGGGATGGATATTCCGATCATCGGCCGAATCATTTCACCGTCAATCGGATGAGGAGTAGTAGTTCGTATCCCGATCTACGTCTTCAGGAGTCTTCATTGGATGCCGGGGATGAACGGTGGCGATTTTACGATGATACTCATGTGCATAATCATCGGTTTGCGTCCTCCGATCAGCTTCATCACCGTCGTGAAGCTCGGCCGGAGCTTGAACGCGAAGATTCTGGTGCCAAAAGTATAGGTTTCGACAGATCTGAGATTCGTGAAGATGTATATTCACAACCGGCGATACCTTCTCCCCCGCGATCGCCGCCGCCGCGGGTGTCTCCTCCGCGATCTCCATCACCGCCTCCTACGCCTCCGCCTCCTGCTAATACGACTCCTAAAGTGGTTAAACGAAGGCCAAAGAGAACCCATAAGGTCCATAGCCATACGCCCGATGGAGCAATCGATCAACAGCAGAAGAATGACGATTCGGACGTAGCCGATTTTCGACGGATTCAGCTTCCACCACTCTCGCCGCCGTCATTTTATCGGGAATCGGAGCAGAAGAGCGGCAAAAACGAGAAGAAGAGAGGTGGCGCTCCAAAAGAAATTTGGTCCGCACTGAGGAGGAGGAAGAAGAAGCAAAGACAAAAGAGCGTCGAAAGCTTCGAGGCTATCATCGCCTCCCAAAACGCTTCAACATCGTCATTACCACCGCCGTCACCACCGCCGCCTCCGCCGCTCCCGCCGCCGTCAGTTCTGCAAAATCTATTTTCATCCAAGAAAGGAAAAGCAAAAAAGGTACAGTCCACACCTCCACCATCAATAGTCTCCTCAGAACCTAAACCAGAGATCGAAGATCAAAATCACCTCCTCAAACCTCACGATCCTCCAATGGAGCTTGAGAGACTGAGCAGTTTAAACGACGAAGAGTACAATACGCGCATTGGCGGTGAGTCGCCATTTCATCCGATTCCTCCGCCACCACCGCCGCCGCCGCCGTTCAGAATGCATGGAGACTTCGACAGTGTAGGAAGCAACAGCAGTACACCAAGAGCCATCTCGCCGGACATTGACGAGAGTGAAGCCGATGGACCGCCCGCGGCCGGCGAAATGAAACTCATGAAAGATTCAACAATTCCGATGTTCTGTTCAAGCCCAGATGTTAACAGTAAAGCCGATAATTTCATTGCAAGATTCAGAGCCGATTTGAAGTTGCAGAAGATGAATTCCATCAAAGAGAAGACGGCGAGGAAGAGATCTAACCTAGGCCGAACACCAGGCCCAGGCCCAAAGTAA
Protein sequence
MEEDGNAPPPFWLQSSTSMDQVDYNRRRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVQFASQLIRPQSVKKSWDSLNLVLVLFAIVCGFLSRNTGDDNRGSFEDRSVSSRRRMKSNPTTPRRWDGYSDHRPNHFTVNRMRSSSSYPDLRLQESSLDAGDERWRFYDDTHVHNHRFASSDQLHHRREARPELEREDSGAKSIGFDRSEIREDVYSQPAIPSPPRSPPPRVSPPRSPSPPPTPPPPANTTPKVVKRRPKRTHKVHSHTPDGAIDQQQKNDDSDVADFRRIQLPPLSPPSFYRESEQKSGKNEKKRGGAPKEIWSALRRRKKKQRQKSVESFEAIIASQNASTSSLPPPSPPPPPPLPPPSVLQNLFSSKKGKAKKVQSTPPPSIVSSEPKPEIEDQNHLLKPHDPPMELERLSSLNDEEYNTRIGGESPFHPIPPPPPPPPPFRMHGDFDSVGSNSSTPRAISPDIDESEADGPPAAGEMKLMKDSTIPMFCSSPDVNSKADNFIARFRADLKLQKMNSIKEKTARKRSNLGRTPGPGPK
Homology
BLAST of Tan0008072 vs. NCBI nr
Match:
XP_038896222.1 (serine/arginine repetitive matrix protein 1-like [Benincasa hispida])
HSP 1 Score: 927.9 bits (2397), Expect = 4.0e-266
Identity = 491/559 (87.84%), Postives = 516/559 (92.31%), Query Frame = 0
Query: 1 MEEDGNAPPPFWLQSSTSMDQVDYNRRRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIP 60
MEEDGNAPPPFWLQSS S+ ++DYNRRRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIP
Sbjct: 1 MEEDGNAPPPFWLQSSNSLHELDYNRRRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIP 60
Query: 61 KFVQFASQLIRPQSVKKSWDSLNLVLVLFAIVCGFLSRNTGDDNRGSFEDRSVSSRRRMK 120
KFVQF SQLIRPQSVKKSWDSLNL+LVLFAIVCGFLSRNTGDD+R SFED SVSSRR MK
Sbjct: 61 KFVQFTSQLIRPQSVKKSWDSLNLLLVLFAIVCGFLSRNTGDDSRASFEDPSVSSRRTMK 120
Query: 121 SNPTTPRRWDGYSDHRPNHFTVNRMRSSSSYPDLRLQESSLDAGDERWRFYDDTHVHNHR 180
SNPTTPRRWDGY+DHRPNH+T+NRMRSSSSYPDLRLQES+ DAGD RWRFYDDTHV NHR
Sbjct: 121 SNPTTPRRWDGYTDHRPNHYTLNRMRSSSSYPDLRLQESTFDAGDHRWRFYDDTHVTNHR 180
Query: 181 FASSDQLHHRREARPELEREDSGAKSIGFDRSEIREDVYSQPAIPSP--PRSPPPRVSPP 240
+ SSDQLH RRE RPELER DS AKSIGFDRSEIREDVYSQPAIPSP PRSPPPRVSPP
Sbjct: 181 YLSSDQLHRRRETRPELERLDSDAKSIGFDRSEIREDVYSQPAIPSPPRPRSPPPRVSPP 240
Query: 241 RSPSPPPTPPPPANTT--PKVVKRRPKRTHKVHSHTPDGAIDQQQKNDDSDVADFRRIQL 300
R PSPPPTPPPPANTT PKVVKRRPKRTHKVHSHTPD IDQQ +N DSDVA+F+RIQL
Sbjct: 241 RPPSPPPTPPPPANTTPPPKVVKRRPKRTHKVHSHTPDTEIDQQNENGDSDVANFQRIQL 300
Query: 301 PPLSPPSFYRESEQKSGKNEKKRGGAPKEIWSALRRRKKKQRQKSVESFEAIIASQNAST 360
PPLSPPSFYRESEQKS +NEKKRGGA KEIWSALRRRKKKQRQKS+ESFEAIIASQ AST
Sbjct: 301 PPLSPPSFYRESEQKSNRNEKKRGGASKEIWSALRRRKKKQRQKSIESFEAIIASQRAST 360
Query: 361 SSLPPPSPPPPPPLPPPSVLQNLFSSKKGKAKKVQSTPPPS-IVSSEPKPEIEDQNHLLK 420
P SPPPPPPLP PSVLQNLFSSKKGK KKVQSTPPP SSEPKP+ ED+N +LK
Sbjct: 361 ----PSSPPPPPPLPSPSVLQNLFSSKKGKGKKVQSTPPPEPPASSEPKPKTEDRNQMLK 420
Query: 421 PHDPPMELERLSSLNDEEYNTRIGGESPFHPIPPPPPPPPPFRMHGDFDSVGSNSSTPRA 480
PH+PPMEL+RLSSLNDEEYNTRIGGESP+HPIPPPPPPPPPFRMHGDFDSVGSNSSTPRA
Sbjct: 421 PHEPPMELDRLSSLNDEEYNTRIGGESPYHPIPPPPPPPPPFRMHGDFDSVGSNSSTPRA 480
Query: 481 ISPDIDESEADGPPAAGEMKLMKDSTIPMFCSSPDVNSKADNFIARFRADLKLQKMNSIK 540
ISP++DESEADGPPA GE KL+KDSTIP+FCSSPDVNSKAD FIARFRADLKLQKMNSIK
Sbjct: 481 ISPEMDESEADGPPATGERKLVKDSTIPIFCSSPDVNSKADKFIARFRADLKLQKMNSIK 540
Query: 541 EKTARKRSNLGRTPGPGPK 555
EKTARKRSNLGRT GPGPK
Sbjct: 541 EKTARKRSNLGRTSGPGPK 555
BLAST of Tan0008072 vs. NCBI nr
Match:
XP_023548433.1 (protein enabled homolog [Cucurbita pepo subsp. pepo])
HSP 1 Score: 913.7 bits (2360), Expect = 7.8e-262
Identity = 482/559 (86.23%), Postives = 512/559 (91.59%), Query Frame = 0
Query: 1 MEEDGNAPPPFWLQSSTSMDQVDYNRRR-RLSRASSFLLNSSAFLIVLLVIVLCFILIVI 60
MEEDGNAPPPFWLQ S S+ ++D +RRR RLSRASSFLLNSSAFL+VLLVIVLCFI IVI
Sbjct: 1 MEEDGNAPPPFWLQPSNSLHELDNHRRRHRLSRASSFLLNSSAFLVVLLVIVLCFIWIVI 60
Query: 61 PKFVQFASQLIRPQSVKKSWDSLNLVLVLFAIVCGFLSRNTGDDNRGSFEDRSVSSRRRM 120
PKFVQF SQLIRPQSVKKSWDSLNLVLVLFAIVCGFLSRN G+D+R SFEDRSVSSRR +
Sbjct: 61 PKFVQFGSQLIRPQSVKKSWDSLNLVLVLFAIVCGFLSRNAGEDSRDSFEDRSVSSRRTI 120
Query: 121 KSNPTTPRRWDGYSDHRPNHFTVNRMRSSSSYPDLRLQESSLDAGDERWRFYDDTHVHNH 180
KSNP PR+WDGY+DHRP H+TVNRMRSSSSYPDLRLQESSLDAGD++WR YDDTHV N+
Sbjct: 121 KSNPRNPRQWDGYADHRPIHYTVNRMRSSSSYPDLRLQESSLDAGDQQWRSYDDTHVPNN 180
Query: 181 RFASSDQLHHRREARPELEREDSGAKSIGFDRSEIREDVYSQPAIPSPPRSPPPRVSPPR 240
RF SSDQLH RREARPELEREDS KSIGFDRSE+REDVYSQ IPSPPRSPPP+VSPPR
Sbjct: 181 RFPSSDQLHRRREARPELEREDSDVKSIGFDRSEMREDVYSQMPIPSPPRSPPPQVSPPR 240
Query: 241 SPSPPPTPPPPANTTPKVVKRRPKRTHKVHSHTPDGAIDQQQKNDDSDVADFRRIQLPPL 300
SPSPPPTPPPPANTTPKVVKRRPKRTHKVHSHTP G IDQ KN DSDVA+F+RI LPPL
Sbjct: 241 SPSPPPTPPPPANTTPKVVKRRPKRTHKVHSHTPAGEIDQHNKNGDSDVAEFQRIPLPPL 300
Query: 301 SPPSFYRESEQKSGKNEKKRGGAPKEIWSALRRRKKKQRQKSVESFEAIIASQNASTSSL 360
SPP FYRESEQKS KN+KKRGGAPKEIWSALRRR+KKQRQKS+ESFE I+ASQ STSSL
Sbjct: 301 SPPLFYRESEQKSVKNDKKRGGAPKEIWSALRRRRKKQRQKSIESFEDIVASQRPSTSSL 360
Query: 361 PPPSPPPPPPLPPPSVLQNLFSSKKGKAKKVQSTP----PPSIVSSEPKPEIEDQNHLLK 420
PPPSPPPPPPLP PSVLQ LF+SKKGK KKVQSTP PPSI S EPKP IEDQNHLLK
Sbjct: 361 PPPSPPPPPPLPSPSVLQVLFTSKKGKGKKVQSTPSPESPPSIASPEPKPIIEDQNHLLK 420
Query: 421 PHDPPMELERLSSLNDEEYNTRIGGESPFHPIPPPPPPPPPFRMHGDFDSVGSNSSTPRA 480
PH+PP+EL RLSSLNDEEY+TRIGGESPFHPIPPPPPPPPPFRMHGDFDSVGSNSSTPRA
Sbjct: 421 PHEPPVELARLSSLNDEEYSTRIGGESPFHPIPPPPPPPPPFRMHGDFDSVGSNSSTPRA 480
Query: 481 ISPDIDESEADGPPAAGEMKLMKDSTIPMFCSSPDVNSKADNFIARFRADLKLQKMNSIK 540
+SPD+ ESEADG PAAGE KL+KDSTIPMFCSSPDVNSKAD FIARFRADLKLQKMNSIK
Sbjct: 481 VSPDMGESEADGQPAAGERKLVKDSTIPMFCSSPDVNSKADKFIARFRADLKLQKMNSIK 540
Query: 541 EKTARKRSNLGRTPGPGPK 555
EKTARKRSNLGRTPGPGP+
Sbjct: 541 EKTARKRSNLGRTPGPGPR 559
BLAST of Tan0008072 vs. NCBI nr
Match:
KAG6575459.1 (hypothetical protein SDJN03_26098, partial [Cucurbita argyrosperma subsp. sororia] >KAG7014003.1 hypothetical protein SDJN02_24173, partial [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 910.2 bits (2351), Expect = 8.6e-261
Identity = 482/559 (86.23%), Postives = 512/559 (91.59%), Query Frame = 0
Query: 1 MEEDGNAPPPFWLQSSTSMDQVDYNRRR-RLSRASSFLLNSSAFLIVLLVIVLCFILIVI 60
MEEDGNAPPPFWLQ S S+ ++D +RRR RLSRASSFLLNSSAFL+VLLVIVLCFI IVI
Sbjct: 1 MEEDGNAPPPFWLQPSNSLHELDDHRRRHRLSRASSFLLNSSAFLVVLLVIVLCFIWIVI 60
Query: 61 PKFVQFASQLIRPQSVKKSWDSLNLVLVLFAIVCGFLSRNTGDDNRGSFEDRSVSSRRRM 120
PKFVQF SQLIRPQS+KKSWDSLNLVLVLFAIVCGFLSRN GDD+R SFEDRSVSSRR +
Sbjct: 61 PKFVQFGSQLIRPQSMKKSWDSLNLVLVLFAIVCGFLSRNAGDDSRDSFEDRSVSSRRII 120
Query: 121 KSNPTTPRRWDGYSDHRPNHFTVNRMRSSSSYPDLRLQESSLDAGDERWRFYDDTHVHNH 180
KSNP PR+WDGY+DHRP H+TVNRMRSSSSYPDLRLQESSLDAGD+RWR YDDTHV N+
Sbjct: 121 KSNPRNPRQWDGYADHRPIHYTVNRMRSSSSYPDLRLQESSLDAGDQRWRSYDDTHVPNN 180
Query: 181 RFASSDQLHHRREARPELEREDSGAKSIGFDRSEIREDVYSQPAIPSPPRSPPPRVSPPR 240
RF SSDQLH RREARPELEREDS KSIGFDRSEIREDVYSQ IPSPPRSPPP+VSPPR
Sbjct: 181 RFPSSDQLHRRREARPELEREDSDVKSIGFDRSEIREDVYSQLPIPSPPRSPPPQVSPPR 240
Query: 241 SPSPPPTPPPPANTTPKVVKRRPKRTHKVHSHTPDGAIDQQQKNDDSDVADFRRIQLPPL 300
SPSPPPTPPPPANTTPKVVKRRPKRTHKVHSHTP G IDQ KN DSDVA+F+RI LPPL
Sbjct: 241 SPSPPPTPPPPANTTPKVVKRRPKRTHKVHSHTPAGEIDQHNKNGDSDVAEFQRIPLPPL 300
Query: 301 SPPSFYRESEQKSGKNEKKRGGAPKEIWSALRRRKKKQRQKSVESFEAIIASQNASTSSL 360
SPP FYRESEQKS KNEKKRGGAPKEIWSALRRR+KKQRQKS+ESFEAI+ASQ STSSL
Sbjct: 301 SPPLFYRESEQKSVKNEKKRGGAPKEIWSALRRRRKKQRQKSIESFEAIVASQRPSTSSL 360
Query: 361 PPPSPPPPPPLPPPSVLQNLFSSKKGKAKKVQSTP----PPSIVSSEPKPEIEDQNHLLK 420
PPPSPPPPPPL PSVLQ LF+SKKG+ KKVQSTP PPSI SSEPKP IEDQNHLLK
Sbjct: 361 PPPSPPPPPPLSSPSVLQVLFTSKKGRGKKVQSTPSPESPPSIASSEPKPIIEDQNHLLK 420
Query: 421 PHDPPMELERLSSLNDEEYNTRIGGESPFHPIPPPPPPPPPFRMHGDFDSVGSNSSTPRA 480
PH+PP+EL RL+SLNDEEY+TRIGGES FHPIPPPPPPPPPFRMHGDFDSVGSNSSTPRA
Sbjct: 421 PHEPPVELARLNSLNDEEYSTRIGGESSFHPIPPPPPPPPPFRMHGDFDSVGSNSSTPRA 480
Query: 481 ISPDIDESEADGPPAAGEMKLMKDSTIPMFCSSPDVNSKADNFIARFRADLKLQKMNSIK 540
+SPD+DESEADG PAAGE K +K+STIPMFCSSPDVNSKAD FIARFRADLKLQKMNSIK
Sbjct: 481 VSPDMDESEADGKPAAGERKPVKNSTIPMFCSSPDVNSKADKFIARFRADLKLQKMNSIK 540
Query: 541 EKTARKRSNLGRTPGPGPK 555
EKTARKRSNLGRTPGPGP+
Sbjct: 541 EKTARKRSNLGRTPGPGPR 559
BLAST of Tan0008072 vs. NCBI nr
Match:
XP_022953834.1 (protein enabled homolog [Cucurbita moschata])
HSP 1 Score: 907.5 bits (2344), Expect = 5.6e-260
Identity = 481/559 (86.05%), Postives = 512/559 (91.59%), Query Frame = 0
Query: 1 MEEDGNAPPPFWLQSSTSMDQVDYNRRR-RLSRASSFLLNSSAFLIVLLVIVLCFILIVI 60
MEEDGNAPPPFWLQ S S+ ++D +RRR RLSRASSFLLNSSAFL+VLLVIVLCFI IVI
Sbjct: 1 MEEDGNAPPPFWLQPSNSLHELDNHRRRHRLSRASSFLLNSSAFLVVLLVIVLCFIWIVI 60
Query: 61 PKFVQFASQLIRPQSVKKSWDSLNLVLVLFAIVCGFLSRNTGDDNRGSFEDRSVSSRRRM 120
PKFVQF SQLIRPQS+KKSWDSLNLVLVLFAIVCGFLSRN GDD+R SFEDRSVSSRR +
Sbjct: 61 PKFVQFGSQLIRPQSMKKSWDSLNLVLVLFAIVCGFLSRNAGDDSRDSFEDRSVSSRRTI 120
Query: 121 KSNPTTPRRWDGYSDHRPNHFTVNRMRSSSSYPDLRLQESSLDAGDERWRFYDDTHVHNH 180
K+NP PR+WDGY+DHRP H+TVNRMRSSSSYPDLRLQESSL AGD+R R YDDTHV N+
Sbjct: 121 KTNPRNPRQWDGYADHRPIHYTVNRMRSSSSYPDLRLQESSLVAGDQRRRSYDDTHVPNN 180
Query: 181 RFASSDQLHHRREARPELEREDSGAKSIGFDRSEIREDVYSQPAIPSPPRSPPPRVSPPR 240
RF SDQL+ RREARPELEREDS KSIGFDRSEIREDVYSQ IPSPPRSPPP+VSPPR
Sbjct: 181 RFPYSDQLYRRREARPELEREDSDVKSIGFDRSEIREDVYSQLPIPSPPRSPPPQVSPPR 240
Query: 241 SPSPPPTPPPPANTTPKVVKRRPKRTHKVHSHTPDGAIDQQQKNDDSDVADFRRIQLPPL 300
SPSPPPTPPPPANTTPKVVKRRPKRTHKVHSHTP G IDQ KN DSDVA+F+RI LPPL
Sbjct: 241 SPSPPPTPPPPANTTPKVVKRRPKRTHKVHSHTPAGEIDQHNKNGDSDVAEFQRIPLPPL 300
Query: 301 SPPSFYRESEQKSGKNEKKRGGAPKEIWSALRRRKKKQRQKSVESFEAIIASQNASTSSL 360
SPP FYRESEQKS KNEKKRGGAPKEIWSALRRR+KKQRQKS+ESFEAI+ASQ STSSL
Sbjct: 301 SPPLFYRESEQKSVKNEKKRGGAPKEIWSALRRRRKKQRQKSIESFEAIVASQRPSTSSL 360
Query: 361 PPPSPPPPPPLPPPSVLQNLFSSKKGKAKKVQSTP----PPSIVSSEPKPEIEDQNHLLK 420
PPPSPPPPPPLP PSVLQ LF+SKKG+ KKVQSTP PPSI SSEPKP IEDQNHLLK
Sbjct: 361 PPPSPPPPPPLPSPSVLQVLFTSKKGRGKKVQSTPSPESPPSIASSEPKPIIEDQNHLLK 420
Query: 421 PHDPPMELERLSSLNDEEYNTRIGGESPFHPIPPPPPPPPPFRMHGDFDSVGSNSSTPRA 480
PH+PP+EL RL+SLNDEEY+TRIGGESPFHPIPPPPPPPPPFRMHGDFDSVGSNSSTPRA
Sbjct: 421 PHEPPVELARLNSLNDEEYSTRIGGESPFHPIPPPPPPPPPFRMHGDFDSVGSNSSTPRA 480
Query: 481 ISPDIDESEADGPPAAGEMKLMKDSTIPMFCSSPDVNSKADNFIARFRADLKLQKMNSIK 540
+SPD+DESEADG PAAGE KL+KDSTIPMFCSSPDVNSKAD FIARFRADLKLQKMNSIK
Sbjct: 481 VSPDMDESEADGKPAAGERKLVKDSTIPMFCSSPDVNSKADKFIARFRADLKLQKMNSIK 540
Query: 541 EKTARKRSNLGRTPGPGPK 555
EKTARKRSNLGRTPGPGP+
Sbjct: 541 EKTARKRSNLGRTPGPGPR 559
BLAST of Tan0008072 vs. NCBI nr
Match:
KAG6593534.1 (hypothetical protein SDJN03_13010, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 904.0 bits (2335), Expect = 6.2e-259
Identity = 483/556 (86.87%), Postives = 509/556 (91.55%), Query Frame = 0
Query: 1 MEEDGNAPPPFWLQSSTSMDQVDYNRRRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIP 60
ME DGNA PPFWLQSS+S QV YNRRRRLSRASSFLLNSSAFLIVLLVIVLCF+LIVIP
Sbjct: 1 MEGDGNASPPFWLQSSSSFQQVHYNRRRRLSRASSFLLNSSAFLIVLLVIVLCFVLIVIP 60
Query: 61 KFVQFASQLIRPQSVKKSWDSLNLVLVLFAIVCGFLSRNTGDDNRGSFEDRSVSSRRRMK 120
K VQFASQLIRPQSVKKSWDSLNLVLVLFAIVCGFLSRNTGDDNRG FEDRSVSSRRR+K
Sbjct: 61 KCVQFASQLIRPQSVKKSWDSLNLVLVLFAIVCGFLSRNTGDDNRG-FEDRSVSSRRRLK 120
Query: 121 SNPTTPRRWDGYSDHRPNHFTVNRMRSSSSYPDLRLQESSLDAGDERWRFYDDTHVHNHR 180
SNPTTPR+WDGYSDHRPN +TVNRMRSSSSYPDLRLQESSLDAGDERWRFYDDTHV NHR
Sbjct: 121 SNPTTPRQWDGYSDHRPNQYTVNRMRSSSSYPDLRLQESSLDAGDERWRFYDDTHVRNHR 180
Query: 181 FASSDQLHHRREARPELEREDSGAKSIGFDRSEIREDVYSQPAIPSPPRSPPPRVSPPRS 240
FASSDQLH R +ARPELEREDS AKS GFDRSE+REDVYSQPAIPSPPR PPPR
Sbjct: 181 FASSDQLHRRHQARPELEREDSSAKSTGFDRSEVREDVYSQPAIPSPPRVPPPRF----- 240
Query: 241 PSPPPTPPPPANTTPKVVKRRPKRTHKVHSHTPDGAIDQQQKNDDSDVADFRRIQLPPLS 300
PSPPPT PA+TTPKV KRRPKRTH VHSHTPDGAIDQQQKNDDSDVADF+RI LPPLS
Sbjct: 241 PSPPPTLQTPASTTPKVAKRRPKRTHNVHSHTPDGAIDQQQKNDDSDVADFQRIHLPPLS 300
Query: 301 PPSFYRESEQKSGKNEKKRGGAPKEIWSALRRRKKKQRQKSVESFEAIIASQNASTSSLP 360
PPSFY+ESEQKSGKNEKKRGGAPKEIWSALRRRKKKQRQKSVESFEAI + +++SSLP
Sbjct: 301 PPSFYQESEQKSGKNEKKRGGAPKEIWSALRRRKKKQRQKSVESFEAIATLRASTSSSLP 360
Query: 361 PPSPPPPPPLPPPSVLQNLFSSKKGKAKKVQSTPPPSIVSSEPKPEIEDQNHLLKPHDPP 420
PSPPPPPPLPPP VLQNLF SKKGKAKKVQS PPP+IV+SEPKPEIE QNHLLKP+DPP
Sbjct: 361 LPSPPPPPPLPPP-VLQNLFPSKKGKAKKVQSEPPPTIVTSEPKPEIEHQNHLLKPNDPP 420
Query: 421 MELERLSSLNDEEYNTRIGGESPFHPIPPPPPPPPP--FRMHGDFDSVGSNSSTPRAISP 480
MELERLSSLNDEEYNTRIG +SPFH IPPPPPPPPP FRMHGDFDS GSNSSTPRAISP
Sbjct: 421 MELERLSSLNDEEYNTRIGCDSPFHLIPPPPPPPPPPLFRMHGDFDSAGSNSSTPRAISP 480
Query: 481 DIDESEADGPPAAGEMKLMKDSTIPMFCSSPDVNSKADNFIARFRADLKLQKMNSIKEKT 540
+I ESE DGPPAAG+MK+ + ST P+FCSSPDVNSKADNFIARF+ADLKLQKMNSIKE++
Sbjct: 481 EIYESEGDGPPAAGKMKVKQVSTTPIFCSSPDVNSKADNFIARFKADLKLQKMNSIKERS 540
Query: 541 ARKRSNLGRTPGPGPK 555
ARKRSNLGR GPGPK
Sbjct: 541 ARKRSNLGRAAGPGPK 549
BLAST of Tan0008072 vs. ExPASy TrEMBL
Match:
A0A6J1GQY1 (protein enabled homolog OS=Cucurbita moschata OX=3662 GN=LOC111456249 PE=4 SV=1)
HSP 1 Score: 907.5 bits (2344), Expect = 2.7e-260
Identity = 481/559 (86.05%), Postives = 512/559 (91.59%), Query Frame = 0
Query: 1 MEEDGNAPPPFWLQSSTSMDQVDYNRRR-RLSRASSFLLNSSAFLIVLLVIVLCFILIVI 60
MEEDGNAPPPFWLQ S S+ ++D +RRR RLSRASSFLLNSSAFL+VLLVIVLCFI IVI
Sbjct: 1 MEEDGNAPPPFWLQPSNSLHELDNHRRRHRLSRASSFLLNSSAFLVVLLVIVLCFIWIVI 60
Query: 61 PKFVQFASQLIRPQSVKKSWDSLNLVLVLFAIVCGFLSRNTGDDNRGSFEDRSVSSRRRM 120
PKFVQF SQLIRPQS+KKSWDSLNLVLVLFAIVCGFLSRN GDD+R SFEDRSVSSRR +
Sbjct: 61 PKFVQFGSQLIRPQSMKKSWDSLNLVLVLFAIVCGFLSRNAGDDSRDSFEDRSVSSRRTI 120
Query: 121 KSNPTTPRRWDGYSDHRPNHFTVNRMRSSSSYPDLRLQESSLDAGDERWRFYDDTHVHNH 180
K+NP PR+WDGY+DHRP H+TVNRMRSSSSYPDLRLQESSL AGD+R R YDDTHV N+
Sbjct: 121 KTNPRNPRQWDGYADHRPIHYTVNRMRSSSSYPDLRLQESSLVAGDQRRRSYDDTHVPNN 180
Query: 181 RFASSDQLHHRREARPELEREDSGAKSIGFDRSEIREDVYSQPAIPSPPRSPPPRVSPPR 240
RF SDQL+ RREARPELEREDS KSIGFDRSEIREDVYSQ IPSPPRSPPP+VSPPR
Sbjct: 181 RFPYSDQLYRRREARPELEREDSDVKSIGFDRSEIREDVYSQLPIPSPPRSPPPQVSPPR 240
Query: 241 SPSPPPTPPPPANTTPKVVKRRPKRTHKVHSHTPDGAIDQQQKNDDSDVADFRRIQLPPL 300
SPSPPPTPPPPANTTPKVVKRRPKRTHKVHSHTP G IDQ KN DSDVA+F+RI LPPL
Sbjct: 241 SPSPPPTPPPPANTTPKVVKRRPKRTHKVHSHTPAGEIDQHNKNGDSDVAEFQRIPLPPL 300
Query: 301 SPPSFYRESEQKSGKNEKKRGGAPKEIWSALRRRKKKQRQKSVESFEAIIASQNASTSSL 360
SPP FYRESEQKS KNEKKRGGAPKEIWSALRRR+KKQRQKS+ESFEAI+ASQ STSSL
Sbjct: 301 SPPLFYRESEQKSVKNEKKRGGAPKEIWSALRRRRKKQRQKSIESFEAIVASQRPSTSSL 360
Query: 361 PPPSPPPPPPLPPPSVLQNLFSSKKGKAKKVQSTP----PPSIVSSEPKPEIEDQNHLLK 420
PPPSPPPPPPLP PSVLQ LF+SKKG+ KKVQSTP PPSI SSEPKP IEDQNHLLK
Sbjct: 361 PPPSPPPPPPLPSPSVLQVLFTSKKGRGKKVQSTPSPESPPSIASSEPKPIIEDQNHLLK 420
Query: 421 PHDPPMELERLSSLNDEEYNTRIGGESPFHPIPPPPPPPPPFRMHGDFDSVGSNSSTPRA 480
PH+PP+EL RL+SLNDEEY+TRIGGESPFHPIPPPPPPPPPFRMHGDFDSVGSNSSTPRA
Sbjct: 421 PHEPPVELARLNSLNDEEYSTRIGGESPFHPIPPPPPPPPPFRMHGDFDSVGSNSSTPRA 480
Query: 481 ISPDIDESEADGPPAAGEMKLMKDSTIPMFCSSPDVNSKADNFIARFRADLKLQKMNSIK 540
+SPD+DESEADG PAAGE KL+KDSTIPMFCSSPDVNSKAD FIARFRADLKLQKMNSIK
Sbjct: 481 VSPDMDESEADGKPAAGERKLVKDSTIPMFCSSPDVNSKADKFIARFRADLKLQKMNSIK 540
Query: 541 EKTARKRSNLGRTPGPGPK 555
EKTARKRSNLGRTPGPGP+
Sbjct: 541 EKTARKRSNLGRTPGPGPR 559
BLAST of Tan0008072 vs. ExPASy TrEMBL
Match:
A0A6J1HGU6 (serine/arginine repetitive matrix protein 1-like OS=Cucurbita moschata OX=3662 GN=LOC111464215 PE=4 SV=1)
HSP 1 Score: 903.7 bits (2334), Expect = 3.9e-259
Identity = 483/556 (86.87%), Postives = 509/556 (91.55%), Query Frame = 0
Query: 1 MEEDGNAPPPFWLQSSTSMDQVDYNRRRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIP 60
ME DGNA PPFWLQSS+S QV YNRRRRLSRASSFLLNSSAFLIVLLVIVLCF+LIVIP
Sbjct: 1 MEGDGNASPPFWLQSSSSFQQVHYNRRRRLSRASSFLLNSSAFLIVLLVIVLCFVLIVIP 60
Query: 61 KFVQFASQLIRPQSVKKSWDSLNLVLVLFAIVCGFLSRNTGDDNRGSFEDRSVSSRRRMK 120
K VQFASQLIRPQSVKKSWDSLNLVLVLFAIVCGFLSRNTGDDNRG FEDRSVSSRRR+K
Sbjct: 61 KCVQFASQLIRPQSVKKSWDSLNLVLVLFAIVCGFLSRNTGDDNRG-FEDRSVSSRRRLK 120
Query: 121 SNPTTPRRWDGYSDHRPNHFTVNRMRSSSSYPDLRLQESSLDAGDERWRFYDDTHVHNHR 180
SNPTTPR+WDGYSDHRPN +TVNRMRSSSSYPDLRLQESSLDAGDERWRFYDDTHV NHR
Sbjct: 121 SNPTTPRQWDGYSDHRPNQYTVNRMRSSSSYPDLRLQESSLDAGDERWRFYDDTHVRNHR 180
Query: 181 FASSDQLHHRREARPELEREDSGAKSIGFDRSEIREDVYSQPAIPSPPRSPPPRVSPPRS 240
FASSDQLH R +ARPELEREDS AKS GFDRSE+REDVYSQPAIPSPPR P PPRS
Sbjct: 181 FASSDQLHRRHQARPELEREDSSAKSTGFDRSEVREDVYSQPAIPSPPRVP-----PPRS 240
Query: 241 PSPPPTPPPPANTTPKVVKRRPKRTHKVHSHTPDGAIDQQQKNDDSDVADFRRIQLPPLS 300
PSPPPT PA+TTPKVVKRRPKRTH VHSHTPDGAIDQQQKNDDSDVADF+RI LPPLS
Sbjct: 241 PSPPPTLQTPASTTPKVVKRRPKRTHNVHSHTPDGAIDQQQKNDDSDVADFQRIHLPPLS 300
Query: 301 PPSFYRESEQKSGKNEKKRGGAPKEIWSALRRRKKKQRQKSVESFEAIIASQNASTSSLP 360
PPSFY+ESEQKSGKNEKKRGGAPKEIWS LRRRKKKQRQKSVESFEAI + +++SSLP
Sbjct: 301 PPSFYQESEQKSGKNEKKRGGAPKEIWSTLRRRKKKQRQKSVESFEAIATLRASTSSSLP 360
Query: 361 PPSPPPPPPLPPPSVLQNLFSSKKGKAKKVQSTPPPSIVSSEPKPEIEDQNHLLKPHDPP 420
PSPPPPPPLPPP VLQNLF SKKGKAKKVQS PPP+IV+SEPKPEIE QNHLLKP+DPP
Sbjct: 361 LPSPPPPPPLPPP-VLQNLFPSKKGKAKKVQSEPPPTIVTSEPKPEIEHQNHLLKPNDPP 420
Query: 421 MELERLSSLNDEEYNTRIGGESPFHPIPPPPPPPPP--FRMHGDFDSVGSNSSTPRAISP 480
MELERLSSLNDEEYNTRIG +SPFH IPPPPPPPPP FRMHGDFDS GSNS TPRAISP
Sbjct: 421 MELERLSSLNDEEYNTRIGCDSPFHLIPPPPPPPPPPLFRMHGDFDSAGSNSCTPRAISP 480
Query: 481 DIDESEADGPPAAGEMKLMKDSTIPMFCSSPDVNSKADNFIARFRADLKLQKMNSIKEKT 540
+I ESE DGPPAAG+MK+ + ST P+FCSSPDVNSKADNFIARF+ADLKLQKMNSIKE++
Sbjct: 481 EIYESEGDGPPAAGKMKVKQVSTTPIFCSSPDVNSKADNFIARFKADLKLQKMNSIKERS 540
Query: 541 ARKRSNLGRTPGPGPK 555
ARKRSNLGR GPGPK
Sbjct: 541 ARKRSNLGRAAGPGPK 549
BLAST of Tan0008072 vs. ExPASy TrEMBL
Match:
A0A6J1KKJ2 (serine/arginine repetitive matrix protein 1-like OS=Cucurbita maxima OX=3661 GN=LOC111494880 PE=4 SV=1)
HSP 1 Score: 899.0 bits (2322), Expect = 9.6e-258
Identity = 487/561 (86.81%), Postives = 510/561 (90.91%), Query Frame = 0
Query: 1 MEEDGNAPPPFWLQSSTSMDQVDYNRRRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIP 60
ME DGNA PPFWLQSS S QV YNRRRRLSRASSFLLNSSAFL VLLVIVLCF+LIVIP
Sbjct: 1 MEGDGNASPPFWLQSSNSFQQVHYNRRRRLSRASSFLLNSSAFLFVLLVIVLCFVLIVIP 60
Query: 61 KFVQFASQLIRPQSVKKSWDSLNLVLVLFAIVCGFLSRNTGDDNRGSFEDRSVSSRRRMK 120
K VQFASQLIRPQSVKKSWDSLNLVLVLFAIVCGFLSRNTGDDNRG FEDRSVSSRRR+K
Sbjct: 61 KCVQFASQLIRPQSVKKSWDSLNLVLVLFAIVCGFLSRNTGDDNRG-FEDRSVSSRRRLK 120
Query: 121 SNPTTPRRWDGYSDHRPNHFTVNRMRSSSSYPDLRLQESSLDAGDERWRFYDDTHVHNHR 180
SNPTTPR+WDGY DHRPNH+TVNRMRSSSSYPDLRLQESSLDAGDERWRFYDDTHV NHR
Sbjct: 121 SNPTTPRQWDGYPDHRPNHYTVNRMRSSSSYPDLRLQESSLDAGDERWRFYDDTHVRNHR 180
Query: 181 FASSDQLHHRREARPELEREDSGAKSIGFDRSEIREDVYSQPAIPSPPRSPPPRVSPPRS 240
FASSDQLH R +ARPELEREDSGAKS GFDRSE+ EDVYSQPAIPSPPR P PPRS
Sbjct: 181 FASSDQLHRRHQARPELEREDSGAKSTGFDRSEVHEDVYSQPAIPSPPRVP-----PPRS 240
Query: 241 PSPPPTPPPPANTTPKVVKRRPKRTHKVHSHTPDGAIDQQQKNDDSDVADFRRIQLPPLS 300
PSPPPT PA+TTPKVVKRRPKRTH VHSHTPDGAIDQQQKNDDSDVADF+RI LPPLS
Sbjct: 241 PSPPPTLQTPASTTPKVVKRRPKRTHNVHSHTPDGAIDQQQKNDDSDVADFQRIHLPPLS 300
Query: 301 PPSFYRESEQKSGKNEKKRGGAPKEIWSALRRRKKKQRQKSVESFEAIIASQNASTSSLP 360
PPSFY+ESEQKSGKNEKKRGGAPKEIWSALRRRKKKQRQKSVESFEA IA+ ASTSSLP
Sbjct: 301 PPSFYQESEQKSGKNEKKRGGAPKEIWSALRRRKKKQRQKSVESFEA-IATLRASTSSLP 360
Query: 361 PPSPPPPPPLPPPSVLQNLFSSKKGKAKKVQS-----TPPPSIVSSEPKPEIEDQNHLLK 420
SPPPPPPLPPP VLQNLF SKKGKAKKVQS +PPP+IV+SEPKPEIE QNH LK
Sbjct: 361 LASPPPPPPLPPP-VLQNLFPSKKGKAKKVQSEPPPESPPPTIVTSEPKPEIELQNHHLK 420
Query: 421 PHDPPMELERLSSLNDEEYNTRIGGESPFHPIPPPPPPPPP--FRMHGDFDSVGSNSSTP 480
P+DPPMELERLSSLNDEEYNTRIG +SPFH IPPPPPPPPP FRMHGDFDS GSNSSTP
Sbjct: 421 PNDPPMELERLSSLNDEEYNTRIGCDSPFHLIPPPPPPPPPPLFRMHGDFDSAGSNSSTP 480
Query: 481 RAISPDIDESEADGPPAAGEMKLMKDSTIPMFCSSPDVNSKADNFIARFRADLKLQKMNS 540
RAISP+IDESE DGPPAAG+MK+ + ST P+FCSSPDVNSKAD FIARF+ADLKLQKMNS
Sbjct: 481 RAISPEIDESEGDGPPAAGKMKVKQVSTTPIFCSSPDVNSKADKFIARFKADLKLQKMNS 540
Query: 541 IKEKTARKRSNLGRTPGPGPK 555
IKE++ARKRSNLGRT GPGPK
Sbjct: 541 IKERSARKRSNLGRTAGPGPK 553
BLAST of Tan0008072 vs. ExPASy TrEMBL
Match:
A0A1S3CII2 (LOW QUALITY PROTEIN: serine/arginine repetitive matrix protein 1-like OS=Cucumis melo OX=3656 GN=LOC103500804 PE=4 SV=1)
HSP 1 Score: 897.5 bits (2318), Expect = 2.8e-257
Identity = 482/563 (85.61%), Postives = 512/563 (90.94%), Query Frame = 0
Query: 1 MEEDGNA-PPPFWLQSS-TSMDQVDYNRRRRLSRASSFLLNSSAFLIVLLVIVLCFILIV 60
MEEDGNA PPFWLQSS +S+ ++ Y+RRRRLSRASSFLLNSSAFLIVLLVIVLCFILIV
Sbjct: 1 MEEDGNAHSPPFWLQSSNSSLHELHYSRRRRLSRASSFLLNSSAFLIVLLVIVLCFILIV 60
Query: 61 IPKFVQFASQLIRPQSVKKSWDSLNLVLVLFAIVCGFLSRNT-GDDNRGSFEDRSVSSRR 120
IPKFVQF SQLIRPQSVKKSWDSLNL+LVLFAIVCGFL RN GDD+RGSFEDRSVSSRR
Sbjct: 61 IPKFVQFTSQLIRPQSVKKSWDSLNLLLVLFAIVCGFLGRNAGGDDSRGSFEDRSVSSRR 120
Query: 121 RMKSNPTTPRRWDGYSDHRPNHFTVNRMRSSSSYPDLRLQESSLDAGDERWRFYDDTHVH 180
MKSNPTTPRRWDGY+DHRPNHFT+NRMRSSSSYPDLRLQESS DAGD RWRFYDDTHV
Sbjct: 121 SMKSNPTTPRRWDGYTDHRPNHFTLNRMRSSSSYPDLRLQESSFDAGDHRWRFYDDTHVT 180
Query: 181 NHRFASSDQLHHRREARPELEREDSGAKSIGFDRSEIREDVYSQPAIPSPPRSPPPRVSP 240
NHR++SSDQLH RRE +PELER+DS AKSI FDRSEIR DVYS+P IPSPPRSPPP+VSP
Sbjct: 181 NHRYSSSDQLHRRRETQPELERQDSEAKSIVFDRSEIR-DVYSEPVIPSPPRSPPPQVSP 240
Query: 241 PRSPSPPPTPPPPANTTPKVVKRRPKRTHKVHSHTPDGAIDQQQKNDDSDVADFRRIQLP 300
PR PSPPPTPPPPANT PK+VKRRPKRTHKVHSHTP+ I+QQ +N DSDVA+F+RIQLP
Sbjct: 241 PRPPSPPPTPPPPANTIPKMVKRRPKRTHKVHSHTPEEEINQQHENGDSDVANFQRIQLP 300
Query: 301 PLSPPSFYRESEQKSGKNEKKRGGAPKEIWSALRRRKKKQRQKSVESFEAIIASQNASTS 360
PLSPP FYRESEQKS KNEKKR GA KEIWSALRRRKKKQRQKSVESFEAIIASQ ASTS
Sbjct: 301 PLSPPLFYRESEQKSSKNEKKRTGASKEIWSALRRRKKKQRQKSVESFEAIIASQRASTS 360
Query: 361 SLPPPS--PPPPPPLPPPSVLQNLFSSKKGKAKKVQST-----PPPSIVSSEPKPEIEDQ 420
SLPPPS PPPPPPLP PSVLQNLFSS+KGK KKVQST PPPSI SSEPKP+ EDQ
Sbjct: 361 SLPPPSPPPPPPPPLPSPSVLQNLFSSRKGKHKKVQSTSLPDPPPPSIASSEPKPKTEDQ 420
Query: 421 NHLLKPHDPPMELERLSSLNDEEYNTRIGGESPFHPIPPPPPPPPPFRMHGDFDSVGSNS 480
N +LKP DPPMEL+RLSSLNDEEY+TRIGGESP+HPIPPPPPPPPPFRMHGDFDSVGSNS
Sbjct: 421 NQILKPQDPPMELDRLSSLNDEEYHTRIGGESPYHPIPPPPPPPPPFRMHGDFDSVGSNS 480
Query: 481 STPRAISPDIDESEADGPPAAGEMKLMKDSTIPMFCSSPDVNSKADNFIARFRADLKLQK 540
STPRAISP++DESEAD PPA E KL+KD TIPMFCSSPDVNSKAD FIARFRADLKLQK
Sbjct: 481 STPRAISPEMDESEADAPPATSERKLVKDPTIPMFCSSPDVNSKADKFIARFRADLKLQK 540
Query: 541 MNSIKEKTARKRSNLGRTPGPGP 554
MNSIKEKT RKRSNLGRT GPGP
Sbjct: 541 MNSIKEKTTRKRSNLGRTSGPGP 562
BLAST of Tan0008072 vs. ExPASy TrEMBL
Match:
A0A5A7V0Q3 (Serine/arginine repetitive matrix protein 1-like OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold347G001070 PE=4 SV=1)
HSP 1 Score: 895.6 bits (2313), Expect = 1.1e-256
Identity = 481/563 (85.44%), Postives = 512/563 (90.94%), Query Frame = 0
Query: 1 MEEDGNA-PPPFWLQSS-TSMDQVDYNRRRRLSRASSFLLNSSAFLIVLLVIVLCFILIV 60
MEEDGNA PPFWLQSS +S+ ++ Y+RRRRLSRASSFLLNSSAFLIVLLVIVLCFILIV
Sbjct: 1 MEEDGNAHSPPFWLQSSNSSLHELRYSRRRRLSRASSFLLNSSAFLIVLLVIVLCFILIV 60
Query: 61 IPKFVQFASQLIRPQSVKKSWDSLNLVLVLFAIVCGFLSRNT-GDDNRGSFEDRSVSSRR 120
IPKFVQF SQLIRPQSVKKSWDSLNL+LVLFAIVCGFL RN GDD+RGSFEDRSVSSRR
Sbjct: 61 IPKFVQFTSQLIRPQSVKKSWDSLNLLLVLFAIVCGFLGRNAGGDDSRGSFEDRSVSSRR 120
Query: 121 RMKSNPTTPRRWDGYSDHRPNHFTVNRMRSSSSYPDLRLQESSLDAGDERWRFYDDTHVH 180
MKSNPTTPRRWDGY+DHRPNHFT+NRMRSSSSYPDLRLQESS DAGD +WRFYDDTHV
Sbjct: 121 SMKSNPTTPRRWDGYTDHRPNHFTLNRMRSSSSYPDLRLQESSFDAGDHQWRFYDDTHVT 180
Query: 181 NHRFASSDQLHHRREARPELEREDSGAKSIGFDRSEIREDVYSQPAIPSPPRSPPPRVSP 240
NHR++SSDQLH RRE +PELER+DS AKSI FDRSEIR DVYS+P IPSPPRSPPP+VSP
Sbjct: 181 NHRYSSSDQLHRRRETQPELERQDSEAKSIVFDRSEIR-DVYSEPVIPSPPRSPPPQVSP 240
Query: 241 PRSPSPPPTPPPPANTTPKVVKRRPKRTHKVHSHTPDGAIDQQQKNDDSDVADFRRIQLP 300
PR PSPPPTPPPPANT PK+VKRRPKRTHKVHSHTP+ I+QQ +N DSDVA+F+RIQLP
Sbjct: 241 PRPPSPPPTPPPPANTIPKMVKRRPKRTHKVHSHTPEEEINQQHENGDSDVANFQRIQLP 300
Query: 301 PLSPPSFYRESEQKSGKNEKKRGGAPKEIWSALRRRKKKQRQKSVESFEAIIASQNASTS 360
PLSPP FYRESEQKS KNEKKR GA KEIWSALRRRKKKQRQKSVESFEAIIASQ ASTS
Sbjct: 301 PLSPPLFYRESEQKSSKNEKKRTGASKEIWSALRRRKKKQRQKSVESFEAIIASQRASTS 360
Query: 361 SLPPPS--PPPPPPLPPPSVLQNLFSSKKGKAKKVQST-----PPPSIVSSEPKPEIEDQ 420
SLPPPS PPPPPPLP PSVLQNLFSS+KGK KKVQST PPPSI SSEPKP+ EDQ
Sbjct: 361 SLPPPSPPPPPPPPLPSPSVLQNLFSSRKGKHKKVQSTSLPDPPPPSIASSEPKPKAEDQ 420
Query: 421 NHLLKPHDPPMELERLSSLNDEEYNTRIGGESPFHPIPPPPPPPPPFRMHGDFDSVGSNS 480
N +LKP DPPMEL+RLSSLNDEEY+TRIGGESP+HPIPPPPPPPPPFRMHGDFDSVGSNS
Sbjct: 421 NQILKPQDPPMELDRLSSLNDEEYHTRIGGESPYHPIPPPPPPPPPFRMHGDFDSVGSNS 480
Query: 481 STPRAISPDIDESEADGPPAAGEMKLMKDSTIPMFCSSPDVNSKADNFIARFRADLKLQK 540
STPRAISP++DESEAD PPA E KL+KD TIPMFCSSPDVNSKAD FIARFRADLKLQK
Sbjct: 481 STPRAISPEMDESEADAPPATSERKLVKDPTIPMFCSSPDVNSKADKFIARFRADLKLQK 540
Query: 541 MNSIKEKTARKRSNLGRTPGPGP 554
MNSIKEKT RKRSNLGRT GPGP
Sbjct: 541 MNSIKEKTTRKRSNLGRTSGPGP 562
BLAST of Tan0008072 vs. TAIR 10
Match:
AT1G72790.1 (hydroxyproline-rich glycoprotein family protein )
HSP 1 Score: 262.3 bits (669), Expect = 8.8e-70
Identity = 226/599 (37.73%), Postives = 302/599 (50.42%), Query Frame = 0
Query: 2 EEDGNAPPPFWLQSSTSMDQVDYNRRRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIPK 61
E+DG+A PFWLQS + + R L ++ + F ++++ FI IP
Sbjct: 3 EDDGDASTPFWLQS--RRNNTYFRRTASLGGRTTTIATQIFFAGTAAILIVVFI---IPP 62
Query: 62 FVQFASQLIRPQSVKKSWDSLNLVLVLFAIVCGFLSRNTGDD-----------NRGSFED 121
F SQ+ RP V+KSWD LN VLVLFA++CGFLSRNT +D N+ S
Sbjct: 63 FFSSVSQIFRPHLVRKSWDYLNFVLVLFAVLCGFLSRNTNNDESNHHKEEDIRNKFSTSP 122
Query: 122 RSVSSRRRMKSNPTTPRRWD----GYSDHRPNHFTVNRMRSSSSYPDLRLQESSLDAGDE 181
+ R R+ ++ TTPR W+ G + + +R+RS SSYPDLRL+E DE
Sbjct: 123 SIIDRRSRVSNSGTTPRYWNDDRGGGGGDQTVYKRFSRLRSVSSYPDLRLREYE---ADE 182
Query: 182 RWRFYDDTHVHNHRFASSDQLHHRR-------EARPELEREDSGAKSIGFDRSEIRE--- 241
RWRFYDDT V R+ D ++ + E +P E D + S++R
Sbjct: 183 RWRFYDDTRVSQCRYEDVDPIYPNQSYRNWHEEGKPPPEDVDQTEDGDNGEGSKVRNGGS 242
Query: 242 -------------DVYSQPAIPSPPRSPPPRVSPPRSPSPPPTPPPPANTTPKVVKRRPK 301
+V + +PS +PP PSPPP+PP P K+ +
Sbjct: 243 ETEKVEVVATAEAEVVEELKVPS---------APPYIPSPPPSPP-----RPPPAKQAKR 302
Query: 302 RTHKVHSHTPDGAIDQQQKNDDSDVADFRRIQLPPLSPPSFYRESEQKSGKNEKKRGGAP 361
+T++V+ D + +++K D VA P+ PP+ QKS K EKK+GGA
Sbjct: 303 KTNRVYQ---DVSPQEEKKERDDFVA-----TTTPIPPPA---TVYQKSNKQEKKKGGAT 362
Query: 362 KEIWSALRRRKKKQRQKSVESFEAIIASQNASTSSLPPP---SPPPPPPLPPPSVLQNLF 421
K+ ALRR+KKKQRQ+S++ + + S PP SPPPPPP PPP Q LF
Sbjct: 363 KDFLIALRRKKKKQRQQSIDGLDLLFGSD--------PPLVYSPPPPPP-PPPPFFQGLF 422
Query: 422 SSKKGKAKKVQSTPPPSIVSSEPKPEIEDQNHLLKPHDPPMELERLSSLNDEEYNTR--- 481
SSKKGK+KK S PPP P+ E + K P+E R S N T+
Sbjct: 423 SSKKGKSKKNNSNPPPPPPPPPPERRYESRASTSKLRKAPVE-SRTSKPNPPAKVTQYVG 482
Query: 482 IGGESPFHPIPPPPPPPP------PFRMHGDFDSVGSNSSTPRAISPDIDESEADGPPAA 541
G ESP PIPPPPPPPP F GD+ + S+ S I E D P A
Sbjct: 483 TGSESPLMPIPPPPPPPPFKMPAWKFVKRGDYVRMASDIS--------ISSDEPDDPDVA 542
Query: 542 GEMKLMKDSTIPMFCSSPDVNSKADNFIARFRADLKLQKMNSIKEKTARKRSNLGRTPG 551
+ K++ MFC SPDV++KAD+FIARFRA LKL+KMNS+K R RSNLG PG
Sbjct: 543 -QSAGSKEAAGSMFCPSPDVDTKADDFIARFRAGLKLEKMNSVK----RGRSNLGPEPG 545
BLAST of Tan0008072 vs. TAIR 10
Match:
AT5G57070.1 (hydroxyproline-rich glycoprotein family protein )
HSP 1 Score: 173.3 bits (438), Expect = 5.3e-43
Identity = 192/614 (31.27%), Postives = 272/614 (44.30%), Query Frame = 0
Query: 8 PPPFWLQSSTSMDQVDYNRRRRLSRASSFLLNSSAFLIVLLVIVLCFILIVIPKFVQFAS 67
PP W Q D Y RRR A +L + + I L F+ V+P F+ S
Sbjct: 5 PPLIWPQ----FDSTGYARRRSSIPA---ILVPAMIGVTSAAIFLVFVTFVVPTFLSVTS 64
Query: 68 QLIRPQSVKKSWDSLNLVLVLFAIVCGFLSRNTGD----DNRGSFEDRSVSS-------- 127
Q+++P SVK+ WDS+N+VLV+FAI+CG L+R D ++ E+ V
Sbjct: 65 QILQPASVKRGWDSINVVLVVFAILCGVLARRNDDGLSSESLHGGEEEEVGGGAVTNGEM 124
Query: 128 -----RRRMKSNPTTPRRW--DGYSDHR----------------PNHFTVNRMRSSSSYP 187
+ S+ T +W D Y R P V RSSSSYP
Sbjct: 125 TVGEISKISSSSSTVSEQWFDDVYDSDRLKIYESVSSRSFSHGLPVTGNVPLRRSSSSYP 184
Query: 188 DLRLQESSLDAGDERWRFYDDTHVHNHRFA-SSDQLHHRREARPELEREDSGAKSIGFDR 247
DLR Q + GD R+RFYDD + +R SS + ++ E+E E+S K I D
Sbjct: 185 DLR-QGVFRETGDRRFRFYDDFEIDKYRSQDSSSYQQFQNLSKTEIEEEESEPKEIQIDT 244
Query: 248 SEIREDVYSQPAIPSPPRSPPPRVSPPRSPSPPPTPPPPANTTPKVVKRRPKRTHKVHSH 307
++ SPP+ PP +PPP PPPP P V ++P+RTH+ +
Sbjct: 245 FVVKPS--------SPPQQPP--------ATPPPPPPPP----PVEVPQKPRRTHRSVRN 304
Query: 308 TPDGAIDQQQKNDDSDVADFRRIQLPPLSPPS----------FYRESEQKSGKNEKKRGG 367
Q+N F+R PP SPP +K G ++++
Sbjct: 305 R------DLQENAKRSETKFKRTFQPPPSPPPPPPPPPPQPLIAATPPRKQGTLQRRKSN 364
Query: 368 APKEI-------WSALRRRKKKQRQKSVESFEAIIASQNAS-----TSSLPPPSPPPPPP 427
A KEI ++ +++KK Q+ K E E+ ++ + S +PPPSPPPPPP
Sbjct: 365 AAKEIKMVFASLYNQGKKKKKLQKSKRKERIESSPMVEDVTEPPQYQSLIPPPSPPPPPP 424
Query: 428 LPPP------SVLQNLFSSKKGKAKKVQSTPPPSIVSSEPKPEIEDQNHLLKPHDPPMEL 487
PPP SV LF KK+ S P P P P Q P PP +
Sbjct: 425 PPPPPLRSSQSVFYGLFKKGVKSNKKIHSVPAP----PPPPPPRYTQ---FDPQTPPRRV 484
Query: 488 ER---LSSLNDEEYNTRIGGE-SPFHPIPPPPPPPPPFR-------MHGDFDSVGSNSST 538
+ + +N G+ SP I PPPPPPPPFR + GDF + SN S+
Sbjct: 485 KSGRPPRPTKPKNFNEENNGQGSPLIQITPPPPPPPPFRVPPLKYVVSGDFAKIRSNQSS 544
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
XP_038896222.1 | 4.0e-266 | 87.84 | serine/arginine repetitive matrix protein 1-like [Benincasa hispida] | [more] |
XP_023548433.1 | 7.8e-262 | 86.23 | protein enabled homolog [Cucurbita pepo subsp. pepo] | [more] |
KAG6575459.1 | 8.6e-261 | 86.23 | hypothetical protein SDJN03_26098, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
XP_022953834.1 | 5.6e-260 | 86.05 | protein enabled homolog [Cucurbita moschata] | [more] |
KAG6593534.1 | 6.2e-259 | 86.87 | hypothetical protein SDJN03_13010, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1GQY1 | 2.7e-260 | 86.05 | protein enabled homolog OS=Cucurbita moschata OX=3662 GN=LOC111456249 PE=4 SV=1 | [more] |
A0A6J1HGU6 | 3.9e-259 | 86.87 | serine/arginine repetitive matrix protein 1-like OS=Cucurbita moschata OX=3662 G... | [more] |
A0A6J1KKJ2 | 9.6e-258 | 86.81 | serine/arginine repetitive matrix protein 1-like OS=Cucurbita maxima OX=3661 GN=... | [more] |
A0A1S3CII2 | 2.8e-257 | 85.61 | LOW QUALITY PROTEIN: serine/arginine repetitive matrix protein 1-like OS=Cucumis... | [more] |
A0A5A7V0Q3 | 1.1e-256 | 85.44 | Serine/arginine repetitive matrix protein 1-like OS=Cucumis melo var. makuwa OX=... | [more] |
Match Name | E-value | Identity | Description | |
AT1G72790.1 | 8.8e-70 | 37.73 | hydroxyproline-rich glycoprotein family protein | [more] |
AT5G57070.1 | 5.3e-43 | 31.27 | hydroxyproline-rich glycoprotein family protein | [more] |