Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: five_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GCGCCAAAAGATAAAATCTGATTTATTTTTCTCGCCGTTCTGGCGAAGCGGAGTCGACGGCGAGACGAAGAAGAACCAAGACTAAAAGTGGTCGTTCATCATCTTCATGAACGTTTCCTCCAAGCTTTCCTTCTGCAGGAACGTGGTTGGGATTGTGGAGAACAGAAATGCTTTGTGTTTTGAGCCTTTTGGCAATCGTCGCGCCTGAAGGAGATGATCAACGCCGTCGCCGGTTCTCTTTTTCTTAAGTTTCCATCGCCGTCGTGGCCAAGGTTGCCGTTGCTGTGCACTGTGCAGTGAGAGGGAGGAGAAAGAAGTGGAAATTAGGGGTTTCCTCACGAAGGGCAAAATAGTAATTTTAGGAGCTTCAGAATTCCTATTACTCAACAAAACGTAATTTCACCATTTTCCTCTCTTTCTCGACTTTGAACTTTCAGTTTCGTTGATCTTCAATCTGCTCGTTTTCCGGATCCTTGCTTCATTCTCATTCTCTTCCTGTTCGACACTTCACCATGTCGCCTACTCCCGAGGAACCTAATAATTTGCAGAACGGAATCGAAATCCAACCACACATTTCATCAGAATCAGATCAAATTACTGAACCCAGATCAGGGCCAGAAGAACCTACAGTAGATTCAATTCCCAGTTCTGAATTACAACGAGAACGTGAATCGGAATCAGTTAGTAATGGAGTACCAGATTCGGAGCCGGAGTCTCCAAGGAAACAGTTATCGGAGTCAATTCATTTACATGTAGTGACGGGTGTTACAGATCCGAGTGTTGAAGAGCATAAAGAAACTTCCACCCCATCCAACGGCAACACGGAAAACTTGCAACCTGCGTTGCGTAAAGACGAAGGAAGCCGAACGTTTACAATGAGAGAGTTGTTGAATGGATTGAAAGGTGAAGATGGTAGCGACAGCCTTAATGAATCTGAAGGCGAGAGGCCCGAGGGGAACTCCGGTTACAGGTTTGGAATCTTGCTTGGCATTGGAGATTTCTTCGTTCTCTTTTGTTTAGTTGATTAAACTGAATATGGTTGATTGGTAAAAATGAGTTCTTTCTAATTATGCAGTATAATCTCATTATTGTTACTATTATTTTTTTTTAAAAAAAGTGAATGCTGAATGATCAACCTGTTTTTTTTTTTTTATCCTGCGATCACTTCCGTTAGAGGGAGAGAGATTTTGCGACTGTTCTGTCAAACTTGAGGGTTGAATACCTATTTTTTTATTAAAAAAGTGTTATGGGTTAGTTTCTTTCGAGGAAAACTATTTAGGCATAAAATTAAATGACTCCCATTTGAACATGTCAATTTCCGAAAGTCAAAGTTGATCAGAAGTTTTCCAACGAGCTTGCTTGTTGGTGCCTGCAAGGTTTCTTCTCTCAAACTGTCTAGGTCATTTGTCTTCCTCAAAGGGGTCTGTCTTTCTTTTTCCTCCTAGAAAGGTAAGATAAACTGATTGCTTTCAACGATCGAATGATATCCTCAGACTCAGGCTTTAGTTGTTATGGTCAGACATTTTTTGTGTACGTTCAGAAAGTAAATATAATGGCTTCTCTATATGGCCTTTTGTTTTGTGTGTCAGCAAACCTGACGGACCCTATCCAATTCTATTAGACTGAGCTTGTGTGTATGTGGCTGTTCACATTCACCATTCAGTTTCATCGAATTATTTGTTAGATCTGGAAATTGCAGCAAATTGACCCAATCCAACCCATGATATGCTTAATTCTTTCCTTTATTAAATATTACCTTATCCCAAACCACGATATACTCCTATAGAGCTTTTCTGTAGTTTGGATATTTAAATATTATATATATGTATATATACTTCTAGTGGTTAGGTAGACTTAGACAGGATCCTTTGGGTTGTTGAAATGAAGTTTTACCATGGGAATTTTGGTTGGAAACAAGTTTTGTTGTTGTTGTTTTTGCTTCTGTTTTCCAATATCTCTCTCTCTCTCTCTCATATGTTGCTTAATTTCTTGCATCTGCCTTTGCCTTTGATTTATTTCTAATCAACTTCTGTTGCTATGGAATTGGATTTTGTGGCTTCAAATTTTGGTGATTTCATTTCACCAAAATGTGCCTTTATTTCATTTTTGGATAAGACCAAATTGGCTTTTATAAGGTGGGTAGGAACTAGGAAAGTTTAGTTTGCTTTCAGTTTTGATGATTTTTGGTCGGGTAGGATTGAGTTGCCAATTCCAATTGCACTTTTCTATTAACCACATAGGTATGAACTTTTAAAAAAATCTCTGTTATCTGTTTGGCCTGCTTAATTTTCATACTGAACGTGGTTGGTTTGATTAACTTTAGGATATAAATATAACTTGCAGCTGGGTTTTGGTTTGGGTTCTAAATTAACCCAACGTGAACCAACAACACCCCTACTTTATAGTCAACATTATTTGTTTATTCAGATCTCTTACTAATGCAGTTTTATGGATTGCAATTCGGTAGAACCCACATAGAACTAAATTGATTGAAATATAAAAAGATAACAGCTACAGGATAACCCAAGTACTTGAGGTACTTGGCTACTTGGATGGAATCTACCCCCTCAGTTTATTTAGAGCCAAATACTATACCAACCTCCCTAACTTATTACCGATGCACCCTTCAATTCATGTCACGACCCTTGATTTCTTAACCCTATTATTTTCTCACAATTATAACGCATCTTTTACATTTTCTTTTTGTTCCATCCTCTACTTGTAGCTTCACTTCTAGGGGCTCTATGACTCATTTTGGAATTAATCAATTCATTTATGGGTACTGCTGTCCATTTGTTTTTGAATACTAATCAGTTAGTTTTCTTTTTCTTTCTTTCCATTTGACTCTGGTTTGAATTAAGAAAGTGCTGTTTTCAGTGGAATCTTTAGTTGCTATTCACAAGAGTAGAATTTTGATGTTACCACCTTTTGTCTTTATAGCTATAATCTTTCTTTTCTTCTTTTATTTACTGCATGAATTTGCACAGCCTTAATCAAGATAGCCCACATCAGCCTTATTCTGAACAGAGTAGAGCTGCCATGGAGTTGATCAACAGTGTTACAGGTGTCGATGAAGAGGGTCGTTCTCGCCAAAGGATTCTCACATTTGCTGCTAGGAGGTATCATTATAATACTTTTTGTACCTTGTAGTGGTCTGTCAACTACGTCTATATTTCTCTCGATAGTTCTGGTATCTTGTAATTTGAATTCCTTTGAAGTAAGGGCCGAGCTGTATTTCCCCTTTTCTTGCAATAGGAGCTTGTGCTTTAGCTCTACTTATGTCATTGTGATGCACTGATAGAATGGAAATGGGATTTTCACTGCAACCGAGAAAAAAAGAAGGAAACTCATGCATGCATAGATTATTTTTATTTTATCAATTTTGAAAACATATTTATATATCGTGTCTCTTTGCATGAGAATTATTATTATTATTTTATCAATTTTGATAAAAAAAATTAATTTTGAGGGACTAAATTTGAGATTCTTTGAAGATGGACTAAAATTGGACAAGTTTCAAAGTATAAGGATCAAAATGGTGCTTTAGACTTATTATTATTTTTTTTTTCGAAAATGTAGGCAGGGCTAACTGTAAACCTATTGGCAAAATATGTTACAATTTGCCTTTGGTCCCAATTATTGATAAAACATGTTATATAGTTATTTCATATCAGTAACAGAAAGTCCATTTCTTTCTTAAAATTTTCTTTTCGGCACTCCTACAGAGGCCCTTTGTTTCTAAACATCTATTTTCATTCAAATAATGCAGGTATGCTAGTGCAATTGAGAGAAATGGTCAAGACTATGATGCTTTGTACAATTGGGCTTTGGTCCTCCAGGTCTGAGACTTTATATTATTCATTTTGGAAACTATCTTTTTTTCTTCTGTTCCATTTCTTGATCAACAACTTTGCAAGGCAGGGAGCTCTTATTGATGAATCATTAAAAGAATTCTAGCTGAATTGAATTATGTGTTTGGTATAAAAATATATCACAACTTAGCTGACATCTTCCAGTACAAGTGTCAATGTTTTATATAGAGAGACAACATGGATATAAAATGCAAGGAGACATTTAATTTTCTTTGCATAGACAAGTACAACTTCTTTATAATAAATATAGTACAAAATGTACAGATATTTCGTTCTGTTTCAAGCAACATCACGATAAAACTCTGTATAAATCTGGTGTTGCAGGAGAGTGCAGACAATGTTAGTCCAGATTCCACCTCACCTTCTAAAGATGCGTTGCTTGAGGAGGCTTGTAAAAAGTATGATGAGGCCACCCATCTGTGCCCAACACTTCATGATGTATGTAAAAAAACCTTTTTATACTTTTACTACTCAAGATGCTGTTGCGAAAAATTTGACGGTAGATTCAAAAGAAACGAAAAGAGGGACAAACTTTGAAAAGCACGTATATTGATTCTGCCAGAAATGAATTTATTCAGATTAAGAGTTCTTTTGCCATCCTTTTTCTCAAGATGGATCAAAATTAGTTTTCCTTTGGTCCATTTTTCACATCAGTAACTACTTAATCCTGCTTAGCTTAGGTTTTGATAAACCATGACTAAGAGTAGGCGTTTGTTTATACTTGTTACTGGCTGAGTTAGAAAATATTTGTATGCAACTTTCACATGAACTGCAAAACTCATCTTTGGTGCTGTCTTAATGATGTATTTGGATTGCATCCCAGGCTTTTTATAATTGGGCTATTGCAATCTCTGATCGGGCCAAAATGCGTGGTCGTACGAAGGAGGCCGAAGAACTGTGGAAGCAGGTTTATACTTCCATCCCTAGTAGGTTATCGTCATTACACATTTATAAACAAAAACTTCAATCTCTAATGTATGGATATCATTTGGAATTTGGGAATGTCCATGACTGGAGAGGATGAGCACTATCCTTTATGGGCTGTATATGTCCACAATTTCCTCCAGGATTGGAAAAGATTTAAAATTTGGCTATTTACTGTGCTAATTTGATCACTGTGATTCCATGGACACTCTTGGCCTTCTCCATTTTGTGCTATTTTCTTCTCTTTTTTGATGAAACGTTTCTTTTACCATACATGCTTTTGTTAAATATTGTCTTCGATTTCTTGACTTGGTGAAGGTGAAGGAATCAATAGCCTTAAGCCTCCTTATGCGTTGAATGTTTGGATCTCTGCCCAAGCACTTGAAATTTTATATCTAGAAAAAAGAAAAAACTTTATTTTCTTTCACAAGCGTCATTTAGGTGGTTAAACTGTTTAATAAATAAAGTGTCTAATTAGACTTCATCATCAATATTGTAATAAAAAAATTGAAATGTTTGTAAAACACAAATAGAGGTGCCTTTATATTAGAATACTGATTAATTGGGGGACTGAATCGATCTTATTAAAATTCTATCCTAAATAGTTATTGACACCTAATCTGACTAACATATTTCAGACTTGAAAAAATTAGAATAAATAAAAATTTCAAAAGGAGAAATGAAGAATGAAAAAAGAACTGCTTGTGTTCTTTTACTACTCCCTTGTTTCCACATTCATCTAATTTTCAGTAATTATTTTTTTCCTTTTCGTCATCCCCCTCTTTTGTAATTTCATACCTTCCATAGGAAAAAAATGATTTTCTTACCAACTGATCATTGCAATTATTCTTTCAGGCTACCAAAAATTATGAAAAAGCTGTCCAACTCAACTGGAATAGTCCCCAGGTACATCCCAAGAAACTGCTAGGAAAGCCACGTGCTTAAATTTGTCTCTGGGACTTTCTTCTTCCTTTGCATTTTGTGTCCTCTTCTGTGAAATTGATTCTAAGCCTCATTAACTTTTTTGCAAATACCTTAATTCTAACCCTTTCTGGGTCTGTTTCAGGCGTTAAATAATTGGGGGCTTGCCCTACAGGTACTTATGTTTTTCACATTAGATTAATCTTTGTATGAAAGACTCTCTTGCCCACCTCACCTATGCAACCACAAAGTGTAAAAAATTGTTTTTATTGACTTCAAGATCCATTTCAGGAACTCAGTGCGATTGTGCCGGCACGAGAAAAGCAGACAATTGTAAAAACAGCTATCAGTAAGGTGTCAACTATTATTTTGAAACTTTTAAATACCAAATAATTCCAGCCTTTTGTTTACACGTTAAAACTTACTTTTATATTCTATGCCTTGATTACTCAGTTTTCTTCCATTGCAGTTCCGTGCTGCTATACAGTTGCAATTTGATTTTCATCGAGCAATCTACAACCTTGGCACTGTTCTGGTGAGTCGGTTACCTGCTTTAGATATGGAAATTACATGACCTTACATCTATTTAATGCACTGTGTTTAGAATAACTTCATCAACCTGTATTGTATAGCTGTATTTAAGAGCGTCTTGAATTCTGTTTCTTTAAATGACAATGGGAAGGAAGACTTCTTGGTTTATTCAGATCAGATAACATAAATTTGCTCAAACCTAAACTCAATTCTGGCATCTACAGTATGGATTAGCTGAGGACACATTAAGAACTGGTGGATCAGGAAATGTTAAGGATGTTTCCCCCAATGAGTTATACAGCCAATCTGCTATTTATATTGCAGCTGCTCATGCTCTAAAACCAAACTACTCTGTACGTCCAGCTTCTTTTACATTCTTACCAACAATTAAACATGTATTTCTGCACAAGTTTGAAATTTTGTTTTTTGTTTTTAATAGGTTTATAGCAGCGCCTTACGGTTGGTCCGCTCCATGGTTAGTTCTGACTCCCCTCAAATAATGCGGATCACACTTATTCATGTCTTGTGTTCTATTATTTGCTCAGTATAAACCGGTTGGCTGTATTTTTCATATGGTTTAGTATAATATGCGTTGTATTGGTCTTTTAATGGGAAGATGCGAAAGATCTGTACAAAGTTCTAAAATAATTGAAGCTGGTACTGTCATAATGGATTCCATATTAGGCCTATGATGTGATCAACAGTTGGGGTATGCTTGGTTTTAAGTTTCATGATTTACTGTATCAATGAAGACTACTTTAGCAGTCAAAATGTAGAGTCATGATATGTCACGTCGTTACAACTTTAGCATATACCCCATTCAAGCATTTTACTGTATTTTTTAAATTTACAAGATATCTGTACCGTCCGAATTTGGACTTATCTTATATTCAAATGCACATGTCCTTTTGAAATGACTTGGTGGTGGCATCCCTTGATACGTAATTACTTTTTTGTGAACTTAATGGATCTTGGCTTTTTGACAGCTGCCGTTACCCTATCTAAAAGTTGGATACCTGACTGCACCTCCTGTGGGGAGGCCACTTGCTCCTCACAGTGATTGGAAACGTTCACAATTTTTTCTAAATCATGATGTATTGCAAAAGGTACAATTTTATTTATCTATTTATTTTCTTTTATCTCTTAACTTTGGAAGAAACTGACGTTAAACTGCTGCTCATGGCTAATCATCTTTCAGCTTAACATAGGAGGGGAACAAATACAAACATCCCCTAGTATTTTAGGAAGATCTGGAAGTACCTTGAATGGCGACAGGACAATCAAAGTAGAAATTCCCGATATCGTCTCTGTATCCGCATGTGCCGATCTTACTTTACCACCTGGTGCTGGACTCTGCATTGACACAATCCATGGACCCATTTTCTTGGTGAGCTTATTGTAGCATGATTTAAATTCTCTGTTTCAATTGAAAACAATTTAATTGAAAGTAGCTTATCCAATAGGATGGTAAAGAGGAAAAACCATTAGCAATGTAAAAGATAATGATAACAACAAAGAAGAAGGAAAAAGAACGACATTCAACCTATCAATTTTTACCTAATCTGAAAGATTATCATGTCTTATTATCTGATATGTTTGAGTCTCGAGTAGCTACTGCTAGAGTGGAGTGCACTGATCACCTTGGTTGATTCTATCAGGTTGCTGACTCATGGGACACACTCGATGGATGGCTTGATGCTATTAGATTAGTTTACACGATCTACGCTCGAGGCAAGAACGAGGTTTTGGCTGGCATCATAACAGGTTGATTGATTACTACCAAGTACACAAAAATGTATCAATGGTGCTATCTTGATGTCTATATATTATGCTTAGTTCACAATAGTAGATTGAGTATTCATTTCTCTAGATTGAAACACACAATTTTGGGGTGCTTTTCCAGTGCTTAAATACATGTTTCTCTAATCACCTTTCCCTTTGAAGTGTAATAATATTATTTGTACATGTCGTTCATTAACAGCAAGCACAAATGGTACGGTTTCGGACCTTCAGAAATGAGAGAGCTTCAAATTAA
mRNA sequence
GCGCCAAAAGATAAAATCTGATTTATTTTTCTCGCCGTTCTGGCGAAGCGGAGTCGACGGCGAGACGAAGAAGAACCAAGACTAAAAGTGGTCGTTCATCATCTTCATGAACGTTTCCTCCAAGCTTTCCTTCTGCAGGAACGTGGTTGGGATTGTGGAGAACAGAAATGCTTTGTGTTTTGAGCCTTTTGGCAATCGTCGCGCCTGAAGGAGATGATCAACGCCGTCGCCGGTTCTCTTTTTCTTAAGTTTCCATCGCCGTCGTGGCCAAGGTTGCCGTTGCTGTGCACTGTGCAGTGAGAGGGAGGAGAAAGAAGTGGAAATTAGGGGTTTCCTCACGAAGGGCAAAATAGTAATTTTAGGAGCTTCAGAATTCCTATTACTCAACAAAACGTAATTTCACCATTTTCCTCTCTTTCTCGACTTTGAACTTTCAGTTTCGTTGATCTTCAATCTGCTCGTTTTCCGGATCCTTGCTTCATTCTCATTCTCTTCCTGTTCGACACTTCACCATGTCGCCTACTCCCGAGGAACCTAATAATTTGCAGAACGGAATCGAAATCCAACCACACATTTCATCAGAATCAGATCAAATTACTGAACCCAGATCAGGGCCAGAAGAACCTACAGTAGATTCAATTCCCAGTTCTGAATTACAACGAGAACGTGAATCGGAATCAGTTAGTAATGGAGTACCAGATTCGGAGCCGGAGTCTCCAAGGAAACAGTTATCGGAGTCAATTCATTTACATGTAGTGACGGGTGTTACAGATCCGAGTGTTGAAGAGCATAAAGAAACTTCCACCCCATCCAACGGCAACACGGAAAACTTGCAACCTGCGTTGCGTAAAGACGAAGGAAGCCGAACGTTTACAATGAGAGAGTTGTTGAATGGATTGAAAGGTGAAGATGGTAGCGACAGCCTTAATGAATCTGAAGGCGAGAGGCCCGAGGGGAACTCCGGTTACAGCCTTAATCAAGATAGCCCACATCAGCCTTATTCTGAACAGAGTAGAGCTGCCATGGAGTTGATCAACAGTGTTACAGGTGTCGATGAAGAGGGTCGTTCTCGCCAAAGGATTCTCACATTTGCTGCTAGGAGGTATGCTAGTGCAATTGAGAGAAATGGTCAAGACTATGATGCTTTGTACAATTGGGCTTTGGTCCTCCAGGAGAGTGCAGACAATGTTAGTCCAGATTCCACCTCACCTTCTAAAGATGCGTTGCTTGAGGAGGCTTGTAAAAAGTATGATGAGGCCACCCATCTGTGCCCAACACTTCATGATGCTTTTTATAATTGGGCTATTGCAATCTCTGATCGGGCCAAAATGCGTGGTCGTACGAAGGAGGCCGAAGAACTGTGGAAGCAGGCTACCAAAAATTATGAAAAAGCTGTCCAACTCAACTGGAATAGTCCCCAGGCGTTAAATAATTGGGGGCTTGCCCTACAGGAACTCAGTGCGATTGTGCCGGCACGAGAAAAGCAGACAATTGTAAAAACAGCTATCAGTAAGTTCCGTGCTGCTATACAGTTGCAATTTGATTTTCATCGAGCAATCTACAACCTTGGCACTGTTCTGTATGGATTAGCTGAGGACACATTAAGAACTGGTGGATCAGGAAATGTTAAGGATGTTTCCCCCAATGAGTTATACAGCCAATCTGCTATTTATATTGCAGCTGCTCATGCTCTAAAACCAAACTACTCTGTTTATAGCAGCGCCTTACGGTTGGTCCGCTCCATGCTGCCGTTACCCTATCTAAAAGTTGGATACCTGACTGCACCTCCTGTGGGGAGGCCACTTGCTCCTCACAGTGATTGGAAACGTTCACAATTTTTTCTAAATCATGATGTATTGCAAAAGCTTAACATAGGAGGGGAACAAATACAAACATCCCCTAGTATTTTAGGAAGATCTGGAAGTACCTTGAATGGCGACAGGACAATCAAAGTAGAAATTCCCGATATCGTCTCTGTATCCGCATGTGCCGATCTTACTTTACCACCTGGTGCTGGACTCTGCATTGACACAATCCATGGACCCATTTTCTTGGTTGCTGACTCATGGGACACACTCGATGGATGGCTTGATGCTATTAGATTAGTTTACACGATCTACGCTCGAGGCAAGAACGAGGTTTTGGCTGGCATCATAACAGGTTGATTGATTACTACCAAGTACACAAAAATGTATCAATGGTGCTATCTTGATGTCTATATATTATGCTTAGTTCACAATAGTAGATTGAGTATTCATTTCTCTAGATTGAAACACACAATTTTGGGGTGCTTTTCCAGTGCTTAAATACATGTTTCTCTAATCACCTTTCCCTTTGAAGTGTAATAATATTATTTGTACATGTCGTTCATTAACAGCAAGCACAAATGGTACGGTTTCGGACCTTCAGAAATGAGAGAGCTTCAAATTAA
Coding sequence (CDS)
ATGTCGCCTACTCCCGAGGAACCTAATAATTTGCAGAACGGAATCGAAATCCAACCACACATTTCATCAGAATCAGATCAAATTACTGAACCCAGATCAGGGCCAGAAGAACCTACAGTAGATTCAATTCCCAGTTCTGAATTACAACGAGAACGTGAATCGGAATCAGTTAGTAATGGAGTACCAGATTCGGAGCCGGAGTCTCCAAGGAAACAGTTATCGGAGTCAATTCATTTACATGTAGTGACGGGTGTTACAGATCCGAGTGTTGAAGAGCATAAAGAAACTTCCACCCCATCCAACGGCAACACGGAAAACTTGCAACCTGCGTTGCGTAAAGACGAAGGAAGCCGAACGTTTACAATGAGAGAGTTGTTGAATGGATTGAAAGGTGAAGATGGTAGCGACAGCCTTAATGAATCTGAAGGCGAGAGGCCCGAGGGGAACTCCGGTTACAGCCTTAATCAAGATAGCCCACATCAGCCTTATTCTGAACAGAGTAGAGCTGCCATGGAGTTGATCAACAGTGTTACAGGTGTCGATGAAGAGGGTCGTTCTCGCCAAAGGATTCTCACATTTGCTGCTAGGAGGTATGCTAGTGCAATTGAGAGAAATGGTCAAGACTATGATGCTTTGTACAATTGGGCTTTGGTCCTCCAGGAGAGTGCAGACAATGTTAGTCCAGATTCCACCTCACCTTCTAAAGATGCGTTGCTTGAGGAGGCTTGTAAAAAGTATGATGAGGCCACCCATCTGTGCCCAACACTTCATGATGCTTTTTATAATTGGGCTATTGCAATCTCTGATCGGGCCAAAATGCGTGGTCGTACGAAGGAGGCCGAAGAACTGTGGAAGCAGGCTACCAAAAATTATGAAAAAGCTGTCCAACTCAACTGGAATAGTCCCCAGGCGTTAAATAATTGGGGGCTTGCCCTACAGGAACTCAGTGCGATTGTGCCGGCACGAGAAAAGCAGACAATTGTAAAAACAGCTATCAGTAAGTTCCGTGCTGCTATACAGTTGCAATTTGATTTTCATCGAGCAATCTACAACCTTGGCACTGTTCTGTATGGATTAGCTGAGGACACATTAAGAACTGGTGGATCAGGAAATGTTAAGGATGTTTCCCCCAATGAGTTATACAGCCAATCTGCTATTTATATTGCAGCTGCTCATGCTCTAAAACCAAACTACTCTGTTTATAGCAGCGCCTTACGGTTGGTCCGCTCCATGCTGCCGTTACCCTATCTAAAAGTTGGATACCTGACTGCACCTCCTGTGGGGAGGCCACTTGCTCCTCACAGTGATTGGAAACGTTCACAATTTTTTCTAAATCATGATGTATTGCAAAAGCTTAACATAGGAGGGGAACAAATACAAACATCCCCTAGTATTTTAGGAAGATCTGGAAGTACCTTGAATGGCGACAGGACAATCAAAGTAGAAATTCCCGATATCGTCTCTGTATCCGCATGTGCCGATCTTACTTTACCACCTGGTGCTGGACTCTGCATTGACACAATCCATGGACCCATTTTCTTGGTTGCTGACTCATGGGACACACTCGATGGATGGCTTGATGCTATTAGATTAGTTTACACGATCTACGCTCGAGGCAAGAACGAGGTTTTGGCTGGCATCATAACAGGTTGA
Protein sequence
MSPTPEEPNNLQNGIEIQPHISSESDQITEPRSGPEEPTVDSIPSSELQRERESESVSNGVPDSEPESPRKQLSESIHLHVVTGVTDPSVEEHKETSTPSNGNTENLQPALRKDEGSRTFTMRELLNGLKGEDGSDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGSGNVKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGRSGSTLNGDRTIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPIFLVADSWDTLDGWLDAIRLVYTIYARGKNEVLAGIITG*
Homology
BLAST of CSPI03G00840 vs. ExPASy Swiss-Prot
Match:
Q9FHY8 (Protein HLB1 OS=Arabidopsis thaliana OX=3702 GN=HLB1 PE=1 SV=1)
HSP 1 Score: 631.7 bits (1628), Expect = 7.7e-180
Identity = 358/583 (61.41%), Postives = 421/583 (72.21%), Query Frame = 0
Query: 1 MSPTPEEPNNLQNGIEIQPHISSESDQITEPRSGPEEPTVDSIPSSELQRERESESVSNG 60
M+ T EEP LQNG +E + I EP+ E IP E++ + E V +
Sbjct: 1 MADTVEEP-QLQNG-AAPAESETEQNPIPEPQLQTEPKLTGEIP--EIEADLTPEEVQSE 60
Query: 61 VPDSEPESPRKQLSESIHLHVVTGVTDPSVEEHKETSTP-------------------SN 120
V D++PE + ++ V T VTD EE + P S
Sbjct: 61 VTDAKPEEVQSEVKPE---EVKTVVTDAKPEEAQSEVKPEEVQSVVTDTKPDLTDVDLSP 120
Query: 121 GNTENL------------QPALRK-DEGSRTFTMRELLNGLKGEDGSDSLNESEGERPEG 180
G +E + L+K D+G++TFTMRELL+ LK E EG+
Sbjct: 121 GGSEEIPIRSTEVEQESTTSVLKKDDDGNKTFTMRELLSELKSE---------EGDGTPH 180
Query: 181 NSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYASAIERNGQD 240
+S +++S QP ++ AM+LIN + DEEGRSRQR+L FAAR+YASAIERN D
Sbjct: 181 SSASPFSRESASQP--AENNPAMDLINRIQVNDEEGRSRQRVLAFAARKYASAIERNPDD 240
Query: 241 YDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAIS 300
+DALYNWAL+LQESADNVSPDS SPSKD LLEEACKKYDEAT LCPTL+DA+YNWAIAIS
Sbjct: 241 HDALYNWALILQESADNVSPDSVSPSKDDLLEEACKKYDEATRLCPTLYDAYYNWAIAIS 300
Query: 301 DRAKMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIV 360
DRAK+RGRTKEAEELW+QA NYEKAVQLNWNS QALNNWGL LQELS IVPAREK+ +V
Sbjct: 301 DRAKIRGRTKEAEELWEQAADNYEKAVQLNWNSSQALNNWGLVLQELSQIVPAREKEKVV 360
Query: 361 KTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGSGNVKDVSPNELYSQSAIYI 420
+TAISKFRAAI+LQFDFHRAIYNLGTVLYGLAEDTLRTGGSGN KD+ P ELYSQSAIYI
Sbjct: 361 RTAISKFRAAIRLQFDFHRAIYNLGTVLYGLAEDTLRTGGSGNGKDMPPGELYSQSAIYI 420
Query: 421 AAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHD- 480
AAAH+LKP+YSVYSSALRLVRSMLPLP+LKVGYLTAPPVG LAPHSDWKR++F LNH+
Sbjct: 421 AAAHSLKPSYSVYSSALRLVRSMLPLPHLKVGYLTAPPVGNSLAPHSDWKRTEFELNHER 480
Query: 481 VLQKLNIGGEQIQTSPSILGRSGSTLNGDRTIKVEIPDIVSVSACADLTLPPGAGLCIDT 540
+LQ L ++ + S + ST +T+KV I +IVSV+ CADLTLPPGAGLCIDT
Sbjct: 481 LLQVLKPEPREMGRNLSGKAETMSTNVERKTVKVNITEIVSVTPCADLTLPPGAGLCIDT 540
Query: 541 IHGPIFLVADSWDTLDGWLDAIRLVYTIYARGKNEVLAGIITG 551
IHGP+FLVADSW++LDGWLDAIRLVYTIYARGK++VLAGIITG
Sbjct: 541 IHGPVFLVADSWESLDGWLDAIRLVYTIYARGKSDVLAGIITG 565
BLAST of CSPI03G00840 vs. ExPASy TrEMBL
Match:
A0A0A0L688 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G002900 PE=4 SV=1)
HSP 1 Score: 1083.9 bits (2802), Expect = 0.0e+00
Identity = 550/550 (100.00%), Postives = 550/550 (100.00%), Query Frame = 0
Query: 1 MSPTPEEPNNLQNGIEIQPHISSESDQITEPRSGPEEPTVDSIPSSELQRERESESVSNG 60
MSPTPEEPNNLQNGIEIQPHISSESDQITEPRSGPEEPTVDSIPSSELQRERESESVSNG
Sbjct: 1 MSPTPEEPNNLQNGIEIQPHISSESDQITEPRSGPEEPTVDSIPSSELQRERESESVSNG 60
Query: 61 VPDSEPESPRKQLSESIHLHVVTGVTDPSVEEHKETSTPSNGNTENLQPALRKDEGSRTF 120
VPDSEPESPRKQLSESIHLHVVTGVTDPSVEEHKETSTPSNGNTENLQPALRKDEGSRTF
Sbjct: 61 VPDSEPESPRKQLSESIHLHVVTGVTDPSVEEHKETSTPSNGNTENLQPALRKDEGSRTF 120
Query: 121 TMRELLNGLKGEDGSDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGV 180
TMRELLNGLKGEDGSDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGV
Sbjct: 121 TMRELLNGLKGEDGSDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGV 180
Query: 181 DEEGRSRQRILTFAARRYASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLE 240
DEEGRSRQRILTFAARRYASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLE
Sbjct: 181 DEEGRSRQRILTFAARRYASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLE 240
Query: 241 EACKKYDEATHLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWN 300
EACKKYDEATHLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWN
Sbjct: 241 EACKKYDEATHLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWN 300
Query: 301 SPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLA 360
SPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLA
Sbjct: 301 SPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLA 360
Query: 361 EDTLRTGGSGNVKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVG 420
EDTLRTGGSGNVKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVG
Sbjct: 361 EDTLRTGGSGNVKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVG 420
Query: 421 YLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGRSGSTLNGDRTIK 480
YLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGRSGSTLNGDRTIK
Sbjct: 421 YLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGRSGSTLNGDRTIK 480
Query: 481 VEIPDIVSVSACADLTLPPGAGLCIDTIHGPIFLVADSWDTLDGWLDAIRLVYTIYARGK 540
VEIPDIVSVSACADLTLPPGAGLCIDTIHGPIFLVADSWDTLDGWLDAIRLVYTIYARGK
Sbjct: 481 VEIPDIVSVSACADLTLPPGAGLCIDTIHGPIFLVADSWDTLDGWLDAIRLVYTIYARGK 540
Query: 541 NEVLAGIITG 551
NEVLAGIITG
Sbjct: 541 NEVLAGIITG 550
BLAST of CSPI03G00840 vs. ExPASy TrEMBL
Match:
A0A1S3BJC9 (uncharacterized protein LOC103490705 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103490705 PE=4 SV=1)
HSP 1 Score: 1057.7 bits (2734), Expect = 1.6e-305
Identity = 536/550 (97.45%), Postives = 542/550 (98.55%), Query Frame = 0
Query: 1 MSPTPEEPNNLQNGIEIQPHISSESDQITEPRSGPEEPTVDSIPSSELQRERESESVSNG 60
MSPTPEEPNNLQNGIEIQPHISSESDQI+EPRS EEPT DSIPSSELQ+ERESESVSNG
Sbjct: 1 MSPTPEEPNNLQNGIEIQPHISSESDQISEPRSELEEPTADSIPSSELQQERESESVSNG 60
Query: 61 VPDSEPESPRKQLSESIHLHVVTGVTDPSVEEHKETSTPSNGNTENLQPALRKDEGSRTF 120
V DSEPESPRKQLSESIHLHVVTGVTDPSVEEHKETSTP NGNTENLQPALRKDEGSRTF
Sbjct: 61 VADSEPESPRKQLSESIHLHVVTGVTDPSVEEHKETSTPFNGNTENLQPALRKDEGSRTF 120
Query: 121 TMRELLNGLKGEDGSDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGV 180
TMRELLNGLKGEDGSD LNESEGERPEGNSG+SLNQDSPHQPYSEQSRAAMELINS+TGV
Sbjct: 121 TMRELLNGLKGEDGSDGLNESEGERPEGNSGHSLNQDSPHQPYSEQSRAAMELINSITGV 180
Query: 181 DEEGRSRQRILTFAARRYASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLE 240
DEEGRSRQRILTFAARRYASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLE
Sbjct: 181 DEEGRSRQRILTFAARRYASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLE 240
Query: 241 EACKKYDEATHLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWN 300
EACKKYDEATHLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWN
Sbjct: 241 EACKKYDEATHLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWN 300
Query: 301 SPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLA 360
SPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLA
Sbjct: 301 SPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLA 360
Query: 361 EDTLRTGGSGNVKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVG 420
EDTLRTGG+GN+KDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVG
Sbjct: 361 EDTLRTGGTGNIKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVG 420
Query: 421 YLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGRSGSTLNGDRTIK 480
YLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPS LGRSGSTLNGDRTIK
Sbjct: 421 YLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSTLGRSGSTLNGDRTIK 480
Query: 481 VEIPDIVSVSACADLTLPPGAGLCIDTIHGPIFLVADSWDTLDGWLDAIRLVYTIYARGK 540
VEIPDIVSVSACADLTLPPGAGLCIDTIHGPIFLVADSWD LDGWLDAIRLVYTIYARGK
Sbjct: 481 VEIPDIVSVSACADLTLPPGAGLCIDTIHGPIFLVADSWDALDGWLDAIRLVYTIYARGK 540
Query: 541 NEVLAGIITG 551
NEVLAGIITG
Sbjct: 541 NEVLAGIITG 550
BLAST of CSPI03G00840 vs. ExPASy TrEMBL
Match:
A0A6J1HJU5 (protein HLB1-like isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111465172 PE=4 SV=1)
HSP 1 Score: 964.9 bits (2493), Expect = 1.4e-277
Identity = 502/554 (90.61%), Postives = 517/554 (93.32%), Query Frame = 0
Query: 1 MSPTPEEPNNLQNGIEIQPHISSESDQITEPRSGPEEPTVDSIPSSELQRERESESVSNG 60
MSP PEEPNNLQNGIEI+PHIS ES+QI E +S PE T D IP++ELQ+ERESESV NG
Sbjct: 1 MSPIPEEPNNLQNGIEIEPHISVESNQIGESKSEPES-TADVIPTAELQQERESESV-NG 60
Query: 61 VPDSEP----ESPRKQLSESIHLHVVTGVTDPSVEEHKETSTPSNGNTENLQPALRKDEG 120
V DSEP +SPRKQLSESI L VVT VTDP EE K TS SNG EN QPALRKDEG
Sbjct: 61 VADSEPQSELDSPRKQLSESIELQVVTDVTDPRFEEPKGTSISSNG-AENSQPALRKDEG 120
Query: 121 SRTFTMRELLNGLKGEDGSDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINS 180
SRTFTMRELLNGLK EDG+DSLNESEGE+PE NSGYSLNQDSPHQPYSEQSRAAMELINS
Sbjct: 121 SRTFTMRELLNGLKVEDGNDSLNESEGEKPEANSGYSLNQDSPHQPYSEQSRAAMELINS 180
Query: 181 VTGVDEEGRSRQRILTFAARRYASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKD 240
VTGVDEEGRSRQRILTFAARRYASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKD
Sbjct: 181 VTGVDEEGRSRQRILTFAARRYASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKD 240
Query: 241 ALLEEACKKYDEATHLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQ 300
ALLEEACKKYDEAT LCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQAT+NYEKAVQ
Sbjct: 241 ALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATRNYEKAVQ 300
Query: 301 LNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVL 360
LNWNSPQALNNWGLALQELSAIVPAREK TIVKTAISKFRAAIQLQFDFHRAIYNLGTVL
Sbjct: 301 LNWNSPQALNNWGLALQELSAIVPAREKPTIVKTAISKFRAAIQLQFDFHRAIYNLGTVL 360
Query: 361 YGLAEDTLRTGGSGNVKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPY 420
YGLAEDTLRTGG+G VKDVSPNELYSQSAIYIAAAHALKP+YSVYSSALRLVRSMLPLPY
Sbjct: 361 YGLAEDTLRTGGTGTVKDVSPNELYSQSAIYIAAAHALKPSYSVYSSALRLVRSMLPLPY 420
Query: 421 LKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGRSGSTLNGD 480
LKVGYLTAPPVGRP APH DWKRSQFFLNHDVLQKLNIGGEQIQTSP++LGRSGSTLNGD
Sbjct: 421 LKVGYLTAPPVGRPFAPHGDWKRSQFFLNHDVLQKLNIGGEQIQTSPTLLGRSGSTLNGD 480
Query: 481 RTIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPIFLVADSWDTLDGWLDAIRLVYTIY 540
RT+KVEIPDIVSVSACADLTLPPGAGLCIDTIHG IFLVADSWD LDGWLDAIRLVYTIY
Sbjct: 481 RTMKVEIPDIVSVSACADLTLPPGAGLCIDTIHGQIFLVADSWDALDGWLDAIRLVYTIY 540
Query: 541 ARGKNEVLAGIITG 551
ARGKNEVLAGII G
Sbjct: 541 ARGKNEVLAGIIAG 551
BLAST of CSPI03G00840 vs. ExPASy TrEMBL
Match:
A0A6J1HL68 (protein HLB1-like isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111465172 PE=4 SV=1)
HSP 1 Score: 954.5 bits (2466), Expect = 1.9e-274
Identity = 501/581 (86.23%), Postives = 518/581 (89.16%), Query Frame = 0
Query: 1 MSPTPEEPNNLQNGIEIQPHISSESDQITEPRSGPEEPTVDSIPSSELQRERESESVS-- 60
MSP PEEPNNLQNGIEI+PHIS ES+QI E +S PE T D +P++ELQ+ERESESV+
Sbjct: 1 MSPIPEEPNNLQNGIEIEPHISVESNQIGESKSEPES-TADVVPTAELQQERESESVNGV 60
Query: 61 -------------------------NGVPDSEP----ESPRKQLSESIHLHVVTGVTDPS 120
NGV DSEP +SPRKQLSESI L VVT VTDP
Sbjct: 61 ADLEPQLEMVIPTAELQQERESESVNGVADSEPQSELDSPRKQLSESIELQVVTDVTDPR 120
Query: 121 VEEHKETSTPSNGNTENLQPALRKDEGSRTFTMRELLNGLKGEDGSDSLNESEGERPEGN 180
EE K TS SNG EN QPALRKDEGSRTFTMRELLNGLK EDG+DSLNESEGE+PE N
Sbjct: 121 FEEPKGTSISSNG-AENSQPALRKDEGSRTFTMRELLNGLKVEDGNDSLNESEGEKPEAN 180
Query: 181 SGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYASAIERNGQDY 240
SGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYASAIERNGQDY
Sbjct: 181 SGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYASAIERNGQDY 240
Query: 241 DALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAISD 300
DALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEAT LCPTLHDAFYNWAIAISD
Sbjct: 241 DALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISD 300
Query: 301 RAKMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVK 360
RAKMRGRTKEAEELWKQAT+NYEKAVQLNWNSPQALNNWGLALQELSAIVPAREK TIVK
Sbjct: 301 RAKMRGRTKEAEELWKQATRNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKPTIVK 360
Query: 361 TAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGSGNVKDVSPNELYSQSAIYIA 420
TAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGG+G VKDVSPNELYSQSAIYIA
Sbjct: 361 TAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGTVKDVSPNELYSQSAIYIA 420
Query: 421 AAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVL 480
AAHALKP+YSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRP APH DWKRSQFFLNHDVL
Sbjct: 421 AAHALKPSYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPFAPHGDWKRSQFFLNHDVL 480
Query: 481 QKLNIGGEQIQTSPSILGRSGSTLNGDRTIKVEIPDIVSVSACADLTLPPGAGLCIDTIH 540
QKLNIGGEQIQTSP++LGRSGSTLNGDRT+KVEIPDIVSVSACADLTLPPGAGLCIDTIH
Sbjct: 481 QKLNIGGEQIQTSPTLLGRSGSTLNGDRTMKVEIPDIVSVSACADLTLPPGAGLCIDTIH 540
Query: 541 GPIFLVADSWDTLDGWLDAIRLVYTIYARGKNEVLAGIITG 551
G IFLVADSWD LDGWLDAIRLVYTIYARGKNEVLAGII G
Sbjct: 541 GQIFLVADSWDALDGWLDAIRLVYTIYARGKNEVLAGIIAG 579
BLAST of CSPI03G00840 vs. ExPASy TrEMBL
Match:
A0A6J1EA05 (protein HLB1-like OS=Cucurbita moschata OX=3662 GN=LOC111431218 PE=4 SV=1)
HSP 1 Score: 951.4 bits (2458), Expect = 1.6e-273
Identity = 500/582 (85.91%), Postives = 515/582 (88.49%), Query Frame = 0
Query: 1 MSPTPEEPNNLQNGIEIQPHISSESDQITEPRSGPEEPTVDSIPSSELQRERESESVSNG 60
MSPTPEEPNNLQNGIEI+PHIS ES+QI E +S PE T D +P++ELQ+ERE ESV NG
Sbjct: 1 MSPTPEEPNNLQNGIEIEPHISVESNQIGESKSEPES-TADVVPTAELQQERELESV-NG 60
Query: 61 VPDSEP--------------------------------ESPRKQLSESIHLHVVTGVTDP 120
V D EP +SPRKQLSESI L V T V DP
Sbjct: 61 VEDLEPQSELVIPTAELQQERESESVNGVADSELQSELDSPRKQLSESIQLQVATDVADP 120
Query: 121 SVEEHKETSTPSNGNTENLQPALRKDEGSRTFTMRELLNGLKGEDGSDSLNESEGERPEG 180
EE K TS SNG TEN QPALRKDEGSRTFTMRELLNGLK EDG+DSLNESEGE+PE
Sbjct: 121 RFEEPKGTSISSNG-TENSQPALRKDEGSRTFTMRELLNGLKVEDGNDSLNESEGEKPEA 180
Query: 181 NSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYASAIERNGQD 240
NSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYASAIERNGQD
Sbjct: 181 NSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYASAIERNGQD 240
Query: 241 YDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAIS 300
YDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEAT LCPTLHDAFYNWAIAIS
Sbjct: 241 YDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAIS 300
Query: 301 DRAKMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIV 360
DRAKMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIV
Sbjct: 301 DRAKMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIV 360
Query: 361 KTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGSGNVKDVSPNELYSQSAIYI 420
KTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGG+G VKDVSPNELYSQSAIYI
Sbjct: 361 KTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGTVKDVSPNELYSQSAIYI 420
Query: 421 AAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDV 480
AAAHALKP+YSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRP APH DWKRSQFFLNHDV
Sbjct: 421 AAAHALKPSYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPFAPHGDWKRSQFFLNHDV 480
Query: 481 LQKLNIGGEQIQTSPSILGRSGSTLNGDRTIKVEIPDIVSVSACADLTLPPGAGLCIDTI 540
LQKLNIGGEQ QTSP++LGRSGSTLNGDRT+KVEIPDIVSVSACADLTLPPGAGLCIDTI
Sbjct: 481 LQKLNIGGEQTQTSPTLLGRSGSTLNGDRTMKVEIPDIVSVSACADLTLPPGAGLCIDTI 540
Query: 541 HGPIFLVADSWDTLDGWLDAIRLVYTIYARGKNEVLAGIITG 551
HG IFLVADSWD LDGWLDAIRLVYTIYARGKNEVLAGII G
Sbjct: 541 HGQIFLVADSWDALDGWLDAIRLVYTIYARGKNEVLAGIIAG 579
BLAST of CSPI03G00840 vs. NCBI nr
Match:
XP_004146133.1 (protein HLB1 isoform X1 [Cucumis sativus] >KGN55671.1 hypothetical protein Csa_010331 [Cucumis sativus])
HSP 1 Score: 1083.9 bits (2802), Expect = 0.0e+00
Identity = 550/550 (100.00%), Postives = 550/550 (100.00%), Query Frame = 0
Query: 1 MSPTPEEPNNLQNGIEIQPHISSESDQITEPRSGPEEPTVDSIPSSELQRERESESVSNG 60
MSPTPEEPNNLQNGIEIQPHISSESDQITEPRSGPEEPTVDSIPSSELQRERESESVSNG
Sbjct: 1 MSPTPEEPNNLQNGIEIQPHISSESDQITEPRSGPEEPTVDSIPSSELQRERESESVSNG 60
Query: 61 VPDSEPESPRKQLSESIHLHVVTGVTDPSVEEHKETSTPSNGNTENLQPALRKDEGSRTF 120
VPDSEPESPRKQLSESIHLHVVTGVTDPSVEEHKETSTPSNGNTENLQPALRKDEGSRTF
Sbjct: 61 VPDSEPESPRKQLSESIHLHVVTGVTDPSVEEHKETSTPSNGNTENLQPALRKDEGSRTF 120
Query: 121 TMRELLNGLKGEDGSDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGV 180
TMRELLNGLKGEDGSDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGV
Sbjct: 121 TMRELLNGLKGEDGSDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGV 180
Query: 181 DEEGRSRQRILTFAARRYASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLE 240
DEEGRSRQRILTFAARRYASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLE
Sbjct: 181 DEEGRSRQRILTFAARRYASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLE 240
Query: 241 EACKKYDEATHLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWN 300
EACKKYDEATHLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWN
Sbjct: 241 EACKKYDEATHLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWN 300
Query: 301 SPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLA 360
SPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLA
Sbjct: 301 SPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLA 360
Query: 361 EDTLRTGGSGNVKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVG 420
EDTLRTGGSGNVKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVG
Sbjct: 361 EDTLRTGGSGNVKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVG 420
Query: 421 YLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGRSGSTLNGDRTIK 480
YLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGRSGSTLNGDRTIK
Sbjct: 421 YLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGRSGSTLNGDRTIK 480
Query: 481 VEIPDIVSVSACADLTLPPGAGLCIDTIHGPIFLVADSWDTLDGWLDAIRLVYTIYARGK 540
VEIPDIVSVSACADLTLPPGAGLCIDTIHGPIFLVADSWDTLDGWLDAIRLVYTIYARGK
Sbjct: 481 VEIPDIVSVSACADLTLPPGAGLCIDTIHGPIFLVADSWDTLDGWLDAIRLVYTIYARGK 540
Query: 541 NEVLAGIITG 551
NEVLAGIITG
Sbjct: 541 NEVLAGIITG 550
BLAST of CSPI03G00840 vs. NCBI nr
Match:
XP_008448563.1 (PREDICTED: uncharacterized protein LOC103490705 isoform X1 [Cucumis melo])
HSP 1 Score: 1057.7 bits (2734), Expect = 3.3e-305
Identity = 536/550 (97.45%), Postives = 542/550 (98.55%), Query Frame = 0
Query: 1 MSPTPEEPNNLQNGIEIQPHISSESDQITEPRSGPEEPTVDSIPSSELQRERESESVSNG 60
MSPTPEEPNNLQNGIEIQPHISSESDQI+EPRS EEPT DSIPSSELQ+ERESESVSNG
Sbjct: 1 MSPTPEEPNNLQNGIEIQPHISSESDQISEPRSELEEPTADSIPSSELQQERESESVSNG 60
Query: 61 VPDSEPESPRKQLSESIHLHVVTGVTDPSVEEHKETSTPSNGNTENLQPALRKDEGSRTF 120
V DSEPESPRKQLSESIHLHVVTGVTDPSVEEHKETSTP NGNTENLQPALRKDEGSRTF
Sbjct: 61 VADSEPESPRKQLSESIHLHVVTGVTDPSVEEHKETSTPFNGNTENLQPALRKDEGSRTF 120
Query: 121 TMRELLNGLKGEDGSDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGV 180
TMRELLNGLKGEDGSD LNESEGERPEGNSG+SLNQDSPHQPYSEQSRAAMELINS+TGV
Sbjct: 121 TMRELLNGLKGEDGSDGLNESEGERPEGNSGHSLNQDSPHQPYSEQSRAAMELINSITGV 180
Query: 181 DEEGRSRQRILTFAARRYASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLE 240
DEEGRSRQRILTFAARRYASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLE
Sbjct: 181 DEEGRSRQRILTFAARRYASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLE 240
Query: 241 EACKKYDEATHLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWN 300
EACKKYDEATHLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWN
Sbjct: 241 EACKKYDEATHLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWN 300
Query: 301 SPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLA 360
SPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLA
Sbjct: 301 SPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLA 360
Query: 361 EDTLRTGGSGNVKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVG 420
EDTLRTGG+GN+KDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVG
Sbjct: 361 EDTLRTGGTGNIKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVG 420
Query: 421 YLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGRSGSTLNGDRTIK 480
YLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPS LGRSGSTLNGDRTIK
Sbjct: 421 YLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSTLGRSGSTLNGDRTIK 480
Query: 481 VEIPDIVSVSACADLTLPPGAGLCIDTIHGPIFLVADSWDTLDGWLDAIRLVYTIYARGK 540
VEIPDIVSVSACADLTLPPGAGLCIDTIHGPIFLVADSWD LDGWLDAIRLVYTIYARGK
Sbjct: 481 VEIPDIVSVSACADLTLPPGAGLCIDTIHGPIFLVADSWDALDGWLDAIRLVYTIYARGK 540
Query: 541 NEVLAGIITG 551
NEVLAGIITG
Sbjct: 541 NEVLAGIITG 550
BLAST of CSPI03G00840 vs. NCBI nr
Match:
XP_038876586.1 (protein HLB1 [Benincasa hispida])
HSP 1 Score: 1010.7 bits (2612), Expect = 4.6e-291
Identity = 517/550 (94.00%), Postives = 526/550 (95.64%), Query Frame = 0
Query: 1 MSPTPEEPNNLQNGIEIQPHISSESDQITEPRSGPEEPTVDSIPSSELQRERESESVSNG 60
MSPTPEEPNNLQNGIEIQPHIS ESDQ +EPRS P EPT D+I SSEL +ERESESV+NG
Sbjct: 1 MSPTPEEPNNLQNGIEIQPHISPESDQTSEPRSEP-EPTADAILSSELHQERESESVNNG 60
Query: 61 VPDSEPESPRKQLSESIHLHVVTGVTDPSVEEHKETSTPSNGNTENLQPALRKDEGSRTF 120
V DSEP S RKQL ESIHL V T V DP EEHKETS PSNGNTEN +PALRKDEGSRTF
Sbjct: 61 VADSEPVSRRKQLPESIHLQVETDVADPRFEEHKETSIPSNGNTENSKPALRKDEGSRTF 120
Query: 121 TMRELLNGLKGEDGSDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGV 180
TMRELLNGLKGEDG+DSLNESEGERPEGN GYSLNQDSPHQPYSEQSRAAMELI+SVTGV
Sbjct: 121 TMRELLNGLKGEDGNDSLNESEGERPEGNPGYSLNQDSPHQPYSEQSRAAMELISSVTGV 180
Query: 181 DEEGRSRQRILTFAARRYASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLE 240
DEEGRSRQRILTFAARRYASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLE
Sbjct: 181 DEEGRSRQRILTFAARRYASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLE 240
Query: 241 EACKKYDEATHLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWN 300
EACKKYDEAT LCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWN
Sbjct: 241 EACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWN 300
Query: 301 SPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLA 360
SPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLA
Sbjct: 301 SPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLA 360
Query: 361 EDTLRTGGSGNVKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVG 420
EDTLRTGG+GNVKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVG
Sbjct: 361 EDTLRTGGTGNVKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVG 420
Query: 421 YLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGRSGSTLNGDRTIK 480
YLTAPPVGRPLAPH DWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGRSGSTLNGD TIK
Sbjct: 421 YLTAPPVGRPLAPHGDWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGRSGSTLNGDWTIK 480
Query: 481 VEIPDIVSVSACADLTLPPGAGLCIDTIHGPIFLVADSWDTLDGWLDAIRLVYTIYARGK 540
VEIPDIVSVSACADLTLPPGAGLCIDTIHGP+FLVADSWD LDGWLDAIRLVYTIYARGK
Sbjct: 481 VEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDALDGWLDAIRLVYTIYARGK 540
Query: 541 NEVLAGIITG 551
NEVLAGIITG
Sbjct: 541 NEVLAGIITG 549
BLAST of CSPI03G00840 vs. NCBI nr
Match:
XP_022965252.1 (protein HLB1-like isoform X2 [Cucurbita maxima])
HSP 1 Score: 964.9 bits (2493), Expect = 2.9e-277
Identity = 502/554 (90.61%), Postives = 517/554 (93.32%), Query Frame = 0
Query: 1 MSPTPEEPNNLQNGIEIQPHISSESDQITEPRSGPEEPTVDSIPSSELQRERESESVSNG 60
MSP PEEPNNLQNGIEI+PHIS ES+QI E +S PE T D IP++ELQ+ERESESV NG
Sbjct: 1 MSPIPEEPNNLQNGIEIEPHISVESNQIGESKSEPES-TADVIPTAELQQERESESV-NG 60
Query: 61 VPDSEP----ESPRKQLSESIHLHVVTGVTDPSVEEHKETSTPSNGNTENLQPALRKDEG 120
V DSEP +SPRKQLSESI L VVT VTDP EE K TS SNG EN QPALRKDEG
Sbjct: 61 VADSEPQSELDSPRKQLSESIELQVVTDVTDPRFEEPKGTSISSNG-AENSQPALRKDEG 120
Query: 121 SRTFTMRELLNGLKGEDGSDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINS 180
SRTFTMRELLNGLK EDG+DSLNESEGE+PE NSGYSLNQDSPHQPYSEQSRAAMELINS
Sbjct: 121 SRTFTMRELLNGLKVEDGNDSLNESEGEKPEANSGYSLNQDSPHQPYSEQSRAAMELINS 180
Query: 181 VTGVDEEGRSRQRILTFAARRYASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKD 240
VTGVDEEGRSRQRILTFAARRYASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKD
Sbjct: 181 VTGVDEEGRSRQRILTFAARRYASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKD 240
Query: 241 ALLEEACKKYDEATHLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQ 300
ALLEEACKKYDEAT LCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQAT+NYEKAVQ
Sbjct: 241 ALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATRNYEKAVQ 300
Query: 301 LNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVL 360
LNWNSPQALNNWGLALQELSAIVPAREK TIVKTAISKFRAAIQLQFDFHRAIYNLGTVL
Sbjct: 301 LNWNSPQALNNWGLALQELSAIVPAREKPTIVKTAISKFRAAIQLQFDFHRAIYNLGTVL 360
Query: 361 YGLAEDTLRTGGSGNVKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPY 420
YGLAEDTLRTGG+G VKDVSPNELYSQSAIYIAAAHALKP+YSVYSSALRLVRSMLPLPY
Sbjct: 361 YGLAEDTLRTGGTGTVKDVSPNELYSQSAIYIAAAHALKPSYSVYSSALRLVRSMLPLPY 420
Query: 421 LKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGRSGSTLNGD 480
LKVGYLTAPPVGRP APH DWKRSQFFLNHDVLQKLNIGGEQIQTSP++LGRSGSTLNGD
Sbjct: 421 LKVGYLTAPPVGRPFAPHGDWKRSQFFLNHDVLQKLNIGGEQIQTSPTLLGRSGSTLNGD 480
Query: 481 RTIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPIFLVADSWDTLDGWLDAIRLVYTIY 540
RT+KVEIPDIVSVSACADLTLPPGAGLCIDTIHG IFLVADSWD LDGWLDAIRLVYTIY
Sbjct: 481 RTMKVEIPDIVSVSACADLTLPPGAGLCIDTIHGQIFLVADSWDALDGWLDAIRLVYTIY 540
Query: 541 ARGKNEVLAGIITG 551
ARGKNEVLAGII G
Sbjct: 541 ARGKNEVLAGIIAG 551
BLAST of CSPI03G00840 vs. NCBI nr
Match:
XP_023552571.1 (protein HLB1-like [Cucurbita pepo subsp. pepo])
HSP 1 Score: 954.5 bits (2466), Expect = 3.9e-274
Identity = 501/581 (86.23%), Postives = 518/581 (89.16%), Query Frame = 0
Query: 1 MSPTPEEPNNLQNGIEIQPHISSESDQITEPRSGPEEPTVDSIPSSELQRERESESVS-- 60
MSPTPEEPNNLQNGIEI+ HIS ES+QI E +S PE T D +P++ELQ+ER+SESV+
Sbjct: 1 MSPTPEEPNNLQNGIEIESHISVESNQIGESKSEPES-TADVVPTAELQQERQSESVNGV 60
Query: 61 -------------------------NGVPDSEP----ESPRKQLSESIHLHVVTGVTDPS 120
NG DSEP +SPRKQLSESI L VVT VTDP
Sbjct: 61 AGLEPQSELVIPTAELQQERESESFNGAADSEPQSELDSPRKQLSESIELQVVTDVTDPR 120
Query: 121 VEEHKETSTPSNGNTENLQPALRKDEGSRTFTMRELLNGLKGEDGSDSLNESEGERPEGN 180
EE K TS SNG TEN QPALRKDEGSRTFTMRELLNGLK EDG+DSLNESEGE+PE N
Sbjct: 121 FEEPKGTSISSNG-TENSQPALRKDEGSRTFTMRELLNGLKVEDGNDSLNESEGEKPEAN 180
Query: 181 SGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYASAIERNGQDY 240
SGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYASAIERNGQDY
Sbjct: 181 SGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYASAIERNGQDY 240
Query: 241 DALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAISD 300
DALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEAT LCPTLHDAFYNWAIAISD
Sbjct: 241 DALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISD 300
Query: 301 RAKMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVK 360
RAKMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVK
Sbjct: 301 RAKMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVK 360
Query: 361 TAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGSGNVKDVSPNELYSQSAIYIA 420
TAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGG+G VKDVSPNELYSQSAIYIA
Sbjct: 361 TAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGTVKDVSPNELYSQSAIYIA 420
Query: 421 AAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVL 480
AAHALKP+YSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRP APH DWKRSQFFLNHDVL
Sbjct: 421 AAHALKPSYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPFAPHGDWKRSQFFLNHDVL 480
Query: 481 QKLNIGGEQIQTSPSILGRSGSTLNGDRTIKVEIPDIVSVSACADLTLPPGAGLCIDTIH 540
QKLNIGGEQ QTSP++LGRSGSTLNGDRT+KVEIPDIVSVSACADLTLPPGAGLCIDTIH
Sbjct: 481 QKLNIGGEQTQTSPTLLGRSGSTLNGDRTMKVEIPDIVSVSACADLTLPPGAGLCIDTIH 540
Query: 541 GPIFLVADSWDTLDGWLDAIRLVYTIYARGKNEVLAGIITG 551
G IFLVADSWD LDGWLDAIRLVYTIYARGKNEVLAGII G
Sbjct: 541 GQIFLVADSWDALDGWLDAIRLVYTIYARGKNEVLAGIIAG 579
BLAST of CSPI03G00840 vs. TAIR 10
Match:
AT5G41950.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )
HSP 1 Score: 631.7 bits (1628), Expect = 5.5e-181
Identity = 358/583 (61.41%), Postives = 421/583 (72.21%), Query Frame = 0
Query: 1 MSPTPEEPNNLQNGIEIQPHISSESDQITEPRSGPEEPTVDSIPSSELQRERESESVSNG 60
M+ T EEP LQNG +E + I EP+ E IP E++ + E V +
Sbjct: 1 MADTVEEP-QLQNG-AAPAESETEQNPIPEPQLQTEPKLTGEIP--EIEADLTPEEVQSE 60
Query: 61 VPDSEPESPRKQLSESIHLHVVTGVTDPSVEEHKETSTP-------------------SN 120
V D++PE + ++ V T VTD EE + P S
Sbjct: 61 VTDAKPEEVQSEVKPE---EVKTVVTDAKPEEAQSEVKPEEVQSVVTDTKPDLTDVDLSP 120
Query: 121 GNTENL------------QPALRK-DEGSRTFTMRELLNGLKGEDGSDSLNESEGERPEG 180
G +E + L+K D+G++TFTMRELL+ LK E EG+
Sbjct: 121 GGSEEIPIRSTEVEQESTTSVLKKDDDGNKTFTMRELLSELKSE---------EGDGTPH 180
Query: 181 NSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYASAIERNGQD 240
+S +++S QP ++ AM+LIN + DEEGRSRQR+L FAAR+YASAIERN D
Sbjct: 181 SSASPFSRESASQP--AENNPAMDLINRIQVNDEEGRSRQRVLAFAARKYASAIERNPDD 240
Query: 241 YDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAIS 300
+DALYNWAL+LQESADNVSPDS SPSKD LLEEACKKYDEAT LCPTL+DA+YNWAIAIS
Sbjct: 241 HDALYNWALILQESADNVSPDSVSPSKDDLLEEACKKYDEATRLCPTLYDAYYNWAIAIS 300
Query: 301 DRAKMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIV 360
DRAK+RGRTKEAEELW+QA NYEKAVQLNWNS QALNNWGL LQELS IVPAREK+ +V
Sbjct: 301 DRAKIRGRTKEAEELWEQAADNYEKAVQLNWNSSQALNNWGLVLQELSQIVPAREKEKVV 360
Query: 361 KTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGSGNVKDVSPNELYSQSAIYI 420
+TAISKFRAAI+LQFDFHRAIYNLGTVLYGLAEDTLRTGGSGN KD+ P ELYSQSAIYI
Sbjct: 361 RTAISKFRAAIRLQFDFHRAIYNLGTVLYGLAEDTLRTGGSGNGKDMPPGELYSQSAIYI 420
Query: 421 AAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHD- 480
AAAH+LKP+YSVYSSALRLVRSMLPLP+LKVGYLTAPPVG LAPHSDWKR++F LNH+
Sbjct: 421 AAAHSLKPSYSVYSSALRLVRSMLPLPHLKVGYLTAPPVGNSLAPHSDWKRTEFELNHER 480
Query: 481 VLQKLNIGGEQIQTSPSILGRSGSTLNGDRTIKVEIPDIVSVSACADLTLPPGAGLCIDT 540
+LQ L ++ + S + ST +T+KV I +IVSV+ CADLTLPPGAGLCIDT
Sbjct: 481 LLQVLKPEPREMGRNLSGKAETMSTNVERKTVKVNITEIVSVTPCADLTLPPGAGLCIDT 540
Query: 541 IHGPIFLVADSWDTLDGWLDAIRLVYTIYARGKNEVLAGIITG 551
IHGP+FLVADSW++LDGWLDAIRLVYTIYARGK++VLAGIITG
Sbjct: 541 IHGPVFLVADSWESLDGWLDAIRLVYTIYARGKSDVLAGIITG 565
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Q9FHY8 | 7.7e-180 | 61.41 | Protein HLB1 OS=Arabidopsis thaliana OX=3702 GN=HLB1 PE=1 SV=1 | [more] |
Match Name | E-value | Identity | Description | |
A0A0A0L688 | 0.0e+00 | 100.00 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G002900 PE=4 SV=1 | [more] |
A0A1S3BJC9 | 1.6e-305 | 97.45 | uncharacterized protein LOC103490705 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... | [more] |
A0A6J1HJU5 | 1.4e-277 | 90.61 | protein HLB1-like isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111465172 PE=4 SV... | [more] |
A0A6J1HL68 | 1.9e-274 | 86.23 | protein HLB1-like isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111465172 PE=4 SV... | [more] |
A0A6J1EA05 | 1.6e-273 | 85.91 | protein HLB1-like OS=Cucurbita moschata OX=3662 GN=LOC111431218 PE=4 SV=1 | [more] |
Match Name | E-value | Identity | Description | |
AT5G41950.1 | 5.5e-181 | 61.41 | Tetratricopeptide repeat (TPR)-like superfamily protein | [more] |