Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCGACTACTCCCGAGGAACCTAATAATTTGCAAAACGGAATCGTAACTGAGCCACAAATTTCGTCAGAATCAGAGCAAACCGATGAGTCCAGATCAGAGCCAGAACGCATAGCAGACGCAATTCCCAAAGCTGAATCACAGCTAGAACGAGAATCAGAATCAGAATCAGTTTATGTAGAAGCAGAAGCAGAAGCAGAATCAGAGCTGGCGTCTCGAAGGAAACAGTTATCGGAGTCACTGCCATTACAGGTAGTGACGAATGTTTCAGATCCGAAATTTGATGAGTCTAAAGGAACCTCGATCCCGTCCAACGGCATCGAGAACTCGCAGCCTACGCTGCGTAAAGATGAAGGAAGCCGGACGTTTACAATGAGAGAGTTGCTGAATGGATTGAAAGGTGAAGATGGCAACGACAGCGTTAACGAATCTGAAGGCGAGAAGCCCGACGGTTACAGGTTTGTTGTGTACTTGAGAATACTGAATATGGTTGATTGGTAATCGAATGCTTTCTAATAATGCATTCTAATCTCCAATTATTTGAAATTGAATGGTGAATGGTGACTGTTCAACCTGATTTTTACGACAGTTCTTATTAAGCTTGAAGGTTGAAAGCCGAATTTTCATTAAAAATAGTAATGGGGTAGTAGTTTTCTTTCAATGACGAATGGTTTAGGCGTAAAAATCAAGGAACTCCCATTTGAAAGTCAATATCTGAAAGTCAAAGTTCATCAAAAGTTCTCTAACCAGCTTGCTCGGCGTCTGCGAGTTTTCGTCTCTTTAACGATGGTTTTCACCATCTGATTGACCCCCTGCCAATTCTTTATAGCCGATTCTCTCAAACTCTCTAGAGCAGTTTTTCTTCCTAGAAGGGTAAGATACAATGATTGCTTTGAACGATTGAATACTGTCAGGGACTATTCATTGTTGTTACGGTTAGTATAATGGCTTCTCTAGGAGAGTTTTTTGTTTTGCATGTTAGCAAATCCGATGATCCTATTGAATTCTATAGGGCAGACCTTGTGTGTAGAATCTGGTCGTTCGGATTTACCATTCAGTTGACCCGAAGTTTTTGTTAAATCTGAAAATTGCTTCAACCTGACCCAATTCAACCCATGAACAGACTAAATTCTGTCCTTCATTACATTATCCCAAAACAGGATATGTTCCCATAAAACGTTTCTGAGGTTTGGATGTAATCATTTTAGTGCTAAAAGAGCTAGAATCCTTCGGCTTATTGAAACGAAGTCTTTCCTTGGGACTTTGGTTTTGAAACAAGTTTTATTATTATTATTATTTGGCTTTGAAACTGGCTTTGCCTTGGGTTTGATTTCTTCTCAACTTCTGTTGCTATCGAATTGGATTTTGTGGCTTCAAATTTTGGTTATTTCATTTCTAACCAAATAGGCTTTTATTGAAGTGATAGGAAAGTGTAGTTGGTTTCAATTTAGTTACTTCTGGTCAAATTGGATTGTATGGCCTATTCTAATTGCACATCGCCCATTATCTCCTTGGGTATTAACTTAACAAACATCCCTATTAAGAGTTTTAATTTTGTTTGTGTTTCATGTACCTTGGACCAACCTTGAAACTGGTTAACTGAAATGATAAACTGACACGTTATCTGTTTGGCCTGCTCTTTTTTTATTAGCCTGAAAAGAAATATATCTTGCAGCTTGGTTTTGGTTCGTGCTCAAATTAACCCAACATGAACCACTAACACTCCTAATAATTGTCAATTAGTCAATTCATCTTAATCATACCCCGTTTTGGAACTAGTCAATTCATTTATGGATACTCCTATCCGTTCCTTTTTGCATACTGATCTATTAGTTCTCTCTCCCTCTTTCTCTTCATTTGATTTTGGTTTGGATTAAGAAAATTCTCTGTATTATCAGTGGAATCTTTAATTGCAGTAGAGTTTTGCTGTTACCACCTTTTGTCTCAATAGCTATATTGTTTCTTTACTGTATGAATTTGCACAGTCTTAATCAAGATAGCCCACAACAGCCTTATTCTGAACAGAGCAGAGCTGCCATGGAGTTGATCAGCAGTGTTACAGGTGTTGATGAAGAGGGCCGTTCTCGCCAACGGATTCTCACATTTGCTGCTAAGAGGTATCATCATAATACTTTTTGTCCTTCGTAATTGCATCTACTTGGTTTATGTTTCTCATAATAGGTCTTGTAACTTGGACTTTGAATTCCTATTGAAATCGGGCCCCAGTTGTACTTCCCCATTTCATCCAATAGGATCTTGGGTTGTGTCAGAGCCTAGTTGTGTCATTGTGATGCATTGAAAGAATGAGAATTGGATTTTCACTGCAACCAAAAAGGAAAAAAAAGTGCATACTTTGGGGGTTAGTAAAGGAGACAATAGATAATACTATATGTAGCATGGTTGTATTTATATAAATAATCATGGGTATTTGATTAGCTTTCATGCAAAGCAGTAAAGGATTAACTGATTTTATAACATGTAGAGTTTACAAACTTATTAAGGAATATCTGTATTCTAGTTAGGTGGGCGCATTGTTTTCCAAAGGGGTATGCCTAAGTGCAAACCACCTGTAGCATAGGTGAGATGCGAGGAAATCACCTCAAGATATTTCGGGTATGTCTCCTTGGAGATAATTGTAGGAGGGTAGATCTCACTAAGGCTCATATTTTCCCCCCTCGAATTGTTCTTAGGTTAGAAAGATTATTACGCCTCAGTGTACTTTGGGCTTAGAAGAACCATGGGTAGCCACCAAAATTAAATTATGATGTCCAAGACATATGAGTGGTGCCCAAAAAGTATATATAAAAGGAAGACTGAGAACCCTTCTTTTTTTCTGCCTTTTGGTTATGATTATTGATAAAAATAGATATTATTTAGTTATTTCTCATCAGAAATAGAATCCATTTTATTCTTTAAAAAATGGCTACTCTTATAGAGATCCTTTGTTTATAAACATTATATTTTCAATTTAAAAATTGCAGGTATGCTAGTGCAATTGAGAGAAATGCTCAAGACTATGATGCTCTATACAATTGGGCTTTGGTCCTCCAGGTCTGAGATTTTATATGATCTTCATCAATAATGTTGTAAAGTGCAGAGAGATATTATTGACGGGTCACTAAAACAACTCCAGATCAATTGAATTCTGTGTTTGGTATAAAAATATAACACAACTTACCTGACACCTTCCAGTACCTAGTGTCAATAATTTTTTGTTAAAAGACAACAAGGATATGAAATACGATGAACATTTACTTTTCTTTGATAATGCAAGTAAAAAATCATTGTAATAAGACACCGTAAAACGAACAGATCTCTTCATTCTGTTATAAGCAATATCACAATAAACTCTATTAATCTGATGTTGCAGGAGAGTGCAGATAATGTTAGTCCAGATTCCACTTCACCTTCTAAAGATGCATTGCTTGAGGAGGCTTGTAAAAAGTATGATGAGGCAACCCGTCTTTGCCCAACACTTCACGATGTATGTAAAAAAAAAAGAAACCGTTTTGTACTTTTACTAGTTAGGATGCTGTTGCCAAAAAATGTGAACAGAAGATGGGGAAATGAAGGAAAATATGGACAAACTTTGAAGAGTTTCGAATATTGATTCTGCCAGAAATGAACTTATTTAGATTATAGAGTTCTTTTTTTTTTTCTTTTCTTTTTGCCAGATGGATCAAAATGAGTTTTCCTTGGCCCCATTTTTCACATAGTAATTATTTATCCCTGCTTAGGTTTTGATAAACCATGGCTGAAAGTAGGCCTTCTTTTATACTTGGCATTGGCCGATTTAGAAAATACTCGTATGCATGTTTCACCTGACCTGCAAAATTCCGTTCGTGCTGTCTTAATGATTGAATGTATTTGGATGCATCCCAGGCTTTTTATAATTGGGCTATTGCAATCTCTGATCGGGCCAAAATGCGTGGTCGTACGAAGGAGGCTGAAGAACTGTGGAAGCAGGTTTGTACTTCCATCTCTAATGGGTTTTTGTCATTACGCATTTGAAAAATCTCCCATCCCCAGTGTATCTATCGTTCTGGTAAAAAGGGGGGGCTCTATCATTTTCTGGGTTATCTACGTTTTCTCTGGCGATAGAGAAGGATTTAAAATTTTTTCTATTTACAGTGCTAATGTCATCACTTTAATTCAATGCAACTCCTTCAGTTCTTATTATCTTCTTCTCTTTTTGGTGAAATGTTTGATTACTTGGTGAAGGATGCCTCATTAACACGTTGAAAGTTTGGATCTCCACCCTCCTTATAATTTTATATTTTTCTTAAAAAGAAACTTGCCTTTTCTTTCAACGGTATCATAGGCTGTTAACTGTGTAAAAGGTTTAATTATACTTCATCATCAACTCCGTGATGAAAATTTGCTAGTGTCGGTAAAACACATAAAATACAGGTGCTAATTTTCAGGAATTATTTTTCCTTTTCATCTGAACTAATCATTTCAATTATTCTATCATTCAGGCTACCAAAAATTATGAGAAAGCTGTCCAACTCAACTGGAATAGTCCCCAGGTACATCTCAAGAAATTGCTAGTACCTACATGTGCTTAACTTTTTCTCTTGGATTTTCTTCTTCGTTTGCATTATGTTCTCTTTGTGAAGTTGATTCTATGACAAGCCTCCATAACATTAAAAGAATTGTTTGCATTTATATTTTTGCAAATACCTTAAGGCTAACTATTTTTGGATCTATTTCAGGCGCTAAATAATTGGGGGCTTGCTCTACAGGTACTTATGTTTTTCACATTAGATTAATCTCAATTTGAAATAGTCTCTCTTGCCCACCTCCACGTACTCAAGCATAAGCAATATGGGTAAGATATTTGTTTTTATTGAATATATACGTATATCATGATCCATTTCAGGAACTCAGTGCGATTGTGCCAGCACGAGAAAAGCAGACAATTGTAAGAACAGCTATCAGTAAGGTGACAATAACACTTATTTGGAAACATCTAAATACCAAATAGTTGCAGCCTTTTGTTTAACACTTACTGTTACATTCTGAGCGTTGTTGGTAACTCAGTTGTCTTCCAATTGCAGTTTCGTGCAGCTATACAGTTGCAATTTGATTTTCATCGAGCAATTTACAATCTTGGTACTGTTTTGGTGAGTCTGTTGCCTGCCCCATATATGGAAATTACATGACCATTCATCTGCTTAACGCGCAGTGTTTAGAATGACTCATCAACTTGTGTTGTAGGGATGTGTATAGAAGATTCTTAAGTATTGTTTATATAAATGCCAAGCAATGGAAAACTTATGTCTTGATTGATTTAGATCAGATAATTTAAATTTGCTCAAATCTTAACTCATTTCTGACCTCTACAGTATGGACTAGCTGAGGACACATTACGGACTGGTGGAACAGGAAACTTTAAGGATGTTTCCCCCAATGAGTTGTACAGCCAATCTGCAATTTATATTGCAGCTGCTCATGCTTTAAAACCAAGTTACTCTGTAAGCCCAGTCCTTTCGTTTCGAATAATTTCCGTGTTGTTGTGCCTGATGTAGAAACCAACTATTATGTTCCGTTCAACGTCTTTTTCATTTTTTACCAAGACTTGACACGTATTTGTGCACAAGTACGATAGTTTTTTAAGCCAACTTCTAGTTGTTCGTGGAAAAGGATACAGAAACATTTTAAATATTATAACCGAAGCAAAGTTTCAATTTTTCTTCCATTTCTCTTGTTTTCCGATTAGTCATTTATTTCTTCACATACAAGCTTCAATTGAAGTTTTGATGTTTGTTTTTAATAGGTTTACAGCAGTGCCTTGCGATTGGTTCGTTCAATGGTTAGTCTGACTTCTCAAATAATATGCATCACACTTGACTACGTTTTATTTTCTATTATTTGCTCAATACTTGTAGACCAGAAAGCTTTCTGTAGTGTTTATATGGTTTAGTTTAATATGCGCTGTATTGGTCTTTTAATGGGATGAGAAAGATCTGTACAAAATTCTAAAATAAATGAAGTTGTCAATGTCATAATACATACTGTATTACGTCTATAATGTCATCAACAGTTGGGGCATGTCATGTTTTTAAGTTTCGTGTTGTGAGATCCCACATCGGTTGGAGAGGGGAACAAAACATTCCTTATAAGGGTGTGGAAACCTCTCCCTAGTAGATACGTTTTAAAACCATGAGAAACCGTGAGGCTGACGGCGATACGTAATGGGTCAAATCGGACAATATCTGCTACGTTGGGTTTGAGTTATTACAAATGGTATCAGAGCCAGACACCAGGCGATAATGTGACAGCGAGGACCAAGGGGGTGGGTTGTGAGATCCCACCTCGGTTGGGGAAGGGAATGAAACATTTCTTATAAGGGTATGGAAACCTCTCTCTAGTAAACGCATTTTAAAACCGTGAGGTTGACGGCAATACGTAACAGGCCAAAGCGGACAATATCTACTACAGGTAGGTTTGGGCTGTTACACTTGTATTGCTTTCGGTTCAAAGTCTGAAAGGCTGTGTCTATCAATGAAGATTACTATAGCAGTCAAAAGTATGGCCATGATGTGTCATATCGTTATAACTTTAGCATATACCCATTAAAGCATTTTACTATATTTTTTAAGATTCCAAGACATCTGTAACCCGTAAATTCAGACTCATCATATATTCTGATTTTACTTTCTGGTGACATCCCTTGATCTGGAATTATACTTTGGTGCAATGAACTTTCTGGAACTTGCCTTTTTTACAGCTGCCGTTACCCTATCTAAAAGTTGGGTACCTGACTGCACCTCCTGTAGGGAAACCACTTGCTCCACACAGTGATTGGAAACGTTCACAGTATTTCCTAAATCATGATGTTTTGCAAAAGGTAAAGTTCTATCTGTTTTTATGGTTTAGATTTTATCAGTCAGCCGTGGGAGATGGGTGCGTTTATGTGATTTGGTTGAAACTGATATTGAATTACTTATGGTCAATCCTCTTTCAGCTTAAAATAGGGGGGGAACAAATACAAACATCCCCTAATGCTTTAGGAAGATCTGGAAGTACCTTGAATGGCGATATGCCAATCAAAGTAGAAATTCCAGATATTGTATCTGTATCAGCATGTGCAGATCTAACTTTACCGCCGGGCGCTGGACTTTGCATTGACACAATCCATGGACCAGTTTTCTTGGTGAGCTTGAGCTTTTACGTCTTTTTTTTTTTTTTTTTTTAATAAATAAGAAACCATTTCATTTATGGTAGGAAATTGTAGAGGAGAAGATGCCCAATTCTAGGGAGTTTTTTTAAAAAAACCTCTCCAGTTGGATCGTAAAGAGGAAACGGTATGAATTGGCTTAATGATCGAGAAATTGATGTAAATAATAAAGTTCTCCAAGTAATGAGTTTAAGCCATGACAACCACCTACCTAGAATTTAATAACTTAGGAAGTTACCTTGGCAACTTAACGTAGTTGACTCAAAAGATTGTTTCATGAGAATAGTTGACTACAAATAGGTATGCTCAAGCTAACTTGGACACTCATGATAAAATCGAATGGAGTTGGCTGAAAAGATTGTTCCATAAAAATAGTCGAGGTATGTCCAAGCTAACCTAGACATTCACGAAAAAACCGAATGTAGTTGGCTGAAAAGATTGCTTCATGAAAATAGTTGAGATATGCCCAAGCTAACCTAGACACTCATACTCATGAAAAAACCGAATGTAGTTGGCTGAAAAGATTGTTCCATGAAAATAGTCAAGGTATGCCCAAACTAACCTAGACACTCATGAAAAAACCGAATCGAGCTCATCAATTTTACAATATCTGAAAAATTATCATGTCTGATATGTTCGAGTAGCAAACTGTGCTATTAGAGTGAGTCCATTGATCACATGGAAACTCCCATAGTTTCTTTTGGTTACTTAACTTACACTATTATGTCCAACAATTATGAACTTATATGATCACCTTATCTATTATATATCAGGTTGCTGACTCGTGGGATGCGCTCGATGGATGGCTTGATGCGGTTAGATTAGTTTATACAATCTATGCTCGAGGCAAGAACGACGTTTTGGCTGGCATTGCAACGGGTTGATGATTATTATCAAGTATGAAAATTTATTACTTATATTACCTTGATGTTTATATTATGCTTGCTTATAGTAGATTTAGTATTCATTTCTCCAAATTGGAACTCAATTTTTGGGGTGCTTTTCCAGTGCTTAATACATGATATTCTCTAGCCAGATTTTCTTTTCATGTATAAGAATCTACGTACATGTTTCTGGGCTCTTTGAAAAGAGCTTCGAATTAATTTCATGGAAACTTTAATTGAATTATTTGATA
mRNA sequence
ATGTCGACTACTCCCGAGGAACCTAATAATTTGCAAAACGGAATCGTAACTGAGCCACAAATTTCGTCAGAATCAGAGCAAACCGATGAGTCCAGATCAGAGCCAGAACGCATAGCAGACGCAATTCCCAAAGCTGAATCACAGCTAGAACGAGAATCAGAATCAGAATCAGTTTATGTAGAAGCAGAAGCAGAAGCAGAATCAGAGCTGGCGTCTCGAAGGAAACAGTTATCGGAGTCACTGCCATTACAGGTAGTGACGAATGTTTCAGATCCGAAATTTGATGAGTCTAAAGGAACCTCGATCCCGTCCAACGGCATCGAGAACTCGCAGCCTACGCTGCGTAAAGATGAAGGAAGCCGGACGTTTACAATGAGAGAGTTGCTGAATGGATTGAAAGGTGAAGATGGCAACGACAGCGTTAACGAATCTGAAGGCGAGAAGCCCGACGGTTACAGGTTTGTTGTGTACTTGAGAATACTGAATATGGTTGATTGTCTTAATCAAGATAGCCCACAACAGCCTTATTCTGAACAGAGCAGAGCTGCCATGGAGTTGATCAGCAGTGTTACAGGTGTTGATGAAGAGGGCCGTTCTCGCCAACGGATTCTCACATTTGCTGCTAAGAGGTATGCTAGTGCAATTGAGAGAAATGCTCAAGACTATGATGCTCTATACAATTGGGCTTTGGTCCTCCAGGAGAGTGCAGATAATGTTAGTCCAGATTCCACTTCACCTTCTAAAGATGCATTGCTTGAGGAGGCTTGTAAAAAGTATGATGAGGCAACCCGTCTTTGCCCAACACTTCACGATGCTTTTTATAATTGGGCTATTGCAATCTCTGATCGGGCCAAAATGCGTGGTCGTACGAAGGAGGCTGAAGAACTGTGGAAGCAGGCTACCAAAAATTATGAGAAAGCTGTCCAACTCAACTGGAATAGTCCCCAGGCGCTAAATAATTGGGGGCTTGCTCTACAGGAACTCAGTGCGATTGTGCCAGCACGAGAAAAGCAGACAATTGTAAGAACAGCTATCAGTAAGTTTCGTGCAGCTATACAGTTGCAATTTGATTTTCATCGAGCAATTTACAATCTTGGTACTGTTTTGTATGGACTAGCTGAGGACACATTACGGACTGGTGGAACAGGAAACTTTAAGGATGTTTCCCCCAATGAGTTGTACAGCCAATCTGCAATTTATATTGCAGCTGCTCATGCTTTAAAACCAAGTTACTCTGTTTACAGCAGTGCCTTGCGATTGGTTCGTTCAATGCTGCCGTTACCCTATCTAAAAGTTGGGTACCTGACTGCACCTCCTGTAGGGAAACCACTTGCTCCACACAGTGATTGGAAACGTTCACAGTATTTCCTAAATCATGATGTTTTGCAAAAGCTTAAAATAGGGGGGGAACAAATACAAACATCCCCTAATGCTTTAGGAAGATCTGGAAGTACCTTGAATGGCGATATGCCAATCAAAGTAGAAATTCCAGATATTGTATCTGTATCAGCATGTGCAGATCTAACTTTACCGCCGGGCGCTGGACTTTGCATTGACACAATCCATGGACCAGTTTTCTTGGTTGCTGACTCGTGGGATGCGCTCGATGGATGGCTTGATGCGGTTAGATTAGTTTATACAATCTATGCTCGAGGCAAGAACGACGTTTTGGCTGGCATTGCAACGGGTTGATGATTATTATCAAGTATGAAAATTTATTACTTATATTACCTTGATGTTTATATTATGCTTGCTTATAGTAGATTTAGTATTCATTTCTCCAAATTGGAACTCAATTTTTGGGGTGCTTTTCCAGTGCTTAATACATGATATTCTCTAGCCAGATTTTCTTTTCATGTATAAGAATCTACGTACATGTTTCTGGGCTCTTTGAAAAGAGCTTCGAATTAATTTCATGGAAACTTTAATTGAATTATTTGATA
Coding sequence (CDS)
ATGTCGACTACTCCCGAGGAACCTAATAATTTGCAAAACGGAATCGTAACTGAGCCACAAATTTCGTCAGAATCAGAGCAAACCGATGAGTCCAGATCAGAGCCAGAACGCATAGCAGACGCAATTCCCAAAGCTGAATCACAGCTAGAACGAGAATCAGAATCAGAATCAGTTTATGTAGAAGCAGAAGCAGAAGCAGAATCAGAGCTGGCGTCTCGAAGGAAACAGTTATCGGAGTCACTGCCATTACAGGTAGTGACGAATGTTTCAGATCCGAAATTTGATGAGTCTAAAGGAACCTCGATCCCGTCCAACGGCATCGAGAACTCGCAGCCTACGCTGCGTAAAGATGAAGGAAGCCGGACGTTTACAATGAGAGAGTTGCTGAATGGATTGAAAGGTGAAGATGGCAACGACAGCGTTAACGAATCTGAAGGCGAGAAGCCCGACGGTTACAGGTTTGTTGTGTACTTGAGAATACTGAATATGGTTGATTGTCTTAATCAAGATAGCCCACAACAGCCTTATTCTGAACAGAGCAGAGCTGCCATGGAGTTGATCAGCAGTGTTACAGGTGTTGATGAAGAGGGCCGTTCTCGCCAACGGATTCTCACATTTGCTGCTAAGAGGTATGCTAGTGCAATTGAGAGAAATGCTCAAGACTATGATGCTCTATACAATTGGGCTTTGGTCCTCCAGGAGAGTGCAGATAATGTTAGTCCAGATTCCACTTCACCTTCTAAAGATGCATTGCTTGAGGAGGCTTGTAAAAAGTATGATGAGGCAACCCGTCTTTGCCCAACACTTCACGATGCTTTTTATAATTGGGCTATTGCAATCTCTGATCGGGCCAAAATGCGTGGTCGTACGAAGGAGGCTGAAGAACTGTGGAAGCAGGCTACCAAAAATTATGAGAAAGCTGTCCAACTCAACTGGAATAGTCCCCAGGCGCTAAATAATTGGGGGCTTGCTCTACAGGAACTCAGTGCGATTGTGCCAGCACGAGAAAAGCAGACAATTGTAAGAACAGCTATCAGTAAGTTTCGTGCAGCTATACAGTTGCAATTTGATTTTCATCGAGCAATTTACAATCTTGGTACTGTTTTGTATGGACTAGCTGAGGACACATTACGGACTGGTGGAACAGGAAACTTTAAGGATGTTTCCCCCAATGAGTTGTACAGCCAATCTGCAATTTATATTGCAGCTGCTCATGCTTTAAAACCAAGTTACTCTGTTTACAGCAGTGCCTTGCGATTGGTTCGTTCAATGCTGCCGTTACCCTATCTAAAAGTTGGGTACCTGACTGCACCTCCTGTAGGGAAACCACTTGCTCCACACAGTGATTGGAAACGTTCACAGTATTTCCTAAATCATGATGTTTTGCAAAAGCTTAAAATAGGGGGGGAACAAATACAAACATCCCCTAATGCTTTAGGAAGATCTGGAAGTACCTTGAATGGCGATATGCCAATCAAAGTAGAAATTCCAGATATTGTATCTGTATCAGCATGTGCAGATCTAACTTTACCGCCGGGCGCTGGACTTTGCATTGACACAATCCATGGACCAGTTTTCTTGGTTGCTGACTCGTGGGATGCGCTCGATGGATGGCTTGATGCGGTTAGATTAGTTTATACAATCTATGCTCGAGGCAAGAACGACGTTTTGGCTGGCATTGCAACGGGTTGA
Protein sequence
MSTTPEEPNNLQNGIVTEPQISSESEQTDESRSEPERIADAIPKAESQLERESESESVYVEAEAEAESELASRRKQLSESLPLQVVTNVSDPKFDESKGTSIPSNGIENSQPTLRKDEGSRTFTMRELLNGLKGEDGNDSVNESEGEKPDGYRFVVYLRILNMVDCLNQDSPQQPYSEQSRAAMELISSVTGVDEEGRSRQRILTFAAKRYASAIERNAQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNFKDVSPNELYSQSAIYIAAAHALKPSYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGKPLAPHSDWKRSQYFLNHDVLQKLKIGGEQIQTSPNALGRSGSTLNGDMPIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDALDGWLDAVRLVYTIYARGKNDVLAGIATG
Homology
BLAST of CmaCh06G005880 vs. ExPASy Swiss-Prot
Match:
Q9FHY8 (Protein HLB1 OS=Arabidopsis thaliana OX=3702 GN=HLB1 PE=1 SV=1)
HSP 1 Score: 629.0 bits (1621), Expect = 5.1e-179
Identity = 361/590 (61.19%), Postives = 427/590 (72.37%), Query Frame = 0
Query: 1 MSTTPEEPNNLQNGI-----------VTEPQISSESEQTDESRSEPERIADAIPKAESQL 60
M+ T EEP LQNG + EPQ+ +E + T E PE AD P+
Sbjct: 1 MADTVEEP-QLQNGAAPAESETEQNPIPEPQLQTEPKLTGEI---PEIEADLTPEEVQSE 60
Query: 61 ERESESESVYVEAEAEAESEL---ASRRKQLSESLPLQVVTNVSDPK-----FDESKGTS 120
+++ E V E + E + A + SE P +V + V+D K D S G S
Sbjct: 61 VTDAKPEEVQSEVKPEEVKTVVTDAKPEEAQSEVKPEEVQSVVTDTKPDLTDVDLSPGGS 120
Query: 121 ----IPSNGIENSQPT--LRK-DEGSRTFTMRELLNGLKGEDGNDSVNESEGEKPDGYRF 180
I S +E T L+K D+G++TFTMRELL+ LK E+G+ + + S
Sbjct: 121 EEIPIRSTEVEQESTTSVLKKDDDGNKTFTMRELLSELKSEEGDGTPHSSASP------- 180
Query: 181 VVYLRILNMVDCLNQDSPQQPYSEQSRAAMELISSVTGVDEEGRSRQRILTFAAKRYASA 240
+++S QP ++ AM+LI+ + DEEGRSRQR+L FAA++YASA
Sbjct: 181 ------------FSRESASQP--AENNPAMDLINRIQVNDEEGRSRQRVLAFAARKYASA 240
Query: 241 IERNAQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFY 300
IERN D+DALYNWAL+LQESADNVSPDS SPSKD LLEEACKKYDEATRLCPTL+DA+Y
Sbjct: 241 IERNPDDHDALYNWALILQESADNVSPDSVSPSKDDLLEEACKKYDEATRLCPTLYDAYY 300
Query: 301 NWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPA 360
NWAIAISDRAK+RGRTKEAEELW+QA NYEKAVQLNWNS QALNNWGL LQELS IVPA
Sbjct: 301 NWAIAISDRAKIRGRTKEAEELWEQAADNYEKAVQLNWNSSQALNNWGLVLQELSQIVPA 360
Query: 361 REKQTIVRTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNFKDVSPNELY 420
REK+ +VRTAISKFRAAI+LQFDFHRAIYNLGTVLYGLAEDTLRTGG+GN KD+ P ELY
Sbjct: 361 REKEKVVRTAISKFRAAIRLQFDFHRAIYNLGTVLYGLAEDTLRTGGSGNGKDMPPGELY 420
Query: 421 SQSAIYIAAAHALKPSYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGKPLAPHSDWKRSQ 480
SQSAIYIAAAH+LKPSYSVYSSALRLVRSMLPLP+LKVGYLTAPPVG LAPHSDWKR++
Sbjct: 421 SQSAIYIAAAHSLKPSYSVYSSALRLVRSMLPLPHLKVGYLTAPPVGNSLAPHSDWKRTE 480
Query: 481 YFLNHD-VLQKLKIGGEQIQTSPNALGRSGSTLNGDMPIKVEIPDIVSVSACADLTLPPG 540
+ LNH+ +LQ LK ++ + + + ST +KV I +IVSV+ CADLTLPPG
Sbjct: 481 FELNHERLLQVLKPEPREMGRNLSGKAETMSTNVERKTVKVNITEIVSVTPCADLTLPPG 540
Query: 541 AGLCIDTIHGPVFLVADSWDALDGWLDAVRLVYTIYARGKNDVLAGIATG 564
AGLCIDTIHGPVFLVADSW++LDGWLDA+RLVYTIYARGK+DVLAGI TG
Sbjct: 541 AGLCIDTIHGPVFLVADSWESLDGWLDAIRLVYTIYARGKSDVLAGIITG 565
BLAST of CmaCh06G005880 vs. ExPASy TrEMBL
Match:
A0A6J1KVY8 (protein HLB1-like isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111498676 PE=4 SV=1)
HSP 1 Score: 1059.7 bits (2739), Expect = 4.3e-306
Identity = 549/563 (97.51%), Postives = 549/563 (97.51%), Query Frame = 0
Query: 1 MSTTPEEPNNLQNGIVTEPQISSESEQTDESRSEPERIADAIPKAESQLERESESESVYV 60
MSTTPEEPNNLQNGIVTEPQISSESEQTDESRSEPERIADAIPKAESQLERESESESVYV
Sbjct: 1 MSTTPEEPNNLQNGIVTEPQISSESEQTDESRSEPERIADAIPKAESQLERESESESVYV 60
Query: 61 EAEAEAESELASRRKQLSESLPLQVVTNVSDPKFDESKGTSIPSNGIENSQPTLRKDEGS 120
EAEAEAESELASRRKQLSESLPLQVVTNVSDPKFDESKGTSIPSNGIENSQPTLRKDEGS
Sbjct: 61 EAEAEAESELASRRKQLSESLPLQVVTNVSDPKFDESKGTSIPSNGIENSQPTLRKDEGS 120
Query: 121 RTFTMRELLNGLKGEDGNDSVNESEGEKPDGYRFVVYLRILNMVDCLNQDSPQQPYSEQS 180
RTFTMRELLNGLKGEDGNDSVNESEGEKPDGY LNQDSPQQPYSEQS
Sbjct: 121 RTFTMRELLNGLKGEDGNDSVNESEGEKPDGY-------------SLNQDSPQQPYSEQS 180
Query: 181 RAAMELISSVTGVDEEGRSRQRILTFAAKRYASAIERNAQDYDALYNWALVLQESADNVS 240
RAAMELISSVTGVDEEGRSRQRILTFAAKRYASAIERNAQDYDALYNWALVLQESADNVS
Sbjct: 181 RAAMELISSVTGVDEEGRSRQRILTFAAKRYASAIERNAQDYDALYNWALVLQESADNVS 240
Query: 241 PDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQA 300
PDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQA
Sbjct: 241 PDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQA 300
Query: 301 TKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDFHR 360
TKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDFHR
Sbjct: 301 TKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDFHR 360
Query: 361 AIYNLGTVLYGLAEDTLRTGGTGNFKDVSPNELYSQSAIYIAAAHALKPSYSVYSSALRL 420
AIYNLGTVLYGLAEDTLRTGGTGNFKDVSPNELYSQSAIYIAAAHALKPSYSVYSSALRL
Sbjct: 361 AIYNLGTVLYGLAEDTLRTGGTGNFKDVSPNELYSQSAIYIAAAHALKPSYSVYSSALRL 420
Query: 421 VRSMLPLPYLKVGYLTAPPVGKPLAPHSDWKRSQYFLNHDVLQKLKIGGEQIQTSPNALG 480
VRSMLPLPYLKVGYLTAPPVGKPLAPHSDWKRSQYFLNHDVLQKLKIGGEQIQTSPNALG
Sbjct: 421 VRSMLPLPYLKVGYLTAPPVGKPLAPHSDWKRSQYFLNHDVLQKLKIGGEQIQTSPNALG 480
Query: 481 RSGSTLNGDMPIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDALDGWLD 540
RSGSTLNGDMPIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDALDGWLD
Sbjct: 481 RSGSTLNGDMPIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDALDGWLD 540
Query: 541 AVRLVYTIYARGKNDVLAGIATG 564
AVRLVYTIYARGKNDVLAGIATG
Sbjct: 541 AVRLVYTIYARGKNDVLAGIATG 550
BLAST of CmaCh06G005880 vs. ExPASy TrEMBL
Match:
A0A6J1G2N6 (protein HLB1-like isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111450247 PE=4 SV=1)
HSP 1 Score: 1055.8 bits (2729), Expect = 6.2e-305
Identity = 546/563 (96.98%), Postives = 548/563 (97.34%), Query Frame = 0
Query: 1 MSTTPEEPNNLQNGIVTEPQISSESEQTDESRSEPERIADAIPKAESQLERESESESVYV 60
MSTTPEEPNNLQNGIVTEPQISSESEQTDESRSEPERIADAIPKAESQLERESESESVYV
Sbjct: 1 MSTTPEEPNNLQNGIVTEPQISSESEQTDESRSEPERIADAIPKAESQLERESESESVYV 60
Query: 61 EAEAEAESELASRRKQLSESLPLQVVTNVSDPKFDESKGTSIPSNGIENSQPTLRKDEGS 120
EAEAEAESELASRRKQLSESLPLQVVTNVSDPKFDESKGTSIPSNGIENSQPTLRKDEGS
Sbjct: 61 EAEAEAESELASRRKQLSESLPLQVVTNVSDPKFDESKGTSIPSNGIENSQPTLRKDEGS 120
Query: 121 RTFTMRELLNGLKGEDGNDSVNESEGEKPDGYRFVVYLRILNMVDCLNQDSPQQPYSEQS 180
RTFTMRELLNGLKGEDGNDSVNESEGEKPDGY LNQDSPQQPYSEQS
Sbjct: 121 RTFTMRELLNGLKGEDGNDSVNESEGEKPDGY-------------SLNQDSPQQPYSEQS 180
Query: 181 RAAMELISSVTGVDEEGRSRQRILTFAAKRYASAIERNAQDYDALYNWALVLQESADNVS 240
RAAMELISSVTGVDEEGRSRQRILTFAAKRYASAIERN+QDYDALYNWALVLQESADNVS
Sbjct: 181 RAAMELISSVTGVDEEGRSRQRILTFAAKRYASAIERNSQDYDALYNWALVLQESADNVS 240
Query: 241 PDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQA 300
PDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQA
Sbjct: 241 PDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQA 300
Query: 301 TKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDFHR 360
TKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDFHR
Sbjct: 301 TKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDFHR 360
Query: 361 AIYNLGTVLYGLAEDTLRTGGTGNFKDVSPNELYSQSAIYIAAAHALKPSYSVYSSALRL 420
AIYNLGTVLYGLAEDTLRTGGTGNFKDVSPNELYSQSAIYIAAAHALKPSYSVYSSALRL
Sbjct: 361 AIYNLGTVLYGLAEDTLRTGGTGNFKDVSPNELYSQSAIYIAAAHALKPSYSVYSSALRL 420
Query: 421 VRSMLPLPYLKVGYLTAPPVGKPLAPHSDWKRSQYFLNHDVLQKLKIGGEQIQTSPNALG 480
VRSMLPLPYLKVGYLTAPPVGKPLAPHSDWKRSQYFLNHDVLQKLKIGGEQIQTSPNALG
Sbjct: 421 VRSMLPLPYLKVGYLTAPPVGKPLAPHSDWKRSQYFLNHDVLQKLKIGGEQIQTSPNALG 480
Query: 481 RSGSTLNGDMPIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDALDGWLD 540
RSGSTLNGDMPIKV+IPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDALDGWLD
Sbjct: 481 RSGSTLNGDMPIKVDIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDALDGWLD 540
Query: 541 AVRLVYTIYARGKNDVLAGIATG 564
AVRLVYTIYARGKNDVLAGI TG
Sbjct: 541 AVRLVYTIYARGKNDVLAGIVTG 550
BLAST of CmaCh06G005880 vs. ExPASy TrEMBL
Match:
A0A6J1KU40 (protein HLB1-like isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111498676 PE=4 SV=1)
HSP 1 Score: 985.3 bits (2546), Expect = 1.0e-283
Identity = 513/527 (97.34%), Postives = 513/527 (97.34%), Query Frame = 0
Query: 1 MSTTPEEPNNLQNGIVTEPQISSESEQTDESRSEPERIADAIPKAESQLERESESESVYV 60
MSTTPEEPNNLQNGIVTEPQISSESEQTDESRSEPERIADAIPKAESQLERESESESVYV
Sbjct: 1 MSTTPEEPNNLQNGIVTEPQISSESEQTDESRSEPERIADAIPKAESQLERESESESVYV 60
Query: 61 EAEAEAESELASRRKQLSESLPLQVVTNVSDPKFDESKGTSIPSNGIENSQPTLRKDEGS 120
EAEAEAESELASRRKQLSESLPLQVVTNVSDPKFDESKGTSIPSNGIENSQPTLRKDEGS
Sbjct: 61 EAEAEAESELASRRKQLSESLPLQVVTNVSDPKFDESKGTSIPSNGIENSQPTLRKDEGS 120
Query: 121 RTFTMRELLNGLKGEDGNDSVNESEGEKPDGYRFVVYLRILNMVDCLNQDSPQQPYSEQS 180
RTFTMRELLNGLKGEDGNDSVNESEGEKPDGY LNQDSPQQPYSEQS
Sbjct: 121 RTFTMRELLNGLKGEDGNDSVNESEGEKPDGY-------------SLNQDSPQQPYSEQS 180
Query: 181 RAAMELISSVTGVDEEGRSRQRILTFAAKRYASAIERNAQDYDALYNWALVLQESADNVS 240
RAAMELISSVTGVDEEGRSRQRILTFAAKRYASAIERNAQDYDALYNWALVLQESADNVS
Sbjct: 181 RAAMELISSVTGVDEEGRSRQRILTFAAKRYASAIERNAQDYDALYNWALVLQESADNVS 240
Query: 241 PDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQA 300
PDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQA
Sbjct: 241 PDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQA 300
Query: 301 TKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDFHR 360
TKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDFHR
Sbjct: 301 TKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDFHR 360
Query: 361 AIYNLGTVLYGLAEDTLRTGGTGNFKDVSPNELYSQSAIYIAAAHALKPSYSVYSSALRL 420
AIYNLGTVLYGLAEDTLRTGGTGNFKDVSPNELYSQSAIYIAAAHALKPSYSVYSSALRL
Sbjct: 361 AIYNLGTVLYGLAEDTLRTGGTGNFKDVSPNELYSQSAIYIAAAHALKPSYSVYSSALRL 420
Query: 421 VRSMLPLPYLKVGYLTAPPVGKPLAPHSDWKRSQYFLNHDVLQKLKIGGEQIQTSPNALG 480
VRSMLPLPYLKVGYLTAPPVGKPLAPHSDWKRSQYFLNHDVLQKLKIGGEQIQTSPNALG
Sbjct: 421 VRSMLPLPYLKVGYLTAPPVGKPLAPHSDWKRSQYFLNHDVLQKLKIGGEQIQTSPNALG 480
Query: 481 RSGSTLNGDMPIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFL 528
RSGSTLNGDMPIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFL
Sbjct: 481 RSGSTLNGDMPIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFL 514
BLAST of CmaCh06G005880 vs. ExPASy TrEMBL
Match:
A0A6J1G2L1 (protein HLB1-like isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111450247 PE=4 SV=1)
HSP 1 Score: 983.0 bits (2540), Expect = 5.1e-283
Identity = 511/527 (96.96%), Postives = 513/527 (97.34%), Query Frame = 0
Query: 1 MSTTPEEPNNLQNGIVTEPQISSESEQTDESRSEPERIADAIPKAESQLERESESESVYV 60
MSTTPEEPNNLQNGIVTEPQISSESEQTDESRSEPERIADAIPKAESQLERESESESVYV
Sbjct: 1 MSTTPEEPNNLQNGIVTEPQISSESEQTDESRSEPERIADAIPKAESQLERESESESVYV 60
Query: 61 EAEAEAESELASRRKQLSESLPLQVVTNVSDPKFDESKGTSIPSNGIENSQPTLRKDEGS 120
EAEAEAESELASRRKQLSESLPLQVVTNVSDPKFDESKGTSIPSNGIENSQPTLRKDEGS
Sbjct: 61 EAEAEAESELASRRKQLSESLPLQVVTNVSDPKFDESKGTSIPSNGIENSQPTLRKDEGS 120
Query: 121 RTFTMRELLNGLKGEDGNDSVNESEGEKPDGYRFVVYLRILNMVDCLNQDSPQQPYSEQS 180
RTFTMRELLNGLKGEDGNDSVNESEGEKPDGY LNQDSPQQPYSEQS
Sbjct: 121 RTFTMRELLNGLKGEDGNDSVNESEGEKPDGY-------------SLNQDSPQQPYSEQS 180
Query: 181 RAAMELISSVTGVDEEGRSRQRILTFAAKRYASAIERNAQDYDALYNWALVLQESADNVS 240
RAAMELISSVTGVDEEGRSRQRILTFAAKRYASAIERN+QDYDALYNWALVLQESADNVS
Sbjct: 181 RAAMELISSVTGVDEEGRSRQRILTFAAKRYASAIERNSQDYDALYNWALVLQESADNVS 240
Query: 241 PDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQA 300
PDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQA
Sbjct: 241 PDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQA 300
Query: 301 TKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDFHR 360
TKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDFHR
Sbjct: 301 TKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDFHR 360
Query: 361 AIYNLGTVLYGLAEDTLRTGGTGNFKDVSPNELYSQSAIYIAAAHALKPSYSVYSSALRL 420
AIYNLGTVLYGLAEDTLRTGGTGNFKDVSPNELYSQSAIYIAAAHALKPSYSVYSSALRL
Sbjct: 361 AIYNLGTVLYGLAEDTLRTGGTGNFKDVSPNELYSQSAIYIAAAHALKPSYSVYSSALRL 420
Query: 421 VRSMLPLPYLKVGYLTAPPVGKPLAPHSDWKRSQYFLNHDVLQKLKIGGEQIQTSPNALG 480
VRSMLPLPYLKVGYLTAPPVGKPLAPHSDWKRSQYFLNHDVLQKLKIGGEQIQTSPNALG
Sbjct: 421 VRSMLPLPYLKVGYLTAPPVGKPLAPHSDWKRSQYFLNHDVLQKLKIGGEQIQTSPNALG 480
Query: 481 RSGSTLNGDMPIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFL 528
RSGSTLNGDMPIKV+IPDIVSVSACADLTLPPGAGLCIDTIHGPVFL
Sbjct: 481 RSGSTLNGDMPIKVDIPDIVSVSACADLTLPPGAGLCIDTIHGPVFL 514
BLAST of CmaCh06G005880 vs. ExPASy TrEMBL
Match:
A0A6J1HJU5 (protein HLB1-like isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111465172 PE=4 SV=1)
HSP 1 Score: 926.0 bits (2392), Expect = 7.4e-266
Identity = 482/563 (85.61%), Postives = 505/563 (89.70%), Query Frame = 0
Query: 1 MSTTPEEPNNLQNGIVTEPQISSESEQTDESRSEPERIADAIPKAESQLERESESESVYV 60
MS PEEPNNLQNGI EP IS ES Q ES+SEPE AD IP AE L++E ESESV
Sbjct: 1 MSPIPEEPNNLQNGIEIEPHISVESNQIGESKSEPESTADVIPTAE--LQQERESESVNG 60
Query: 61 EAEAEAESELASRRKQLSESLPLQVVTNVSDPKFDESKGTSIPSNGIENSQPTLRKDEGS 120
A++E +SEL S RKQLSES+ LQVVT+V+DP+F+E KGTSI SNG ENSQP LRKDEGS
Sbjct: 61 VADSEPQSELDSPRKQLSESIELQVVTDVTDPRFEEPKGTSISSNGAENSQPALRKDEGS 120
Query: 121 RTFTMRELLNGLKGEDGNDSVNESEGEKPDGYRFVVYLRILNMVDCLNQDSPQQPYSEQS 180
RTFTMRELLNGLK EDGNDS+NESEGEKP+ N LNQDSP QPYSEQS
Sbjct: 121 RTFTMRELLNGLKVEDGNDSLNESEGEKPEA----------NSGYSLNQDSPHQPYSEQS 180
Query: 181 RAAMELISSVTGVDEEGRSRQRILTFAAKRYASAIERNAQDYDALYNWALVLQESADNVS 240
RAAMELI+SVTGVDEEGRSRQRILTFAA+RYASAIERN QDYDALYNWALVLQESADNVS
Sbjct: 181 RAAMELINSVTGVDEEGRSRQRILTFAARRYASAIERNGQDYDALYNWALVLQESADNVS 240
Query: 241 PDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQA 300
PDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQA
Sbjct: 241 PDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQA 300
Query: 301 TKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDFHR 360
T+NYEKAVQLNWNSPQALNNWGLALQELSAIVPAREK TIV+TAISKFRAAIQLQFDFHR
Sbjct: 301 TRNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKPTIVKTAISKFRAAIQLQFDFHR 360
Query: 361 AIYNLGTVLYGLAEDTLRTGGTGNFKDVSPNELYSQSAIYIAAAHALKPSYSVYSSALRL 420
AIYNLGTVLYGLAEDTLRTGGTG KDVSPNELYSQSAIYIAAAHALKPSYSVYSSALRL
Sbjct: 361 AIYNLGTVLYGLAEDTLRTGGTGTVKDVSPNELYSQSAIYIAAAHALKPSYSVYSSALRL 420
Query: 421 VRSMLPLPYLKVGYLTAPPVGKPLAPHSDWKRSQYFLNHDVLQKLKIGGEQIQTSPNALG 480
VRSMLPLPYLKVGYLTAPPVG+P APH DWKRSQ+FLNHDVLQKL IGGEQIQTSP LG
Sbjct: 421 VRSMLPLPYLKVGYLTAPPVGRPFAPHGDWKRSQFFLNHDVLQKLNIGGEQIQTSPTLLG 480
Query: 481 RSGSTLNGDMPIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDALDGWLD 540
RSGSTLNGD +KVEIPDIVSVSACADLTLPPGAGLCIDTIHG +FLVADSWDALDGWLD
Sbjct: 481 RSGSTLNGDRTMKVEIPDIVSVSACADLTLPPGAGLCIDTIHGQIFLVADSWDALDGWLD 540
Query: 541 AVRLVYTIYARGKNDVLAGIATG 564
A+RLVYTIYARGKN+VLAGI G
Sbjct: 541 AIRLVYTIYARGKNEVLAGIIAG 551
BLAST of CmaCh06G005880 vs. NCBI nr
Match:
XP_023005781.1 (protein HLB1-like isoform X1 [Cucurbita maxima])
HSP 1 Score: 1059.7 bits (2739), Expect = 8.9e-306
Identity = 549/563 (97.51%), Postives = 549/563 (97.51%), Query Frame = 0
Query: 1 MSTTPEEPNNLQNGIVTEPQISSESEQTDESRSEPERIADAIPKAESQLERESESESVYV 60
MSTTPEEPNNLQNGIVTEPQISSESEQTDESRSEPERIADAIPKAESQLERESESESVYV
Sbjct: 1 MSTTPEEPNNLQNGIVTEPQISSESEQTDESRSEPERIADAIPKAESQLERESESESVYV 60
Query: 61 EAEAEAESELASRRKQLSESLPLQVVTNVSDPKFDESKGTSIPSNGIENSQPTLRKDEGS 120
EAEAEAESELASRRKQLSESLPLQVVTNVSDPKFDESKGTSIPSNGIENSQPTLRKDEGS
Sbjct: 61 EAEAEAESELASRRKQLSESLPLQVVTNVSDPKFDESKGTSIPSNGIENSQPTLRKDEGS 120
Query: 121 RTFTMRELLNGLKGEDGNDSVNESEGEKPDGYRFVVYLRILNMVDCLNQDSPQQPYSEQS 180
RTFTMRELLNGLKGEDGNDSVNESEGEKPDGY LNQDSPQQPYSEQS
Sbjct: 121 RTFTMRELLNGLKGEDGNDSVNESEGEKPDGY-------------SLNQDSPQQPYSEQS 180
Query: 181 RAAMELISSVTGVDEEGRSRQRILTFAAKRYASAIERNAQDYDALYNWALVLQESADNVS 240
RAAMELISSVTGVDEEGRSRQRILTFAAKRYASAIERNAQDYDALYNWALVLQESADNVS
Sbjct: 181 RAAMELISSVTGVDEEGRSRQRILTFAAKRYASAIERNAQDYDALYNWALVLQESADNVS 240
Query: 241 PDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQA 300
PDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQA
Sbjct: 241 PDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQA 300
Query: 301 TKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDFHR 360
TKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDFHR
Sbjct: 301 TKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDFHR 360
Query: 361 AIYNLGTVLYGLAEDTLRTGGTGNFKDVSPNELYSQSAIYIAAAHALKPSYSVYSSALRL 420
AIYNLGTVLYGLAEDTLRTGGTGNFKDVSPNELYSQSAIYIAAAHALKPSYSVYSSALRL
Sbjct: 361 AIYNLGTVLYGLAEDTLRTGGTGNFKDVSPNELYSQSAIYIAAAHALKPSYSVYSSALRL 420
Query: 421 VRSMLPLPYLKVGYLTAPPVGKPLAPHSDWKRSQYFLNHDVLQKLKIGGEQIQTSPNALG 480
VRSMLPLPYLKVGYLTAPPVGKPLAPHSDWKRSQYFLNHDVLQKLKIGGEQIQTSPNALG
Sbjct: 421 VRSMLPLPYLKVGYLTAPPVGKPLAPHSDWKRSQYFLNHDVLQKLKIGGEQIQTSPNALG 480
Query: 481 RSGSTLNGDMPIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDALDGWLD 540
RSGSTLNGDMPIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDALDGWLD
Sbjct: 481 RSGSTLNGDMPIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDALDGWLD 540
Query: 541 AVRLVYTIYARGKNDVLAGIATG 564
AVRLVYTIYARGKNDVLAGIATG
Sbjct: 541 AVRLVYTIYARGKNDVLAGIATG 550
BLAST of CmaCh06G005880 vs. NCBI nr
Match:
XP_022946039.1 (protein HLB1-like isoform X1 [Cucurbita moschata])
HSP 1 Score: 1055.8 bits (2729), Expect = 1.3e-304
Identity = 546/563 (96.98%), Postives = 548/563 (97.34%), Query Frame = 0
Query: 1 MSTTPEEPNNLQNGIVTEPQISSESEQTDESRSEPERIADAIPKAESQLERESESESVYV 60
MSTTPEEPNNLQNGIVTEPQISSESEQTDESRSEPERIADAIPKAESQLERESESESVYV
Sbjct: 1 MSTTPEEPNNLQNGIVTEPQISSESEQTDESRSEPERIADAIPKAESQLERESESESVYV 60
Query: 61 EAEAEAESELASRRKQLSESLPLQVVTNVSDPKFDESKGTSIPSNGIENSQPTLRKDEGS 120
EAEAEAESELASRRKQLSESLPLQVVTNVSDPKFDESKGTSIPSNGIENSQPTLRKDEGS
Sbjct: 61 EAEAEAESELASRRKQLSESLPLQVVTNVSDPKFDESKGTSIPSNGIENSQPTLRKDEGS 120
Query: 121 RTFTMRELLNGLKGEDGNDSVNESEGEKPDGYRFVVYLRILNMVDCLNQDSPQQPYSEQS 180
RTFTMRELLNGLKGEDGNDSVNESEGEKPDGY LNQDSPQQPYSEQS
Sbjct: 121 RTFTMRELLNGLKGEDGNDSVNESEGEKPDGY-------------SLNQDSPQQPYSEQS 180
Query: 181 RAAMELISSVTGVDEEGRSRQRILTFAAKRYASAIERNAQDYDALYNWALVLQESADNVS 240
RAAMELISSVTGVDEEGRSRQRILTFAAKRYASAIERN+QDYDALYNWALVLQESADNVS
Sbjct: 181 RAAMELISSVTGVDEEGRSRQRILTFAAKRYASAIERNSQDYDALYNWALVLQESADNVS 240
Query: 241 PDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQA 300
PDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQA
Sbjct: 241 PDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQA 300
Query: 301 TKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDFHR 360
TKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDFHR
Sbjct: 301 TKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDFHR 360
Query: 361 AIYNLGTVLYGLAEDTLRTGGTGNFKDVSPNELYSQSAIYIAAAHALKPSYSVYSSALRL 420
AIYNLGTVLYGLAEDTLRTGGTGNFKDVSPNELYSQSAIYIAAAHALKPSYSVYSSALRL
Sbjct: 361 AIYNLGTVLYGLAEDTLRTGGTGNFKDVSPNELYSQSAIYIAAAHALKPSYSVYSSALRL 420
Query: 421 VRSMLPLPYLKVGYLTAPPVGKPLAPHSDWKRSQYFLNHDVLQKLKIGGEQIQTSPNALG 480
VRSMLPLPYLKVGYLTAPPVGKPLAPHSDWKRSQYFLNHDVLQKLKIGGEQIQTSPNALG
Sbjct: 421 VRSMLPLPYLKVGYLTAPPVGKPLAPHSDWKRSQYFLNHDVLQKLKIGGEQIQTSPNALG 480
Query: 481 RSGSTLNGDMPIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDALDGWLD 540
RSGSTLNGDMPIKV+IPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDALDGWLD
Sbjct: 481 RSGSTLNGDMPIKVDIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDALDGWLD 540
Query: 541 AVRLVYTIYARGKNDVLAGIATG 564
AVRLVYTIYARGKNDVLAGI TG
Sbjct: 541 AVRLVYTIYARGKNDVLAGIVTG 550
BLAST of CmaCh06G005880 vs. NCBI nr
Match:
XP_023540563.1 (protein HLB1-like isoform X1 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 1049.3 bits (2712), Expect = 1.2e-302
Identity = 545/563 (96.80%), Postives = 546/563 (96.98%), Query Frame = 0
Query: 1 MSTTPEEPNNLQNGIVTEPQISSESEQTDESRSEPERIADAIPKAESQLERESESESVYV 60
MSTTPEEPNNLQNGIVTEPQISSESEQTDESRSEPERIADAIPKAESQLERESESESVYV
Sbjct: 1 MSTTPEEPNNLQNGIVTEPQISSESEQTDESRSEPERIADAIPKAESQLERESESESVYV 60
Query: 61 EAEAEAESELASRRKQLSESLPLQVVTNVSDPKFDESKGTSIPSNGIENSQPTLRKDEGS 120
EAEAESELASRRKQLSESLPLQVVTNVSDPKFDESKGTSIPSNGIENSQPTLRKDEGS
Sbjct: 61 --EAEAESELASRRKQLSESLPLQVVTNVSDPKFDESKGTSIPSNGIENSQPTLRKDEGS 120
Query: 121 RTFTMRELLNGLKGEDGNDSVNESEGEKPDGYRFVVYLRILNMVDCLNQDSPQQPYSEQS 180
RTFTMRELLNGLKGEDGNDSVNESEGEKPDGY LNQDSPQQPYSEQS
Sbjct: 121 RTFTMRELLNGLKGEDGNDSVNESEGEKPDGY-------------SLNQDSPQQPYSEQS 180
Query: 181 RAAMELISSVTGVDEEGRSRQRILTFAAKRYASAIERNAQDYDALYNWALVLQESADNVS 240
RAAMELISSVTGVDEEGRSRQRILTFAAKRYASAIERN+QDYDALYNWALVLQESADNVS
Sbjct: 181 RAAMELISSVTGVDEEGRSRQRILTFAAKRYASAIERNSQDYDALYNWALVLQESADNVS 240
Query: 241 PDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQA 300
PDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQA
Sbjct: 241 PDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQA 300
Query: 301 TKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDFHR 360
TKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDFHR
Sbjct: 301 TKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDFHR 360
Query: 361 AIYNLGTVLYGLAEDTLRTGGTGNFKDVSPNELYSQSAIYIAAAHALKPSYSVYSSALRL 420
AIYNLGTVLYGLAEDTLRTGGTGNFKDVSPNELYSQSAIYIAAAHALKPSYSVYSSALRL
Sbjct: 361 AIYNLGTVLYGLAEDTLRTGGTGNFKDVSPNELYSQSAIYIAAAHALKPSYSVYSSALRL 420
Query: 421 VRSMLPLPYLKVGYLTAPPVGKPLAPHSDWKRSQYFLNHDVLQKLKIGGEQIQTSPNALG 480
VRSMLPLPYLKVGYLTAPPVGKPLAPHSDWKRSQYFLNHDVLQKLKIGGEQIQTSPNALG
Sbjct: 421 VRSMLPLPYLKVGYLTAPPVGKPLAPHSDWKRSQYFLNHDVLQKLKIGGEQIQTSPNALG 480
Query: 481 RSGSTLNGDMPIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDALDGWLD 540
RSGSTLNGDMPIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDALDGWLD
Sbjct: 481 RSGSTLNGDMPIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDALDGWLD 540
Query: 541 AVRLVYTIYARGKNDVLAGIATG 564
AVRLVYTIYARGKNDVLAGI TG
Sbjct: 541 AVRLVYTIYARGKNDVLAGIVTG 548
BLAST of CmaCh06G005880 vs. NCBI nr
Match:
KAG7028212.1 (Protein HLB1, partial [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 1046.6 bits (2705), Expect = 7.8e-302
Identity = 543/563 (96.45%), Postives = 545/563 (96.80%), Query Frame = 0
Query: 1 MSTTPEEPNNLQNGIVTEPQISSESEQTDESRSEPERIADAIPKAESQLERESESESVYV 60
MSTTPEEPNNLQNGIVTEPQISSESEQTDESRSEPERIADAIPKAESQLERESESESVYV
Sbjct: 1 MSTTPEEPNNLQNGIVTEPQISSESEQTDESRSEPERIADAIPKAESQLERESESESVYV 60
Query: 61 EAEAEAESELASRRKQLSESLPLQVVTNVSDPKFDESKGTSIPSNGIENSQPTLRKDEGS 120
EAEAESELASRRKQLSESLPLQ VTNVSDPKFDESKGTSIPSNGIENSQPTLRKDEGS
Sbjct: 61 --EAEAESELASRRKQLSESLPLQAVTNVSDPKFDESKGTSIPSNGIENSQPTLRKDEGS 120
Query: 121 RTFTMRELLNGLKGEDGNDSVNESEGEKPDGYRFVVYLRILNMVDCLNQDSPQQPYSEQS 180
RTFTMRELLNGLKGEDGNDSVNESEGEKPDGY LNQDSPQQPYSEQS
Sbjct: 121 RTFTMRELLNGLKGEDGNDSVNESEGEKPDGY-------------SLNQDSPQQPYSEQS 180
Query: 181 RAAMELISSVTGVDEEGRSRQRILTFAAKRYASAIERNAQDYDALYNWALVLQESADNVS 240
RAAMELISSVTGVDEEGRSRQRILTFAAKRYASAIERN+QDYDALYNWALVLQESADNVS
Sbjct: 181 RAAMELISSVTGVDEEGRSRQRILTFAAKRYASAIERNSQDYDALYNWALVLQESADNVS 240
Query: 241 PDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQA 300
PDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQA
Sbjct: 241 PDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQA 300
Query: 301 TKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDFHR 360
TKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDFHR
Sbjct: 301 TKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDFHR 360
Query: 361 AIYNLGTVLYGLAEDTLRTGGTGNFKDVSPNELYSQSAIYIAAAHALKPSYSVYSSALRL 420
AIYNLGTVLYGLAEDTLRTGGTGNFKDVSPNELYSQSAIYIAAAHALKPSYSVYSSALRL
Sbjct: 361 AIYNLGTVLYGLAEDTLRTGGTGNFKDVSPNELYSQSAIYIAAAHALKPSYSVYSSALRL 420
Query: 421 VRSMLPLPYLKVGYLTAPPVGKPLAPHSDWKRSQYFLNHDVLQKLKIGGEQIQTSPNALG 480
VRSMLPLPYLKVGYLTAPPVGKPLAPHSDWKRSQYFLNHDVLQKLKIGGEQIQTSPNALG
Sbjct: 421 VRSMLPLPYLKVGYLTAPPVGKPLAPHSDWKRSQYFLNHDVLQKLKIGGEQIQTSPNALG 480
Query: 481 RSGSTLNGDMPIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDALDGWLD 540
RSGSTLNGDMPIKV+IPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDALDGWLD
Sbjct: 481 RSGSTLNGDMPIKVDIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDALDGWLD 540
Query: 541 AVRLVYTIYARGKNDVLAGIATG 564
AVRLVYTIYARGKNDVLAGI TG
Sbjct: 541 AVRLVYTIYARGKNDVLAGIVTG 548
BLAST of CmaCh06G005880 vs. NCBI nr
Match:
KAG6596677.1 (Protein HLB1, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 1044.6 bits (2700), Expect = 3.0e-301
Identity = 542/563 (96.27%), Postives = 544/563 (96.63%), Query Frame = 0
Query: 1 MSTTPEEPNNLQNGIVTEPQISSESEQTDESRSEPERIADAIPKAESQLERESESESVYV 60
MSTTPEEPNNLQNGIVTEPQISSESEQTDESRSEPERIADAIPKAESQLERESESESVYV
Sbjct: 1 MSTTPEEPNNLQNGIVTEPQISSESEQTDESRSEPERIADAIPKAESQLERESESESVYV 60
Query: 61 EAEAEAESELASRRKQLSESLPLQVVTNVSDPKFDESKGTSIPSNGIENSQPTLRKDEGS 120
EAEAESELASRRKQLSESLPLQ VTNVSDPKFDESKGTSIPSNGIENSQPTLRKDEGS
Sbjct: 61 --EAEAESELASRRKQLSESLPLQAVTNVSDPKFDESKGTSIPSNGIENSQPTLRKDEGS 120
Query: 121 RTFTMRELLNGLKGEDGNDSVNESEGEKPDGYRFVVYLRILNMVDCLNQDSPQQPYSEQS 180
RTFTMRELLNGLKGEDGNDSVNESEGEKPDGY LNQDSPQQPYSEQS
Sbjct: 121 RTFTMRELLNGLKGEDGNDSVNESEGEKPDGY-------------SLNQDSPQQPYSEQS 180
Query: 181 RAAMELISSVTGVDEEGRSRQRILTFAAKRYASAIERNAQDYDALYNWALVLQESADNVS 240
RAAMELISSVTGVDEEGRSRQRILTFAAKRYASAIERN+ DYDALYNWALVLQESADNVS
Sbjct: 181 RAAMELISSVTGVDEEGRSRQRILTFAAKRYASAIERNSHDYDALYNWALVLQESADNVS 240
Query: 241 PDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQA 300
PDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQA
Sbjct: 241 PDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQA 300
Query: 301 TKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDFHR 360
TKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDFHR
Sbjct: 301 TKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVRTAISKFRAAIQLQFDFHR 360
Query: 361 AIYNLGTVLYGLAEDTLRTGGTGNFKDVSPNELYSQSAIYIAAAHALKPSYSVYSSALRL 420
AIYNLGTVLYGLAEDTLRTGGTGNFKDVSPNELYSQSAIYIAAAHALKPSYSVYSSALRL
Sbjct: 361 AIYNLGTVLYGLAEDTLRTGGTGNFKDVSPNELYSQSAIYIAAAHALKPSYSVYSSALRL 420
Query: 421 VRSMLPLPYLKVGYLTAPPVGKPLAPHSDWKRSQYFLNHDVLQKLKIGGEQIQTSPNALG 480
VRSMLPLPYLKVGYLTAPPVGKPLAPHSDWKRSQYFLNHDVLQKLKIGGEQIQTSPNALG
Sbjct: 421 VRSMLPLPYLKVGYLTAPPVGKPLAPHSDWKRSQYFLNHDVLQKLKIGGEQIQTSPNALG 480
Query: 481 RSGSTLNGDMPIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDALDGWLD 540
RSGSTLNGDMPIKV+IPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDALDGWLD
Sbjct: 481 RSGSTLNGDMPIKVDIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDALDGWLD 540
Query: 541 AVRLVYTIYARGKNDVLAGIATG 564
AVRLVYTIYARGKNDVLAGI TG
Sbjct: 541 AVRLVYTIYARGKNDVLAGIVTG 548
BLAST of CmaCh06G005880 vs. TAIR 10
Match:
AT5G41950.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )
HSP 1 Score: 629.0 bits (1621), Expect = 3.6e-180
Identity = 361/590 (61.19%), Postives = 427/590 (72.37%), Query Frame = 0
Query: 1 MSTTPEEPNNLQNGI-----------VTEPQISSESEQTDESRSEPERIADAIPKAESQL 60
M+ T EEP LQNG + EPQ+ +E + T E PE AD P+
Sbjct: 1 MADTVEEP-QLQNGAAPAESETEQNPIPEPQLQTEPKLTGEI---PEIEADLTPEEVQSE 60
Query: 61 ERESESESVYVEAEAEAESEL---ASRRKQLSESLPLQVVTNVSDPK-----FDESKGTS 120
+++ E V E + E + A + SE P +V + V+D K D S G S
Sbjct: 61 VTDAKPEEVQSEVKPEEVKTVVTDAKPEEAQSEVKPEEVQSVVTDTKPDLTDVDLSPGGS 120
Query: 121 ----IPSNGIENSQPT--LRK-DEGSRTFTMRELLNGLKGEDGNDSVNESEGEKPDGYRF 180
I S +E T L+K D+G++TFTMRELL+ LK E+G+ + + S
Sbjct: 121 EEIPIRSTEVEQESTTSVLKKDDDGNKTFTMRELLSELKSEEGDGTPHSSASP------- 180
Query: 181 VVYLRILNMVDCLNQDSPQQPYSEQSRAAMELISSVTGVDEEGRSRQRILTFAAKRYASA 240
+++S QP ++ AM+LI+ + DEEGRSRQR+L FAA++YASA
Sbjct: 181 ------------FSRESASQP--AENNPAMDLINRIQVNDEEGRSRQRVLAFAARKYASA 240
Query: 241 IERNAQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFY 300
IERN D+DALYNWAL+LQESADNVSPDS SPSKD LLEEACKKYDEATRLCPTL+DA+Y
Sbjct: 241 IERNPDDHDALYNWALILQESADNVSPDSVSPSKDDLLEEACKKYDEATRLCPTLYDAYY 300
Query: 301 NWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPA 360
NWAIAISDRAK+RGRTKEAEELW+QA NYEKAVQLNWNS QALNNWGL LQELS IVPA
Sbjct: 301 NWAIAISDRAKIRGRTKEAEELWEQAADNYEKAVQLNWNSSQALNNWGLVLQELSQIVPA 360
Query: 361 REKQTIVRTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNFKDVSPNELY 420
REK+ +VRTAISKFRAAI+LQFDFHRAIYNLGTVLYGLAEDTLRTGG+GN KD+ P ELY
Sbjct: 361 REKEKVVRTAISKFRAAIRLQFDFHRAIYNLGTVLYGLAEDTLRTGGSGNGKDMPPGELY 420
Query: 421 SQSAIYIAAAHALKPSYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGKPLAPHSDWKRSQ 480
SQSAIYIAAAH+LKPSYSVYSSALRLVRSMLPLP+LKVGYLTAPPVG LAPHSDWKR++
Sbjct: 421 SQSAIYIAAAHSLKPSYSVYSSALRLVRSMLPLPHLKVGYLTAPPVGNSLAPHSDWKRTE 480
Query: 481 YFLNHD-VLQKLKIGGEQIQTSPNALGRSGSTLNGDMPIKVEIPDIVSVSACADLTLPPG 540
+ LNH+ +LQ LK ++ + + + ST +KV I +IVSV+ CADLTLPPG
Sbjct: 481 FELNHERLLQVLKPEPREMGRNLSGKAETMSTNVERKTVKVNITEIVSVTPCADLTLPPG 540
Query: 541 AGLCIDTIHGPVFLVADSWDALDGWLDAVRLVYTIYARGKNDVLAGIATG 564
AGLCIDTIHGPVFLVADSW++LDGWLDA+RLVYTIYARGK+DVLAGI TG
Sbjct: 541 AGLCIDTIHGPVFLVADSWESLDGWLDAIRLVYTIYARGKSDVLAGIITG 565
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Q9FHY8 | 5.1e-179 | 61.19 | Protein HLB1 OS=Arabidopsis thaliana OX=3702 GN=HLB1 PE=1 SV=1 | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1KVY8 | 4.3e-306 | 97.51 | protein HLB1-like isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111498676 PE=4 SV... | [more] |
A0A6J1G2N6 | 6.2e-305 | 96.98 | protein HLB1-like isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111450247 PE=4 ... | [more] |
A0A6J1KU40 | 1.0e-283 | 97.34 | protein HLB1-like isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111498676 PE=4 SV... | [more] |
A0A6J1G2L1 | 5.1e-283 | 96.96 | protein HLB1-like isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111450247 PE=4 ... | [more] |
A0A6J1HJU5 | 7.4e-266 | 85.61 | protein HLB1-like isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111465172 PE=4 SV... | [more] |
Match Name | E-value | Identity | Description | |
XP_023005781.1 | 8.9e-306 | 97.51 | protein HLB1-like isoform X1 [Cucurbita maxima] | [more] |
XP_022946039.1 | 1.3e-304 | 96.98 | protein HLB1-like isoform X1 [Cucurbita moschata] | [more] |
XP_023540563.1 | 1.2e-302 | 96.80 | protein HLB1-like isoform X1 [Cucurbita pepo subsp. pepo] | [more] |
KAG7028212.1 | 7.8e-302 | 96.45 | Protein HLB1, partial [Cucurbita argyrosperma subsp. argyrosperma] | [more] |
KAG6596677.1 | 3.0e-301 | 96.27 | Protein HLB1, partial [Cucurbita argyrosperma subsp. sororia] | [more] |
Match Name | E-value | Identity | Description | |
AT5G41950.1 | 3.6e-180 | 61.19 | Tetratricopeptide repeat (TPR)-like superfamily protein | [more] |