Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CAACAATTTTCTAATAATTAAATAAATAAATAAACAATAAGGGAACCCACGCCACATCTCTCTCCATCGCACTGCCGCAAGCCGTCCACCCTCGCGCCGCCGTCATTCTGCCGCCACCTCCGAGGAGCCTCTGTTCGTCGCCTCCGAGCGCCTCTGTTCGTCGCCTCCGAGCGCCTCTCTTCTTTTATTTCATTCTTCAACGATTTCTCTCCACTATTACTGTTTAAGTAATCCTCGAATATTGGACTAATTAGGTGGTCGAAGCCAAAATGTGGAACAAATTAAGAAAAATCACATTAAAACAAATTAAGTGATCTTCTGAGCTTCTTCTCGATCAGAGCCATGGGGTTTTTGGGTTTTTTCAGGAGGTTTAATGGCCTTCAGAAATTTGATGCAACCCTTAGAGCAATACCAGTTGGGTACGTTCAGAGTAGATGCTTATCAAATTCCAAGCTTTTCCATGGCGGCAAAGAAACAATGGTTCCGGTTTTAATTGTTGGCGCAGGACCTGTTGGTCTCGTCCTTGCTATTCTTCTCACTAAATTAGGTATATATTGTCATTTTGCTCTTAATTTTCTCGACCATGGAAAATTGGGATTTTTATAACTTGAAACAGAACATGAAACTTTCAAGTTGGTATTAGAAATGTTATTATATAGATAATTATAATCTTAGTGTTTATGCTAATTCCTTTCATTTTGGGGAATATGCATTTTTCCCCTCCTGTAGGGGTTAAATGTGCAGTTGTGGAGAAGAACAGAAGTTTTTCTAAACATCCGCAAGCTCACTTCATAAATAACAGATCCATGGAGGTTGATTGAATTTCTCTTAATGAAGATGCCATTCCAATGACTGAGTTAAACATTCAAAGTAATGTGTTTATACAGAATTGATTGATAGGTATTTCGCAAATTGGATGGACTAGCAGAGGAGATACAATTACATCAACCACCTGTCGAGTCATGGAGAAAGTTCATATATTGTACTTCACTGAATGGTACAATTCTCGGATCTGTAGACCATATGCAACCACAAGGTACAATTTTATCCATGCTCTGGTTTTCTAGTTTTATGTACTTCAAGATGGGTTCACGATTTTTTTAGTTGATACTGGGATACTTATTTGAGTCAACTTTATTGCTCAATCACTAATTTCATTGCTATTTATAGTCTGATACTTTGTACAGAAGATACTAGTAAGGCATTCTATTCAGAAATTTATAGTTAGTTTGCTTGCTAGTTCGGCTTTATAATGATCATTGTGGATATAGTAAAAGTAAATGATTGTTCCTGCCTCATTGCAAATGAATTCTTTAAGGATTGGCAGAGGAGAAAAAGGTTTTGTAATCAAGTTGGCCTGTGCGCTAAAGGCTTCAGCTGTTGTTCGACTCATTGGATTTAGGTTTGTCTCTTTAGTGCTAACTTCTCATCATAAATATCAAGTTTAGATGTATGGTTCTTTGTTCTTTTAGAGGGATTTCTCTCTCCTTTTGGTGGGATTGGTTTTTATATGCCTTGTTTTCTTTCATTTTTATCTCAATGAAAGTCATCGGTGGTAGGTTCTATGCTGCTAGAAGTATCAGGCAAGGATCCCCTTTTCTCCTCCTTATATGTGATACCTTTCCTATTGCCTTGGTAGGCTTCTCCCTAAAGGTGCTCATATAGGCTTTCATACTGCCGTTTGTAATTTCATTTTATCAATGAAACCAGTTTGATCATTTCTAATTAAAAAAAATCGTTTCTCATAAGAAAAAAAAGAAGCGGGAAGGAGTGATCTCTCAATATAATAAACTATCTGCAAGTTGTTGATGACACTGGGACTCCCTGTTTCGGATGGTTAACATTTTCAATTAGGCGTTTGGTTTAAGTTCCAATCTTTGCTAAACCATGATTGGTGGAAAAACTTTGAAGATTCTATAGTCCAAAAGGAGAATGACCCATTCTTTTACAGTAGATTTATCCTAAACCAGCAAAATTTGCCTAATTTGCCCTCTCATTTTACATGAGTTCTTCCCATGCCTTTGACTGTGGCTCACGCTTTGGAAATTAACGAGGGGTTTTTGTGGAAGGAACAGTCATGTGGTTCAAAATAGGAAAGAAGATACTTTTGATTGTGGCACATACTATTAGAAGCCTCTTATTTACTACGAATAGCAGAAATTCTTTCAGAAAGGAGAAATACTTCGGATTTGGTTTATTAGAGTCTTTCTGGAGGTCCATTGGCTAAAAAGTAAGGCAAGTATTTTCTAATCTTTTGTCTGTTTCGGGAAAATTTTACTTCTGTCACCTTCTTTTGATAAAAGCTTCATATTCCTTTGAATAACTGTTGTCTAACTCTCTTTGTGCTAATGGAAAGAGTCCTTCCGTAAGTTATCTGGCTTGGTTAGTGTGTGTGTGTGTGTGTGTGTTTTTCCCTTTTTTTTTTAAGAGGCTTGGTTAGTGTTTGCCTTCCCTCTGTTTTGTAATTTCATTTGTTCAATGAAATTTGTTTATTTATTTTAAATAAAAATGGATGAATATTAGAAGATTTAGATTTCAGGAACATCTGGTTTGAGTGATATTAAAAATTGTGGATTTTTGTGCATATAAAAATATTTGGTTTTTAATAGTGTGCTGCCAAGTGCTACCTTCTGGTTTCTCTTTTTAAAGGATATTTAGTCCTTCGGTTTTAGTTTTTTTTTTTTTTTTTTTTTTTTTTAATTGGGGAGGGGGTTTGAAGTTTGTGATAATATTGAATACAATTTTTGTTTTTTCTTAAAAAAAAAAAACAGATTTTGAGCACATCACCAGCCCAGTTTCTGTTGCACATTTCTCCCAGTACAAATTAAACAGGTTACTACTTAAGCAACTTCAAAATCTTGGGTTTCAAGTCTGTACACCGGAAAGCTTGGAAGGCCCCTGTGTAGTAAGAGAAAAGAAAATACTTCTGGGGCATGAGTGTGTTTCTATTGATGCTACTGATGAGACAATAACCATGACTGCATCTTATCTCAAGGAAGGGACACATATTAAGAGGAGGAATATCAGCTGTAATGTCCTTGTTGGTGCAGACGGTGCTGGAAGTTCTGTCCGGAGACTAGTAGGCATAGAAATGAAGGGTGAAAATGACTTACAAAAGCTTGTAAGCGTCCATTTCTTTAGTAGAGAGCTTGGTGAGTATTTGCTAAAAGAGAGGCCTGGTATGCTATATTTCGTCTTTAACACTGAAGCTATTGGGGTTCTTGTTGCTCATGATCTCAAGCAAGGCGAATTTATATTGCAGGTACTGCCATCATCTTGATACAATCTTCTTCATCTGCATTCTGCATCTTACACTTAAGATATTACGTGTTATCTTACAGGTACCATTCTATCCTCCTCAACAAAATATTGGAGATTTCTGCCCTAAGGTAATTTTGTCCACCATCTTGTGTTGAGTATGTTGAAGTTGAAGAAAAAACAGTTTACAGTTGAGTGCTAGCTTTAGAGTGGGATATACAATTCAAGTTATAGTGCATATCCCATTACTTATGATTGAATTATCAAATGATTGACCTGCAAAAATAATATGTATCAATGCCAATCATACATTCTTTCTGTAGTAAATGGATTCAATTTCTTGTAGATGTGTGAGGAGTTAATCTTCAAATTGGTTGGTCAAAACCTTTGTGACATAGATGTGCAAGATGTAAAACCTTGGATTATGCATGCTGAAGTTGCCGAGAAGTTCATATGCTGTCAAAGTCGTGTATTGCTTGCTGGTGATGCTGCTCATCGATTTCCTCCAGCTGGTGGATTTGGTTAGTATTAATTTTTTATACTTTAAAATCATTTGTTAGGATTTATAAATTATAATACTACACATTGTTTTGGCATAGCATTTCTATTGTTAATGTATCTTGGCCAATTTAGAAAAATGGACTGTTCAAGTAAACAGTCATTAATCTTAAAACAAATATACATATGGAAAACTTTATTGTCTGTCTGTTACATGGAATTAGTTTCAATCTCCACATATTAGCTGGGGACAGGAAGCTTGTTCTTCTTAGATTTCAGAAACCAAATTTAAATCAATTGTTTATGAAAAGTTCCAAGATGGAATTGAAGAATCCATGCCATGCTTATAATATTGGTGAATTAAATCCCATAAAATAGGCACACTGATGCATGGAATATGAGTGTTGTACAATACAAGCACACATCATTTTCTTAAAAAATAGACCGACAATTTTGGACATGTCCAAAAATTGTGTTTGAAAATTTTGTTACATAAAGGATTTGGAATTATGTTCTTGTAAATGTAACTTCATCCTTTTTTAAAAAAAGAAAAAAAATTGATTATTACAAAGTAAACTAAAGACCTCCCTAAGAACACAAAGTGAAATAAACTCTCGTCTTTTGAAGAAAAGAGCTTATATCTCAAACATTGTGTGAAATTATGTTCTCGTAAATGTATCTTCTTTCTGAATATTGTTGGATTTTGTAAGGTAAAATCTACTTATTCTCTTCAATTTATTTGTTTATTTATTTATTTACTTCTTAGTTCTTGATGTTAATTTTCTATATCATCTAAAAAAGATTGTCTTCCCTACAAAATATGTTTAAGAGGCGTTCCAAGTGTTTTTCAATGTCTCTCACTGTCACAGATGTCCCGGATGTGTTATGAATTATGTAACATCCAAATGTTCTTGAAATGCGTCTAAGTTATTCTTGTTCAATGCTTAGTACTGACATTATGCTCAGAATAGTGTCCAAGTTACCAAGATTGAAGCAAAATTGGTCAAATCAAAGAGAAGAGAAAGTCAATGCACATATATATATAGATGTATGTATATATGTATTTTCATTTGGTTTTAAAAATGTAATGAAAACAAGAACTTGGAAGTTTTTCGTACACATTATGACAGGTAAAAACTTCCATAAATGTACAATTGTGCCATCATTTGTTGATGTATTCATGCAGGAATGAATACTGGAATTCAAGACGTCCATAATCTTGCCTGGAAATTAGCTGCAGTGCTACAAGATATTGCATCACCTTCAATATTAAATACTTATGAAATGGAAAGGAGGCCGGTAATCTTATAATTTGTTGATGTTCTTACTCTTTCAATAATTGAATTTTGTTGTGTGAGTTGCCTGAGAATGTGATGCACGTTGACACTGCTTGGCTATTACAAATAAGAATGCTAAAGGAAAGACAAAAAAGACCATAAAATATCTTATCGTTTCTCTTTCAATGACCGTAAAATATCTTTTCTTGTCATGACAAATGATTGATTTTCTCAACTAATTTGAGGTAGTTGAGTCACTTTTGGTTAAACGACTGTGGGGATGAAACTTGAGCATAAATGTGGTCCTTTTCACCCCCAACAATTTAGCCATTTTGTTTTGAGAATAGAGTTGAGTGAGTGGTCTGGAGTCTGGAGGGAAATGTGCACATACATTTAAGTCGTGTAGTATTTCTTGTATTCTGTAGGCAGTACCAAGGTGAAGCAGGTCTCGACCTCTTCATTCAGCAACAAATTTTCTAATTGATTCTCGACCTCTTCTGTCACTCCCTTGTCAGATAGCACTATTCAATACGGCCCTTAGTGTTAAGAACTTCAAAGCAGCCATGGAAGTGCCTGCAGCTCTTGGTCTGGATCCAAAGATTGCAAACTCTGGTAAATAGCAGCTTCATCTAGTTTGAATGTCACTCTTTAAAACAATCTCACCTAAAGTTCACCATATTGGTTGTAATTGTAATATCTAAGTCTTTGCAATACAGAGAATGAATTCCTTTTTCCTTAACGTGCAGTGCACCGAGTAGTTAACCATGGCCTTGGTTCTATTTTATCATCCTCACAACAGAGCGCAGTTTTGGATGGAATTTTTAAGATAGGTCGTTTGCAGCTCTCAGATTCATTTCTGAATGATGGAAATCCCGTTGGATCTTCAAGACTCGCAAAACTGAGACAGATATTTGATGAAGGGAAGAGCCTTCAACTTCAGTTCCCTGCGGAGGATCTTGGTTTCAGGTTTAACTTCTATTCAAACTCAATGCATGAGCTAATTTTACTCTTCTTTAGATGTTTTAACTTTCATCCCATCCACTGGTGTTTCTCGACTTAACAACTGGCAATGTTTTAAAAAGCCTTACCGGACACGTGCCTAGGCTCAAGGCACAGGTTTGGCTACTCACCTTGAAAAGGTGAGGCTCACAAAATAAGGCATGCTTGAAGCTTGCGCCTTTTGTGAAGTCTCATAGCTCAAAGCCCTGAGTCTTGGGACTTTTTCATTATTTAAAAAAAAATCTTTACAATTAGGGTTTTTCCTTCTTTAAAATCTTAAAAAATCAAATTTACTAAGCCTAAATGCGAAAAAATTTGTGTTTAGGGTTTTTTTTTCTCACTTCAACTATGCTTTCTCTTCTTTTTTCTTTATGTAGTACTTTTATATATAACGCGCCTCACAAAAAATATTCTGCGCATAAGTCCTAGAAGACTATTGCACTTTATCATGCCTTGAGCTTTAAAAAACACTGGCAACTGGTGTATCAGACTATAAACAAAAGTAAAGCTCTTTCATTGGGGGATCTGGTTTAATGATAGCTTAAACATATTTATCTCAATCTTATCAATGGGATTTAGGTACTCCGAAGGGGCAATAATTCCTGATGATTCTCTCCTTGGTGGTCGAGAAGAACCTACAGGTCGTCGGAGACAGTACGTCCCTTCTGCAGATCCAGGATCAAGGCTGCCTCATATGAATGTGAGGGTTTTGGCCAGTGAGGTATTAGAATTGATCGACTGTTCACTCTGCATATTAAATGGCCACATGAATATAAAAAATAACAATTGGTTTCAAACATCATTTAGTGCACTAAACGATTTCTTTAGGTGTGCTTTTATTGCATTGCAAGACACTTAAGCTGCAGCTAGATAGTTGAGTGATATTTTAAATACCTTTTTTGTCCCTAAATTTTGGGTCTAGTTTCCATTTGGTCCCTAGATTTCAAAATGTTACACATTTAGTCCTTTTTAAGTTTTGAGTTTGATTTCATTTCAGTTCCTAAGTTCCATAATACTACAATTTACCAATTACATTTGTGTTTTATTCAATTTGGTCCCTAGGTTTCAAGATTTACACTTTTAATCTCGATTTTTTTTTTTATTAAATAGTCACTTTTAGTCACTAGCGTTAATGTCTATTAATTAATTTAAAATAATTATGAAGTAAAATTTAAAATTTTATTTTAATAGTGATGAAAAAAGAAGCTTAACTAATTATAATTCTTTTACTTCTTTAAAATTAATTAATACACATTGACAGAAAGTGAGTATTTAATGAAAAATTGAGGTTAAAAGTATAAATCATTCAATTTAAGGATCAAATTGAACTAAAACTCAAATCTTGAGGGTAAGACTGTAACATTTTGAAATCTAAAGACAATTGAAATCGAACTCAAAGCTTAAGAAACCAAACTAAGATCTTTCAATGTGTAACATTTTGAAACTTAGGGACCAAATAGAAACTAGATTCAAAATGTAGGGACCAAAAATGTATTTTCCCAATTTTCATTTAAGTCAACTAAAGACGGGAAATATGTGTGTGCAGGAGATTTTTTCCACGCTCGATCTTGTATCTGGGGATGTAGTCCAATTCCTTCTCATAATAGGTCCGCGACCGGAGTCTTACTGTCTTGCTCACGCTACTCTCAAGGTAGCAGAGGAATTCAAAATTTCTGTTAGGGTATGCATTTTATGGTCTGCTGATACCACCAAGATTCAGTCAAGCAGCAAGGAAGAACTAACACCTTGGAAGAACTACATTGATGTTCAAGAAATTCGGCAATGGTCAACTTCACCGTCATGGTGGGACGTTTGTCAGATGACCGACAAAGGAGCAATCCTAGTCCGGCCTGACGAGCATATTGCTTGGAGGGTGAAGTCAGGTATTTCTGGGGATCCAAACACAGAACTGACGAGAGTTTTTACTACACTTTTGAAGTGATATGTTTGGCTTTACTTTGTTGTGTTTCACAACATGAGTGGTTTTAGAGATTTCACGTAGAAAAGTGCAGTACTTGTAGCCATTGGCATATATTATTGGGAACTTGGGGAACATTGCAGAATGAATTGTATTACTCTTCATAAAGTTTAAAATTGTACCATTTTAACTTTTAAAGTAAAATAAAGATTATTTACCGTACGATTTGTAGACTCTATATTGACGATAATTGATTT
mRNA sequence
CAACAATTTTCTAATAATTAAATAAATAAATAAACAATAAGGGAACCCACGCCACATCTCTCTCCATCGCACTGCCGCAAGCCGTCCACCCTCGCGCCGCCGTCATTCTGCCGCCACCTCCGAGGAGCCTCTGTGGTCGAAGCCAAAATGTGGAACAAATTAAGAAAAATCACATTAAAACAAATTAAGTGATCTTCTGAGCTTCTTCTCGATCAGAGCCATGGGGTTTTTGGGTTTTTTCAGGAGGTTTAATGGCCTTCAGAAATTTGATGCAACCCTTAGAGCAATACCAGTTGGGTACGTTCAGAGTAGATGCTTATCAAATTCCAAGCTTTTCCATGGCGGCAAAGAAACAATGGTTCCGGTTTTAATTGTTGGCGCAGGACCTGTTGGTCTCGTCCTTGCTATTCTTCTCACTAAATTAGGGGTTAAATGTGCAGTTGTGGAGAAGAACAGAAGTTTTTCTAAACATCCGCAAGCTCACTTCATAAATAACAGATCCATGGAGGTATTTCGCAAATTGGATGGACTAGCAGAGGAGATACAATTACATCAACCACCTGTCGAGTCATGGAGAAAGTTCATATATTGTACTTCACTGAATGGTACAATTCTCGGATCTGTAGACCATATGCAACCACAAGATTTTGAGCACATCACCAGCCCAGTTTCTGTTGCACATTTCTCCCAGTACAAATTAAACAGGTTACTACTTAAGCAACTTCAAAATCTTGGGTTTCAAGTCTGTACACCGGAAAGCTTGGAAGGCCCCTGTGTAGTAAGAGAAAAGAAAATACTTCTGGGGCATGAGTGTGTTTCTATTGATGCTACTGATGAGACAATAACCATGACTGCATCTTATCTCAAGGAAGGGACACATATTAAGAGGAGGAATATCAGCTGTAATGTCCTTGTTGGTGCAGACGGTGCTGGAAGTTCTGTCCGGAGACTAGTAGGCATAGAAATGAAGGGTGAAAATGACTTACAAAAGCTTGTAAGCGTCCATTTCTTTAGTAGAGAGCTTGGTGAGTATTTGCTAAAAGAGAGGCCTGGTATGCTATATTTCGTCTTTAACACTGAAGCTATTGGGGTTCTTGTTGCTCATGATCTCAAGCAAGGCGAATTTATATTGCAGGTACCATTCTATCCTCCTCAACAAAATATTGGAGATTTCTGCCCTAAGATGTGTGAGGAGTTAATCTTCAAATTGGTTGGTCAAAACCTTTGTGACATAGATGTGCAAGATGTAAAACCTTGGATTATGCATGCTGAAGTTGCCGAGAAGTTCATATGCTGTCAAAGTCGTGTATTGCTTGCTGGTGATGCTGCTCATCGATTTCCTCCAGCTGGTGGATTTGGAATGAATACTGGAATTCAAGACGTCCATAATCTTGCCTGGAAATTAGCTGCAGTGCTACAAGATATTGCATCACCTTCAATATTAAATACTTATGAAATGGAAAGGAGGCCGATAGCACTATTCAATACGGCCCTTAGTGTTAAGAACTTCAAAGCAGCCATGGAAGTGCCTGCAGCTCTTGGTCTGGATCCAAAGATTGCAAACTCTGTGCACCGAGTAGTTAACCATGGCCTTGGTTCTATTTTATCATCCTCACAACAGAGCGCAGTTTTGGATGGAATTTTTAAGATAGGTCGTTTGCAGCTCTCAGATTCATTTCTGAATGATGGAAATCCCGTTGGATCTTCAAGACTCGCAAAACTGAGACAGATATTTGATGAAGGGAAGAGCCTTCAACTTCAGTTCCCTGCGGAGGATCTTGGTTTCAGGTACTCCGAAGGGGCAATAATTCCTGATGATTCTCTCCTTGGTGGTCGAGAAGAACCTACAGGTCGTCGGAGACAGTACGTCCCTTCTGCAGATCCAGGATCAAGGCTGCCTCATATGAATGTGAGGGTTTTGGCCAGTGAGGAGATTTTTTCCACGCTCGATCTTGTATCTGGGGATGTAGTCCAATTCCTTCTCATAATAGGTCCGCGACCGGAGTCTTACTGTCTTGCTCACGCTACTCTCAAGGTAGCAGAGGAATTCAAAATTTCTGTTAGGGTATGCATTTTATGGTCTGCTGATACCACCAAGATTCAGTCAAGCAGCAAGGAAGAACTAACACCTTGGAAGAACTACATTGATGTTCAAGAAATTCGGCAATGGTCAACTTCACCGTCATGGTGGGACGTTTGTCAGATGACCGACAAAGGAGCAATCCTAGTCCGGCCTGACGAGCATATTGCTTGGAGGGTGAAGTCAGGTATTTCTGGGGATCCAAACACAGAACTGACGAGAGTTTTTACTACACTTTTGAAGTGATATGTTTGGCTTTACTTTGTTGTGTTTCACAACATGAGTGGTTTTAGAGATTTCACGTAGAAAAGTGCAGTACTTGTAGCCATTGGCATATATTATTGGGAACTTGGGGAACATTGCAGAATGAATTGTATTACTCTTCATAAAGTTTAAAATTGTACCATTTTAACTTTTAAAGTAAAATAAAGATTATTTACCGTACGATTTGTAGACTCTATATTGACGATAATTGATTT
Coding sequence (CDS)
ATGGGGTTTTTGGGTTTTTTCAGGAGGTTTAATGGCCTTCAGAAATTTGATGCAACCCTTAGAGCAATACCAGTTGGGTACGTTCAGAGTAGATGCTTATCAAATTCCAAGCTTTTCCATGGCGGCAAAGAAACAATGGTTCCGGTTTTAATTGTTGGCGCAGGACCTGTTGGTCTCGTCCTTGCTATTCTTCTCACTAAATTAGGGGTTAAATGTGCAGTTGTGGAGAAGAACAGAAGTTTTTCTAAACATCCGCAAGCTCACTTCATAAATAACAGATCCATGGAGGTATTTCGCAAATTGGATGGACTAGCAGAGGAGATACAATTACATCAACCACCTGTCGAGTCATGGAGAAAGTTCATATATTGTACTTCACTGAATGGTACAATTCTCGGATCTGTAGACCATATGCAACCACAAGATTTTGAGCACATCACCAGCCCAGTTTCTGTTGCACATTTCTCCCAGTACAAATTAAACAGGTTACTACTTAAGCAACTTCAAAATCTTGGGTTTCAAGTCTGTACACCGGAAAGCTTGGAAGGCCCCTGTGTAGTAAGAGAAAAGAAAATACTTCTGGGGCATGAGTGTGTTTCTATTGATGCTACTGATGAGACAATAACCATGACTGCATCTTATCTCAAGGAAGGGACACATATTAAGAGGAGGAATATCAGCTGTAATGTCCTTGTTGGTGCAGACGGTGCTGGAAGTTCTGTCCGGAGACTAGTAGGCATAGAAATGAAGGGTGAAAATGACTTACAAAAGCTTGTAAGCGTCCATTTCTTTAGTAGAGAGCTTGGTGAGTATTTGCTAAAAGAGAGGCCTGGTATGCTATATTTCGTCTTTAACACTGAAGCTATTGGGGTTCTTGTTGCTCATGATCTCAAGCAAGGCGAATTTATATTGCAGGTACCATTCTATCCTCCTCAACAAAATATTGGAGATTTCTGCCCTAAGATGTGTGAGGAGTTAATCTTCAAATTGGTTGGTCAAAACCTTTGTGACATAGATGTGCAAGATGTAAAACCTTGGATTATGCATGCTGAAGTTGCCGAGAAGTTCATATGCTGTCAAAGTCGTGTATTGCTTGCTGGTGATGCTGCTCATCGATTTCCTCCAGCTGGTGGATTTGGAATGAATACTGGAATTCAAGACGTCCATAATCTTGCCTGGAAATTAGCTGCAGTGCTACAAGATATTGCATCACCTTCAATATTAAATACTTATGAAATGGAAAGGAGGCCGATAGCACTATTCAATACGGCCCTTAGTGTTAAGAACTTCAAAGCAGCCATGGAAGTGCCTGCAGCTCTTGGTCTGGATCCAAAGATTGCAAACTCTGTGCACCGAGTAGTTAACCATGGCCTTGGTTCTATTTTATCATCCTCACAACAGAGCGCAGTTTTGGATGGAATTTTTAAGATAGGTCGTTTGCAGCTCTCAGATTCATTTCTGAATGATGGAAATCCCGTTGGATCTTCAAGACTCGCAAAACTGAGACAGATATTTGATGAAGGGAAGAGCCTTCAACTTCAGTTCCCTGCGGAGGATCTTGGTTTCAGGTACTCCGAAGGGGCAATAATTCCTGATGATTCTCTCCTTGGTGGTCGAGAAGAACCTACAGGTCGTCGGAGACAGTACGTCCCTTCTGCAGATCCAGGATCAAGGCTGCCTCATATGAATGTGAGGGTTTTGGCCAGTGAGGAGATTTTTTCCACGCTCGATCTTGTATCTGGGGATGTAGTCCAATTCCTTCTCATAATAGGTCCGCGACCGGAGTCTTACTGTCTTGCTCACGCTACTCTCAAGGTAGCAGAGGAATTCAAAATTTCTGTTAGGGTATGCATTTTATGGTCTGCTGATACCACCAAGATTCAGTCAAGCAGCAAGGAAGAACTAACACCTTGGAAGAACTACATTGATGTTCAAGAAATTCGGCAATGGTCAACTTCACCGTCATGGTGGGACGTTTGTCAGATGACCGACAAAGGAGCAATCCTAGTCCGGCCTGACGAGCATATTGCTTGGAGGGTGAAGTCAGGTATTTCTGGGGATCCAAACACAGAACTGACGAGAGTTTTTACTACACTTTTGAAGTGA
Protein sequence
MGFLGFFRRFNGLQKFDATLRAIPVGYVQSRCLSNSKLFHGGKETMVPVLIVGAGPVGLVLAILLTKLGVKCAVVEKNRSFSKHPQAHFINNRSMEVFRKLDGLAEEIQLHQPPVESWRKFIYCTSLNGTILGSVDHMQPQDFEHITSPVSVAHFSQYKLNRLLLKQLQNLGFQVCTPESLEGPCVVREKKILLGHECVSIDATDETITMTASYLKEGTHIKRRNISCNVLVGADGAGSSVRRLVGIEMKGENDLQKLVSVHFFSRELGEYLLKERPGMLYFVFNTEAIGVLVAHDLKQGEFILQVPFYPPQQNIGDFCPKMCEELIFKLVGQNLCDIDVQDVKPWIMHAEVAEKFICCQSRVLLAGDAAHRFPPAGGFGMNTGIQDVHNLAWKLAAVLQDIASPSILNTYEMERRPIALFNTALSVKNFKAAMEVPAALGLDPKIANSVHRVVNHGLGSILSSSQQSAVLDGIFKIGRLQLSDSFLNDGNPVGSSRLAKLRQIFDEGKSLQLQFPAEDLGFRYSEGAIIPDDSLLGGREEPTGRRRQYVPSADPGSRLPHMNVRVLASEEIFSTLDLVSGDVVQFLLIIGPRPESYCLAHATLKVAEEFKISVRVCILWSADTTKIQSSSKEELTPWKNYIDVQEIRQWSTSPSWWDVCQMTDKGAILVRPDEHIAWRVKSGISGDPNTELTRVFTTLLK
Homology
BLAST of Bhi07G000229 vs. TAIR 10
Match:
AT1G24340.1 (FAD/NAD(P)-binding oxidoreductase family protein )
HSP 1 Score: 933.7 bits (2412), Expect = 8.6e-272
Identity = 453/708 (63.98%), Postives = 560/708 (79.10%), Query Frame = 0
Query: 1 MGFLGFFRRFNGLQKFDATLRAIPVGYVQSRCLSNSKLFHGGKETMVPVLIVGAGPVGLV 60
M LG +R + ++ +R PV Y Q + LS++ LF+G +PVLIVGAGPVGLV
Sbjct: 1 MAILGLIKRVTRITVNNSRVRVYPVRYFQRKDLSSTNLFNGEDAAKLPVLIVGAGPVGLV 60
Query: 61 LAILLTKLGVKCAVVEKNRSFSKHPQAHFINNRSMEVFRKLDGLAEEIQLHQPPVESWRK 120
L+ILLTKLGVKCAVV+K SFSKHPQAHFINNRSME+FR+LDGLAEEI+ QPPV+ WRK
Sbjct: 61 LSILLTKLGVKCAVVDKATSFSKHPQAHFINNRSMEIFRELDGLAEEIERSQPPVDLWRK 120
Query: 121 FIYCTSLNGTILGSVDHMQPQDFEHITSPVSVAHFSQYKLNRLLLKQLQNLGFQV---CT 180
FIYCTSL+G+ LG+VDHMQPQDFE + SP SVAHFSQYKL LLLK+L++LGF V
Sbjct: 121 FIYCTSLSGSTLGTVDHMQPQDFEKVVSPASVAHFSQYKLTNLLLKRLEDLGFHVRGSKE 180
Query: 181 PESLEGPCVVREKKILLGHECVSIDATDETITMTASYLKEGTHIKRRNISCNVLVGADGA 240
+ LE VV ++IL+GHECV IDA ++IT T S+LK G H+K RNI C++LVGADGA
Sbjct: 181 SDGLEADSVV-ARQILMGHECVGIDANKDSITATVSFLKGGKHMK-RNIQCSLLVGADGA 240
Query: 241 GSSVRRLVGIEMKGENDLQKLVSVHFFSRELGEYLLKERPGMLYFVFNTEAIGVLVAHDL 300
GS+VR+L IEM+GE DLQKLVSVHF SRELGEYL+ RPGML+F+FNT+ IGVLVAHDL
Sbjct: 241 GSAVRKLTVIEMRGERDLQKLVSVHFMSRELGEYLISNRPGMLFFIFNTDGIGVLVAHDL 300
Query: 301 KQGEFILQVPFYPPQQNIGDFCPKMCEELIFKLVGQNLCDIDVQDVKPWIMHAEVAEKFI 360
QGEF+LQ+P+YPPQQ++ DF P+MC+ LIF LVG L D+DV D+KPW+MHAEVAEKF+
Sbjct: 301 LQGEFVLQIPYYPPQQSLSDFSPEMCKMLIFNLVGHELSDLDVADIKPWVMHAEVAEKFM 360
Query: 361 CCQSRVLLAGDAAHRFPPAGGFGMNTGIQDVHNLAWKLAAVLQDIASPSILNTYEMERRP 420
CC++RV+LAGDAAHRFPPAGGFGMNTGIQD HNLAWK+AA++Q A+ SIL TYE ERRP
Sbjct: 361 CCENRVILAGDAAHRFPPAGGFGMNTGIQDAHNLAWKIAALVQGSANSSILKTYETERRP 420
Query: 421 IALFNTALSVKNFKAAMEVPAALGLDPKIANSVHRVVNHGLGSILSSSQQSAVLDGIFKI 480
IAL NT+LSV+NF+AAM VP+ALGLDP +ANSVHR +N +GSIL + Q A+LD +F +
Sbjct: 421 IALSNTSLSVQNFRAAMSVPSALGLDPTVANSVHRFINKTVGSILPTGLQKAILDNVFAL 480
Query: 481 GRLQLSDSFLNDGNPVGSSRLAKLRQIFDEGKSLQLQFPAEDLGFRYSEGAIIPD-DSLL 540
GR QLS+S LN+ NP+G+ RL++L+ IF+ GKSLQLQFPAEDLGFRY EGAI+PD +S
Sbjct: 481 GRAQLSESLLNESNPLGNQRLSRLKSIFEGGKSLQLQFPAEDLGFRYLEGAIVPDNESEA 540
Query: 541 GGREEPTGRRRQYVPSADPGSRLPHMNVRVLAS---EEIFSTLDLVSGDVVQFLLIIGPR 600
G E P+GRRR YVP A+PGSRLPHM V++L+ E I STLDLVS + V+FLLII P
Sbjct: 541 GDPEVPSGRRRDYVPCAEPGSRLPHMYVKILSDSTREVIVSTLDLVSTEKVEFLLIISPL 600
Query: 601 PESYCLAHATLKVAEEFKISVRVCILWSADTTKIQSSSKEELTPWKNYIDVQEI-RQWST 660
ESY LAHAT KVA+EF SV+VC++W + ++ S L PW+NY+DV E+ +Q
Sbjct: 601 QESYELAHATFKVAKEFMASVKVCVVWPSSDDGLERKSNSALAPWENYVDVMEVKKQNGE 660
Query: 661 SPSWWDVCQMTDKGAILVRPDEHIAWRVKSGISGDPNTELTRVFTTLL 701
SWW +C+M+++G+ILVRPD+HIAWR KSGI+ DP + VFT +L
Sbjct: 661 GTSWWSICKMSERGSILVRPDQHIAWRAKSGITLDPTLHMRDVFTIIL 706
BLAST of Bhi07G000229 vs. ExPASy Swiss-Prot
Match:
P42534 (Putative polyketide hydroxylase OS=Streptomyces coelicolor (strain ATCC BAA-471 / A3(2) / M145) OX=100226 GN=SCO5321 PE=3 SV=2)
HSP 1 Score: 196.1 bits (497), Expect = 1.4e-48
Identity = 194/678 (28.61%), Postives = 289/678 (42.63%), Query Frame = 0
Query: 41 GGKETMVPVLIVGAGPVGLVLAILLTKLGVKCAVVEKNRSFSKHPQAHFINNRSMEVFRK 100
G + VPVL+VG VGL ++ L +LGV+ +VE++ S HP+ N R+ME+FR
Sbjct: 15 GDRTHRVPVLVVGGSLVGLSTSVFLGRLGVRHTLVERHAGTSIHPRGRGNNVRTMEIFR- 74
Query: 101 LDGLAEEIQLHQPPVESWRKFIYCTSLNGT----ILGSVDHMQPQDFEHITSPVSVAHFS 160
+ G +I+ + + +L G + +D P SP S S
Sbjct: 75 VAGTEPDIRRAAATLADNHGILQAPTLAGDAGEWLFKQID---PGGGLARFSPSSWCLCS 134
Query: 161 QYKLNRLLLKQLQNLGFQVCTPESLEGPCVVREKKILLGHECVSIDATDETITMTASYLK 220
Q L LL NLG + G E +S +A E +T +
Sbjct: 135 QNDLEPELLTHATNLG-----------------GDLRFGTELLSFEADTEGVTAIVKSRE 194
Query: 221 EGTHIKRRNISCNVLVGADGAGSSVRRLVGIEMKGENDLQKLVSVHFFSRELGEYLLKER 280
G H I + LV ADG S VR +GI G DL VS+ F SR L + ++ +R
Sbjct: 195 TGEH---TTIRADYLVAADGPRSPVREQLGIGQSGPGDLFHNVSITFRSRRLAD-VVGDR 254
Query: 281 PGMLYFVFNTEAIGVLVAHDLKQGEFILQVPFYPPQ-QNIGDFCPKMCEELIFKLVGQNL 340
++ ++ + A G L+ D ++ ++ P++P Q + + DF + C I + +G
Sbjct: 255 RFIVCYLTDENADGALLPVDNRE-NWVFHAPWHPEQGETVEDFTDERCAAHIRRAIGDPD 314
Query: 341 CDIDVQDVKPWIMHAEVAEKFICCQSRVLLAGDAAHRFPPAGGFGMNTGIQDVHNLAWKL 400
D+++ PW VA + RVLLAGD+AH P G FG NTGIQD HNLAWKL
Sbjct: 315 LDVEITGKAPWHAAQRVARSY--RSGRVLLAGDSAHEMSPTGAFGSNTGIQDAHNLAWKL 374
Query: 401 AAVLQDIASPSILNTYEMERRPIALFNTALSVKNFKAAMEVPAALGLDPKIANSVHRVVN 460
AAVL+ A ++L+TY+ ERRP+A A S + ++E P +A
Sbjct: 375 AAVLEGWAGEALLDTYDTERRPVA---EATSARAAHRSVEHSHPGFAPPPVAGG----GG 434
Query: 461 HGLGSILSSSQQSAVLDGIFKIGRLQLSDSFLNDGNPVGSSRLAKLRQIFDEGKS----- 520
G G+ + + + G G L G P G G +
Sbjct: 435 PGAGTPGGAGRGTGGPGGPGGPGGLGGPGGPGGTGGPGGPGGPGGPDGPRGAGGAPGGGP 494
Query: 521 --------LQLQFPAEDLGFRYSEGAIIPDDSLLGGREEPTGRRRQYVPSADPGSRLPHM 580
Q LG+RY GA++ D E + PGSR PH+
Sbjct: 495 GGGPGGGGPQRGILNVALGYRYPRGAVVGADPATPVVPEGLDL------TGAPGSRAPHL 554
Query: 581 NVRVLASEEIFSTLDLVSGDVVQFLLIIGPRPESYCLAHATLKVAEEFKISVRVCILWSA 640
VR ++ STLDL +V LL +P + A VA ++ ++
Sbjct: 555 WVR--RGQDRLSTLDLYEDSLV--LLSDAAQPTGW--HEAAAGVAAGMRVPLK------- 614
Query: 641 DTTKIQSSSKEELTPWKNYIDVQEIRQWSTSPSWWDVCQMTDKGAILVRPDEHIAWRVKS 700
+ ++ S +L P D +E W +T GA+LVRPD +AWR
Sbjct: 615 -SYRVGGSPGADLNP-----DDEE-------TDWARAHGVTRGGAVLVRPDGFVAWR-SP 624
BLAST of Bhi07G000229 vs. ExPASy Swiss-Prot
Match:
Q8KN28 (2,4-dichlorophenol 6-monooxygenase OS=Delftia acidovorans OX=80866 GN=tfdB PE=1 SV=3)
HSP 1 Score: 191.8 bits (486), Expect = 2.6e-47
Identity = 184/664 (27.71%), Postives = 269/664 (40.51%), Query Frame = 0
Query: 49 VLIVGAGPVGLVLAILLTKLGVKCAVVEKNRSFSKHPQAHFINNRSMEVFRKLDGLAEEI 108
VL+VG+GP G +LL GVK V K + S+ P++H N R+MEV R L GL E
Sbjct: 12 VLVVGSGPAGAASTLLLATYGVKTLCVSKYATTSRTPRSHITNQRTMEVMRDL-GLELEC 71
Query: 109 QLHQPPVESWRKFIYCTSLNGTILGSV----DHMQPQDFEHITSPVSVAHFSQYKLNRLL 168
+ P E + +YCTSL G LG V H Q + + SP + Q L ++
Sbjct: 72 EAMASPAELMGENVYCTSLVGDELGRVLTWGTHPQRRADYELASPTHMCDLPQNLLEPIM 131
Query: 169 LKQLQNLGFQVCTPESLEGPCVVREKKILLGHECVSIDATDETITMTASYLKEGTHIKRR 228
+ G V E VS+ + +T T +++ ++
Sbjct: 132 INHAARRGADV-----------------RFHTEFVSLKQDETGVTAT---VRDHLLDRQY 191
Query: 229 NISCNVLVGADGAGSSVRRLVGIEMKGENDLQKLVSVHFFSRELGEYLLKERPGMLYFVF 288
+I L+GADGA S V VG+ M+G+ + ++V F +L +Y + RP +LY+V
Sbjct: 192 DIRAKYLIGADGANSQVVDQVGLPMEGKMGVSGSINV-VFEADLTKY-VGHRPSVLYWVI 251
Query: 289 ----NTEAIGVLVAHDLKQGEFILQVPFYPPQQNIGDFCPKMCEELIFKLVGQNLCDIDV 348
+ +G+ V ++ L + Y D +++ L+G + + +
Sbjct: 252 QPGSSVGGLGIGVIRMVRPWNKWLCIWGYDIAGGPPDLNEAHARQIVHSLLGDSTIPVKI 311
Query: 349 QDVKPWIMHAEVAEKFICCQSRVLLAGDAAHRFPPAGGFGMNTGIQDVHNLAWKLAAVLQ 408
+ W ++ A + +RV GDA HR PP G G NT IQD NL WKL+ VLQ
Sbjct: 312 ESTSTWTVNDMYATRLF--DNRVFCMGDAVHRHPPTNGLGSNTSIQDAFNLCWKLSHVLQ 371
Query: 409 DIASPSILNTYEMERRPIALFNTALSVKNFKAAMEVPAALGL-DPKIANSVHRVVNHGLG 468
A P +L TY ER P+A + K+ + AALGL D K + R
Sbjct: 372 GKAGPELLATYNEERAPVARQVVQRANKSLGDFPPILAALGLFDTKDPEQMQR------- 431
Query: 469 SILSSSQQSAVLDGIFKIGRLQLSDSFLNDGNPVGSSRLAKLRQIFDEGKSLQLQFPAED 528
I RL+ + +P + A LR D G +
Sbjct: 432 ----------------NIARLK-------EQSPEAQEQRAALRAAID-GTQYVYNAHGVE 491
Query: 529 LGFRYSEGAIIPDDSLLGGREEPTGRRRQ---YVPSADPGSRLPHMNVRVLASEEIFSTL 588
+ RY AI+PD G +P RR + S PG+ +PH V V STL
Sbjct: 492 MNQRYQSAAIVPD-----GTPDPGFRRDSELYHAHSGRPGAPVPH--VWVTRHGRRVSTL 551
Query: 589 DLVSGDVVQFLLIIGPRPESYCLAHATLKVAEEFKISVRVCILWSADTTKIQSSSKEELT 648
DL L I P HA AE I + V I+ E+L
Sbjct: 552 DLCGKGRFSLLSGIAGSPWVEAAVHA----AESLGIDLDVHIIG-------PGQELEDL- 584
Query: 649 PWKNYIDVQEIRQWSTSPSWWDVCQMTDKGAILVRPDEHIAWRV---KSGISGDPNTELT 698
Y D +R ++ + GA+LVRPD I WR + G + L
Sbjct: 612 ----YGDFARVR------------EIEESGALLVRPDNFICWRAMRWQEGSGDELRAALK 584
BLAST of Bhi07G000229 vs. ExPASy Swiss-Prot
Match:
Q05355 (Putative polyketide hydroxylase OS=Streptomyces halstedii OX=1944 GN=schC PE=3 SV=1)
HSP 1 Score: 186.4 bits (472), Expect = 1.1e-45
Identity = 186/660 (28.18%), Postives = 273/660 (41.36%), Query Frame = 0
Query: 47 VPVLIVGAGPVGLVLAILLTKLGVKCAVVEKNRSFSKHPQAHFINNRSMEVFRKLDGLAE 106
VPVL+VG VGL ++ L +LGV+ +VE++ S HP+ N R+MEV+R G+ +
Sbjct: 15 VPVLVVGGSLVGLSTSVFLGRLGVRHMLVERHAGTSVHPRGRGNNVRTMEVYRAA-GVEQ 74
Query: 107 EIQLHQPPVESWRKFIYCTSLNGT----ILGSVDHMQPQDFEHITSPVSVAHFSQYKLNR 166
I+ + + SL G +L +D P SP S SQ L
Sbjct: 75 GIRRAAATLAGNHGILQTPSLVGDEGEWLLRDID---PGGGLARFSPSSWCLCSQNDLEP 134
Query: 167 LLLKQLQNLGFQVCTPESLEGPCVVREKKILLGHECVSIDATDETITMTASYLKEGTHIK 226
+LL LG +I E S + +T + G H
Sbjct: 135 VLLDHAVELG-----------------GEIRFSTELQSFEQDPAGVTAVIKSRRSGEH-- 194
Query: 227 RRNISCNVLVGADGAGSSVRRLVGIEMKGENDLQKLVSVHFFSRELGEYLLKERPGMLYF 286
+ + LV ADG S VR +GI G DL VSV F SR L E+ + +R ++ +
Sbjct: 195 -TTVRADYLVAADGPRSPVREQLGIGQSGPGDLFHNVSVTFRSRRLAEF-VGDRHFIVCY 254
Query: 287 VFNTEAIGVLVAHDLKQGEFILQVPFYP-PQQNIGDFCPKMCEELIFKLVGQNLCDIDVQ 346
+ N EA G L+ D ++ ++ P+YP + + DF + C + I + VG D+++
Sbjct: 255 LTNPEADGALLPVDNRE-NWVFHAPWYPRAARPLEDFTDERCADHIRRAVGVPDLDVEIT 314
Query: 347 DVKPWIMHAEVAEKFICCQSRVLLAGDAAHRFPPAGGFGMNTGIQDVHNLAWKLAAVLQD 406
PW VA ++ RV LAGD+AH P G FG NTGIQD HNLAWKLAAVL
Sbjct: 315 GKAPWHAAQRVARQYRA--GRVFLAGDSAHEMSPTGAFGSNTGIQDAHNLAWKLAAVLGG 374
Query: 407 IASPSILNTYEMERRPIALFNTALSVKNFKAAMEVPAALGLDPKIANSVHRVVNHGLGSI 466
A +L+TY+ ERRP+A TA + A + G P S
Sbjct: 375 WAGDGLLDTYDAERRPVAEATTARAA----ARSAEHSHPGFAPPPGTS------------ 434
Query: 467 LSSSQQSAVLDGIFKIGRLQLSDSFLNDGNPVGSSRLAKLRQIFDEGKSLQLQFPAEDLG 526
G P +G L + LG
Sbjct: 435 ----------------------------GGP--------------QGGILNVA-----LG 494
Query: 527 FRYSEGAIIPDDSLLGGREEPTGRRRQYVPSADPGSRLPHMNVRVLASEEIFSTLDLVSG 586
+RY GA++ D E + +PGSR PH+ + E STLDL
Sbjct: 495 YRYPGGAVLGADPATPVVPE------ALTLAGEPGSRAPHL--WMSRRGERLSTLDLYER 552
Query: 587 DVVQFLLIIGPRPESYCLAHATLKVAEEFKISVRVCILWSADTTKIQSSSKEELTPWKNY 646
V P+++ + +++AEE + + + ++ S+ +LTP
Sbjct: 555 SPVLLSDADAGAPDAW--HESAVRLAEELSVPL--------TSYRVGRSAGADLTPED-- 552
Query: 647 IDVQEIRQWSTSPSWWDVCQMTDKGAILVRPDEHIAWRVKSGI-SGDPNTELTRVFTTLL 701
DV + T P GA+LVRPD +AWR + + + + L V TT+L
Sbjct: 615 -DVNWTARHGTPPG----------GAVLVRPDGFVAWRSQEPVPAEETEPTLRHVLTTVL 552
BLAST of Bhi07G000229 vs. ExPASy Swiss-Prot
Match:
P27138 (2,4-dichlorophenol 6-monooxygenase OS=Cupriavidus pinatubonensis (strain JMP 134 / LMG 1197) OX=264198 GN=tfdB PE=3 SV=1)
HSP 1 Score: 175.6 bits (444), Expect = 1.9e-42
Identity = 167/645 (25.89%), Postives = 269/645 (41.71%), Query Frame = 0
Query: 49 VLIVGAGPVGLVLAILLTKLGVKCAVVEKNRSFSKHPQAHFINNRSMEVFRKLDGLAEEI 108
VL+VG GP G LL + GV+ ++ K + P+AH N R+ME+ R L GL E
Sbjct: 9 VLVVGTGPAGASAGALLARYGVRTMLINKYNWTAPTPRAHITNQRTMEILRDL-GLEAEA 68
Query: 109 QLHQPPVESWRKFIYCTSLNGTILGSV-----DHMQPQDFEHITSPVSVAHFSQYKLNRL 168
+L+ P + + C SL G G + D + D++ SP S+ Q L +
Sbjct: 69 RLYAAPNDLMGENTICASLAGEEFGRIRTWGTDVRRRADYDE-CSPTSMCDLPQNYLEPI 128
Query: 169 LLKQLQNLGFQVCTPESLEGPCVVREKKILLGHECVSIDATDETITMTASYLKEGTHIKR 228
L+K +L+G C VR LGHE + +S L++ + +
Sbjct: 129 LVKS-----------AALDG-CKVRFDTEYLGHE--------QDADGVSSRLRDRLNGEE 188
Query: 229 RNISCNVLVGADGAGSSVRRLVGIEMKGENDLQKLVSVH-FFSRELGEYLLKERPGMLYF 288
+ L+GADGA S V + +++ E + K S++ F +L Y + RP +LY+
Sbjct: 189 FTVRSKYLIGADGANSRV--VSDLDLPLEGTMGKSGSINLLFEADLDRY-VAHRPSVLYW 248
Query: 289 VF----NTEAIGVLVAHDLKQGEFILQVPFYPPQQNIGDFCPKMCEELIFKLVGQNLCDI 348
V + +G+ V ++ L + Y +Q + ++ L+G + +
Sbjct: 249 VIQPGSDIGGLGIGVVRMVRPWNKWLAIWGYDVEQGPPEISESFARRIVHNLIGDDSVPL 308
Query: 349 DVQDVKPWIMHAEVAEKFICCQSRVLLAGDAAHRFPPAGGFGMNTGIQDVHNLAWKLAAV 408
++ + W ++ A + Q RV AGDA HR PP G G NT IQD NLAWK+A V
Sbjct: 309 KIEGISTWTVNDMYATRL--QQGRVFCAGDAVHRHPPTNGLGSNTSIQDSFNLAWKIAMV 368
Query: 409 LQDIASPSILNTYEMERRPIALFNTALSVKNFKAAMEVPAALGLDPKIANSVHRVVNHGL 468
L A S+L+TY +ER PIA + K+ + + ALGL P+ ++
Sbjct: 369 LNGTADESLLDTYTIERAPIAKQVVCRANKSLEDFPPIAMALGL-PQAKSA--------- 428
Query: 469 GSILSSSQQSAVLDGIFKIGRLQLSDSFLNDGNPVGSSRLAKLRQIFDEGKSLQLQFPAE 528
++ + + + P ++ +LR+ G +
Sbjct: 429 -------------------DEMKSNMARRKEPGPEAQAQRTRLREAI-AGTNYVYNAHGV 488
Query: 529 DLGFRYSEGAIIPDDSLLGGREEPTGRRRQYVPSADPGSRLPHMNVRVLASEEIFSTLDL 588
++ RY AI+ D+S E + S PG+ +PH+ V ST DL
Sbjct: 489 EMNQRYDSPAIVADNS---PDEVFRDVELYHQASTRPGAPMPHVWVYASGDGHRISTKDL 548
Query: 589 V-SGDVVQFLLIIGPRPESYCLAHATLKVAEEFKISVRVCILWSADTTKIQSSSKEELTP 648
G+ F I G + A V+ + ++V V I+ +
Sbjct: 549 CGKGNFTLFTGIGGAAWQD-----AAAAVSRQLGVAVTVRIIGPGQAYE----------- 564
Query: 649 WKNYIDVQEIRQWSTSPSWWDVCQMTDKGAILVRPDEHIAWRVKS 683
+Y D I ++ D GAILVRPD H+A+R S
Sbjct: 609 -DHYGDFARI------------SEIIDTGAILVRPDFHVAYRATS 564
BLAST of Bhi07G000229 vs. ExPASy Swiss-Prot
Match:
P31020 (Phenol 2-monooxygenase OS=Pseudomonas sp. (strain EST1001) OX=69012 GN=pheA PE=3 SV=1)
HSP 1 Score: 156.4 bits (394), Expect = 1.2e-36
Identity = 168/664 (25.30%), Postives = 265/664 (39.91%), Query Frame = 0
Query: 49 VLIVGAGPVGLVLAILLTKLGVKCAVVEKNRSFSKHPQAHFINNRSMEVFRKLDGLAEEI 108
VLIVG+GP G A+ L+ G+ ++ K R + P+AH N R+ME+ R G+ +++
Sbjct: 38 VLIVGSGPAGSSAAMFLSTQGISNIMITKYRWTANTPRAHITNQRTMEILRDA-GIEDQV 97
Query: 109 QLHQPPVESWRKFIYCTSLNGTILG-----SVDHMQPQDFEHITSPVSVAHFSQYKLNRL 168
P E +YC S+ G +G + D+E + SP Q L +
Sbjct: 98 LAEAVPHELMGDTVYCESMAGEEIGRRPTWGTRPDRRADYE-LASPAMPCDIPQTLLEPI 157
Query: 169 LLKQLQNLGFQVCTPESLEGPCVVREKKILLGHECVSIDATDETITMTASYLKEGTHIKR 228
+LK +R + E +S D+ +++ G +
Sbjct: 158 MLKN-----------------ATMRGTQTQFSTEYLSHTQDDKGVSVQVLNRLTG---QE 217
Query: 229 RNISCNVLVGADGAGSSVRRLVGIEMKGENDLQKLVSVHFFSRELGEYLLKERPGMLYFV 288
I L+GADGA S V +G M N K H+ L L P + +
Sbjct: 218 YTIRAKYLIGADGARSKVAADIGGSM---NITFKADLSHWRPSALDPVL--GLPPRIEYR 277
Query: 289 FNTEAIGVLVAHDLKQGEFILQVPF----YPPQQNIGDFCPKMCEELIFKLVGQNLCDID 348
+ +V E+++ F PP+ N + +++ LVG D++
Sbjct: 278 WPRRWFDRMVR---PWNEWLVVWGFDINQEPPKLNDDE-----AIQIVRNLVGIEDLDVE 337
Query: 349 VQDVKPWIMHAEVAEKFICCQSRVLLAGDAAHRFPPAGGFGMNTGIQDVHNLAWKLAAVL 408
+ W + + A + RV AGDA H+ PP+ G G NT IQD +NL WKLA VL
Sbjct: 338 ILGYSLWGNNDQYATHL--QKGRVCCAGDAIHKHPPSHGLGSNTSIQDSYNLCWKLACVL 397
Query: 409 QDIASPSILNTYEMERRPIALFNTALSVKNFKAAMEVPAALGLDPKIANSVHRVVNHGLG 468
+ A P +L TY ER PIA ++V G
Sbjct: 398 KGQAGPELLETYSTERAPIA-------------------------------KQIVTRANG 457
Query: 469 SILSSSQQSAVLDGIFKIGRLQLSDSFL------NDGNPVGSSRLAKLRQIFDEGKSLQL 528
SSS+ + D + + +D F+ + +P G+ R A LR D K +
Sbjct: 458 ---SSSEYKPIFDAL-GVTDATTNDEFVEKLALRKENSPEGARRRAALRAALD-NKDYEF 517
Query: 529 QFPAEDLGFRYSEGAIIPDDSLLGGREEPTGRRRQYVPSADPGSRLPHMNVRVLASEEIF 588
++G Y A+I D E Q S PG RLPH + ++E +
Sbjct: 518 NAQGTEIGQFYDSSAVITDGQKRPAMTEDPMLHHQ--KSTFPGLRLPH--AWLGDAKEKY 577
Query: 589 STLDLVSGDVVQFLLIIGPRPESYCLAHATLKVAEEFKISVRVCILWSADTTKIQSSSKE 648
ST D+ G +F + G +++ A A ++VAE I ++ ++
Sbjct: 578 STHDIAEG--TRFTIFTGITGQAW--ADAAVRVAERLGIDLKAVVIGEGQ---------- 595
Query: 649 ELTPWKNYIDVQEIRQWSTSPSWWDVCQMTDKGAILVRPDEHIAWRVKSGISGDPNTELT 698
VQ++ W ++ + G ILVRPD+HI WR +S ++ DP T L
Sbjct: 638 ---------PVQDL-----YGDWLRQREVDEDGVILVRPDKHIGWRAQSMVA-DPETALF 595
BLAST of Bhi07G000229 vs. ExPASy TrEMBL
Match:
A0A1S3BZD5 (putative polyketide hydroxylase OS=Cucumis melo OX=3656 GN=LOC103495067 PE=4 SV=1)
HSP 1 Score: 1301.2 bits (3366), Expect = 0.0e+00
Identity = 633/701 (90.30%), Postives = 671/701 (95.72%), Query Frame = 0
Query: 1 MGFLGFFRRFNGLQKFDATLRAIPVGYVQSRCLSNSKLFHGGKETMVPVLIVGAGPVGLV 60
MGFLGFFRRFNGLQKFDAT RAIP+GYVQ R SNSKLFHGG ETMVPVLIVGAGPVGLV
Sbjct: 9 MGFLGFFRRFNGLQKFDATPRAIPLGYVQCRGSSNSKLFHGGDETMVPVLIVGAGPVGLV 68
Query: 61 LAILLTKLGVKCAVVEKNRSFSKHPQAHFINNRSMEVFRKLDGLAEEIQLHQPPVESWRK 120
LAILLTKLG+KCA+VEKNRSFSKHPQAHFINNR+MEVFRKLDGLAE+IQL+QPPVESWRK
Sbjct: 69 LAILLTKLGIKCAIVEKNRSFSKHPQAHFINNRTMEVFRKLDGLAEKIQLYQPPVESWRK 128
Query: 121 FIYCTSLNGTILGSVDHMQPQDFEHITSPVSVAHFSQYKLNRLLLKQLQNLGFQVCTPES 180
FIYCTSLNGTILGSVDHMQPQDFEHI SPVSVAHFSQYKLNRLLLKQLQNLGFQV +P+S
Sbjct: 129 FIYCTSLNGTILGSVDHMQPQDFEHIISPVSVAHFSQYKLNRLLLKQLQNLGFQVGSPDS 188
Query: 181 LEGPCVVREKKILLGHECVSIDATDETITMTASYLKEGTHIKRRNISCNVLVGADGAGSS 240
LEGPCVVREKKILLGHECVSIDATDE++ MTASYLKEG H++RRNISCN+LVGADGAGS+
Sbjct: 189 LEGPCVVREKKILLGHECVSIDATDESVNMTASYLKEGKHVERRNISCNILVGADGAGST 248
Query: 241 VRRLVGIEMKGENDLQKLVSVHFFSRELGEYLLKERPGMLYFVFNTEAIGVLVAHDLKQG 300
VRRLVG+EMKGENDLQKLVS+HFFSRELGEYLLK+RPGMLYF+FNTEAIGVLVAHDLKQG
Sbjct: 249 VRRLVGVEMKGENDLQKLVSIHFFSRELGEYLLKDRPGMLYFIFNTEAIGVLVAHDLKQG 308
Query: 301 EFILQVPFYPPQQNIGDFCPKMCEELIFKLVGQNLCDIDVQDVKPWIMHAEVAEKFICCQ 360
EFILQVPFYPPQQNI DFCP MC ELIFKLVGQNLCDIDVQDVKPWIMHAEVAEKFICC+
Sbjct: 309 EFILQVPFYPPQQNIEDFCPAMCNELIFKLVGQNLCDIDVQDVKPWIMHAEVAEKFICCR 368
Query: 361 SRVLLAGDAAHRFPPAGGFGMNTGIQDVHNLAWKLAAVLQDIASPSILNTYEMERRPIAL 420
+ VLLAGDAAHRFPPAGGFGMNTGIQDVHNLAWKLAAVLQDIASPSILNTYEMERRPIAL
Sbjct: 369 NHVLLAGDAAHRFPPAGGFGMNTGIQDVHNLAWKLAAVLQDIASPSILNTYEMERRPIAL 428
Query: 421 FNTALSVKNFKAAMEVPAALGLDPKIANSVHRVVNHGLGSILSSSQQSAVLDGIFKIGRL 480
FNTALSVKNFKAAMEVPAALGLDPKIANSVHRVVN+GLGSILSSS QSAVLDGIFKIGRL
Sbjct: 429 FNTALSVKNFKAAMEVPAALGLDPKIANSVHRVVNNGLGSILSSSLQSAVLDGIFKIGRL 488
Query: 481 QLSDSFLNDGNPVGSSRLAKLRQIFDEGKSLQLQFPAEDLGFRYSEGAIIPDDSLLGGRE 540
QLSD FLN NP+GSSRLAKLR IFDEGKSLQLQFPAEDLGFRYS+GAIIPD++LLGGRE
Sbjct: 489 QLSDIFLNVKNPIGSSRLAKLRHIFDEGKSLQLQFPAEDLGFRYSDGAIIPDNTLLGGRE 548
Query: 541 EPTGRRRQYVPSADPGSRLPHMNVRVLASEEIFSTLDLVSGDVVQFLLIIGPRPESYCLA 600
EPTGRRRQY+PSADPGSRLPHMNVRVLASE+I STLDLVSGD ++FLLII PR ESY LA
Sbjct: 549 EPTGRRRQYIPSADPGSRLPHMNVRVLASEDIISTLDLVSGDKIEFLLIIAPRSESYHLA 608
Query: 601 HATLKVAEEFKISVRVCILWSADTTKIQSSSKEELTPWKNYIDVQEIRQWSTSPSWWDVC 660
HA KVAEEFK SV+VCILWSA TTKI+SSSK+ LTPW+NY+DV+EIRQ +TSPSWWD+C
Sbjct: 609 HAGFKVAEEFKTSVKVCILWSASTTKIESSSKDLLTPWENYVDVEEIRQSTTSPSWWDIC 668
Query: 661 QMTDKGAILVRPDEHIAWRVKSGISGDPNTELTRVFTTLLK 702
+MTDKGAILVRPDEHIAWRVKSGISGDPNTEL RVFTTLLK
Sbjct: 669 KMTDKGAILVRPDEHIAWRVKSGISGDPNTELMRVFTTLLK 709
BLAST of Bhi07G000229 vs. ExPASy TrEMBL
Match:
A0A0A0KLD9 (FAD_binding_3 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_5G056620 PE=4 SV=1)
HSP 1 Score: 1285.4 bits (3325), Expect = 0.0e+00
Identity = 629/701 (89.73%), Postives = 668/701 (95.29%), Query Frame = 0
Query: 1 MGFLGFFRRFNGLQKFDATLRAIPVGYVQSRCLSNSKLFHGGKETMVPVLIVGAGPVGLV 60
MGFLGFF+RFNGLQKFDA LR P+ +Q R SNSK+FHGG ETMVPVLIVGAGPVGLV
Sbjct: 1 MGFLGFFKRFNGLQKFDAMLRTKPLRNIQCRGSSNSKIFHGGDETMVPVLIVGAGPVGLV 60
Query: 61 LAILLTKLGVKCAVVEKNRSFSKHPQAHFINNRSMEVFRKLDGLAEEIQLHQPPVESWRK 120
LAILLTKLGVKCA+VEKN+SFSKHPQAHFINNR+MEVFRKLDGLAE+IQL+QPPVESWRK
Sbjct: 61 LAILLTKLGVKCAIVEKNKSFSKHPQAHFINNRTMEVFRKLDGLAEKIQLYQPPVESWRK 120
Query: 121 FIYCTSLNGTILGSVDHMQPQDFEHITSPVSVAHFSQYKLNRLLLKQLQNLGFQVCTPES 180
FIYCTSLNGTILGSVDHMQPQDFEHI SPVSVAHFSQYKLN LLLKQLQNLGFQVC+P+S
Sbjct: 121 FIYCTSLNGTILGSVDHMQPQDFEHIISPVSVAHFSQYKLNGLLLKQLQNLGFQVCSPDS 180
Query: 181 LEGPCVVREKKILLGHECVSIDATDETITMTASYLKEGTHIKRRNISCNVLVGADGAGSS 240
LEGPCVVREKKILLGHECVSIDATDE++ MTASYLKEG H++RRNISCN+LVGADGAGS+
Sbjct: 181 LEGPCVVREKKILLGHECVSIDATDESVNMTASYLKEGKHVERRNISCNILVGADGAGST 240
Query: 241 VRRLVGIEMKGENDLQKLVSVHFFSRELGEYLLKERPGMLYFVFNTEAIGVLVAHDLKQG 300
VRRLVGIEMKGENDLQKLVS+HFFSRELGEYLLK+RPGMLYF+FNTEAIGVLVAHDLKQG
Sbjct: 241 VRRLVGIEMKGENDLQKLVSIHFFSRELGEYLLKDRPGMLYFIFNTEAIGVLVAHDLKQG 300
Query: 301 EFILQVPFYPPQQNIGDFCPKMCEELIFKLVGQNLCDIDVQDVKPWIMHAEVAEKFICCQ 360
EFILQVPFYPPQQNI DF P+MCEELIFKLVG+NLCDIDV+DVKPWIMHAEVAEKFIC Q
Sbjct: 301 EFILQVPFYPPQQNIEDFFPQMCEELIFKLVGRNLCDIDVRDVKPWIMHAEVAEKFICRQ 360
Query: 361 SRVLLAGDAAHRFPPAGGFGMNTGIQDVHNLAWKLAAVLQDIASPSILNTYEMERRPIAL 420
+ VLLAGDAAHRFPPAGGFGMNTGIQDVHNLAWKLAAVLQDIASPSILNTYEMERRPIAL
Sbjct: 361 NHVLLAGDAAHRFPPAGGFGMNTGIQDVHNLAWKLAAVLQDIASPSILNTYEMERRPIAL 420
Query: 421 FNTALSVKNFKAAMEVPAALGLDPKIANSVHRVVNHGLGSILSSSQQSAVLDGIFKIGRL 480
FNTALSVKNFKAAMEVPAALGLDPKIANSVHRVVNHGLGSILSSS QSAVLDGIFKIGRL
Sbjct: 421 FNTALSVKNFKAAMEVPAALGLDPKIANSVHRVVNHGLGSILSSSLQSAVLDGIFKIGRL 480
Query: 481 QLSDSFLNDGNPVGSSRLAKLRQIFDEGKSLQLQFPAEDLGFRYSEGAIIPDDSLLGGRE 540
QLSD+FLN NP+GSSRLAKLR IFDEGKSLQLQFPAEDLGFRYSEGAII D++LLGGRE
Sbjct: 481 QLSDTFLNVENPIGSSRLAKLRHIFDEGKSLQLQFPAEDLGFRYSEGAIIRDNNLLGGRE 540
Query: 541 EPTGRRRQYVPSADPGSRLPHMNVRVLASEEIFSTLDLVSGDVVQFLLIIGPRPESYCLA 600
EPTGRRRQY+PSADPGSRLPHMNVRVLASEEI STLDLVSGD ++FLLII PR ESY LA
Sbjct: 541 EPTGRRRQYLPSADPGSRLPHMNVRVLASEEIISTLDLVSGDKIEFLLIIAPRSESYRLA 600
Query: 601 HATLKVAEEFKISVRVCILWSADTTKIQSSSKEELTPWKNYIDVQEIRQWSTSPSWWDVC 660
HA LKVAEEFK SV+VCILWSA+TTKI+SSSK++LTPW+NYI+VQEIRQ TSPSWWDVC
Sbjct: 601 HAALKVAEEFKTSVKVCILWSANTTKIESSSKDQLTPWENYIEVQEIRQSITSPSWWDVC 660
Query: 661 QMTDKGAILVRPDEHIAWRVKSGISGDPNTELTRVFTTLLK 702
+MTDKGAILVRPDEHIAWRVKSGISGDPNTEL VFTTLLK
Sbjct: 661 KMTDKGAILVRPDEHIAWRVKSGISGDPNTELIGVFTTLLK 701
BLAST of Bhi07G000229 vs. ExPASy TrEMBL
Match:
A0A6J1G1R0 (uncharacterized protein LOC111449881 OS=Cucurbita moschata OX=3662 GN=LOC111449881 PE=4 SV=1)
HSP 1 Score: 1253.8 bits (3243), Expect = 0.0e+00
Identity = 612/702 (87.18%), Postives = 652/702 (92.88%), Query Frame = 0
Query: 1 MGFLGFFRRFNGLQKFDATLRAIPVGYVQSRCLSNSKLFHGGKETMVPVLIVGAGPVGLV 60
MGFLGFF+RFNGLQKF+A+LRAIP+GYVQSR LSNSKLFHGG+ET VPVLIVGAGPVGLV
Sbjct: 34 MGFLGFFKRFNGLQKFNASLRAIPLGYVQSRGLSNSKLFHGGEETAVPVLIVGAGPVGLV 93
Query: 61 LAILLTKLGVKCAVVEKNRSFSKHPQAHFINNRSMEVFRKLDGLAEEIQLHQPPVESWRK 120
LAILLTKLGVKCA++EKN SFS HPQAHFINNRSMEVFRKLDGLAEEIQL QPPV+SWRK
Sbjct: 94 LAILLTKLGVKCAILEKNTSFSNHPQAHFINNRSMEVFRKLDGLAEEIQLRQPPVDSWRK 153
Query: 121 FIYCTSLNGTILGSVDHMQPQDFEHITSPVSVAHFSQYKLNRLLLKQLQNLGFQVCTPES 180
FIYCTSL GTILGSVDHMQPQDFEHI SPVSVAHFSQYKLNRLLLKQLQNL FQVC+P+S
Sbjct: 154 FIYCTSLKGTILGSVDHMQPQDFEHIISPVSVAHFSQYKLNRLLLKQLQNLEFQVCSPDS 213
Query: 181 LEGPCVVREKKILLGHECVSIDATDETITMTASYLKEGTHIKRRNISCNVLVGADGAGSS 240
EGPC+VREK+IL+GHECVSI TD+ +TMTASYLKEG HI+RRNI N+LVGADGAGS+
Sbjct: 214 SEGPCIVREKQILMGHECVSIGVTDDAVTMTASYLKEGKHIERRNICSNILVGADGAGST 273
Query: 241 VRRLVGIEMKGENDLQKLVSVHFFSRELGEYLLKERPGMLYFVFNTEAIGVLVAHDLKQG 300
VRRLVGIEMKGE DLQKLVS+HFFSRELGEYLL ERPGMLYF+FNTEAIGVLVAHDLKQG
Sbjct: 274 VRRLVGIEMKGEKDLQKLVSIHFFSRELGEYLLTERPGMLYFIFNTEAIGVLVAHDLKQG 333
Query: 301 EFILQVPFYPPQQNIGDFCPKMCEELIFKLVGQNLCDIDVQDVKPWIMHAEVAEKFICCQ 360
EFILQVPFYPPQQNI DFCPKMCEE+IF LVG NLCD+DVQDVKPWIMHAEVAEKFI CQ
Sbjct: 334 EFILQVPFYPPQQNIEDFCPKMCEEIIFDLVGLNLCDVDVQDVKPWIMHAEVAEKFISCQ 393
Query: 361 SRVLLAGDAAHRFPPAGGFGMNTGIQDVHNLAWKLAAVLQDIASPSILNTYEMERRPIAL 420
RVLLAGDAAHRFPPAGGFGMNTGIQDVHNLAWKLAAVLQDIASPSILNTYEMER+PIAL
Sbjct: 394 GRVLLAGDAAHRFPPAGGFGMNTGIQDVHNLAWKLAAVLQDIASPSILNTYEMERKPIAL 453
Query: 421 FNTALSVKNFKAAMEVPAALGLDPKIANSVHRVVNHGLGSILSSSQQSAVLDGIFKIGRL 480
NTALSVKNFKAAMEVPAALGLDPKIANSVH+ VNHGLGS+LSSS Q VLDGIFKIGRL
Sbjct: 454 LNTALSVKNFKAAMEVPAALGLDPKIANSVHQAVNHGLGSVLSSSLQREVLDGIFKIGRL 513
Query: 481 QLSDSFLNDGNPVGSSRLAKLRQIFDEGKSLQLQFPAEDLGFRYSEGAIIPDDSLLGGRE 540
QLSD+ LND NPVGSSRLAKL IFDEGKSLQLQFPAEDLGFRYSEGA++PD++ GGRE
Sbjct: 514 QLSDTLLNDKNPVGSSRLAKLSHIFDEGKSLQLQFPAEDLGFRYSEGALLPDNNQPGGRE 573
Query: 541 EPTGRRRQYVPSADPGSRLPHMNVRVLASEEIFSTLDLVSGDVVQFLLIIGPRPESYCLA 600
EPTGRRR+Y+PSADPGS+LPHMNVR LAS E+ STLDLVSGD V+FLLII P PESY LA
Sbjct: 574 EPTGRRRRYIPSADPGSKLPHMNVRALASMEVISTLDLVSGDKVEFLLIIAPLPESYRLA 633
Query: 601 HATLKVAEEFKISVRVCILWSADTTKIQSSSKEELTPWKNYIDVQEIRQWSTS-PSWWDV 660
A L VAEEFK SVRVCILWSAD T+I+SSSKEELTPW+NYIDVQEIRQ STS SWWDV
Sbjct: 634 RAALMVAEEFKTSVRVCILWSADITRIESSSKEELTPWENYIDVQEIRQPSTSTASWWDV 693
Query: 661 CQMTDKGAILVRPDEHIAWRVKSGISGDPNTELTRVFTTLLK 702
CQMTDKGAILVRPDEH+ WRVKSG+SGDPNTEL RVFTTLLK
Sbjct: 694 CQMTDKGAILVRPDEHVGWRVKSGVSGDPNTELRRVFTTLLK 735
BLAST of Bhi07G000229 vs. ExPASy TrEMBL
Match:
A0A6J1HUM9 (uncharacterized protein LOC111466353 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111466353 PE=4 SV=1)
HSP 1 Score: 1247.3 bits (3226), Expect = 0.0e+00
Identity = 607/702 (86.47%), Postives = 653/702 (93.02%), Query Frame = 0
Query: 1 MGFLGFFRRFNGLQKFDATLRAIPVGYVQSRCLSNSKLFHGGKETMVPVLIVGAGPVGLV 60
MGFLGFF+RFNGL++F+A+LRAIP+GYVQSR LSNSKLFHGG+ET VPVLIVGAGPVGLV
Sbjct: 1 MGFLGFFKRFNGLRRFNASLRAIPLGYVQSRGLSNSKLFHGGEETAVPVLIVGAGPVGLV 60
Query: 61 LAILLTKLGVKCAVVEKNRSFSKHPQAHFINNRSMEVFRKLDGLAEEIQLHQPPVESWRK 120
LAILLTKLGVKCA++EKN SFS HPQAHFINNRSMEVFRKLDGLAEEIQL QPPV+SWRK
Sbjct: 61 LAILLTKLGVKCAILEKNTSFSNHPQAHFINNRSMEVFRKLDGLAEEIQLCQPPVDSWRK 120
Query: 121 FIYCTSLNGTILGSVDHMQPQDFEHITSPVSVAHFSQYKLNRLLLKQLQNLGFQVCTPES 180
F+YCTSL GTILGSVDHMQPQDFEHI SPVSVAHFSQYKLNRLLLKQLQNLGFQVC+P+S
Sbjct: 121 FLYCTSLKGTILGSVDHMQPQDFEHIISPVSVAHFSQYKLNRLLLKQLQNLGFQVCSPDS 180
Query: 181 LEGPCVVREKKILLGHECVSIDATDETITMTASYLKEGTHIKRRNISCNVLVGADGAGSS 240
EGPC+VREK+IL+GHECVSI TD+ +TMTASYLKEG HI+RRNI N+LVGADGAGS+
Sbjct: 181 SEGPCIVREKQILMGHECVSIGVTDDAVTMTASYLKEGKHIERRNICSNILVGADGAGST 240
Query: 241 VRRLVGIEMKGENDLQKLVSVHFFSRELGEYLLKERPGMLYFVFNTEAIGVLVAHDLKQG 300
VRRLVGIEMKGE +LQKLVS+HFFSRELGEYLL ERPGMLYF+FN EAIGVLVAHDLKQG
Sbjct: 241 VRRLVGIEMKGEKNLQKLVSIHFFSRELGEYLLTERPGMLYFIFNIEAIGVLVAHDLKQG 300
Query: 301 EFILQVPFYPPQQNIGDFCPKMCEELIFKLVGQNLCDIDVQDVKPWIMHAEVAEKFICCQ 360
EFILQVPFYPPQQNI DFCPKMCEE+IF LVG NLCD+DVQDVKPWIMHAEVAEKFI CQ
Sbjct: 301 EFILQVPFYPPQQNIEDFCPKMCEEIIFNLVGLNLCDVDVQDVKPWIMHAEVAEKFIACQ 360
Query: 361 SRVLLAGDAAHRFPPAGGFGMNTGIQDVHNLAWKLAAVLQDIASPSILNTYEMERRPIAL 420
+RVLLAGDAAHRFPPAGGFGMNTGIQDVHNLAWKLAAVLQDIASPSILNTYEMER+PIAL
Sbjct: 361 NRVLLAGDAAHRFPPAGGFGMNTGIQDVHNLAWKLAAVLQDIASPSILNTYEMERKPIAL 420
Query: 421 FNTALSVKNFKAAMEVPAALGLDPKIANSVHRVVNHGLGSILSSSQQSAVLDGIFKIGRL 480
NTALSVKNFKAAMEVPAALGLDPKIANSVH+ VNHGLGS+LSSS Q VLDGIFKIGRL
Sbjct: 421 LNTALSVKNFKAAMEVPAALGLDPKIANSVHQAVNHGLGSVLSSSLQRTVLDGIFKIGRL 480
Query: 481 QLSDSFLNDGNPVGSSRLAKLRQIFDEGKSLQLQFPAEDLGFRYSEGAIIPDDSLLGGRE 540
QLSD+ LN+ NPVGSSRLAKL IFDEGKSLQLQFPAEDLGFRYSEGA+IPD++ LGGRE
Sbjct: 481 QLSDTLLNEKNPVGSSRLAKLSHIFDEGKSLQLQFPAEDLGFRYSEGALIPDNNQLGGRE 540
Query: 541 EPTGRRRQYVPSADPGSRLPHMNVRVLASEEIFSTLDLVSGDVVQFLLIIGPRPESYCLA 600
EPTGRRR+Y+PSADPGS+LPHMNVR LAS E+ STLDLVSGD V+FLLII P PE Y LA
Sbjct: 541 EPTGRRRRYIPSADPGSKLPHMNVRALASMEVISTLDLVSGDKVEFLLIIAPLPEFYRLA 600
Query: 601 HATLKVAEEFKISVRVCILWSADTTKIQSSSKEELTPWKNYIDVQEIRQWSTS-PSWWDV 660
A L VAEEFK SVRVCILWSAD T+I+SSSKEEL PW+NYIDVQEIRQ STS SWWDV
Sbjct: 601 RAALMVAEEFKTSVRVCILWSADITRIESSSKEELAPWENYIDVQEIRQPSTSTASWWDV 660
Query: 661 CQMTDKGAILVRPDEHIAWRVKSGISGDPNTELTRVFTTLLK 702
CQMTDKGAILVRPDEH+AWRVKSG+SGDPNTEL RVFT+LLK
Sbjct: 661 CQMTDKGAILVRPDEHVAWRVKSGVSGDPNTELRRVFTSLLK 702
BLAST of Bhi07G000229 vs. ExPASy TrEMBL
Match:
A0A6J1DCK1 (uncharacterized protein LOC111019789 OS=Momordica charantia OX=3673 GN=LOC111019789 PE=4 SV=1)
HSP 1 Score: 1231.5 bits (3185), Expect = 0.0e+00
Identity = 600/702 (85.47%), Postives = 651/702 (92.74%), Query Frame = 0
Query: 1 MGFLGFFRRFNGLQKFDATLRAIPVGYVQSRCLSNSKLFHGGKETMVPVLIVGAGPVGLV 60
MG LGF RFNGL+KFDA LRA+P+G VQSR LSNSK+FHGG++TMVPVLIVGAGPVGLV
Sbjct: 21 MGLLGFVNRFNGLRKFDAKLRALPLGLVQSRGLSNSKIFHGGEDTMVPVLIVGAGPVGLV 80
Query: 61 LAILLTKLGVKCAVVEKNRSFSKHPQAHFINNRSMEVFRKLDGLAEEIQLHQPPVESWRK 120
LAILLTKLGVKCAVVEKNRSFS HPQAHFINNRSMEVFRKL GLAEEIQL QPPV+SWRK
Sbjct: 81 LAILLTKLGVKCAVVEKNRSFSNHPQAHFINNRSMEVFRKLHGLAEEIQLCQPPVDSWRK 140
Query: 121 FIYCTSLNGTILGSVDHMQPQDFEHITSPVSVAHFSQYKLNRLLLKQLQNLGFQVCTPES 180
FIYCTSLNG ILGSVDHMQPQDF +I SPVSVAHFSQYKLNRLLLK+LQNLGFQVC+P+S
Sbjct: 141 FIYCTSLNGAILGSVDHMQPQDFGNIISPVSVAHFSQYKLNRLLLKKLQNLGFQVCSPDS 200
Query: 181 LEGPCVVREKKILLGHECVSIDATDETITMTASYLKEGTHIKRRNISCNVLVGADGAGSS 240
LE C+VREK+ILLGHECVSIDATD+++T TASYLKEG H +RRNI N+LVG DGAGS+
Sbjct: 201 LEDTCIVREKQILLGHECVSIDATDDSVTTTASYLKEGKHTERRNICSNILVGTDGAGST 260
Query: 241 VRRLVGIEMKGENDLQKLVSVHFFSRELGEYLLKERPGMLYFVFNTEAIGVLVAHDLKQG 300
VRRLVGIE+KGENDLQKLVSVHFFSRELGEYLL ERPGMLYF+FNTEAIGVLVAHDLKQG
Sbjct: 261 VRRLVGIEIKGENDLQKLVSVHFFSRELGEYLLNERPGMLYFIFNTEAIGVLVAHDLKQG 320
Query: 301 EFILQVPFYPPQQNIGDFCPKMCEELIFKLVGQNLCDIDVQDVKPWIMHAEVAEKFICCQ 360
EFILQVPFYPPQQ+I DF PKMCEELIFKLVG NLCDIDVQDVKPWIMHAEVAEKFICC
Sbjct: 321 EFILQVPFYPPQQSIEDFSPKMCEELIFKLVGLNLCDIDVQDVKPWIMHAEVAEKFICCH 380
Query: 361 SRVLLAGDAAHRFPPAGGFGMNTGIQDVHNLAWKLAAVLQDIASPSILNTYEMERRPIAL 420
+RVLLAGDAAHRFPPAGGFGMNTGIQDVHNLAWKLAAVLQDIASPSILNTYEMERRPIA
Sbjct: 381 NRVLLAGDAAHRFPPAGGFGMNTGIQDVHNLAWKLAAVLQDIASPSILNTYEMERRPIAQ 440
Query: 421 FNTALSVKNFKAAMEVPAALGLDPKIANSVHRVVNHGLGSILSSSQQSAVLDGIFKIGRL 480
FNTALSVKNFKAAMEVPAALGLDP+IANSVH+ VNHGLGS+L SS QS+VLDGIFKIGR+
Sbjct: 441 FNTALSVKNFKAAMEVPAALGLDPRIANSVHQAVNHGLGSVLPSSLQSSVLDGIFKIGRM 500
Query: 481 QLSDSFLNDGNPVGSSRLAKLRQIFDEGKSLQLQFPAEDLGFRYSEGAIIPDDSLLGGRE 540
QLS+S LND NP+GSSRLAKLRQIFDEGKSLQLQFPAEDLGFRYSEGA+IPD++LL G E
Sbjct: 501 QLSESLLNDKNPIGSSRLAKLRQIFDEGKSLQLQFPAEDLGFRYSEGALIPDNTLLSGPE 560
Query: 541 EPTGRRRQYVPSADPGSRLPHMNVRVLASEEIFSTLDLVSGDVVQFLLIIGPRPESYCLA 600
EPTGRRRQYVPSADPGSRLPHMNVR LASE++ STLDLVSGD V+FLLII P PESY LA
Sbjct: 561 EPTGRRRQYVPSADPGSRLPHMNVRTLASEDMISTLDLVSGDKVEFLLIIAPLPESYHLA 620
Query: 601 HATLKV-AEEFKISVRVCILWSADTTKIQSSSKEELTPWKNYIDVQEIRQWSTSPSWWDV 660
+ LK+ AEEFK SV+VC+LWSAD +++S S++ELTPW+NY+DVQEIRQ STSPSWWDV
Sbjct: 621 RSALKIAAEEFKTSVKVCVLWSADIPRVESRSQQELTPWENYMDVQEIRQSSTSPSWWDV 680
Query: 661 CQMTDKGAILVRPDEHIAWRVKSGISGDPNTELTRVFTTLLK 702
C+MTD GAILVRPDEHIAWRVKSG+SGD NT++ RVFT LLK
Sbjct: 681 CRMTDNGAILVRPDEHIAWRVKSGVSGDLNTQMKRVFTALLK 722
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
AT1G24340.1 | 8.6e-272 | 63.98 | FAD/NAD(P)-binding oxidoreductase family protein | [more] |
Match Name | E-value | Identity | Description | |
P42534 | 1.4e-48 | 28.61 | Putative polyketide hydroxylase OS=Streptomyces coelicolor (strain ATCC BAA-471 ... | [more] |
Q8KN28 | 2.6e-47 | 27.71 | 2,4-dichlorophenol 6-monooxygenase OS=Delftia acidovorans OX=80866 GN=tfdB PE=1 ... | [more] |
Q05355 | 1.1e-45 | 28.18 | Putative polyketide hydroxylase OS=Streptomyces halstedii OX=1944 GN=schC PE=3 S... | [more] |
P27138 | 1.9e-42 | 25.89 | 2,4-dichlorophenol 6-monooxygenase OS=Cupriavidus pinatubonensis (strain JMP 134... | [more] |
P31020 | 1.2e-36 | 25.30 | Phenol 2-monooxygenase OS=Pseudomonas sp. (strain EST1001) OX=69012 GN=pheA PE=3... | [more] |
Match Name | E-value | Identity | Description | |
A0A1S3BZD5 | 0.0e+00 | 90.30 | putative polyketide hydroxylase OS=Cucumis melo OX=3656 GN=LOC103495067 PE=4 SV=... | [more] |
A0A0A0KLD9 | 0.0e+00 | 89.73 | FAD_binding_3 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_5G0566... | [more] |
A0A6J1G1R0 | 0.0e+00 | 87.18 | uncharacterized protein LOC111449881 OS=Cucurbita moschata OX=3662 GN=LOC1114498... | [more] |
A0A6J1HUM9 | 0.0e+00 | 86.47 | uncharacterized protein LOC111466353 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1DCK1 | 0.0e+00 | 85.47 | uncharacterized protein LOC111019789 OS=Momordica charantia OX=3673 GN=LOC111019... | [more] |