|
Sequences
The following sequences are available for this feature:
Gene sequence (with intron) Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR Hold the cursor over a type above to highlight its positions in the sequence below. CCCACCAACATTCTCTCTCTAAAATTCCAAAAAATTTTATCTTATTTTTATTTTTCTTCTCTCTCCTTCTCTCTATGATTTCTCGTAGAGCTTCGAACTTGAATTCTTTTCATCGGCGACTGTCTAATTCCGCCGTACAGTTTCAACAATGCTCCGACCTAACGCCGAACGACGCCGTTTTCCGATAACTTTCAGTAACATCTTCTTCTTTTTCACTCTTAATTTTAATTTCCATCTGTTTCATTTTTATTTTTTTATTTTCATAGTTTCAGTTTCAGCTTGAGCTTAAATATTTAGGAATTTCAGCTCCGTTTCTACCGTCTATATGTTTTTATTATTTTTATTTTAATAAAAAATTAAATTCACTTATTATCGCGATATTTTATGTATTGTTTTGGATATTTCTCTTGGTAATTTCTGCTGATTTGTTAATTTGTTAAATGTTCTTAAATTTGTAATGTCATAAAACCCTGGTCCTATATTTAGTGTGTAAATTGTTAAATGACCGTTAAACCAAAAGAAATTAATAATTTATATGTGTGTATATATATTCTTAACACAAATGTTTCAAAATTAGGGTTTTCTTTCGTGCTACAGTATTAGAGACTGTTATTTTATGTGCCAAATCTGAGATTTAGGTCTTTTTTATTTTGTTTTTTTAATTTTTATTTACGGTATATGTTAATGTTTGTCACGATAATTAACTGATGGGTATGACTATATATATTTATAGGTAGTCTTACATTTCTTGAAGGGGGACATTGTGTATAAATGATGCATAGAATTAATGTGATGGAAGGGAATAATCATCATGATGGGACTCATTCCAAGCCTGCAAGAAAATTCATTCAGATTGACTCTATATATATTGATCTATTTAGCTCCAATCATAAATGTGATGACCAGTGTGAACTTTTCTCCATCCGGTAAGCGTCTTTACGACTTATGTTGAGAAGATTGAGATGTCTTGATTATTGAGTTATGTATGTTCAAGTTGGCTTGTTATTTGTAGTTGTTTCGCAAGCTCAATTGGGGTTACTTTTTCTTTTTCTTTTTTTCCCATTGTGAAGGAGGGAACTCGAACCTAGATGCGGGGTCTCGAAATGTTGTGAGAGAGGGGAAGTAGCAATAGCCAATAGATGTCTTTTTAAACTGACGACCTTTATACCTAAGTATAACTTAACTAGTTAAAGACATTTAAAATTTTGACCAAGAGATTAGAGATTTGAATTCTTAAATATGTATTGTTGAATGCCAAAAAAAAAAAAAATACTGTATGGTGACACTCAAGTCTGCTTTTACTACTCAACATAACTTGACTCCAAAGTGTTAATGCATCATATAATAAGGACCTTGACAACGAATATTTATCCAAGTTAAGTGTGCGGGTGTAGCTACCTATTTAGGGAAACTGGAAACACACAGCTTTTGTTTTCCTTATTATTGAATCTATATCCTTGTAACAGTGGTTATGTATCTGATATGCGCAAAAAAGATTGGAAGATATGTTGGCCGTTTTCTGATATTGAGAATGGCCATAAGTTGGATGATCCTATACTCTTGGTCCCGCCTGTATTTGATCCGAGTTTCAACCCGCAGCGAGGCAAAAGTCATTGGCAAGAGAGTTCTGACAAAGCTGCAGATAAAGGTTTCCACTTTGATAGCTGTCACAACCTTGGAAAAATTTCAAATTCTTCCCCAAAAGCTCCAAAACAAGATGTAATCAATGGAAGAACAATGGCTGATAATGCTTCTATTTCGGGTCGCCAACCCTCAAATTGTGATCAGAAGGAAAAGAAACTTGATGTTGCAGATAGAGATAACTGTACTGGTAGGTTTTTTCCTTTTCTTTTTCCCTTCTATTTTTATTTTTTTGGCATATCTTTGTTTTGTACACATCATATGAGACTGGAATTTTGGCTACTCCTGATATAATGATTCACAGTGTCTTATTCATTTGCACAATGTTGAGAATTTGTGGCTCTGTTGGGACTAACTAATGATTGTTTCATGGATTCAAAATGTATTTATTCTACTTTAGTAAGATTGTTTATCCCTTTGACAAGGGATCCCTTTGTCCCTAGTATATCATCTTTGCAAAGTCAATCGCAATCTATAGCTCCTCGATATATGCTGTGAAATGTGTCTTTTATGTTGCATTTGTACCTAATAGTACTTTTTCCTAATCAGGCATTTGTGTTTTTCCATAATCAGATTATTTTTGGTATCTGAAACAGTTGCTCTTATATCACAAAGTGAGCCAGGTTGTGCAAGTCATGGAGTTACTGAGATTGAGCCTGTTAGTGGAAAGCTCATTCCCAAAGCAACTGAGGAAAGCCCTGCAGCACTTCAGGATGGAAAACAAACTCATGCAGATCGTCTTAATGGACAATTAACCTTGGTATCAGAGAATGACAGTACGGTAGACGTACCCCGAGGACATTATACTGTTACATTTCAAGAAAATGGAGATGCGTCCATGGAATCAAACCAAAGCACGGATTCATTATCTGAAAGTGCTGAAACAGTTGGAAACAGTCCTCATCATTGTCATCTAGGAAAGTTACATCGTCGAAGAACCCCAAAGGTTCGTCTATTGACTGATTTGCTAGGAGACAATGGAAATATGATAGCTAAACATGTCGAAAGTTCTCCATCCGATGGGTCTCCTGAGGCATCTGTGCAGGCAGATGTGAGGTATGCTCCCAAATGTCAGGTAACTATAGAGGAAGATGTTTGGCATTCAGATCATAGACGGGAAAGAAGGTTGCCTCGGAATGGAAAGTGTAGGCATCAGGAGATTCCCTCTTCTTCCAGTGTGGATAAGAAAATTCAAACATGGAGGGGGCAGATAGAAAGCTCTGTTTCTAGTTTAGGAAATGAAAATGCTCATTCAGGAATAAAACAGACCATGAAAGGCCCTTGGAGCAGCTACAAAATGGATGGAAACAATAGTTTAAGAAGGAAAAAAAGTAAAAAGTTTCCAGTGGTTGATCCATACTCCGTGCCCTTAGTGCCATCTAAAGTTAAAGATCAATGTGAAGTTCAGGCAATAACCGAAAATAGAAGTGAGGTTGCTGTGGATAGTGCTGCTATCTTAGCATATCACAATGATTTTTCTAGCAGAACTCCACACTCAACATCATTGAATGCCATGGAATCTAAATCTGGCACATCTAAGAACCCAAATTCAAGCAAGGAGCCTGTGATTTTTGAAGGGCCCACTAATGTATTTGCATGGAACAATGGAATGCTTTGGAGGGGATCAGTTACACAGAAAGATGTGGAAACCATGAAAAGTAGGTCTGTAGCTAATCCTCTTCCAAGTTACAGAAACAATGAAAGAGAATTGCATCCTTCTCATAATAACTATTCAGAGCCACAAAGGGACCACAAAGGAATCCATCATCGAGGAGAAAACGAGCTGGCCACTTTTTTGCCTGAGCTAGAGGACACTTCCAAAGTAAGGATTAATATTGAAACGAGTAATCTTGGATATCCAAATCACCCTCATCAAGCTTCAGATGTATTTTATGGACAAGGAGTGCGTAGTGTGCTGAACAGTAAAATGGCCAACTTGAGAATGCCTCTTCCAAGACAAAATGCAGATCCTCACACAGATAATAGTTGGTCGCAGCTGCAGAATAAGGTATATTCTTCAATTTTTATGGTTTTTGCCGAAGGAAAGAAAAAGAAAAAGACTTCGCCATAAATTATAATGAACATGCATCAGATTTCTTTCATACAATCGTAAATATACTAAAGGTAATTGCATTATCAGTATTGGGAAATTCAACCTGTCTTGACTTTCTTTGATTTGGGAAATGTCTGCATAATTTGAACTATTTGAACTATTTGAACATATATCATATATGTGTGAAGTCTATAAAGCAAGTACATTTAACATGCAAAGTAGATCAGCTTTGGAAGCTGGATTTGGTGTCTGTTTGTATTAAATACATACCTGTTGATTTTGTAAAATAAACGTCATTATTGAATTAACTGTCATGAATTCTTATCCACGAAATGGCACAGCCATACCTCTAAGGAAGTGATAATCCAGTACTGGTCCTCGCCTCCTTAGCTCTCCCACACCTCCAAGAAAAGGAGAGAGTGAAAATATCTTTTCTCTGGAGTCAACAAGAAGTGAAGAACTCGATTTGTTTTCAACTTGTACAAGAAGAATCTCCATTCATTATTTTTATTGAGTTTGTGTTGGGTGTGCTATTTAATGAGCTGAGATTACTTTGAGCTGAACTATGAAACATTCCCCCATTTTTTTCTTTTTTTTTTTTTCATTTTGTGCGTATGCCCGTAGTCTATGATGAGTGCTTCACAGAATTCCAATTCTTCCTTGAAATTTATGAACAGGATTTATACAGAAGAGGCAATGGGAAAAGAACTATTGAAGCTCAGGAACCTTTGGCCCTAAATAAGAGACAGATTAACCAAAAAATGGACCAGGCATCTGATCATGGGACTTCCGACGACATCCCCATGGAAATCGTCGAACTAATGGCAAAGAATCAGTATGAAAGACGTCTTCCTGATGCCGAGAATAATAATAAACACGTTTCAGAAACAGGCAAATTCTCAAGGGCTGTTCAAGTGAATAATTACGGCGATGTATACAGAAATGGGAGAGAATTATTACAAAAGCCTGAAAATCTTCAACAAAATGCTCAGGCAAGGAATGGAGGAAAAGTTGTGGAAACCAGGAAACAGAAGTCAGCAGATTATTTCTCAAATATTAGGGAATCTCACTTTGATACAAACCATCCGCAGCAGAATCATATGCTCGGGTGTAATGGTTCAATTCATTCTCTAGTGGAGCCATCAAATGGTATTCAATATTCTTCCATTGGATCTAAAAGAAAAAGTTGTACTGAGATTAGAAAATGTAACGGAATTACAGTGGAAGGTCTCTACAACTCCAAAGTACAATCTTCTGAAGGATGCATGGATCATTTACCTGTTTCAGAACAGAATATAGAAGCAGCTTACGTATGGTCTTCTTCTTCTTTGATGCCAGATCATCTGTCCAATGGATATCAGAAATTTCCAGCTCATTCGACCAACAGCAGGAAAATCTCAAGTCCGAGATCATTTCAGATGGGAAACACAAATGCACAGAATCATCATATTCATCACCATACCAACCTAGAAAGGCATGGTAGGCATAACAACAATTCTGAAGCATACGGCCAGAGATTTGCAGAGAGTTCATTTTGTCACTGTCCTAATGTGGCTGAGCTTCACCATAATCCAGTTGGTTCATTGGAGTTGTACTCTAACGAAACCATATCGGCGATGCACTTGCTTAGCCTCATGGACGCCAGAATGCAATCTAATGCACCCATGACTGCAGGTGAGAAGCATAAGTCATCCAAGAAATCTCCTGTTCCTCGTCCTCGAAAAGCTAAAGAATTTTCCACAACGAACATTTGTTTCAATAAGACCATCCAGGACATAAACCAATTTTCATCTGCTTTCCATGACGAAGTTTGTATTTCAGCAACCAATGCATCTGCTAGTACATTCCAGAATATTAGAGGATTTGGAACCAATTCCAATTTTTCCGGCCAGGCTGTCTTTAGGCCTCAATATGGAGCAAAAATGAAATGCTCAGATCCATCTTCGTGGAGTAAAGACCAAACACTATCGAAGTCTCAGTTCAGAAGTGGTGATCTGCGCACTGATGATAGAGCATTTCCTGTTAATGGTATAGAGAAAGGTGTGGTAAATGCAACTAATTCCGAAGTGTTGTTGGTGCATCACATTGAAAGAAGCTCTGAGGAATGCAAATTGGTAGCTCATACTAGAACTCTGCAAAACAAGAAAAGCACTTCTGAGACTGAAATATGTAGTGTCAACAAAAATCCTGCTGACTTTAGCTTGCCTGAAGCAGGAAATATATACATGATTGGAGCTGAAGAATTCAATTTTGGAAGAACTCTTTTTTCTAAGAACAGATCTAGCTCTATTTGTTTCAATGATCGGTACAAACAGCAGAGGATCGTGTAGCATGATATCGAAAAACTACATGGACCGAATCGACACATAAATCTTGTGCAATTCCTTCTGGTATTGACTTGAAACCTTCTACTATTCTCGAACCACATGATTAACTTTATAAAGTAAGCCTCAAGCATTTGTCAAAGATACTCTCTTGTTTCTTTCAGAAAAATCCTTGATCATCACCCAAGGGCGCTATTCCTTAATCCAATATTTTGAAGGATCACATGGAATCGAAGCTGCATACAAAAAGTCGTGCTTAGCGCAAGAAAAGGTCGGTGTACAGATCTTAAAATTCTCAGTACTATGTATATGAAGCATTCATATTCCAAAGGTAATGCTGTAAATTGTTGTATTCCATATCATTATAGCGCCATGAACAATGCCATTGAAAGGCTTGCACCAACTTTTTGGTCTCTGCTAAATTATCAGCATTTAAGTGCTAGTATTTTGACAAACAAAACTCAAAAGAATTTACTAATAAGCTGGTTTCTATTGATGAATGGTTCAGTTTGTACTTGAATTTTAGGGCTTGGTCTATCTCAGCCCTCTCCTTGGCTTGATTGATAATGAAGAATGTTTGGACA mRNA sequence CCCACCAACATTCTCTCTCTAAAATTCCAAAAAATTTTATCTTATTTTTATTTTTCTTCTCTCTCCTTCTCTCTATGATTTCTCGTAGAGCTTCGAACTTGAATTCTTTTCATCGGCGACTGTCTAATTCCGCCGTACAGTTTCAACAATGCTCCGACCTAACGCCGAACGACGCCGTTTTCCGATAACTTTCAGTAGTCTTACATTTCTTGAAGGGGGACATTGTGTATAAATGATGCATAGAATTAATGTGATGGAAGGGAATAATCATCATGATGGGACTCATTCCAAGCCTGCAAGAAAATTCATTCAGATTGACTCTATATATATTGATCTATTTAGCTCCAATCATAAATGTGATGACCAGTGTGAACTTTTCTCCATCCGTGGTTATGTATCTGATATGCGCAAAAAAGATTGGAAGATATGTTGGCCGTTTTCTGATATTGAGAATGGCCATAAGTTGGATGATCCTATACTCTTGGTCCCGCCTGTATTTGATCCGAGTTTCAACCCGCAGCGAGGCAAAAGTCATTGGCAAGAGAGTTCTGACAAAGCTGCAGATAAAGGTTTCCACTTTGATAGCTGTCACAACCTTGGAAAAATTTCAAATTCTTCCCCAAAAGCTCCAAAACAAGATGTAATCAATGGAAGAACAATGGCTGATAATGCTTCTATTTCGGGTCGCCAACCCTCAAATTGTGATCAGAAGGAAAAGAAACTTGATGTTGCAGATAGAGATAACTGTACTGTTGCTCTTATATCACAAAGTGAGCCAGGTTGTGCAAGTCATGGAGTTACTGAGATTGAGCCTGTTAGTGGAAAGCTCATTCCCAAAGCAACTGAGGAAAGCCCTGCAGCACTTCAGGATGGAAAACAAACTCATGCAGATCGTCTTAATGGACAATTAACCTTGGTATCAGAGAATGACAGTACGGTAGACGTACCCCGAGGACATTATACTGTTACATTTCAAGAAAATGGAGATGCGTCCATGGAATCAAACCAAAGCACGGATTCATTATCTGAAAGTGCTGAAACAGTTGGAAACAGTCCTCATCATTGTCATCTAGGAAAGTTACATCGTCGAAGAACCCCAAAGGTTCGTCTATTGACTGATTTGCTAGGAGACAATGGAAATATGATAGCTAAACATGTCGAAAGTTCTCCATCCGATGGGTCTCCTGAGGCATCTGTGCAGGCAGATGTGAGGTATGCTCCCAAATGTCAGGTAACTATAGAGGAAGATGTTTGGCATTCAGATCATAGACGGGAAAGAAGGTTGCCTCGGAATGGAAAGTGTAGGCATCAGGAGATTCCCTCTTCTTCCAGTGTGGATAAGAAAATTCAAACATGGAGGGGGCAGATAGAAAGCTCTGTTTCTAGTTTAGGAAATGAAAATGCTCATTCAGGAATAAAACAGACCATGAAAGGCCCTTGGAGCAGCTACAAAATGGATGGAAACAATAGTTTAAGAAGGAAAAAAAGTAAAAAGTTTCCAGTGGTTGATCCATACTCCGTGCCCTTAGTGCCATCTAAAGTTAAAGATCAATGTGAAGTTCAGGCAATAACCGAAAATAGAAGTGAGGTTGCTGTGGATAGTGCTGCTATCTTAGCATATCACAATGATTTTTCTAGCAGAACTCCACACTCAACATCATTGAATGCCATGGAATCTAAATCTGGCACATCTAAGAACCCAAATTCAAGCAAGGAGCCTGTGATTTTTGAAGGGCCCACTAATGTATTTGCATGGAACAATGGAATGCTTTGGAGGGGATCAGTTACACAGAAAGATGTGGAAACCATGAAAAGTAGGTCTGTAGCTAATCCTCTTCCAAGTTACAGAAACAATGAAAGAGAATTGCATCCTTCTCATAATAACTATTCAGAGCCACAAAGGGACCACAAAGGAATCCATCATCGAGGAGAAAACGAGCTGGCCACTTTTTTGCCTGAGCTAGAGGACACTTCCAAAGTAAGGATTAATATTGAAACGAGTAATCTTGGATATCCAAATCACCCTCATCAAGCTTCAGATGTATTTTATGGACAAGGAGTGCGTAGTGTGCTGAACAGTAAAATGGCCAACTTGAGAATGCCTCTTCCAAGACAAAATGCAGATCCTCACACAGATAATAGTTGGTCGCAGCTGCAGAATAAGGATTTATACAGAAGAGGCAATGGGAAAAGAACTATTGAAGCTCAGGAACCTTTGGCCCTAAATAAGAGACAGATTAACCAAAAAATGGACCAGGCATCTGATCATGGGACTTCCGACGACATCCCCATGGAAATCGTCGAACTAATGGCAAAGAATCAGTATGAAAGACGTCTTCCTGATGCCGAGAATAATAATAAACACGTTTCAGAAACAGGCAAATTCTCAAGGGCTGTTCAAGTGAATAATTACGGCGATGTATACAGAAATGGGAGAGAATTATTACAAAAGCCTGAAAATCTTCAACAAAATGCTCAGGCAAGGAATGGAGGAAAAGTTGTGGAAACCAGGAAACAGAAGTCAGCAGATTATTTCTCAAATATTAGGGAATCTCACTTTGATACAAACCATCCGCAGCAGAATCATATGCTCGGGTGTAATGGTTCAATTCATTCTCTAGTGGAGCCATCAAATGGTATTCAATATTCTTCCATTGGATCTAAAAGAAAAAGTTGTACTGAGATTAGAAAATGTAACGGAATTACAGTGGAAGGTCTCTACAACTCCAAAGTACAATCTTCTGAAGGATGCATGGATCATTTACCTGTTTCAGAACAGAATATAGAAGCAGCTTACGTATGGTCTTCTTCTTCTTTGATGCCAGATCATCTGTCCAATGGATATCAGAAATTTCCAGCTCATTCGACCAACAGCAGGAAAATCTCAAGTCCGAGATCATTTCAGATGGGAAACACAAATGCACAGAATCATCATATTCATCACCATACCAACCTAGAAAGGCATGGTAGGCATAACAACAATTCTGAAGCATACGGCCAGAGATTTGCAGAGAGTTCATTTTGTCACTGTCCTAATGTGGCTGAGCTTCACCATAATCCAGTTGGTTCATTGGAGTTGTACTCTAACGAAACCATATCGGCGATGCACTTGCTTAGCCTCATGGACGCCAGAATGCAATCTAATGCACCCATGACTGCAGGTGAGAAGCATAAGTCATCCAAGAAATCTCCTGTTCCTCGTCCTCGAAAAGCTAAAGAATTTTCCACAACGAACATTTGTTTCAATAAGACCATCCAGGACATAAACCAATTTTCATCTGCTTTCCATGACGAAGTTTGTATTTCAGCAACCAATGCATCTGCTAGTACATTCCAGAATATTAGAGGATTTGGAACCAATTCCAATTTTTCCGGCCAGGCTGTCTTTAGGCCTCAATATGGAGCAAAAATGAAATGCTCAGATCCATCTTCGTGGAGTAAAGACCAAACACTATCGAAGTCTCAGTTCAGAAGTGGTGATCTGCGCACTGATGATAGAGCATTTCCTGTTAATGGTATAGAGAAAGGTGTGGTAAATGCAACTAATTCCGAAGTGTTGTTGGTGCATCACATTGAAAGAAGCTCTGAGGAATGCAAATTGGTAGCTCATACTAGAACTCTGCAAAACAAGAAAAGCACTTCTGAGACTGAAATATGTAGTGTCAACAAAAATCCTGCTGACTTTAGCTTGCCTGAAGCAGGAAATATATACATGATTGGAGCTGAAGAATTCAATTTTGGAAGAACTCTTTTTTCTAAGAACAGATCTAGCTCTATTTGTTTCAATGATCGGTACAAACAGCAGAGGATCGTGTAGCATGATATCGAAAAACTACATGGACCGAATCGACACATAAATCTTGTGCAATTCCTTCTGAAAAATCCTTGATCATCACCCAAGGGCGCTATTCCTTAATCCAATATTTTGAAGGATCACATGGAATCGAAGCTGCATACAAAAAGTCGTGCTTAGCGCAAGAAAAGGTCGGTGTACAGATCTTAAAATTCTCAGTACTATGTATATGAAGCATTCATATTCCAAAGGTAATGCTGTAAATTGTTGTATTCCATATCATTATAGCGCCATGAACAATGCCATTGAAAGGCTTGCACCAACTTTTTGGTCTCTGCTAAATTATCAGCATTTAAGTGCTAGTATTTTGACAAACAAAACTCAAAAGAATTTACTAATAAGCTGGTTTCTATTGATGAATGGTTCAGTTTGTACTTGAATTTTAGGGCTTGGTCTATCTCAGCCCTCTCCTTGGCTTGATTGATAATGAAGAATGTTTGGACA Coding sequence (CDS) ATGATGCATAGAATTAATGTGATGGAAGGGAATAATCATCATGATGGGACTCATTCCAAGCCTGCAAGAAAATTCATTCAGATTGACTCTATATATATTGATCTATTTAGCTCCAATCATAAATGTGATGACCAGTGTGAACTTTTCTCCATCCGTGGTTATGTATCTGATATGCGCAAAAAAGATTGGAAGATATGTTGGCCGTTTTCTGATATTGAGAATGGCCATAAGTTGGATGATCCTATACTCTTGGTCCCGCCTGTATTTGATCCGAGTTTCAACCCGCAGCGAGGCAAAAGTCATTGGCAAGAGAGTTCTGACAAAGCTGCAGATAAAGGTTTCCACTTTGATAGCTGTCACAACCTTGGAAAAATTTCAAATTCTTCCCCAAAAGCTCCAAAACAAGATGTAATCAATGGAAGAACAATGGCTGATAATGCTTCTATTTCGGGTCGCCAACCCTCAAATTGTGATCAGAAGGAAAAGAAACTTGATGTTGCAGATAGAGATAACTGTACTGTTGCTCTTATATCACAAAGTGAGCCAGGTTGTGCAAGTCATGGAGTTACTGAGATTGAGCCTGTTAGTGGAAAGCTCATTCCCAAAGCAACTGAGGAAAGCCCTGCAGCACTTCAGGATGGAAAACAAACTCATGCAGATCGTCTTAATGGACAATTAACCTTGGTATCAGAGAATGACAGTACGGTAGACGTACCCCGAGGACATTATACTGTTACATTTCAAGAAAATGGAGATGCGTCCATGGAATCAAACCAAAGCACGGATTCATTATCTGAAAGTGCTGAAACAGTTGGAAACAGTCCTCATCATTGTCATCTAGGAAAGTTACATCGTCGAAGAACCCCAAAGGTTCGTCTATTGACTGATTTGCTAGGAGACAATGGAAATATGATAGCTAAACATGTCGAAAGTTCTCCATCCGATGGGTCTCCTGAGGCATCTGTGCAGGCAGATGTGAGGTATGCTCCCAAATGTCAGGTAACTATAGAGGAAGATGTTTGGCATTCAGATCATAGACGGGAAAGAAGGTTGCCTCGGAATGGAAAGTGTAGGCATCAGGAGATTCCCTCTTCTTCCAGTGTGGATAAGAAAATTCAAACATGGAGGGGGCAGATAGAAAGCTCTGTTTCTAGTTTAGGAAATGAAAATGCTCATTCAGGAATAAAACAGACCATGAAAGGCCCTTGGAGCAGCTACAAAATGGATGGAAACAATAGTTTAAGAAGGAAAAAAAGTAAAAAGTTTCCAGTGGTTGATCCATACTCCGTGCCCTTAGTGCCATCTAAAGTTAAAGATCAATGTGAAGTTCAGGCAATAACCGAAAATAGAAGTGAGGTTGCTGTGGATAGTGCTGCTATCTTAGCATATCACAATGATTTTTCTAGCAGAACTCCACACTCAACATCATTGAATGCCATGGAATCTAAATCTGGCACATCTAAGAACCCAAATTCAAGCAAGGAGCCTGTGATTTTTGAAGGGCCCACTAATGTATTTGCATGGAACAATGGAATGCTTTGGAGGGGATCAGTTACACAGAAAGATGTGGAAACCATGAAAAGTAGGTCTGTAGCTAATCCTCTTCCAAGTTACAGAAACAATGAAAGAGAATTGCATCCTTCTCATAATAACTATTCAGAGCCACAAAGGGACCACAAAGGAATCCATCATCGAGGAGAAAACGAGCTGGCCACTTTTTTGCCTGAGCTAGAGGACACTTCCAAAGTAAGGATTAATATTGAAACGAGTAATCTTGGATATCCAAATCACCCTCATCAAGCTTCAGATGTATTTTATGGACAAGGAGTGCGTAGTGTGCTGAACAGTAAAATGGCCAACTTGAGAATGCCTCTTCCAAGACAAAATGCAGATCCTCACACAGATAATAGTTGGTCGCAGCTGCAGAATAAGGATTTATACAGAAGAGGCAATGGGAAAAGAACTATTGAAGCTCAGGAACCTTTGGCCCTAAATAAGAGACAGATTAACCAAAAAATGGACCAGGCATCTGATCATGGGACTTCCGACGACATCCCCATGGAAATCGTCGAACTAATGGCAAAGAATCAGTATGAAAGACGTCTTCCTGATGCCGAGAATAATAATAAACACGTTTCAGAAACAGGCAAATTCTCAAGGGCTGTTCAAGTGAATAATTACGGCGATGTATACAGAAATGGGAGAGAATTATTACAAAAGCCTGAAAATCTTCAACAAAATGCTCAGGCAAGGAATGGAGGAAAAGTTGTGGAAACCAGGAAACAGAAGTCAGCAGATTATTTCTCAAATATTAGGGAATCTCACTTTGATACAAACCATCCGCAGCAGAATCATATGCTCGGGTGTAATGGTTCAATTCATTCTCTAGTGGAGCCATCAAATGGTATTCAATATTCTTCCATTGGATCTAAAAGAAAAAGTTGTACTGAGATTAGAAAATGTAACGGAATTACAGTGGAAGGTCTCTACAACTCCAAAGTACAATCTTCTGAAGGATGCATGGATCATTTACCTGTTTCAGAACAGAATATAGAAGCAGCTTACGTATGGTCTTCTTCTTCTTTGATGCCAGATCATCTGTCCAATGGATATCAGAAATTTCCAGCTCATTCGACCAACAGCAGGAAAATCTCAAGTCCGAGATCATTTCAGATGGGAAACACAAATGCACAGAATCATCATATTCATCACCATACCAACCTAGAAAGGCATGGTAGGCATAACAACAATTCTGAAGCATACGGCCAGAGATTTGCAGAGAGTTCATTTTGTCACTGTCCTAATGTGGCTGAGCTTCACCATAATCCAGTTGGTTCATTGGAGTTGTACTCTAACGAAACCATATCGGCGATGCACTTGCTTAGCCTCATGGACGCCAGAATGCAATCTAATGCACCCATGACTGCAGGTGAGAAGCATAAGTCATCCAAGAAATCTCCTGTTCCTCGTCCTCGAAAAGCTAAAGAATTTTCCACAACGAACATTTGTTTCAATAAGACCATCCAGGACATAAACCAATTTTCATCTGCTTTCCATGACGAAGTTTGTATTTCAGCAACCAATGCATCTGCTAGTACATTCCAGAATATTAGAGGATTTGGAACCAATTCCAATTTTTCCGGCCAGGCTGTCTTTAGGCCTCAATATGGAGCAAAAATGAAATGCTCAGATCCATCTTCGTGGAGTAAAGACCAAACACTATCGAAGTCTCAGTTCAGAAGTGGTGATCTGCGCACTGATGATAGAGCATTTCCTGTTAATGGTATAGAGAAAGGTGTGGTAAATGCAACTAATTCCGAAGTGTTGTTGGTGCATCACATTGAAAGAAGCTCTGAGGAATGCAAATTGGTAGCTCATACTAGAACTCTGCAAAACAAGAAAAGCACTTCTGAGACTGAAATATGTAGTGTCAACAAAAATCCTGCTGACTTTAGCTTGCCTGAAGCAGGAAATATATACATGATTGGAGCTGAAGAATTCAATTTTGGAAGAACTCTTTTTTCTAAGAACAGATCTAGCTCTATTTGTTTCAATGATCGGTACAAACAGCAGAGGATCGTGTAG Protein sequence MMHRINVMEGNNHHDGTHSKPARKFIQIDSIYIDLFSSNHKCDDQCELFSIRGYVSDMRKKDWKICWPFSDIENGHKLDDPILLVPPVFDPSFNPQRGKSHWQESSDKAADKGFHFDSCHNLGKISNSSPKAPKQDVINGRTMADNASISGRQPSNCDQKEKKLDVADRDNCTVALISQSEPGCASHGVTEIEPVSGKLIPKATEESPAALQDGKQTHADRLNGQLTLVSENDSTVDVPRGHYTVTFQENGDASMESNQSTDSLSESAETVGNSPHHCHLGKLHRRRTPKVRLLTDLLGDNGNMIAKHVESSPSDGSPEASVQADVRYAPKCQVTIEEDVWHSDHRRERRLPRNGKCRHQEIPSSSSVDKKIQTWRGQIESSVSSLGNENAHSGIKQTMKGPWSSYKMDGNNSLRRKKSKKFPVVDPYSVPLVPSKVKDQCEVQAITENRSEVAVDSAAILAYHNDFSSRTPHSTSLNAMESKSGTSKNPNSSKEPVIFEGPTNVFAWNNGMLWRGSVTQKDVETMKSRSVANPLPSYRNNERELHPSHNNYSEPQRDHKGIHHRGENELATFLPELEDTSKVRINIETSNLGYPNHPHQASDVFYGQGVRSVLNSKMANLRMPLPRQNADPHTDNSWSQLQNKDLYRRGNGKRTIEAQEPLALNKRQINQKMDQASDHGTSDDIPMEIVELMAKNQYERRLPDAENNNKHVSETGKFSRAVQVNNYGDVYRNGRELLQKPENLQQNAQARNGGKVVETRKQKSADYFSNIRESHFDTNHPQQNHMLGCNGSIHSLVEPSNGIQYSSIGSKRKSCTEIRKCNGITVEGLYNSKVQSSEGCMDHLPVSEQNIEAAYVWSSSSLMPDHLSNGYQKFPAHSTNSRKISSPRSFQMGNTNAQNHHIHHHTNLERHGRHNNNSEAYGQRFAESSFCHCPNVAELHHNPVGSLELYSNETISAMHLLSLMDARMQSNAPMTAGEKHKSSKKSPVPRPRKAKEFSTTNICFNKTIQDINQFSSAFHDEVCISATNASASTFQNIRGFGTNSNFSGQAVFRPQYGAKMKCSDPSSWSKDQTLSKSQFRSGDLRTDDRAFPVNGIEKGVVNATNSEVLLVHHIERSSEECKLVAHTRTLQNKKSTSETEICSVNKNPADFSLPEAGNIYMIGAEEFNFGRTLFSKNRSSSICFNDRYKQQRIV
Homology
BLAST of Bhi04G000687 vs. TAIR 10
Match: AT5G11530.1 (embryonic flower 1 (EMF1) ) HSP 1 Score: 125.6 bits (314), Expect = 2.8e-28 Identity = 274/1231 (22.26%), Postives = 474/1231 (38.51%), Query Frame = 0 Query: 26 IQIDSIYIDLFSSNHKCD-DQCELFSIRGYVSDMRKKDWKICWPFSDIENGHKLDDPILL 85 I+I+SI IDL + ++ D +C+ FS+RG+V++ R++D + CWPFS+ E+ +D Sbjct: 5 IKINSISIDLAGAANEIDMVKCDHFSMRGFVAETRERDLRKCWPFSE-ESVSLVDQQSYT 64
Query: 86 VPPVFDPSFNPQRGKSHWQESSDKAADKGFHFD---SCHNLGKISNSSPKAPKQDVINGR 145 +P + P F W D H H+ K +S + N Sbjct: 65 LPTLSVPKF-------RWWHCMSCIKDIDAHGPKDCGLHSNSKAIGNSSVIESKSKFNSL 124
Query: 146 TMADNASISGRQPSNCDQKEKKLDVAD------------RDNCTVALISQSEPGCASHGV 205 T+ D+ +KEKK D+AD D+ T + G G Sbjct: 125 TIIDH------------EKEKKTDIADNAIEEKVGVNCENDDQTATTFLKKARG-RPMGA 184
Query: 206 TEIEPVSGKLIPKATEESPAALQDGKQTHADRLNGQLTLVSENDSTVDVPRGHYTVTFQE 265 + + S KL+ SP Q G ++LN +S +V + T E Sbjct: 185 SNVRSKSRKLV------SPE--QVGNNRSKEKLNKPSMDISSWKEKQNVDQAVTTFGSSE 244
Query: 266 NG----DASMESNQSTDSLSESAETVGNSPHHCHLGK--LHRRRTPKVRLLTDLLGDNGN 325 D ++ ++ + E S +L L RR++ KVRLL++LLG+ Sbjct: 245 IAGVVEDTPPKATKNHKGIRGLMECDNGSSESINLAMSGLQRRKSRKVRLLSELLGN--- 304
Query: 326 MIAKHVESSPSDGSPEASVQADVRYAPKCQVTIEEDVWHSDHRRERRLPRNGKCRHQEIP 385 + S GS + EE + R R + + +P Sbjct: 305 -------TKTSGGS---------------NIRKEESALKKESVRGR--------KRKLLP 364
Query: 386 SSSSVDKKIQTWRGQIESSVSSLGNENAHSGIKQTMKGPWSSYKMDGNNSLRRKKSKKFP 445 ++ V + + T E++ S ++ +S + T G D ++++++F Sbjct: 365 ENNYVSRILSTMGATSENASKSCDSDQGNS--ESTDSG------FDRTPFKGKQRNRRFQ 424
Query: 446 VVDPYSVPLVPSKVKDQCEVQAITENRSEVAVDSAAILAYHNDFSSRTPHSTSLNAMESK 505 VVD + VP +P + + I E+ ++ + S H+ F+ ++ Sbjct: 425 VVDEF-VPSLPCETSQ----EGIKEHDADPSKRSTPA---HSLFTGNDSVPCPPGTQRTE 484
Query: 506 SGTSKNPNSSKEPVIFEGPTNVFAWNNGMLWRGSVTQKDVETMKSRSVANPLPSYRNNER 565 S +K+PVI G + V +++NG+ +Q + T S + + N +R Sbjct: 485 RKLSLPKKKTKKPVIDNGKSTVISFSNGI----DGSQVNSHTGPSMNTVSQTRDLLNGKR 544
Query: 566 ELHPSHNNYSEPQRDHKGIHHRGENELATFLPELEDTSKVRINIETSNL--GYPNHPHQA 625 N + K + + + + L+D VR N + + + Sbjct: 545 VGGLFDNRLASDGYFRKYLSQVNDKPITSL--HLQDNDYVRSRDAEPNCLRDFSSSSKSS 604
Query: 626 SDVFYGQGV---------RSVLNSKMANLRMPLPRQNADPHTDNSWSQLQNKDLYRRGNG 685 S + GV + S +NL++ P + + S++ KD Sbjct: 605 SGGWLRTGVDIVDFRNNNHNTNRSSFSNLKLRYPPSSTEV---ADLSRVLQKDASGADRK 664
Query: 686 KRTIEAQEPLALNKRQINQKMDQASDHGTSDDIPMEIVELMAKNQYERRLPDAE---NNN 745 +T+ QE + Q + + + ++ +DDIPMEIVELMAKNQYER LPD E +N Sbjct: 665 GKTVMVQEHHGAPRSQSHDRKETTTEEQNNDDIPMEIVELMAKNQYERCLPDKEEDVSNK 724
Query: 746 KHVSETGKFSRAVQVNNYGDVYRNGREL-----LQKPENLQQNAQARNGGKVVETRKQKS 805 + ET S+ + + + Y NG L + P+ NA+ Sbjct: 725 QPSQETAHKSKNALLIDLNETYDNGISLEDNNTSRPPKPCSSNAR--------------- 784
Query: 806 ADYFSNIRESHFDTNHPQQNH-----MLGCNGSIHSLVEPSNGIQYSSIGSKRKSCTEIR 865 RE HF Q +H S + P+ + SSI +C + Sbjct: 785 -------REEHFPMGRQQNSHDFFPISQPYVPSPFGIFPPTQENRASSIRFSGHNCQWLG 844
Query: 866 KCNGITVEGLYNSKVQSSEGCMDHLPVSEQNIEAAY-VWSSSSLMPDHLSNGYQKFPAHS 925 + + S + C V Q EA++ +W SS + P ++ S Sbjct: 845 NLPTVGNQNPSPSSFRVLRACDTCQSVPNQYREASHPIWPSSMIPPQ------SQYKPVS 904
Query: 926 TNSRKISSPRSFQMGNTNAQNHHIHHHTNLERHGRHNNNSEAYGQRFAESSFCHCPNVAE 985 N + ++P + + N +++ + N ++G + H V+ Sbjct: 905 LNINQSTNPGTLSQASNNENTWNLNFVAANGKQKCGPNPEFSFGCK-------HAAGVSS 964
Query: 986 LHHNPVGSLELYSNETISAMHLLSLMDARMQSNAPMTAGEKHKSSK--KSPVPRPRKAKE 1045 P+ + S +I A+HLLSL+D R++S P ++H ++K K P ++KE Sbjct: 965 SSSRPIDNFS--SESSIPALHLLSLLDPRLRSTTP---ADQHGNTKFTKRHFPPANQSKE 1024
Query: 1046 F------STTNICFNKTIQDINQFSSAFHDEVCISATNASASTFQNIRGFGTNSNFSGQA 1105 F ++ ++ + +S F E S +F GT+S A Sbjct: 1025 FIELQTGDSSKSAYSTKQIPFDLYSKRFTQE-------PSRKSFPITPPIGTSSLSFQNA 1084
Query: 1106 VFRPQYGAKMKCSDPSSWSKDQTLSKSQFRSGDLRTDDRAFPVNGIEKGVVNATNSEVL- 1165 + P + K + T K F S + D F + G A+NS +L Sbjct: 1085 SWSPHHQEKKTKRKDTFAPVYNTHEKPVFASSN---DQAKFQLLG-------ASNSMMLP 1086
Query: 1166 LVHHI-------ERSSEECKLVAHTRTLQNKKSTSETEICSVNKNPADFSLPEAGNIYMI 1193 L H+ +R +E C A ++N +S +CSVN+NPADF++PE GN+YM+ Sbjct: 1145 LKFHMTDKEKKQKRKAESCNNNASAGPVKN---SSGPIVCSVNRNPADFTIPEPGNVYML 1086
BLAST of Bhi04G000687 vs. ExPASy Swiss-Prot
Match: Q9LYD9 (Protein EMBRYONIC FLOWER 1 OS=Arabidopsis thaliana OX=3702 GN=EMF1 PE=1 SV=1) HSP 1 Score: 125.6 bits (314), Expect = 3.9e-27 Identity = 274/1231 (22.26%), Postives = 474/1231 (38.51%), Query Frame = 0 Query: 26 IQIDSIYIDLFSSNHKCD-DQCELFSIRGYVSDMRKKDWKICWPFSDIENGHKLDDPILL 85 I+I+SI IDL + ++ D +C+ FS+RG+V++ R++D + CWPFS+ E+ +D Sbjct: 5 IKINSISIDLAGAANEIDMVKCDHFSMRGFVAETRERDLRKCWPFSE-ESVSLVDQQSYT 64
Query: 86 VPPVFDPSFNPQRGKSHWQESSDKAADKGFHFD---SCHNLGKISNSSPKAPKQDVINGR 145 +P + P F W D H H+ K +S + N Sbjct: 65 LPTLSVPKF-------RWWHCMSCIKDIDAHGPKDCGLHSNSKAIGNSSVIESKSKFNSL 124
Query: 146 TMADNASISGRQPSNCDQKEKKLDVAD------------RDNCTVALISQSEPGCASHGV 205 T+ D+ +KEKK D+AD D+ T + G G Sbjct: 125 TIIDH------------EKEKKTDIADNAIEEKVGVNCENDDQTATTFLKKARG-RPMGA 184
Query: 206 TEIEPVSGKLIPKATEESPAALQDGKQTHADRLNGQLTLVSENDSTVDVPRGHYTVTFQE 265 + + S KL+ SP Q G ++LN +S +V + T E Sbjct: 185 SNVRSKSRKLV------SPE--QVGNNRSKEKLNKPSMDISSWKEKQNVDQAVTTFGSSE 244
Query: 266 NG----DASMESNQSTDSLSESAETVGNSPHHCHLGK--LHRRRTPKVRLLTDLLGDNGN 325 D ++ ++ + E S +L L RR++ KVRLL++LLG+ Sbjct: 245 IAGVVEDTPPKATKNHKGIRGLMECDNGSSESINLAMSGLQRRKSRKVRLLSELLGN--- 304
Query: 326 MIAKHVESSPSDGSPEASVQADVRYAPKCQVTIEEDVWHSDHRRERRLPRNGKCRHQEIP 385 + S GS + EE + R R + + +P Sbjct: 305 -------TKTSGGS---------------NIRKEESALKKESVRGR--------KRKLLP 364
Query: 386 SSSSVDKKIQTWRGQIESSVSSLGNENAHSGIKQTMKGPWSSYKMDGNNSLRRKKSKKFP 445 ++ V + + T E++ S ++ +S + T G D ++++++F Sbjct: 365 ENNYVSRILSTMGATSENASKSCDSDQGNS--ESTDSG------FDRTPFKGKQRNRRFQ 424
Query: 446 VVDPYSVPLVPSKVKDQCEVQAITENRSEVAVDSAAILAYHNDFSSRTPHSTSLNAMESK 505 VVD + VP +P + + I E+ ++ + S H+ F+ ++ Sbjct: 425 VVDEF-VPSLPCETSQ----EGIKEHDADPSKRSTPA---HSLFTGNDSVPCPPGTQRTE 484
Query: 506 SGTSKNPNSSKEPVIFEGPTNVFAWNNGMLWRGSVTQKDVETMKSRSVANPLPSYRNNER 565 S +K+PVI G + V +++NG+ +Q + T S + + N +R Sbjct: 485 RKLSLPKKKTKKPVIDNGKSTVISFSNGI----DGSQVNSHTGPSMNTVSQTRDLLNGKR 544
Query: 566 ELHPSHNNYSEPQRDHKGIHHRGENELATFLPELEDTSKVRINIETSNL--GYPNHPHQA 625 N + K + + + + L+D VR N + + + Sbjct: 545 VGGLFDNRLASDGYFRKYLSQVNDKPITSL--HLQDNDYVRSRDAEPNCLRDFSSSSKSS 604
Query: 626 SDVFYGQGV---------RSVLNSKMANLRMPLPRQNADPHTDNSWSQLQNKDLYRRGNG 685 S + GV + S +NL++ P + + S++ KD Sbjct: 605 SGGWLRTGVDIVDFRNNNHNTNRSSFSNLKLRYPPSSTEV---ADLSRVLQKDASGADRK 664
Query: 686 KRTIEAQEPLALNKRQINQKMDQASDHGTSDDIPMEIVELMAKNQYERRLPDAE---NNN 745 +T+ QE + Q + + + ++ +DDIPMEIVELMAKNQYER LPD E +N Sbjct: 665 GKTVMVQEHHGAPRSQSHDRKETTTEEQNNDDIPMEIVELMAKNQYERCLPDKEEDVSNK 724
Query: 746 KHVSETGKFSRAVQVNNYGDVYRNGREL-----LQKPENLQQNAQARNGGKVVETRKQKS 805 + ET S+ + + + Y NG L + P+ NA+ Sbjct: 725 QPSQETAHKSKNALLIDLNETYDNGISLEDNNTSRPPKPCSSNAR--------------- 784
Query: 806 ADYFSNIRESHFDTNHPQQNH-----MLGCNGSIHSLVEPSNGIQYSSIGSKRKSCTEIR 865 RE HF Q +H S + P+ + SSI +C + Sbjct: 785 -------REEHFPMGRQQNSHDFFPISQPYVPSPFGIFPPTQENRASSIRFSGHNCQWLG 844
Query: 866 KCNGITVEGLYNSKVQSSEGCMDHLPVSEQNIEAAY-VWSSSSLMPDHLSNGYQKFPAHS 925 + + S + C V Q EA++ +W SS + P ++ S Sbjct: 845 NLPTVGNQNPSPSSFRVLRACDTCQSVPNQYREASHPIWPSSMIPPQ------SQYKPVS 904
Query: 926 TNSRKISSPRSFQMGNTNAQNHHIHHHTNLERHGRHNNNSEAYGQRFAESSFCHCPNVAE 985 N + ++P + + N +++ + N ++G + H V+ Sbjct: 905 LNINQSTNPGTLSQASNNENTWNLNFVAANGKQKCGPNPEFSFGCK-------HAAGVSS 964
Query: 986 LHHNPVGSLELYSNETISAMHLLSLMDARMQSNAPMTAGEKHKSSK--KSPVPRPRKAKE 1045 P+ + S +I A+HLLSL+D R++S P ++H ++K K P ++KE Sbjct: 965 SSSRPIDNFS--SESSIPALHLLSLLDPRLRSTTP---ADQHGNTKFTKRHFPPANQSKE 1024
Query: 1046 F------STTNICFNKTIQDINQFSSAFHDEVCISATNASASTFQNIRGFGTNSNFSGQA 1105 F ++ ++ + +S F E S +F GT+S A Sbjct: 1025 FIELQTGDSSKSAYSTKQIPFDLYSKRFTQE-------PSRKSFPITPPIGTSSLSFQNA 1084
Query: 1106 VFRPQYGAKMKCSDPSSWSKDQTLSKSQFRSGDLRTDDRAFPVNGIEKGVVNATNSEVL- 1165 + P + K + T K F S + D F + G A+NS +L Sbjct: 1085 SWSPHHQEKKTKRKDTFAPVYNTHEKPVFASSN---DQAKFQLLG-------ASNSMMLP 1086
Query: 1166 LVHHI-------ERSSEECKLVAHTRTLQNKKSTSETEICSVNKNPADFSLPEAGNIYMI 1193 L H+ +R +E C A ++N +S +CSVN+NPADF++PE GN+YM+ Sbjct: 1145 LKFHMTDKEKKQKRKAESCNNNASAGPVKN---SSGPIVCSVNRNPADFTIPEPGNVYML 1086
BLAST of Bhi04G000687 vs. ExPASy TrEMBL
Match: A0A1S3BB95 (protein EMBRYONIC FLOWER 1-like isoform X1 OS=Cucumis melo OX=3656 GN=LOC103488193 PE=4 SV=1) HSP 1 Score: 1765.0 bits (4570), Expect = 0.0e+00 Identity = 933/1209 (77.17%), Postives = 1016/1209 (84.04%), Query Frame = 0 Query: 2 MHRINVMEGNNHHDGTHSKPARKFIQIDSIYIDLFSSNHKCDDQ-CELFSIRGYVSDMRK 61 MHRINVME NNHHDGT ++PARKF+QIDSIYIDLFSS+HKCD Q CELFSIRGYVSDM K Sbjct: 1 MHRINVMEENNHHDGTDTRPARKFVQIDSIYIDLFSSDHKCDGQNCELFSIRGYVSDMHK 60
Query: 62 KDWKICWPFSDI-ENGHKLDDPILLVPPVFDPSFNPQRGKSHWQESSDKAADKGFHFDSC 121 KDWKICWPFSDI +NGHK ++PI LVP VFDPSF+ +GK HWQE+SDKAAD+GF FDSC Sbjct: 61 KDWKICWPFSDIMDNGHKSNEPIPLVPSVFDPSFDAYQGKIHWQETSDKAADQGFLFDSC 120
Query: 122 HNLGKISNSSPKAPKQDVINGRT-MADNASISGRQPSNCDQKEKKLDVADR-DNCTVALI 181 NLGKISNSSP A KQDVI+GRT MADN S S+CDQKEK L+VADR DNCTVALI Sbjct: 121 QNLGKISNSSPNASKQDVISGRTIMADNVS-----NSSCDQKEKTLNVADRSDNCTVALI 180
Query: 182 SQSEPGCASHGVTEIEPVSGKLIPKATEESPAALQDGKQTHADRLNGQLT-LVSENDSTV 241 SQSEPGCASHGVTEIEPVS L KATEES AALQDG+QT AD LNGQLT LVSE D V Sbjct: 181 SQSEPGCASHGVTEIEPVSRNLTLKATEESLAALQDGQQTPADCLNGQLTLLVSEKDDMV 240
Query: 242 DVPRGHYTVTFQENGDASMESNQSTDSLSESAETVGNSPHHCHLGKLHRRRTPKVRLLTD 301 DV GH+TV Q NGDASMESN ST S SESAETVGNSPH+CHLG+LHRRRTPK+RLLTD Sbjct: 241 DVAHGHHTVKVQGNGDASMESNDSTVSSSESAETVGNSPHNCHLGRLHRRRTPKIRLLTD 300
Query: 302 LLGDNGNMIAKHVESSPSDGSPEASVQADVRYAPKCQVTIEEDVWHSDHRRERRLPRNGK 361 LLGDNGNM+ KHVESS SDGSPEAS QADVR+ KCQV IEED HSDH+RERRL RNGK Sbjct: 301 LLGDNGNMVVKHVESSLSDGSPEASEQADVRFTSKCQVIIEEDASHSDHKRERRLARNGK 360
Query: 362 CRHQEIPSSSSVDKKIQTWRGQIESSVSSLGNENAHSGIKQTMKGPWSSYKMDGNNSLRR 421 CRHQEIPSSSSVDK+IQTW G+IESSVS LG ENA SG+K+T+KGPW SYKMDGN+SLRR Sbjct: 361 CRHQEIPSSSSVDKQIQTWMGEIESSVSCLGTENALSGMKKTIKGPWCSYKMDGNSSLRR 420
Query: 422 KKSKKFPVVDPYSVPLVPSKVKDQCEVQAITENRSEVAVDSAAILAYHNDFSSRTPHSTS 481 KKS+KFPVVDPYS+ L+PSK KDQCE+ ENRSEVAVDS AI A+HN+FS R PHS S Sbjct: 421 KKSRKFPVVDPYSMSLLPSKAKDQCEIWERNENRSEVAVDSVAIFAHHNEFSCRIPHSLS 480
Query: 482 LNAMESKSGTSKNPNSSKEPVIFEGPTNVFAWNNGMLWRGSVTQKDVETMKSRSVANPLP 541 NA+ESK TS NPNSS EPV+FEGPTNVF WNN +LWRGSVTQKDVETM SR ANP Sbjct: 481 SNAIESKPSTSGNPNSSNEPVVFEGPTNVFPWNNRILWRGSVTQKDVETMNSRPAANPST 540
Query: 542 SYRNNERELHPSHNNYSEPQRDHKGIHHRGENELATFLPELEDTSKV-RIN-IETSNLGY 601 +Y+ NERELHPS +NYS PQ+DHKGI GENEL+TF+PE ++TSKV ++N T N Sbjct: 541 NYKKNERELHPSLDNYSSPQKDHKGIRCHGENELSTFVPEQDNTSKVSQLNGNRTGNHRD 600
Query: 602 PNHPHQASDVFYGQGVRSVLNSKMANLRMPLPRQNADPHTDNSWSQLQNKDLYRRGNGKR 661 PN+P QASDV G GV +VLNSKM NLRMPLPR DP TDNS SQLQNKDL+ RGNGKR Sbjct: 601 PNYPPQASDVICGNGVETVLNSKMTNLRMPLPR---DPQTDNSRSQLQNKDLHTRGNGKR 660
Query: 662 TIEAQEPLALNKRQINQKMDQASDHGTSDDIPMEIVELMAKNQYERRLPDAENNNKHVSE 721 TIEAQEPL L KRQINQ+ DQ SD GTSDDIPMEIVELMAKNQYERRLPDAENN KHVSE Sbjct: 661 TIEAQEPLTLKKRQINQRTDQPSDRGTSDDIPMEIVELMAKNQYERRLPDAENNYKHVSE 720
Query: 722 TGKFSRAVQVNNYGDVYRNGRELLQKPENLQQNAQARNGG-------KVVETRKQKSADY 781 TGKFSRAVQ NNYG VYRNGRELLQKPENL+QNAQ RNGG +VVE R Q SA+Y Sbjct: 721 TGKFSRAVQANNYGYVYRNGRELLQKPENLKQNAQERNGGNGSICAREVVEARTQTSANY 780
Query: 782 FSNIRESHFDTNHPQQNHMLGCNGSIHSLVEPSNGIQYSSIGSKRKSCTEIRKCNGITVE 841 FSNI ES F NH QQNHML CNGS HS EPS G+QYSSIGSKRK +EIRKCNG TVE Sbjct: 781 FSNIGESQFGMNHLQQNHMLRCNGSTHSFEEPSTGMQYSSIGSKRKIRSEIRKCNGTTVE 840
Query: 842 -GLYNSKVQSSEGCMDHLPVSEQNIEAAYVWSSSSLMPDHLSNGYQKFPAHSTNSRKISS 901 G YNSKVQ SEG +DHLPVSEQNIEAAY+W S+ L+PDHLSNGYQ FPAHST+SRKISS Sbjct: 841 SGPYNSKVQYSEGFIDHLPVSEQNIEAAYIW-STPLIPDHLSNGYQNFPAHSTDSRKISS 900
Query: 902 PRSFQMGNTNAQNHHIHHHTNLERHGRHNNNSEAYGQRFAESSFCHCPNVAELHHNPVGS 961 PRSFQMGNTNAQNH HH TNLERHGR ++EAY QRFAESSFC PNV ELHHNPVGS Sbjct: 901 PRSFQMGNTNAQNHRNHHPTNLERHGR-QKSTEAYSQRFAESSFCRHPNVVELHHNPVGS 960
Query: 962 LELYSNETISAMHLLSLMDARMQSNAPMTAGEKHKSSKKSPVPRPRKAKEFSTTNICFNK 1021 LELYSNE ISA+HLLSLMDARMQSNAP TAGEKHK SKK PVPRP+KA+EFS T+ICFNK Sbjct: 961 LELYSNEAISALHLLSLMDARMQSNAPTTAGEKHKPSKKPPVPRPQKAEEFSATDICFNK 1020
Query: 1022 TIQDINQFSSAFHDEVCISATNASASTFQNIRGFGTNSNFSGQAVFRPQYGAKMKCSDPS 1081 TIQDI+QFSSAFHDE+C S T+AS STFQ+ RGFG+ +NFS Q VFR Q GAKMKCSD S Sbjct: 1021 TIQDISQFSSAFHDELCSSPTDASTSTFQHSRGFGSGTNFSSQVVFRSQNGAKMKCSDSS 1080
Query: 1082 SWSKDQTLSKSQFRSGDLRTDDRAFPVNGIEKGVVNATNSEVL-LVHHIERSSEECKLVA 1141 S SKDQ LSKS+F SG DDR FPVNGIEKG+VNA+NSE L HH++R+SEECKLVA Sbjct: 1081 SGSKDQKLSKSRFISG----DDRTFPVNGIEKGLVNASNSEAFALAHHMKRNSEECKLVA 1140
Query: 1142 HTRTLQNKKSTSETEICSVNKNPADFSLPEAGNIYMIGAEEFNFGRTLFSKNRSSSICFN 1195 T+TLQN+KSTSETEIC VNKNPADFSLPEAGNIYMIGAEEFNFGRT KNRS SICFN Sbjct: 1141 PTQTLQNEKSTSETEICRVNKNPADFSLPEAGNIYMIGAEEFNFGRTFLPKNRSGSICFN 1195
BLAST of Bhi04G000687 vs. ExPASy TrEMBL
Match: A0A0A0LPT5 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G375180 PE=4 SV=1) HSP 1 Score: 1760.3 bits (4558), Expect = 0.0e+00 Identity = 935/1211 (77.21%), Postives = 1015/1211 (83.82%), Query Frame = 0 Query: 1 MMHRINVMEGNNHHDGTHSKPARKFIQIDSIYIDLFSSNHKCDDQ-CELFSIRGYVSDMR 60 MMHRINVME NNHHDGT S+PAR F+QIDSIYIDLFSS+H CDDQ CELFSIRGYVSDM Sbjct: 1 MMHRINVMEENNHHDGTDSRPARNFVQIDSIYIDLFSSDHICDDQKCELFSIRGYVSDMH 60
Query: 61 KKDWKICWPFSD-IENGHKLDDPILLVPPVFDPSFNPQRGKSHWQESSDKAADKGFHFDS 120 KKDWKIC PFSD I+NGHKL++PI VP V DPSF+ +GK HWQE+SDK AD+GF FD Sbjct: 61 KKDWKICSPFSDIIDNGHKLNEPIASVPSVLDPSFDAYQGKIHWQETSDKDADQGFLFD- 120
Query: 121 CHNLGKISNSSPKAPKQDVINGRT-MADNASISGRQPSNCDQKEKKLDVADR-DNCTVAL 180 HNLGK SNSSP A KQDVI+GRT MADN S S DQKEKKL+VADR DNCTVAL Sbjct: 121 -HNLGKFSNSSPNASKQDVISGRTIMADNVS-----NSYYDQKEKKLNVADRSDNCTVAL 180
Query: 181 ISQSEPGCASHGVTEIEPVSGKLIPKATEESPAALQDGKQTHADRLNGQLT-LVSENDST 240 ISQSEPGCASHGVTEIE VS L KA EES AALQDGKQT AD LNGQLT LVSE D Sbjct: 181 ISQSEPGCASHGVTEIELVSRNLTLKAAEESLAALQDGKQTPADCLNGQLTLLVSEKDDM 240
Query: 241 VDVPRGHYTVTFQENGDASMESNQSTDSLSESAETVGNSPHHCHLGKLHRRRTPKVRLLT 300 VDV GH+TV Q NGDASMESN+ST S SESAETVGNSPH+CHLG+LHRRRTPK+RLLT Sbjct: 241 VDVVHGHHTVKVQGNGDASMESNESTVSSSESAETVGNSPHNCHLGRLHRRRTPKIRLLT 300
Query: 301 DLLGDNGNMIAKHV-ESSPSDGSPEASVQADVRYAPKCQVTIEEDVWHSDHRRERRLPRN 360 DLLGDNGNM+ KHV +SSPSDGSPEAS QADVR+ KCQVTIEED H DH+RERRL RN Sbjct: 301 DLLGDNGNMVVKHVDQSSPSDGSPEASEQADVRFTSKCQVTIEEDASHPDHKRERRLARN 360
Query: 361 GKCRHQEIPSSSSVDKKIQTWRGQIESSVSSLGNENAHSGIKQTMKGPWSSYKMDGNNSL 420 GKCRHQEIPSSSSVDK+IQTWRG+IESSVS LG ENA SG+K TMKGPW SYKMDGN+SL Sbjct: 361 GKCRHQEIPSSSSVDKQIQTWRGEIESSVSCLGTENAPSGMKSTMKGPWCSYKMDGNSSL 420
Query: 421 RRKKSKKFPVVDPYSVPLVPSKVKDQCEVQAITENRSEVAVDSAAILAYHNDFSSRTPHS 480 RRKKSKKFPVVDPYS+ L PS+VKDQCE+ I ENRSEVAVDS AI A+HN+FS R PHS Sbjct: 421 RRKKSKKFPVVDPYSMSLTPSEVKDQCEIWEINENRSEVAVDSVAIFAHHNEFSCRIPHS 480
Query: 481 TSLNAMESKSGTSKNPNSSKEPVIFEGPTNVFAWNNGMLWRGSVTQKDVETMKSRSVANP 540 S N +ESK GTS NPNSSKEPV+FEGPTNV WNN +LWRGSVTQKDVETM ANP Sbjct: 481 ISSNVIESKPGTSGNPNSSKEPVVFEGPTNVVPWNNRILWRGSVTQKDVETMNGNPAANP 540
Query: 541 LPSYRNNERELHPSHNNYSEPQRDHKGIHHRGENELATFLPELEDTSKV-RIN-IETSNL 600 P+++ NERE HPS NNYS Q+DHKGI RGENEL+TF+PE +DTSKV ++N T + Sbjct: 541 FPNFKKNEREWHPSLNNYSSLQKDHKGIRCRGENELSTFVPEQDDTSKVSQLNGNRTGSH 600
Query: 601 GYPNHPHQASDVFYGQGVRSVLNSKMANLRMPLPRQNADPHTDNSWSQLQNKDLYRRGNG 660 PN+PHQASDV G GV +V+NSKM NL+M LPR DP TDNS SQLQNKDL RRGNG Sbjct: 601 RDPNYPHQASDVICGHGVDTVMNSKMTNLKMSLPR---DPQTDNSQSQLQNKDLLRRGNG 660
Query: 661 KRTIEAQEPLALNKRQINQKMDQASDHGTSDDIPMEIVELMAKNQYERRLPDAENNNKHV 720 KRTIEAQEPLAL KRQINQ+ DQ SD GTSDDIPMEIVELMAKNQYERRLPDAENN KHV Sbjct: 661 KRTIEAQEPLALKKRQINQRTDQPSDRGTSDDIPMEIVELMAKNQYERRLPDAENNYKHV 720
Query: 721 SETGKFSRAVQVNNYGDVYRNGRELLQKPENLQQNAQARNGG-------KVVETRKQKSA 780 SETGKFSRAVQVNNY VYRNGRELLQKP NL+QNAQ RNGG +VVE R A Sbjct: 721 SETGKFSRAVQVNNYDYVYRNGRELLQKPGNLKQNAQERNGGNGLICAREVVEARTHTPA 780
Query: 781 DYFSNIRESHFDTNHPQQNHMLGCNGSIHSLVEPSNGIQYSSIGSKRKSCTEIRKCNGIT 840 +YFSNI ES F +H QQNHML CN SIHSL EPSNG+QYSSIGSKRK +EIRKCNG T Sbjct: 781 NYFSNIGESQFGISHLQQNHMLRCNDSIHSLEEPSNGMQYSSIGSKRKIRSEIRKCNGTT 840
Query: 841 VE-GLYNSKVQSSEGCMDHLPVSEQNIEAAYVWSSSSLMPDHLSNGYQKFPAHSTNSRKI 900 VE G YNSKVQ SEGC+DHLPVSEQNIEAAY+WS+SSLMPDH+SNGYQ FPAHST+SRKI Sbjct: 841 VESGPYNSKVQYSEGCIDHLPVSEQNIEAAYLWSTSSLMPDHMSNGYQNFPAHSTDSRKI 900
Query: 901 SSPRSFQMGNTNAQNHHIHHHTNLERHGRHNNNSEAYGQRFAESSFCHCPNVAELHHNPV 960 SSPR+FQMGNTNAQNHH HH TNLERHGR ++EAY QRFAESSFC PNV EL HNPV Sbjct: 901 SSPRTFQMGNTNAQNHHNHHPTNLERHGR-QKSTEAYSQRFAESSFCRHPNVVELQHNPV 960
Query: 961 GSLELYSNETISAMHLLSLMDARMQSNAPMTAGEKHKSSKKSPVPRPRKAKEFSTTNICF 1020 GSLELYSNE ISAMHLLSLMDARMQSNAP TAGEKH+ SKK PVPR +KA+EFS T+ICF Sbjct: 961 GSLELYSNEAISAMHLLSLMDARMQSNAPTTAGEKHRPSKKPPVPRTQKAEEFSATDICF 1020
Query: 1021 NKTIQDINQFSSAFHDEVCISATNASASTFQNIRGFGTNSNFSGQAVFRPQYGAKMKCSD 1080 NKTIQD++QFSSAFHDEVC SATNAS STFQ+ RGFG+ +NFS QAVFR Q GAKMKCSD Sbjct: 1021 NKTIQDMSQFSSAFHDEVCSSATNASTSTFQHSRGFGSGTNFSSQAVFRSQNGAKMKCSD 1080
Query: 1081 PSSWSKDQTLSKSQFRSGDLRTDDRAFPVNGIEKGVVNATNSEV-LLVHHIERSSEECKL 1140 SSWSKDQ LSKS F SG DDR FPVNGIEKG+VNA+NSEV +L HH++R+SEECKL Sbjct: 1081 SSSWSKDQKLSKSHFISG----DDRTFPVNGIEKGLVNASNSEVFVLAHHMKRNSEECKL 1140
Query: 1141 VAHTRTLQNKKSTSETEICSVNKNPADFSLPEAGNIYMIGAEEFNFGRTLFSKNRSSSIC 1195 VAHTRTLQN+KSTSETEIC VNKNPADFSLPEAGN YMIGAE+FNFGRT KNRS SIC Sbjct: 1141 VAHTRTLQNEKSTSETEICCVNKNPADFSLPEAGNRYMIGAEDFNFGRTFLPKNRSGSIC 1196
BLAST of Bhi04G000687 vs. ExPASy TrEMBL
Match: A0A5A7VH13 (Protein EMBRYONIC FLOWER 1-like isoform X1 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold82G003580 PE=4 SV=1) HSP 1 Score: 1635.5 bits (4234), Expect = 0.0e+00 Identity = 870/1137 (76.52%), Postives = 950/1137 (83.55%), Query Frame = 0 Query: 72 IENGHKLDDPILLVPPVFDPSFNPQRGKSHWQESSDKAADKGFHFDSCHNLGKISNSSPK 131 ++NGHK ++PI LVP VFDPSF+ +GK HWQE+SDKAAD+GF FDSC NLGKISNSSP Sbjct: 1 MDNGHKSNEPIPLVPSVFDPSFDAYQGKIHWQETSDKAADQGFLFDSCQNLGKISNSSPN 60
Query: 132 APKQDVINGRT-MADNASISGRQPSNCDQKEKKLDVADR-DNCTVALISQSEPGCASHGV 191 A KQDVI+GRT MADN S S+CDQKEK L+VADR DNCTVALISQSEPGCASHGV Sbjct: 61 ASKQDVISGRTIMADNVS-----NSSCDQKEKTLNVADRSDNCTVALISQSEPGCASHGV 120
Query: 192 TEIEPVSGKLIPKATEESPAALQDGKQTHADRLNGQLT-LVSENDSTVDVPRGHYTVTFQ 251 TEIEPVS L KATEES AALQDG+QT AD LNGQLT LVSE D VDV GH+TV Q Sbjct: 121 TEIEPVSRNLTLKATEESLAALQDGQQTPADCLNGQLTLLVSEKDDMVDVAHGHHTVKVQ 180
Query: 252 ENGDASMESNQSTDSLSESAETVGNSPHHCHLGKLHRRRTPKVRLLTDLLGDNGNMIAKH 311 NGDASMESN ST S SESAETVGNSPH+CHLG+LHRRRTPK+RLLTDLLGDNGNM+ KH Sbjct: 181 GNGDASMESNDSTVSSSESAETVGNSPHNCHLGRLHRRRTPKIRLLTDLLGDNGNMVVKH 240
Query: 312 VESSPSDGSPEASVQADVRYAPKCQVTIEEDVWHSDHRRERRLPRNGKCRHQEIPSSSSV 371 VESS SDGSPEAS QADVR+ KCQV IEED HSDH+RERRL RNGKCRHQEIPSSSSV Sbjct: 241 VESSLSDGSPEASEQADVRFTSKCQVIIEEDASHSDHKRERRLARNGKCRHQEIPSSSSV 300
Query: 372 DKKIQTWRGQIESSVSSLGNENAHSGIKQTMKGPWSSYKMDGNNSLRRKKSKKFPVVDPY 431 DK+IQTW G+IESSVS LG ENA SG+K+T+KGPW SYKMDGN+SLRRKKS+KFPVVDPY Sbjct: 301 DKQIQTWMGEIESSVSCLGTENALSGMKKTIKGPWCSYKMDGNSSLRRKKSRKFPVVDPY 360
Query: 432 SVPLVPSKVKDQCEVQAITENRSEVAVDSAAILAYHNDFSSRTPHSTSLNAMESKSGTSK 491 S+ L+PSK KDQCE+ ENRSEVAVDS AI A+HN+FS R PHS S NA+ESK TS Sbjct: 361 SMSLLPSKAKDQCEIWERNENRSEVAVDSVAIFAHHNEFSCRIPHSLSSNAIESKPSTSG 420
Query: 492 NPNSSKEPVIFEGPTNVFAWNNGMLWRGSVTQKDVETMKSRSVANPLPSYRNNERELHPS 551 NPNSS EPV+FEGPTNVF WNN +LWRGSVTQKDVETM SR ANP +Y+ NERELHPS Sbjct: 421 NPNSSNEPVVFEGPTNVFPWNNRILWRGSVTQKDVETMNSRPAANPSTNYKKNERELHPS 480
Query: 552 HNNYSEPQRDHKGIHHRGENELATFLPELEDTSKV-RIN-IETSNLGYPNHPHQASDVFY 611 +NYS PQ+DHKGI GENEL+TF+PE ++TSKV ++N T N PN+P QASDV Sbjct: 481 LDNYSSPQKDHKGIRCHGENELSTFVPEQDNTSKVSQLNGNRTGNHRDPNYPPQASDVIC 540
Query: 612 GQGVRSVLNSKMANLRMPLPRQNADPHTDNSWSQLQNKDLYRRGNGKRTIEAQEPLALNK 671 G GV +VLNSKM NLRMPLPR DP TDNS SQLQNKDL+ RGNGKRTIEAQEPL L K Sbjct: 541 GNGVETVLNSKMTNLRMPLPR---DPQTDNSRSQLQNKDLHTRGNGKRTIEAQEPLTLKK 600
Query: 672 RQINQKMDQASDHGTSDDIPMEIVELMAKNQYERRLPDAENNNKHVSETGKFSRAVQVNN 731 RQINQ+ DQ SD GTSDDIPMEIVELMAKNQYERRLPDAENN KHVSETGKFSRAVQ NN Sbjct: 601 RQINQRTDQPSDRGTSDDIPMEIVELMAKNQYERRLPDAENNYKHVSETGKFSRAVQANN 660
Query: 732 YGDVYRNGRELLQKPENLQQNAQARNGG-------KVVETRKQKSADYFSNIRESHFDTN 791 YG VYRNGRELLQKPENL+QNAQ RNGG +VVE R Q SA+YFSNI ES F N Sbjct: 661 YGYVYRNGRELLQKPENLKQNAQERNGGNGSICAREVVEARTQTSANYFSNIGESQFGMN 720
Query: 792 HPQQNHMLGCNGSIHSLVEPSNGIQYSSIGSKRKSCTEIRKCNGITVE-GLYNSKVQSSE 851 H QQNHML CNGS HS EPS G+QYSSIGSKRK +EIRKCNG TVE G YNSKVQ SE Sbjct: 721 HLQQNHMLRCNGSTHSFEEPSTGMQYSSIGSKRKIRSEIRKCNGTTVESGPYNSKVQYSE 780
Query: 852 GCMDHLPVSEQNIEAAYVWSSSSLMPDHLSNGYQKFPAHSTNSRKISSPRSFQMGNTNAQ 911 G +DHLPVSEQNIEAAY+W S+ L+PDHLSNGYQ FPAHST+SRKISSPRSFQMGNTNAQ Sbjct: 781 GFIDHLPVSEQNIEAAYIW-STPLIPDHLSNGYQNFPAHSTDSRKISSPRSFQMGNTNAQ 840
Query: 912 NHHIHHHTNLERHGRHNNNSEAYGQRFAESSFCHCPNVAELHHNPVGSLELYSNETISAM 971 NH HH TNLERHGR ++EAY QRFAESSFC PNV ELHHNPVGSLELYSNE ISA+ Sbjct: 841 NHRNHHPTNLERHGR-QKSTEAYSQRFAESSFCRHPNVVELHHNPVGSLELYSNEAISAL 900
Query: 972 HLLSLMDARMQSNAPMTAGEKHKSSKKSPVPRPRKAKEFSTTNICFNKTIQDINQFSSAF 1031 HLLSLMDARMQSNAP TAGEKHK SKK PVPRP+KA+EFS T+ICFNKTIQDI+QFSSAF Sbjct: 901 HLLSLMDARMQSNAPTTAGEKHKPSKKPPVPRPQKAEEFSATDICFNKTIQDISQFSSAF 960
Query: 1032 HDEVCISATNASASTFQNIRGFGTNSNFSGQAVFRPQYGAKMKCSDPSSWSKDQTLSKSQ 1091 HDE+C S T+AS STFQ+ RGFG+ +NFS Q VFR Q GAKMKCSD SS SKDQ LSKS+ Sbjct: 961 HDELCSSPTDASTSTFQHSRGFGSGTNFSSQVVFRSQNGAKMKCSDSSSGSKDQKLSKSR 1020
Query: 1092 FRSGDLRTDDRAFPVNGIEKGVVNATNSEVL-LVHHIERSSEECKLVAHTRTLQNKKSTS 1151 F SG DDR FPVNGIEKG+VNA+NSE L HH++R+SEECKLVA T+TLQN+KSTS Sbjct: 1021 FISG----DDRTFPVNGIEKGLVNASNSEAFALAHHMKRNSEECKLVAPTQTLQNEKSTS 1080
Query: 1152 ETEICSVNKNPADFSLPEAGNIYMIGAEEFNFGRTLFSKNRSSSICFNDRYKQQRIV 1195 ETEIC VNKNPADFSLPEAGNIYMIGAEEFNFGRT KNRS SICFN+RYKQQ + Sbjct: 1081 ETEICRVNKNPADFSLPEAGNIYMIGAEEFNFGRTFLPKNRSGSICFNNRYKQQTFI 1123
BLAST of Bhi04G000687 vs. ExPASy TrEMBL
Match: A0A1S4DV99 (protein EMBRYONIC FLOWER 1-like isoform X2 OS=Cucumis melo OX=3656 GN=LOC103488193 PE=4 SV=1) HSP 1 Score: 1416.7 bits (3666), Expect = 0.0e+00 Identity = 746/970 (76.91%), Postives = 814/970 (83.92%), Query Frame = 0 Query: 236 VDVPRGHYTVTFQENGDASMESNQSTDSLSESAETVGNSPHHCHLGKLHRRRTPKVRLLT 295 VDV GH+TV Q NGDASMESN ST S SESAETVGNSPH+CHLG+LHRRRTPK+RLLT Sbjct: 2 VDVAHGHHTVKVQGNGDASMESNDSTVSSSESAETVGNSPHNCHLGRLHRRRTPKIRLLT 61
Query: 296 DLLGDNGNMIAKHVESSPSDGSPEASVQADVRYAPKCQVTIEEDVWHSDHRRERRLPRNG 355 DLLGDNGNM+ KHVESS SDGSPEAS QADVR+ KCQV IEED HSDH+RERRL RNG Sbjct: 62 DLLGDNGNMVVKHVESSLSDGSPEASEQADVRFTSKCQVIIEEDASHSDHKRERRLARNG 121
Query: 356 KCRHQEIPSSSSVDKKIQTWRGQIESSVSSLGNENAHSGIKQTMKGPWSSYKMDGNNSLR 415 KCRHQEIPSSSSVDK+IQTW G+IESSVS LG ENA SG+K+T+KGPW SYKMDGN+SLR Sbjct: 122 KCRHQEIPSSSSVDKQIQTWMGEIESSVSCLGTENALSGMKKTIKGPWCSYKMDGNSSLR 181
Query: 416 RKKSKKFPVVDPYSVPLVPSKVKDQCEVQAITENRSEVAVDSAAILAYHNDFSSRTPHST 475 RKKS+KFPVVDPYS+ L+PSK KDQCE+ ENRSEVAVDS AI A+HN+FS R PHS Sbjct: 182 RKKSRKFPVVDPYSMSLLPSKAKDQCEIWERNENRSEVAVDSVAIFAHHNEFSCRIPHSL 241
Query: 476 SLNAMESKSGTSKNPNSSKEPVIFEGPTNVFAWNNGMLWRGSVTQKDVETMKSRSVANPL 535 S NA+ESK TS NPNSS EPV+FEGPTNVF WNN +LWRGSVTQKDVETM SR ANP Sbjct: 242 SSNAIESKPSTSGNPNSSNEPVVFEGPTNVFPWNNRILWRGSVTQKDVETMNSRPAANPS 301
Query: 536 PSYRNNERELHPSHNNYSEPQRDHKGIHHRGENELATFLPELEDTSKV-RIN-IETSNLG 595 +Y+ NERELHPS +NYS PQ+DHKGI GENEL+TF+PE ++TSKV ++N T N Sbjct: 302 TNYKKNERELHPSLDNYSSPQKDHKGIRCHGENELSTFVPEQDNTSKVSQLNGNRTGNHR 361
Query: 596 YPNHPHQASDVFYGQGVRSVLNSKMANLRMPLPRQNADPHTDNSWSQLQNKDLYRRGNGK 655 PN+P QASDV G GV +VLNSKM NLRMPLPR DP TDNS SQLQNKDL+ RGNGK Sbjct: 362 DPNYPPQASDVICGNGVETVLNSKMTNLRMPLPR---DPQTDNSRSQLQNKDLHTRGNGK 421
Query: 656 RTIEAQEPLALNKRQINQKMDQASDHGTSDDIPMEIVELMAKNQYERRLPDAENNNKHVS 715 RTIEAQEPL L KRQINQ+ DQ SD GTSDDIPMEIVELMAKNQYERRLPDAENN KHVS Sbjct: 422 RTIEAQEPLTLKKRQINQRTDQPSDRGTSDDIPMEIVELMAKNQYERRLPDAENNYKHVS 481
Query: 716 ETGKFSRAVQVNNYGDVYRNGRELLQKPENLQQNAQARNGG-------KVVETRKQKSAD 775 ETGKFSRAVQ NNYG VYRNGRELLQKPENL+QNAQ RNGG +VVE R Q SA+ Sbjct: 482 ETGKFSRAVQANNYGYVYRNGRELLQKPENLKQNAQERNGGNGSICAREVVEARTQTSAN 541
Query: 776 YFSNIRESHFDTNHPQQNHMLGCNGSIHSLVEPSNGIQYSSIGSKRKSCTEIRKCNGITV 835 YFSNI ES F NH QQNHML CNGS HS EPS G+QYSSIGSKRK +EIRKCNG TV Sbjct: 542 YFSNIGESQFGMNHLQQNHMLRCNGSTHSFEEPSTGMQYSSIGSKRKIRSEIRKCNGTTV 601
Query: 836 E-GLYNSKVQSSEGCMDHLPVSEQNIEAAYVWSSSSLMPDHLSNGYQKFPAHSTNSRKIS 895 E G YNSKVQ SEG +DHLPVSEQNIEAAY+W S+ L+PDHLSNGYQ FPAHST+SRKIS Sbjct: 602 ESGPYNSKVQYSEGFIDHLPVSEQNIEAAYIW-STPLIPDHLSNGYQNFPAHSTDSRKIS 661
Query: 896 SPRSFQMGNTNAQNHHIHHHTNLERHGRHNNNSEAYGQRFAESSFCHCPNVAELHHNPVG 955 SPRSFQMGNTNAQNH HH TNLERHGR ++EAY QRFAESSFC PNV ELHHNPVG Sbjct: 662 SPRSFQMGNTNAQNHRNHHPTNLERHGR-QKSTEAYSQRFAESSFCRHPNVVELHHNPVG 721
Query: 956 SLELYSNETISAMHLLSLMDARMQSNAPMTAGEKHKSSKKSPVPRPRKAKEFSTTNICFN 1015 SLELYSNE ISA+HLLSLMDARMQSNAP TAGEKHK SKK PVPRP+KA+EFS T+ICFN Sbjct: 722 SLELYSNEAISALHLLSLMDARMQSNAPTTAGEKHKPSKKPPVPRPQKAEEFSATDICFN 781
Query: 1016 KTIQDINQFSSAFHDEVCISATNASASTFQNIRGFGTNSNFSGQAVFRPQYGAKMKCSDP 1075 KTIQDI+QFSSAFHDE+C S T+AS STFQ+ RGFG+ +NFS Q VFR Q GAKMKCSD Sbjct: 782 KTIQDISQFSSAFHDELCSSPTDASTSTFQHSRGFGSGTNFSSQVVFRSQNGAKMKCSDS 841
Query: 1076 SSWSKDQTLSKSQFRSGDLRTDDRAFPVNGIEKGVVNATNSEVL-LVHHIERSSEECKLV 1135 SS SKDQ LSKS+F SG DDR FPVNGIEKG+VNA+NSE L HH++R+SEECKLV Sbjct: 842 SSGSKDQKLSKSRFISG----DDRTFPVNGIEKGLVNASNSEAFALAHHMKRNSEECKLV 901
Query: 1136 AHTRTLQNKKSTSETEICSVNKNPADFSLPEAGNIYMIGAEEFNFGRTLFSKNRSSSICF 1195 A T+TLQN+KSTSETEIC VNKNPADFSLPEAGNIYMIGAEEFNFGRT KNRS SICF Sbjct: 902 APTQTLQNEKSTSETEICRVNKNPADFSLPEAGNIYMIGAEEFNFGRTFLPKNRSGSICF 961
BLAST of Bhi04G000687 vs. ExPASy TrEMBL
Match: A0A6J1BSA9 (protein EMBRYONIC FLOWER 1-like OS=Momordica charantia OX=3673 GN=LOC111004929 PE=4 SV=1) HSP 1 Score: 1416.0 bits (3664), Expect = 0.0e+00 Identity = 795/1218 (65.27%), Postives = 927/1218 (76.11%), Query Frame = 0 Query: 13 HHDGTHSKPARKFIQIDSIYIDLFSSN--HKCDDQCELFSIRGYVSDMRKKDWKICWPFS 72 +H GT SKPA KFIQIDSI+IDLFSS+ D +CE FSIRGYVSDM KKDWKICWPFS Sbjct: 4 NHRGTDSKPAEKFIQIDSIFIDLFSSSDGESDDPKCERFSIRGYVSDMHKKDWKICWPFS 63
Query: 73 DIENGHKLDDPILLVPPVFDPSFNPQRGKSHWQESSDKAADKGFHFDSCHNLGKISNSSP 132 D ++ HKLD IL + PV DPSF+ + + H +E+S+K A +GF +DSCHNL ++SP Sbjct: 64 DFDDVHKLDKLILRLSPVHDPSFDWRDVRIHREENSNKGAAEGFVYDSCHNLRSFLSASP 123
Query: 133 KAPKQDVINGRTMADNASISGRQPSNCDQKEKKLDVADRDNCTVALISQSEPGCASHGVT 192 +A K VINGRTM +NAS QPS+C +KE+KL+VA DN TVALISQSEPGCASH VT Sbjct: 124 RALKHVVINGRTMVENASNFSCQPSSCGEKERKLEVA--DNSTVALISQSEPGCASHEVT 183
Query: 193 EIEPVSGKLIPKATEESPAA-LQDGKQTHADRLNGQLT-LVSENDSTVDVPRGHYTVTFQ 252 +IEPV+ L + TEESPA L GKQT AD L QLT LV ENDSTVDV R ++ FQ Sbjct: 184 DIEPVNRNL--RVTEESPAENLLTGKQTPADHLKEQLTLLVLENDSTVDVDRAYHVTKFQ 243
Query: 253 ENGDASMESNQSTDSLSESA-ETVGNSPHHCHLGKLHRRRTPKVRLLTDLLGDNGNMIA- 312 E+ D SMESN+ST SESA +TVG+S HHCHL KL RRRTPK+RLLT+LLG +GNM Sbjct: 244 ESTDISMESNESTFESSESADDTVGSSLHHCHLEKLPRRRTPKMRLLTELLGGHGNMKKD 303
Query: 313 KHVESSPSDGSPEASVQADVRYAPKCQVTIEEDVWHSDHRRERRLPRNGKCRHQEIPSSS 372 KHVESSPS G+PE+S +AD RYA KCQ+T++E+VWHS ++ERR PRNGKC+HQEIP SS Sbjct: 304 KHVESSPSVGTPESSAEADARYASKCQITLQENVWHSGRKKERRFPRNGKCKHQEIPYSS 363
Query: 373 SVDKKIQTWRGQIESSVSSLGNENAHSGIKQTMKGPWSSYKMDGNNSLRRKKSKKFPVVD 432 SVDK+IQTWR + E+SVSSL ENA SG QT KG WSSYKMDGNN+L +KKSKKFPVVD Sbjct: 364 SVDKQIQTWREETENSVSSLETENALSGTIQTKKGLWSSYKMDGNNTLAKKKSKKFPVVD 423
Query: 433 PYSVPLVPSKVKDQCEVQA--ITENRSE-VAVDSAAILAYHNDFSSRTPHSTSLNAMESK 492 PYSV L+P K KDQ E A T+ RS+ A+DSAA++A+ N+ SSRTPH SLNAMESK Sbjct: 424 PYSVSLLPPKGKDQNETWATPTTKYRSDKEALDSAAVIAHRNELSSRTPHPISLNAMESK 483
Query: 493 SGTSKNPNSSKEPVIFEGPTNVFAWNNGMLWRGSVTQKDVETMKSRSVANPL--PSYRNN 552 S T+KNPNSSKEP+I EG VF W+ GM+ + SVTQKD++T VAN + RNN Sbjct: 484 SSTTKNPNSSKEPMIVEGSGTVFPWDGGMINKSSVTQKDMQT-----VANTFQYANSRNN 543
Query: 553 ERELHPSHNNYSEPQRDHKGIHHRGENELATFLPELEDTSKV----RINIETSNLGYPNH 612 ERELH S NNY PQRDHKGI RGENEL T LPE ED S+V R +I+ ++LG N Sbjct: 544 ERELHLSPNNYFNPQRDHKGISRRGENELPTSLPEQEDPSRVIKFRRKDIKRNHLGDLNP 603
Query: 613 PHQASDVFYGQGVRSVLNSKMANLRMPLPRQNADPHTDNSWSQLQNKDLYRRGNGKRTIE 672 P++ASDVFYGQGV SVLNSK+ANLRMPLPRQN +P TDN WSQLQ KD+Y N K+TIE Sbjct: 604 PYEASDVFYGQGVYSVLNSKIANLRMPLPRQNVEPDTDNGWSQLQQKDIYSGSNSKKTIE 663
Query: 673 AQEPLALNKRQINQKMDQASDHGTSDDIPMEIVELMAKNQYERRLPDAENNNKHVSETGK 732 AQEPLA KRQINQ++ +ASD GT DDIPMEIVELMAKNQYER L DAE NNKH+ ET Sbjct: 664 AQEPLASMKRQINQRV-EASDSGTCDDIPMEIVELMAKNQYERCLHDAE-NNKHLLETSN 723
Query: 733 FSRAVQVNNYGDVYRNGRELLQKPENLQQNAQARNG-------GKVVETRKQKSADYFSN 792 FSR QVNNYGD+YRNGR LQK EN +Q AQARNG GKV+E +KQK ADYFSN Sbjct: 724 FSRTGQVNNYGDIYRNGRGSLQKSENHKQKAQARNGGNAAICAGKVLEAKKQKPADYFSN 783
Query: 793 IRESHFDTNHPQQNHMLGCNGSIHSLVEPSNGIQYSSIGSKRKSCTEIRKCNGITVEGL- 852 I ESHF+TNH QQ MLG N SIHS +PS+GIQ+SSIGSKR+S TE RKCNG +E + Sbjct: 784 IGESHFNTNHLQQTCMLGHNASIHSQEKPSSGIQFSSIGSKRQSSTESRKCNGTILESVP 843
Query: 853 YNSKVQSSEGCMDHLPVSEQNIEAAYVWSSSSLMPDHLSNGYQKFPAHSTNSRKISSPRS 912 YNSKVQS GC+D+ PVSEQN+EA + WSSS +MPDHL +GYQ+FPA ST+ KISSPRS Sbjct: 844 YNSKVQSFGGCIDYPPVSEQNMEAPHRWSSSPMMPDHLPHGYQRFPAQSTDREKISSPRS 903
Query: 913 FQMGNTNAQNHHIHHHTNLERHGRHNNNSEAYGQRFAESSFCHCPNVAELHHNPVGSLEL 972 +GN QN+HIHH TNLE+HGRH NSEAY Q FAE SFC PNV ELH N VGSLEL Sbjct: 904 LPIGNAITQNYHIHHPTNLEKHGRH-YNSEAYSQNFAEGSFCCHPNVVELHQNLVGSLEL 963
Query: 973 YSNETISAMHLLSLMDARMQSNAPMTAGEKHKSSKKSPVPRPRKAKEFSTTNICFNKTIQ 1032 YSNETI AMHLLSLMDA MQSNA +TA KHK SKK +P P K KEFS +I ++T+Q Sbjct: 964 YSNETIPAMHLLSLMDAGMQSNASITASGKHKFSKKPRIPHPLKGKEFSGMDIRLDETVQ 1023
Query: 1033 DINQFSSAFHDEV---------CISATNASASTFQNIRGFGTNSNFSGQAVFRPQYGAKM 1092 IN SS FH EV ASA TFQ+ RGFG+N++F+GQAVF+ + K+ Sbjct: 1024 AINYSSSVFHGEVPSKSHFRSPAAPVIGASACTFQDSRGFGSNTHFAGQAVFKSRNRGKI 1083
Query: 1093 KCSDPSSWSKDQTLSKSQFRSGDLRTDDRAFPVNGIEKGVVNATNSEVL-LVHHIERSSE 1152 KCSD S+W K Q L KS FRSG L TDDR FPVNGI+KGVV A+NSEVL L HH+ER+SE Sbjct: 1084 KCSDQSTWRKGQKLPKSLFRSGGLGTDDRTFPVNGIQKGVVCASNSEVLELAHHMERNSE 1143
Query: 1153 ECKLVAHTRT---LQNKKSTSETEICSVNKNPADFSLPEAGNIYMIGAEEFNFGRTLFSK 1195 E +L+A T+T LQ++KST ETEICSVNKNPADFSLPEAGNIYMIGAE+F+FGR L SK Sbjct: 1144 ESELIARTKTLQDLQDQKSTFETEICSVNKNPADFSLPEAGNIYMIGAEDFSFGRALHSK 1203
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
AT5G11530.1 | 2.8e-28 | 22.26 | embryonic flower 1 (EMF1) | [more] |
Match Name | E-value | Identity | Description | |
Q9LYD9 | 3.9e-27 | 22.26 | Protein EMBRYONIC FLOWER 1 OS=Arabidopsis thaliana OX=3702 GN=EMF1 PE=1 SV=1 | [more] |
Match Name | E-value | Identity | Description | |
A0A1S3BB95 | 0.0e+00 | 77.17 | protein EMBRYONIC FLOWER 1-like isoform X1 OS=Cucumis melo OX=3656 GN=LOC1034881... | [more] |
A0A0A0LPT5 | 0.0e+00 | 77.21 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G375180 PE=4 SV=1 | [more] |
A0A5A7VH13 | 0.0e+00 | 76.52 | Protein EMBRYONIC FLOWER 1-like isoform X1 OS=Cucumis melo var. makuwa OX=119469... | [more] |
A0A1S4DV99 | 0.0e+00 | 76.91 | protein EMBRYONIC FLOWER 1-like isoform X2 OS=Cucumis melo OX=3656 GN=LOC1034881... | [more] |
A0A6J1BSA9 | 0.0e+00 | 65.27 | protein EMBRYONIC FLOWER 1-like OS=Momordica charantia OX=3673 GN=LOC111004929 P... | [more] |
InterPro
Analysis Name: InterPro Annotations of Wax gourd (B227) v1
Date Performed: 2021-10-22
IPR Term | IPR Description | Source | Source Term | Source Description | Alignment |
None | No IPR available | COILS | Coil | Coil | coord: 695..715 |
None | No IPR available | MOBIDB_LITE | mobidb-lite | disorder_prediction | coord: 247..284 |
None | No IPR available | MOBIDB_LITE | mobidb-lite | disorder_prediction | coord: 524..546 |
None | No IPR available | MOBIDB_LITE | mobidb-lite | disorder_prediction | coord: 247..272 |
None | No IPR available | MOBIDB_LITE | mobidb-lite | disorder_prediction | coord: 468..497 |
None | No IPR available | MOBIDB_LITE | mobidb-lite | disorder_prediction | coord: 384..404 |
None | No IPR available | MOBIDB_LITE | mobidb-lite | disorder_prediction | coord: 971..995 |
None | No IPR available | MOBIDB_LITE | mobidb-lite | disorder_prediction | coord: 1..20 |
None | No IPR available | MOBIDB_LITE | mobidb-lite | disorder_prediction | coord: 978..992 |
IPR034583 | Protein EMBRYONIC FLOWER 1 | PANTHER | PTHR35504 | PROTEIN EMBRYONIC FLOWER 1 | coord: 6..1192 |
Relationships
The following mRNA feature(s) are a part of this gene:
GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category |
Term Accession |
Term Name |
biological_process |
GO:0009910 |
negative regulation of flower development |
biological_process |
GO:0045892 |
negative regulation of transcription, DNA-templated |
biological_process |
GO:0048367 |
shoot system development |
|