Spg021589 (gene) Sponge gourd (cylindrica) v1

Overview
NameSpg021589
Typegene
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionProtein DETOXIFICATION
Locationscaffold9: 5361023 .. 5384101 (-)
RNA-Seq ExpressionSpg021589
SyntenySpg021589
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTCTAACCTCGGCCTTTCTGTGTTTCTCTTCAATTTGGTTTCGTAAGTCTGTCCTTACCGCACGACTACTGTTCTGGGTTGATTCCGAGCTCCCTAGCACCATATAACCTCTAGATTTCATCCCCTTGACTCTGTTGGTTGCGTTGTCAATCTTCTTTTTCTTCAATCTTCATTCTGACTCTGCAGTGTACTCCACTGGTTCCTTCTCATTCCTTGGAACCCTAACTAGCATTTTCTCTTGAACTTTGGAGTCAGAACCAGATTTATCCAAGATACATAGTATCTTAGACAAATTTTTATCCATTATGCTCACCTTGTCCTCAAGGGTGGCAATTTTGGCATCGACCTGCCTCTCTTGGGAAAGAGGTCGAGGAGACCAAGGTCAAGGCTGAGGCCAAGGAGATCGGGTCCTAGGTCGAGCATGACTCTCTTCCTACGACCTCAGCTGGTGGTGGTCGCTTTTTCGAAGTTTCTGCTCCTGGATTCTTCTTTTCTTCTCCATCATCTTACCCGTTAGGTAATCGATGTGCAAACTGCCCTGGCAAGTCTTCCCCATAGATGGCGCCAATTGTTGATGTCACAATCCGGGTAGGAAGAACTCACCTGATCTTCATGTGACACCAACGGTAAATTTATTTGAAGGGGGGAAGACGTGGCCTGCAAAACAAGGAAACTTGCACACCGATGTGGTGTTTTCCACACCGCCTCCAATGCTTAAGTGAGCAAGGAGAAAGATGAGAAGCTAGAGTAAGAGTGGAGAGAGTTTAGAGAATAGAGTTTCAGATCCCTTCTTCAAGGGTTTATATACCTTCTCTGGTATTTAGGGTTTCTAGGCATCTAAAGACAGATTCGGGTTGAACCAAGTCGAACCGGACTGACCAAGGACACCAAGAACCGAACAAAGGGAGGGAAACTCAGCATGAGATTAGGCTGGCCTAATCCCTTCGGCCTACTTCTCCAGGGTCCGCTTCCTGGTTGTGTCTTTGGATTGGTGTTGTGATATCCTCGTTAGCTCCTTACGTTTCAGAGTGGTCCTGAATTACCTATAACAGAACTTTAAAAAGAATACAGAATAATAAAATTTTGTAAATCTAAGTCAATTTGATCAAAAAAATTAAAAGAGTAAGTAATATTATTAAATTTCAGATTCATTAACTCGATTTCAACTAGACATCTAGAATGATTAAAAGAGAATAGAGGATAAGACAATTTTCTTTCTTAAAGATCGATGGATGGATCTCACCCTTTTAATTGTTGAAACTCAGGATAAAAAGAAAAGAAGAATAATCGAAAAAGAGAGTAAAGTTCTCTTTGCTAAATGCTTTAAATTATACTCAGACAAACAAAAAGGATGCTTTAAGTTAAATACAGGATAAAATTTTAGTAATAAGAACTAAATGCTCTTATGACTCCAATATAAATTACTTCCAAAAGAAAGAAAAAGAAAAGGTTTTGCTCAAGAAAAACATTTAATATTTATTCTTAGAATAAAATTCGATGTTGTCCTCAAAATTTGTATGTTATTCAATTTTTTTTCTCTAATCTTTAATTAATATTCTATGTTTAAAAATTACTTTGTATAATAATAAATAACTTTTTAAAATATAAGATTGATGTTCCATTTTTGTGGTTTTATAGAATAGAATAGCTTAAGAATTTAGACACAAGTTTTTGCCCAAGATCTGAAATCTACGTGCATAAGTATTAAAAGCTATGCCAACATGAACACATAGGTCAACTGGCATAAAATTGTACGAACGACCAAGAGGTCTATGGTACAAATCCTCACTCTTCACATGTTTTCAATTTTCAATAAAAAAAATGCTATTATAATGAAATCAATAACAAGGACCATGTTTTTTCTAATATAATGCAGATTTTTTAGTGAAAAAATTAGTAAATTTAAATGAAGAGAAGTTGCCAATTTTAGGGACTAAAATAAAATATTAAAAAGTTTAAGGACCAAATCAAGAGAGATTGAATTTAGGAAAGTTTCAACTTTTACCTTTTTATTTTTAACTCAATAAAGATGAGATAGTTGTTTTTTTCAGTTCAACAACATTTGAGGGAATGAGATTCAAATCTCTTGCCTCTGAAATTTTTTTAAAAAATTAGTCTTACGTGCATTGAGTGGATTGTAAAAATAGAAGCTTGCTATGAAATTAAGGTGACTTTACAAAAAGGACAACAACACAAGGATTGAATTTTTGTTCGGATGGTGAACTTTAGGAGATTTATTAAATGGTGAATCAATTTGTTAGAGTTTGATTTTTATTGAAAAGGGTGGGTTTTAGGAGATTTTTACTTGTGTCCATCTTAAAATTTTTGTAAATTCAGTATTGTAATGTCTCAGACCAAAGATTTGGAATCTAGATTCGACCCTTGATGACCCACCATAATTGTAGTTGTCTTAGGGTGTGTTTGGTTTAACTTTTCAAGTGTTTAATTTTGAAAATAAATCATTTTGAAAAAATTTAGTGTTTGGCAACCACTAAAAATGACTTATTAGAAAATGAGATTGATAAATAAACCATTTTGAGGGAAACACTTGAAACCAACTTTTTAAAAAAGACTTCTTGAGTAAATTACAATAACTTGTGTAGAAAAACATTTGTGAAGAAATGCAACCAAACATATGAATGTTTAAATTCTAAACAATTTTTATACAAAATGTTTTTTAAAAAATGCATTCTTGAAAACCATTTAAATTAAAATGTATTCCAAAACAGACCCTTAAATAGCTTTTAAGAGTAAAGATCGTTCCCACCAACATGGATCCTTTTAGGATGTTCTATCCTCACTCACATGTTTTCGAGTAAAATTTTAAGTTCTTATAATATTTTTTTTTTTAGTTTAACAACATTTTGGGGTGGAGGGATGCAAGCCTTTAACCTCTTGGTTGATGGTACAGGTCAATTACTGTTGAGTTATGCTTGATTTGACCAAGTTCCTATGATTAAGCCATTGAAAAAAATAGTTTTATATCTTCAAAGATTCGAACCACAGTTCTCTTGATTATTAATACAAACTTTATATTAATTGAACTATGTTGGTGTTCACAATTAAAGAATATAATTAAGTCTAAAGTTCAGTTTGGTGAAACTAAAATAATCTTATAATAAAAATTAGTTTCACCATAGTGGGAAGAATATAACTGGTAATTTTACGATGTTGAACATTATTATATAGTGAATTTAACAGCCTGAAAGGGCCACGGTGAATAACTTTTCCAAATTTACAAGGAGGGGTAAAAACTAAATGACCTTTATTTGCTCTTGTGTATGCCCTCTAGAACTGAAGAGTGCTGGGGAAGTTGTTTTGAAGAGGGAAAAAAAATGAGAGAAATTAAAATGGAGAGTTTATGAAAAGGAGCTGAAGAGAAGGTGAGCTTTTTAGCAGTGCCCATCGTCATAGAGTTTGTTCTTCAATACCTTTTGCAGGTAGTGATTGTCATCATCGTTGGCCATCTCGGCGACGAGCTTTTGCTTTCTGGAGTCTCCATTGCCACTTCCTTCGCTCGCGCCACTGGCTTTAGCCTCCTCGTAATCCTTGTTTTTCTTTGTTTTAAGACCATGTCATTTTGTTTTTGAGCTTCGCGTGTTTTTCCTCCTTTTTTTTTATCCTAACAAAATACCTTGCTTTTTGTGACGCCCGAACATATATAACAGATACATGAATGAACTGAGACCATGTTTGAATGAGAGAAAGTCCGAACATATAAAGTATGGTTAAGATAAACTTAAAAAATTTATAATAACTAATGTTCACACATAAATTCAAAGATGAGACACGTGAAATAATTTCGATTTATATTGATGATGTATGCGTGTGAGTGTGCACTCTCTAATACATAATCCTCTGATTCTCTCTCTTGTGTGTGTTTCACTCGAGGACTGGAATGCTTGTTTTTGAATTTGAAGTCGGCTCAAGGGCTTTGCAGCTTGTTCTTAAGATTGAAGTTGTCTCAAGAGCTTTGCAACTTGTTCTTGAGATTGAAGTTGTATCAAGGGCTTCGTGAGTTCAAGGAGTCTTTTGATTCTGGAAGTCTTCAAGAAGCTTTAGTCTTTCAGAGTCTTGAAGGTTTGTTCTTCAGAAGGATTTGAGTTCTTCTAGATGGCTTGAGGTAAACTTTAATCTTCAAGTCTTCAAGAGGCCCTGAGTTTCAATCAGTTTTGCAGTAAGATTTAAAGAGGATCTGAGCTTCGATCTTCAGTTTTGCAGCAGGAGTTGAAGAGGATATGAGCTTTGTTCTTTGGTTCTACAGTGAGATTTGAGGAGGATCTGAGGTTTGATCTTCGATTTTGCAGCGGGATTTGAAGGGGATCTGAGCTTTTTTCTTCAGTTCTATAGCAGGATTTGAAGAGGATCTGAGCTTCAATCCTTGGTTCTGAAGCGGGATTTGAAGAGAATCTAAGCTTCATTCTTCATTTCAACAGCTAGATTTGAAGAAGATTGGAGCTTCAATCTTCGGTTTTGCAGGAGGAGCTTCAATCTTCAGAGTCTTCTGAGAGCTTCCTAATCTTCCTCTTCTTCCTTTCTTTCTGCCCCCTCCAAATGAGAGGCAAGACCTCCTATTTATAGAGTTTTCAAAAGGCCTTCGTGGGCTTGGACTTGATTGCATTTGGGCCCATCCACGGGGTTGGGCTTGATTGCATTTGAGCCCATTCATGGGTTTGGGCTTGAAATTTTGGGCTTCAGGCCCAATTACCTTAATGAATTTGGATTTATATTAAACCTAATTTGATGAACTTAGTACATTTTAATTTCATGAACTTAGAAAATTACGTGAAAATTTTCTATACCAACAAATACCTATACCAACATGGTGCATCTTCCTTTTCGGTGGCTCAATTATAGAAACTTATTCAGAGTGAAGTGTGTTTGACTGGGGAATGGTGACCTCCTAAGAATTTTCCTAGGAAGCATGTGGTGAGGAGGACAAAACGTCGTAAAAGAACCTTGTGTTAATCTGTGTGGACAATCTTCACTAAGAAATTGAAGATAAGTATGGTGACGTTGAAGAGTCCTGGTCCAAATCTTGAACATGGAATGTTACAATTTCATACTTTTGAAGTTTGTTAGCTGGTTTAGAAGAAGGAGGATTAGATATTTATTATATTTATCTAAACATAAGGAGACTTGGTATTTATAGGAAGTTCTAAGTTTTTGGATTTGGTGATAGTTTTACGGCGGAGAGAAAAAAGGCGAAGAAGATAAGATACTTGCTTGCTTGTCTTATCATCTACTAAGTGAAAAGCTTAAGTCAATAGGTTTAGAATGAGAAGTTGGGAAGCTTTTATTTTCTTCTCTTTCTTATAGTTTTTCTTTTGTGTTACCTTGGGATACTTGTATTTGAAAACTTAAACTAACATGATAATTGTCGTTTTCTTTCCCCTCTCTCCATATATTTAATATTTTATGAAAAATATATATTTGATATACTACTCAGTTTGAAATTTCCTCTCCTCACCACTAAATACAGACAAAAAATTTGGTGATGGATAACAAGAACTTTTTGTTGTTTAGTCGATAAAGAAGGATCATTCATCTTTTATTGATATTCTTTATCATTTCTCTCTCACTTGTGGGTTTCAAAATTAGCACAAGTTTAATGAGTTACAACACAATTTTAACAAGAAATAAAATTATGTTAAATTATATCATTGGAACTCAAGAACTTGAGCTAAAAGGTCAATCTTTTCTTTATATTCTTTACCTAGGTTGCTTTTTTGTTGCAGTTGGGAATGGCTGGAGCTTTGGAAACTCTATGTGGGCAAGCATATGGGGCAGAACAATATCAAAAGCTTGGAGTTTATACTTATAGTTGCAAAATTTCTCTCATTTTGGTGTGTTTTCCAATCTCCATATTGTGGCTCTTCACAGATAAGTTACTAATTTCCATTGGTCAAGACCCTTCCATTTCTTATGTGGCTAGAAAATACTCAACTTTTCTCATTCCAAACCTCTTTTCCTATGCAATACTTCAGTCTCTTATGCGCTATCTCCTCACTCAAAGCTTGATCATTCCCTTGCTAGTTTGCTCTTTTGTCACCCTCTCTTTGCATATTCCCATTTCTTTGGCTTCTTGTATTCCATTTTAACCTCAAGGTTGTAGGAGCTGCTCTGGCTCTTGGCATATCCTCTTGTCTGAATGTCATTTTGTTAGGGCTCTACGTCTTCTTCTCTCCATCCTGCAAGAAGACTCGTGCTCCGTTCTCAAGGGAGGCCATCTTGAGCATTCGTGAGTTCTTTCGGCTCGCCGTTCCCTCCGCTGTGATGACTTGGTAAGACGGTCGATTTGGTCATTTGATTCGATTTGATTTGGTTTGATTTGTTTCGTTAAAACGTGTTTTTGTTTTGTGTTTTTGTAGCCTTGAGTGGTGGTCATATGAGGTCATTCTTTTGCTTTCTGGGCTTTTACCGAATCCTAAGGTGGAGACTTCTGTGCTTTCTATATGGTATATTATTTGTGCATTTTAATTTTCTCAAGCATTGAGAACTTAAATTTTTCAAAAGATAAGTCCAAACCTTTAACTCTTTTTTGGTACTTTTTGGACAGTTTCTCAATCACTTATTTGCATTTTTTCATACCATATGGGTTGGCGGCCACAGTAAGGTAAGTTCTAATCATTTCTCATTTTTGTTGTGGTCAAAGATATATAATTTTTCTTTTTGTTTTTTTTTTATAAATGAGAAAAAATTTAAAAAAAAAAACATTATCAAATAGTTCCAGCCCTTTTGAGTTACAATAATATGAGTTATAATCTTCGTTTTTGCATACATGGTCTTAGTAATATTATTGGAAAGGAAAAGTTCGAGGATGCCACTATCTACATTACGGTTGAATCCATAGACATGTAAAAATGTAGATAGAAATGATGTGGCGAGAGATTTATTACTTGGATGTACATATAGATTTTGAGTAATTTGAATGATTATATGTATAGCACAAGGGTTTCAAATGAACTAAGAGCTGGAAATCCAGAGGCAGCTAAGGTGGCAGTGAGGGTAGTGGGAATTCTTGGCATCATTGAATCAACGATTGTGAGTGTGACTCTCTTTGGGTGTCACAATATCTTGTGATATGCATTCACAAGTGACAACCAAATTGCCAAGCATATTGCTTCTATGTGGCCTTTAATTTGTCTTTCCATTCTCATTGATAGTTTCCTTGCTATTCTTACAGGTATGTATCTCGGTATCTCTATATTCTAACTTTTAAAACCACTTATAGATTTCCTAAATTTTTGTTAATCACAATTTTGTTTCTATATTTTAAGTTTAGGTTTAATTGGATTATCATAGTTTAAAAAGTTTCAATATTGGATTTTATGATTTGGTTTAAAACTTTGTTCTTATAGTTATAAGTCGATAAGATATTGAATAGGTGACACTTTTCTATTTATGTGACAATAAAGTAATGCATTCAATTTGGTTAACTATGTTTAGTTATACGGTATATATTTTAGGGTTTAAGACAATGTGTATAAATATATATATATATCTATAATCTATATCTATACTATATTAAAAAGTGAGGATCTTCAAAAAACTTTTTTGGACACTTTTGTCCTTTCTTTAATTTACTAATTTACAATTTTACCATTAAAGAATAATTATAAATAGTAAATTAATTAATTAATAATTAATTATCAATAATAAATGTGTAGGATTTACTAAAAGGTTGGAATGCAAAAAGTTTTTCCTACAACCTTTTTAAATTATAAGTCATAATGAAATTTTATAGGTTATAGAATTTAAAAGGTTTTGTAACTCTCATCTTCTACTACTATAAAATGGAGTTTTTTTCACCATTTTCTTCACCAACACTTAGTTTCATTCTTTCTTTATTTTTGTTCTCTTTGTTTTCTTTGCTCTTCAAAGTCTCTTCAATCTTCAAAACATCTCGAGATGTTGATGTGGTGTATTCAAATTGCAAAGGTTTCATGAGGGGTAGTATATTCAAATTGCAAAGGAATGCATGCATGTGTCAAATTTATATCTTAGATCCTAGGCTTTTGACCTTGCAACTCCAATTTGATGAAGTTGAAGAATTGTATTTGTATATCAATATTTATCAAGTGAGCAAATCTATTTTACGAAATGAGTGTATATCTAAGATTTTTTTTTGTTACTATTCATTTTTGTTATTGTTCTTATATCATTCATGTTTTTTTTATTAATAATATTTGTTGAAAAATAAATTTTAGTGCATCAATTACAAGTTTATATTTCTTTTGTTGAGAAAAAAAATGTCTAACCTTAATGCCCCACGGAATTTTCAATTAATTATAAATAGTAAATTAATTATCAATAATAAATTAAAAATTATTTTTTAGACCATTTATGTTATTTGATCTATTTTTATGTTTATTAAGCATGTCATTTAATTCTAACGTTTAATAACATTAATTAATAAGAATGTGAAAAGGTAGAATATGAGAAGCTCTATCTCATTATTTCTATAATTTATTTTTCAAAATTTATGAACTTATTTTTAATTATGTTTAATAGTGAAATTCCAAAGCTTTTGTCACTTTTATCTCCTACAACTATGTATAAAGCTCTCTTCAACAATATCTAATACAAGGTTTTGAGTTTTCTCTTATATTTAGGTTTGCTTTTATTCTTTTATTCATTGTTTTTGCTTTGCCTCATAACTTTTGTTGAACTCCTTAATCTAACATTCTTCTTGATTTTCTTTTGTCGCAATATTGATCCTTGATAATTTAAGAAAAGACTTGAAGATATGCAAAAGAGAAGCTAGAAGAAATATTATTATTGTTAGGAATATACATTTTTTTAAAAAAAAAAATTGTTGGTAGTAAATGAATAAATTCATTACAATAATTAAAAAAATGGTACTTTCTCAAATTCTTCAAATTCCACTTTTTTTTTAATCTCTCTCATGTTTATGATTATTTATTTTCTCATCCCACTTTTTGAAATGTTTTTTCAAATTCTTCACATTCCACTTTTGTTCTCTTTCTCATTCTTCTCATTATTTCTCTCTCTTTAACTCTTGAGGAAGTATTAGTTGATAATTAAAAAATTTGAATGTGAACCATTAGAACATGAAACATTCTAATATGAATAACTTGAATATGAAATTCTGTTTAAAAAAAAAAACTTGAATATGAAAAGTTTATCATTTATTTTATTAAGTAACTATACAAATGTATGAGTTTAATTATCTTAAAATTTTATGAATTTATATTTAATTACCATGAATAAGGAAATTTTAAAAGTTTTGTATTTTTGTTTATCTTCTACCACTATATAAATAACTCTCTCGTACTTTTCTTTAATACACAAACATGAAAAAGTTAGAATATTAAAACTTTTGTCAATTTATTCCTAAAAATAAATTATAAAATTTATGAAGTTCTCCTTAATTATATAATGAAAATTTAAAAGTTTATCAAAATTTTAATAATTTAAAACTTTATAAATGTGTAAAATACTTAATAAATCATTTATTATGTTGTAAATATAAATGAACCGAAGATCGGATATATGCTTAACTTTTTTTAAAACAAATAATATTAATATATAACAAAATTAAATGAAAATATACCAGGAATATGATAAAATAATGAAAACTACCACGTGCAAAGCACGTGTTAAAAAACTAGTATATCTAAAAGTACGAATGTGGAAGAAACTTTTTTTGGACAAAATTACCCTTCATTATACAAATAATTCCCAAATTGTCCTTCATCTCTCTTAACTCTCCCATTAAGTTTTCATTTTTCTCTCCCCTCACACACGTTTTTTTGCAAATTCTTTCTCACTCTCCTCCAATGCCACACAAATGTCATAAATATTGTTATTTAACAATGAGTTTTTTTTTTCCTTTTACGAAGATAATTGGGAAACTTTATGTGGGACGAAAAAAAATAAAAAATAAAAAATATATTGACAAAATGAAATCGAAAAAGTATTTTTTTTTTCCAACCAATGCTACTTTATAGTGGGTGTGAGAAAGACCTAACAAACAACAACACATCAAATGATTTATGAAAAAAAAAATAAAAAAATAAAATTACACTAAAGACATACAAATTTGAATTTATTCATAAAATTACACTATCCTGATTAATTACTATTATTTATGAAAAAATTACAAAAATGAAAATGAATAATTTAACATAAAAACTATGAATTTTAAATTTTAAAATGTTAAAACAATGTTTTTTCCTAAAATAGATATAAAATAAGTCATTTTTTTTACAAAAAAAAAAATAAATCCTTTTATACATGTAATGTAAGAAATGGAGAAGTTAAAAAACTTTTGGTATTCCAAAATAATTATGGATGAAAACCAATAATAGTCTACTTTTACATTTATTTCATACATCAATAATTATGGATGCATTTAAAATTTCATTTATGTAATTCTGTTTTCTAACCATCCTAAAATTAATTAAAAATAAAAAATAAAAAAACAAAAAAAAGGTGAGAGTTAAAGAATGGTAGAAGCAACGACAGGGTAAGTGGATTTTTGTTTTGTTTTTTTTTTTTTTTTTTTTGAGAAAGTAAGTGGAAATTTTCATGTCATTAATTTTGATTTTTATTTAAAAAAAAAAAACCGAAAGACACTTTATTAGTATTATTATTTTTAGACAAACTAAAAGACACTTTAAGATTACAAAATAATGGCAATAATTGCAATAATATTAATATTTAGGTTAGTGGTTAGTGGAATAGTAATCATTTAGTTTTAAAATACAAACTTTAAATTTCCTACATTCAATGCACTCTACAACCCCCATATGCACCACGCCATCTCTATTGTCTTCATGCAAAATTGGGTGATTTTCATTTAAACCTCACCACTTTGTTCATGCAAGTTTTTATTCACTCATATAATCTTTCTATCCTTCACTAATACCACATTGATAATAACTTATATTAAAACCTATTTATCTTCCATAACCTTTTCATCTTCCTCAATCACTCAACTACAATGACTTTTCAAATTCTCAACTTATCTCTATATATCGATAATTTTAGGTCAGTCATATAGCGTTCATCATTCTCCTCTACTCTCTATTGTCATTAGGTTTTCTTTGTTCTATTTTCAATTTCTATTGTTATATCAAAATGTCTCTAGAAAATTATATGTTGCTCAAAAATTTGGAGCCGTTTCAAAAGGAATGGTGTGTAAAAGTCATGGTTATTCGAAAAGGGACAGTGGAGACATACACTAACCATAAAGGTCCACTCCAAATATTGAAGCTAATTCTGATTGATGAGGAGGTAAAATTTTTTTTCCAAATAGAATGAACATGTATTTTCTTTTCTTTATAATAGTAATATTCAATTAAACCTTTTGTTACAATTTTTCTTTTATAAGTTATGTTATATTATGATATTTCACTTTTACAGGGAACTCAAATCCAAGCAGTCATGTATAATGAAAATATTTTAAGATTGGAACACACTCTGAAGAAAAACGAATCATATTTCATTACCAATGGAAATGTGAAACCGGTCGATAATAGATTTCTCAATGTCAACCAAGATATTGAATTATCTATATCAACTAATACAGAAGTTCGAGAATCCAATGAAATAGTTTCAACAAAAAATGTTGTCTACAATTTCATTGAATTTAATCAAATTATCAATGCACTGGATAAAGAGAAGACAATAGGTATGTTTTATTTTTTAAATAACATTTAAAGTCTTTTTAGAGCTTTAAATTTATATTTAATATTCAATATTCTATGTTATTACAGATGTCATTGGAATTGTGACTGTCGTGAAGCCAATAATCAATATCAAGAGTCAAAGAGATGATGCTCGAAGCATAAAAAAGAGGGATATTATTTTGATCAATTCAAGGTATGAACCATTTTTTTTGGTGATATATGTTATCCAAGAGGGAAAAATAAGTTTAAAACCATATTATTAATTTCTAAAAAATAGTTTTTAATAAATTGTTATATCGAACATGTGTTATTCATGTACAGTAGCTTCACACTTTCATTTATTCTTTTGTGTGTGTGTTTTTTTAGAGAAAAAAATATTTGTTGTTTGTGTCATAACTTTATATTGAGTCATTCCCTCCAACATTTATAATATGTGTTTTTTCAATATTCAAATAGGTTGGAAACTATGAAGATAGGCTTATGGGGAGATTTGGCAGAAAATGAAGGACAATTACTTGAAAATGAAGTTGACAAAAAACCAGTTGGAGTTTTTCTTAATGTAAAGGGGGACTTGTATGAAGGTATTATAAAATTGGTTAATATTTATATAATATCATTTTCATTATTATTACTATTGTTATTGTTGTTATTGTGTTTGTAGATGAATTCAGTTTAGCAACTACAATGATAAGTTCCATCCAAATAAATCCTGAAATTCATGAAGTTGAAGAATTACATACCTGGTATATTATGTAGTTTTAATTAATTTAAAATTTAGAATTTAAGACTATTTTATTTCATTGTAATTTCACATGAATATTATATTTTAAAAAGGTGTAAATCATCACTATGTGGAAAAAGTATTTCGAGCATTCTCCCTACATCAAAGAGGCTAAAGAAAGCTACAATGGTTGATTTGAAAGATATAATTGAAGGATAAGTGAGCCAATTGCATTGTTCATTCAAAGCAACAATAGTCAAAGTTCTCAACAAAGAACAACCATGGTATGAAAATTGCAAAACTTGTAACAAAAGAGTTTACTCATCATTGGATAGCGATACAACTTCATGTCTCCAATGTAACAATCCAAAAGCTTTCTTTGTGAGAAGATCTTTGTTAAAAATTGTTGTTTCAAATGGAGACATCGAGGCATATGTTACTCTGTTTGATGCAGCAGAATATCTTATGGGATGCACTATTATTGAATATGCCAAAGAAATGAAAAAGGTGAGAAAGATATAAAGTTTGGTTAACTTTTTCTTTCATGTCTTCATAACTATATTTATTTTTAAATTATTCATTCTTAATATTTTGTATTGTTTTAATGTTACAGGGTGAGAATAAAGAAAAATGTCAGTTTTACAAGAATTTGGTGCTGAGTCAAGGCAAAGAATACATTTTTCTTGTAAGAAATGAAAAAAAAAAAATGCAATCAATCACTCAAAGAAGCAAATCATTGCACAAGAAATACAAAAATTTGAACTTGTGGAAATTGATGGAGATGAAGAAGTTCAAGAACATGGAAGAAGCAAGTTGAAGAAAGTTAAGATTGAGAAGTAACAATCTTGTGTTCTAATATTATCAATGTTCTTCTTCAAATTCTGGAATATTCATGTCAAAGTTTACTTCATTTTAATATATTTTATCTTTTTGTCTTTCTTCAGTTTAATATATATACATATATTTTTGTGAAATTTACTAAGTTTTTAACATCATGCATTGGGTTCTTTTCATACTTTATCAACAAACATGAATTGGAATCTAATATAATATAATGGTCATTTCTTTCAAATTGTTGGTTTTAAAAAACACTTAATTATAATTATAACATTAAAAATAATTACAAAAGTAATTATCATAAACATAAAAAATTAGTATAAACATAGTAGACATAAAATTACGTATTGAATAGAAAATATGTTATATTAATTATAAAAATACAATATCATACAAACATTGTTATTTAACAATGAGTTTTTTTTTTTTTCCTTTTACGAAGATAATTGGGAAACTTTATGTGTGACGAAAAAAAATAATAAAAAATATATCGACAAAATGAAATCGAATTTTTTTTTTTTTCCAACCAATGTTATTTTATAGTGGGTGTGAGAAAGACCTAACAAACAACAACACATCAAATGATTTTTCTTCGATTATTAATTATTAAAATTTGAACAAGTTTAGATCTTAAAAAAACATAAATTTTATAAATTGAAATTAAATAAGAAGTCAAATTGTAACAAAACTATATATATTCAATGACCATAAACTTCTTTTCCAAAAAAAAAAAAAAACGTAAACTAAACACATACAAATTTGTATTTATTGATTCTTCATCAATTTTTTTTTTCCAATTTTCATATCTTTATTTTACCATTTTTACACTATCATCAAACAATTAAATAAATAAACACTATCCACAATAAACACCATCAATAATTACTATGCACAAAACACTACCTAATATCTACAATCCACGACACACTTAAAAAAAACCAATGAATAATTGCAATAATTAAAATAATATTAACATTAAAAACCACGCACTACACTTAAACATAAATTCATATAATTACACTATCCTGATTAATTACTATTATTTATGAAAAAATTACAAAAATAAAAATGAATAATTTAACATAAAAACTATGAAATTTAAATTTTAAAATGTTAAAACGATGTTTTTCCTAAAATAGATATAAAATAAGTCCTTTTTTTTTATAAATATAAAAGAAAATAAATCCTTTTATACATGTAATGTAAGAAATGGAGAAGTTTCAAAACTTTTGGTATTCCAACCCATTCTAAATTGCATAAAGTTTATTAAATTTTTTTATACATGAAGTTAAAAACCAATGATAGTCTACTCTTACATTTATTTCATACATCAATAATTATGGATGCATTTAAAATTTCATTTATGTAATTCCATTTCCTAACCATCCTAAAATTAATTAAACATCTCTCTCAAAAAAAAAAAAAATAATAATAATAATTAAACATAAAAAAAAAAAAAAAAAAGTTAAAGAATGGTAGAAGTAACGACAAGGTATTGGTTCAAAAAAAAAAAAAAACGACAAAGTAAGTGGAAATTTTCTTCTTTTTCTTTTTTTTTTTTTTTTTTCTTTTCAGAAAGTAAGTGGAAATTTTAATGTCATTAATTTTGATTTTCATAACATTAATTTTCATTTAAAAAAAAAAACTGAAAGACACTTTAAGATTGCAAAATAATGAATATTTAGGTTAGTGGTTAGTGGAGTAGTAATCATTTAGTTTTAAAATACAACCTTTAAATTTCCTACATTCAATGCACTCTACAACACTCATATGCACCACGCCATCTCCATTGTCTTCATGTAAAATTGGGTGATTTCCATTTCAACCTCACCACTTTGTTCAAGCAAACCAATGTTTTTATTCACTCAGAAACTTTCTATCCTTCACTAATACCACTCAATCACAAAAGTTGCAAATTGGGTCATTCACATGTACATTCTTCTACATTATCAACCGATGAACATTGAAAAATGTTGTTCACAATTGTGATTATGGTAAGTCACATTCTCAAATGCTATTGTCCATCAAAATTTTCAATTTTCATTTTCTTCCACCAATCTCCTTTCTATTCAATTGTCTAATATTTATTTAAAATTTACAAAATTCATACATTTTTGGATTGAAAATTTTGTTCAAATCAAATTAAATAACTATCTCTTATTTTTTTTTCCATTTTATTGGAGATTACAGTATTATTAATCGATATTTTAAATTATATATATATATCTTTTTTTTTTGTTCGTTTTATGATGCAGTGCATGTCCACAAAATTAAATTCTGTGTACTTGCTTAAAGTTGAGCTTGGATTCTCAAGAAGATAGATTCTACGAGGTATAACAATCCTATGTTGTTTAAAATTTATGTTTTATCTTTTGATGCTTAGTTTTTGCAATATTTTAGATTATTTAAGTTACTTTTCCAGCAACTCATTACCAAATTTATGCAAGAATAAATTTATATTATATTATATAATAAACAATATATATTATACTATATATTGTTTTGCTGGAAATACAAGAATAAATTTATATTACATATGTCTAATAGTTGCTCCCCTAAACCCGAAACCCTAGAAATATTGTACACAATTGTGATTATGGTAAGTCACATTCTCAAATGCTATATTGTCCATCAAAATTTTCAATTTCCATTTTCTTCCACCAATCTCTTTTCTATTGAATTGTCTAATATTTATTTAAAATTTAAAAAATTCATACATTTTTGGATTGAAAATCTGGTTCAAATCAAATTAAATAACTATCTCTTAATTTTTTTTCTATTTTATTGGAGATTACAGTATAATTCTATAATTTGCTTAAAAATTTTCTCATAATCTTATCTTTCAAATTATTCACAATATATATATGTGATTTTGAATAATAAATTATTCAGTTTTTTTTGGAGTTCATGATTTAAATTAACCTTTCAAATAAAATCCTTTTGTTACAAATTAATAAACTTATATTATAATGAGTTTTGAAATAAAAAAAAATATTTTCATTTTTTAAAGAAAAATGGTTATTTAAAATGAAAATTTACAATACTGCCGCTCAACTATAGGGTTTGTTACATCACATCCTTAAAATGTTAACTATACCAATCATCCACTTAAATTGAAAAATTTTTGTAATTAAACCTTTAATATTGTAATATGCATGAATAAACCACTCTTACGCAATATGTGTGTTATTTGAATCAAACTCTCAAACTATAGGATATATCCTAAAATAATAATAATAATAATAATAATTTTTAAAAACAGAATCATTAAATTTATGCTATTAATTAATAATAATAATAATAATAATAATAATAATAATAATAATATTATTATTATTATATATACTTTTAAAAAAATTGGACTAATTTAATAAATTAATTATTCAATCAATACTTTACTTTGCCTACAAGATGTCAAATTTTTATACTTCGTTCTTATATTATTAATAATTAGGGAAATTAACTTTATGAAATATAAAAACTTAATTTATCAAATATGTACATAAATTTATTCCTTTTCGGATTCACACAATTAAAAAAATATTGTCATAAAAATTCAATAAAACAATACATTTTTTTTTATGCTCAAACAATACATATTACAATATTGATTAACTTTAAATTTAAATACATTCACTGAAAAATGTACAATAAATTTAAATTCAACTTATTCTTATAAGTTTATATCTATTCAAATACATGCATATGCATAAAAAAAATATCAAACAAAAAAAATAATAAATAAAGAAGCAAACATAAAATAGGTGTGTGATTAAATATATAACTTTATTTCTATTCATATATATATATATATATATATATATATATATATATTTTAATTTTACTGGATGGAAATTTAATTTCAATGTTATAATATAGCTTTCAAGAAACTTCAACAACCAATTTGGTGATAGACATGTCTTCTAAGGAAGGTAAGAAACATAGACTCAAAATAATTTACTTTAAAATTCAACAGTTCTTACTATTATATATATTCAAAGTCAAAATTTTAATTATCAAATGTTTAAATATGCAGATATTACAAAACAACGAGAAAAAAAGAAAAACAAAACGTATGGAAAATTGAATGATACTGACAGAAGAAAAAAGATTGACAACGTGTTAGATGCAAAAAGGAAAAGGTTAAGTGGTGAAACTAGCCGTGGAAATAAGAAGACAAAAGTTGATTGGTTGCTGAGAGCCCAACAATATGTAGATCAAGAACCAGGTTTGTCAAAATTATGCATATTCCTTCACATAAGTTAGTACTATTAGCTCTAATATTTAATTTACATAATATATATATATTTGTTATAGGTCAAATTTCATATTATCCGTTATGTTCGGCATTACATGAGTTGAAACAACCATCATCTTGTGTTTACTGTGATGCTAAGAAATTTGAGTATGAACCACCTTCCTTTTGTTGCTCTATAGGCAAAGTGCAATTAGTAGAGACCAATGTTCCTGAACAACTACAATCACTATTTACTTCTGAAACAGAAGAAGCAAAAATGTTCAAGAAGAATATTAGAGCATATAATAGTGTTTTCTCATTCACCTCTTTTGGTGTTAAATTGGATAAGGAACTAGCATCTTCAAAACACGGTGTCTACACTTTTCGTGCTCAAGGACAAATATATCATGAATTACCTTCTTTAATGCCCAAAGATGGTCCAAAATACTTTCAATTGTATTTCTATGATACAACAAATGAGCTTGAAAATAGGATGAACATTTTACTAGAGGCACAATTGGATGAAAGTATTATGGAAAAGTTAACGAACATCCTGAGAGAAAACCCATATGCTAAGTTCTTGAGAAGATTAAAAGACATGTCATCTTTCCAAAATCTACAAATCCGTATTGTTAGCAATACAAATTTAGATCAACGAGTGTATAATACACCTACAGCAGATCAAGTTGCTGCAATATGGATTGAAGGAAATGATAGTAATAGTCCATTTGAAAGAGACATTATTGTACATGCTCATTCAGGAAATAAGCAAAGGGTGAAACACTACTTTAGTTGTTATGATTGTTTACAATATCCTTTACTATTTCCCATGGGAGAATCTGGATGGCATCAGGATATAAAAAAGAAAGAAATTGGAAATGACATTGGAAACTCAAGCATTAGTTATAAGTATCCAAGAAGGAATCCACACTTATGTGATTCAATTGATGAAGTGCTTCAAGCAGAACGTCAAGGTAAATCTTTCATTTTAGTCCACAATACATTTTCCATTCAAAACTTTATTTGGATAAGTATATTATCTTACACAAAATTATTTTAAACTATAGGTATTTCTGGACGCAATTTTGGAAATGTATCCTGTAGGGAGTATTATTGTTACAAACTACAGATAAGACCATCTCCAAAATCAATAGTTCTTTTTGCTGGAAGATTATTACAACAATATGTTGTAGACATGTACATAAAGCTTGAAACAACAAGACTTGATTTCCATCGAACACAACAATCTCTAATTAGAGCTGAATTATATCAAGGAATAGTTGATAGTGTGAACATTGGGGAAACGAGAGGCAATAAAATTGGTAAGAGAATTGTACTCCCAGTTTCTTTCATTGGAGGTCCAAGAGATATGCGACGACGTTATTTAGATGCTATGGCATTGGTGCAAAGATTTGGAAAGCCAGATTTATTCATAACTATGACATGCAACTCTGAGTGGAAAGAGATTAAGGATGAATTAAAACATGGACAACGTCCTCAGGATAGACCAGATTTGACTTCTAGAATATTTCGCAGCAAACTAGAAGATCTTAAAGACCAGATATTTAAAAAATAAATATTTGGTAAGGTTGCAGCTTATGTGTATGTTATTGAATTTCAAAAGAGGGGACTACCACATGCACATATGCTTATTATTTTAAATCGAGGCTATAAGATCACAAATGCAGATGACTATGTCAAATATGTGAGTGCAGAATTACCAAATAAAGAAAATTTTCCAACATTATATGAGACTGTTGTCCAGCATATGTTACATGGACCTTGTGGAGAAGCAAACAAGAAAAACACGTGTATGTTGAATGGAAAATGTAAATTTCGTTATCCAAGACCTTTTTGTTCGAAGACAATGCAAGGAAAAGATGCATATCCTATATACAGAAGAAGAAATGATGGTGCTCAGGTTAGTTTTTACACAATCTTATTTTGTTTTTGTGGACAATTATTTAGATATATTATAGAAAATATGATTATTCTAATACATCTATGAAATAATTTATGATAATTCATTCATAGGTCAAAGTACGAAAGGCGATGTTAGATAATAAGTGGGTAGTTCCTTATAATCCATATTTGTTGTCAAGATATAATTGTCATATCAATGTCGAGATATGTTCTGGATTGAAGGCTGTTAAGTACCTCTACAAGTATATTTATAAAGGACATGATAAAGCAGCTGTCTCGATATCATGTGATGATGAAAGTAGAATTGTTAATGAAATACAAGAGTTTCAAGATGCAAGATGGGTCTCAGCTCAAGAATCAATGTGGAGAATATTTGAATTTAAGTTGCATGAGATAAGTCCAGCAGTCATAAACCTACAGTTACACCTTCCAAACCAGCAATCAGTTACATTTTGGGAAAAACAAAACTTGCAAGTTGTTCTAAACCAAGATCATGTCTCAAAAACAATGTTGACCGAGTATTTTGAAATGTGCAAAACATACGAAGATGCAAGGAAGTTTATGTACAAAGATTTTCCAGAGCATTATGTTTGGAATAAACAGTCAAAAACATGGTCATTGAGGAAAAAAAGACAAGTCATATCTAGAGTTAATGGAGTCAATCCAGCAGAAGGTGAAAGATACTATTTAAGATTGCTATTAAATCATGTCAAAGGACCAACATCATATGATGATTTGTTAACAGTGGATGGAACACTATATTCATCTTTTAAAGCAGCAGCACAAAAACGAGGATTGCTTGAATCAGATAATAGTATCATAGAATGTTTGGACGAAGCATCCACATTTCAAATGCCAAAAGCATTCAGAAGACTTTTTGCAACATTATTGGCCCATTGTGAACCTACAGATATAAGAAAGTTATGGGACTTGTACTTTGAAACAATGTCAGAAGACTACAGGAAATTGGAAGACACTACTTTAGAAGATCAAATTCAACTCACTTTGATAGATTTGAGAAGCTTTCTACAGTCTATGGGAAAAAACATGAGTGGTTATGACTTGCCAGCTATACACTCTAATTCATTTGAAGCAAGTTCAACTAATAGAAAAGAAATACTCGATGAGGAAGCTATAGAGATCCCAAAGGAAGATATTGATACAGTTTCAAAATTGAACTCAGAACAAAAAGTTGCATATGATCAAATAATGGGAAGAGTGACAAGCAAAAAACCAGGTGTGTTCTTTGTTGATGGTCCAGGAGGGACTGGAAAAACATTTTTATATCGTGCATTACTGGCAACAACAAGAGCAAAGGGCATGATTGCATTAGCAACTGCAACATCAGGTGTTGCTGCTGCAATAATGCCAGGAGGTCGAACAGCACATTCACGATTCAACATACCATTGCAAACAACAGAATCATCTATGTGTGTCATATCCAAACAAAGTAGTAAGGCTGAATTGCTAAGAAGGGCAAGAATTATTATATGGGATGAAGCACCAATGGCAAAGAGGTTTGTAATTGAAACAGTTGATAGAACATTTAGAGATATAATGGATAGTTTAGAACCATTTGGTGGAAAGGTTTTTGTGTTCGGTGGTGACTTTCGCCAAGTTCTTCCAGTGGTTCCACGTGCTACAAGATAACAAACCGTTAATGTTAGTCTTGTAAGGTCATATCTTTGGGAAAATATGGAAAAACTTAAGTTAACAACAAACATGAGAGCTATCTCTGATGAAAAGTTTGCAAACTTCCTATTGCGAGTTGGTAATGGAGAAGAAACAAGTATAGAAAATGATCTCATTTCCTTGCCAGATCAAATGCTAATTCCTTTCGAAAATGAAAAAGATTCTGAATTGAATTTTATCCAGAGCATCTTTCCTAATTTGATAGGCAATGCTGAAAGTTCAGAATATATAACAAGTCGAGCAATTTTGGCGACAAAAAATGAATATGTTGACCAACTGAATGAAAAGTTGATTCATAGATTTCCAGGAGAGTCAAGAGTTTTTGTGAGTTTTGATGAAGCAGTTGATGACACAAACAACTATTATCAAGAAGAGTTCCTTAATACATTGCTTCCTAACGGAATTCCCCCTCGCAAGTTAGAACTTAAAAAGAATTGTCCAATTATGCTTATGAGAAATTTAGATCCAACTAATGGATTGTGCAATGGGACAAGAATGGTATGCAAAGAATTTGACACTCATGTTATACATGCAGAAATCAGTATTGGTCAACATGCAGGTAAACAAGTTTTTCTTCCTCGCATTCCTCTAAGCCCAGCAAATGATGAAGGTTATCCGTTCAAATTCAAGAGGAAACAATTTCCTATACGACTTTGTTTTGCAATGACAATAAATAAGGCACAGGGGCAAACTATACCACATGTGGGAGTATATTTACCAGAACATGTTTTCTCTCATGGACAATTATATGTCGCATTATCTAGAGGAACTTCAATGTCAACAACTAAGGTATTCATCAAATCGGATACTCCTATTGGAAGAAATTCAACTTTTACAAAAAATGTTGTTTACAAAGAAGTACTTCTACAAGAAAGGTATTATGAATTACAATTATAAACTTTGATACATACATAAGTTTTTCCTTCTAAATTTAAATTTACTTTTGATGTAGGGAAGAAAATTAGTAGACAATGTACAACGATATGAGCATCTTTGCAACATATAAAAGGTATCCAATTCGATAATAAGTGTATGAGTTTTGAACGTATGATCTACATTTTCATATGTATGTTAGAAAAAAAACAATTAATCAAATTAATTAATTAATCAAGTTGTTTAAAAATACTTATGTATAATTAGCATACATCCACTGTTTCAAACCTTCAAAATGTTTAAACTGATGTATTACATTTGTTTCTTTTCCCTTGATTGAGGTTACAGGCACTATGTATTCATGGGTCAAAAGTTCGTTGCATGAAGCCAATTGACGTCCTTTGCCAGTTATTTGATTACAGATTTGTAATTTCTTCATTCATTCTATCTATACCTATATACTTTTGAATTTATTGTAACATTTCAATGTCCATCACTTTAAATAGAATTATTCAAAAAAAAAAAAAACTTTTCAAGTACTTATAGACCAAAATAAATTAAAAATATTATTACCAAAGCCACGTGCAACGCACGTGTAAAATACTAGTATATATATATATATATGTATGTATGATAATTTAGAGTGTGTGGCAATTTAGGGGTTGCAAGAGGCACTGGATGGCAGCGTTTAGGGGCATATGTGAATTTGGGATCCTATTATATTGTTGGGATTCCAATGGCAACTGTGTTGGCTTTTGTTGCACATTTAAGAGTCAAAGGACTTTGGATTGGCATAGTTTCAGGAGCAGTACTTCAAAGCTTTCTTTTTGCTCTCATTACCATTTTCACTAATTGGCATAAACAGGTTCTCTTTCTACATTTTTTCCATTTTTTATATGATTAGTGTCTTTCAACTACTAATTATTTTTGTAACTGATTTTAATTCTTCAATAACCATTTAAAAAATATATAAATCTATAGACATTTACTCCCGAGATAAAATTAATTAATGTTAAAAAATTTGTTGACTGAAATAAAATACATAATAAAATGTGAGGACTAAAATAGAATTTTTTTGGTAAATTAAGTCAAAAGACTAAAAGTAAGATGAATTGTTTTTAGTTGTAGGATTTTA

mRNA sequence

ATGGCTCTAACCTCGGCCTTTCTGTGTTTCTCTTCAATTTGGTTTCGTAAGTCTGTCCTTACCGCACGACTACTGTTCTGGGGTGGCAATTTTGGCATCGACCTGCCTCTCTTGGGAAAGAGGTCGAGGAGACCAAGGTCAAGGCTGAGGCCAAGGAGATCGGGTCCTAGGTCGAGCATGACTCTCTTCCTACGACCTCAGCTGGTAGTGATTGTCATCATCGTTGGCCATCTCGGCGACGAGCTTTTGCTTTCTGGAGTCTCCATTGCCACTTCCTTCGCTCGCGCCACTGGCTTTAGCCTCCTCGTAATCCTTTCGGCTCAAGGGCTTTGCAGCTTGTTCTTAAGATTGAAGTTGTCTCAAGAGCTTTGCAACTTGTTCTTGAGATTGAAGTTGTATCAAGGGCTTCGTGAGTTCAAGGAGTCTTTTGATTCTGGAAGTCTTCAAGAAGCTTTAGTCTTTCAGAGTCTTGAAGTGAGATTTGAGGAGGATCTGAGTTGGGAATGGCTGGAGCTTTGGAAACTCTATGTGGGCAAGCATATGGGGCAGAACAATATCAAAAGCTTGGAGTTTATACTTATAGTTGCAAAATTTCTCTCATTTTGGTGTGTTGTAGGAGCTGCTCTGGCTCTTGGCATATCCTCTTGTCTGAATGTCATTTTGTTAGGGCTCTACGTCTTCTTCTCTCCATCCTGCAAGAAGACTCGTGCTCCGTTCTCAAGGGAGGCCATCTTGAGCATTCGTGAGTTCTTTCGGCTCGCCGTTCCCTCCGCTGTGATGACTTGCCTTGAGTGGTGGTCATATGAGGTCATTCTTTTGCTTTCTGGGCTTTTACCGAATCCTAAGGTGGAGACTTCTGTGCTTTCTATATGTTTCTCAATCACTTATTTGCATTTTTTCATACCATATGGGTTGGCGGCCACAGTAAGCACAAGGGTTTCAAATGAACTAAGAGCTGGAAATCCAGAGGCAGCTAAGGTGGCAGTGAGGGTAGTGGGAATTCTTGGCATCATTGAATCAACGATTGTGAGTGTGACTCTCTTTGGGTGTCACAATATCTTGTGA

Coding sequence (CDS)

ATGGCTCTAACCTCGGCCTTTCTGTGTTTCTCTTCAATTTGGTTTCGTAAGTCTGTCCTTACCGCACGACTACTGTTCTGGGGTGGCAATTTTGGCATCGACCTGCCTCTCTTGGGAAAGAGGTCGAGGAGACCAAGGTCAAGGCTGAGGCCAAGGAGATCGGGTCCTAGGTCGAGCATGACTCTCTTCCTACGACCTCAGCTGGTAGTGATTGTCATCATCGTTGGCCATCTCGGCGACGAGCTTTTGCTTTCTGGAGTCTCCATTGCCACTTCCTTCGCTCGCGCCACTGGCTTTAGCCTCCTCGTAATCCTTTCGGCTCAAGGGCTTTGCAGCTTGTTCTTAAGATTGAAGTTGTCTCAAGAGCTTTGCAACTTGTTCTTGAGATTGAAGTTGTATCAAGGGCTTCGTGAGTTCAAGGAGTCTTTTGATTCTGGAAGTCTTCAAGAAGCTTTAGTCTTTCAGAGTCTTGAAGTGAGATTTGAGGAGGATCTGAGTTGGGAATGGCTGGAGCTTTGGAAACTCTATGTGGGCAAGCATATGGGGCAGAACAATATCAAAAGCTTGGAGTTTATACTTATAGTTGCAAAATTTCTCTCATTTTGGTGTGTTGTAGGAGCTGCTCTGGCTCTTGGCATATCCTCTTGTCTGAATGTCATTTTGTTAGGGCTCTACGTCTTCTTCTCTCCATCCTGCAAGAAGACTCGTGCTCCGTTCTCAAGGGAGGCCATCTTGAGCATTCGTGAGTTCTTTCGGCTCGCCGTTCCCTCCGCTGTGATGACTTGCCTTGAGTGGTGGTCATATGAGGTCATTCTTTTGCTTTCTGGGCTTTTACCGAATCCTAAGGTGGAGACTTCTGTGCTTTCTATATGTTTCTCAATCACTTATTTGCATTTTTTCATACCATATGGGTTGGCGGCCACAGTAAGCACAAGGGTTTCAAATGAACTAAGAGCTGGAAATCCAGAGGCAGCTAAGGTGGCAGTGAGGGTAGTGGGAATTCTTGGCATCATTGAATCAACGATTGTGAGTGTGACTCTCTTTGGGTGTCACAATATCTTGTGA

Protein sequence

MALTSAFLCFSSIWFRKSVLTARLLFWGGNFGIDLPLLGKRSRRPRSRLRPRRSGPRSSMTLFLRPQLVVIVIIVGHLGDELLLSGVSIATSFARATGFSLLVILSAQGLCSLFLRLKLSQELCNLFLRLKLYQGLREFKESFDSGSLQEALVFQSLEVRFEEDLSWEWLELWKLYVGKHMGQNNIKSLEFILIVAKFLSFWCVVGAALALGISSCLNVILLGLYVFFSPSCKKTRAPFSREAILSIREFFRLAVPSAVMTCLEWWSYEVILLLSGLLPNPKVETSVLSICFSITYLHFFIPYGLAATVSTRVSNELRAGNPEAAKVAVRVVGILGIIESTIVSVTLFGCHNIL
Homology
BLAST of Spg021589 vs. NCBI nr
Match: XP_022141867.1 (protein DETOXIFICATION 1-like isoform X1 [Momordica charantia])

HSP 1 Score: 271.9 bits (694), Expect = 7.6e-69
Identity = 176/328 (53.66%), Postives = 201/328 (61.28%), Query Frame = 0

Query: 69  VVIVIIVGHLGDELLLSGVSIATSFARATGFSLLVILSAQGLCSLFLRLKLSQELCNLFL 128
           VV V+IVGHLGDELLLSGVSIATSF R TGFSLL+ ++              + LC    
Sbjct: 69  VVTVVIVGHLGDELLLSGVSIATSFVRVTGFSLLLGMAG-----------ALETLCGQAY 128

Query: 129 RLKLYQGLREFKESFDSGSLQEALVFQSLEVRFEEDLSWEWLELWKLYVGKHMGQNNI-- 188
             + YQ L         G    + +   + V F   + W + +   + +G+    +++  
Sbjct: 129 GAEQYQKL---------GVYTYSCMISLILVCFPISVLWFFTDKLLISIGQDPSISSVAR 188

Query: 189 -------------------------KSLEFILIVAKFLSF-------WC--------VVG 248
                                    +SL   L+ + F +        W         VVG
Sbjct: 189 KYSVFLIPNLFACAILQSLLRYFLTQSLILPLLFSSFATLCLHIPICWLFVFHFKLRVVG 248

Query: 249 AALALGISSCLNVILLGLYVFFSPSCKKTRAPFSREAILSIREFFRLAVPSAVMTCLEWW 308
           AALALGIS  LNVILL  YVFFSPSC KTRAP SREAI SI +FFRLAVPSAVM CLEWW
Sbjct: 249 AALALGISYWLNVILLASYVFFSPSCNKTRAPLSREAISSIGQFFRLAVPSAVMVCLEWW 308

Query: 309 SYEVILLLSGLLPNPKVETSVLSICFSITYLHFFIPYGLAATVSTRVSNELRAGNPEAAK 355
           SYEVILLLSGLLPNPKVE SVLSICFSITYLH+FIPYGL ATVSTRVSNEL AGNPEAAK
Sbjct: 309 SYEVILLLSGLLPNPKVEASVLSICFSITYLHYFIPYGLGATVSTRVSNELGAGNPEAAK 368

BLAST of Spg021589 vs. NCBI nr
Match: XP_022141869.1 (protein DETOXIFICATION 1-like isoform X3 [Momordica charantia])

HSP 1 Score: 271.9 bits (694), Expect = 7.6e-69
Identity = 176/328 (53.66%), Postives = 201/328 (61.28%), Query Frame = 0

Query: 69  VVIVIIVGHLGDELLLSGVSIATSFARATGFSLLVILSAQGLCSLFLRLKLSQELCNLFL 128
           VV V+IVGHLGDELLLSGVSIATSF R TGFSLL+ ++              + LC    
Sbjct: 69  VVTVVIVGHLGDELLLSGVSIATSFVRVTGFSLLLGMAG-----------ALETLCGQAY 128

Query: 129 RLKLYQGLREFKESFDSGSLQEALVFQSLEVRFEEDLSWEWLELWKLYVGKHMGQNNI-- 188
             + YQ L         G    + +   + V F   + W + +   + +G+    +++  
Sbjct: 129 GAEQYQKL---------GVYTYSCMISLILVCFPISVLWFFTDKLLISIGQDPSISSVAR 188

Query: 189 -------------------------KSLEFILIVAKFLSF-------WC--------VVG 248
                                    +SL   L+ + F +        W         VVG
Sbjct: 189 KYSVFLIPNLFACAILQSLLRYFLTQSLILPLLFSSFATLCLHIPICWLFVFHFKLRVVG 248

Query: 249 AALALGISSCLNVILLGLYVFFSPSCKKTRAPFSREAILSIREFFRLAVPSAVMTCLEWW 308
           AALALGIS  LNVILL  YVFFSPSC KTRAP SREAI SI +FFRLAVPSAVM CLEWW
Sbjct: 249 AALALGISYWLNVILLASYVFFSPSCNKTRAPLSREAISSIGQFFRLAVPSAVMVCLEWW 308

Query: 309 SYEVILLLSGLLPNPKVETSVLSICFSITYLHFFIPYGLAATVSTRVSNELRAGNPEAAK 355
           SYEVILLLSGLLPNPKVE SVLSICFSITYLH+FIPYGL ATVSTRVSNEL AGNPEAAK
Sbjct: 309 SYEVILLLSGLLPNPKVEASVLSICFSITYLHYFIPYGLGATVSTRVSNELGAGNPEAAK 368

BLAST of Spg021589 vs. NCBI nr
Match: XP_022141868.1 (protein DETOXIFICATION 1-like isoform X2 [Momordica charantia])

HSP 1 Score: 271.9 bits (694), Expect = 7.6e-69
Identity = 176/328 (53.66%), Postives = 201/328 (61.28%), Query Frame = 0

Query: 69  VVIVIIVGHLGDELLLSGVSIATSFARATGFSLLVILSAQGLCSLFLRLKLSQELCNLFL 128
           VV V+IVGHLGDELLLSGVSIATSF R TGFSLL+ ++              + LC    
Sbjct: 69  VVTVVIVGHLGDELLLSGVSIATSFVRVTGFSLLLGMAG-----------ALETLCGQAY 128

Query: 129 RLKLYQGLREFKESFDSGSLQEALVFQSLEVRFEEDLSWEWLELWKLYVGKHMGQNNI-- 188
             + YQ L         G    + +   + V F   + W + +   + +G+    +++  
Sbjct: 129 GAEQYQKL---------GVYTYSCMISLILVCFPISVLWFFTDKLLISIGQDPSISSVAR 188

Query: 189 -------------------------KSLEFILIVAKFLSF-------WC--------VVG 248
                                    +SL   L+ + F +        W         VVG
Sbjct: 189 KYSVFLIPNLFACAILQSLLRYFLTQSLILPLLFSSFATLCLHIPICWLFVFHFKLRVVG 248

Query: 249 AALALGISSCLNVILLGLYVFFSPSCKKTRAPFSREAILSIREFFRLAVPSAVMTCLEWW 308
           AALALGIS  LNVILL  YVFFSPSC KTRAP SREAI SI +FFRLAVPSAVM CLEWW
Sbjct: 249 AALALGISYWLNVILLASYVFFSPSCNKTRAPLSREAISSIGQFFRLAVPSAVMVCLEWW 308

Query: 309 SYEVILLLSGLLPNPKVETSVLSICFSITYLHFFIPYGLAATVSTRVSNELRAGNPEAAK 355
           SYEVILLLSGLLPNPKVE SVLSICFSITYLH+FIPYGL ATVSTRVSNEL AGNPEAAK
Sbjct: 309 SYEVILLLSGLLPNPKVEASVLSICFSITYLHYFIPYGLGATVSTRVSNELGAGNPEAAK 368

BLAST of Spg021589 vs. NCBI nr
Match: XP_022141871.1 (protein DETOXIFICATION 8-like isoform X5 [Momordica charantia])

HSP 1 Score: 267.7 bits (683), Expect = 1.4e-67
Identity = 169/286 (59.09%), Postives = 181/286 (63.29%), Query Frame = 0

Query: 69  VVIVIIVGHLGDELLLSGVSIATSFARATGFSLLVILSAQGLCSLFLRLKLSQELCNLFL 128
           VV V+IVGHLGDELLLSGVSIATSF R TGFSLL+ ++              + LC    
Sbjct: 69  VVTVVIVGHLGDELLLSGVSIATSFVRVTGFSLLLGMAG-----------ALETLCGQAY 128

Query: 129 RLKLYQGLREFKESFDSGSLQEALVFQSLEVRFEEDLSWEWLELWKLYVGKHMGQNNIKS 188
             + YQ L  +  S                                              
Sbjct: 129 GAEQYQKLGVYTYSC--------------------------------------------M 188

Query: 189 LEFILIVAKFLSFWCVVGAALALGISSCLNVILLGLYVFFSPSCKKTRAPFSREAILSIR 248
           +  IL+         VVGAALALGIS  LNVILL  YVFFSPSC KTRAP SREAI SI 
Sbjct: 189 ISLILL--------RVVGAALALGISYWLNVILLASYVFFSPSCNKTRAPLSREAISSIG 248

Query: 249 EFFRLAVPSAVMTCLEWWSYEVILLLSGLLPNPKVETSVLSICFSITYLHFFIPYGLAAT 308
           +FFRLAVPSAVM CLEWWSYEVILLLSGLLPNPKVE SVLSICFSITYLH+FIPYGL AT
Sbjct: 249 QFFRLAVPSAVMVCLEWWSYEVILLLSGLLPNPKVEASVLSICFSITYLHYFIPYGLGAT 291

Query: 309 VSTRVSNELRAGNPEAAKVAVRVVGILGIIESTIVSVTLFGCHNIL 355
           VSTRVSNEL AGNPEAAKVAV+VVG LGIIES  VSV LFGC NIL
Sbjct: 309 VSTRVSNELGAGNPEAAKVAVKVVGALGIIESITVSVILFGCRNIL 291

BLAST of Spg021589 vs. NCBI nr
Match: XP_038889426.1 (protein DETOXIFICATION 8-like isoform X2 [Benincasa hispida])

HSP 1 Score: 266.9 bits (681), Expect = 2.4e-67
Identity = 174/316 (55.06%), Postives = 205/316 (64.87%), Query Frame = 0

Query: 69  VVIVIIVGHLGDELLLSGVSIATSFARATGFSLLVILSA--QGLC------SLFLRLKLS 128
           VV VI+VGHLGDELLLSGVSIA SF R TGFSLL+ ++   + LC        + +L + 
Sbjct: 34  VVTVIVVGHLGDELLLSGVSIAVSFVRVTGFSLLLGMAGALETLCGQAYGAEQYHKLGIY 93

Query: 129 QELCNLFLRLKLY----------------------QGLREFKESFDSGSLQEALVFQSLE 188
              C + L L  +                        +      F   +L    + QSL 
Sbjct: 94  TYSCMISLLLVCFPISILWFFTDKLLILIGQDPSISSVARNYSVFLIPNLFAYAILQSL- 153

Query: 189 VRFEEDLSWEWLELWKLYVGKHMGQNNIKSLEFILIVAKFLSFWCVVGAALALGISSCLN 248
           VR+   L  + L L  L+        +I     +++  KF     V+GAALALGIS  LN
Sbjct: 154 VRY---LLTQSLILPLLFCSFLTLSLHIPICWLLVLHFKFK----VMGAALALGISYWLN 213

Query: 249 VILLGLYVFFSPSCKKTRAPFSREAILSIREFFRLAVPSAVMTCLEWWSYEVILLLSGLL 308
            +LL LY+FFSPSC KTRAPFS EAI SI +FFRLA+PSA+M CLEWWSYEVILLLSGLL
Sbjct: 214 AVLLALYIFFSPSCNKTRAPFSTEAISSIPKFFRLAIPSALMVCLEWWSYEVILLLSGLL 273

Query: 309 PNPKVETSVLSICFSITYLHFFIPYGLAATVSTRVSNELRAGNPEAAKVAVRVVGILGII 355
           PNPKVE SVLSICFSITYLH+FIPYGL ATVSTRVSNEL AGNPE AKVAV+VVG++G+I
Sbjct: 274 PNPKVEASVLSICFSITYLHYFIPYGLGATVSTRVSNELGAGNPERAKVAVKVVGVVGMI 333

BLAST of Spg021589 vs. ExPASy Swiss-Prot
Match: Q9FHB6 (Protein DETOXIFICATION 16 OS=Arabidopsis thaliana OX=3702 GN=DTX16 PE=2 SV=1)

HSP 1 Score: 166.0 bits (419), Expect = 7.7e-40
Identity = 133/351 (37.89%), Postives = 181/351 (51.57%), Query Frame = 0

Query: 29  GNFGIDLPLLGKRSR-RPRSRLRPRRSGPRSSMTLFLRPQLVVIVIIVGHLGDELLLSGV 88
           G   +  PL+G++S  +   + +   SGP  +++L      V+ V+ VGHLG  L LS  
Sbjct: 8   GEGDLSWPLIGEKSSVKEEVKKQLWLSGPLIAVSLLQFCLQVISVMFVGHLG-SLPLSAA 67

Query: 89  SIATSFARATGFSLLV--ILSAQGLCSLFLRLKLSQELCNLFLRLKLYQGLREFKESFDS 148
           SIATSFA  TGFS L+    +   LC      K    L     R      L     S   
Sbjct: 68  SIATSFASVTGFSFLMGTASALDTLCGQAYGAKKYGMLGIQMQRAMFVLTLASIPLSIIW 127

Query: 149 GSLQEALVF------------QSLEVRFEEDLSWEWLELWKLYVGKHMGQNNIKSLEF-- 208
            + +  LVF               +       ++  L+ +  ++     QNN+  + F  
Sbjct: 128 ANTEHLLVFFGQNKSIATLAGSYAKFMIPSIFAYGLLQCFNRFL---QAQNNVFPVVFCS 187

Query: 209 -ILIVAKFLSFWCVV--------GAALALGISSCLNVILLGLYVFFSPSCKKTRAPFSRE 268
            +      L  W +V        GAALA  IS  LNV+LL  YV FSPSC  T   FS+E
Sbjct: 188 GVTTSLHVLLCWVLVFKSGLGFQGAALANSISYWLNVVLLFCYVKFSPSCSLTWTGFSKE 247

Query: 269 AILSIREFFRLAVPSAVMTCLEWWSYEVILLLSGLLPNPKVETSVLSICFSITYLHFFIP 328
           A+  I  F RLAVPSA+M CLE WS+E+++LLSGLLPNP +ETSVLSIC + +   + IP
Sbjct: 248 ALRDILPFLRLAVPSALMVCLEMWSFELLVLLSGLLPNPVLETSVLSICLNTSGTMWMIP 307

Query: 329 YGLAATVSTRVSNELRAGNPEAAKVAVRVVGILGIIESTIVSVTLFGCHNI 354
           +GL+   STR+SNEL AGNP+ AK+AVRVV  + + ES ++   L    NI
Sbjct: 308 FGLSGAASTRISNELGAGNPKVAKLAVRVVICIAVAESIVIGSVLILIRNI 354

BLAST of Spg021589 vs. ExPASy Swiss-Prot
Match: Q9SIA5 (Protein DETOXIFICATION 1 OS=Arabidopsis thaliana OX=3702 GN=DTX1 PE=2 SV=1)

HSP 1 Score: 165.6 bits (418), Expect = 1.0e-39
Identity = 122/345 (35.36%), Postives = 176/345 (51.01%), Query Frame = 0

Query: 52  RRSGPRSSMTLFLRPQLVVIVIIVGHLGDELLLSGVSIATSFARATGFSLLVILSAQGLC 111
           R + P +++T+      V+ V++ GH G EL LSGV++A SF   TGFS+        +C
Sbjct: 33  RLAAPMATVTIAQYLLPVISVMVAGHNG-ELQLSGVALANSFTNVTGFSI--------MC 92

Query: 112 SLFLRLKLSQELCNLFLRLKLYQGLREFKESFDSGSLQEALVFQSLEVRFEEDLSWEWLE 171
            L   L   + LC      K Y+ +         G+   + +  ++ + F   + W ++E
Sbjct: 93  GLVGAL---ETLCGQAYGAKQYEKI---------GTYAYSAIASNIPICFLISILWLYIE 152

Query: 172 LWKLYVGKHMGQNNIK-SLEFILIVAKF-------------------------------- 231
              + +G+    + I  S  F LI A F                                
Sbjct: 153 KILISLGQDPEISRIAGSYAFWLIPALFGQAIVIPLSRFLLTQGLVIPLLFTAVTTLLFH 212

Query: 232 -LSFWCVV--------GAALALGISSCLNVILLGLYVFFSPSCKKTRAPFSREAILSIRE 291
            L  W +V        G A+A  +S     ++L  YV FS SC+KTR   SR+ + SI++
Sbjct: 213 VLVCWTLVFLFGLGCNGPAMATSVSFWFYAVILSCYVRFSSSCEKTRGFVSRDFVSSIKQ 272

Query: 292 FFRLAVPSAVMTCLEWWSYEVILLLSGLLPNPKVETSVLSICFSITYLHFFIPYGLAATV 351
           FF+  +PSA M CLEWW +E+++L SGLLPNPK+ETSVLSIC +I  LH+ I  G+AA V
Sbjct: 273 FFQYGIPSAAMICLEWWLFEILILCSGLLPNPKLETSVLSICLTIETLHYVISAGVAAAV 332

Query: 352 STRVSNELRAGNPEAAKVAVRVVGILGIIESTIVSVTLFGCHNIL 355
           STRVSN L AGNP+ A+V+V     L I+ES   S+ LF C NI+
Sbjct: 333 STRVSNNLGAGNPQVARVSVLAGLCLWIVESAFFSILLFTCRNII 356

BLAST of Spg021589 vs. ExPASy Swiss-Prot
Match: Q9SIA1 (Protein DETOXIFICATION 5 OS=Arabidopsis thaliana OX=3702 GN=DTX5 PE=3 SV=2)

HSP 1 Score: 160.2 bits (404), Expect = 4.2e-38
Identity = 125/329 (37.99%), Postives = 185/329 (56.23%), Query Frame = 0

Query: 54  SGPRSSMTLFLRPQLVVIVIIVGHLGDELLLSGVSIATSFARATGFSLL--VILSAQGLC 113
           + P +++T+      V+ V++ GH G EL LSGV++AT+FA  +GF ++  ++ + + LC
Sbjct: 38  AAPMATVTVSQYLLPVISVMVAGHCG-ELQLSGVTLATAFANVSGFGIMYGLVGALETLC 97

Query: 114 SLFLRLK---------LSQELCNL----------FLRLKLYQGLREFKE-SFDSGS---- 173
                 K          S  + N+          F   KL+  L +  + S  +GS    
Sbjct: 98  GQAYGAKQYTKIGTYTFSAIVSNVPIVVLISILWFYMDKLFVSLGQDPDISKVAGSYAVC 157

Query: 174 LQEALVFQSLEVRFEEDLSWEWLELWKLYVGKHMGQNNIKSLEFILIVAKFLSFWCVV-- 233
           L  AL+ Q+++      L  + L L  LY         I +L F + V   L +   +  
Sbjct: 158 LIPALLAQAVQQPLTRFLQTQGLVLPLLYCA-------ITTLLFHIPVCLILVYAFGLGS 217

Query: 234 -GAALALGISSCLNVILLGLYVFFSPSCKKTRAPFSREAILSIREFFRLAVPSAVMTCLE 293
            GAALA+G+S   NV++L LYV FS +C+KTR   S + +LS+++FF+  +PSA MT +E
Sbjct: 218 NGAALAIGLSYWFNVLILALYVRFSSACEKTRGFVSDDFVLSVKQFFQYGIPSAAMTTIE 277

Query: 294 WWSYEVILLLSGLLPNPKVETSVLSICFSITYLHFFIPYGLAATVSTRVSNELRAGNPEA 353
           W  +E+++L SGLLPNPK+ETSVLSIC + + LH  IP G+ A  STR+SNEL AGNPE 
Sbjct: 278 WSLFELLILSSGLLPNPKLETSVLSICLTTSSLHCVIPMGIGAAGSTRISNELGAGNPEV 337

BLAST of Spg021589 vs. ExPASy Swiss-Prot
Match: Q8RWF5 (Protein DETOXIFICATION 6 OS=Arabidopsis thaliana OX=3702 GN=DTX6 PE=2 SV=1)

HSP 1 Score: 156.4 bits (394), Expect = 6.1e-37
Identity = 127/331 (38.37%), Postives = 183/331 (55.29%), Query Frame = 0

Query: 52  RRSGPRSSMTLFLRPQLVVIVIIVGHLGDELLLSGVSIATSFARATGFSLLVILSA--QG 111
           R + P +++T+      V+ V++ GH   EL LSGV++ATSF   +GFS++  L+   + 
Sbjct: 36  RMALPMATVTVAQYLLPVISVMVAGH-RSELQLSGVALATSFTNVSGFSVMFGLAGALET 95

Query: 112 LCSLFLRLK---------LSQELCNL----------FLRLKLYQGLREFKE-SFDSGS-- 171
           LC      K          S  + N+          F   KL+  L +  + S  +GS  
Sbjct: 96  LCGQAYGAKQYAKIGTYTFSAIVSNVPIVVLISILWFYMDKLFVSLGQDPDISKVAGSYA 155

Query: 172 --LQEALVFQSLEVRFEEDLSWEWLELWKLYVGKHMGQNNIKSLEFILIVAKFLSFWCVV 231
             L  AL+ Q+++      L  + L L  LY         I +L F + V   L +   +
Sbjct: 156 VCLIPALLAQAVQQPLTRFLQTQGLVLPLLYCA-------ITTLLFHIPVCLILVYAFGL 215

Query: 232 ---GAALALGISSCLNVILLGLYVFFSPSCKKTRAPFSREAILSIREFFRLAVPSAVMTC 291
              GAALA+G+S   NV++L LYV FS SC+KTR   S + +LS+++FF+  +PSA MT 
Sbjct: 216 GSNGAALAIGLSYWFNVLILALYVRFSSSCEKTRGFVSDDFVLSVKQFFQYGIPSAAMTT 275

Query: 292 LEWWSYEVILLLSGLLPNPKVETSVLSICFSITYLHFFIPYGLAATVSTRVSNELRAGNP 351
           +EW  +E ++L SGLLPNPK+ETSVLSIC + + LH+ IP G+ A  S RVSNEL AGNP
Sbjct: 276 IEWSLFEFLILSSGLLPNPKLETSVLSICLTTSSLHYVIPMGIGAAGSIRVSNELGAGNP 335

Query: 352 EAAKVAVRVVGILGIIESTIVSVTLFGCHNI 354
           E A++AV     L  +E+TI S  LF C +I
Sbjct: 336 EVARLAVFAGIFLWFLEATICSTLLFICRDI 358

BLAST of Spg021589 vs. ExPASy Swiss-Prot
Match: Q9SIA4 (Protein DETOXIFICATION 3 OS=Arabidopsis thaliana OX=3702 GN=DTX3 PE=3 SV=1)

HSP 1 Score: 155.6 bits (392), Expect = 1.0e-36
Identity = 113/343 (32.94%), Postives = 171/343 (49.85%), Query Frame = 0

Query: 54  SGPRSSMTLFLRPQLVVIVIIVGHLGDELLLSGVSIATSFARATGFSLLVILSAQGLCSL 113
           + P +++T+      V+ V++ GH G EL LSGV++ATSF   +GFS+L  L+       
Sbjct: 35  AAPMAAVTIAQYLLPVISVMVAGHNG-ELQLSGVALATSFTNVSGFSILFGLAG------ 94

Query: 114 FLRLKLSQELCNLFLRLKLYQGLREFKESFDSGSLQEALVFQSLEVRFEEDLSWEWLELW 173
                  + LC      K Y+ +  +  S  + ++   ++   L         W ++E  
Sbjct: 95  -----ALETLCGQAYGAKQYEKIGTYTYSATASNIPICVLISVL---------WIYIEKL 154

Query: 174 KLYVGKHMGQNNIK------------SLEFILIVAKFL---------------------- 233
            + +G+    + +             +  F + + +FL                      
Sbjct: 155 LISLGQDPDISRVAGSYALWLIPALFAHAFFIPLTRFLLAQGLVLPLLYCTLTTLLFHIP 214

Query: 234 SFWCVV--------GAALALGISSCLNVILLGLYVFFSPSCKKTRAPFSREAILSIREFF 293
             W  V        GAA+A+ +S    V++L  YV +S SC KTR   S + +  I++FF
Sbjct: 215 VCWAFVYAFGLGSNGAAMAISVSFWFYVVILSCYVRYSSSCDKTRVFVSSDFVSCIKQFF 274

Query: 294 RLAVPSAVMTCLEWWSYEVILLLSGLLPNPKVETSVLSICFSITYLHFFIPYGLAATVST 353
              VPSA M CLEWW +E+++L SGLLPNPK+ETSVLSIC +   LH+ IP G+AA VST
Sbjct: 275 HFGVPSAAMVCLEWWLFELLILCSGLLPNPKLETSVLSICLTTASLHYVIPGGVAAAVST 334

Query: 354 RVSNELRAGNPEAAKVAVRVVGILGIIESTIVSVTLFGCHNIL 355
           RVSN+L AG P+ A+V+V     L ++ES   S  LF C NI+
Sbjct: 335 RVSNKLGAGIPQVARVSVLAGLCLWLVESAFFSTLLFTCRNII 356

BLAST of Spg021589 vs. ExPASy TrEMBL
Match: A0A6J1CL18 (Protein DETOXIFICATION OS=Momordica charantia OX=3673 GN=LOC111012127 PE=3 SV=1)

HSP 1 Score: 271.9 bits (694), Expect = 3.7e-69
Identity = 176/328 (53.66%), Postives = 201/328 (61.28%), Query Frame = 0

Query: 69  VVIVIIVGHLGDELLLSGVSIATSFARATGFSLLVILSAQGLCSLFLRLKLSQELCNLFL 128
           VV V+IVGHLGDELLLSGVSIATSF R TGFSLL+ ++              + LC    
Sbjct: 69  VVTVVIVGHLGDELLLSGVSIATSFVRVTGFSLLLGMAG-----------ALETLCGQAY 128

Query: 129 RLKLYQGLREFKESFDSGSLQEALVFQSLEVRFEEDLSWEWLELWKLYVGKHMGQNNI-- 188
             + YQ L         G    + +   + V F   + W + +   + +G+    +++  
Sbjct: 129 GAEQYQKL---------GVYTYSCMISLILVCFPISVLWFFTDKLLISIGQDPSISSVAR 188

Query: 189 -------------------------KSLEFILIVAKFLSF-------WC--------VVG 248
                                    +SL   L+ + F +        W         VVG
Sbjct: 189 KYSVFLIPNLFACAILQSLLRYFLTQSLILPLLFSSFATLCLHIPICWLFVFHFKLRVVG 248

Query: 249 AALALGISSCLNVILLGLYVFFSPSCKKTRAPFSREAILSIREFFRLAVPSAVMTCLEWW 308
           AALALGIS  LNVILL  YVFFSPSC KTRAP SREAI SI +FFRLAVPSAVM CLEWW
Sbjct: 249 AALALGISYWLNVILLASYVFFSPSCNKTRAPLSREAISSIGQFFRLAVPSAVMVCLEWW 308

Query: 309 SYEVILLLSGLLPNPKVETSVLSICFSITYLHFFIPYGLAATVSTRVSNELRAGNPEAAK 355
           SYEVILLLSGLLPNPKVE SVLSICFSITYLH+FIPYGL ATVSTRVSNEL AGNPEAAK
Sbjct: 309 SYEVILLLSGLLPNPKVEASVLSICFSITYLHYFIPYGLGATVSTRVSNELGAGNPEAAK 368

BLAST of Spg021589 vs. ExPASy TrEMBL
Match: A0A6J1CK03 (Protein DETOXIFICATION OS=Momordica charantia OX=3673 GN=LOC111012127 PE=3 SV=1)

HSP 1 Score: 271.9 bits (694), Expect = 3.7e-69
Identity = 176/328 (53.66%), Postives = 201/328 (61.28%), Query Frame = 0

Query: 69  VVIVIIVGHLGDELLLSGVSIATSFARATGFSLLVILSAQGLCSLFLRLKLSQELCNLFL 128
           VV V+IVGHLGDELLLSGVSIATSF R TGFSLL+ ++              + LC    
Sbjct: 69  VVTVVIVGHLGDELLLSGVSIATSFVRVTGFSLLLGMAG-----------ALETLCGQAY 128

Query: 129 RLKLYQGLREFKESFDSGSLQEALVFQSLEVRFEEDLSWEWLELWKLYVGKHMGQNNI-- 188
             + YQ L         G    + +   + V F   + W + +   + +G+    +++  
Sbjct: 129 GAEQYQKL---------GVYTYSCMISLILVCFPISVLWFFTDKLLISIGQDPSISSVAR 188

Query: 189 -------------------------KSLEFILIVAKFLSF-------WC--------VVG 248
                                    +SL   L+ + F +        W         VVG
Sbjct: 189 KYSVFLIPNLFACAILQSLLRYFLTQSLILPLLFSSFATLCLHIPICWLFVFHFKLRVVG 248

Query: 249 AALALGISSCLNVILLGLYVFFSPSCKKTRAPFSREAILSIREFFRLAVPSAVMTCLEWW 308
           AALALGIS  LNVILL  YVFFSPSC KTRAP SREAI SI +FFRLAVPSAVM CLEWW
Sbjct: 249 AALALGISYWLNVILLASYVFFSPSCNKTRAPLSREAISSIGQFFRLAVPSAVMVCLEWW 308

Query: 309 SYEVILLLSGLLPNPKVETSVLSICFSITYLHFFIPYGLAATVSTRVSNELRAGNPEAAK 355
           SYEVILLLSGLLPNPKVE SVLSICFSITYLH+FIPYGL ATVSTRVSNEL AGNPEAAK
Sbjct: 309 SYEVILLLSGLLPNPKVEASVLSICFSITYLHYFIPYGLGATVSTRVSNELGAGNPEAAK 368

BLAST of Spg021589 vs. ExPASy TrEMBL
Match: A0A6J1CLS8 (Protein DETOXIFICATION OS=Momordica charantia OX=3673 GN=LOC111012127 PE=3 SV=1)

HSP 1 Score: 271.9 bits (694), Expect = 3.7e-69
Identity = 176/328 (53.66%), Postives = 201/328 (61.28%), Query Frame = 0

Query: 69  VVIVIIVGHLGDELLLSGVSIATSFARATGFSLLVILSAQGLCSLFLRLKLSQELCNLFL 128
           VV V+IVGHLGDELLLSGVSIATSF R TGFSLL+ ++              + LC    
Sbjct: 69  VVTVVIVGHLGDELLLSGVSIATSFVRVTGFSLLLGMAG-----------ALETLCGQAY 128

Query: 129 RLKLYQGLREFKESFDSGSLQEALVFQSLEVRFEEDLSWEWLELWKLYVGKHMGQNNI-- 188
             + YQ L         G    + +   + V F   + W + +   + +G+    +++  
Sbjct: 129 GAEQYQKL---------GVYTYSCMISLILVCFPISVLWFFTDKLLISIGQDPSISSVAR 188

Query: 189 -------------------------KSLEFILIVAKFLSF-------WC--------VVG 248
                                    +SL   L+ + F +        W         VVG
Sbjct: 189 KYSVFLIPNLFACAILQSLLRYFLTQSLILPLLFSSFATLCLHIPICWLFVFHFKLRVVG 248

Query: 249 AALALGISSCLNVILLGLYVFFSPSCKKTRAPFSREAILSIREFFRLAVPSAVMTCLEWW 308
           AALALGIS  LNVILL  YVFFSPSC KTRAP SREAI SI +FFRLAVPSAVM CLEWW
Sbjct: 249 AALALGISYWLNVILLASYVFFSPSCNKTRAPLSREAISSIGQFFRLAVPSAVMVCLEWW 308

Query: 309 SYEVILLLSGLLPNPKVETSVLSICFSITYLHFFIPYGLAATVSTRVSNELRAGNPEAAK 355
           SYEVILLLSGLLPNPKVE SVLSICFSITYLH+FIPYGL ATVSTRVSNEL AGNPEAAK
Sbjct: 309 SYEVILLLSGLLPNPKVEASVLSICFSITYLHYFIPYGLGATVSTRVSNELGAGNPEAAK 368

BLAST of Spg021589 vs. ExPASy TrEMBL
Match: A0A6J1CKI4 (protein DETOXIFICATION 8-like isoform X5 OS=Momordica charantia OX=3673 GN=LOC111012127 PE=3 SV=1)

HSP 1 Score: 267.7 bits (683), Expect = 6.9e-68
Identity = 169/286 (59.09%), Postives = 181/286 (63.29%), Query Frame = 0

Query: 69  VVIVIIVGHLGDELLLSGVSIATSFARATGFSLLVILSAQGLCSLFLRLKLSQELCNLFL 128
           VV V+IVGHLGDELLLSGVSIATSF R TGFSLL+ ++              + LC    
Sbjct: 69  VVTVVIVGHLGDELLLSGVSIATSFVRVTGFSLLLGMAG-----------ALETLCGQAY 128

Query: 129 RLKLYQGLREFKESFDSGSLQEALVFQSLEVRFEEDLSWEWLELWKLYVGKHMGQNNIKS 188
             + YQ L  +  S                                              
Sbjct: 129 GAEQYQKLGVYTYSC--------------------------------------------M 188

Query: 189 LEFILIVAKFLSFWCVVGAALALGISSCLNVILLGLYVFFSPSCKKTRAPFSREAILSIR 248
           +  IL+         VVGAALALGIS  LNVILL  YVFFSPSC KTRAP SREAI SI 
Sbjct: 189 ISLILL--------RVVGAALALGISYWLNVILLASYVFFSPSCNKTRAPLSREAISSIG 248

Query: 249 EFFRLAVPSAVMTCLEWWSYEVILLLSGLLPNPKVETSVLSICFSITYLHFFIPYGLAAT 308
           +FFRLAVPSAVM CLEWWSYEVILLLSGLLPNPKVE SVLSICFSITYLH+FIPYGL AT
Sbjct: 249 QFFRLAVPSAVMVCLEWWSYEVILLLSGLLPNPKVEASVLSICFSITYLHYFIPYGLGAT 291

Query: 309 VSTRVSNELRAGNPEAAKVAVRVVGILGIIESTIVSVTLFGCHNIL 355
           VSTRVSNEL AGNPEAAKVAV+VVG LGIIES  VSV LFGC NIL
Sbjct: 309 VSTRVSNELGAGNPEAAKVAVKVVGALGIIESITVSVILFGCRNIL 291

BLAST of Spg021589 vs. ExPASy TrEMBL
Match: A0A6J1CJB5 (Protein DETOXIFICATION OS=Momordica charantia OX=3673 GN=LOC111012127 PE=3 SV=1)

HSP 1 Score: 263.8 bits (673), Expect = 1.0e-66
Identity = 171/343 (49.85%), Postives = 204/343 (59.48%), Query Frame = 0

Query: 54  SGPRSSMTLFLRPQLVVIVIIVGHLGDELLLSGVSIATSFARATGFSLLVILSAQGLCSL 113
           + P ++ T+      +V V++VGHLGD+LLLSG SIATSF   TG S+L+ ++       
Sbjct: 19  AAPMAASTILQYSMQIVAVMMVGHLGDQLLLSGGSIATSFVNVTGISVLLGMAG------ 78

Query: 114 FLRLKLSQELCNLFLRLKLYQGLREFKESFDSGSLQEALVFQSLEVRFEEDLSWEWLELW 173
                  + LC      + YQ L         G    + +   + V F   + W + +  
Sbjct: 79  -----ALETLCGQAYGAEQYQKL---------GVYTYSCMISLILVCFPISVLWFFTDKL 138

Query: 174 KLYVGKHMGQNNI---------------------------KSLEFILIVAKFLSF----- 233
            + +G+    +++                           +SL   L+ + F +      
Sbjct: 139 LISIGQDPSISSVARKYSVFLIPNLFACAILQSLLRYFLTQSLILPLLFSSFATLCLHIP 198

Query: 234 --WC--------VVGAALALGISSCLNVILLGLYVFFSPSCKKTRAPFSREAILSIREFF 293
             W         VVGAALALGIS  LNVILL  YVFFSPSC KTRAP SREAI SI +FF
Sbjct: 199 ICWLFVFHFKLRVVGAALALGISYWLNVILLASYVFFSPSCNKTRAPLSREAISSIGQFF 258

Query: 294 RLAVPSAVMTCLEWWSYEVILLLSGLLPNPKVETSVLSICFSITYLHFFIPYGLAATVST 353
           RLAVPSAVM CLEWWSYEVILLLSGLLPNPKVE SVLSICFSITYLH+FIPYGL ATVST
Sbjct: 259 RLAVPSAVMVCLEWWSYEVILLLSGLLPNPKVEASVLSICFSITYLHYFIPYGLGATVST 318

Query: 354 RVSNELRAGNPEAAKVAVRVVGILGIIESTIVSVTLFGCHNIL 355
           RVSNEL AGNPEAAKVAV+VVG LGIIES  VSV LFGC NIL
Sbjct: 319 RVSNELGAGNPEAAKVAVKVVGALGIIESITVSVILFGCRNIL 341

BLAST of Spg021589 vs. TAIR 10
Match: AT5G52450.1 (MATE efflux family protein )

HSP 1 Score: 166.0 bits (419), Expect = 5.5e-41
Identity = 133/351 (37.89%), Postives = 181/351 (51.57%), Query Frame = 0

Query: 29  GNFGIDLPLLGKRSR-RPRSRLRPRRSGPRSSMTLFLRPQLVVIVIIVGHLGDELLLSGV 88
           G   +  PL+G++S  +   + +   SGP  +++L      V+ V+ VGHLG  L LS  
Sbjct: 8   GEGDLSWPLIGEKSSVKEEVKKQLWLSGPLIAVSLLQFCLQVISVMFVGHLG-SLPLSAA 67

Query: 89  SIATSFARATGFSLLV--ILSAQGLCSLFLRLKLSQELCNLFLRLKLYQGLREFKESFDS 148
           SIATSFA  TGFS L+    +   LC      K    L     R      L     S   
Sbjct: 68  SIATSFASVTGFSFLMGTASALDTLCGQAYGAKKYGMLGIQMQRAMFVLTLASIPLSIIW 127

Query: 149 GSLQEALVF------------QSLEVRFEEDLSWEWLELWKLYVGKHMGQNNIKSLEF-- 208
            + +  LVF               +       ++  L+ +  ++     QNN+  + F  
Sbjct: 128 ANTEHLLVFFGQNKSIATLAGSYAKFMIPSIFAYGLLQCFNRFL---QAQNNVFPVVFCS 187

Query: 209 -ILIVAKFLSFWCVV--------GAALALGISSCLNVILLGLYVFFSPSCKKTRAPFSRE 268
            +      L  W +V        GAALA  IS  LNV+LL  YV FSPSC  T   FS+E
Sbjct: 188 GVTTSLHVLLCWVLVFKSGLGFQGAALANSISYWLNVVLLFCYVKFSPSCSLTWTGFSKE 247

Query: 269 AILSIREFFRLAVPSAVMTCLEWWSYEVILLLSGLLPNPKVETSVLSICFSITYLHFFIP 328
           A+  I  F RLAVPSA+M CLE WS+E+++LLSGLLPNP +ETSVLSIC + +   + IP
Sbjct: 248 ALRDILPFLRLAVPSALMVCLEMWSFELLVLLSGLLPNPVLETSVLSICLNTSGTMWMIP 307

Query: 329 YGLAATVSTRVSNELRAGNPEAAKVAVRVVGILGIIESTIVSVTLFGCHNI 354
           +GL+   STR+SNEL AGNP+ AK+AVRVV  + + ES ++   L    NI
Sbjct: 308 FGLSGAASTRISNELGAGNPKVAKLAVRVVICIAVAESIVIGSVLILIRNI 354

BLAST of Spg021589 vs. TAIR 10
Match: AT2G04040.1 (MATE efflux family protein )

HSP 1 Score: 165.6 bits (418), Expect = 7.1e-41
Identity = 122/345 (35.36%), Postives = 176/345 (51.01%), Query Frame = 0

Query: 52  RRSGPRSSMTLFLRPQLVVIVIIVGHLGDELLLSGVSIATSFARATGFSLLVILSAQGLC 111
           R + P +++T+      V+ V++ GH G EL LSGV++A SF   TGFS+        +C
Sbjct: 33  RLAAPMATVTIAQYLLPVISVMVAGHNG-ELQLSGVALANSFTNVTGFSI--------MC 92

Query: 112 SLFLRLKLSQELCNLFLRLKLYQGLREFKESFDSGSLQEALVFQSLEVRFEEDLSWEWLE 171
            L   L   + LC      K Y+ +         G+   + +  ++ + F   + W ++E
Sbjct: 93  GLVGAL---ETLCGQAYGAKQYEKI---------GTYAYSAIASNIPICFLISILWLYIE 152

Query: 172 LWKLYVGKHMGQNNIK-SLEFILIVAKF-------------------------------- 231
              + +G+    + I  S  F LI A F                                
Sbjct: 153 KILISLGQDPEISRIAGSYAFWLIPALFGQAIVIPLSRFLLTQGLVIPLLFTAVTTLLFH 212

Query: 232 -LSFWCVV--------GAALALGISSCLNVILLGLYVFFSPSCKKTRAPFSREAILSIRE 291
            L  W +V        G A+A  +S     ++L  YV FS SC+KTR   SR+ + SI++
Sbjct: 213 VLVCWTLVFLFGLGCNGPAMATSVSFWFYAVILSCYVRFSSSCEKTRGFVSRDFVSSIKQ 272

Query: 292 FFRLAVPSAVMTCLEWWSYEVILLLSGLLPNPKVETSVLSICFSITYLHFFIPYGLAATV 351
           FF+  +PSA M CLEWW +E+++L SGLLPNPK+ETSVLSIC +I  LH+ I  G+AA V
Sbjct: 273 FFQYGIPSAAMICLEWWLFEILILCSGLLPNPKLETSVLSICLTIETLHYVISAGVAAAV 332

Query: 352 STRVSNELRAGNPEAAKVAVRVVGILGIIESTIVSVTLFGCHNIL 355
           STRVSN L AGNP+ A+V+V     L I+ES   S+ LF C NI+
Sbjct: 333 STRVSNNLGAGNPQVARVSVLAGLCLWIVESAFFSILLFTCRNII 356

BLAST of Spg021589 vs. TAIR 10
Match: AT2G04090.1 (MATE efflux family protein )

HSP 1 Score: 160.2 bits (404), Expect = 3.0e-39
Identity = 125/329 (37.99%), Postives = 185/329 (56.23%), Query Frame = 0

Query: 54  SGPRSSMTLFLRPQLVVIVIIVGHLGDELLLSGVSIATSFARATGFSLL--VILSAQGLC 113
           + P +++T+      V+ V++ GH G EL LSGV++AT+FA  +GF ++  ++ + + LC
Sbjct: 38  AAPMATVTVSQYLLPVISVMVAGHCG-ELQLSGVTLATAFANVSGFGIMYGLVGALETLC 97

Query: 114 SLFLRLK---------LSQELCNL----------FLRLKLYQGLREFKE-SFDSGS---- 173
                 K          S  + N+          F   KL+  L +  + S  +GS    
Sbjct: 98  GQAYGAKQYTKIGTYTFSAIVSNVPIVVLISILWFYMDKLFVSLGQDPDISKVAGSYAVC 157

Query: 174 LQEALVFQSLEVRFEEDLSWEWLELWKLYVGKHMGQNNIKSLEFILIVAKFLSFWCVV-- 233
           L  AL+ Q+++      L  + L L  LY         I +L F + V   L +   +  
Sbjct: 158 LIPALLAQAVQQPLTRFLQTQGLVLPLLYCA-------ITTLLFHIPVCLILVYAFGLGS 217

Query: 234 -GAALALGISSCLNVILLGLYVFFSPSCKKTRAPFSREAILSIREFFRLAVPSAVMTCLE 293
            GAALA+G+S   NV++L LYV FS +C+KTR   S + +LS+++FF+  +PSA MT +E
Sbjct: 218 NGAALAIGLSYWFNVLILALYVRFSSACEKTRGFVSDDFVLSVKQFFQYGIPSAAMTTIE 277

Query: 294 WWSYEVILLLSGLLPNPKVETSVLSICFSITYLHFFIPYGLAATVSTRVSNELRAGNPEA 353
           W  +E+++L SGLLPNPK+ETSVLSIC + + LH  IP G+ A  STR+SNEL AGNPE 
Sbjct: 278 WSLFELLILSSGLLPNPKLETSVLSICLTTSSLHCVIPMGIGAAGSTRISNELGAGNPEV 337

BLAST of Spg021589 vs. TAIR 10
Match: AT2G04100.1 (MATE efflux family protein )

HSP 1 Score: 156.4 bits (394), Expect = 4.3e-38
Identity = 127/331 (38.37%), Postives = 183/331 (55.29%), Query Frame = 0

Query: 52  RRSGPRSSMTLFLRPQLVVIVIIVGHLGDELLLSGVSIATSFARATGFSLLVILSA--QG 111
           R + P +++T+      V+ V++ GH   EL LSGV++ATSF   +GFS++  L+   + 
Sbjct: 36  RMALPMATVTVAQYLLPVISVMVAGH-RSELQLSGVALATSFTNVSGFSVMFGLAGALET 95

Query: 112 LCSLFLRLK---------LSQELCNL----------FLRLKLYQGLREFKE-SFDSGS-- 171
           LC      K          S  + N+          F   KL+  L +  + S  +GS  
Sbjct: 96  LCGQAYGAKQYAKIGTYTFSAIVSNVPIVVLISILWFYMDKLFVSLGQDPDISKVAGSYA 155

Query: 172 --LQEALVFQSLEVRFEEDLSWEWLELWKLYVGKHMGQNNIKSLEFILIVAKFLSFWCVV 231
             L  AL+ Q+++      L  + L L  LY         I +L F + V   L +   +
Sbjct: 156 VCLIPALLAQAVQQPLTRFLQTQGLVLPLLYCA-------ITTLLFHIPVCLILVYAFGL 215

Query: 232 ---GAALALGISSCLNVILLGLYVFFSPSCKKTRAPFSREAILSIREFFRLAVPSAVMTC 291
              GAALA+G+S   NV++L LYV FS SC+KTR   S + +LS+++FF+  +PSA MT 
Sbjct: 216 GSNGAALAIGLSYWFNVLILALYVRFSSSCEKTRGFVSDDFVLSVKQFFQYGIPSAAMTT 275

Query: 292 LEWWSYEVILLLSGLLPNPKVETSVLSICFSITYLHFFIPYGLAATVSTRVSNELRAGNP 351
           +EW  +E ++L SGLLPNPK+ETSVLSIC + + LH+ IP G+ A  S RVSNEL AGNP
Sbjct: 276 IEWSLFEFLILSSGLLPNPKLETSVLSICLTTSSLHYVIPMGIGAAGSIRVSNELGAGNP 335

Query: 352 EAAKVAVRVVGILGIIESTIVSVTLFGCHNI 354
           E A++AV     L  +E+TI S  LF C +I
Sbjct: 336 EVARLAVFAGIFLWFLEATICSTLLFICRDI 358

BLAST of Spg021589 vs. TAIR 10
Match: AT2G04050.1 (MATE efflux family protein )

HSP 1 Score: 155.6 bits (392), Expect = 7.4e-38
Identity = 113/343 (32.94%), Postives = 171/343 (49.85%), Query Frame = 0

Query: 54  SGPRSSMTLFLRPQLVVIVIIVGHLGDELLLSGVSIATSFARATGFSLLVILSAQGLCSL 113
           + P +++T+      V+ V++ GH G EL LSGV++ATSF   +GFS+L  L+       
Sbjct: 35  AAPMAAVTIAQYLLPVISVMVAGHNG-ELQLSGVALATSFTNVSGFSILFGLAG------ 94

Query: 114 FLRLKLSQELCNLFLRLKLYQGLREFKESFDSGSLQEALVFQSLEVRFEEDLSWEWLELW 173
                  + LC      K Y+ +  +  S  + ++   ++   L         W ++E  
Sbjct: 95  -----ALETLCGQAYGAKQYEKIGTYTYSATASNIPICVLISVL---------WIYIEKL 154

Query: 174 KLYVGKHMGQNNIK------------SLEFILIVAKFL---------------------- 233
            + +G+    + +             +  F + + +FL                      
Sbjct: 155 LISLGQDPDISRVAGSYALWLIPALFAHAFFIPLTRFLLAQGLVLPLLYCTLTTLLFHIP 214

Query: 234 SFWCVV--------GAALALGISSCLNVILLGLYVFFSPSCKKTRAPFSREAILSIREFF 293
             W  V        GAA+A+ +S    V++L  YV +S SC KTR   S + +  I++FF
Sbjct: 215 VCWAFVYAFGLGSNGAAMAISVSFWFYVVILSCYVRYSSSCDKTRVFVSSDFVSCIKQFF 274

Query: 294 RLAVPSAVMTCLEWWSYEVILLLSGLLPNPKVETSVLSICFSITYLHFFIPYGLAATVST 353
              VPSA M CLEWW +E+++L SGLLPNPK+ETSVLSIC +   LH+ IP G+AA VST
Sbjct: 275 HFGVPSAAMVCLEWWLFELLILCSGLLPNPKLETSVLSICLTTASLHYVIPGGVAAAVST 334

Query: 354 RVSNELRAGNPEAAKVAVRVVGILGIIESTIVSVTLFGCHNIL 355
           RVSN+L AG P+ A+V+V     L ++ES   S  LF C NI+
Sbjct: 335 RVSNKLGAGIPQVARVSVLAGLCLWLVESAFFSTLLFTCRNII 356

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022141867.17.6e-6953.66protein DETOXIFICATION 1-like isoform X1 [Momordica charantia][more]
XP_022141869.17.6e-6953.66protein DETOXIFICATION 1-like isoform X3 [Momordica charantia][more]
XP_022141868.17.6e-6953.66protein DETOXIFICATION 1-like isoform X2 [Momordica charantia][more]
XP_022141871.11.4e-6759.09protein DETOXIFICATION 8-like isoform X5 [Momordica charantia][more]
XP_038889426.12.4e-6755.06protein DETOXIFICATION 8-like isoform X2 [Benincasa hispida][more]
Match NameE-valueIdentityDescription
Q9FHB67.7e-4037.89Protein DETOXIFICATION 16 OS=Arabidopsis thaliana OX=3702 GN=DTX16 PE=2 SV=1[more]
Q9SIA51.0e-3935.36Protein DETOXIFICATION 1 OS=Arabidopsis thaliana OX=3702 GN=DTX1 PE=2 SV=1[more]
Q9SIA14.2e-3837.99Protein DETOXIFICATION 5 OS=Arabidopsis thaliana OX=3702 GN=DTX5 PE=3 SV=2[more]
Q8RWF56.1e-3738.37Protein DETOXIFICATION 6 OS=Arabidopsis thaliana OX=3702 GN=DTX6 PE=2 SV=1[more]
Q9SIA41.0e-3632.94Protein DETOXIFICATION 3 OS=Arabidopsis thaliana OX=3702 GN=DTX3 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
A0A6J1CL183.7e-6953.66Protein DETOXIFICATION OS=Momordica charantia OX=3673 GN=LOC111012127 PE=3 SV=1[more]
A0A6J1CK033.7e-6953.66Protein DETOXIFICATION OS=Momordica charantia OX=3673 GN=LOC111012127 PE=3 SV=1[more]
A0A6J1CLS83.7e-6953.66Protein DETOXIFICATION OS=Momordica charantia OX=3673 GN=LOC111012127 PE=3 SV=1[more]
A0A6J1CKI46.9e-6859.09protein DETOXIFICATION 8-like isoform X5 OS=Momordica charantia OX=3673 GN=LOC11... [more]
A0A6J1CJB51.0e-6649.85Protein DETOXIFICATION OS=Momordica charantia OX=3673 GN=LOC111012127 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT5G52450.15.5e-4137.89MATE efflux family protein [more]
AT2G04040.17.1e-4135.36MATE efflux family protein [more]
AT2G04090.13.0e-3937.99MATE efflux family protein [more]
AT2G04100.14.3e-3838.37MATE efflux family protein [more]
AT2G04050.17.4e-3832.94MATE efflux family protein [more]
InterPro
Analysis Name: InterPro Annotations of Sponge gourd (cylindrica) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002528Multi antimicrobial extrusion proteinPFAMPF01554MatEcoord: 256..351
e-value: 2.9E-14
score: 53.0
NoneNo IPR availablePANTHERPTHR11206:SF345PROTEIN DETOXIFICATIONcoord: 67..106
coord: 203..354
NoneNo IPR availablePANTHERPTHR11206MULTIDRUG RESISTANCE PROTEINcoord: 67..106
coord: 203..354

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Spg021589.1Spg021589.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0055085 transmembrane transport
cellular_component GO:0016020 membrane
molecular_function GO:0015297 antiporter activity
molecular_function GO:0042910 xenobiotic transmembrane transporter activity