Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: five_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATTAAAAAAAAATTATATTAAATGAGAATTCTATGTCTGAAAATCAGTGAGCGATTTACAAACATAATTCCACCCTTTTCTTGTTTTCATTGATAACCAGAGAGCCATCAAAATTTATCTTCTTTTATACTGACAGACCTTCATATCGAACATTCAAATCATTTCTTTTCTTCCAACTCTCCTGAAATACGTAGAGATTCTGTATAAACTCCAATAGACCAAACAATTTGTACCTTTTAATTGAAAACTCAGCCGAGCGATCTTCGCCAGTGCAAGTTTACTTCCTGATTCTGCATGCGTACAGATGAAGATGATGTACGAGGAGAACAGAACAAGCTCCAGCTTTGCTTTGCGCTGCTCAACTAACTTGATGAAGCAAGAACAATCTGTCATCAATCTAATCGACGATAATAAAGCGAAGAGAAAAAGAAACCCAAATTACGAACAAGACAAGGTCTGAAAATACATACAAGAAGAATCTCGACAAAAAACTAGGGACGCAAATTTCTGGATTTTGATCTCTTTGAAGATAAACAGCGTATGCCCATTACTGGTTTTGATTAAAATCCGAAGGAAGTAAATTGGAAGGCGATGTCTTTTCGTGATTGGGTTCGAATCGGCTGTGAAAAGCACTTTTCTTGGGTTTAATTTAATGTGTTTTGATATGTTTGACTTCCGGTTTCGAAATGTGGGAATGTTTTTATCTGACACCCACTTAATTTAACAGAAAGGAACTAATTATTTGATTTAATTTTGGCGACTGTGAGCTTTTGTTTGATTTTCGTGTTTATTGCGTGGAATGTGGATAGGGAGAAAGGTAAGAATATTTCTTATTTCTATTTTTTTTGAGGGATTTTTTAATGATGTGAATTTTAAGGGCCTAGGGTTCGCTTTTCGGGCTCCAGTCTTCTATCGGTGCCCAAAATTTCTTCCCCTTTCGTATGAATTTAGGAGCTTCGGACTCTGTCTTGTCGTCGATTTTTCTTGATCTCATCGCGCTGGATGTTCAAATTTCCTTGATTTTTCCAATTGGGTCGGTGGGTTTCTCCGGTTTCTGTCATCTGAGGCGGTGAATTTCTTCTCCCTTCTTTTGCTGGTGGCTGGTTCGTTTAATTTTGAAGTTTTCTCTTATGGGTTTTGCTTTGGAGCTTTGTTTGTGATGGTTTTTATGCATGTCCTTGAGGTGATTTTACTCAACCAGCAGCTGAATGCATACTTTGCTCTTGCAGCACTTGTTGATCATAGTTGGAGATTCTTAACGGATAGGGTTTGAGTTCTTAGTAAGTTCTGCATTCTGTTTTTGCAGAAATTTTGTGCTGAAGTGGGTGAGTTTTAGGGTTTAATTGTTGGTCCGGAAAAAATTAGAGGTTCTGGACGGCTGTGATTTTGATCTTAAAGCCGTCTCTGAGGGGTTTTAGAATGATGGCATTATTTCAACATCAGAATGAGAATTCAGGTTCTTTACTTCTATATCATGGCCTATGGGTACCGGTCTCTGAGATTTTTAACTGATGCCAGGGCATTGGGGCTCTGTTTTGGCGAATGTTTGTGTTCTAACGGTTCCACAAGCAAGTTGAAAGGTCTCATTGCTTGAGGAAGTTCGCTTAATGCAATTTTCTCCCGGTTCTTCTTGTAAATGATGCTTCACTCCGTTGGAGCTGCTAGACAAACTTGTAGTTTACTTGCTGTTACCTGCGGAAGTGTACCTAAAGTAAAATACGAGGAGGATGTTGCTGTGGATAAGTTGAAATATCCCTTTCCAGAATTAGTTTCTTCCGGACGATTGGAGGTGCAAAGCTGCATTAGTTGTCATTTATCTGCAATTTTCCTACTCAATTAAAATGGTTCTTGTCAAAATATTTAACTGCTTGAATGTGTAGGTTCGAGTTTTGACAAATCCAAGCAAGGATGAATTTAGTAGAATTGTAGAATCATGTCAACCGAGCTTTGTCTACTTGCAAGGGGAACAACTTGAAAATGATGAAATTGGGTCTTTGGTTTGGAATGGTGTTGATTTGTCTCTTGAAGATTTATGCGGACTATTCAATACTGCATTACCATCCACCGTATGCTCTGTCTTCCTTATATAATTGATGAATCAGTACTTTACATAACCATATCGTTTTTTCTTCTGCCCATTCACTCCTTTGTGATGGGGTTGTAGGAAGGGGTCGTTAGAATAGATTCAGTTTTTGATGACTTACTTCATCTATTTTGGTCAGGTGTATTTAGAAATCCCAGATGGAGGCGAAATAGCAGATGCTCTTCATTCTAAGGTGACTTCTATGAGGTTCTTAGAACACCACTTCAATTGCATTATGATGTTGATATGAATAAGGATGGCAGCCTGCCTGTCTAACATTCTTGCTTGATATCTTCTTTGTGAAAGAATGTTGATTGAACCAGATTCATCATTGTGCAGGGAATTCCCTATGTCATTTATTGGAACAACACATTTTCAAGTTATGCTGCAGCTCATTTTCGTAATGCATTGCTTTCAGTGGTGCAGAGGTAACATACTTGAAGGCTTTGCACTACTTGCTATTAGACAGACTAAAATTGTGTGGTTTTACGTGTCTATTTACCTTCTTCATGGATTCAAGTGAATACTTATCTTGACATTTTAGAGTAGAATAGAAGTTACTTTGTGATATATATATATATATTGTGCTTCTTATATGAGATCACCCTACTATGGAAGTTGTTTTGAGTAGTGTTTTTAAAAGCGTGAAGGAAATTAAAGAGCAATGGCCCTTTTAGGGCTTAAGTGCAAGGAGCAAAAAAAAAGGGTGAGGACTTTAATGAAGTGAAGTGCATGATATATAGTGAATATAATTTTAAAAATCAACAACAACATGAAAAACAGAATTCTCATGTAAAATGAATTAAGTCTCTCAAGGAGGATTCCCATGAAAAAAAAACACAGTCGTGTTTGCCCATGCATCTTATTTCTAGCTATCTACATGTCATTCTTGTACAAAACACAAACCAACTCGTTAGTGACACTGGTAGGCCCTCCCCCTCCCTACTTGTTGAGGGCACTTATTTTTGCATATCTTGCGTTTCTTTTACTTTTCTGTATGAACTTTTGTGCTTGTGCGTCTGCAGAGGCTGATAAATATGATTTTTATGTTTTATTTCACTTGTGGGCTGCAGTTCATCTACTCATACATGGGATGCTTTTCAGCTTGCACATGCTGCTTTTAGGCTTCATTGTGTGGGGAGCAATTATGCCCTTCCCGGGAATGCTGACGATATTAGGAATGATCTAGAGCCTCAGCTTATTGGGGAACCTCTAAAGATTAACGTAGAACCTCCCGAGATAGATGCAGGTGAAGATGAAGATGGTTCTTTAGAAACCCTCCCTGTCTTAAGTATACATGATAATAATGTGACCGTGAGATTTCTTATCTGTGGAATGTCTGGCACACCGGTAAGCAGTTTTAAGACGTTCATCTTATATTCATGTGCCATTATGTACACCTGCATATAGGTGTGCTGGTACGTGTACCTTTTCGATTATAAGTTATTTTACATGGTTTTCCTTATAGTTTTCATGCTTTATAGGATGCATGCTTATTGAGATCATTGGAGGATGGCCTTAATGCCCTTTTGAACATTGAAGTAAGTGTCATTCAGAATTTTGCTTGTTTTATTTATGGCTTGTAGGAGTTGGTCAATGACATTGTATGATATTATTTGTTAACATTTTCTAATGAAAGAACGGAAAAAATGTCTATGAACTGTTGTAACTTAAGTTGTTTTGGGGGCATTCCCTTCTTAACTGTCTTAGTTTTTGGTTTTTATGGGTGATACAGATACGTGGGAGTAAACTTCAGGGAAAGTTCAGGTAAGCTTCTTCTTGGAAAGAACTTATGTTGAAATATATTGGAATATTTTCCGTTTGAACACTTGAACCACTTTTCACATATGTTAACTCTATATCTAGGCTATGCCTTCTTGGTAGGAATTAAGTCTTCGGTTCAATGAATGTCAATTGTACATGCTTTTAATACCTTTAAAAATATGGTTCAGTGCTCCTCCGCCACCTCTTCAAGCAGGATCCTTTTCTCGTGGTGTTGTGACAATGCGATGTGATATAGTGACCTGTAGTTCAGCCCACATCTCAATATTGGTGTCCGGTAGTGCTCATACTTGTTTTGATGATCAGGTTTGTTTCAAATTTGAACTATTTGACTTGTTTCTTATTGTTAATTTTTTCAAATAAGAGGAGTCTAATATCTGGTTGCTACTAATTTTTTCAGTTGTTGGAGAAACATATCAAACATGAGATTATTGAAAATAGCCAATTAGTTCATGCCATGCATGATTGTGAGGGCAACAAACATCACATGCACGAGCCTCGAAAATCTGCTTCAGTTGCTTGCGGGGCAACTGTATTTGAGGTTTCCATGAAGGTTCCCGCTTGGGCATCACAGGTCCCCCAACCTACCTGTTTCCCCATCCTCACTTAAATTAAGCACTCTAGAGTAAACTGTACATTGACTTATTCTTATTAAAAACAACGAACTTGATTGATTCATATTATCACTGAAATGTTGTTCTCTCTGGTTGAGGTTAAATTTATCAAAAGATTTGAATTTGGTTGAATAATTTGGAGTCATTTCATGTACGATATATTAATATCAATTTGATTGTCATGGATGCTGTTTCAGGTTTTGAGGCAACTAGCACCTGATATATCGTATCGGAGTTTAGTTGCACTTGGCATTGGGGGAGTTCAGGGTTTGCCTGTTGCTTCTTTTGAGAAAGAGGATGCTGAGCGGTTGCTCTTCTTCTATTCAAGGGATGAGAATGATAAACATTCAGATCAGTTGCTTGTAAGTGTATTGCCCAACTGGTTTAAACCACCTACTCCTAGTAGAAAGAGAGGGGAACCAAGCCAGGGAATAAGGAACACTCTTTCACTTGACAGTCTTGCATATGCAAATATTCCTTCCATTAGAAAAGTAGGTGGAGAGGAGCCTGCACCAATGAATGGGTTTAAGACACCCTTACTCCCAGCTAGGAAAAGATTAAAAGTAGCTACCATGAGGCCTATTCCACGTGTGCATAGGAATAAAATGACACCATTCTCTGGATTGGCTGAAGCAGATGGGAATAATGGAGGCCAACCCAAGGCTAGTTTGCCCGTCATTACCCCGTCAAAGCATGTAACTGTAGGATCAACTTCTGCAACACAAAGAAAATCTTTTTCAAGCTCATCTCAGTCTAAGCAGCAGATTGTTCCCTTAAATCCACTTCCTTTAAAGAAACATGGTTGTGGAAGAAACCCAATTCAAGATTGCTCTGAGGTAAGATACTATACTACAATCATTAAGGTTTAAACTTACAAGGGTTGGTGGATATCAAGTTATTGGTATGGTAGTATATTTTGGAAGTACAAAAATTTTCTGCAAGAAAACTTGAAGTCTATCAGATCAATGTGGAAATCCACGACTAAAATATGGATTTCATTCTACTGTTATTAATATTTCTTTCCCACCCTCATTGCAGGAGGAGTTCTTGAAGGATGTTATGGAGTTTTTACTACTTAGAGGACATTCGCGACTTATTCCTCAAGGTGGACTTGAGGAGTTCCCAGATGCCATACTCAATGGGAAGCGTCTTGATCTCTATAACTTGTATAAGGAGGTATGGTCATTGAATATCCAGAATAGAACATTGATTCCTCTTGTTTGGTACTTTAACTGGAAAGACCCATATTGAAGGTGTTGATCCGCTTTGTAGGTGGTCACCCGAGGAGGCTTTCATGTCGGCAATGGTATCAATTGGAAGGGGCAGATCTTCTCTAAGATGCACAATTACACAATGACCAATAGAATGACTGTATGTACAGCAAGTCTTTTTTCATGGATTTGTAACCCTTCTGATGCTATTTAGTATTTTCTTAGCTTAAACTACAGGAATCTATTGGTAGATGACAGCTCTTTCGATTTAAACTAATTTCATATGTTGTAAGAAGTTGTTCCTCTATGTTCTCACAGGGTGTTGGAAATACACTGAAAAGACATTATGAGACTTACCTTCTAGAATATGAATTGGCTCATGATGATGTAGATGGAGAATGCTGCCTTTTGTGTCACAGGTTTGTCAATTTGTTCTACTGATTTTGTTCATTCACATGGTGAAAGATGATTTTGAATTGTAGTAAGATCACTTCCTTACCATGGGATTTGCAACTGTTCAACAGTAGTGCAGCAGGGGATTGGGTGAACTGCGGTATTTGTGGTGAATGGGCCCATTTCGGGTGCGACCGGAGGCAGGGTCTCGGTGCATTTAAGGTATAATTTCCGATCATGATGAGACCAATTCTGGGAAAATCTTGAGTTTGTGAAACTTGAAAGTGAAAGGTGTGCTATCTGGTTAATCTTTTCTCTGTGAATCTGTTTCTTTACTTGCAGGATTATGCCAAAACAGATGGGTTAGAGTATGTTTGTCCACACTGTAGCATTACAACTTACAAGAAGAAACCACACAGAGTAGCAAACGGGTCTCCGCAAGGAATAACGAATCCACGGATACCTTGAATCAGTTTTGGTCTTCGTCTCGAGGTTTTGGCTGCGTTATCATTTTTTTTTGCCTATTCTTATCCTACACGAGAAATTATTCGTTCTATCGGCTCGTAGATGAGTTAAATTTGAGGAGGTGTGCTGCTAAACCATTAGTCGAAGGGTAAATTTATGAGACTGAGGATGAAAAGGAAGAGAGCCATAGGTTATTGTTTTTTGACCTCTTTCTCTGTTGCTAGGTAAGTAGCTTAGGAGAAAAGTTAGTAGCAGGCAGGCATGGCAGTTAATTTTAGTAGGGTGCTGTAAAATACTTCTATTCTTGTAGGCTAAGCCATGACTGATGTATTTTATTAATCAATGAAGATATCTAATTTACCATTTTTTATTGTATAATATGATCATCC
mRNA sequence
ATGATGCTTCACTCCGTTGGAGCTGCTAGACAAACTTGTAGTTTACTTGCTGTTACCTGCGGAAGTGTACCTAAAGTAAAATACGAGGAGGATGTTGCTGTGGATAAGTTGAAATATCCCTTTCCAGAATTAGTTTCTTCCGGACGATTGGAGGTTCGAGTTTTGACAAATCCAAGCAAGGATGAATTTAGTAGAATTGTAGAATCATGTCAACCGAGCTTTGTCTACTTGCAAGGGGAACAACTTGAAAATGATGAAATTGGGTCTTTGGTTTGGAATGGTGTTGATTTGTCTCTTGAAGATTTATGCGGACTATTCAATACTGCATTACCATCCACCGTGTATTTAGAAATCCCAGATGGAGGCGAAATAGCAGATGCTCTTCATTCTAAGGGAATTCCCTATGTCATTTATTGGAACAACACATTTTCAAGTTATGCTGCAGCTCATTTTCGTAATGCATTGCTTTCAGTGGTGCAGAGTTCATCTACTCATACATGGGATGCTTTTCAGCTTGCACATGCTGCTTTTAGGCTTCATTGTGTGGGGAGCAATTATGCCCTTCCCGGGAATGCTGACGATATTAGGAATGATCTAGAGCCTCAGCTTATTGGGGAACCTCTAAAGATTAACGTAGAACCTCCCGAGATAGATGCAGGTGAAGATGAAGATGGTTCTTTAGAAACCCTCCCTGTCTTAAGTATACATGATAATAATGTGACCGTGAGATTTCTTATCTGTGGAATGTCTGGCACACCGGATGCATGCTTATTGAGATCATTGGAGGATGGCCTTAATGCCCTTTTGAACATTGAAATACGTGGGAGTAAACTTCAGGGAAAGTTCAGTGCTCCTCCGCCACCTCTTCAAGCAGGATCCTTTTCTCGTGGTGTTGTGACAATGCGATGTGATATAGTGACCTGTAGTTCAGCCCACATCTCAATATTGGTGTCCGGTAGTGCTCATACTTGTTTTGATGATCAGTTGTTGGAGAAACATATCAAACATGAGATTATTGAAAATAGCCAATTAGTTCATGCCATGCATGATTGTGAGGGCAACAAACATCACATGCACGAGCCTCGAAAATCTGCTTCAGTTGCTTGCGGGGCAACTGTATTTGAGGTTTCCATGAAGGTTCCCGCTTGGGCATCACAGGTTTTGAGGCAACTAGCACCTGATATATCGTATCGGAGTTTAGTTGCACTTGGCATTGGGGGAGTTCAGGGTTTGCCTGTTGCTTCTTTTGAGAAAGAGGATGCTGAGCGGTTGCTCTTCTTCTATTCAAGGGATGAGAATGATAAACATTCAGATCAGTTGCTTGTAAGTGTATTGCCCAACTGGTTTAAACCACCTACTCCTAGTAGAAAGAGAGGGGAACCAAGCCAGGGAATAAGGAACACTCTTTCACTTGACAGTCTTGCATATGCAAATATTCCTTCCATTAGAAAAGTAGGTGGAGAGGAGCCTGCACCAATGAATGGGTTTAAGACACCCTTACTCCCAGCTAGGAAAAGATTAAAAGTAGCTACCATGAGGCCTATTCCACGTGTGCATAGGAATAAAATGACACCATTCTCTGGATTGGCTGAAGCAGATGGGAATAATGGAGGCCAACCCAAGGCTAGTTTGCCCGTCATTACCCCGTCAAAGCATGTAACTGTAGGATCAACTTCTGCAACACAAAGAAAATCTTTTTCAAGCTCATCTCAGTCTAAGCAGCAGATTGTTCCCTTAAATCCACTTCCTTTAAAGAAACATGGTTGTGGAAGAAACCCAATTCAAGATTGCTCTGAGGAGGAGTTCTTGAAGGATGTTATGGAGTTTTTACTACTTAGAGGACATTCGCGACTTATTCCTCAAGGTGGACTTGAGGAGTTCCCAGATGCCATACTCAATGGGAAGCGTCTTGATCTCTATAACTTGTATAAGGAGGTGGTCACCCGAGGAGGCTTTCATGTCGGCAATGGTATCAATTGGAAGGGGCAGATCTTCTCTAAGATGCACAATTACACAATGACCAATAGAATGACTGGTGTTGGAAATACACTGAAAAGACATTATGAGACTTACCTTCTAGAATATGAATTGGCTCATGATGATGTAGATGGAGAATGCTGCCTTTTGTGTCACAGTAGTGCAGCAGGGGATTGGGTGAACTGCGGTATTTGTGGTGAATGGGCCCATTTCGGGTGCGACCGGAGGCAGGGTCTCGGTGCATTTAAGGATTATGCCAAAACAGATGGGTTAGAGTATGTTTGTCCACACTGTAGCATTACAACTTACAAGAAGAAACCACACAGAGTAGCAAACGGGTCTCCGCAAGGAATAACGAATCCACGGATACCTTGA
Coding sequence (CDS)
ATGATGCTTCACTCCGTTGGAGCTGCTAGACAAACTTGTAGTTTACTTGCTGTTACCTGCGGAAGTGTACCTAAAGTAAAATACGAGGAGGATGTTGCTGTGGATAAGTTGAAATATCCCTTTCCAGAATTAGTTTCTTCCGGACGATTGGAGGTTCGAGTTTTGACAAATCCAAGCAAGGATGAATTTAGTAGAATTGTAGAATCATGTCAACCGAGCTTTGTCTACTTGCAAGGGGAACAACTTGAAAATGATGAAATTGGGTCTTTGGTTTGGAATGGTGTTGATTTGTCTCTTGAAGATTTATGCGGACTATTCAATACTGCATTACCATCCACCGTGTATTTAGAAATCCCAGATGGAGGCGAAATAGCAGATGCTCTTCATTCTAAGGGAATTCCCTATGTCATTTATTGGAACAACACATTTTCAAGTTATGCTGCAGCTCATTTTCGTAATGCATTGCTTTCAGTGGTGCAGAGTTCATCTACTCATACATGGGATGCTTTTCAGCTTGCACATGCTGCTTTTAGGCTTCATTGTGTGGGGAGCAATTATGCCCTTCCCGGGAATGCTGACGATATTAGGAATGATCTAGAGCCTCAGCTTATTGGGGAACCTCTAAAGATTAACGTAGAACCTCCCGAGATAGATGCAGGTGAAGATGAAGATGGTTCTTTAGAAACCCTCCCTGTCTTAAGTATACATGATAATAATGTGACCGTGAGATTTCTTATCTGTGGAATGTCTGGCACACCGGATGCATGCTTATTGAGATCATTGGAGGATGGCCTTAATGCCCTTTTGAACATTGAAATACGTGGGAGTAAACTTCAGGGAAAGTTCAGTGCTCCTCCGCCACCTCTTCAAGCAGGATCCTTTTCTCGTGGTGTTGTGACAATGCGATGTGATATAGTGACCTGTAGTTCAGCCCACATCTCAATATTGGTGTCCGGTAGTGCTCATACTTGTTTTGATGATCAGTTGTTGGAGAAACATATCAAACATGAGATTATTGAAAATAGCCAATTAGTTCATGCCATGCATGATTGTGAGGGCAACAAACATCACATGCACGAGCCTCGAAAATCTGCTTCAGTTGCTTGCGGGGCAACTGTATTTGAGGTTTCCATGAAGGTTCCCGCTTGGGCATCACAGGTTTTGAGGCAACTAGCACCTGATATATCGTATCGGAGTTTAGTTGCACTTGGCATTGGGGGAGTTCAGGGTTTGCCTGTTGCTTCTTTTGAGAAAGAGGATGCTGAGCGGTTGCTCTTCTTCTATTCAAGGGATGAGAATGATAAACATTCAGATCAGTTGCTTGTAAGTGTATTGCCCAACTGGTTTAAACCACCTACTCCTAGTAGAAAGAGAGGGGAACCAAGCCAGGGAATAAGGAACACTCTTTCACTTGACAGTCTTGCATATGCAAATATTCCTTCCATTAGAAAAGTAGGTGGAGAGGAGCCTGCACCAATGAATGGGTTTAAGACACCCTTACTCCCAGCTAGGAAAAGATTAAAAGTAGCTACCATGAGGCCTATTCCACGTGTGCATAGGAATAAAATGACACCATTCTCTGGATTGGCTGAAGCAGATGGGAATAATGGAGGCCAACCCAAGGCTAGTTTGCCCGTCATTACCCCGTCAAAGCATGTAACTGTAGGATCAACTTCTGCAACACAAAGAAAATCTTTTTCAAGCTCATCTCAGTCTAAGCAGCAGATTGTTCCCTTAAATCCACTTCCTTTAAAGAAACATGGTTGTGGAAGAAACCCAATTCAAGATTGCTCTGAGGAGGAGTTCTTGAAGGATGTTATGGAGTTTTTACTACTTAGAGGACATTCGCGACTTATTCCTCAAGGTGGACTTGAGGAGTTCCCAGATGCCATACTCAATGGGAAGCGTCTTGATCTCTATAACTTGTATAAGGAGGTGGTCACCCGAGGAGGCTTTCATGTCGGCAATGGTATCAATTGGAAGGGGCAGATCTTCTCTAAGATGCACAATTACACAATGACCAATAGAATGACTGGTGTTGGAAATACACTGAAAAGACATTATGAGACTTACCTTCTAGAATATGAATTGGCTCATGATGATGTAGATGGAGAATGCTGCCTTTTGTGTCACAGTAGTGCAGCAGGGGATTGGGTGAACTGCGGTATTTGTGGTGAATGGGCCCATTTCGGGTGCGACCGGAGGCAGGGTCTCGGTGCATTTAAGGATTATGCCAAAACAGATGGGTTAGAGTATGTTTGTCCACACTGTAGCATTACAACTTACAAGAAGAAACCACACAGAGTAGCAAACGGGTCTCCGCAAGGAATAACGAATCCACGGATACCTTGA
Protein sequence
MMLHSVGAARQTCSLLAVTCGSVPKVKYEEDVAVDKLKYPFPELVSSGRLEVRVLTNPSKDEFSRIVESCQPSFVYLQGEQLENDEIGSLVWNGVDLSLEDLCGLFNTALPSTVYLEIPDGGEIADALHSKGIPYVIYWNNTFSSYAAAHFRNALLSVVQSSSTHTWDAFQLAHAAFRLHCVGSNYALPGNADDIRNDLEPQLIGEPLKINVEPPEIDAGEDEDGSLETLPVLSIHDNNVTVRFLICGMSGTPDACLLRSLEDGLNALLNIEIRGSKLQGKFSAPPPPLQAGSFSRGVVTMRCDIVTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIENSQLVHAMHDCEGNKHHMHEPRKSASVACGATVFEVSMKVPAWASQVLRQLAPDISYRSLVALGIGGVQGLPVASFEKEDAERLLFFYSRDENDKHSDQLLVSVLPNWFKPPTPSRKRGEPSQGIRNTLSLDSLAYANIPSIRKVGGEEPAPMNGFKTPLLPARKRLKVATMRPIPRVHRNKMTPFSGLAEADGNNGGQPKASLPVITPSKHVTVGSTSATQRKSFSSSSQSKQQIVPLNPLPLKKHGCGRNPIQDCSEEEFLKDVMEFLLLRGHSRLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVANGSPQGITNPRIP
Homology
BLAST of Spg023560 vs. NCBI nr
Match:
XP_022949406.1 (AT-rich interactive domain-containing protein 4-like isoform X1 [Cucurbita moschata] >XP_022949407.1 AT-rich interactive domain-containing protein 4-like isoform X1 [Cucurbita moschata] >XP_022949408.1 AT-rich interactive domain-containing protein 4-like isoform X1 [Cucurbita moschata] >XP_022949411.1 AT-rich interactive domain-containing protein 4-like isoform X1 [Cucurbita moschata])
HSP 1 Score: 1499.6 bits (3881), Expect = 0.0e+00
Identity = 716/782 (91.56%), Postives = 751/782 (96.04%), Query Frame = 0
Query: 2 MLHSVGAARQTCSLLAVTCGSVPKVKYEEDVAVDKLKYPFPELVSSGRLEVRVLTNPSKD 61
MLHS+GAARQTCSLLAVTCG +PKVK EEDVA LKYPFPELVSSGRLEV+VLTNPSK+
Sbjct: 1 MLHSIGAARQTCSLLAVTCGRLPKVKCEEDVAEHNLKYPFPELVSSGRLEVQVLTNPSKN 60
Query: 62 EFSRIVESCQPSFVYLQGEQLENDEIGSLVWNGVDLSLEDLCGLFNTALPSTVYLEIPDG 121
EF RIVESCQPSFVYLQGEQLENDE+GSLVWNGVDLSLEDLCGLF+TALP+TVYLE+P+G
Sbjct: 61 EFGRIVESCQPSFVYLQGEQLENDELGSLVWNGVDLSLEDLCGLFDTALPTTVYLELPNG 120
Query: 122 GEIADALHSKGIPYVIYWNNTFSSYAAAHFRNALLSVVQSSSTHTWDAFQLAHAAFRLHC 181
G+IA+ LHSKGIPYVIYWNNTFS YAAAHFRNALLSVV+SSSTHTWDAFQLAHAAFRLHC
Sbjct: 121 GKIAETLHSKGIPYVIYWNNTFSCYAAAHFRNALLSVVESSSTHTWDAFQLAHAAFRLHC 180
Query: 182 VGSNYALPGNADDIRNDLEPQLIGEPLKINVEPPEIDAGEDEDGSLETLPVLSIHDNNVT 241
VG NYALPGNADD R+DLEPQLIGEP KIN+EPPE+DAGEDED SLE +PV+S+HDNNVT
Sbjct: 181 VGRNYALPGNADDFRSDLEPQLIGEPPKINIEPPELDAGEDEDASLEAVPVISVHDNNVT 240
Query: 242 VRFLICGMSGTPDACLLRSLEDGLNALLNIEIRGSKLQGKFSAPPPPLQAGSFSRGVVTM 301
+R LICG+ TPDACLLRSLEDGLNALLNIEIRGSKLQGKFSAPPPPLQAGSFSRGVVTM
Sbjct: 241 MRLLICGLPCTPDACLLRSLEDGLNALLNIEIRGSKLQGKFSAPPPPLQAGSFSRGVVTM 300
Query: 302 RCDIVTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIENSQLVHAMHDCEGNKHHMHEP 361
RCDIVTCSSAHIS+LVSGSAHTCFDDQLLEKHIKHEIIENSQLVH MHDCEGNKHHMH+P
Sbjct: 301 RCDIVTCSSAHISVLVSGSAHTCFDDQLLEKHIKHEIIENSQLVHVMHDCEGNKHHMHKP 360
Query: 362 RKSASVACGATVFEVSMKVPAWASQVLRQLAPDISYRSLVALGIGGVQGLPVASFEKEDA 421
RKSASVACGATVFEVSMKVPAWASQVLRQLAPD+S+RSLVALGIGGVQG PVASFEKEDA
Sbjct: 361 RKSASVACGATVFEVSMKVPAWASQVLRQLAPDMSHRSLVALGIGGVQGFPVASFEKEDA 420
Query: 422 ERLLFFYSRDENDKHSDQLLVSVLPNWFKPPTPSRKRGEPSQGIRNTLSLDSLAYANIPS 481
ERLLFF SRDENDKHSDQLLVSVLP+WFKPPTPSRKR EPSQG+RNTLS DSLAYANIPS
Sbjct: 421 ERLLFFCSRDENDKHSDQLLVSVLPHWFKPPTPSRKRVEPSQGMRNTLSHDSLAYANIPS 480
Query: 482 IRKVGGEEPAPMNGFKTPLLPARKRLKVATMRPIPRVHRNKMTPFSGLAEADGNNGGQPK 541
+R+VG EEPAPMNGFK PLLPARKRLKVATM+PIP VHRNKM FSG E DGN+GGQPK
Sbjct: 481 VRRVGREEPAPMNGFKAPLLPARKRLKVATMKPIPLVHRNKMKLFSGWTEGDGNSGGQPK 540
Query: 542 ASLPVITPSKHVTVGSTSATQRKSFSSSSQSKQQIVPLNPLPLKKHGCGRNPIQDCSEEE 601
ASLP +TPSKHVTVGSTSATQRKSFSSSSQSKQQI+PLNPLPLKKHGCGRNP+QDCSEEE
Sbjct: 541 ASLPAVTPSKHVTVGSTSATQRKSFSSSSQSKQQIIPLNPLPLKKHGCGRNPVQDCSEEE 600
Query: 602 FLKDVMEFLLLRGHSRLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWK 661
FLKDVMEFLLLRGHSRLIPQGG+EEFPDA+LNGKRLDLYNLYKEVVTRGGFHVGNGINWK
Sbjct: 601 FLKDVMEFLLLRGHSRLIPQGGVEEFPDAVLNGKRLDLYNLYKEVVTRGGFHVGNGINWK 660
Query: 662 GQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNC 721
GQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNC
Sbjct: 661 GQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNC 720
Query: 722 GICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVANGSPQGITNPR 781
GICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPH VANGSPQGITNPR
Sbjct: 721 GICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHSVANGSPQGITNPR 780
Query: 782 IP 784
IP
Sbjct: 781 IP 782
BLAST of Spg023560 vs. NCBI nr
Match:
XP_023524673.1 (AT-rich interactive domain-containing protein 4-like isoform X1 [Cucurbita pepo subsp. pepo] >XP_023524674.1 AT-rich interactive domain-containing protein 4-like isoform X1 [Cucurbita pepo subsp. pepo] >XP_023524675.1 AT-rich interactive domain-containing protein 4-like isoform X1 [Cucurbita pepo subsp. pepo] >XP_023524676.1 AT-rich interactive domain-containing protein 4-like isoform X1 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 1496.9 bits (3874), Expect = 0.0e+00
Identity = 717/782 (91.69%), Postives = 748/782 (95.65%), Query Frame = 0
Query: 2 MLHSVGAARQTCSLLAVTCGSVPKVKYEEDVAVDKLKYPFPELVSSGRLEVRVLTNPSKD 61
MLHS+GAARQTCSLLAVTCG +PKVK EEDVA LKYPFPEL SSGRLEV+VLTNPSK+
Sbjct: 1 MLHSIGAARQTCSLLAVTCGRLPKVKCEEDVAEHNLKYPFPELASSGRLEVQVLTNPSKN 60
Query: 62 EFSRIVESCQPSFVYLQGEQLENDEIGSLVWNGVDLSLEDLCGLFNTALPSTVYLEIPDG 121
EF RIVESCQPSFVYLQGEQLENDE+GSLVWNGVDLSLEDLCGLF+TALP+TVYLE+P+G
Sbjct: 61 EFGRIVESCQPSFVYLQGEQLENDELGSLVWNGVDLSLEDLCGLFDTALPTTVYLELPNG 120
Query: 122 GEIADALHSKGIPYVIYWNNTFSSYAAAHFRNALLSVVQSSSTHTWDAFQLAHAAFRLHC 181
G IA+ LHSKGIPYVIYWNNTFS YAAAHFRNALLSVVQSSSTHTWDAFQLAHAAFRLHC
Sbjct: 121 GIIAETLHSKGIPYVIYWNNTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLAHAAFRLHC 180
Query: 182 VGSNYALPGNADDIRNDLEPQLIGEPLKINVEPPEIDAGEDEDGSLETLPVLSIHDNNVT 241
VG NYALPGNADD R+DLEPQLIGEP KINVEPPE+DAGEDED SLE LPV+S+HDNNVT
Sbjct: 181 VGRNYALPGNADDFRSDLEPQLIGEPPKINVEPPELDAGEDEDASLEALPVISVHDNNVT 240
Query: 242 VRFLICGMSGTPDACLLRSLEDGLNALLNIEIRGSKLQGKFSAPPPPLQAGSFSRGVVTM 301
+R LICG+ TPDACLLRSLEDGLNALLNIEIRGSKLQGKFSAPPPPLQAGSFSRGVVTM
Sbjct: 241 MRLLICGLPCTPDACLLRSLEDGLNALLNIEIRGSKLQGKFSAPPPPLQAGSFSRGVVTM 300
Query: 302 RCDIVTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIENSQLVHAMHDCEGNKHHMHEP 361
RCDIVTCSSAHIS+LVSGSAHTCFDDQLLEKHIKHEIIENSQLVH MHDCEGNKHHMH+P
Sbjct: 301 RCDIVTCSSAHISVLVSGSAHTCFDDQLLEKHIKHEIIENSQLVHVMHDCEGNKHHMHKP 360
Query: 362 RKSASVACGATVFEVSMKVPAWASQVLRQLAPDISYRSLVALGIGGVQGLPVASFEKEDA 421
RKSASVACGATVFEVSMKVPAWASQVLRQLAPD+S+RSLVALGIGGVQG PVASFEKEDA
Sbjct: 361 RKSASVACGATVFEVSMKVPAWASQVLRQLAPDMSHRSLVALGIGGVQGFPVASFEKEDA 420
Query: 422 ERLLFFYSRDENDKHSDQLLVSVLPNWFKPPTPSRKRGEPSQGIRNTLSLDSLAYANIPS 481
ERLLFF SRDENDKHSDQLLVSVLP+WFKPPTPSRKR EPSQG+RNTLS DSLAYANIPS
Sbjct: 421 ERLLFFCSRDENDKHSDQLLVSVLPHWFKPPTPSRKRVEPSQGMRNTLSHDSLAYANIPS 480
Query: 482 IRKVGGEEPAPMNGFKTPLLPARKRLKVATMRPIPRVHRNKMTPFSGLAEADGNNGGQPK 541
+R+VG EEPAPMNGFK PLLPARKRLKVATMRPIP VHRNKM FSG E DGN+GGQPK
Sbjct: 481 VRRVGREEPAPMNGFKAPLLPARKRLKVATMRPIPLVHRNKMKLFSGWTEGDGNSGGQPK 540
Query: 542 ASLPVITPSKHVTVGSTSATQRKSFSSSSQSKQQIVPLNPLPLKKHGCGRNPIQDCSEEE 601
ASLP +TPSKHVTVGSTSATQRKSFSSSSQSKQQI+PLNPLPLKKHGCGRNP+QDCSEEE
Sbjct: 541 ASLPAVTPSKHVTVGSTSATQRKSFSSSSQSKQQIIPLNPLPLKKHGCGRNPVQDCSEEE 600
Query: 602 FLKDVMEFLLLRGHSRLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWK 661
FLKDVMEFLLLRGHSRLIPQGG+EEFPDA+LNGKRLDLYNLYKEVVTRGGFHVGNGINWK
Sbjct: 601 FLKDVMEFLLLRGHSRLIPQGGVEEFPDAVLNGKRLDLYNLYKEVVTRGGFHVGNGINWK 660
Query: 662 GQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNC 721
GQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLC SSAAGDWVNC
Sbjct: 661 GQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCRSSAAGDWVNC 720
Query: 722 GICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVANGSPQGITNPR 781
GICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPH VANGSPQGITNPR
Sbjct: 721 GICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHSVANGSPQGITNPR 780
Query: 782 IP 784
+P
Sbjct: 781 VP 782
BLAST of Spg023560 vs. NCBI nr
Match:
XP_022998193.1 (AT-rich interactive domain-containing protein 4-like isoform X1 [Cucurbita maxima] >XP_022998194.1 AT-rich interactive domain-containing protein 4-like isoform X1 [Cucurbita maxima] >XP_022998195.1 AT-rich interactive domain-containing protein 4-like isoform X1 [Cucurbita maxima])
HSP 1 Score: 1488.0 bits (3851), Expect = 0.0e+00
Identity = 714/782 (91.30%), Postives = 745/782 (95.27%), Query Frame = 0
Query: 2 MLHSVGAARQTCSLLAVTCGSVPKVKYEEDVAVDKLKYPFPELVSSGRLEVRVLTNPSKD 61
MLHS+GAARQTCSLLAVTCG +PKVK EEDVA LKYPFPELVSSGRLEV+VLTNPSK+
Sbjct: 1 MLHSIGAARQTCSLLAVTCGRLPKVKCEEDVAEHNLKYPFPELVSSGRLEVQVLTNPSKN 60
Query: 62 EFSRIVESCQPSFVYLQGEQLENDEIGSLVWNGVDLSLEDLCGLFNTALPSTVYLEIPDG 121
EFSRIVESCQPSFVYLQGEQLENDE+GSLVWNGVDLSLEDLCGLFNTALP+TVYLE+P+G
Sbjct: 61 EFSRIVESCQPSFVYLQGEQLENDELGSLVWNGVDLSLEDLCGLFNTALPTTVYLELPNG 120
Query: 122 GEIADALHSKGIPYVIYWNNTFSSYAAAHFRNALLSVVQSSSTHTWDAFQLAHAAFRLHC 181
G IA+ LHSKGIPYVIYWNNTFS YAAAHFRNALLSVVQSSSTHTWDAFQLAHAAFRL C
Sbjct: 121 GIIAETLHSKGIPYVIYWNNTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLAHAAFRLQC 180
Query: 182 VGSNYALPGNADDIRNDLEPQLIGEPLKINVEPPEIDAGEDEDGSLETLPVLSIHDNNVT 241
+G NYALPGNAD+ R+DLEPQLIGEP KI VEPPE+DAG DED SLE LPV+S+HDNNVT
Sbjct: 181 MGRNYALPGNADNFRSDLEPQLIGEPPKIIVEPPELDAGADEDASLEALPVISVHDNNVT 240
Query: 242 VRFLICGMSGTPDACLLRSLEDGLNALLNIEIRGSKLQGKFSAPPPPLQAGSFSRGVVTM 301
+R LICG+ TPDACLLRSLEDGLNALLNIEIRGSKLQGKFSAPPPPLQA SFSRGVVTM
Sbjct: 241 MRLLICGLPCTPDACLLRSLEDGLNALLNIEIRGSKLQGKFSAPPPPLQAESFSRGVVTM 300
Query: 302 RCDIVTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIENSQLVHAMHDCEGNKHHMHEP 361
RCDIVTCSSAHIS+LVSGSAHTCFDDQLLEKHIKHEIIENSQLVH MHDCEGNKHHMH+P
Sbjct: 301 RCDIVTCSSAHISVLVSGSAHTCFDDQLLEKHIKHEIIENSQLVHVMHDCEGNKHHMHKP 360
Query: 362 RKSASVACGATVFEVSMKVPAWASQVLRQLAPDISYRSLVALGIGGVQGLPVASFEKEDA 421
RKSASVACGATVFEVSMKVPAWASQVLRQLAPD+S+RSLVALGIGGVQG PVASFEKEDA
Sbjct: 361 RKSASVACGATVFEVSMKVPAWASQVLRQLAPDMSHRSLVALGIGGVQGFPVASFEKEDA 420
Query: 422 ERLLFFYSRDENDKHSDQLLVSVLPNWFKPPTPSRKRGEPSQGIRNTLSLDSLAYANIPS 481
ERLLFF SRDENDKHSDQLLVSVLPNWFKPPTPSRKR EPSQGIRN L DSLAYANIPS
Sbjct: 421 ERLLFFCSRDENDKHSDQLLVSVLPNWFKPPTPSRKRVEPSQGIRNALLHDSLAYANIPS 480
Query: 482 IRKVGGEEPAPMNGFKTPLLPARKRLKVATMRPIPRVHRNKMTPFSGLAEADGNNGGQPK 541
+R+VG EEPAPMNGFK PLLPARKRLKVATMRPIP VHRNKM FSG E DGNNG QPK
Sbjct: 481 VRRVGREEPAPMNGFKAPLLPARKRLKVATMRPIPLVHRNKMKLFSGWTEGDGNNGSQPK 540
Query: 542 ASLPVITPSKHVTVGSTSATQRKSFSSSSQSKQQIVPLNPLPLKKHGCGRNPIQDCSEEE 601
ASLPV+TPSKHVT+GSTSATQRKSFSSSSQSKQQI+PLNPLPLKKHGCGRNP+QDCSEEE
Sbjct: 541 ASLPVVTPSKHVTIGSTSATQRKSFSSSSQSKQQIIPLNPLPLKKHGCGRNPVQDCSEEE 600
Query: 602 FLKDVMEFLLLRGHSRLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWK 661
FLKDVMEFLLLRGHSRLIPQGG+EEFPDA+LNGKRLDLYNLYKEVVTRGGFHVGNGINWK
Sbjct: 601 FLKDVMEFLLLRGHSRLIPQGGVEEFPDAVLNGKRLDLYNLYKEVVTRGGFHVGNGINWK 660
Query: 662 GQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNC 721
GQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNC
Sbjct: 661 GQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNC 720
Query: 722 GICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVANGSPQGITNPR 781
GICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPH +ANGSPQGITNPR
Sbjct: 721 GICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHSLANGSPQGITNPR 780
Query: 782 IP 784
+P
Sbjct: 781 LP 782
BLAST of Spg023560 vs. NCBI nr
Match:
KAG7036396.1 (AT-rich interactive domain-containing protein 4, partial [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 1479.5 bits (3829), Expect = 0.0e+00
Identity = 716/790 (90.63%), Postives = 746/790 (94.43%), Query Frame = 0
Query: 2 MLHSVGAARQTCSLLAVTCGSVPKVKYEEDVAVDKLKYPFPELVSSGRLEVRVLTNPSKD 61
MLHS+GAARQTCSLLAVTCG +PKVK EEDVA LKYPFPELVSSGRLEV+VLTNPS +
Sbjct: 1 MLHSIGAARQTCSLLAVTCGRLPKVKCEEDVAEHNLKYPFPELVSSGRLEVQVLTNPSTN 60
Query: 62 EFSRIVESCQPSFVYLQGEQLENDEIGSLVWNGVDLSLEDLCGLFNTALPSTVYLEIPDG 121
EF RIVESCQPSFVYLQGEQLENDE+GSLVWNGVDLSLEDL GLF+TALP+TVYLE+P+G
Sbjct: 61 EFGRIVESCQPSFVYLQGEQLENDELGSLVWNGVDLSLEDLSGLFDTALPTTVYLELPNG 120
Query: 122 GEIADALHSKGIPYVIYWNNTFSSYAAAHFRNALLSVVQSSSTHTWDAFQLAHAAFRLHC 181
G IA+ LHSKGIPYVIYWNNTFS YAAAHFRNALLSVVQSSSTHTWDAFQLAHAAFRLHC
Sbjct: 121 GIIAETLHSKGIPYVIYWNNTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLAHAAFRLHC 180
Query: 182 VGSNYALPGNADDIRNDLEPQLIGEPLKINVEPPEIDAGEDEDGSLETLPVLSIHDNNVT 241
VG NYALPGNADD R+DLEPQLIGEP KINVEPPE+DAGEDED SLE LPV+S+HDNNVT
Sbjct: 181 VGRNYALPGNADDFRSDLEPQLIGEPPKINVEPPELDAGEDEDASLEALPVISVHDNNVT 240
Query: 242 VRFLICGMSGTPDACLLRSLEDGLNALLNIE--------IRGSKLQGKFSAPPPPLQAGS 301
+R LICG+ TPDACLLRSLEDGLNALLNIE IRGSKLQGKFSAPPPPLQAGS
Sbjct: 241 MRLLICGLPCTPDACLLRSLEDGLNALLNIEELTNDILYIRGSKLQGKFSAPPPPLQAGS 300
Query: 302 FSRGVVTMRCDIVTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIENSQLVHAMHDCEG 361
FSRGVVTMRCDIVTCSSAHIS+LVSGSAHTCFDDQLLEKHIKHEIIENSQLVH MHDCEG
Sbjct: 301 FSRGVVTMRCDIVTCSSAHISVLVSGSAHTCFDDQLLEKHIKHEIIENSQLVHVMHDCEG 360
Query: 362 NKHHMHEPRKSASVACGATVFEVSMKVPAWASQVLRQLAPDISYRSLVALGIGGVQGLPV 421
NKHHMH+PRKSASVACGATVFEVSMKVPAWASQVLRQLAPD+S+RSLVALGIGGVQG PV
Sbjct: 361 NKHHMHKPRKSASVACGATVFEVSMKVPAWASQVLRQLAPDMSHRSLVALGIGGVQGFPV 420
Query: 422 ASFEKEDAERLLFFYSRDENDKHSDQLLVSVLPNWFKPPTPSRKRGEPSQGIRNTLSLDS 481
ASFEKEDAERLLFF SRDENDKHSDQLLVSVLP+WFKPPTPSRKR EPSQG+RNTLS DS
Sbjct: 421 ASFEKEDAERLLFFCSRDENDKHSDQLLVSVLPHWFKPPTPSRKRVEPSQGMRNTLSHDS 480
Query: 482 LAYANIPSIRKVGGEEPAPMNGFKTPLLPARKRLKVATMRPIPRVHRNKMTPFSGLAEAD 541
LAYANIPS+R+VG EEPAPMNGFK PLLPARKRLKVATMRPIP VHRNKM FSG E D
Sbjct: 481 LAYANIPSVRRVGREEPAPMNGFKAPLLPARKRLKVATMRPIPLVHRNKMKLFSGWTEGD 540
Query: 542 GNNGGQPKASLPVITPSKHVTVGSTSATQRKSFSSSSQSKQQIVPLNPLPLKKHGCGRNP 601
GN+GGQPKASLP +TPSKHVTVGSTSATQRKSFSSSSQSKQQI+PLNPLPLKKHGCGRNP
Sbjct: 541 GNSGGQPKASLPAVTPSKHVTVGSTSATQRKSFSSSSQSKQQIIPLNPLPLKKHGCGRNP 600
Query: 602 IQDCSEEEFLKDVMEFLLLRGHSRLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFH 661
+QDCSEEEFLKDVMEFLLLRGHSRLIPQGG+EEFPDA+LNGKRLDLYNLYKEVVTRGGFH
Sbjct: 601 VQDCSEEEFLKDVMEFLLLRGHSRLIPQGGVEEFPDAVLNGKRLDLYNLYKEVVTRGGFH 660
Query: 662 VGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSS 721
VGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCH S
Sbjct: 661 VGNGINWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCH-S 720
Query: 722 AAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVANGS 781
AAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEY CPHCSITTYKKKPH VANGS
Sbjct: 721 AAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYGCPHCSITTYKKKPHSVANGS 780
Query: 782 PQGITNPRIP 784
PQGITNPRIP
Sbjct: 781 PQGITNPRIP 789
BLAST of Spg023560 vs. NCBI nr
Match:
XP_038883881.1 (AT-rich interactive domain-containing protein 4-like [Benincasa hispida] >XP_038883888.1 AT-rich interactive domain-containing protein 4-like [Benincasa hispida])
HSP 1 Score: 1471.1 bits (3807), Expect = 0.0e+00
Identity = 720/785 (91.72%), Postives = 746/785 (95.03%), Query Frame = 0
Query: 2 MLHSVGAARQTCSLLAVTCGSVPKVKYEEDVAVDKLKYPFPELVSSGRLEVRVLTNPSKD 61
MLHSV AARQTCSLLAVTCGSVPK+K EE+V DKL+YPFPELVSSGRLEVRVL NPSKD
Sbjct: 1 MLHSVVAARQTCSLLAVTCGSVPKIKCEEEVDEDKLRYPFPELVSSGRLEVRVLANPSKD 60
Query: 62 EFSRIVESCQPSFVYLQGEQLENDEIGSLVWNGVDLSLEDLCGLFNTALPSTVYLEIPDG 121
EFSRIVES PSFVYLQGEQL NDEIGSLVWNGVDLSLEDLCGLFNT LP+ VYLEIP+G
Sbjct: 61 EFSRIVESYLPSFVYLQGEQLGNDEIGSLVWNGVDLSLEDLCGLFNTTLPTIVYLEIPNG 120
Query: 122 GEIADALHSKGIPYVIYWNNTFSSYAAAHFRNALLSVVQSSSTHTWDAFQLAHAAFRLHC 181
G IA+ALHSKGIPY++YWN+TFS YAAAHFRNALLSVVQSSSTHTWDAFQLA AAF+L+C
Sbjct: 121 GRIAEALHSKGIPYLMYWNSTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLARAAFKLYC 180
Query: 182 VGSNYALPGNADD-IRNDLEPQLIGEPLKINVEPPEIDA--GEDEDGSLETLPVLSIHDN 241
VGSNY LPG ADD I +DLEPQLIGEPLKINVEPPE+DA GED DGSLETLP +SIHDN
Sbjct: 181 VGSNYGLPGIADDSIMSDLEPQLIGEPLKINVEPPEVDAGEGEDGDGSLETLPAISIHDN 240
Query: 242 NVTVRFLICGMSGTPDACLLRSLEDGLNALLNIEIRGSKLQGKFSAPPPPLQAGSFSRGV 301
NVTVRFLICG+ TPDACLLRSLEDGLNALLNIEIRGSKLQGKFSAPPPPLQAGSFSRGV
Sbjct: 241 NVTVRFLICGVPCTPDACLLRSLEDGLNALLNIEIRGSKLQGKFSAPPPPLQAGSFSRGV 300
Query: 302 VTMRCDIVTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIENSQLVHAMHDCEGNKHHM 361
VTMRCDIVTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIEN+QLVHAMHDCEGNKHHM
Sbjct: 301 VTMRCDIVTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIENNQLVHAMHDCEGNKHHM 360
Query: 362 HEPRKSASVACGATVFEVSMKVPAWASQVLRQLAPDISYRSLVALGIGGVQGLPVASFEK 421
HEPRKSASVACGATVFEVSMKVPAWASQVLRQLAPD+SYRSLVALGIGGVQGLPVASFEK
Sbjct: 361 HEPRKSASVACGATVFEVSMKVPAWASQVLRQLAPDMSYRSLVALGIGGVQGLPVASFEK 420
Query: 422 EDAERLLFFYSRDENDKHSDQLLVSVLPNWFKPPTPSRKRGEPSQGIRNTLSLDSLAYAN 481
EDAERLLFF S DENDKHS+QLLVSVLP+WFKPPTPSRKR EPSQGIR+TLS DSLAYAN
Sbjct: 421 EDAERLLFFCSGDENDKHSEQLLVSVLPSWFKPPTPSRKRVEPSQGIRSTLSHDSLAYAN 480
Query: 482 IPSIRKVGGEEPAPMNGFKTPLLPARKRLKVATMRPIPRVHRNKMTPFSGLAEADGNNGG 541
IPSIR+V EEPAPMNGFK PLLP RKRLKVA+MRP+PRVHRNK+TPFSGLAE D NNG
Sbjct: 481 IPSIRRVAREEPAPMNGFKAPLLPTRKRLKVASMRPVPRVHRNKITPFSGLAEVDWNNGS 540
Query: 542 QPKASLPVITPSKHVTVGSTSATQRKSFSSSSQSKQQIVPLNPLPLKKHGCGRNPIQDCS 601
KASLPV+TPSKHVTVGSTSAT RKSFSSSSQSK QI+ LNPLPLKKHGCGRNPIQDCS
Sbjct: 541 LSKASLPVVTPSKHVTVGSTSATHRKSFSSSSQSK-QIISLNPLPLKKHGCGRNPIQDCS 600
Query: 602 EEEFLKDVMEFLLLRGHSRLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGI 661
EEEFLKDVMEFLLLRGHSRLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGI
Sbjct: 601 EEEFLKDVMEFLLLRGHSRLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGI 660
Query: 662 NWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDW 721
NWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDW
Sbjct: 661 NWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDW 720
Query: 722 VNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVANGSPQGIT 781
VNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVANGSPQGIT
Sbjct: 721 VNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVANGSPQGIT 780
Query: 782 NPRIP 784
NPRIP
Sbjct: 781 NPRIP 784
BLAST of Spg023560 vs. ExPASy Swiss-Prot
Match:
Q6NQ79 (AT-rich interactive domain-containing protein 4 OS=Arabidopsis thaliana OX=3702 GN=ARID4 PE=1 SV=1)
HSP 1 Score: 915.2 bits (2364), Expect = 4.9e-265
Identity = 461/775 (59.48%), Postives = 562/775 (72.52%), Query Frame = 0
Query: 2 MLHSVGAARQTCSLLAVTCGS-VPKVKYEEDVAVDKLKYPFPELVSSGRLEVRVLTNPSK 61
M H G +R C+++AV G+ + + D + KYPFP+L SSGRL+ +VL NP+
Sbjct: 1 MFHGQGFSRNRCNVVAVVSGAELCDTNNQIDGTSHQPKYPFPDLSSSGRLKFQVLNNPTP 60
Query: 62 DEFSRIVESCQPSFVYLQGEQL-ENDEIGSLVWNGVDLSLED-LCGLFNTALPSTVYLEI 121
+EF V S FVYLQGE ++DE+G LV D S D L LF + LP+TVYLE+
Sbjct: 61 EEFQVAVNSSATDFVYLQGEHSGDSDEVGPLVLGYTDFSTPDALVTLFGSTLPTTVYLEL 120
Query: 122 PDGGEIADALHSKGIPYVIYWNNTFSSYAAAHFRNALLSVVQSSSTHTWDAFQLAHAAFR 181
P+G E+A AL+SKG+ YVIYW N FS YAA HFR++L SV+QSS + TWD F +A A+FR
Sbjct: 121 PNGEELAQALYSKGVQYVIYWKNVFSKYAACHFRHSLFSVIQSSCSDTWDVFHVAEASFR 180
Query: 182 LHCVGSNYALPGNADDIRN-DLEPQLIGEPLKINVEPPEIDAGEDEDGSLETLPVLSIHD 241
L+C N LP N++ N ++ P L+GEP KI+V PE D E+E+ SLE+LP + I+D
Sbjct: 181 LYCTSDNAVLPSNSNRKMNYEMGPCLLGEPPKIDVVSPEADELEEEN-SLESLPSIKIYD 240
Query: 242 NNVTVRFLICGMSGTPDACLLRSLEDGLNALLNIEIRGSKLQGKFSAPPPPLQAGSFSRG 301
+VTVRFL+CG T D LL SL DGLNALL IE+RGSKL + SAP PPLQAG+F+RG
Sbjct: 241 EDVTVRFLLCGPPCTVDTFLLGSLMDGLNALLRIEMRGSKLHNRSSAPAPPLQAGTFTRG 300
Query: 302 VVTMRCDIVTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIENSQLVHAMHDCEGNKHH 361
VVTMRCD+ TCSSAHIS+LVSG+A TCF DQLLE HIKHE++E QLVH++ + E K
Sbjct: 301 VVTMRCDVSTCSSAHISMLVSGNAQTCFSDQLLENHIKHEVVEKIQLVHSVVNSEETKRG 360
Query: 362 MHEPRKSASVACGATVFEVSMKVPAWASQVLRQLAPDISYRSLVALGIGGVQGLPVASFE 421
EPR+SAS+ACGA+V EVSM+VP WA QVLRQLAPD+SYRSLV LG+ +QGL VASFE
Sbjct: 361 FSEPRRSASIACGASVCEVSMQVPTWALQVLRQLAPDVSYRSLVVLGVASIQGLSVASFE 420
Query: 422 KEDAERLLFFYSRDENDKHSDQLLVSVLPNWFKPPTPSRKRGEPSQGIRNTLSLDSLAYA 481
K+DAERLLFF + ND + L+S +PNW PP P+RKR EP R + +++
Sbjct: 421 KDDAERLLFFCGQQINDTSNHDALLSKIPNWLTPPLPTRKRSEP---CRESKEIEN---- 480
Query: 482 NIPSIRKVGGEEPAPMNGFKTPLLPARKRLKVATMRPIPRVHRNKMTPFSGLAEADGNNG 541
GG P +++ VA +RPIP R+KM PFSG +E +G
Sbjct: 481 --------GG--------------PTSRKINVAALRPIPHTRRHKMIPFSGYSEIGRFDG 540
Query: 542 GQPKASLPVITPSKHVTVGSTSATQRKSFSSSSQSKQQIVPLNPLPLKKHGCGRNPIQDC 601
K SLP+ P KH G T T RK+FS S Q K QI+ LNPLPLKKH CGR IQ C
Sbjct: 541 DHTKGSLPM--PPKHGASGGTPVTHRKAFSGSYQRK-QIISLNPLPLKKHDCGRAHIQVC 600
Query: 602 SEEEFLKDVMEFLLLRGHSRLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNG 661
SEEEFL+DVM+FLL+RGH+RL+P GGL EFPDA+LN KRLDL+NLY+EVV+RGGFHVGNG
Sbjct: 601 SEEEFLRDVMQFLLIRGHTRLVPPGGLAEFPDAVLNSKRLDLFNLYREVVSRGGFHVGNG 660
Query: 662 INWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGD 721
INWKGQ+FSKM N+T+TNRMTGVGNTLKRHYETYLLEYE AHDDVDGECCL+C SS AGD
Sbjct: 661 INWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYEYAHDDVDGECCLICRSSTAGD 720
Query: 722 WVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVANG 773
WVNCG CGEWAHFGCDRR GLGAFKDYAKTDGLEYVCP+CS++ Y+KK + +NG
Sbjct: 721 WVNCGSCGEWAHFGCDRRPGLGAFKDYAKTDGLEYVCPNCSVSNYRKKSQKTSNG 742
BLAST of Spg023560 vs. ExPASy TrEMBL
Match:
A0A6J1GBY3 (AT-rich interactive domain-containing protein 4-like isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111452767 PE=4 SV=1)
HSP 1 Score: 1499.6 bits (3881), Expect = 0.0e+00
Identity = 716/782 (91.56%), Postives = 751/782 (96.04%), Query Frame = 0
Query: 2 MLHSVGAARQTCSLLAVTCGSVPKVKYEEDVAVDKLKYPFPELVSSGRLEVRVLTNPSKD 61
MLHS+GAARQTCSLLAVTCG +PKVK EEDVA LKYPFPELVSSGRLEV+VLTNPSK+
Sbjct: 1 MLHSIGAARQTCSLLAVTCGRLPKVKCEEDVAEHNLKYPFPELVSSGRLEVQVLTNPSKN 60
Query: 62 EFSRIVESCQPSFVYLQGEQLENDEIGSLVWNGVDLSLEDLCGLFNTALPSTVYLEIPDG 121
EF RIVESCQPSFVYLQGEQLENDE+GSLVWNGVDLSLEDLCGLF+TALP+TVYLE+P+G
Sbjct: 61 EFGRIVESCQPSFVYLQGEQLENDELGSLVWNGVDLSLEDLCGLFDTALPTTVYLELPNG 120
Query: 122 GEIADALHSKGIPYVIYWNNTFSSYAAAHFRNALLSVVQSSSTHTWDAFQLAHAAFRLHC 181
G+IA+ LHSKGIPYVIYWNNTFS YAAAHFRNALLSVV+SSSTHTWDAFQLAHAAFRLHC
Sbjct: 121 GKIAETLHSKGIPYVIYWNNTFSCYAAAHFRNALLSVVESSSTHTWDAFQLAHAAFRLHC 180
Query: 182 VGSNYALPGNADDIRNDLEPQLIGEPLKINVEPPEIDAGEDEDGSLETLPVLSIHDNNVT 241
VG NYALPGNADD R+DLEPQLIGEP KIN+EPPE+DAGEDED SLE +PV+S+HDNNVT
Sbjct: 181 VGRNYALPGNADDFRSDLEPQLIGEPPKINIEPPELDAGEDEDASLEAVPVISVHDNNVT 240
Query: 242 VRFLICGMSGTPDACLLRSLEDGLNALLNIEIRGSKLQGKFSAPPPPLQAGSFSRGVVTM 301
+R LICG+ TPDACLLRSLEDGLNALLNIEIRGSKLQGKFSAPPPPLQAGSFSRGVVTM
Sbjct: 241 MRLLICGLPCTPDACLLRSLEDGLNALLNIEIRGSKLQGKFSAPPPPLQAGSFSRGVVTM 300
Query: 302 RCDIVTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIENSQLVHAMHDCEGNKHHMHEP 361
RCDIVTCSSAHIS+LVSGSAHTCFDDQLLEKHIKHEIIENSQLVH MHDCEGNKHHMH+P
Sbjct: 301 RCDIVTCSSAHISVLVSGSAHTCFDDQLLEKHIKHEIIENSQLVHVMHDCEGNKHHMHKP 360
Query: 362 RKSASVACGATVFEVSMKVPAWASQVLRQLAPDISYRSLVALGIGGVQGLPVASFEKEDA 421
RKSASVACGATVFEVSMKVPAWASQVLRQLAPD+S+RSLVALGIGGVQG PVASFEKEDA
Sbjct: 361 RKSASVACGATVFEVSMKVPAWASQVLRQLAPDMSHRSLVALGIGGVQGFPVASFEKEDA 420
Query: 422 ERLLFFYSRDENDKHSDQLLVSVLPNWFKPPTPSRKRGEPSQGIRNTLSLDSLAYANIPS 481
ERLLFF SRDENDKHSDQLLVSVLP+WFKPPTPSRKR EPSQG+RNTLS DSLAYANIPS
Sbjct: 421 ERLLFFCSRDENDKHSDQLLVSVLPHWFKPPTPSRKRVEPSQGMRNTLSHDSLAYANIPS 480
Query: 482 IRKVGGEEPAPMNGFKTPLLPARKRLKVATMRPIPRVHRNKMTPFSGLAEADGNNGGQPK 541
+R+VG EEPAPMNGFK PLLPARKRLKVATM+PIP VHRNKM FSG E DGN+GGQPK
Sbjct: 481 VRRVGREEPAPMNGFKAPLLPARKRLKVATMKPIPLVHRNKMKLFSGWTEGDGNSGGQPK 540
Query: 542 ASLPVITPSKHVTVGSTSATQRKSFSSSSQSKQQIVPLNPLPLKKHGCGRNPIQDCSEEE 601
ASLP +TPSKHVTVGSTSATQRKSFSSSSQSKQQI+PLNPLPLKKHGCGRNP+QDCSEEE
Sbjct: 541 ASLPAVTPSKHVTVGSTSATQRKSFSSSSQSKQQIIPLNPLPLKKHGCGRNPVQDCSEEE 600
Query: 602 FLKDVMEFLLLRGHSRLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWK 661
FLKDVMEFLLLRGHSRLIPQGG+EEFPDA+LNGKRLDLYNLYKEVVTRGGFHVGNGINWK
Sbjct: 601 FLKDVMEFLLLRGHSRLIPQGGVEEFPDAVLNGKRLDLYNLYKEVVTRGGFHVGNGINWK 660
Query: 662 GQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNC 721
GQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNC
Sbjct: 661 GQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNC 720
Query: 722 GICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVANGSPQGITNPR 781
GICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPH VANGSPQGITNPR
Sbjct: 721 GICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHSVANGSPQGITNPR 780
Query: 782 IP 784
IP
Sbjct: 781 IP 782
BLAST of Spg023560 vs. ExPASy TrEMBL
Match:
A0A6J1KBW8 (AT-rich interactive domain-containing protein 4-like isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111492918 PE=4 SV=1)
HSP 1 Score: 1488.0 bits (3851), Expect = 0.0e+00
Identity = 714/782 (91.30%), Postives = 745/782 (95.27%), Query Frame = 0
Query: 2 MLHSVGAARQTCSLLAVTCGSVPKVKYEEDVAVDKLKYPFPELVSSGRLEVRVLTNPSKD 61
MLHS+GAARQTCSLLAVTCG +PKVK EEDVA LKYPFPELVSSGRLEV+VLTNPSK+
Sbjct: 1 MLHSIGAARQTCSLLAVTCGRLPKVKCEEDVAEHNLKYPFPELVSSGRLEVQVLTNPSKN 60
Query: 62 EFSRIVESCQPSFVYLQGEQLENDEIGSLVWNGVDLSLEDLCGLFNTALPSTVYLEIPDG 121
EFSRIVESCQPSFVYLQGEQLENDE+GSLVWNGVDLSLEDLCGLFNTALP+TVYLE+P+G
Sbjct: 61 EFSRIVESCQPSFVYLQGEQLENDELGSLVWNGVDLSLEDLCGLFNTALPTTVYLELPNG 120
Query: 122 GEIADALHSKGIPYVIYWNNTFSSYAAAHFRNALLSVVQSSSTHTWDAFQLAHAAFRLHC 181
G IA+ LHSKGIPYVIYWNNTFS YAAAHFRNALLSVVQSSSTHTWDAFQLAHAAFRL C
Sbjct: 121 GIIAETLHSKGIPYVIYWNNTFSCYAAAHFRNALLSVVQSSSTHTWDAFQLAHAAFRLQC 180
Query: 182 VGSNYALPGNADDIRNDLEPQLIGEPLKINVEPPEIDAGEDEDGSLETLPVLSIHDNNVT 241
+G NYALPGNAD+ R+DLEPQLIGEP KI VEPPE+DAG DED SLE LPV+S+HDNNVT
Sbjct: 181 MGRNYALPGNADNFRSDLEPQLIGEPPKIIVEPPELDAGADEDASLEALPVISVHDNNVT 240
Query: 242 VRFLICGMSGTPDACLLRSLEDGLNALLNIEIRGSKLQGKFSAPPPPLQAGSFSRGVVTM 301
+R LICG+ TPDACLLRSLEDGLNALLNIEIRGSKLQGKFSAPPPPLQA SFSRGVVTM
Sbjct: 241 MRLLICGLPCTPDACLLRSLEDGLNALLNIEIRGSKLQGKFSAPPPPLQAESFSRGVVTM 300
Query: 302 RCDIVTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIENSQLVHAMHDCEGNKHHMHEP 361
RCDIVTCSSAHIS+LVSGSAHTCFDDQLLEKHIKHEIIENSQLVH MHDCEGNKHHMH+P
Sbjct: 301 RCDIVTCSSAHISVLVSGSAHTCFDDQLLEKHIKHEIIENSQLVHVMHDCEGNKHHMHKP 360
Query: 362 RKSASVACGATVFEVSMKVPAWASQVLRQLAPDISYRSLVALGIGGVQGLPVASFEKEDA 421
RKSASVACGATVFEVSMKVPAWASQVLRQLAPD+S+RSLVALGIGGVQG PVASFEKEDA
Sbjct: 361 RKSASVACGATVFEVSMKVPAWASQVLRQLAPDMSHRSLVALGIGGVQGFPVASFEKEDA 420
Query: 422 ERLLFFYSRDENDKHSDQLLVSVLPNWFKPPTPSRKRGEPSQGIRNTLSLDSLAYANIPS 481
ERLLFF SRDENDKHSDQLLVSVLPNWFKPPTPSRKR EPSQGIRN L DSLAYANIPS
Sbjct: 421 ERLLFFCSRDENDKHSDQLLVSVLPNWFKPPTPSRKRVEPSQGIRNALLHDSLAYANIPS 480
Query: 482 IRKVGGEEPAPMNGFKTPLLPARKRLKVATMRPIPRVHRNKMTPFSGLAEADGNNGGQPK 541
+R+VG EEPAPMNGFK PLLPARKRLKVATMRPIP VHRNKM FSG E DGNNG QPK
Sbjct: 481 VRRVGREEPAPMNGFKAPLLPARKRLKVATMRPIPLVHRNKMKLFSGWTEGDGNNGSQPK 540
Query: 542 ASLPVITPSKHVTVGSTSATQRKSFSSSSQSKQQIVPLNPLPLKKHGCGRNPIQDCSEEE 601
ASLPV+TPSKHVT+GSTSATQRKSFSSSSQSKQQI+PLNPLPLKKHGCGRNP+QDCSEEE
Sbjct: 541 ASLPVVTPSKHVTIGSTSATQRKSFSSSSQSKQQIIPLNPLPLKKHGCGRNPVQDCSEEE 600
Query: 602 FLKDVMEFLLLRGHSRLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWK 661
FLKDVMEFLLLRGHSRLIPQGG+EEFPDA+LNGKRLDLYNLYKEVVTRGGFHVGNGINWK
Sbjct: 601 FLKDVMEFLLLRGHSRLIPQGGVEEFPDAVLNGKRLDLYNLYKEVVTRGGFHVGNGINWK 660
Query: 662 GQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNC 721
GQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNC
Sbjct: 661 GQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNC 720
Query: 722 GICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVANGSPQGITNPR 781
GICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPH +ANGSPQGITNPR
Sbjct: 721 GICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHSLANGSPQGITNPR 780
Query: 782 IP 784
+P
Sbjct: 781 LP 782
BLAST of Spg023560 vs. ExPASy TrEMBL
Match:
A0A6J1DIE1 (AT-rich interactive domain-containing protein 4-like OS=Momordica charantia OX=3673 GN=LOC111020321 PE=4 SV=1)
HSP 1 Score: 1457.6 bits (3772), Expect = 0.0e+00
Identity = 717/786 (91.22%), Postives = 742/786 (94.40%), Query Frame = 0
Query: 1 MMLHSVGAARQTCSLLAVTCGSVPKVKYEEDVAVDKLKYPFPELVSSGRLEVRVLTNPSK 60
MMLHSVG ARQTCSLLAVTCGSVPKVK EEDVA D+LKYPFPELVSSGRLEVRVLTNPSK
Sbjct: 1 MMLHSVGPARQTCSLLAVTCGSVPKVKCEEDVAEDRLKYPFPELVSSGRLEVRVLTNPSK 60
Query: 61 DEFSRIVESCQPSFVYLQGEQLENDEIGSLVWNGVDLSLEDLCGLFNTALPSTVYLEIPD 120
DEF+RIVESCQPSFVYLQGEQLENDEIGSLVWNGVDLSLEDLCGLF+TALP TVYLEIP+
Sbjct: 61 DEFTRIVESCQPSFVYLQGEQLENDEIGSLVWNGVDLSLEDLCGLFHTALPITVYLEIPN 120
Query: 121 GGEIADALHSKGIPYVIYWNNTFSSYAAAHFRNALLSVVQSSSTHTWDAFQLAHAAFRLH 180
GG A+ALHSKGIPYV+YWNNT S YAAAHFRN LLSVVQSSSTHTWDAFQLAHAAFRLH
Sbjct: 121 GGRTAEALHSKGIPYVMYWNNTLSCYAAAHFRNGLLSVVQSSSTHTWDAFQLAHAAFRLH 180
Query: 181 CVGSNYALPGNADDIRNDLEPQLIGEPLKINVEPPEI---DAGEDEDGSLETLPVLSIHD 240
C SNYALPG+ D I +LEPQLIGEPLKI+VEPPEI DAGEDED SL TLP +SIHD
Sbjct: 181 CARSNYALPGDDDIISCNLEPQLIGEPLKISVEPPEIDAGDAGEDEDDSLGTLPAISIHD 240
Query: 241 NNVTVRFLICGMSGTPDACLLRSLEDGLNALLNIEIRGSKLQGKFSAPPPPLQAGSFSRG 300
NNVT+RFLICG+ TPDACLLRSLEDGLNALLNIEIRGSKLQGKFSA PPPLQAGSFSRG
Sbjct: 241 NNVTMRFLICGVPCTPDACLLRSLEDGLNALLNIEIRGSKLQGKFSASPPPLQAGSFSRG 300
Query: 301 VVTMRCDIVTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIENSQLVHAMHDCEGNKHH 360
VVTMRCD+VTCSSAHI+ILVSGSAHTCFDDQLLEKHIKHEIIENSQLVHA+ DCEGN+H
Sbjct: 301 VVTMRCDMVTCSSAHIAILVSGSAHTCFDDQLLEKHIKHEIIENSQLVHALRDCEGNQHC 360
Query: 361 MHEPRKSASVACGATVFEVSMKVPAWASQVLRQLAPDISYRSLVALGIGGVQGLPVASFE 420
MHEPRKSASVACGATVFEVSMKVPAWASQVLRQLAPD+SYRSLVALGIGGVQGLPVASFE
Sbjct: 361 MHEPRKSASVACGATVFEVSMKVPAWASQVLRQLAPDMSYRSLVALGIGGVQGLPVASFE 420
Query: 421 KEDAERLLFFYSRDENDKHSDQLLVSVLPNWFKPPTPSRKRGEPSQGIRNTLSLDSLAYA 480
KEDAER LFF SRD NDKHSDQL +SVLP+WFKPP PSRKR EPSQGI +T+S DSLAYA
Sbjct: 421 KEDAERFLFFCSRDGNDKHSDQLFLSVLPSWFKPPIPSRKRVEPSQGI-STVSHDSLAYA 480
Query: 481 NIPSIRKVGGEEPAPMNGFKTPLLPARKRLKVATMRPIPRVHRNKMTPFSGLAEADGNNG 540
NIPSIR+VGGEE APMNGFK LLPARKRLKVATMRPIPRVHRNKMTPFSGL EADGNNG
Sbjct: 481 NIPSIRRVGGEERAPMNGFKATLLPARKRLKVATMRPIPRVHRNKMTPFSGLTEADGNNG 540
Query: 541 GQPKASLPVITPSKHVTVGSTSATQRKSFSSSSQSKQQIVPLNPLPLKKHGCGRNPIQDC 600
PKASLPV+TPSKHVTVGSTSATQRKSFSSSSQSK QI+ LNPLPLKKHGCGRNPIQ C
Sbjct: 541 YLPKASLPVVTPSKHVTVGSTSATQRKSFSSSSQSK-QIISLNPLPLKKHGCGRNPIQYC 600
Query: 601 SEEEFLKDVMEFLLLRGHSRLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNG 660
SEEEFLKDVMEFLLLRGHSRLIPQGGL EFPDAILNGKRLDLYNLYKEVVTRGGFHVGNG
Sbjct: 601 SEEEFLKDVMEFLLLRGHSRLIPQGGLSEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNG 660
Query: 661 INWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGD 720
INWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGD
Sbjct: 661 INWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGD 720
Query: 721 WVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVANGSPQGI 780
WVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKP+RVANGSPQGI
Sbjct: 721 WVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPYRVANGSPQGI 780
Query: 781 TNPRIP 784
TNPRIP
Sbjct: 781 TNPRIP 784
BLAST of Spg023560 vs. ExPASy TrEMBL
Match:
A0A6J1GVQ9 (AT-rich interactive domain-containing protein 4-like OS=Cucurbita moschata OX=3662 GN=LOC111457916 PE=4 SV=1)
HSP 1 Score: 1442.9 bits (3734), Expect = 0.0e+00
Identity = 708/786 (90.08%), Postives = 736/786 (93.64%), Query Frame = 0
Query: 2 MLHSVGAARQTCSLLAVTCGSVPKVKYEEDVAVDKLKYPFPELVSSGRLEVRVLTNPSKD 61
MLHSV AARQTCSLLAVTCGSV K K EEDV DKLKYPFP LVSSGRLEVR LTNPS D
Sbjct: 1 MLHSVIAARQTCSLLAVTCGSVLKAKCEEDVDEDKLKYPFPGLVSSGRLEVRALTNPSTD 60
Query: 62 EFSRIVESCQPSFVYLQGEQLENDEIGSLVWNGVDLSLEDLCGLFNTALPSTVYLEIPDG 121
EFSRIVESC PSFVYLQGEQL NDEIGSLVWNGVDL LEDLCGLFNTALP+ VYLEIP+G
Sbjct: 61 EFSRIVESCLPSFVYLQGEQLGNDEIGSLVWNGVDLFLEDLCGLFNTALPTVVYLEIPNG 120
Query: 122 GEIADALHSKGIPYVIYWNNTFSSYAAAHFRNALLSVVQSSSTHTWDAFQLAHAAFRLHC 181
G IA+ALHSKGIPYV+YWN+TFS YAAAHFRNAL SV+QSSSTHTWDAFQLA AAFRLHC
Sbjct: 121 GRIAEALHSKGIPYVMYWNSTFSCYAAAHFRNALFSVLQSSSTHTWDAFQLARAAFRLHC 180
Query: 182 VGSNYALPGNADDIRNDLEPQLIGEPLKINVEPPEIDA--GEDEDGSLETLPVLSIHDNN 241
+GS++ALPG D I + LEPQ+ GEPLKINVEPP++D GEDEDGSLETL +SIHDNN
Sbjct: 181 MGSSHALPGIVDSITSGLEPQVFGEPLKINVEPPKVDVGEGEDEDGSLETLTAISIHDNN 240
Query: 242 VTVRFLICGMSGTPDACLLRSLEDGLNALLNIEIRGSKLQGKFSAPPPPLQAGSFSRGVV 301
VTVRFLICG+ TPDACLLRSLEDGLNALLNIEIRG KLQGKFSAPPPPLQAGSF+RGVV
Sbjct: 241 VTVRFLICGVPCTPDACLLRSLEDGLNALLNIEIRGCKLQGKFSAPPPPLQAGSFARGVV 300
Query: 302 TMRCDIVTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIENSQLVHAMHDCEGNKHHMH 361
TMRCDIVTCSSAHISILVSGS HTCFDDQLLEKHIKHEIIEN+QLVHAM+DCE NKHHMH
Sbjct: 301 TMRCDIVTCSSAHISILVSGSPHTCFDDQLLEKHIKHEIIENNQLVHAMYDCEDNKHHMH 360
Query: 362 EPRKSASVACGATVFEVSMKVPAWASQVLRQLAPDISYRSLVALGIGGVQGLPVASFEKE 421
EPRKSASVACGATVFEVSMKVPAWASQVLRQLAPD+SYRSLVALGIGGVQGLPVASFEKE
Sbjct: 361 EPRKSASVACGATVFEVSMKVPAWASQVLRQLAPDMSYRSLVALGIGGVQGLPVASFEKE 420
Query: 422 DAERLLFFYSRDENDKHSDQLLVSVLPNWFKPPTPSRKRGEPSQGIRNTLSLDSLAYANI 481
DAERLLFF S+D NDKHSDQLLVSVLP+WFKPP PSRKR EPSQGIR+TLS D LAYANI
Sbjct: 421 DAERLLFFCSKDVNDKHSDQLLVSVLPSWFKPPPPSRKRVEPSQGIRSTLSHDRLAYANI 480
Query: 482 PSIRKVGGEEPAPMNGFKTPLLPARKRLKVATMRPIPRVHRNKMTPFSGLAEADGNNGGQ 541
P IR+VG EEPAPMNGFKTPLL RKRLKVA+MRPIPRVHRNKMTPFSGL EADGNNGGQ
Sbjct: 481 PFIRRVGREEPAPMNGFKTPLLATRKRLKVASMRPIPRVHRNKMTPFSGLTEADGNNGGQ 540
Query: 542 PKASLPVITPSKHVTVGSTSATQRKSFSSSSQSKQQIVPLNPLPLKKHGCGRNPIQDCSE 601
PKA PV+TPSKHVTVGSTSATQRKSFSSSSQSK QI+ LNPLPLKKHGCGRNPIQDCSE
Sbjct: 541 PKACFPVVTPSKHVTVGSTSATQRKSFSSSSQSK-QIISLNPLPLKKHGCGRNPIQDCSE 600
Query: 602 EEFLKDVMEFLLLRGHSRLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGIN 661
EEFLKDVMEFLLLRGHSRLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGIN
Sbjct: 601 EEFLKDVMEFLLLRGHSRLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGIN 660
Query: 662 WKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWV 721
WKGQIFSKMHNYTM+NRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWV
Sbjct: 661 WKGQIFSKMHNYTMSNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWV 720
Query: 722 NCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTY-KKKPHRVANGSPQGIT 781
NCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTY KKKPHRVANGSPQG+T
Sbjct: 721 NCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYNKKKPHRVANGSPQGLT 780
Query: 782 N-PRIP 784
N PRIP
Sbjct: 781 NPPRIP 785
BLAST of Spg023560 vs. ExPASy TrEMBL
Match:
A0A0A0LEG9 (ARID domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G646580 PE=4 SV=1)
HSP 1 Score: 1437.9 bits (3721), Expect = 0.0e+00
Identity = 699/785 (89.04%), Postives = 739/785 (94.14%), Query Frame = 0
Query: 2 MLHSVGAARQTCSLLAVTCGSVPKVKYEEDVAVDKLKYPFPELVSSGRLEVRVLTNPSKD 61
MLHSV AARQTCSLLAVTCG+VPKVK EE+V DKLKYPFPELVS GRLEVRVL NPSKD
Sbjct: 1 MLHSVVAARQTCSLLAVTCGNVPKVKCEEEVDEDKLKYPFPELVSCGRLEVRVLANPSKD 60
Query: 62 EFSRIVESCQPSFVYLQGEQLENDEIGSLVWNGVDLSLEDLCGLFNTALPSTVYLEIPDG 121
EFSRIVESC PSFVYLQGEQL NDEIGSLVWNGVDLSLEDLCGLFN ALP+ VYLEIPDG
Sbjct: 61 EFSRIVESCLPSFVYLQGEQLGNDEIGSLVWNGVDLSLEDLCGLFNAALPTFVYLEIPDG 120
Query: 122 GEIADALHSKGIPYVIYWNNTFSSYAAAHFRNALLSVVQSSSTHTWDAFQLAHAAFRLHC 181
G IA+ALHSKGIPY+IYWN+TFS YAAAHFR+ALLSVVQSSSTHTWDAFQLA AAFRL+
Sbjct: 121 GRIAEALHSKGIPYLIYWNSTFSCYAAAHFRHALLSVVQSSSTHTWDAFQLARAAFRLYS 180
Query: 182 VGSNYALPGNADD-IRNDLEPQLIGEPLKINVEPPEIDA--GEDEDGSLETLPVLSIHDN 241
VGSNY LPG ADD + +DLEPQLIGEPLKI+VEPPE+D GEDEDGSLE LP ++IHDN
Sbjct: 181 VGSNYGLPGIADDSMMSDLEPQLIGEPLKIDVEPPELDVGEGEDEDGSLEALPAINIHDN 240
Query: 242 NVTVRFLICGMSGTPDACLLRSLEDGLNALLNIEIRGSKLQGKFSAPPPPLQAGSFSRGV 301
NVT+RFLICG+ TPD CLLRSLEDGL+ALL IE+RGSKLQGKFSAPPPPLQAGSFSRGV
Sbjct: 241 NVTMRFLICGVPCTPDTCLLRSLEDGLDALLKIEMRGSKLQGKFSAPPPPLQAGSFSRGV 300
Query: 302 VTMRCDIVTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIENSQLVHAMHDCEGNKHHM 361
VTMRCDIVTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIE++QLVHA+HDCEGNKHHM
Sbjct: 301 VTMRCDIVTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIEHNQLVHAIHDCEGNKHHM 360
Query: 362 HEPRKSASVACGATVFEVSMKVPAWASQVLRQLAPDISYRSLVALGIGGVQGLPVASFEK 421
H+PRKSAS+ACGATVFEVSMKVPAWASQVLRQLAPDISYRSLVALGIGGVQGLPVASFEK
Sbjct: 361 HKPRKSASIACGATVFEVSMKVPAWASQVLRQLAPDISYRSLVALGIGGVQGLPVASFEK 420
Query: 422 EDAERLLFFYSRDENDKHSDQLLVSVLPNWFKPPTPSRKRGEPSQGIRNTLSLDSLAYAN 481
EDAERLLFF S D NDKHS+QLLVSVLP+WFKPPTPSRKR EPSQGIRN+LS DSL+YA+
Sbjct: 421 EDAERLLFFCSGDGNDKHSEQLLVSVLPSWFKPPTPSRKRVEPSQGIRNSLSHDSLSYAH 480
Query: 482 IPSIRKVGGEEPAPMNGFKTPLLPARKRLKVATMRPIPRVHRNKMTPFSGLAEADGNNGG 541
IP+IR+VG E+P PMNGFK L PARK+LKVA+MRP+PR+HRNKMTPF+GL E DGNNGG
Sbjct: 481 IPAIRRVGREDPVPMNGFKASLHPARKKLKVASMRPVPRLHRNKMTPFAGLTEVDGNNGG 540
Query: 542 QPKASLPVITPSKHVTVGSTSATQRKSFSSSSQSKQQIVPLNPLPLKKHGCGRNPIQDCS 601
KASL ++TP KHVTVGSTSAT RKSFSSSSQSK QI+ LNPLPLKKHGCGRNPIQDCS
Sbjct: 541 LSKASLSIVTPPKHVTVGSTSATHRKSFSSSSQSK-QIISLNPLPLKKHGCGRNPIQDCS 600
Query: 602 EEEFLKDVMEFLLLRGHSRLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGI 661
EEEFLKDVMEFLLLRGH+RLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGI
Sbjct: 601 EEEFLKDVMEFLLLRGHTRLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGI 660
Query: 662 NWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDW 721
NWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDW
Sbjct: 661 NWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDW 720
Query: 722 VNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVANGSPQGIT 781
VNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVANGSPQGIT
Sbjct: 721 VNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVANGSPQGIT 780
Query: 782 NPRIP 784
NPRIP
Sbjct: 781 NPRIP 784
BLAST of Spg023560 vs. TAIR 10
Match:
AT3G43240.1 (ARID/BRIGHT DNA-binding domain-containing protein )
HSP 1 Score: 915.2 bits (2364), Expect = 3.5e-266
Identity = 461/775 (59.48%), Postives = 562/775 (72.52%), Query Frame = 0
Query: 2 MLHSVGAARQTCSLLAVTCGS-VPKVKYEEDVAVDKLKYPFPELVSSGRLEVRVLTNPSK 61
M H G +R C+++AV G+ + + D + KYPFP+L SSGRL+ +VL NP+
Sbjct: 1 MFHGQGFSRNRCNVVAVVSGAELCDTNNQIDGTSHQPKYPFPDLSSSGRLKFQVLNNPTP 60
Query: 62 DEFSRIVESCQPSFVYLQGEQL-ENDEIGSLVWNGVDLSLED-LCGLFNTALPSTVYLEI 121
+EF V S FVYLQGE ++DE+G LV D S D L LF + LP+TVYLE+
Sbjct: 61 EEFQVAVNSSATDFVYLQGEHSGDSDEVGPLVLGYTDFSTPDALVTLFGSTLPTTVYLEL 120
Query: 122 PDGGEIADALHSKGIPYVIYWNNTFSSYAAAHFRNALLSVVQSSSTHTWDAFQLAHAAFR 181
P+G E+A AL+SKG+ YVIYW N FS YAA HFR++L SV+QSS + TWD F +A A+FR
Sbjct: 121 PNGEELAQALYSKGVQYVIYWKNVFSKYAACHFRHSLFSVIQSSCSDTWDVFHVAEASFR 180
Query: 182 LHCVGSNYALPGNADDIRN-DLEPQLIGEPLKINVEPPEIDAGEDEDGSLETLPVLSIHD 241
L+C N LP N++ N ++ P L+GEP KI+V PE D E+E+ SLE+LP + I+D
Sbjct: 181 LYCTSDNAVLPSNSNRKMNYEMGPCLLGEPPKIDVVSPEADELEEEN-SLESLPSIKIYD 240
Query: 242 NNVTVRFLICGMSGTPDACLLRSLEDGLNALLNIEIRGSKLQGKFSAPPPPLQAGSFSRG 301
+VTVRFL+CG T D LL SL DGLNALL IE+RGSKL + SAP PPLQAG+F+RG
Sbjct: 241 EDVTVRFLLCGPPCTVDTFLLGSLMDGLNALLRIEMRGSKLHNRSSAPAPPLQAGTFTRG 300
Query: 302 VVTMRCDIVTCSSAHISILVSGSAHTCFDDQLLEKHIKHEIIENSQLVHAMHDCEGNKHH 361
VVTMRCD+ TCSSAHIS+LVSG+A TCF DQLLE HIKHE++E QLVH++ + E K
Sbjct: 301 VVTMRCDVSTCSSAHISMLVSGNAQTCFSDQLLENHIKHEVVEKIQLVHSVVNSEETKRG 360
Query: 362 MHEPRKSASVACGATVFEVSMKVPAWASQVLRQLAPDISYRSLVALGIGGVQGLPVASFE 421
EPR+SAS+ACGA+V EVSM+VP WA QVLRQLAPD+SYRSLV LG+ +QGL VASFE
Sbjct: 361 FSEPRRSASIACGASVCEVSMQVPTWALQVLRQLAPDVSYRSLVVLGVASIQGLSVASFE 420
Query: 422 KEDAERLLFFYSRDENDKHSDQLLVSVLPNWFKPPTPSRKRGEPSQGIRNTLSLDSLAYA 481
K+DAERLLFF + ND + L+S +PNW PP P+RKR EP R + +++
Sbjct: 421 KDDAERLLFFCGQQINDTSNHDALLSKIPNWLTPPLPTRKRSEP---CRESKEIEN---- 480
Query: 482 NIPSIRKVGGEEPAPMNGFKTPLLPARKRLKVATMRPIPRVHRNKMTPFSGLAEADGNNG 541
GG P +++ VA +RPIP R+KM PFSG +E +G
Sbjct: 481 --------GG--------------PTSRKINVAALRPIPHTRRHKMIPFSGYSEIGRFDG 540
Query: 542 GQPKASLPVITPSKHVTVGSTSATQRKSFSSSSQSKQQIVPLNPLPLKKHGCGRNPIQDC 601
K SLP+ P KH G T T RK+FS S Q K QI+ LNPLPLKKH CGR IQ C
Sbjct: 541 DHTKGSLPM--PPKHGASGGTPVTHRKAFSGSYQRK-QIISLNPLPLKKHDCGRAHIQVC 600
Query: 602 SEEEFLKDVMEFLLLRGHSRLIPQGGLEEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNG 661
SEEEFL+DVM+FLL+RGH+RL+P GGL EFPDA+LN KRLDL+NLY+EVV+RGGFHVGNG
Sbjct: 601 SEEEFLRDVMQFLLIRGHTRLVPPGGLAEFPDAVLNSKRLDLFNLYREVVSRGGFHVGNG 660
Query: 662 INWKGQIFSKMHNYTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGD 721
INWKGQ+FSKM N+T+TNRMTGVGNTLKRHYETYLLEYE AHDDVDGECCL+C SS AGD
Sbjct: 661 INWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYEYAHDDVDGECCLICRSSTAGD 720
Query: 722 WVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSITTYKKKPHRVANG 773
WVNCG CGEWAHFGCDRR GLGAFKDYAKTDGLEYVCP+CS++ Y+KK + +NG
Sbjct: 721 WVNCGSCGEWAHFGCDRRPGLGAFKDYAKTDGLEYVCPNCSVSNYRKKSQKTSNG 742
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_022949406.1 | 0.0e+00 | 91.56 | AT-rich interactive domain-containing protein 4-like isoform X1 [Cucurbita mosch... | [more] |
XP_023524673.1 | 0.0e+00 | 91.69 | AT-rich interactive domain-containing protein 4-like isoform X1 [Cucurbita pepo ... | [more] |
XP_022998193.1 | 0.0e+00 | 91.30 | AT-rich interactive domain-containing protein 4-like isoform X1 [Cucurbita maxim... | [more] |
KAG7036396.1 | 0.0e+00 | 90.63 | AT-rich interactive domain-containing protein 4, partial [Cucurbita argyrosperma... | [more] |
XP_038883881.1 | 0.0e+00 | 91.72 | AT-rich interactive domain-containing protein 4-like [Benincasa hispida] >XP_038... | [more] |
Match Name | E-value | Identity | Description | |
Q6NQ79 | 4.9e-265 | 59.48 | AT-rich interactive domain-containing protein 4 OS=Arabidopsis thaliana OX=3702 ... | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1GBY3 | 0.0e+00 | 91.56 | AT-rich interactive domain-containing protein 4-like isoform X1 OS=Cucurbita mos... | [more] |
A0A6J1KBW8 | 0.0e+00 | 91.30 | AT-rich interactive domain-containing protein 4-like isoform X1 OS=Cucurbita max... | [more] |
A0A6J1DIE1 | 0.0e+00 | 91.22 | AT-rich interactive domain-containing protein 4-like OS=Momordica charantia OX=3... | [more] |
A0A6J1GVQ9 | 0.0e+00 | 90.08 | AT-rich interactive domain-containing protein 4-like OS=Cucurbita moschata OX=36... | [more] |
A0A0A0LEG9 | 0.0e+00 | 89.04 | ARID domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G646580 PE=4 S... | [more] |
Match Name | E-value | Identity | Description | |
AT3G43240.1 | 3.5e-266 | 59.48 | ARID/BRIGHT DNA-binding domain-containing protein | [more] |