Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideexonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGATCAATGGGTCTGCCGAAAGTACACCATCTTCTTACTTTCATGAATTTAGTTATGATGACAATATCTTCACGGGTAACAAACCCTCCCTTCGGGGATGCACCTCAGGAAGCAGTTTTCAACTTGAGAGTACTTCCATTCTTGGTGACAAACTGTACATTCAAAATGATGTCATCAAAAGAATCCAAAAGCAGGGAATCCCTGATGATGAAGTTGATGTTCTAAAGCTTGACGGTTACATCCAGGGTTCTGATTTTTATGCTGGAGACTCATTGCATGCTGAGGTATTGTTGTCTGACTCTTACTAGAGCTAACAATTTATGCGTTTCTCAAGATCTAAAACTTCTGTCCTTTTTTTTTCCTGTTGTCGGTTACTTTTTTTTGGGCAACAGTTTACTGAAGAAAATATATACTCATGTCATTTGGACAAGCACGTGCAGAAGTTTTTCTCAAGTTATCAGACTAGAAATTCCCCAGATGTTCACGTGACCCCAAATCCCAGATTAGCCTCAGAATGGGATGTTGATTGCTTCAGTGTTAGGGATGGGGTTGAAAGGAACTGGAGATCTAGAGACAGGACTCCCTTCAGGGATTTGGTGGATGGTGAGGATAAGGGCTGCGGATTTGATTCTGATATCATGTTGAGAAGTTCCAAAAAGAATTACATACCAAGCTGTATAGATAGTGAACTGATAATTGATGATGTCCTTGATACAAGAGAAGACCTTAGTACTTCCCTTGAAAAATCTAATAATTTTGATCATTCTTCTCCTGTGAGTCCTAATATGCACTCCTGTCAGAAGTATCTTTTCAATTGGAGATTACCTGGAAAAGATTGGGAAAAGGCTTATGGAAGCTCAGAGCTTAAGTTTGGACATCAAGCTTTTAAACAGAAGTACGTTTCTGTTGAAAGGCCTAGAAGATGCAAATCAGCTCCACCTTCTTACAAAAGAAAAACTAGTTTCTATTGCCTGTACCGAAGAAAGGAAGAAAAGCATAATGCCGCCGGTTTCTATGGCCTTGACCAAAGAAAAACTGATAAGTTTAATGCCACAAATTTCTATTGCATGGACCAAGGGAAAGAAGAAAAGCTTAGGGCATCGGCCTTCCTTGACAGCCCACCTCATTTAGGTATTTTCATTTAAGAAGTCATTACTAGTAATGCATTGTCATTGATGAAATATTAATGCTGTGCCAATACGTACAGAACTAGCTGAGCTGAGAGATTCCAAACATTTCTCTAGTACTAATAATCTTTATATTAAGCCAAGTCCTCTTGATGACTTATCGATGGGAACTAGGTAGAGATCCATTCCTTTCCCCTATCAATTTAAAATTGAAGTTGTTATCATAATATAATGCATCTTTTCATCACTTCCAGAACAGATATGACAAAGACGCCTGCTATTACGGGAAATAATAAAGAGAAACAAGAAGGAAAAATTTCCAAGCAGTTCCAATCCGATGTTAAAGTTACTGCATCTGCTTTAGGTAACTGCACTATGTATATGAAACTGATTATTTCTTTGTTCTTAAGGATTTGGAATTCAAAATTTCAATTCTTAGGGAGGGAATTGGTATTGAAAGATTTTATGTGATCAGAAAAACATGGAAAGAAAATTGTGCTGTAATTTCTTTCCAGTAGATTTATCTTTATTATCATTGACATATTAGTGATTAATAAGTTAAATTCTCATCAAGAATAGTCTTTACAATATATAGCATGCTGCATTAGTTCATAATTTTGTTGTTCTTGAATATTGAGTTTTATATGATGGTAGCATCATCTTTACTGGCGGAAATAAAATTTCAGTTCTTTTTGCAGTTGCCATATTAGTGAAAATTTAAATATCCTCCAATTCTATTTAAATGGCATATTTTTTATCCTGTTTCGCCTTCAGAATTATGCTCAAAGGAAACTCGAGAGTCAGATTTATGGATCAAATGGAAAAATTGCTGTCCGACTACAGTTAAGTTTGAAAGTTGATTTGTTTTCTATAGTTTGATATGAGGCTTCTCTTTTCTATTTTTATTTATTTATTTATTTGAGTAACATTTAATAGTTTTCCTTTATTGATTTCAGAGAAATGATGGGCCACGTGCTTTTGAAGATGAAGTTAGTATACTTGATATCTCTTCAGGATTCCTATCTCTTGCCAGAAATTCCTTAGTTCCCAAATCCATCGATAAGAATTTCCTTGAAGATGCCAAAGTTCTTCTACAGCTTGATAAGAAATTCATTCCAGTTGTTTCTGGTGGAATACTGGCTGTTATTGATCAGGTTAGTTCACTCCCTTCTGGTTCATTCACTTCAGGTTTTAAGTGAAAGACAATATCTCCACCCCTTAAAAATCTTTCTTCTCCTATTTCTCTTCGAGTCTAATGTATGACAAGGTAGCGAAATAGGCAATCACCTTCAAACACTTTGTACCCGAAGGAGTAGGCCATTGAGGAACTAGGTTTGAAGTACCTCTCCTAGTATGAGTGGGTTTCCTGCCAACTTCGTTTTGTTGTGTGATAATATATCCAAGTGTAAGGGTTTATGATATGCCACATAAGAAGGGGCATTGGGTAACATTTAGATAGCATCTACTTGAGAGGTCTGAGGAAGATGGCAATCTCTTCTTTCATTAGCTTTGTCGATCCAAGGAATGCAAATGTAGAAACGGAAGTAGTTGAATTGATTCTAGATCAACAACTTTATAGCAATTGGGGAATTTTTTTCCTCAATTTTATAGCGTGTCACCTGACTCACGTTCCAGTGTGGAAACAACAAAGGTGCTCATTGAAGATTGTTGGAACAAGGAAAATCAAGTATGAAGTGTTCATCTTAAAAAGAACTTATTTGGTCTTTCAGTCTGGTTTCCTTTTTTTTTTTTTTTCTTTTTTTTACTGAAGAAGTAGATTAGGTTGTCTGCGTACTTTGATACATTACTTGGATACAGCACACTCATGCCTTGTGATTTTTCGTGCACCCCATATACCAACATATATGTGCACACCAATTGACAAGTCTTTCTAAAGCACGGGGTGGGGTAATGATGTTTTAAATGGTGCTTTCTCCAGCATGCTGCAGATGAAAGAATCCGACTTGAAGATCTTCGTCAAAAGGTAAACATTATATATTTGATGGAATAATTCTATGATTTCTAATAAAGCAATATATGTATATACACAGTTGTTGTCTGGTGAAGCAAAGACAATAGCCTATCTGGAGGATGAACATGAACTGGTAAGTTTATGTTTTTTGATGTTGTCTGGTCTCAATCTGTCATGCTTTGATTGAAAGGGATAAATTATTCTTGCTGTTTGTTTACTACAGGTGCTGCCTGAAATTGGGTACCAGTTGTTGTACAACTATAGTGATCAGGTTAAAGAGTGGGGTTGGATCTGTAATATTCATGCTCAAGATTCGAAATCCTTCCAAAGGTATATTTAATTCCTTTGGGGTTTGTCTACTTTGCCAGTACTGCTCAAGTTTGCCGAAGTAATATTTGGTTCTCTGTGCAGGAATTTGAATATCCTATACAAGCAGGAAACGGTCATCACGCTAATGGCAGTTAATCACCATCTTCCTTGCTTCGATAATTATTGCATTTAAACTTTGAAACCAAACCATGCCAACTGTATTAAAATAGCAATTGTCAAATTTGTGCTTGAAATTTGTAACGAATTTATATTTTTGTTTGAAAATTCATGTATTTTTATTTTATGTTTGTATAAAATTTCATGTGTATGACACAGGTACCTTGCATACTAGGAGTTAATTTATCTGATGCAGATCTGCTGGAGTTTCTTGATCAGGTAACTGAGGACCTTGCGCTTTAAAACCATTTTTGCTTGGTTCATCCATTAATTTTGTTGAGCTGTTACATTGTTCCCTAAACACCTGGAATAATTTTGCTAATGTCGCTATAGCTGCATATATAAGCTTGCTAAGACTTTTTTCTTGGATCGAAAGCTTGCTGATACAGATGGCTCATCAACAATGCCGCCATCTGTGCTTCGAGTTCTTAATTCAAAGGCCTGCAGAGGTACATGCTTAAGCTTGTTTAATGTGTGTAAATACCGTTCACCATGGGGAATTAAATGTCGTGTATAACAGGTTGACTTGCACCTTTTTCGGATGATGTCCATAGCTGTTACATGACAAATTTTTTTTTTTTTGGGGTAATAAATGCTGTTGAAGAAATGACCTCCATCATGTAGTGTCTCACTTCTCTGGCAACAGTCTGTAGCTCAGTGCTATGTGTGTATCTGCATGTTGATCCTTTTTCACTGAAAACTTTTATTTGATAAAATTACTTGATTAGGTGCAATTATGTTTGGAGACTCTTTGTTACCTTCAGAGTGTTCCCTTATTGTTGATGAACTGAAGCAGACTTCTCTGTGTTTCCAAGTGAGCGAATAAGTTTTCAAGCATGTTTTCAAAGTTCTGATATATATATATATATATATATATATATCTCTTTTGGTGGGTGGAAGGAGAAAATAACTCTTAATTGATGAGTGTGAGATGTTTAATGTGCAGTGCGCCCATGGACGACCAACTACAGTACCTCTCGTGAACTTGGAGGCATTGCACAAGCAGATAAGGGAGATGGAAATATTAGATAAAAATGGTTCGAATGGAACGTGGCATGGGCTGCGACGACATGAGCTGAGCATTGAACGGATGTTGCAGCACATAGGTTCGGCCTAAGGTTCATAGTGCCCATGGTAAATGCTGCCTTACCATCAAGGAGAATAATTGAGAGGGTGCCCCGGTAATACCAGTCTACAGAAGCGGATGGATTGAGTAGCATTCTTAGATCATGTATACAACCAGTCTACGATGATTTGTGAAAAAGAGCAATGTTTCTGTAGTTTAGTTTTCCTTGGAATTTCGGTTACAACTCATATTATAAACAGTTGTCTGCCCTCTAGCTGCATAAATCCTTATGCATTTTGCCTGTCATGGTGCTCTGTGTGTGAATAAAACTAAATGACTTGTTCTTTTCTCTTGTAAATTGGTCATTGGAATAATGAAACCCCTCCCTATTGATAAGTAATAAACTACAGGCAGCTAAGTTGAGAAAAATGGCCATTTAATTAGCCCAAATA
mRNA sequence
ATGATCAATGGGTCTGCCGAAAGTACACCATCTTCTTACTTTCATGAATTTAGTTATGATGACAATATCTTCACGGGTAACAAACCCTCCCTTCGGGGATGCACCTCAGGAAGCAGTTTTCAACTTGAGAGTACTTCCATTCTTGGTGACAAACTGTACATTCAAAATGATGTCATCAAAAGAATCCAAAAGCAGGGAATCCCTGATGATGAAGTTGATGTTCTAAAGCTTGACGGTTACATCCAGGGTTCTGATTTTTATGCTGGAGACTCATTGCATGCTGAGTTTACTGAAGAAAATATATACTCATGTCATTTGGACAAGCACGTGCAGAAGTTTTTCTCAAGTTATCAGACTAGAAATTCCCCAGATGTTCACGTGACCCCAAATCCCAGATTAGCCTCAGAATGGGATGTTGATTGCTTCAGTGTTAGGGATGGGGTTGAAAGGAACTGGAGATCTAGAGACAGGACTCCCTTCAGGGATTTGGTGGATGGTGAGGATAAGGGCTGCGGATTTGATTCTGATATCATGTTGAGAAGTTCCAAAAAGAATTACATACCAAGCTGTATAGATAGTGAACTGATAATTGATGATGTCCTTGATACAAGAGAAGACCTTAGTACTTCCCTTGAAAAATCTAATAATTTTGATCATTCTTCTCCTGTGAGTCCTAATATGCACTCCTGTCAGAAGTATCTTTTCAATTGGAGATTACCTGGAAAAGATTGGGAAAAGGCTTATGGAAGCTCAGAGCTTAAGTTTGGACATCAAGCTTTTAAACAGAAGTACGTTTCTGTTGAAAGGCCTAGAAGATGCAAATCAGCTCCACCTTCTTACAAAAGAAAAACTAGTTTCTATTGCCTGTACCGAAGAAAGGAAGAAAAGCATAATGCCGCCGGTTTCTATGGCCTTGACCAAAGAAAAACTGATAAGTTTAATGCCACAAATTTCTATTGCATGGACCAAGGGAAAGAAGAAAAGCTTAGGGCATCGGCCTTCCTTGACAGCCCACCTCATTTAGAACTAGCTGAGCTGAGAGATTCCAAACATTTCTCTAGTACTAATAATCTTTATATTAAGCCAAGTCCTCTTGATGACTTATCGATGGGAACTAGAACAGATATGACAAAGACGCCTGCTATTACGGGAAATAATAAAGAGAAACAAGAAGGAAAAATTTCCAAGCAGTTCCAATCCGATGTTAAAGTTACTGCATCTGCTTTAGAATTATGCTCAAAGGAAACTCGAGAGTCAGATTTATGGATCAAATGGAAAAATTGCTGTCCGACTACAAGAAATGATGGGCCACGTGCTTTTGAAGATGAAGTTAGTATACTTGATATCTCTTCAGGATTCCTATCTCTTGCCAGAAATTCCTTAGTTCCCAAATCCATCGATAAGAATTTCCTTGAAGATGCCAAAGTTCTTCTACAGCTTGATAAGAAATTCATTCCAGTTGTTTCTGGTGGAATACTGGCTGTTATTGATCAGCATGCTGCAGATGAAAGAATCCGACTTGAAGATCTTCGTCAAAAGTTGTTGTCTGGTGAAGCAAAGACAATAGCCTATCTGGAGGATGAACATGAACTGGTGCTGCCTGAAATTGGGTACCAGTTGTTGTACAACTATAGTGATCAGGTTAAAGAGTGGGGTTGGATCTGTAATATTCATGCTCAAGATTCGAAATCCTTCCAAAGGAATTTGAATATCCTATACAAGCAGGAAACGGTCATCACGCTAATGGCAGTACCTTGCATACTAGGAGTTAATTTATCTGATGCAGATCTGCTGGAGTTTCTTGATCAGCTTGCTGATACAGATGGCTCATCAACAATGCCGCCATCTGTGCTTCGAGTTCTTAATTCAAAGGCCTGCAGAGGTGCAATTATGTTTGGAGACTCTTTGTTACCTTCAGAGTGTTCCCTTATTGTTGATGAACTGAAGCAGACTTCTCTGTGTTTCCAATGCGCCCATGGACGACCAACTACAGTACCTCTCGTGAACTTGGAGGCATTGCACAAGCAGATAAGGGAGATGGAAATATTAGATAAAAATGGTTCGAATGGAACGTGGCATGGGCTGCGACGACATGAGCTGAGCATTGAACGGATGTTGCAGCACATAGGTTCGGCCTAAGGTTCATAGTGCCCATGGTAAATGCTGCCTTACCATCAAGGAGAATAATTGAGAGGGTGCCCCGGTAATACCAGTCTACAGAAGCGGATGGATTGAGTAGCATTCTTAGATCATGTATACAACCAGTCTACGATGATTTGTGAAAAAGAGCAATGTTTCTGTAGTTTAGTTTTCCTTGGAATTTCGGTTACAACTCATATTATAAACAGTTGTCTGCCCTCTAGCTGCATAAATCCTTATGCATTTTGCCTGTCATGGTGCTCTGTGTGTGAATAAAACTAAATGACTTGTTCTTTTCTCTTGTAAATTGGTCATTGGAATAATGAAACCCCTCCCTATTGATAAGTAATAAACTACAGGCAGCTAAGTTGAGAAAAATGGCCATTTAATTAGCCCAAATA
Coding sequence (CDS)
ATGATCAATGGGTCTGCCGAAAGTACACCATCTTCTTACTTTCATGAATTTAGTTATGATGACAATATCTTCACGGGTAACAAACCCTCCCTTCGGGGATGCACCTCAGGAAGCAGTTTTCAACTTGAGAGTACTTCCATTCTTGGTGACAAACTGTACATTCAAAATGATGTCATCAAAAGAATCCAAAAGCAGGGAATCCCTGATGATGAAGTTGATGTTCTAAAGCTTGACGGTTACATCCAGGGTTCTGATTTTTATGCTGGAGACTCATTGCATGCTGAGTTTACTGAAGAAAATATATACTCATGTCATTTGGACAAGCACGTGCAGAAGTTTTTCTCAAGTTATCAGACTAGAAATTCCCCAGATGTTCACGTGACCCCAAATCCCAGATTAGCCTCAGAATGGGATGTTGATTGCTTCAGTGTTAGGGATGGGGTTGAAAGGAACTGGAGATCTAGAGACAGGACTCCCTTCAGGGATTTGGTGGATGGTGAGGATAAGGGCTGCGGATTTGATTCTGATATCATGTTGAGAAGTTCCAAAAAGAATTACATACCAAGCTGTATAGATAGTGAACTGATAATTGATGATGTCCTTGATACAAGAGAAGACCTTAGTACTTCCCTTGAAAAATCTAATAATTTTGATCATTCTTCTCCTGTGAGTCCTAATATGCACTCCTGTCAGAAGTATCTTTTCAATTGGAGATTACCTGGAAAAGATTGGGAAAAGGCTTATGGAAGCTCAGAGCTTAAGTTTGGACATCAAGCTTTTAAACAGAAGTACGTTTCTGTTGAAAGGCCTAGAAGATGCAAATCAGCTCCACCTTCTTACAAAAGAAAAACTAGTTTCTATTGCCTGTACCGAAGAAAGGAAGAAAAGCATAATGCCGCCGGTTTCTATGGCCTTGACCAAAGAAAAACTGATAAGTTTAATGCCACAAATTTCTATTGCATGGACCAAGGGAAAGAAGAAAAGCTTAGGGCATCGGCCTTCCTTGACAGCCCACCTCATTTAGAACTAGCTGAGCTGAGAGATTCCAAACATTTCTCTAGTACTAATAATCTTTATATTAAGCCAAGTCCTCTTGATGACTTATCGATGGGAACTAGAACAGATATGACAAAGACGCCTGCTATTACGGGAAATAATAAAGAGAAACAAGAAGGAAAAATTTCCAAGCAGTTCCAATCCGATGTTAAAGTTACTGCATCTGCTTTAGAATTATGCTCAAAGGAAACTCGAGAGTCAGATTTATGGATCAAATGGAAAAATTGCTGTCCGACTACAAGAAATGATGGGCCACGTGCTTTTGAAGATGAAGTTAGTATACTTGATATCTCTTCAGGATTCCTATCTCTTGCCAGAAATTCCTTAGTTCCCAAATCCATCGATAAGAATTTCCTTGAAGATGCCAAAGTTCTTCTACAGCTTGATAAGAAATTCATTCCAGTTGTTTCTGGTGGAATACTGGCTGTTATTGATCAGCATGCTGCAGATGAAAGAATCCGACTTGAAGATCTTCGTCAAAAGTTGTTGTCTGGTGAAGCAAAGACAATAGCCTATCTGGAGGATGAACATGAACTGGTGCTGCCTGAAATTGGGTACCAGTTGTTGTACAACTATAGTGATCAGGTTAAAGAGTGGGGTTGGATCTGTAATATTCATGCTCAAGATTCGAAATCCTTCCAAAGGAATTTGAATATCCTATACAAGCAGGAAACGGTCATCACGCTAATGGCAGTACCTTGCATACTAGGAGTTAATTTATCTGATGCAGATCTGCTGGAGTTTCTTGATCAGCTTGCTGATACAGATGGCTCATCAACAATGCCGCCATCTGTGCTTCGAGTTCTTAATTCAAAGGCCTGCAGAGGTGCAATTATGTTTGGAGACTCTTTGTTACCTTCAGAGTGTTCCCTTATTGTTGATGAACTGAAGCAGACTTCTCTGTGTTTCCAATGCGCCCATGGACGACCAACTACAGTACCTCTCGTGAACTTGGAGGCATTGCACAAGCAGATAAGGGAGATGGAAATATTAGATAAAAATGGTTCGAATGGAACGTGGCATGGGCTGCGACGACATGAGCTGAGCATTGAACGGATGTTGCAGCACATAGGTTCGGCCTAA
Protein sequence
MINGSAESTPSSYFHEFSYDDNIFTGNKPSLRGCTSGSSFQLESTSILGDKLYIQNDVIKRIQKQGIPDDEVDVLKLDGYIQGSDFYAGDSLHAEFTEENIYSCHLDKHVQKFFSSYQTRNSPDVHVTPNPRLASEWDVDCFSVRDGVERNWRSRDRTPFRDLVDGEDKGCGFDSDIMLRSSKKNYIPSCIDSELIIDDVLDTREDLSTSLEKSNNFDHSSPVSPNMHSCQKYLFNWRLPGKDWEKAYGSSELKFGHQAFKQKYVSVERPRRCKSAPPSYKRKTSFYCLYRRKEEKHNAAGFYGLDQRKTDKFNATNFYCMDQGKEEKLRASAFLDSPPHLELAELRDSKHFSSTNNLYIKPSPLDDLSMGTRTDMTKTPAITGNNKEKQEGKISKQFQSDVKVTASALELCSKETRESDLWIKWKNCCPTTRNDGPRAFEDEVSILDISSGFLSLARNSLVPKSIDKNFLEDAKVLLQLDKKFIPVVSGGILAVIDQHAADERIRLEDLRQKLLSGEAKTIAYLEDEHELVLPEIGYQLLYNYSDQVKEWGWICNIHAQDSKSFQRNLNILYKQETVITLMAVPCILGVNLSDADLLEFLDQLADTDGSSTMPPSVLRVLNSKACRGAIMFGDSLLPSECSLIVDELKQTSLCFQCAHGRPTTVPLVNLEALHKQIREMEILDKNGSNGTWHGLRRHELSIERMLQHIGSA
Homology
BLAST of CmoCh06G001530 vs. ExPASy Swiss-Prot
Match:
F4JN26 (DNA mismatch repair protein MLH3 OS=Arabidopsis thaliana OX=3702 GN=MLH3 PE=2 SV=2)
HSP 1 Score: 377.1 bits (967), Expect = 4.4e-103
Identity = 218/471 (46.28%), Postives = 294/471 (62.42%), Query Frame = 0
Query: 248 YGSSELKFGHQAFKQKYVSVERPRRCKSAPPSYKRKTSFYCLYRRKEEKHNAAGFYGLDQ 307
Y + KF + Q +R +R +SAPP Y+ K F L + + K
Sbjct: 733 YSIRKEKFSYMDGTQNNAGKQRSKRSRSAPPFYREKKRFISLSCKSDTK----------P 792
Query: 308 RKTDKFNATNFYCMDQ---GKEEKLRASAFLD-SPPHLELAELRDSKHFSSTNNLYIKPS 367
+ +D + C+ Q + L+ S D S H++ E K SS ++L
Sbjct: 793 KNSDPSEPDDLECLTQPCNASQMHLKCSILDDVSYDHIQETE----KRLSSASDL----- 852
Query: 368 PLDDLSMGTRTDMTKTPAITGNNKEKQEGKISKQFQSDVKVTASALELCSKETRESDLWI 427
S G RT ++T +++ E S++F +K T
Sbjct: 853 ---KASAGCRTVHSET-----QDEDVHEDFSSEEFLDPIKSTT----------------- 912
Query: 428 KWK-NCCPTTRNDGPRAFEDEVSILDISSGFLSL-ARNSLVPKSIDKNFLEDAKVLLQLD 487
KW+ NC + + + DISSG L L + SLVP+SI+++ LEDAKVL Q+D
Sbjct: 913 KWRHNCAVSQVPKESHELHGQDGVFDISSGLLHLRSDESLVPESINRHSLEDAKVLQQVD 972
Query: 488 KKFIPVVSGGILAVIDQHAADERIRLEDLRQKLLSGEAKTIAYLEDEHELVLPEIGYQLL 547
KK+IP+V+ G +A++DQHAADERIRLE+LR K+L+G+A+T+ YL + ELVLPE+GYQLL
Sbjct: 973 KKYIPIVACGTVAIVDQHAADERIRLEELRTKVLAGKARTVTYLSADQELVLPEMGYQLL 1032
Query: 548 YNYSDQVKEWGWICNIHAQDSKSFQRNLNILYKQETVITLMAVPCILGVNLSDADLLEFL 607
+YS+Q+++WGWICNI + S SF++N++I+ ++ T ITL AVPCILGVNLSD DLLEFL
Sbjct: 1033 QSYSEQIRDWGWICNITVEGSTSFKKNMSIIQRKPTPITLNAVPCILGVNLSDVDLLEFL 1092
Query: 608 DQLADTDGSSTMPPSVLRVLNSKACRGAIMFGDSLLPSECSLIVDELKQTSLCFQCAHGR 667
QLADTDGSST+PPSVLRVLNSKACRGAIMFGDSLLPSECSLI+D LKQTSLCFQCAHGR
Sbjct: 1093 QQLADTDGSSTIPPSVLRVLNSKACRGAIMFGDSLLPSECSLIIDGLKQTSLCFQCAHGR 1152
Query: 668 PTTVPLVNLEALHKQIREMEILDKNGSNGTWHGLRRHELSIERMLQHIGSA 713
PTTVPLV+L+ALHKQI ++ WHGL+R E++++R + +A
Sbjct: 1153 PTTVPLVDLKALHKQIAKL------SGRQVWHGLQRREITLDRAKSRLDNA 1153
BLAST of CmoCh06G001530 vs. ExPASy Swiss-Prot
Match:
Q9UHC1 (DNA mismatch repair protein Mlh3 OS=Homo sapiens OX=9606 GN=MLH3 PE=1 SV=3)
HSP 1 Score: 96.7 bits (239), Expect = 1.2e-18
Identity = 79/255 (30.98%), Postives = 122/255 (47.84%), Query Frame = 0
Query: 447 LDISSGFL-SLA---RNSLVPKSIDKNFLEDAKVLLQLDKKFIPVV-----------SGG 506
+D+SSG SLA N L P K + +VL Q+D KFI + G
Sbjct: 1158 VDVSSGQAESLAVKIHNILYPYRFTKGMIHSMQVLQQVDNKFIACLMSTKTEENGEAGGN 1217
Query: 507 ILAVIDQHAADERIRLEDL-------RQKLLSGEAKTI-AYLEDEHELVLPEIGYQLLYN 566
+L ++DQHAA ERIRLE L +Q SG K + + L E+ + E +LL+
Sbjct: 1218 LLVLVDQHAAHERIRLEQLIIDSYEKQQAQGSGRKKLLSSTLIPPLEITVTEEQRRLLWC 1277
Query: 567 YSDQVKEWGW-ICNIHAQDSKSFQRNLNILYKQETVITLMAVPCILGVNLSDADLLEFLD 626
Y +++ G DS + + + + L + ++ + + E L+
Sbjct: 1278 YHKNLEDLGLEFVFPDTSDSLVLVGKVPLCFVEREANELRRGRSTVTKSIVEEFIREQLE 1337
Query: 627 QLADTDG-SSTMPPSVLRVLNSKACRGAIMFGDSLLPSECSLIVDELKQTSLCFQCAHGR 677
L T G T+P +V +VL S+AC GAI F D L E +++ L L FQCAHGR
Sbjct: 1338 LLQTTGGIQGTLPLTVQKVLASQACHGAIKFNDGLSLQESCRLIEALSSCQLPFQCAHGR 1397
BLAST of CmoCh06G001530 vs. ExPASy Swiss-Prot
Match:
Q12083 (DNA mismatch repair protein MLH3 OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=MLH3 PE=1 SV=1)
HSP 1 Score: 93.6 bits (231), Expect = 9.7e-18
Identity = 71/233 (30.47%), Postives = 114/233 (48.93%), Query Frame = 0
Query: 465 SIDKNFLEDAKVLLQLDKKFI-------PVVSGGILAVIDQHAADERIRLEDLRQKLLSG 524
SI ++ L +V+ Q+DKKFI + + +L ++DQHA DERIRLE+L LL+
Sbjct: 484 SISRSVLAKYEVINQVDKKFILIRCLDQSIHNCPLLVLVDQHACDERIRLEELFYSLLT- 543
Query: 525 EAKTIAYLEDEHELVLPEIG---YQLLYNYSDQVKEWG-------WICNIHAQDSKSFQR 584
E T ++ + + E+ L +Y + K+WG + K+
Sbjct: 544 EVVTGTFVARDLKDCCIEVDRTEADLFKHYQSEFKKWGIGYETIEGTMETSLLEIKTLPE 603
Query: 585 NLNILYKQE----TVITLMAVPCI-----LGVNLSDADLLEFLDQLADTDGSSTMPPSVL 644
L Y + ++ L + L ++LS + +D+L SS +P
Sbjct: 604 MLTSKYNGDKDYLKMVLLQHAHDLKDFKKLPMDLSHFENYTSVDKLYWWKYSSCVPTVFH 663
Query: 645 RVLNSKACRGAIMFGDSLLPSECSLIVDELKQTSLCFQCAHGRPTTVPLVNLE 672
+LNSKACR A+MFGD L EC +++ +L + F+CAHGRP+ VP+ L+
Sbjct: 664 EILNSKACRSAVMFGDELTRQECIILISKLSRCHNPFECAHGRPSMVPIAELK 715
BLAST of CmoCh06G001530 vs. ExPASy Swiss-Prot
Match:
P54280 (DNA mismatch repair protein pms1 OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=pms1 PE=3 SV=1)
HSP 1 Score: 84.3 bits (207), Expect = 5.9e-15
Identity = 73/275 (26.55%), Postives = 122/275 (44.36%), Query Frame = 0
Query: 397 QFQSDVKVTASALELCSKETRESDLWIKWKNCCPTTRNDGPRAFEDEVSILDISSGFLSL 456
+F + ++ S ++ K+ SD +K+ N + ED +++ + FL +
Sbjct: 555 KFSKKINISLSGVQ---KDIVRSDALLKFSNKIGVVHDISDENQEDHLNLTVHKADFLRM 614
Query: 457 ARNSLVPKSIDKNFLEDAKVLLQLDKKFIPVVSGGILAVIDQHAADERIRLEDLRQKLLS 516
+V+ Q ++ FI VV G L +IDQHA+DE+ E L+ L+
Sbjct: 615 ------------------RVVGQFNRGFIVVVHGNNLFIIDQHASDEKFNYEHLKSNLVI 674
Query: 517 GEAKTIAYLEDEHELVLPEIGYQLLYNYSDQVKEWGWICNIHAQDSKSFQRNLNILYKQE 576
+ +LVLP+ L +E I +I K F +++ +
Sbjct: 675 ----------NSQDLVLPK-RLDLA-----ATEETVLIDHIDLIRRKGFGVAIDLNQRVG 734
Query: 577 TVITLMAVPCILGVNLSDADLLEFLDQLADTDGSSTMPPSVLRVLNSKACRGAIMFGDSL 636
TL++VP V +DLLE + L++ + R+L SKACR ++M G +L
Sbjct: 735 NRCTLLSVPTSKNVIFDTSDLLEIISVLSEHPQIDPFSSRLERMLASKACRSSVMIGRAL 792
Query: 637 LPSECSLIVDELKQTSLCFQCAHGRPTTVPLVNLE 672
SE + IV L + S + C HGRPT L+ L+
Sbjct: 795 TISEMNTIVRHLAELSKPWNCPHGRPTMRHLLRLK 792
BLAST of CmoCh06G001530 vs. ExPASy Swiss-Prot
Match:
Q941I6 (DNA mismatch repair protein PMS1 OS=Arabidopsis thaliana OX=3702 GN=PMS1 PE=1 SV=1)
HSP 1 Score: 69.7 bits (169), Expect = 1.5e-10
Identity = 64/232 (27.59%), Postives = 95/232 (40.95%), Query Frame = 0
Query: 457 ARNSLVPKSIDKNFLEDAKVLLQLDKKFIPVVSGGILAVIDQHAADERIRLEDLRQKLLS 516
A S + + K +VL Q + FI L ++DQHAADE+ E L + +
Sbjct: 688 AATSELERLFRKEDFRRMQVLGQFNLGFIIAKLERDLFIVDQHAADEKFNFEHLARSTVL 747
Query: 517 GEAKTIAYLEDEHELVLPEIGYQLLYNYSDQVKEWGWIC--NIHAQDSKSFQRNLNILYK 576
+ + L E + PE +L + D ++E G++ N A K F+
Sbjct: 748 NQQPLLQPLNLE---LSPEEEVTVLM-HMDIIRENGFLLEENPSAPPGKHFR-------- 807
Query: 577 QETVITLMAVPCILGVNLSDADLLEFLDQLADTDG-------------SSTMPPSVLRVL 636
L A+P + DL + + L D G S P V +L
Sbjct: 808 ------LRAIPYSKNITFGVEDLKDLISTLGDNHGECSVASSYKTSKTDSICPSRVRAML 867
Query: 637 NSKACRGAIMFGDSLLPSECSLIVDELKQTSLCFQCAHGRPTTVPLVNLEAL 674
S+ACR ++M GD L +E IV+ L + C HGRPT LV+L L
Sbjct: 868 ASRACRSSVMIGDPLRKNEMQKIVEHLADLESPWNCPHGRPTMRHLVDLTTL 901
BLAST of CmoCh06G001530 vs. ExPASy TrEMBL
Match:
A0A6J1FI48 (DNA mismatch repair protein MLH3 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111445300 PE=3 SV=1)
HSP 1 Score: 1448.3 bits (3748), Expect = 0.0e+00
Identity = 712/712 (100.00%), Postives = 712/712 (100.00%), Query Frame = 0
Query: 1 MINGSAESTPSSYFHEFSYDDNIFTGNKPSLRGCTSGSSFQLESTSILGDKLYIQNDVIK 60
MINGSAESTPSSYFHEFSYDDNIFTGNKPSLRGCTSGSSFQLESTSILGDKLYIQNDVIK
Sbjct: 536 MINGSAESTPSSYFHEFSYDDNIFTGNKPSLRGCTSGSSFQLESTSILGDKLYIQNDVIK 595
Query: 61 RIQKQGIPDDEVDVLKLDGYIQGSDFYAGDSLHAEFTEENIYSCHLDKHVQKFFSSYQTR 120
RIQKQGIPDDEVDVLKLDGYIQGSDFYAGDSLHAEFTEENIYSCHLDKHVQKFFSSYQTR
Sbjct: 596 RIQKQGIPDDEVDVLKLDGYIQGSDFYAGDSLHAEFTEENIYSCHLDKHVQKFFSSYQTR 655
Query: 121 NSPDVHVTPNPRLASEWDVDCFSVRDGVERNWRSRDRTPFRDLVDGEDKGCGFDSDIMLR 180
NSPDVHVTPNPRLASEWDVDCFSVRDGVERNWRSRDRTPFRDLVDGEDKGCGFDSDIMLR
Sbjct: 656 NSPDVHVTPNPRLASEWDVDCFSVRDGVERNWRSRDRTPFRDLVDGEDKGCGFDSDIMLR 715
Query: 181 SSKKNYIPSCIDSELIIDDVLDTREDLSTSLEKSNNFDHSSPVSPNMHSCQKYLFNWRLP 240
SSKKNYIPSCIDSELIIDDVLDTREDLSTSLEKSNNFDHSSPVSPNMHSCQKYLFNWRLP
Sbjct: 716 SSKKNYIPSCIDSELIIDDVLDTREDLSTSLEKSNNFDHSSPVSPNMHSCQKYLFNWRLP 775
Query: 241 GKDWEKAYGSSELKFGHQAFKQKYVSVERPRRCKSAPPSYKRKTSFYCLYRRKEEKHNAA 300
GKDWEKAYGSSELKFGHQAFKQKYVSVERPRRCKSAPPSYKRKTSFYCLYRRKEEKHNAA
Sbjct: 776 GKDWEKAYGSSELKFGHQAFKQKYVSVERPRRCKSAPPSYKRKTSFYCLYRRKEEKHNAA 835
Query: 301 GFYGLDQRKTDKFNATNFYCMDQGKEEKLRASAFLDSPPHLELAELRDSKHFSSTNNLYI 360
GFYGLDQRKTDKFNATNFYCMDQGKEEKLRASAFLDSPPHLELAELRDSKHFSSTNNLYI
Sbjct: 836 GFYGLDQRKTDKFNATNFYCMDQGKEEKLRASAFLDSPPHLELAELRDSKHFSSTNNLYI 895
Query: 361 KPSPLDDLSMGTRTDMTKTPAITGNNKEKQEGKISKQFQSDVKVTASALELCSKETRESD 420
KPSPLDDLSMGTRTDMTKTPAITGNNKEKQEGKISKQFQSDVKVTASALELCSKETRESD
Sbjct: 896 KPSPLDDLSMGTRTDMTKTPAITGNNKEKQEGKISKQFQSDVKVTASALELCSKETRESD 955
Query: 421 LWIKWKNCCPTTRNDGPRAFEDEVSILDISSGFLSLARNSLVPKSIDKNFLEDAKVLLQL 480
LWIKWKNCCPTTRNDGPRAFEDEVSILDISSGFLSLARNSLVPKSIDKNFLEDAKVLLQL
Sbjct: 956 LWIKWKNCCPTTRNDGPRAFEDEVSILDISSGFLSLARNSLVPKSIDKNFLEDAKVLLQL 1015
Query: 481 DKKFIPVVSGGILAVIDQHAADERIRLEDLRQKLLSGEAKTIAYLEDEHELVLPEIGYQL 540
DKKFIPVVSGGILAVIDQHAADERIRLEDLRQKLLSGEAKTIAYLEDEHELVLPEIGYQL
Sbjct: 1016 DKKFIPVVSGGILAVIDQHAADERIRLEDLRQKLLSGEAKTIAYLEDEHELVLPEIGYQL 1075
Query: 541 LYNYSDQVKEWGWICNIHAQDSKSFQRNLNILYKQETVITLMAVPCILGVNLSDADLLEF 600
LYNYSDQVKEWGWICNIHAQDSKSFQRNLNILYKQETVITLMAVPCILGVNLSDADLLEF
Sbjct: 1076 LYNYSDQVKEWGWICNIHAQDSKSFQRNLNILYKQETVITLMAVPCILGVNLSDADLLEF 1135
Query: 601 LDQLADTDGSSTMPPSVLRVLNSKACRGAIMFGDSLLPSECSLIVDELKQTSLCFQCAHG 660
LDQLADTDGSSTMPPSVLRVLNSKACRGAIMFGDSLLPSECSLIVDELKQTSLCFQCAHG
Sbjct: 1136 LDQLADTDGSSTMPPSVLRVLNSKACRGAIMFGDSLLPSECSLIVDELKQTSLCFQCAHG 1195
Query: 661 RPTTVPLVNLEALHKQIREMEILDKNGSNGTWHGLRRHELSIERMLQHIGSA 713
RPTTVPLVNLEALHKQIREMEILDKNGSNGTWHGLRRHELSIERMLQHIGSA
Sbjct: 1196 RPTTVPLVNLEALHKQIREMEILDKNGSNGTWHGLRRHELSIERMLQHIGSA 1247
BLAST of CmoCh06G001530 vs. ExPASy TrEMBL
Match:
A0A6J1I5L5 (DNA mismatch repair protein MLH3 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111470127 PE=3 SV=1)
HSP 1 Score: 1401.3 bits (3626), Expect = 0.0e+00
Identity = 687/712 (96.49%), Postives = 698/712 (98.03%), Query Frame = 0
Query: 1 MINGSAESTPSSYFHEFSYDDNIFTGNKPSLRGCTSGSSFQLESTSILGDKLYIQNDVIK 60
MINGSAESTPSSYFHEFSYDDNIFTGNKPSLRGCTSGSSFQLESTSILGDKLYIQNDVIK
Sbjct: 537 MINGSAESTPSSYFHEFSYDDNIFTGNKPSLRGCTSGSSFQLESTSILGDKLYIQNDVIK 596
Query: 61 RIQKQGIPDDEVDVLKLDGYIQGSDFYAGDSLHAEFTEENIYSCHLDKHVQKFFSSYQTR 120
RIQKQGIPDDEVDVLKLDGYIQGS FYAGDSLHAEF EENIYSCHLDKHVQKFFSSYQTR
Sbjct: 597 RIQKQGIPDDEVDVLKLDGYIQGSGFYAGDSLHAEFAEENIYSCHLDKHVQKFFSSYQTR 656
Query: 121 NSPDVHVTPNPRLASEWDVDCFSVRDGVERNWRSRDRTPFRDLVDGEDKGCGFDSDIMLR 180
NSPDVHVTPNPRLASEWDVDCFSVRDGVERNWRSRDRTPFRDLVD EDKGCGFDSDIMLR
Sbjct: 657 NSPDVHVTPNPRLASEWDVDCFSVRDGVERNWRSRDRTPFRDLVDCEDKGCGFDSDIMLR 716
Query: 181 SSKKNYIPSCIDSELIIDDVLDTREDLSTSLEKSNNFDHSSPVSPNMHSCQKYLFNWRLP 240
SSKKNYIPSCIDS+LIIDDVLD REDLSTSLEKSNNF+HSSPVSPNMHSCQKYL NWRLP
Sbjct: 717 SSKKNYIPSCIDSKLIIDDVLDIREDLSTSLEKSNNFEHSSPVSPNMHSCQKYLSNWRLP 776
Query: 241 GKDWEKAYGSSELKFGHQAFKQKYVSVERPRRCKSAPPSYKRKTSFYCLYRRKEEKHNAA 300
G+DWEKAYGSSELKFGH+AFKQKYVSVER RRCKSAPPSYKRKTSFYCLY+RKEEKHNAA
Sbjct: 777 GRDWEKAYGSSELKFGHKAFKQKYVSVERRRRCKSAPPSYKRKTSFYCLYQRKEEKHNAA 836
Query: 301 GFYGLDQRKTDKFNATNFYCMDQGKEEKLRASAFLDSPPHLELAELRDSKHFSSTNNLYI 360
GFYGLDQRKTDKFNATNFYCMDQGK+EKLRASAFLDSPPHLEL +LRDSKHFS TNNLYI
Sbjct: 837 GFYGLDQRKTDKFNATNFYCMDQGKDEKLRASAFLDSPPHLELGQLRDSKHFSGTNNLYI 896
Query: 361 KPSPLDDLSMGTRTDMTKTPAITGNNKEKQEGKISKQFQSDVKVTASALELCSKETRESD 420
PSPLDDLSMGTRTDMTK P ITGNNKEKQEGK+SKQFQSDVKVTASALELCSKET+ES
Sbjct: 897 NPSPLDDLSMGTRTDMTKMPTITGNNKEKQEGKVSKQFQSDVKVTASALELCSKETQESY 956
Query: 421 LWIKWKNCCPTTRNDGPRAFEDEVSILDISSGFLSLARNSLVPKSIDKNFLEDAKVLLQL 480
LWIKWKNCCPTTRNDGPRAFEDEVSILDISSGFLSLARNSLVPKSIDKNFLEDAKVLLQL
Sbjct: 957 LWIKWKNCCPTTRNDGPRAFEDEVSILDISSGFLSLARNSLVPKSIDKNFLEDAKVLLQL 1016
Query: 481 DKKFIPVVSGGILAVIDQHAADERIRLEDLRQKLLSGEAKTIAYLEDEHELVLPEIGYQL 540
DKKFIPVVSGGILAVIDQHAADERIRLEDLRQKLLSGEAKTIAYLEDEHELVLPEIGYQL
Sbjct: 1017 DKKFIPVVSGGILAVIDQHAADERIRLEDLRQKLLSGEAKTIAYLEDEHELVLPEIGYQL 1076
Query: 541 LYNYSDQVKEWGWICNIHAQDSKSFQRNLNILYKQETVITLMAVPCILGVNLSDADLLEF 600
LYNYSDQVKEWGWICNIHAQDSK FQRNLNILYKQETVITLMAVPCILGVNLSDADLLEF
Sbjct: 1077 LYNYSDQVKEWGWICNIHAQDSKCFQRNLNILYKQETVITLMAVPCILGVNLSDADLLEF 1136
Query: 601 LDQLADTDGSSTMPPSVLRVLNSKACRGAIMFGDSLLPSECSLIVDELKQTSLCFQCAHG 660
LDQLADTDGSSTMPPSVLRVLNSKACRGAIMFGDSLLPSECSLIV+ELKQTSLCFQCAHG
Sbjct: 1137 LDQLADTDGSSTMPPSVLRVLNSKACRGAIMFGDSLLPSECSLIVEELKQTSLCFQCAHG 1196
Query: 661 RPTTVPLVNLEALHKQIREMEILDKNGSNGTWHGLRRHELSIERMLQHIGSA 713
RPTTVPLVNLEALHKQIREMEILDKNG NGTWHGLRRHELSIERMLQH+GSA
Sbjct: 1197 RPTTVPLVNLEALHKQIREMEILDKNGLNGTWHGLRRHELSIERMLQHVGSA 1248
BLAST of CmoCh06G001530 vs. ExPASy TrEMBL
Match:
A0A1S3BJQ0 (DNA mismatch repair protein MLH3 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103490644 PE=3 SV=1)
HSP 1 Score: 1055.8 bits (2729), Expect = 7.9e-305
Identity = 545/716 (76.12%), Postives = 598/716 (83.52%), Query Frame = 0
Query: 2 INGSAESTPSSYFHEFSYDDNIFTGNKPSLRGCTSGSSFQLESTSILGDKLYIQNDVIKR 61
I GSAESTPSSYFHEFSYDD IF GNKPSL GC+S SSF YIQNDVI R
Sbjct: 525 ITGSAESTPSSYFHEFSYDDCIFMGNKPSLTGCSSMSSFH----------PYIQNDVIDR 584
Query: 62 IQKQGIPDDEVDVLKLDGYIQGSDFYAGDSLHAEFTEENIYSCHLDKHVQKFFSSYQTRN 121
Q QG+ DDEVD++KLD YI+GSDF AG SLHAE H+Q F SSYQTRN
Sbjct: 585 TQMQGMLDDEVDIMKLDAYIKGSDFCAGSSLHAE-------------HMQMFLSSYQTRN 644
Query: 122 SPDVHVTPNPRLASEWDVDCFSVRDGVERNWRSRDRTPFRDLVDGEDKGCGFDSDIMLRS 181
SP+ H+T LA+EWDVDCFSVRD VER+WRSRDRTPF+ LVD ++KGC FD DIML S
Sbjct: 645 SPNAHMTSKSILATEWDVDCFSVRDEVERSWRSRDRTPFKQLVDDDEKGCRFDYDIMLSS 704
Query: 182 SKK-NYIPSCIDSELIIDDVLDTREDLSTSLEKSNNFDHSSPVSPNMHSCQKYLFNWRLP 241
SKK NY S DS I+DDV DTRE+L L+KSNNF+HSSP SP+MHS QKY NWRLP
Sbjct: 705 SKKNNYKSSYTDSATIVDDVFDTRENLGNFLKKSNNFEHSSPRSPDMHSRQKYFSNWRLP 764
Query: 242 GKDWEKAYGSSELKFGHQAFKQKYVSVERPRRCKSAPPSYKRKTSFYCLYRRKEEKHNAA 301
+D EKAYGSSE KFGHQAFKQKY SVERPRR KSAPP YKRKTSFYCL ++K E+ NAA
Sbjct: 765 ERDCEKAYGSSEPKFGHQAFKQKYCSVERPRRGKSAPPFYKRKTSFYCLDQQKAERPNAA 824
Query: 302 GFYGLDQRKTDKFNATNFYCMDQGKEEKLRASAFLDSPPHLELAELRDSKHFSSTNNLYI 361
FY L++ K D+ +A++FYCMDQGK EKL+AS FLDSPPHLE ELRDS+H S T+N Y+
Sbjct: 825 SFYCLNEGKADQSSASSFYCMDQGKVEKLKASVFLDSPPHLEPVELRDSEHVSGTSNQYV 884
Query: 362 KPSPLDDLSMGT---RTDMTKTPAITGNNKEKQEGKISKQFQSDVKVTASALELCSKETR 421
KP P+DDL + T RTD K AI GN++EKQ G+ISKQ QSDVKVT SA+ELCSKET+
Sbjct: 885 KPFPVDDLLVETRSSRTDTIKMSAIMGNSEEKQ-GEISKQSQSDVKVTESAIELCSKETQ 944
Query: 422 E-SDLWIKWKNCCPTTRNDGPRAFEDEVSILDISSGFLSLARNSLVPKSIDKNFLEDAKV 481
E SDLWIKWKNCCPTTRN+ AF+DEVSILDISSGFLSLA NSLVP IDKNFL++AKV
Sbjct: 945 ESSDLWIKWKNCCPTTRNEDSHAFDDEVSILDISSGFLSLASNSLVPDLIDKNFLQNAKV 1004
Query: 482 LLQLDKKFIPVVSGGILAVIDQHAADERIRLEDLRQKLLSGEAKTIAYLEDEHELVLPEI 541
LLQLDKKFIPVVSGGILAVIDQHAADERIRLEDLRQKLLSGEAKT AYL+ EHEL LPEI
Sbjct: 1005 LLQLDKKFIPVVSGGILAVIDQHAADERIRLEDLRQKLLSGEAKTTAYLDAEHELALPEI 1064
Query: 542 GYQLLYNYSDQVKEWGWICNIHAQDSKSFQRNLNILYKQETVITLMAVPCILGVNLSDAD 601
GYQLLYNY+DQVKEWGWICNIHAQDSKSF+ NLNIL+KQETVITLMAVPCILGVNLSD D
Sbjct: 1065 GYQLLYNYADQVKEWGWICNIHAQDSKSFRSNLNILHKQETVITLMAVPCILGVNLSDVD 1124
Query: 602 LLEFLDQLADTDGSSTMPPSVLRVLNSKACRGAIMFGDSLLPSECSLIVDELKQTSLCFQ 661
LLEFL QLADTDGSSTMPPSVLRVLNSKACRGAIMFGDSLLPSECSLIV+ELKQTSLCFQ
Sbjct: 1125 LLEFLHQLADTDGSSTMPPSVLRVLNSKACRGAIMFGDSLLPSECSLIVEELKQTSLCFQ 1184
Query: 662 CAHGRPTTVPLVNLEALHKQIREMEILDKNGSNGTWHGLRRHELSIERMLQHIGSA 713
CAHGRPTTVPLVNLEALHKQI+E+EI K+GSNGTW+GL RHELSIERMLQ + SA
Sbjct: 1185 CAHGRPTTVPLVNLEALHKQIKELEIHGKSGSNGTWNGLGRHELSIERMLQRLSSA 1216
BLAST of CmoCh06G001530 vs. ExPASy TrEMBL
Match:
A0A1S4DXG5 (DNA mismatch repair protein MLH3 isoform X3 OS=Cucumis melo OX=3656 GN=LOC103490644 PE=3 SV=1)
HSP 1 Score: 1055.8 bits (2729), Expect = 7.9e-305
Identity = 545/716 (76.12%), Postives = 598/716 (83.52%), Query Frame = 0
Query: 2 INGSAESTPSSYFHEFSYDDNIFTGNKPSLRGCTSGSSFQLESTSILGDKLYIQNDVIKR 61
I GSAESTPSSYFHEFSYDD IF GNKPSL GC+S SSF YIQNDVI R
Sbjct: 500 ITGSAESTPSSYFHEFSYDDCIFMGNKPSLTGCSSMSSFH----------PYIQNDVIDR 559
Query: 62 IQKQGIPDDEVDVLKLDGYIQGSDFYAGDSLHAEFTEENIYSCHLDKHVQKFFSSYQTRN 121
Q QG+ DDEVD++KLD YI+GSDF AG SLHAE H+Q F SSYQTRN
Sbjct: 560 TQMQGMLDDEVDIMKLDAYIKGSDFCAGSSLHAE-------------HMQMFLSSYQTRN 619
Query: 122 SPDVHVTPNPRLASEWDVDCFSVRDGVERNWRSRDRTPFRDLVDGEDKGCGFDSDIMLRS 181
SP+ H+T LA+EWDVDCFSVRD VER+WRSRDRTPF+ LVD ++KGC FD DIML S
Sbjct: 620 SPNAHMTSKSILATEWDVDCFSVRDEVERSWRSRDRTPFKQLVDDDEKGCRFDYDIMLSS 679
Query: 182 SKK-NYIPSCIDSELIIDDVLDTREDLSTSLEKSNNFDHSSPVSPNMHSCQKYLFNWRLP 241
SKK NY S DS I+DDV DTRE+L L+KSNNF+HSSP SP+MHS QKY NWRLP
Sbjct: 680 SKKNNYKSSYTDSATIVDDVFDTRENLGNFLKKSNNFEHSSPRSPDMHSRQKYFSNWRLP 739
Query: 242 GKDWEKAYGSSELKFGHQAFKQKYVSVERPRRCKSAPPSYKRKTSFYCLYRRKEEKHNAA 301
+D EKAYGSSE KFGHQAFKQKY SVERPRR KSAPP YKRKTSFYCL ++K E+ NAA
Sbjct: 740 ERDCEKAYGSSEPKFGHQAFKQKYCSVERPRRGKSAPPFYKRKTSFYCLDQQKAERPNAA 799
Query: 302 GFYGLDQRKTDKFNATNFYCMDQGKEEKLRASAFLDSPPHLELAELRDSKHFSSTNNLYI 361
FY L++ K D+ +A++FYCMDQGK EKL+AS FLDSPPHLE ELRDS+H S T+N Y+
Sbjct: 800 SFYCLNEGKADQSSASSFYCMDQGKVEKLKASVFLDSPPHLEPVELRDSEHVSGTSNQYV 859
Query: 362 KPSPLDDLSMGT---RTDMTKTPAITGNNKEKQEGKISKQFQSDVKVTASALELCSKETR 421
KP P+DDL + T RTD K AI GN++EKQ G+ISKQ QSDVKVT SA+ELCSKET+
Sbjct: 860 KPFPVDDLLVETRSSRTDTIKMSAIMGNSEEKQ-GEISKQSQSDVKVTESAIELCSKETQ 919
Query: 422 E-SDLWIKWKNCCPTTRNDGPRAFEDEVSILDISSGFLSLARNSLVPKSIDKNFLEDAKV 481
E SDLWIKWKNCCPTTRN+ AF+DEVSILDISSGFLSLA NSLVP IDKNFL++AKV
Sbjct: 920 ESSDLWIKWKNCCPTTRNEDSHAFDDEVSILDISSGFLSLASNSLVPDLIDKNFLQNAKV 979
Query: 482 LLQLDKKFIPVVSGGILAVIDQHAADERIRLEDLRQKLLSGEAKTIAYLEDEHELVLPEI 541
LLQLDKKFIPVVSGGILAVIDQHAADERIRLEDLRQKLLSGEAKT AYL+ EHEL LPEI
Sbjct: 980 LLQLDKKFIPVVSGGILAVIDQHAADERIRLEDLRQKLLSGEAKTTAYLDAEHELALPEI 1039
Query: 542 GYQLLYNYSDQVKEWGWICNIHAQDSKSFQRNLNILYKQETVITLMAVPCILGVNLSDAD 601
GYQLLYNY+DQVKEWGWICNIHAQDSKSF+ NLNIL+KQETVITLMAVPCILGVNLSD D
Sbjct: 1040 GYQLLYNYADQVKEWGWICNIHAQDSKSFRSNLNILHKQETVITLMAVPCILGVNLSDVD 1099
Query: 602 LLEFLDQLADTDGSSTMPPSVLRVLNSKACRGAIMFGDSLLPSECSLIVDELKQTSLCFQ 661
LLEFL QLADTDGSSTMPPSVLRVLNSKACRGAIMFGDSLLPSECSLIV+ELKQTSLCFQ
Sbjct: 1100 LLEFLHQLADTDGSSTMPPSVLRVLNSKACRGAIMFGDSLLPSECSLIVEELKQTSLCFQ 1159
Query: 662 CAHGRPTTVPLVNLEALHKQIREMEILDKNGSNGTWHGLRRHELSIERMLQHIGSA 713
CAHGRPTTVPLVNLEALHKQI+E+EI K+GSNGTW+GL RHELSIERMLQ + SA
Sbjct: 1160 CAHGRPTTVPLVNLEALHKQIKELEIHGKSGSNGTWNGLGRHELSIERMLQRLSSA 1191
BLAST of CmoCh06G001530 vs. ExPASy TrEMBL
Match:
A0A1S4DXG1 (DNA mismatch repair protein MLH3 isoform X8 OS=Cucumis melo OX=3656 GN=LOC103490644 PE=4 SV=1)
HSP 1 Score: 1055.8 bits (2729), Expect = 7.9e-305
Identity = 545/716 (76.12%), Postives = 598/716 (83.52%), Query Frame = 0
Query: 2 INGSAESTPSSYFHEFSYDDNIFTGNKPSLRGCTSGSSFQLESTSILGDKLYIQNDVIKR 61
I GSAESTPSSYFHEFSYDD IF GNKPSL GC+S SSF YIQNDVI R
Sbjct: 246 ITGSAESTPSSYFHEFSYDDCIFMGNKPSLTGCSSMSSFH----------PYIQNDVIDR 305
Query: 62 IQKQGIPDDEVDVLKLDGYIQGSDFYAGDSLHAEFTEENIYSCHLDKHVQKFFSSYQTRN 121
Q QG+ DDEVD++KLD YI+GSDF AG SLHAE H+Q F SSYQTRN
Sbjct: 306 TQMQGMLDDEVDIMKLDAYIKGSDFCAGSSLHAE-------------HMQMFLSSYQTRN 365
Query: 122 SPDVHVTPNPRLASEWDVDCFSVRDGVERNWRSRDRTPFRDLVDGEDKGCGFDSDIMLRS 181
SP+ H+T LA+EWDVDCFSVRD VER+WRSRDRTPF+ LVD ++KGC FD DIML S
Sbjct: 366 SPNAHMTSKSILATEWDVDCFSVRDEVERSWRSRDRTPFKQLVDDDEKGCRFDYDIMLSS 425
Query: 182 SKK-NYIPSCIDSELIIDDVLDTREDLSTSLEKSNNFDHSSPVSPNMHSCQKYLFNWRLP 241
SKK NY S DS I+DDV DTRE+L L+KSNNF+HSSP SP+MHS QKY NWRLP
Sbjct: 426 SKKNNYKSSYTDSATIVDDVFDTRENLGNFLKKSNNFEHSSPRSPDMHSRQKYFSNWRLP 485
Query: 242 GKDWEKAYGSSELKFGHQAFKQKYVSVERPRRCKSAPPSYKRKTSFYCLYRRKEEKHNAA 301
+D EKAYGSSE KFGHQAFKQKY SVERPRR KSAPP YKRKTSFYCL ++K E+ NAA
Sbjct: 486 ERDCEKAYGSSEPKFGHQAFKQKYCSVERPRRGKSAPPFYKRKTSFYCLDQQKAERPNAA 545
Query: 302 GFYGLDQRKTDKFNATNFYCMDQGKEEKLRASAFLDSPPHLELAELRDSKHFSSTNNLYI 361
FY L++ K D+ +A++FYCMDQGK EKL+AS FLDSPPHLE ELRDS+H S T+N Y+
Sbjct: 546 SFYCLNEGKADQSSASSFYCMDQGKVEKLKASVFLDSPPHLEPVELRDSEHVSGTSNQYV 605
Query: 362 KPSPLDDLSMGT---RTDMTKTPAITGNNKEKQEGKISKQFQSDVKVTASALELCSKETR 421
KP P+DDL + T RTD K AI GN++EKQ G+ISKQ QSDVKVT SA+ELCSKET+
Sbjct: 606 KPFPVDDLLVETRSSRTDTIKMSAIMGNSEEKQ-GEISKQSQSDVKVTESAIELCSKETQ 665
Query: 422 E-SDLWIKWKNCCPTTRNDGPRAFEDEVSILDISSGFLSLARNSLVPKSIDKNFLEDAKV 481
E SDLWIKWKNCCPTTRN+ AF+DEVSILDISSGFLSLA NSLVP IDKNFL++AKV
Sbjct: 666 ESSDLWIKWKNCCPTTRNEDSHAFDDEVSILDISSGFLSLASNSLVPDLIDKNFLQNAKV 725
Query: 482 LLQLDKKFIPVVSGGILAVIDQHAADERIRLEDLRQKLLSGEAKTIAYLEDEHELVLPEI 541
LLQLDKKFIPVVSGGILAVIDQHAADERIRLEDLRQKLLSGEAKT AYL+ EHEL LPEI
Sbjct: 726 LLQLDKKFIPVVSGGILAVIDQHAADERIRLEDLRQKLLSGEAKTTAYLDAEHELALPEI 785
Query: 542 GYQLLYNYSDQVKEWGWICNIHAQDSKSFQRNLNILYKQETVITLMAVPCILGVNLSDAD 601
GYQLLYNY+DQVKEWGWICNIHAQDSKSF+ NLNIL+KQETVITLMAVPCILGVNLSD D
Sbjct: 786 GYQLLYNYADQVKEWGWICNIHAQDSKSFRSNLNILHKQETVITLMAVPCILGVNLSDVD 845
Query: 602 LLEFLDQLADTDGSSTMPPSVLRVLNSKACRGAIMFGDSLLPSECSLIVDELKQTSLCFQ 661
LLEFL QLADTDGSSTMPPSVLRVLNSKACRGAIMFGDSLLPSECSLIV+ELKQTSLCFQ
Sbjct: 846 LLEFLHQLADTDGSSTMPPSVLRVLNSKACRGAIMFGDSLLPSECSLIVEELKQTSLCFQ 905
Query: 662 CAHGRPTTVPLVNLEALHKQIREMEILDKNGSNGTWHGLRRHELSIERMLQHIGSA 713
CAHGRPTTVPLVNLEALHKQI+E+EI K+GSNGTW+GL RHELSIERMLQ + SA
Sbjct: 906 CAHGRPTTVPLVNLEALHKQIKELEIHGKSGSNGTWNGLGRHELSIERMLQRLSSA 937
BLAST of CmoCh06G001530 vs. TAIR 10
Match:
AT4G35520.1 (MUTL protein homolog 3 )
HSP 1 Score: 367.5 bits (942), Expect = 2.5e-101
Identity = 218/485 (44.95%), Postives = 294/485 (60.62%), Query Frame = 0
Query: 248 YGSSELKFGHQAFKQKYVSVERPRRCKSAPPSYKRKTSFYCLYRRKEEKHNAAGFYGLDQ 307
Y + KF + Q +R +R +SAPP Y+ K F L + + K
Sbjct: 733 YSIRKEKFSYMDGTQNNAGKQRSKRSRSAPPFYREKKRFISLSCKSDTK----------P 792
Query: 308 RKTDKFNATNFYCMDQ---GKEEKLRASAFLD-SPPHLELAELRDSKHFSSTNNLYIKPS 367
+ +D + C+ Q + L+ S D S H++ E K SS ++L
Sbjct: 793 KNSDPSEPDDLECLTQPCNASQMHLKCSILDDVSYDHIQETE----KRLSSASDL----- 852
Query: 368 PLDDLSMGTRTDMTKTPAITGNNKEKQEGKISKQFQSDVKVTASALELCSKETRESDLWI 427
S G RT ++T +++ E S++F +K T
Sbjct: 853 ---KASAGCRTVHSET-----QDEDVHEDFSSEEFLDPIKSTT----------------- 912
Query: 428 KWK-NCCPTTRNDGPRAFEDEVSILDISSGFLSL-ARNSLVPKSIDKNFLEDAKVLLQLD 487
KW+ NC + + + DISSG L L + SLVP+SI+++ LEDAKVL Q+D
Sbjct: 913 KWRHNCAVSQVPKESHELHGQDGVFDISSGLLHLRSDESLVPESINRHSLEDAKVLQQVD 972
Query: 488 KKFIPVVSGGILAVIDQHAADERIRLEDLRQKLLSGEAKTIAYLEDEHEL---------- 547
KK+IP+V+ G +A++DQHAADERIRLE+LR K+L+G+A+T+ YL + EL
Sbjct: 973 KKYIPIVACGTVAIVDQHAADERIRLEELRTKVLAGKARTVTYLSADQELFINDALLIFV 1032
Query: 548 ----VLPEIGYQLLYNYSDQVKEWGWICNIHAQDSKSFQRNLNILYKQETVITLMAVPCI 607
VLPE+GYQLL +YS+Q+++WGWICNI + S SF++N++I+ ++ T ITL AVPCI
Sbjct: 1033 LTLKVLPEMGYQLLQSYSEQIRDWGWICNITVEGSTSFKKNMSIIQRKPTPITLNAVPCI 1092
Query: 608 LGVNLSDADLLEFLDQLADTDGSSTMPPSVLRVLNSKACRGAIMFGDSLLPSECSLIVDE 667
LGVNLSD DLLEFL QLADTDGSST+PPSVLRVLNSKACRGAIMFGDSLLPSECSLI+D
Sbjct: 1093 LGVNLSDVDLLEFLQQLADTDGSSTIPPSVLRVLNSKACRGAIMFGDSLLPSECSLIIDG 1152
Query: 668 LKQTSLCFQCAHGRPTTVPLVNLEALHKQIREMEILDKNGSNGTWHGLRRHELSIERMLQ 713
LKQTSLCFQCAHGRPTTVPLV+L+ALHKQI ++ WHGL+R E++++R
Sbjct: 1153 LKQTSLCFQCAHGRPTTVPLVDLKALHKQIAKL------SGRQVWHGLQRREITLDRAKS 1167
BLAST of CmoCh06G001530 vs. TAIR 10
Match:
AT4G02460.1 (DNA mismatch repair protein, putative )
HSP 1 Score: 69.7 bits (169), Expect = 1.1e-11
Identity = 64/232 (27.59%), Postives = 95/232 (40.95%), Query Frame = 0
Query: 457 ARNSLVPKSIDKNFLEDAKVLLQLDKKFIPVVSGGILAVIDQHAADERIRLEDLRQKLLS 516
A S + + K +VL Q + FI L ++DQHAADE+ E L + +
Sbjct: 688 AATSELERLFRKEDFRRMQVLGQFNLGFIIAKLERDLFIVDQHAADEKFNFEHLARSTVL 747
Query: 517 GEAKTIAYLEDEHELVLPEIGYQLLYNYSDQVKEWGWIC--NIHAQDSKSFQRNLNILYK 576
+ + L E + PE +L + D ++E G++ N A K F+
Sbjct: 748 NQQPLLQPLNLE---LSPEEEVTVLM-HMDIIRENGFLLEENPSAPPGKHFR-------- 807
Query: 577 QETVITLMAVPCILGVNLSDADLLEFLDQLADTDG-------------SSTMPPSVLRVL 636
L A+P + DL + + L D G S P V +L
Sbjct: 808 ------LRAIPYSKNITFGVEDLKDLISTLGDNHGECSVASSYKTSKTDSICPSRVRAML 867
Query: 637 NSKACRGAIMFGDSLLPSECSLIVDELKQTSLCFQCAHGRPTTVPLVNLEAL 674
S+ACR ++M GD L +E IV+ L + C HGRPT LV+L L
Sbjct: 868 ASRACRSSVMIGDPLRKNEMQKIVEHLADLESPWNCPHGRPTMRHLVDLTTL 901
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
F4JN26 | 4.4e-103 | 46.28 | DNA mismatch repair protein MLH3 OS=Arabidopsis thaliana OX=3702 GN=MLH3 PE=2 SV... | [more] |
Q9UHC1 | 1.2e-18 | 30.98 | DNA mismatch repair protein Mlh3 OS=Homo sapiens OX=9606 GN=MLH3 PE=1 SV=3 | [more] |
Q12083 | 9.7e-18 | 30.47 | DNA mismatch repair protein MLH3 OS=Saccharomyces cerevisiae (strain ATCC 204508... | [more] |
P54280 | 5.9e-15 | 26.55 | DNA mismatch repair protein pms1 OS=Schizosaccharomyces pombe (strain 972 / ATCC... | [more] |
Q941I6 | 1.5e-10 | 27.59 | DNA mismatch repair protein PMS1 OS=Arabidopsis thaliana OX=3702 GN=PMS1 PE=1 SV... | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1FI48 | 0.0e+00 | 100.00 | DNA mismatch repair protein MLH3 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC... | [more] |
A0A6J1I5L5 | 0.0e+00 | 96.49 | DNA mismatch repair protein MLH3 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC11... | [more] |
A0A1S3BJQ0 | 7.9e-305 | 76.12 | DNA mismatch repair protein MLH3 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103490... | [more] |
A0A1S4DXG5 | 7.9e-305 | 76.12 | DNA mismatch repair protein MLH3 isoform X3 OS=Cucumis melo OX=3656 GN=LOC103490... | [more] |
A0A1S4DXG1 | 7.9e-305 | 76.12 | DNA mismatch repair protein MLH3 isoform X8 OS=Cucumis melo OX=3656 GN=LOC103490... | [more] |