Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideCDSinitialstart_codonintroninternalterminalstop_codon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCGTCCGAGCCTAAGTTTGTTTCAAGTAGATTAAAACGACGATTGAATCGGAACTCGAAGCGGCAAAAATCGAGCAAATCGTTGATGTTTCAATCAATAGCACATTCACTATCTTTCTCAAGCGCCCGCTTTCTTCATCGCTCGGTGATCAATCGTCGAACGATTTGCAGTCTATCTTTGCCGATGTCTAGTTGGTCCTGCAAAAAATGCACATTCCTCAACCCACCTTCCCAAAAGGCAGCCTGCAAAATCTGTCTGTCTCCTTCATCTCCTCCGCCATCACCGTCTTCTTCCTCCGCCCCTCAATGGTCCTGCAAGGCCTGCACCTTCCTGAACCCATATAAGAATTCCGATTGCGAGCTCTGTGGCACTAGGGCTCCGTCCCTCTCGCTTTCGAGTTTCAAGGATTTGATTGATGTCAGCGAGGATGCGGATGCGGATTCTTCTGTTGGGTCTGTGTTCTTTCCCTTGCAGCCCTGCAAGAAAAGGAAACTAGACGATCCTGTTCCTGTGGTGGGTGGTGTCAATTTCGCGGAATTGGGCGCATTTCGTGGTGTTAAGGCATCGACGAACACTATTGCTGAAATGGGTTAGTCTTATTGTGTTGATTTTGTGTCAATATTATTGTTTTTTCAAGAACAATGTAGTTTTAAGTCGTATTAATGGGGTAGTTGTATTGTCTCGTTCTTGATGGGTGGCAGGGGATTCTAGTTCTAGGACAAGTTTGATACCCATAAAGATTTTGACTTACAATGTATGGTTCCGAGAAGATTTGGAGATGCGTAATAGAATGAGAGCCCTTGGACAACTTATCCAACGGCATTCACCAGATGTTATTTGTTTCCAGGTACTCCATCCTCAGTTTTATACACGTCTTGTGTGAGAGTAACGAACCATTTCTTTTAAGGGAGTGGAAAACTTTGCCTAGTAGACATGTTTTAAGACTATGAGGCTGATGGTGATACGTAACGGGCCAAAGCGAACTATATTTGCTAGCGGTGGGCTTGGATTGTTACAAATGGTATCATAGTCAGACACCAAGCGGTGTGCCAACGAGGACGTTGGGCCCTCAAGGGGGTGGATTGAGATCCTACTTGGAGAGGGGAACAAACCATTCCTTATAAGGGTGTGGAAACCTCTCCCTAGTAGGCACGTTTTAAACTCGTGAGGCTGACGGCGATATGTAACGGGCCAAAGCGAACAGTATTTGCTAGCGGTGGACATGGGCTATTACAAATGGTATCAAAGCCAGACACTAGGCGGTGTGCTAACGAGGACGTTGGACCCACACGGGGGTGGATTGTGAGATCCCACTTGGAGAGGGGAATGAACCATTCCTTGTAAGGGTATGGAAACCTCTCCCTAGCAAACACGTTTTAAACCGTGAGACTGACGACGATATGTAACGGGCCAAAACGGACAATATTTGCTAGCGGTGGACTTGGGAGGTTACAAATGGTATCAGAGCCAGATATCGGGTGGTGTGCCAACGTGGGGGTGGATTGTGAGATCTCACTTGGAGAGGGGAACGAACCATTCCTTATAAGGGTGTGGAAACCTCTCCCTAGCATACGCGTTTTAAAACTGTGAGGTTGATGGCGATACATAACGGGTCAAAACGAACAATATTTGCTAGCGATGGACTTGAGCCGTTACATTTTGAGTTTCTAGAATTTTAGTTTGTGATGCATTCTTTTACGTTTACAATGTGTTGCAGGAAGTTACTCCAGATATATATAACATCTTCCAGATTACCAATTGGTGGAAAGTTTATCGCTGCTCGGTCAAGAAGGATGCTCATTCAAGGGGATACTTTTGTATGCTGGTGAGCTTTATGTTACCTCCTTTGCTTGATATTTCATTTCCTATAAAGCATATTTTGGCTTCTGGGTGAAGTTTAGTTTTTATAGTGGATATGGTTGATGAAATGATGTCAAATGGACTCCCCACAAAACTAGCAAAGAGAGCCTTTCCAGCCAACAACCCAAGTCCACCACTAGCTAATAATGTTCGTTTTGGCCCGTTACGTATCGTCGTCAACCTCACGGTTCTAAAACGCATTTGTTAGGGAGAAGTTTTCACATCCTAATAACGAATGCTTCGTTCACTTCTCCAACCGATGTGGGATCTCACAATCCAACCCCTTTGGGCCCTAGCGTTCTCGCTAGCACATTGCCCGATGTCTGGCTCTGATACCATTTGTAATAGCCCAAGCCCACTGCTAGTATATATTGTCTGTCTTGGCCCGTTATGTATCGTTGTTAGCCTCACGGTTTTAAAACGTGTCTGTTAGGGAGAGGTTTCCACACCCTAATAAGAAATGCGTCGTTCACCTTCGACCTTTCCAACCGATATGGGATCTCACAATCCTATCCCCTTGGGGCCAGCATCCTTGTTGGCTCACCACCTGGTGTTTGGCTTTGATACCATTTGTAACAACCCAAGCCCACTGCTAGCATATATTGTCTATCTTGGCCCGTTATGTATCGCTGTTAGCCTCACGGTTTTAAAATGCGTCTGTTAGGGAGAGGTTTCCACACCCTAATAAGCAATTTTTCATTCATCTCTCTAACCAATGTGGGATCTCACACTCCACCCCCCTTAGGGCCTAGCGTCGTCGCTGGCACACCGTTCGGTGTCTAGCTCTTATACCATTTGTAACAACCCAAACCCACCACTAGCATATATTGTCTGCTTTGGTCCGTTACATATCGTCGTCAGCTTCACGGTTTTAAAACGCGTCTGTTGGGGAGAGGTTTCCACATCCTAATAAGGAATGCTTTTTTCACATCTCCAACTGATGTGAGATCTCACAATTACAAAAGCTAGCATTCTCAACGGCCACAGCAGCCCTTTTCAAAATCCGTGGAGTATAAGCATAGGTTTTGTGTACTGTTTTACATATCTTCTAATCTTATTCTCGTTCTAGTTCTTATGTCGTCTGCACGGAGAGGTGTCGAAAAGTAGAATACGAGAGTGACGAACCTGTTATTGCTCTAGTACTATGCTTCATTTCAACATTTTATATCATAAACTGATGATATAATTTATCCATTCACTGAGCAGTTGAGCAAACTGCCGGTGAAATCCTTCAGTTGTCAACCATTTTCCAATTCCATAATGGGGAGAGAACTCTGCGTTGCCAATCTTGAAGTTCAAAATGGCCTTTCATTGACAGTAGCAACAAGCCATCTTGAGAGTCCTTGCCCTGCACCTCCAAAATGGAATCAAATGTACAGCAAAGAGCGTGTAATTCAAGCCAAAGAAGCCATCGACTTTCTCAATGAAAATCCGAACGTCGTTTTCGGCGGTGACATGAACTGGGACGATAAGTTGGATGGTCAGTTTCCTTTTCCCGATGACTGGATTGATGCCTGGGAAGAATTACGCCCCGGTGAAAATGGTTGGACATACGATACCAAATCGAACAAGATGTTATCTGGGAACCGTACGCTGCAAAGACGTCTGGATCGATTCGTTTGTAAGTTACAAGATTTCAAGGTAAGTTCCATTGTAATGATTGGGACTGATCCAATTCCTGAATTAACATACACAAAGGAGAAGAAAGTAGGTAAAGAAATGAAGATGCTTGAGCTCCCTGTTTTGCCCAGTGATCATTATGGCCTGCTCTTGACCATAAGCAGCCTGTAA
mRNA sequence
ATGTCGTCCGAGCCTAAGTTTGTTTCAAGTAGATTAAAACGACGATTGAATCGGAACTCGAAGCGGCAAAAATCGAGCAAATCGTTGATGTTTCAATCAATAGCACATTCACTATCTTTCTCAAGCGCCCGCTTTCTTCATCGCTCGGTGATCAATCGTCGAACGATTTGCAGTCTATCTTTGCCGATGTCTAGTTGGTCCTGCAAAAAATGCACATTCCTCAACCCACCTTCCCAAAAGGCAGCCTGCAAAATCTGTCTGTCTCCTTCATCTCCTCCGCCATCACCGTCTTCTTCCTCCGCCCCTCAATGGTCCTGCAAGGCCTGCACCTTCCTGAACCCATATAAGAATTCCGATTGCGAGCTCTGTGGCACTAGGGCTCCGTCCCTCTCGCTTTCGAGTTTCAAGGATTTGATTGATGTCAGCGAGGATGCGGATGCGGATTCTTCTGTTGGGTCTGTGTTCTTTCCCTTGCAGCCCTGCAAGAAAAGGAAACTAGACGATCCTGTTCCTGTGGTGGGTGGTGTCAATTTCGCGGAATTGGGCGCATTTCGTGGTGTTAAGGCATCGACGAACACTATTGCTGAAATGGGGGATTCTAGTTCTAGGACAAGTTTGATACCCATAAAGATTTTGACTTACAATGTATGGTTCCGAGAAGATTTGGAGATGCGTAATAGAATGAGAGCCCTTGGACAACTTATCCAACGGCATTCACCAGATGTTATTTGTTTCCAGGAAGTTACTCCAGATATATATAACATCTTCCAGATTACCAATTGGTGGAAAGTTTATCGCTGCTCGGTCAAGAAGGATGCTCATTCAAGGGGATACTTTTGTATGCTGTTGAGCAAACTGCCGGTGAAATCCTTCAGTTGTCAACCATTTTCCAATTCCATAATGGGGAGAGAACTCTGCGTTGCCAATCTTGAAGTTCAAAATGGCCTTTCATTGACAGTAGCAACAAGCCATCTTGAGAGTCCTTGCCCTGCACCTCCAAAATGGAATCAAATGTACAGCAAAGAGCGTGTAATTCAAGCCAAAGAAGCCATCGACTTTCTCAATGAAAATCCGAACGTCGTTTTCGGCGGTGACATGAACTGGGACGATAAGTTGGATGGTCAGTTTCCTTTTCCCGATGACTGGATTGATGCCTGGGAAGAATTACGCCCCGGTGAAAATGGTTGGACATACGATACCAAATCGAACAAGATGTTATCTGGGAACCGTACGCTGCAAAGACGTCTGGATCGATTCGTTTGTAAGTTACAAGATTTCAAGGTAAGTTCCATTGTAATGATTGGGACTGATCCAATTCCTGAATTAACATACACAAAGGAGAAGAAAGTAGGTAAAGAAATGAAGATGCTTGAGCTCCCTGTTTTGCCCAGTGATCATTATGGCCTGCTCTTGACCATAAGCAGCCTGTAA
Coding sequence (CDS)
ATGTCGTCCGAGCCTAAGTTTGTTTCAAGTAGATTAAAACGACGATTGAATCGGAACTCGAAGCGGCAAAAATCGAGCAAATCGTTGATGTTTCAATCAATAGCACATTCACTATCTTTCTCAAGCGCCCGCTTTCTTCATCGCTCGGTGATCAATCGTCGAACGATTTGCAGTCTATCTTTGCCGATGTCTAGTTGGTCCTGCAAAAAATGCACATTCCTCAACCCACCTTCCCAAAAGGCAGCCTGCAAAATCTGTCTGTCTCCTTCATCTCCTCCGCCATCACCGTCTTCTTCCTCCGCCCCTCAATGGTCCTGCAAGGCCTGCACCTTCCTGAACCCATATAAGAATTCCGATTGCGAGCTCTGTGGCACTAGGGCTCCGTCCCTCTCGCTTTCGAGTTTCAAGGATTTGATTGATGTCAGCGAGGATGCGGATGCGGATTCTTCTGTTGGGTCTGTGTTCTTTCCCTTGCAGCCCTGCAAGAAAAGGAAACTAGACGATCCTGTTCCTGTGGTGGGTGGTGTCAATTTCGCGGAATTGGGCGCATTTCGTGGTGTTAAGGCATCGACGAACACTATTGCTGAAATGGGGGATTCTAGTTCTAGGACAAGTTTGATACCCATAAAGATTTTGACTTACAATGTATGGTTCCGAGAAGATTTGGAGATGCGTAATAGAATGAGAGCCCTTGGACAACTTATCCAACGGCATTCACCAGATGTTATTTGTTTCCAGGAAGTTACTCCAGATATATATAACATCTTCCAGATTACCAATTGGTGGAAAGTTTATCGCTGCTCGGTCAAGAAGGATGCTCATTCAAGGGGATACTTTTGTATGCTGTTGAGCAAACTGCCGGTGAAATCCTTCAGTTGTCAACCATTTTCCAATTCCATAATGGGGAGAGAACTCTGCGTTGCCAATCTTGAAGTTCAAAATGGCCTTTCATTGACAGTAGCAACAAGCCATCTTGAGAGTCCTTGCCCTGCACCTCCAAAATGGAATCAAATGTACAGCAAAGAGCGTGTAATTCAAGCCAAAGAAGCCATCGACTTTCTCAATGAAAATCCGAACGTCGTTTTCGGCGGTGACATGAACTGGGACGATAAGTTGGATGGTCAGTTTCCTTTTCCCGATGACTGGATTGATGCCTGGGAAGAATTACGCCCCGGTGAAAATGGTTGGACATACGATACCAAATCGAACAAGATGTTATCTGGGAACCGTACGCTGCAAAGACGTCTGGATCGATTCGTTTGTAAGTTACAAGATTTCAAGGTAAGTTCCATTGTAATGATTGGGACTGATCCAATTCCTGAATTAACATACACAAAGGAGAAGAAAGTAGGTAAAGAAATGAAGATGCTTGAGCTCCCTGTTTTGCCCAGTGATCATTATGGCCTGCTCTTGACCATAAGCAGCCTGTAA
Protein sequence
MSSEPKFVSSRLKRRLNRNSKRQKSSKSLMFQSIAHSLSFSSARFLHRSVINRRTICSLSLPMSSWSCKKCTFLNPPSQKAACKICLSPSSPPPSPSSSSAPQWSCKACTFLNPYKNSDCELCGTRAPSLSLSSFKDLIDVSEDADADSSVGSVFFPLQPCKKRKLDDPVPVVGGVNFAELGAFRGVKASTNTIAEMGDSSSRTSLIPIKILTYNVWFREDLEMRNRMRALGQLIQRHSPDVICFQEVTPDIYNIFQITNWWKVYRCSVKKDAHSRGYFCMLLSKLPVKSFSCQPFSNSIMGRELCVANLEVQNGLSLTVATSHLESPCPAPPKWNQMYSKERVIQAKEAIDFLNENPNVVFGGDMNWDDKLDGQFPFPDDWIDAWEELRPGENGWTYDTKSNKMLSGNRTLQRRLDRFVCKLQDFKVSSIVMIGTDPIPELTYTKEKKVGKEMKMLELPVLPSDHYGLLLTISSL
Homology
BLAST of Csor.00g043910 vs. ExPASy Swiss-Prot
Match:
Q5XJA0 (Tyrosyl-DNA phosphodiesterase 2 OS=Danio rerio OX=7955 GN=tdp2 PE=1 SV=3)
HSP 1 Score: 67.4 bits (163), Expect = 5.0e-10
Identity = 78/289 (26.99%), Postives = 125/289 (43.25%), Query Frame = 0
Query: 195 AEMGDSSSRTSLIPIKILTYNVWFREDLEMRNRMRALGQLIQRHSPDVICFQEVTPDIYN 254
AE G + S + I+++NV + L + +R R L + ++PDV+ QE+ P
Sbjct: 109 AENGTAKSEVEDSKLSIISWNVDGLDTLNLADRARGLCSYLALYTPDVVFLQELIPAYVQ 168
Query: 255 IFQITNWWKVYRCSVKKDAHSRGYFC-MLLSKLPVKSFSCQ--PFSNSIMGRELCVANLE 314
+ K + + GYF ++L K VK + F + M R L +A +
Sbjct: 169 YLK-----KRAVSYLFFEGSDDGYFTGIMLRKSRVKFLESEIICFPTTQMMRNLLIAQVT 228
Query: 315 VQNGLSLTVATSHLESPCPAPPKWNQMYSKERVIQAKEAIDFLNENPN---VVFGGDMNW 374
+G L + TSHLES C NQ S+ER Q + + + E P V+F GD N
Sbjct: 229 F-SGQKLYLMTSHLES-CK-----NQ--SQERTKQLRVVLQKIKEAPEDAIVIFAGDTNL 288
Query: 375 -DDKLDGQFPFPDDWIDAWEELRPGEN-GWTYDTKSNKMLSGNRTLQRRLDRFVCKLQDF 434
D ++ P D WE+L E+ +T+DTK+N + + R DR
Sbjct: 289 RDAEVANVGGLPAGVCDVWEQLGKQEHCRYTWDTKANSNKTVPYVSRCRFDR-------- 348
Query: 435 KVSSIVMIGTDPIPELTYTKEKKVGKEMKMLELPVLPSDHYGLLLTISS 476
I + P +T +G M+ L+ SDH+G+ T ++
Sbjct: 349 ----IFLRSAKTAPPVTPDHMALIG--MEKLDCGRYTSDHWGIYCTFNT 369
BLAST of Csor.00g043910 vs. ExPASy Swiss-Prot
Match:
Q9JLG8 (Calpain-15 OS=Mus musculus OX=10090 GN=Capn15 PE=1 SV=1)
HSP 1 Score: 47.4 bits (111), Expect = 5.4e-04
Identity = 26/66 (39.39%), Postives = 32/66 (48.48%), Query Frame = 0
Query: 63 MSSWSCKKCTFLNPPSQKAACKICLSPSSPPPSPS----SSSAPQWSCKACTFLNPYKNS 122
+ WSC +CTFLNP Q+ C IC +P P S +W C CTF N
Sbjct: 4 VGEWSCARCTFLNPAGQR-QCSICEAPRHKPDLDQILRLSVEEQKWPCARCTFRNFLGKE 63
Query: 123 DCELCG 125
CE+CG
Sbjct: 64 ACEVCG 68
BLAST of Csor.00g043910 vs. NCBI nr
Match:
KAG6573182.1 (Tyrosyl-DNA phosphodiesterase 2, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 961 bits (2483), Expect = 0.0
Identity = 476/476 (100.00%), Postives = 476/476 (100.00%), Query Frame = 0
Query: 1 MSSEPKFVSSRLKRRLNRNSKRQKSSKSLMFQSIAHSLSFSSARFLHRSVINRRTICSLS 60
MSSEPKFVSSRLKRRLNRNSKRQKSSKSLMFQSIAHSLSFSSARFLHRSVINRRTICSLS
Sbjct: 1 MSSEPKFVSSRLKRRLNRNSKRQKSSKSLMFQSIAHSLSFSSARFLHRSVINRRTICSLS 60
Query: 61 LPMSSWSCKKCTFLNPPSQKAACKICLSPSSPPPSPSSSSAPQWSCKACTFLNPYKNSDC 120
LPMSSWSCKKCTFLNPPSQKAACKICLSPSSPPPSPSSSSAPQWSCKACTFLNPYKNSDC
Sbjct: 61 LPMSSWSCKKCTFLNPPSQKAACKICLSPSSPPPSPSSSSAPQWSCKACTFLNPYKNSDC 120
Query: 121 ELCGTRAPSLSLSSFKDLIDVSEDADADSSVGSVFFPLQPCKKRKLDDPVPVVGGVNFAE 180
ELCGTRAPSLSLSSFKDLIDVSEDADADSSVGSVFFPLQPCKKRKLDDPVPVVGGVNFAE
Sbjct: 121 ELCGTRAPSLSLSSFKDLIDVSEDADADSSVGSVFFPLQPCKKRKLDDPVPVVGGVNFAE 180
Query: 181 LGAFRGVKASTNTIAEMGDSSSRTSLIPIKILTYNVWFREDLEMRNRMRALGQLIQRHSP 240
LGAFRGVKASTNTIAEMGDSSSRTSLIPIKILTYNVWFREDLEMRNRMRALGQLIQRHSP
Sbjct: 181 LGAFRGVKASTNTIAEMGDSSSRTSLIPIKILTYNVWFREDLEMRNRMRALGQLIQRHSP 240
Query: 241 DVICFQEVTPDIYNIFQITNWWKVYRCSVKKDAHSRGYFCMLLSKLPVKSFSCQPFSNSI 300
DVICFQEVTPDIYNIFQITNWWKVYRCSVKKDAHSRGYFCMLLSKLPVKSFSCQPFSNSI
Sbjct: 241 DVICFQEVTPDIYNIFQITNWWKVYRCSVKKDAHSRGYFCMLLSKLPVKSFSCQPFSNSI 300
Query: 301 MGRELCVANLEVQNGLSLTVATSHLESPCPAPPKWNQMYSKERVIQAKEAIDFLNENPNV 360
MGRELCVANLEVQNGLSLTVATSHLESPCPAPPKWNQMYSKERVIQAKEAIDFLNENPNV
Sbjct: 301 MGRELCVANLEVQNGLSLTVATSHLESPCPAPPKWNQMYSKERVIQAKEAIDFLNENPNV 360
Query: 361 VFGGDMNWDDKLDGQFPFPDDWIDAWEELRPGENGWTYDTKSNKMLSGNRTLQRRLDRFV 420
VFGGDMNWDDKLDGQFPFPDDWIDAWEELRPGENGWTYDTKSNKMLSGNRTLQRRLDRFV
Sbjct: 361 VFGGDMNWDDKLDGQFPFPDDWIDAWEELRPGENGWTYDTKSNKMLSGNRTLQRRLDRFV 420
Query: 421 CKLQDFKVSSIVMIGTDPIPELTYTKEKKVGKEMKMLELPVLPSDHYGLLLTISSL 476
CKLQDFKVSSIVMIGTDPIPELTYTKEKKVGKEMKMLELPVLPSDHYGLLLTISSL
Sbjct: 421 CKLQDFKVSSIVMIGTDPIPELTYTKEKKVGKEMKMLELPVLPSDHYGLLLTISSL 476
BLAST of Csor.00g043910 vs. NCBI nr
Match:
XP_022954436.1 (uncharacterized protein LOC111456698 [Cucurbita moschata])
HSP 1 Score: 948 bits (2451), Expect = 0.0
Identity = 470/476 (98.74%), Postives = 472/476 (99.16%), Query Frame = 0
Query: 1 MSSEPKFVSSRLKRRLNRNSKRQKSSKSLMFQSIAHSLSFSSARFLHRSVINRRTICSLS 60
MSSEPKFVS RLKRRLNRNSKRQKSSKSLMFQSIAHSLSFSSARFLHRSVINRRTICSLS
Sbjct: 1 MSSEPKFVSRRLKRRLNRNSKRQKSSKSLMFQSIAHSLSFSSARFLHRSVINRRTICSLS 60
Query: 61 LPMSSWSCKKCTFLNPPSQKAACKICLSPSSPPPSPSSSSAPQWSCKACTFLNPYKNSDC 120
LPMSSWSCKKCTFLNPPSQKAACKICLSPSSPPPSPSSSSA QWSCKACTFLNPYKNSDC
Sbjct: 61 LPMSSWSCKKCTFLNPPSQKAACKICLSPSSPPPSPSSSSASQWSCKACTFLNPYKNSDC 120
Query: 121 ELCGTRAPSLSLSSFKDLIDVSEDADADSSVGSVFFPLQPCKKRKLDDPVPVVGGVNFAE 180
ELCGTRAPSLSLSSFKDLIDVSEDADADSSVGSVFFPLQPCKKRKLDDPVPVVGGVNFAE
Sbjct: 121 ELCGTRAPSLSLSSFKDLIDVSEDADADSSVGSVFFPLQPCKKRKLDDPVPVVGGVNFAE 180
Query: 181 LGAFRGVKASTNTIAEMGDSSSRTSLIPIKILTYNVWFREDLEMRNRMRALGQLIQRHSP 240
LGAFRGVKASTNTIAEMGDSSSRT+L PIKILTYNVWFREDLEMRNRMRALGQLIQRHSP
Sbjct: 181 LGAFRGVKASTNTIAEMGDSSSRTNLTPIKILTYNVWFREDLEMRNRMRALGQLIQRHSP 240
Query: 241 DVICFQEVTPDIYNIFQITNWWKVYRCSVKKDAHSRGYFCMLLSKLPVKSFSCQPFSNSI 300
DVICFQEVTPDIYNIFQITNWWKVYRCSVKKDAHSRGYFCMLLSKLPVKSFSCQPFSNSI
Sbjct: 241 DVICFQEVTPDIYNIFQITNWWKVYRCSVKKDAHSRGYFCMLLSKLPVKSFSCQPFSNSI 300
Query: 301 MGRELCVANLEVQNGLSLTVATSHLESPCPAPPKWNQMYSKERVIQAKEAIDFLNENPNV 360
MGRELCVANLEVQNGLSLTVATSHLESPCPAPPKWNQMYSKERVIQAKEAIDFLNENPNV
Sbjct: 301 MGRELCVANLEVQNGLSLTVATSHLESPCPAPPKWNQMYSKERVIQAKEAIDFLNENPNV 360
Query: 361 VFGGDMNWDDKLDGQFPFPDDWIDAWEELRPGENGWTYDTKSNKMLSGNRTLQRRLDRFV 420
VFGGDMNWDDK DGQFPFPD+WIDAWEELRPGENGWTYDTKSNKMLSGNRTLQRRLDRFV
Sbjct: 361 VFGGDMNWDDKSDGQFPFPDNWIDAWEELRPGENGWTYDTKSNKMLSGNRTLQRRLDRFV 420
Query: 421 CKLQDFKVSSIVMIGTDPIPELTYTKEKKVGKEMKMLELPVLPSDHYGLLLTISSL 476
CKLQDFKVSSIVMIGTDPIPELTYTKEKKVGKEMKMLELPVLPSDHYGLLLTISSL
Sbjct: 421 CKLQDFKVSSIVMIGTDPIPELTYTKEKKVGKEMKMLELPVLPSDHYGLLLTISSL 476
BLAST of Csor.00g043910 vs. NCBI nr
Match:
XP_023542187.1 (uncharacterized protein LOC111802155 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 931 bits (2407), Expect = 0.0
Identity = 464/476 (97.48%), Postives = 467/476 (98.11%), Query Frame = 0
Query: 1 MSSEPKFVSSRLKRRLNRNSKRQKSSKSLMFQSIAHSLSFSSARFLHRSVINRRTICSLS 60
MSS+ VS RLKRRLNRNSKRQKSSKSLMFQSIAHSLSFSSARFLHRSVINRRTICSLS
Sbjct: 1 MSSD---VSRRLKRRLNRNSKRQKSSKSLMFQSIAHSLSFSSARFLHRSVINRRTICSLS 60
Query: 61 LPMSSWSCKKCTFLNPPSQKAACKICLSPSSPPPSPSSSSAPQWSCKACTFLNPYKNSDC 120
+ MSSWSCKKCTFLNPPSQKAACKICLSPSSPPPSPSSSSAPQWSCKACTFLNPYKNSDC
Sbjct: 61 VSMSSWSCKKCTFLNPPSQKAACKICLSPSSPPPSPSSSSAPQWSCKACTFLNPYKNSDC 120
Query: 121 ELCGTRAPSLSLSSFKDLIDVSEDADADSSVGSVFFPLQPCKKRKLDDPVPVVGGVNFAE 180
ELCGTRAPSLSLSSFKDLIDVSEDADADSSVGSVFFPLQPCKKRKLDDPVPVVGG NFAE
Sbjct: 121 ELCGTRAPSLSLSSFKDLIDVSEDADADSSVGSVFFPLQPCKKRKLDDPVPVVGGDNFAE 180
Query: 181 LGAFRGVKASTNTIAEMGDSSSRTSLIPIKILTYNVWFREDLEMRNRMRALGQLIQRHSP 240
LGAFRG+KASTNTIAEMGDSSSRTSL PIKILTYNVWFREDLEMRNRMRALGQLIQRHSP
Sbjct: 181 LGAFRGIKASTNTIAEMGDSSSRTSLTPIKILTYNVWFREDLEMRNRMRALGQLIQRHSP 240
Query: 241 DVICFQEVTPDIYNIFQITNWWKVYRCSVKKDAHSRGYFCMLLSKLPVKSFSCQPFSNSI 300
DVICFQEVTPDIYNIFQITNWWKVYRCSVKKDAHSRGYFCMLLSKLPVKSFSCQPFSNSI
Sbjct: 241 DVICFQEVTPDIYNIFQITNWWKVYRCSVKKDAHSRGYFCMLLSKLPVKSFSCQPFSNSI 300
Query: 301 MGRELCVANLEVQNGLSLTVATSHLESPCPAPPKWNQMYSKERVIQAKEAIDFLNENPNV 360
MGRELCVANLEVQNGLSLTVATSHLESPCPAPPKWNQMYSKERVIQAKEAIDFL ENPNV
Sbjct: 301 MGRELCVANLEVQNGLSLTVATSHLESPCPAPPKWNQMYSKERVIQAKEAIDFLKENPNV 360
Query: 361 VFGGDMNWDDKLDGQFPFPDDWIDAWEELRPGENGWTYDTKSNKMLSGNRTLQRRLDRFV 420
VFGGDMNWDDK DGQFPFPDDWIDAWEELRPGENGWTYDTKSNKMLSGNRTLQRRLDRFV
Sbjct: 361 VFGGDMNWDDKSDGQFPFPDDWIDAWEELRPGENGWTYDTKSNKMLSGNRTLQRRLDRFV 420
Query: 421 CKLQDFKVSSIVMIGTDPIPELTYTKEKKVGKEMKMLELPVLPSDHYGLLLTISSL 476
CKLQDFKVSSIVMIGTDPIPELTYTKEKKVGKEMKMLELPVLPSDHYGLLLTISSL
Sbjct: 421 CKLQDFKVSSIVMIGTDPIPELTYTKEKKVGKEMKMLELPVLPSDHYGLLLTISSL 473
BLAST of Csor.00g043910 vs. NCBI nr
Match:
XP_022994193.1 (uncharacterized protein LOC111490005 [Cucurbita maxima])
HSP 1 Score: 894 bits (2310), Expect = 0.0
Identity = 437/447 (97.76%), Postives = 440/447 (98.43%), Query Frame = 0
Query: 30 MFQSIAHSLSFSSARFLHRSVINRRTICSLSLPMSSWSCKKCTFLNPPSQKAACKICLSP 89
MFQSIA SLSFSSARFLHRSVINRRT CSLS+PMSSWSCKKCTFLNPPSQKAACKICLSP
Sbjct: 1 MFQSIARSLSFSSARFLHRSVINRRTFCSLSVPMSSWSCKKCTFLNPPSQKAACKICLSP 60
Query: 90 SSPPPSPSSSSAPQWSCKACTFLNPYKNSDCELCGTRAPSLSLSSFKDLIDVSEDADADS 149
SSPPPSPSSSSAPQWSCKACTFLNPYKNSDCELCGTRAPSLSLSSFKDLIDVSEDADADS
Sbjct: 61 SSPPPSPSSSSAPQWSCKACTFLNPYKNSDCELCGTRAPSLSLSSFKDLIDVSEDADADS 120
Query: 150 SVGSVFFPLQPCKKRKLDDPVPVVGGVNFAELGAFRGVKASTNTIAEMGDSSSRTSLIPI 209
SVGSVFFPLQPCKKRKLDDPVPVVGG NFAELGAFRGVKAS NTIAEMGDSSSRTSL PI
Sbjct: 121 SVGSVFFPLQPCKKRKLDDPVPVVGGDNFAELGAFRGVKASANTIAEMGDSSSRTSLTPI 180
Query: 210 KILTYNVWFREDLEMRNRMRALGQLIQRHSPDVICFQEVTPDIYNIFQITNWWKVYRCSV 269
KILTYNVWFREDLEMRNRMRALGQLIQRHSPDVICFQEVTPDIYNIFQITNWWKVYRCSV
Sbjct: 181 KILTYNVWFREDLEMRNRMRALGQLIQRHSPDVICFQEVTPDIYNIFQITNWWKVYRCSV 240
Query: 270 KKDAHSRGYFCMLLSKLPVKSFSCQPFSNSIMGRELCVANLEVQNGLSLTVATSHLESPC 329
KKDAHSRGYFCMLLSKLPVKSFSCQPFSNSIMGRELC+ANLEVQNGLSLTVATSHLESPC
Sbjct: 241 KKDAHSRGYFCMLLSKLPVKSFSCQPFSNSIMGRELCIANLEVQNGLSLTVATSHLESPC 300
Query: 330 PAPPKWNQMYSKERVIQAKEAIDFLNENPNVVFGGDMNWDDKLDGQFPFPDDWIDAWEEL 389
PAPPKWNQMYSKERVIQAKEAI+FL ENPNVVFGGDMNWDDKLDGQFPFPDDWIDAWEEL
Sbjct: 301 PAPPKWNQMYSKERVIQAKEAINFLKENPNVVFGGDMNWDDKLDGQFPFPDDWIDAWEEL 360
Query: 390 RPGENGWTYDTKSNKMLSGNRTLQRRLDRFVCKLQDFKVSSIVMIGTDPIPELTYTKEKK 449
PGENGWTYDTKSNKMLSGNRTLQRRLDRFVCKLQDFKVSSIVMIGTDPIPELTYTKEKK
Sbjct: 361 HPGENGWTYDTKSNKMLSGNRTLQRRLDRFVCKLQDFKVSSIVMIGTDPIPELTYTKEKK 420
Query: 450 VGKEMKMLELPVLPSDHYGLLLTISSL 476
VGKEMKMLELPVLPSDHYGLLLTISSL
Sbjct: 421 VGKEMKMLELPVLPSDHYGLLLTISSL 447
BLAST of Csor.00g043910 vs. NCBI nr
Match:
KAG7012360.1 (Tyrosyl-DNA phosphodiesterase 2, partial [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 780 bits (2014), Expect = 9.99e-283
Identity = 406/476 (85.29%), Postives = 407/476 (85.50%), Query Frame = 0
Query: 1 MSSEPKFVSSRLKRRLNRNSKRQKSSKSLMFQSIAHSLSFSSARFLHRSVINRRTICSLS 60
MSSEPKFVSSRLKRRLNRNSKRQKSSKSLMFQSIAHSLSFSSARFLHRSVINRRTICSLS
Sbjct: 1 MSSEPKFVSSRLKRRLNRNSKRQKSSKSLMFQSIAHSLSFSSARFLHRSVINRRTICSLS 60
Query: 61 LPMSSWSCKKCTFLNPPSQKAACKICLSPSSPPPSPSSSSAPQWSCKACTFLNPYKNSDC 120
LPMSSWSCKKCTFLNPPSQKAACKICLSPSSPPPSPSSSSAPQWSCKACTFLNPYKNSDC
Sbjct: 61 LPMSSWSCKKCTFLNPPSQKAACKICLSPSSPPPSPSSSSAPQWSCKACTFLNPYKNSDC 120
Query: 121 ELCGTRAPSLSLSSFKDLIDVSEDADADSSVGSVFFPLQPCKKRKLDDPVPVVGGVNFAE 180
ELCGTRAPSLSLSSFKDLIDVSEDADADSSVGSVFFPLQPCKKRKLDDPVPVVGGVNFAE
Sbjct: 121 ELCGTRAPSLSLSSFKDLIDVSEDADADSSVGSVFFPLQPCKKRKLDDPVPVVGGVNFAE 180
Query: 181 LGAFRGVKASTNTIAEMGDSSSRTSLIPIKILTYNVWFREDLEMRNRMRALGQLIQRHSP 240
LGAFRGVKASTNTIAEMGDSSSRTSLIPIKILTYN
Sbjct: 181 LGAFRGVKASTNTIAEMGDSSSRTSLIPIKILTYN------------------------- 240
Query: 241 DVICFQEVTPDIYNIFQITNWWKVYRCSVKKDAHSRGYFCMLLSKLPVKSFSCQPFSNSI 300
LSKLPVKSFSCQPF NSI
Sbjct: 241 ------------------------------------------LSKLPVKSFSCQPFPNSI 300
Query: 301 MGRELCVANLEVQNGLSLTVATSHLESPCPAPPKWNQMYSKERVIQAKEAIDFLNENPNV 360
MGRELC+ANLEVQNGLSLTVATSHLESPCPAPPKWNQMYSKERVIQAKEAIDFL ENPNV
Sbjct: 301 MGRELCIANLEVQNGLSLTVATSHLESPCPAPPKWNQMYSKERVIQAKEAIDFLKENPNV 360
Query: 361 VFGGDMNWDDKLDGQFPFPDDWIDAWEELRPGENGWTYDTKSNKMLSGNRTLQRRLDRFV 420
VFGGDMNWDDKLDGQFPFPDDWIDAWEELRPGENGWTYDTKSNKMLSGNRTLQRRLDRFV
Sbjct: 361 VFGGDMNWDDKLDGQFPFPDDWIDAWEELRPGENGWTYDTKSNKMLSGNRTLQRRLDRFV 409
Query: 421 CKLQDFKVSSIVMIGTDPIPELTYTKEKKVGKEMKMLELPVLPSDHYGLLLTISSL 476
CKLQDFKVSSIVMIGTDPIPELTYTKEKKVGKEMKMLELPVLPSDHYGLLLTISSL
Sbjct: 421 CKLQDFKVSSIVMIGTDPIPELTYTKEKKVGKEMKMLELPVLPSDHYGLLLTISSL 409
BLAST of Csor.00g043910 vs. ExPASy TrEMBL
Match:
A0A6J1GQX2 (uncharacterized protein LOC111456698 OS=Cucurbita moschata OX=3662 GN=LOC111456698 PE=4 SV=1)
HSP 1 Score: 948 bits (2451), Expect = 0.0
Identity = 470/476 (98.74%), Postives = 472/476 (99.16%), Query Frame = 0
Query: 1 MSSEPKFVSSRLKRRLNRNSKRQKSSKSLMFQSIAHSLSFSSARFLHRSVINRRTICSLS 60
MSSEPKFVS RLKRRLNRNSKRQKSSKSLMFQSIAHSLSFSSARFLHRSVINRRTICSLS
Sbjct: 1 MSSEPKFVSRRLKRRLNRNSKRQKSSKSLMFQSIAHSLSFSSARFLHRSVINRRTICSLS 60
Query: 61 LPMSSWSCKKCTFLNPPSQKAACKICLSPSSPPPSPSSSSAPQWSCKACTFLNPYKNSDC 120
LPMSSWSCKKCTFLNPPSQKAACKICLSPSSPPPSPSSSSA QWSCKACTFLNPYKNSDC
Sbjct: 61 LPMSSWSCKKCTFLNPPSQKAACKICLSPSSPPPSPSSSSASQWSCKACTFLNPYKNSDC 120
Query: 121 ELCGTRAPSLSLSSFKDLIDVSEDADADSSVGSVFFPLQPCKKRKLDDPVPVVGGVNFAE 180
ELCGTRAPSLSLSSFKDLIDVSEDADADSSVGSVFFPLQPCKKRKLDDPVPVVGGVNFAE
Sbjct: 121 ELCGTRAPSLSLSSFKDLIDVSEDADADSSVGSVFFPLQPCKKRKLDDPVPVVGGVNFAE 180
Query: 181 LGAFRGVKASTNTIAEMGDSSSRTSLIPIKILTYNVWFREDLEMRNRMRALGQLIQRHSP 240
LGAFRGVKASTNTIAEMGDSSSRT+L PIKILTYNVWFREDLEMRNRMRALGQLIQRHSP
Sbjct: 181 LGAFRGVKASTNTIAEMGDSSSRTNLTPIKILTYNVWFREDLEMRNRMRALGQLIQRHSP 240
Query: 241 DVICFQEVTPDIYNIFQITNWWKVYRCSVKKDAHSRGYFCMLLSKLPVKSFSCQPFSNSI 300
DVICFQEVTPDIYNIFQITNWWKVYRCSVKKDAHSRGYFCMLLSKLPVKSFSCQPFSNSI
Sbjct: 241 DVICFQEVTPDIYNIFQITNWWKVYRCSVKKDAHSRGYFCMLLSKLPVKSFSCQPFSNSI 300
Query: 301 MGRELCVANLEVQNGLSLTVATSHLESPCPAPPKWNQMYSKERVIQAKEAIDFLNENPNV 360
MGRELCVANLEVQNGLSLTVATSHLESPCPAPPKWNQMYSKERVIQAKEAIDFLNENPNV
Sbjct: 301 MGRELCVANLEVQNGLSLTVATSHLESPCPAPPKWNQMYSKERVIQAKEAIDFLNENPNV 360
Query: 361 VFGGDMNWDDKLDGQFPFPDDWIDAWEELRPGENGWTYDTKSNKMLSGNRTLQRRLDRFV 420
VFGGDMNWDDK DGQFPFPD+WIDAWEELRPGENGWTYDTKSNKMLSGNRTLQRRLDRFV
Sbjct: 361 VFGGDMNWDDKSDGQFPFPDNWIDAWEELRPGENGWTYDTKSNKMLSGNRTLQRRLDRFV 420
Query: 421 CKLQDFKVSSIVMIGTDPIPELTYTKEKKVGKEMKMLELPVLPSDHYGLLLTISSL 476
CKLQDFKVSSIVMIGTDPIPELTYTKEKKVGKEMKMLELPVLPSDHYGLLLTISSL
Sbjct: 421 CKLQDFKVSSIVMIGTDPIPELTYTKEKKVGKEMKMLELPVLPSDHYGLLLTISSL 476
BLAST of Csor.00g043910 vs. ExPASy TrEMBL
Match:
A0A6J1K4H6 (uncharacterized protein LOC111490005 OS=Cucurbita maxima OX=3661 GN=LOC111490005 PE=4 SV=1)
HSP 1 Score: 894 bits (2310), Expect = 0.0
Identity = 437/447 (97.76%), Postives = 440/447 (98.43%), Query Frame = 0
Query: 30 MFQSIAHSLSFSSARFLHRSVINRRTICSLSLPMSSWSCKKCTFLNPPSQKAACKICLSP 89
MFQSIA SLSFSSARFLHRSVINRRT CSLS+PMSSWSCKKCTFLNPPSQKAACKICLSP
Sbjct: 1 MFQSIARSLSFSSARFLHRSVINRRTFCSLSVPMSSWSCKKCTFLNPPSQKAACKICLSP 60
Query: 90 SSPPPSPSSSSAPQWSCKACTFLNPYKNSDCELCGTRAPSLSLSSFKDLIDVSEDADADS 149
SSPPPSPSSSSAPQWSCKACTFLNPYKNSDCELCGTRAPSLSLSSFKDLIDVSEDADADS
Sbjct: 61 SSPPPSPSSSSAPQWSCKACTFLNPYKNSDCELCGTRAPSLSLSSFKDLIDVSEDADADS 120
Query: 150 SVGSVFFPLQPCKKRKLDDPVPVVGGVNFAELGAFRGVKASTNTIAEMGDSSSRTSLIPI 209
SVGSVFFPLQPCKKRKLDDPVPVVGG NFAELGAFRGVKAS NTIAEMGDSSSRTSL PI
Sbjct: 121 SVGSVFFPLQPCKKRKLDDPVPVVGGDNFAELGAFRGVKASANTIAEMGDSSSRTSLTPI 180
Query: 210 KILTYNVWFREDLEMRNRMRALGQLIQRHSPDVICFQEVTPDIYNIFQITNWWKVYRCSV 269
KILTYNVWFREDLEMRNRMRALGQLIQRHSPDVICFQEVTPDIYNIFQITNWWKVYRCSV
Sbjct: 181 KILTYNVWFREDLEMRNRMRALGQLIQRHSPDVICFQEVTPDIYNIFQITNWWKVYRCSV 240
Query: 270 KKDAHSRGYFCMLLSKLPVKSFSCQPFSNSIMGRELCVANLEVQNGLSLTVATSHLESPC 329
KKDAHSRGYFCMLLSKLPVKSFSCQPFSNSIMGRELC+ANLEVQNGLSLTVATSHLESPC
Sbjct: 241 KKDAHSRGYFCMLLSKLPVKSFSCQPFSNSIMGRELCIANLEVQNGLSLTVATSHLESPC 300
Query: 330 PAPPKWNQMYSKERVIQAKEAIDFLNENPNVVFGGDMNWDDKLDGQFPFPDDWIDAWEEL 389
PAPPKWNQMYSKERVIQAKEAI+FL ENPNVVFGGDMNWDDKLDGQFPFPDDWIDAWEEL
Sbjct: 301 PAPPKWNQMYSKERVIQAKEAINFLKENPNVVFGGDMNWDDKLDGQFPFPDDWIDAWEEL 360
Query: 390 RPGENGWTYDTKSNKMLSGNRTLQRRLDRFVCKLQDFKVSSIVMIGTDPIPELTYTKEKK 449
PGENGWTYDTKSNKMLSGNRTLQRRLDRFVCKLQDFKVSSIVMIGTDPIPELTYTKEKK
Sbjct: 361 HPGENGWTYDTKSNKMLSGNRTLQRRLDRFVCKLQDFKVSSIVMIGTDPIPELTYTKEKK 420
Query: 450 VGKEMKMLELPVLPSDHYGLLLTISSL 476
VGKEMKMLELPVLPSDHYGLLLTISSL
Sbjct: 421 VGKEMKMLELPVLPSDHYGLLLTISSL 447
BLAST of Csor.00g043910 vs. ExPASy TrEMBL
Match:
A0A6J1CF88 (tyrosyl-DNA phosphodiesterase 2 OS=Momordica charantia OX=3673 GN=LOC111010713 PE=4 SV=1)
HSP 1 Score: 781 bits (2016), Expect = 1.12e-282
Identity = 381/447 (85.23%), Postives = 408/447 (91.28%), Query Frame = 0
Query: 31 FQSIAHSLSFSSARFLHRSVINRRTICSLSLPMSSWSCKKCTFLNPPSQKAACKICLSPS 90
FQ+ SL FS RFLH V N +T SLS+PMS+WSCKKCTF+N PSQK ACKICLSPS
Sbjct: 4 FQTRPDSLIFSFGRFLHCPVTNFQTFRSLSVPMSTWSCKKCTFINSPSQKTACKICLSPS 63
Query: 91 SPPPSP-SSSSAPQWSCKACTFLNPYKNSDCELCGTRAPSLSLSSFKDLIDVSEDADADS 150
SPPP P SSSSAP+WSCKACTFLNPY +SDCELCGTRAP+LSLSSFKDLI++SEDADA S
Sbjct: 64 SPPPPPPSSSSAPKWSCKACTFLNPYNSSDCELCGTRAPALSLSSFKDLIEISEDADAGS 123
Query: 151 SVGSVFFPLQPCKKRKLDDPVPVVGGVNFAELGAFRGVKASTNTIAEMGDSSSRTSLIPI 210
SVGSVFFPLQPCKKRKLDDPVPVVG +FAELGAFR +KAS T+AEMGDSS+RTSL I
Sbjct: 124 SVGSVFFPLQPCKKRKLDDPVPVVGHDDFAELGAFRDIKASGKTVAEMGDSSTRTSLTSI 183
Query: 211 KILTYNVWFREDLEMRNRMRALGQLIQRHSPDVICFQEVTPDIYNIFQITNWWKVYRCSV 270
KIL+YNVWFREDLEM NRMRALGQLIQRHSPDV+CFQEVTP IYN FQI NWWKVYRCSV
Sbjct: 184 KILSYNVWFREDLEMHNRMRALGQLIQRHSPDVVCFQEVTPAIYNFFQIFNWWKVYRCSV 243
Query: 271 KKDAHSRGYFCMLLSKLPVKSFSCQPFSNSIMGRELCVANLEVQNGLSLTVATSHLESPC 330
KDAHSRGYFC+LLSKLPVKSFS +PF NSIMGRELC+ANLE+QNG+SLTVATSHLESPC
Sbjct: 244 SKDAHSRGYFCLLLSKLPVKSFSVKPFFNSIMGRELCIANLELQNGISLTVATSHLESPC 303
Query: 331 PAPPKWNQMYSKERVIQAKEAIDFLNENPNVVFGGDMNWDDKLDGQFPFPDDWIDAWEEL 390
PAPPKWNQMYSKERVIQAKEAID L E+PNV+FGGDMNWDDKLDG+FPFPD WIDAWEEL
Sbjct: 304 PAPPKWNQMYSKERVIQAKEAIDSLKESPNVIFGGDMNWDDKLDGRFPFPDGWIDAWEEL 363
Query: 391 RPGENGWTYDTKSNKMLSGNRTLQRRLDRFVCKLQDFKVSSIVMIGTDPIPELTYTKEKK 450
RPGENGWTYDTKSNKMLSGNRTLQ+RLDRFVCKLQD+K SSI MIGTDPIP L+YTKEKK
Sbjct: 364 RPGENGWTYDTKSNKMLSGNRTLQKRLDRFVCKLQDYKASSIEMIGTDPIPGLSYTKEKK 423
Query: 451 VGKEMKMLELPVLPSDHYGLLLTISSL 476
VGKEMK LELPVLPSDHYGLLLTISSL
Sbjct: 424 VGKEMKELELPVLPSDHYGLLLTISSL 450
BLAST of Csor.00g043910 vs. ExPASy TrEMBL
Match:
E5GC61 (Endonuclease/exonuclease/phosphatase family protein OS=Cucumis melo subsp. melo OX=412675 PE=4 SV=1)
HSP 1 Score: 775 bits (2002), Expect = 4.86e-280
Identity = 385/466 (82.62%), Postives = 415/466 (89.06%), Query Frame = 0
Query: 23 QKSSKSLMFQSIAHSLSFSSAR---------FLH-RSVINRRTICSLSLPMSSWSCKKCT 82
QKSS+S MF +I S S SS+ FLH R+V NR T S SL MSSWSCKKCT
Sbjct: 16 QKSSESSMFPTIESSSSSSSSSLRSLNSIGFFLHHRTVENRPTFLSFSLSMSSWSCKKCT 75
Query: 83 FLNPPSQKAACKICLSPSSPPPSPSSSSA--PQWSCKACTFLNPYKNSDCELCGTRAPSL 142
FLNP SQKAACKICLSPSSPPPS SSSS+ P+WSCKACTFLN + NS+CELCGTRAP+L
Sbjct: 76 FLNPSSQKAACKICLSPSSPPPSSSSSSSTTPKWSCKACTFLNSFTNSECELCGTRAPAL 135
Query: 143 SLSSFKDLIDVSEDADADSSVGSVFFPLQPCKKRKLDDPVPVVGGVNFAELGAFRGVKAS 202
SLSSFKDLIDVSEDA+ADSSVGSVFFPLQPCKKRK+DDPVP+ +FAEL AF+G KAS
Sbjct: 136 SLSSFKDLIDVSEDANADSSVGSVFFPLQPCKKRKMDDPVPLESHGDFAELSAFQGTKAS 195
Query: 203 TNTIAEMGDSSSRTSLIPIKILTYNVWFREDLEMRNRMRALGQLIQRHSPDVICFQEVTP 262
N +AEMG SSSR +L P+KI+TYNVWFREDLE+RNRMRALGQLIQRHSPDVICFQEVTP
Sbjct: 196 MNAVAEMGGSSSRANLKPVKIMTYNVWFREDLELRNRMRALGQLIQRHSPDVICFQEVTP 255
Query: 263 DIYNIFQITNWWKVYRCSVKKDAHSRGYFCMLLSKLPVKSFSCQPFSNSIMGRELCVANL 322
IY+IFQITNWWKVYRCSV KD+HSRGYFCMLLSKLPVKSFSCQPF NSIMGRELC+ NL
Sbjct: 256 AIYDIFQITNWWKVYRCSVIKDSHSRGYFCMLLSKLPVKSFSCQPFPNSIMGRELCIGNL 315
Query: 323 EVQNGLSLTVATSHLESPCPAPPKWNQMYSKERVIQAKEAIDFLNENPNVVFGGDMNWDD 382
EVQNG+SLTVATSHLESPCPAPPKWNQMYSKERV+QAK+A+DFL E PNV+FGGDMNWDD
Sbjct: 316 EVQNGISLTVATSHLESPCPAPPKWNQMYSKERVVQAKQAVDFLKETPNVIFGGDMNWDD 375
Query: 383 KLDGQFPFPDDWIDAWEELRPGENGWTYDTKSNKMLSGNRTLQRRLDRFVCKLQDFKVSS 442
KLDGQFPFPD WIDAWEELRPGENGWTYDTKSNKMLSGNRTLQ+RLDRF+CKLQDFKV+S
Sbjct: 376 KLDGQFPFPDGWIDAWEELRPGENGWTYDTKSNKMLSGNRTLQKRLDRFICKLQDFKVNS 435
Query: 443 IVMIGTDPIPELTYTKEKKVGKEMKMLELPVLPSDHYGLLLTISSL 476
I MIGTD IP LTYTKEKKVGKEMK LELPVLPSDHYGLLLTISSL
Sbjct: 436 IEMIGTDSIPGLTYTKEKKVGKEMKTLELPVLPSDHYGLLLTISSL 481
BLAST of Csor.00g043910 vs. ExPASy TrEMBL
Match:
A0A1S3B294 (tyrosyl-DNA phosphodiesterase 2 OS=Cucumis melo OX=3656 GN=LOC103485200 PE=4 SV=1)
HSP 1 Score: 775 bits (2002), Expect = 4.86e-280
Identity = 385/466 (82.62%), Postives = 415/466 (89.06%), Query Frame = 0
Query: 23 QKSSKSLMFQSIAHSLSFSSAR---------FLH-RSVINRRTICSLSLPMSSWSCKKCT 82
QKSS+S MF +I S S SS+ FLH R+V NR T S SL MSSWSCKKCT
Sbjct: 16 QKSSESSMFPTIESSSSSSSSSLRSLNSIGFFLHHRTVENRPTFLSFSLSMSSWSCKKCT 75
Query: 83 FLNPPSQKAACKICLSPSSPPPSPSSSSA--PQWSCKACTFLNPYKNSDCELCGTRAPSL 142
FLNP SQKAACKICLSPSSPPPS SSSS+ P+WSCKACTFLN + NS+CELCGTRAP+L
Sbjct: 76 FLNPSSQKAACKICLSPSSPPPSSSSSSSTTPKWSCKACTFLNSFTNSECELCGTRAPAL 135
Query: 143 SLSSFKDLIDVSEDADADSSVGSVFFPLQPCKKRKLDDPVPVVGGVNFAELGAFRGVKAS 202
SLSSFKDLIDVSEDA+ADSSVGSVFFPLQPCKKRK+DDPVP+ +FAEL AF+G KAS
Sbjct: 136 SLSSFKDLIDVSEDANADSSVGSVFFPLQPCKKRKMDDPVPLESHGDFAELSAFQGTKAS 195
Query: 203 TNTIAEMGDSSSRTSLIPIKILTYNVWFREDLEMRNRMRALGQLIQRHSPDVICFQEVTP 262
N +AEMG SSSR +L P+KI+TYNVWFREDLE+RNRMRALGQLIQRHSPDVICFQEVTP
Sbjct: 196 MNAVAEMGGSSSRANLKPVKIMTYNVWFREDLELRNRMRALGQLIQRHSPDVICFQEVTP 255
Query: 263 DIYNIFQITNWWKVYRCSVKKDAHSRGYFCMLLSKLPVKSFSCQPFSNSIMGRELCVANL 322
IY+IFQITNWWKVYRCSV KD+HSRGYFCMLLSKLPVKSFSCQPF NSIMGRELC+ NL
Sbjct: 256 AIYDIFQITNWWKVYRCSVIKDSHSRGYFCMLLSKLPVKSFSCQPFPNSIMGRELCIGNL 315
Query: 323 EVQNGLSLTVATSHLESPCPAPPKWNQMYSKERVIQAKEAIDFLNENPNVVFGGDMNWDD 382
EVQNG+SLTVATSHLESPCPAPPKWNQMYSKERV+QAK+A+DFL E PNV+FGGDMNWDD
Sbjct: 316 EVQNGISLTVATSHLESPCPAPPKWNQMYSKERVVQAKQAVDFLKETPNVIFGGDMNWDD 375
Query: 383 KLDGQFPFPDDWIDAWEELRPGENGWTYDTKSNKMLSGNRTLQRRLDRFVCKLQDFKVSS 442
KLDGQFPFPD WIDAWEELRPGENGWTYDTKSNKMLSGNRTLQ+RLDRF+CKLQDFKV+S
Sbjct: 376 KLDGQFPFPDGWIDAWEELRPGENGWTYDTKSNKMLSGNRTLQKRLDRFICKLQDFKVNS 435
Query: 443 IVMIGTDPIPELTYTKEKKVGKEMKMLELPVLPSDHYGLLLTISSL 476
I MIGTD IP LTYTKEKKVGKEMK LELPVLPSDHYGLLLTISSL
Sbjct: 436 IEMIGTDSIPGLTYTKEKKVGKEMKTLELPVLPSDHYGLLLTISSL 481
BLAST of Csor.00g043910 vs. TAIR 10
Match:
AT1G11800.1 (endonuclease/exonuclease/phosphatase family protein )
HSP 1 Score: 509.6 bits (1311), Expect = 2.7e-144
Identity = 261/423 (61.70%), Postives = 311/423 (73.52%), Query Frame = 0
Query: 64 SSWSCKKCTFLNPPSQKAACKICLSP------SSPPPSPSSSS--APQWSCKACTFLNPY 123
SSWSC KCTFLN SQK C ICL+P S PPPS S S+ +W+CKACTFLN Y
Sbjct: 18 SSWSCNKCTFLNSASQKLNCMICLAPVSLPSLSPPPPSLSISANDEAKWACKACTFLNTY 77
Query: 124 KNSDCELCGTRAPSLSLSSFKDLIDVS-EDADADSSVGSVFFPLQPCKKRK-LDDPVPVV 183
KNS C++CGTR+P+ SL F+DL D E DADSSVGSVFFPL+ C KRK +DD V V
Sbjct: 78 KNSICDVCGTRSPTSSLLGFQDLTDSGLESNDADSSVGSVFFPLRRCIKRKAMDDDVVEV 137
Query: 184 GGVNFAELGAFRGVKASTNTIAEMG-DSSSRTSLIPIKILTYNVWFREDLEMRNRMRALG 243
G + +GV I G S S T L +KIL+YNVWFREDLE+ RMRA+G
Sbjct: 138 DGASVV-CSESQGVMKKNKEIETKGVASDSGTPLTCLKILSYNVWFREDLELNLRMRAIG 197
Query: 244 QLIQRHSPDVICFQEVTPDIYNIFQITNWWKVYRCSVKKD-AHSRGYFCMLLSKLPVKSF 303
LIQ HSP +ICFQEVTP+IY+IF+ +NWWK Y CSV D A SRGY+CMLLSKL VKSF
Sbjct: 198 HLIQLHSPHLICFQEVTPEIYDIFRKSNWWKAYSCSVSVDVAVSRGYYCMLLSKLGVKSF 257
Query: 304 SCQPFSNSIMGRELCVANLEVQNGLSLTVATSHLESPCPAPPKWNQMYSKERVIQAKEAI 363
S + F NSIMGREL +A +EV L ATSHLESPCP PPKW+QM+S+ERV QAKEAI
Sbjct: 258 SSKSFGNSIMGRELSIAEVEVPGRKPLVFATSHLESPCPGPPKWDQMFSRERVEQAKEAI 317
Query: 364 DFLNENPNVVFGGDMNWDDKLDGQFPFPDDWIDAWEELRPGENGWTYDTKSNKMLSGNRT 423
+ L N NV+FGGDMNW DKLDG+FP PD W+D WE L+PG+ G+TYDTK+N MLSGNR
Sbjct: 318 EILRPNANVIFGGDMNWCDKLDGKFPLPDKWVDVWEVLKPGDLGFTYDTKANPMLSGNRA 377
Query: 424 LQRRLDRFVCKLQDFKVSSIVMIGTDPIPELTYTKEKKVGKEMKMLELPVLPSDHYGLLL 475
LQ+RLDR +C+L D+K+ I M+G + IP L+Y KEKKV ++K LELPVLPSDH+GLL+
Sbjct: 378 LQKRLDRILCRLDDYKLGGIEMVGKEAIPGLSYVKEKKVRGDIKKLELPVLPSDHFGLLV 437
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Q5XJA0 | 5.0e-10 | 26.99 | Tyrosyl-DNA phosphodiesterase 2 OS=Danio rerio OX=7955 GN=tdp2 PE=1 SV=3 | [more] |
Q9JLG8 | 5.4e-04 | 39.39 | Calpain-15 OS=Mus musculus OX=10090 GN=Capn15 PE=1 SV=1 | [more] |
Match Name | E-value | Identity | Description | |
KAG6573182.1 | 0.0 | 100.00 | Tyrosyl-DNA phosphodiesterase 2, partial [Cucurbita argyrosperma subsp. sororia] | [more] |
XP_022954436.1 | 0.0 | 98.74 | uncharacterized protein LOC111456698 [Cucurbita moschata] | [more] |
XP_023542187.1 | 0.0 | 97.48 | uncharacterized protein LOC111802155 [Cucurbita pepo subsp. pepo] | [more] |
XP_022994193.1 | 0.0 | 97.76 | uncharacterized protein LOC111490005 [Cucurbita maxima] | [more] |
KAG7012360.1 | 9.99e-283 | 85.29 | Tyrosyl-DNA phosphodiesterase 2, partial [Cucurbita argyrosperma subsp. argyrosp... | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1GQX2 | 0.0 | 98.74 | uncharacterized protein LOC111456698 OS=Cucurbita moschata OX=3662 GN=LOC1114566... | [more] |
A0A6J1K4H6 | 0.0 | 97.76 | uncharacterized protein LOC111490005 OS=Cucurbita maxima OX=3661 GN=LOC111490005... | [more] |
A0A6J1CF88 | 1.12e-282 | 85.23 | tyrosyl-DNA phosphodiesterase 2 OS=Momordica charantia OX=3673 GN=LOC111010713 P... | [more] |
E5GC61 | 4.86e-280 | 82.62 | Endonuclease/exonuclease/phosphatase family protein OS=Cucumis melo subsp. melo ... | [more] |
A0A1S3B294 | 4.86e-280 | 82.62 | tyrosyl-DNA phosphodiesterase 2 OS=Cucumis melo OX=3656 GN=LOC103485200 PE=4 SV=... | [more] |
Match Name | E-value | Identity | Description | |
AT1G11800.1 | 2.7e-144 | 61.70 | endonuclease/exonuclease/phosphatase family protein | [more] |