Cp4.1LG02g07650 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG02g07650
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionDUF21 domain-containing protein
LocationCp4.1LG02: 633236 .. 637697 (+)
RNA-Seq ExpressionCp4.1LG02g07650
SyntenyCp4.1LG02g07650
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTTCGGTGTTGTGAGTCGAAGTTCTTTCTGTTCCTGTTGATAATCGTAGGACTGGTGGCCTTCGCTGGACTCATGGCTGGGCTCACTTTGGGTCTCATGTCTCTTGGCCTCGTTGACCTCGAAGTTCTTATAAAGTCTGGCCGCCCTCAGGATCGCAAACATGCTGGTAATCTCTATCACTTCCCACAGCTCTTGAAATTCATTTTGAACCTACTTGAATCCTGGTGATCTCAGAGGGCTTATCACCTGAATGTTCTTGGCGTTTCAGGAACCTTTTCATTGTTTGTAATTTGTTAGGGCCTGTTGGCTATTGAAATTATGCAATACTGTGTTTAGGAACGGGGAATTAGGTAGTTGTGTTTGTCTGCATGCTTATGAACTGAGTGGATAGGTTGATTGGGAAGTAACATTGTTGTGATTTCTTTGGCAGCTAAGATATTGCCAGTGGTAAAGAATCAGCATCTTCTGCTTTGCACACTTTTGATTGGTAACTCTTTGGCAATGGAGGTTAGTAGCTACATTTGCACATCTCTGTTTGTGGAGCTTGCATAATTTGCTAATTAAACTTAACGTTGTTGTATGTGTCCCTCCTGCAGGCTCTTCCGATATTCTTAGACATGATCGTACCTCCTTGGCTGGCAGTCCTTGTCTCGGTTACTCTTATTCTCATGTTTGGAGAGGTACTTCTGATCCGACCTATCCGAAAATAATTTGTACATTTGGATATTATTCTGTTTATGTGACGTATACTTCACTGCAGATATTACCGCAAGCAATTTGCACTCGTTATGGGTTGAAAGTTGGGGCAATAATGGCACCTTTTGTTCGCGTTCTTCTCATGATTTTCTTTCCCATTTCATATCCAATTAGTAAGGTAAACTAGTGGAAATGATCATTGCCCTTACCTCTTCTAAGTTCTAAGCAAGGAAATGAAAATTGTAGACTGAGAATGATAGGATAATAGTTTTTTGGCCCTATTTACATGATATTTGGGAAGCTTTTTTGAGTATCTTATATAGCTACAAGGAACCAATAAATTTACTGAGTCACTCATTTGGTTTCTTTTTCGAAATTTAGGTTCTTGATTGGATGTTGGGTAAAGGGCACGCTGTACTCTTGAGGAGAGCAGAGCTCAAGACTTTTGTGAACTTTCATGGCAATGAGGTTTGTCAATCAACTTGGGCATGTAGTCATGGGCATCAGTTGAATGAATTAATATGCTTGGATCTGAGTGCTCTAAAGGAATATTATCCTTATGTGTGCTCTGATTTTCCTCTTACTCCATTCAAGTTTATCTGATGATTATGGTGTATAAAATCTATGTTGGCTTAGGCTGGAAAAGGTGGAGATTTAACTCACGATGAGACTACTATCATTGCTGGGGCACTTGAATTGACAGAAAAGACAGCCAAAGATGCCATGACATCGATATCAAATGCATTTTCCCTTGATCTGGATGCAACTCTTGATTTGTGGGTTTCTAATATATATATATATTTTTTTAATTTTAAAATTTCAGTTCTTGTGTCGGTCTCGCTTGGTTTGCTCGTTATTTCTTTTTTCTAAAGCCCTTTCTCTATATGGGCAGGGAGACGCTCAATGCTATTATGACGAAAGGCCATAGTCGAGTTCCTGTTTATTCTGGAGATCCAAAAAACATAATTGGACTAGTTCTGGTAATTCATTTGCATCATCCATAGATATGACAAACAATAACATTTCTTGATAGGTTGAAATGAATTCTCTTTCTTGCCTTTTCTGTCGTGGCTCCTAATAGTGGGATAATGGGAGCTAAGAATGAAATTCAGAAATGGCTAAGAGGGAATTCACAGGTCCCTCTTGCTTCCCGATTTTAAAAATCAGTGATACTAAACATGCAAAGCTGAAGTACCTTGCATACACCATAAACCAGCTCTTGCACAAATTAGTGAACTTTCGCTATGCTATACTTTTGGACAAAGGAGTTTAAGAATGTGAGCTAAATTGTTAGACTGCCGACCAACGATTAGGAAAATAGAGAGCCTTGGTCAAGCTGAGTTGTTTCAAATTAAAATTTGCAACATATAAACCAAGTACAAAGCCTAATTGAGCAGATATGATTAGTTTGAATGGTGTACCTCTAATTTGCGGGTTGCTCTGGCATATGTTTTTCAATATGTTTATTCAAGTGAGTCGAAGTTAGTGAAATTTGTTTTCATTCCTGACTGACACACATATATATATATATTCATTTGGTGTGTGTTTATTGTGCCAATGATAGTTGGACTTGAGATATAGTCCATGGAATGCAGCATCTGCTCATTTGCTCCATCTTGAACATCTTGCAGGTTAAAAATCTGTTAACTGTTGATCCAGAAGATAGAGTTCTCCTTAGAAAAATGATTATTAGAAAAATTCCACGGTAAGGAGAATATCACTTGCTTCTCTCTTTATGCCTTCAGTTGAAATTACCTGATGACCAAGCTTGACCAGTTTTCATTCCTTCAACTACTTCAGGGTTTCTGAAGACATGCCTCTATATGACATTTTAAATGAATTTCAAAAGGGCCATAGTCACATTGCTGTGGTATTCAAGAAACATGGCTACCAATCTGAGGCATTGCTGAAAAAAGGTTAGGATGTGCAAAAAAGCTGCGAGTTCTTATAATTGGATAAAAAGAAAGTAATTATGAACAATAGACTATCCAGTCATGTTTAGGGCAGTTGGGCTTGAGATCACATAAATTTTAAATTTGTTGTCTATGCTGTCCATTCCTTTATGGCTCAGATTTTACGTTCATAGTATTTCTCATCCAAATTTGTTAAGGTTCTAGACATTGGTTAGCTTGAACTCATATTTGCTCGAGTCATTGAGCTCCGCCTCTTTTTAATGCTTTCCCTGTTTCCAAATTCCTTGGTATCGTATTAGATCCACCAATTACAGGGGATTCACATCAAATGCTCTTAGTTGTGCTGCTTTCACTGTGAATCTATTGTTACTTTTGACATTTCCTTTTCTTGTCTGGTTGATTCTCAGACAATGGTGTTGACTCAACTGCTGGTGCTGCTACTCATAATTTAGCGATGAAAATGGAATTAGTTGATGCTCAAACAATAGCTGAAAAGGCCGGGGGCGAACAGACGAAGAAAAGTCCACCAGCTACTCCTGCCTTTAAAAAACGACACAGAGGTTGTTCATTCTGCATTTTAGATGTCGAAAATGCGCCTCTTCCTGTCTTACCACCTGGTGAAGAGGTGGTTGGTGTCATTACTATGGAGGATGTGATTGAAGAACTTCTTCAGGTAAGGAATTTTTCTGTAACTTGCACCTAAGCATGGCTTCTAAAGTTGGGAGGTTATCTCTGCATGCGCTGCAGTGTCATCCTCCATATCGAAAATGTTGGTATGTTAGGATGTCTGGTGGTTAACATCTTAAGTCGAAGACATTGTGCCTTTTCTTTTTCGGATACCAGCTCCTTGATTGTTGTCTAAGTTATTGGCTTCATAAAAATGGATTAGTTAGTTTGTATTGAGCTGGTTGGCTGTATGTTACTGTTAAATCATCCATCTTACCCATTATTTGAATTTGATCTTGCGTGTCATATATTCTTATGTCAGGAGGAGATATTAGACGAGACAGACGAGTATGTCAATATCCACAACAGGTACATCTTACTAGCTTTAGAAAAGTTAAGAACCCGGACTCGAACTCGAACTCGTGCATGGTTTTGCTTTCTTGCTGAGATTGCACCATTACAAATCTAAGGATTGGATCCATTTTGTTGTTGTAAATATATTTCTTATGAAATCATTTGATGTTTTTGTTGGATGATCACAAAAATTAAACGGAATTCTTCCAAATGCAGAATAAAAATCAACATGCAACCATCTCCGGAAAAACCAAGCACCAATCCACCTCAGCTTTCCCCAAATGTTAATACGTGACTGCTTTGTTACCCAGCAATGCCGGGTCGGACAAGTTTTCACGGGGTCTTCCGAATTGGCTACTGCAGCTCTTCCCCACTCTCTTCACTATGCCTGCCTTCGATAGTCAGGTTTTACTAAAACTTTTTTCTTTTAATCATACAAGAGATTTAAACTGGAAGAATGGAAGGTGAAATCTAGATAAGAGTTAGCTGTAGCGTAATTTATCTCTATGATGGCATAGAGATGTATATTTTCCCAAGGTTAGATGATTGCATTTTGGAAATGACGTTCTTTGATTGTATAAGAATAATATGTAGTTATCTTCAATCCAAAGCAAGTGTATCAATTAAGTGACATATAGTTGTTTTTGGTATGGAATACCAACTCTTGATATCATTATGTCGCATAGCAATTTGATTTTTGTAATGGGTTCTACATCATATTCAAATTGAAAGTTTATTATGAAAGAGGATATTCTATAAAGTGTTTTTTCATTTTTGCTTGGTAGCTCGTCTTGACCATTGAGTCGTGATGATAATGAAGTAAATGGCTTT

mRNA sequence

ATGTTCGGACTGGTGGCCTTCGCTGGACTCATGGCTGGGCTCACTTTGGGTCTCATGTCTCTTGGCCTCGTTGACCTCGAAGTTCTTATAAAGTCTGGCCGCCCTCAGGATCGCAAACATGCTGCTAAGATATTGCCAGTGGTAAAGAATCAGCATCTTCTGCTTTGCACACTTTTGATTGGTAACTCTTTGGCAATGGAGGCTCTTCCGATATTCTTAGACATGATCGTACCTCCTTGGCTGGCAGTCCTTGTCTCGGTTACTCTTATTCTCATGTTTGGAGAGATATTACCGCAAGCAATTTGCACTCGTTATGGGTTGAAAGTTGGGGCAATAATGGCACCTTTTGTTCGCGTTCTTCTCATGATTTTCTTTCCCATTTCATATCCAATTAGTAAGGTTCTTGATTGGATGTTGGGTAAAGGGCACGCTGTACTCTTGAGGAGAGCAGAGCTCAAGACTTTTGTGAACTTTCATGGCAATGAGGCTGGAAAAGGTGGAGATTTAACTCACGATGAGACTACTATCATTGCTGGGGCACTTGAATTGACAGAAAAGACAGCCAAAGATGCCATGACATCGATATCAAATGCATTTTCCCTTGATCTGGATGCAACTCTTGATTTGGAGACGCTCAATGCTATTATGACGAAAGGCCATAGTCGAGTTCCTGTTTATTCTGGAGATCCAAAAAACATAATTGGACTAGTTCTGGTTAAAAATCTGTTAACTGTTGATCCAGAAGATAGAGTTCTCCTTAGAAAAATGATTATTAGAAAAATTCCACGGGTTTCTGAAGACATGCCTCTATATGACATTTTAAATGAATTTCAAAAGGGCCATAGTCACATTGCTGTGGTATTCAAGAAACATGGCTACCAATCTGAGGCATTGCTGAAAAAAGACAATGGTGTTGACTCAACTGCTGGTGCTGCTACTCATAATTTAGCGATGAAAATGGAATTAGTTGATGCTCAAACAATAGCTGAAAAGGCCGGGGGCGAACAGACGAAGAAAAGTCCACCAGCTACTCCTGCCTTTAAAAAACGACACAGAGGTTGTTCATTCTGCATTTTAGATGTCGAAAATGCGCCTCTTCCTGTCTTACCACCTGGTGAAGAGGTGGTTGGTGTCATTACTATGGAGGATGTGATTGAAGAACTTCTTCAGGAGGAGATATTAGACGAGACAGACGAGTATGTCAATATCCACAACAGAATAAAAATCAACATGCAACCATCTCCGGAAAAACCAAGCACCAATCCACCTCAGCTTTCCCCAAATGTTAATACGTGACTGCTTTGTTACCCAGCAATGCCGGGTCGGACAAGTTTTCACGGGGTCTTCCGAATTGGCTACTGCAGCTCTTCCCCACTCTCTTCACTATGCCTGCCTTCGATAGTCAGGTTTTACTAAAACTTTTTTCTTTTAATCATACAAGAGATTTAAACTGGAAGAATGGAAGGTGAAATCTAGATAAGAGTTAGCTGTAGCGTAATTTATCTCTATGATGGCATAGAGATGTATATTTTCCCAAGGTTAGATGATTGCATTTTGGAAATGACGTTCTTTGATTGTATAAGAATAATATGTAGTTATCTTCAATCCAAAGCAAGTGTATCAATTAAGTGACATATAGTTGTTTTTGGTATGGAATACCAACTCTTGATATCATTATGTCGCATAGCAATTTGATTTTTGTAATGGGTTCTACATCATATTCAAATTGAAAGTTTATTATGAAAGAGGATATTCTATAAAGTGTTTTTTCATTTTTGCTTGGTAGCTCGTCTTGACCATTGAGTCGTGATGATAATGAAGTAAATGGCTTT

Coding sequence (CDS)

ATGTTCGGACTGGTGGCCTTCGCTGGACTCATGGCTGGGCTCACTTTGGGTCTCATGTCTCTTGGCCTCGTTGACCTCGAAGTTCTTATAAAGTCTGGCCGCCCTCAGGATCGCAAACATGCTGCTAAGATATTGCCAGTGGTAAAGAATCAGCATCTTCTGCTTTGCACACTTTTGATTGGTAACTCTTTGGCAATGGAGGCTCTTCCGATATTCTTAGACATGATCGTACCTCCTTGGCTGGCAGTCCTTGTCTCGGTTACTCTTATTCTCATGTTTGGAGAGATATTACCGCAAGCAATTTGCACTCGTTATGGGTTGAAAGTTGGGGCAATAATGGCACCTTTTGTTCGCGTTCTTCTCATGATTTTCTTTCCCATTTCATATCCAATTAGTAAGGTTCTTGATTGGATGTTGGGTAAAGGGCACGCTGTACTCTTGAGGAGAGCAGAGCTCAAGACTTTTGTGAACTTTCATGGCAATGAGGCTGGAAAAGGTGGAGATTTAACTCACGATGAGACTACTATCATTGCTGGGGCACTTGAATTGACAGAAAAGACAGCCAAAGATGCCATGACATCGATATCAAATGCATTTTCCCTTGATCTGGATGCAACTCTTGATTTGGAGACGCTCAATGCTATTATGACGAAAGGCCATAGTCGAGTTCCTGTTTATTCTGGAGATCCAAAAAACATAATTGGACTAGTTCTGGTTAAAAATCTGTTAACTGTTGATCCAGAAGATAGAGTTCTCCTTAGAAAAATGATTATTAGAAAAATTCCACGGGTTTCTGAAGACATGCCTCTATATGACATTTTAAATGAATTTCAAAAGGGCCATAGTCACATTGCTGTGGTATTCAAGAAACATGGCTACCAATCTGAGGCATTGCTGAAAAAAGACAATGGTGTTGACTCAACTGCTGGTGCTGCTACTCATAATTTAGCGATGAAAATGGAATTAGTTGATGCTCAAACAATAGCTGAAAAGGCCGGGGGCGAACAGACGAAGAAAAGTCCACCAGCTACTCCTGCCTTTAAAAAACGACACAGAGGTTGTTCATTCTGCATTTTAGATGTCGAAAATGCGCCTCTTCCTGTCTTACCACCTGGTGAAGAGGTGGTTGGTGTCATTACTATGGAGGATGTGATTGAAGAACTTCTTCAGGAGGAGATATTAGACGAGACAGACGAGTATGTCAATATCCACAACAGAATAAAAATCAACATGCAACCATCTCCGGAAAAACCAAGCACCAATCCACCTCAGCTTTCCCCAAATGTTAATACGTGA

Protein sequence

MFGLVAFAGLMAGLTLGLMSLGLVDLEVLIKSGRPQDRKHAAKILPVVKNQHLLLCTLLIGNSLAMEALPIFLDMIVPPWLAVLVSVTLILMFGEILPQAICTRYGLKVGAIMAPFVRVLLMIFFPISYPISKVLDWMLGKGHAVLLRRAELKTFVNFHGNEAGKGGDLTHDETTIIAGALELTEKTAKDAMTSISNAFSLDLDATLDLETLNAIMTKGHSRVPVYSGDPKNIIGLVLVKNLLTVDPEDRVLLRKMIIRKIPRVSEDMPLYDILNEFQKGHSHIAVVFKKHGYQSEALLKKDNGVDSTAGAATHNLAMKMELVDAQTIAEKAGGEQTKKSPPATPAFKKRHRGCSFCILDVENAPLPVLPPGEEVVGVITMEDVIEELLQEEILDETDEYVNIHNRIKINMQPSPEKPSTNPPQLSPNVNT
Homology
BLAST of Cp4.1LG02g07650 vs. ExPASy Swiss-Prot
Match: Q8RY60 (DUF21 domain-containing protein At1g47330 OS=Arabidopsis thaliana OX=3702 GN=CBSDUF7 PE=1 SV=1)

HSP 1 Score: 583.2 bits (1502), Expect = 2.5e-165
Identity = 319/438 (72.83%), Postives = 351/438 (80.14%), Query Frame = 0

Query: 1   MFGLVAFAGLMAGLTLGLMSLGLVDLEVLIKSGRPQDRKHAAKILPVVKNQHLLLCTLLI 60
           +  LVAFAGLMAGLTLGLMSLGLVDLEVLIKSGRPQDR +A KI PVVKNQHLLLCTLLI
Sbjct: 19  IIALVAFAGLMAGLTLGLMSLGLVDLEVLIKSGRPQDRINAGKIFPVVKNQHLLLCTLLI 78

Query: 61  GNSLAMEALPIFLDMIVPPWLAVLVSVTLILMFGEILPQAICTRYGLKVGAIMAPFVRVL 120
           GNS+AMEALPIFLD IVPPWLA+L+SVTLIL+FGEI+PQA+CTRYGLKVGAIMAPFVRVL
Sbjct: 79  GNSMAMEALPIFLDKIVPPWLAILLSVTLILVFGEIMPQAVCTRYGLKVGAIMAPFVRVL 138

Query: 121 LMIFFPISYPISKVLDWMLGKGHAVLLRRAELKTFVNFHGNEAGKGGDLTHDETTIIAGA 180
           L++FFPISYPISKVLDWMLGKGH VLLRRAELKTFVNFHGNEAGKGGDLT DET+II GA
Sbjct: 139 LVLFFPISYPISKVLDWMLGKGHGVLLRRAELKTFVNFHGNEAGKGGDLTTDETSIITGA 198

Query: 181 LELTEKTAKDAMTSISNAFSLDLDATLDLETLNAIMTKGHSRVPVYSGDPKNIIGLVLVK 240
           LELTEKTAKDAMT ISNAFSL+LD  L+LETLN IM+ GHSRVPVY  +P +IIGL+LVK
Sbjct: 199 LELTEKTAKDAMTPISNAFSLELDTPLNLETLNTIMSVGHSRVPVYFRNPTHIIGLILVK 258

Query: 241 NLLTVDPEDRVLLRKMIIRKIPRVSEDMPLYDILNEFQKGHSHIAVVFKKHGYQSEALLK 300
           NLL VD    V LRKM +RKIPRVSE MPLYDILNEFQKGHSHIAVV+K    Q ++   
Sbjct: 259 NLLAVDARKEVPLRKMSMRKIPRVSETMPLYDILNEFQKGHSHIAVVYKDLDEQEQSPET 318

Query: 301 KDNGVDSTAGAAT-------------------HNLAMKMELVDAQTIAEKAGGEQT---K 360
            +NG++      T                        K+E  DA++   + G EQ    K
Sbjct: 319 SENGIERRKNKKTKDELFKDSCRKPKAQFEVSEKEVFKIETGDAKSGKSENGEEQQGSGK 378

Query: 361 KSPPATPAFKKRHRGCSFCILDVENAPLPVLPPGEEVVGVITMEDVIEELLQEEILDETD 417
            S  A PA KKRHRGCSFCILD+EN P+P  P  EEVVGVITMEDVIEELLQEEILDETD
Sbjct: 379 TSLLAAPA-KKRHRGCSFCILDIENTPIPDFPTNEEVVGVITMEDVIEELLQEEILDETD 438

BLAST of Cp4.1LG02g07650 vs. ExPASy Swiss-Prot
Match: Q9ZQR4 (DUF21 domain-containing protein At2g14520 OS=Arabidopsis thaliana OX=3702 GN=CBSDUF3 PE=2 SV=2)

HSP 1 Score: 423.7 bits (1088), Expect = 2.5e-117
Identity = 236/406 (58.13%), Postives = 295/406 (72.66%), Query Frame = 0

Query: 4   LVAFAGLMAGLTLGLMSLGLVDLEVLIKSGRPQDRKHAAKILPVVKNQHLLLCTLLIGNS 63
           LV FAGLM+GLTLGLMS+ LVDLEVL KSG P+DR HAAKILPVVKNQHLLLCTLLI N+
Sbjct: 22  LVLFAGLMSGLTLGLMSMSLVDLEVLAKSGTPRDRIHAAKILPVVKNQHLLLCTLLICNA 81

Query: 64  LAMEALPIFLDMIVPPWLAVLVSVTLILMFGEILPQAICTRYGLKVGAIMAPFVRVLLMI 123
            AMEALPIFLD +V  W A+L+SVTLIL+FGEI+PQ++C+R+GL +GA +APFVRVL+ I
Sbjct: 82  AAMEALPIFLDALVTAWGAILISVTLILLFGEIIPQSVCSRHGLAIGATVAPFVRVLVWI 141

Query: 124 FFPISYPISKVLDWMLGKGHAVLLRRAELKTFVNFHGNEAGKGGDLTHDETTIIAGALEL 183
             P+++PISK+LD++LG G   L RRAELKT V+ HGNEAGKGG+LTHDETTIIAGALEL
Sbjct: 142 CLPVAWPISKLLDFLLGHGRVALFRRAELKTLVDLHGNEAGKGGELTHDETTIIAGALEL 201

Query: 184 TEKTAKDAMTSISNAFSLDLDATLDLETLNAIMTKGHSRVPVYSGDPKNIIGLVLVKNLL 243
           +EK AKDAMT IS+ F +D++A LD + +N I+ KGHSRVPVY     NIIGLVLVKNLL
Sbjct: 202 SEKMAKDAMTPISDTFVIDINAKLDRDLMNLILDKGHSRVPVYYEQRTNIIGLVLVKNLL 261

Query: 244 TVDPEDRVLLRKMIIRKIPRVSEDMPLYDILNEFQKGHSHIAVVFKK----HGYQS---- 303
           T++P++ + ++ + IR+IPRV E +PLYDILNEFQKGHSH+AVV ++    H  QS    
Sbjct: 262 TINPDEEIQVKNVTIRRIPRVPETLPLYDILNEFQKGHSHMAVVVRQCDKIHPLQSNDAA 321

Query: 304 -EALLKKDNGVDSTAGAATHNLAMKMELVDAQTIAEKAGGEQTKKSPPATPAFKKRHRGC 363
            E + +    VD         L  +  L   ++   +A    ++         K+  +  
Sbjct: 322 NETVNEVRVDVDYERSPQETKLKRRRSLQKWKSFPNRANSLGSRS--------KRWSKDN 381

Query: 364 SFCILDVENAPLPVLPPGEEVVGVITMEDVIEELLQEEILDETDEY 401
              IL +   PLP L   E+ VG+ITMEDVIEELLQEEI DETD +
Sbjct: 382 DADILQLNEHPLPKLDEEEDAVGIITMEDVIEELLQEEIFDETDHH 419

BLAST of Cp4.1LG02g07650 vs. ExPASy Swiss-Prot
Match: Q8VZI2 (DUF21 domain-containing protein At4g33700 OS=Arabidopsis thaliana OX=3702 GN=CBSDUF6 PE=1 SV=1)

HSP 1 Score: 421.8 bits (1083), Expect = 9.5e-117
Identity = 242/411 (58.88%), Postives = 296/411 (72.02%), Query Frame = 0

Query: 4   LVAFAGLMAGLTLGLMSLGLVDLEVLIKSGRPQDRKHAAKILPVVKNQHLLLCTLLIGNS 63
           LV FAGLM+GLTLGLMSL LVDLEVL KSG P+ RK+AAKILPVVKNQHLLL TLLI N+
Sbjct: 22  LVLFAGLMSGLTLGLMSLSLVDLEVLAKSGTPEHRKYAAKILPVVKNQHLLLVTLLICNA 81

Query: 64  LAMEALPIFLDMIVPPWLAVLVSVTLILMFGEILPQAICTRYGLKVGAIMAPFVRVLLMI 123
            AME LPIFLD +V  W A+L+SVTLIL+FGEI+PQ+IC+RYGL +GA +APFVRVL+ I
Sbjct: 82  AAMETLPIFLDGLVTAWGAILISVTLILLFGEIIPQSICSRYGLAIGATVAPFVRVLVFI 141

Query: 124 FFPISYPISKVLDWMLGKGHAVLLRRAELKTFVNFHGNEAGKGGDLTHDETTIIAGALEL 183
             P+++PISK+LD++LG   A L RRAELKT V+FHGNEAGKGG+LTHDETTIIAGALEL
Sbjct: 142 CLPVAWPISKLLDFLLGHRRAALFRRAELKTLVDFHGNEAGKGGELTHDETTIIAGALEL 201

Query: 184 TEKTAKDAMTSISNAFSLDLDATLDLETLNAIMTKGHSRVPVYSGDPKNIIGLVLVKNLL 243
           +EK  KDAMT IS+ F +D++A LD + +N I+ KGHSRVPVY   P NIIGLVLVKNLL
Sbjct: 202 SEKMVKDAMTPISDIFVIDINAKLDRDLMNLILEKGHSRVPVYYEQPTNIIGLVLVKNLL 261

Query: 244 TVDPEDRVLLRKMIIRKIPRVSEDMPLYDILNEFQKGHSHIAVVFK----------KHGY 303
           T++P++ + ++ + IR+IPRV E +PLYDILNEFQKG SH+AVV +          K+G 
Sbjct: 262 TINPDEEIPVKNVTIRRIPRVPEILPLYDILNEFQKGLSHMAVVVRQCDKIHPLPSKNGS 321

Query: 304 QSEALLKKDNGVDSTAGAATHNLAMKMELVDAQTIAEKA----GGEQTKKSPPATPAFKK 363
             EA +  D+  + T       L  K  L   ++   +A    GG ++KK      A   
Sbjct: 322 VKEARVDVDS--EGTPTPQERMLRTKRSLQKWKSFPNRASSFKGGSKSKKWSKDNDA--- 381

Query: 364 RHRGCSFCILDVENAPLPVLPPGEEVVGVITMEDVIEELLQEEILDETDEY 401
                   IL +   PLP L   EE VG+ITMEDVIEELLQEEI DETD +
Sbjct: 382 -------DILQLNGNPLPKLAEEEEAVGIITMEDVIEELLQEEIFDETDHH 420

BLAST of Cp4.1LG02g07650 vs. ExPASy Swiss-Prot
Match: Q9LTD8 (DUF21 domain-containing protein At5g52790 OS=Arabidopsis thaliana OX=3702 GN=CBSDUF5 PE=2 SV=2)

HSP 1 Score: 405.6 bits (1041), Expect = 7.0e-112
Identity = 220/417 (52.76%), Postives = 293/417 (70.26%), Query Frame = 0

Query: 4   LVAFAGLMAGLTLGLMSLGLVDLEVLIKSGRPQDRKHAAKILPVVKNQHLLLCTLLIGNS 63
           LV FAGLM+GLTLGLMSL +V+LEV+IK+G P DRK+A KILP+VKNQHLLLCTLLIGN+
Sbjct: 23  LVVFAGLMSGLTLGLMSLSIVELEVMIKAGEPHDRKNAEKILPLVKNQHLLLCTLLIGNA 82

Query: 64  LAMEALPIFLDMIVPPWLAVLVSVTLILMFGEILPQAICTRYGLKVGAIMAPFVRVLLMI 123
           LAMEALPIF+D ++P W A+L+SVTLIL FGEI+PQA+C+RYGL +GA ++  VR+++++
Sbjct: 83  LAMEALPIFVDSLLPAWGAILISVTLILAFGEIIPQAVCSRYGLSIGAKLSFLVRLIIIV 142

Query: 124 FFPISYPISKVLDWMLGKGHAVLLRRAELKTFVNFHGNEAGKGGDLTHDETTIIAGALEL 183
           FFP+SYPISK+LD +LGK H+ LL RAELK+ V  HGNEAGKGG+LTHDETTII+GAL++
Sbjct: 143 FFPLSYPISKLLDLLLGKRHSTLLGRAELKSLVYMHGNEAGKGGELTHDETTIISGALDM 202

Query: 184 TEKTAKDAMTSISNAFSLDLDATLDLETLNAIMTKGHSRVPVYSGDPKNIIGLVLVKNLL 243
           ++K+AKDAMT +S  FSLD++  LD +T+  I + GHSR+P+YS +P  IIG +LVKNL+
Sbjct: 203 SQKSAKDAMTPVSQIFSLDINFKLDEKTMGLIASAGHSRIPIYSVNPNVIIGFILVKNLI 262

Query: 244 TVDPEDRVLLRKMIIRKIPRVSEDMPLYDILNEFQKGHSHIAVVFKKHGYQSEALLKKDN 303
            V PED   +R + IR++P+V  ++PLYDILN FQ G SH+A V     + +      + 
Sbjct: 263 KVRPEDETSIRDLPIRRMPKVDLNLPLYDILNIFQTGRSHMAAVVGTKNHTNTNTPVHEK 322

Query: 304 GVDSTAGAATHNLAMKMELVDAQTIAEKAGGEQTKKSPPATPAFKKRHRGCSFCILDVEN 363
            ++ +               DA              S PA  + +  H+     I  + +
Sbjct: 323 SINGSPNK------------DANVFL----------SIPALNSSETSHQSPIRYIDSISD 382

Query: 364 APLPVLPPGEEVVGVITMEDVIEELLQEEILDETDEYVNIHNRIKINMQPSPEKPST 421
                    EEV+G+IT+EDV+EEL+QEEI DETD+YV +H RI INM  S   P T
Sbjct: 383 -------EDEEVIGIITLEDVMEELIQEEIYDETDQYVELHKRITINMPMSGNSPET 410

BLAST of Cp4.1LG02g07650 vs. ExPASy Swiss-Prot
Match: Q67XQ0 (DUF21 domain-containing protein At4g14240 OS=Arabidopsis thaliana OX=3702 GN=CBSDUF1 PE=1 SV=1)

HSP 1 Score: 386.0 bits (990), Expect = 5.7e-106
Identity = 217/410 (52.93%), Postives = 289/410 (70.49%), Query Frame = 0

Query: 4   LVAFAGLMAGLTLGLMSLGLVDLEVLIKSGRPQDRKHAAKILPVVKNQHLLLCTLLIGNS 63
           LV FAG+M+GLTLGLMSLGLV+LE+L +SG P ++K AA I PVV+ QH LL TLL+ N+
Sbjct: 45  LVLFAGIMSGLTLGLMSLGLVELEILQRSGTPNEKKQAAAIFPVVQKQHQLLVTLLLCNA 104

Query: 64  LAMEALPIFLDMIVPPWLAVLVSVTLILMFGEILPQAICTRYGLKVGAIMAPFVRVLLMI 123
           +AME LPI+LD +   ++A+++SVT +L FGE++PQAICTRYGL VGA     VR+L+ +
Sbjct: 105 MAMEGLPIYLDKLFNEYVAIILSVTFVLAFGEVIPQAICTRYGLAVGANFVWLVRILMTL 164

Query: 124 FFPISYPISKVLDWMLGKGHAVLLRRAELKTFVNFHGNEAGKGGDLTHDETTIIAGALEL 183
            +PI++PI K+LD +LG   A L RRA+LK  V+ H  EAGKGG+LTHDETTII+GAL+L
Sbjct: 165 CYPIAFPIGKILDLVLGHNDA-LFRRAQLKALVSIHSQEAGKGGELTHDETTIISGALDL 224

Query: 184 TEKTAKDAMTSISNAFSLDLDATLDLETLNAIMTKGHSRVPVYSGDPKNIIGLVLVKNLL 243
           TEKTA++AMT I + FSLD+++ LD E +  I+ +GHSRVPVYSG+PKN+IGL+LVK+LL
Sbjct: 225 TEKTAQEAMTPIESTFSLDVNSKLDWEAMGKILARGHSRVPVYSGNPKNVIGLLLVKSLL 284

Query: 244 TVDPEDRVLLRKMIIRKIPRVSEDMPLYDILNEFQKGHSHIAVVFKKHGYQS--EALLKK 303
           TV PE   L+  + IR+IPRV  DMPLYDILNEFQKG SH+A V K  G      + L +
Sbjct: 285 TVRPETETLVSAVCIRRIPRVPADMPLYDILNEFQKGSSHMAAVVKVKGKSKVPPSTLLE 344

Query: 304 DNGVDSTAGAATHNLAMKMELVDAQTIA--EKAGGEQTKKSPPATPAFKKRHRGCSFCIL 363
           ++  +S     T  L +K E      I   +KA G+   ++  + P       G S    
Sbjct: 345 EHTDESNDSDLTAPLLLKREGNHDNVIVTIDKANGQSFFQNNESGP------HGFSHTSE 404

Query: 364 DVENAPLPVLPPGEEVVGVITMEDVIEELLQEEILDETDEYVNIHNRIKI 410
            +E+          EV+G+IT+EDV EELLQEEI+DETDEYV++H RI++
Sbjct: 405 AIEDG---------EVIGIITLEDVFEELLQEEIVDETDEYVDVHKRIRV 438

BLAST of Cp4.1LG02g07650 vs. NCBI nr
Match: XP_023525037.1 (DUF21 domain-containing protein At1g47330 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 821 bits (2120), Expect = 5.34e-299
Identity = 429/429 (100.00%), Postives = 429/429 (100.00%), Query Frame = 0

Query: 3   GLVAFAGLMAGLTLGLMSLGLVDLEVLIKSGRPQDRKHAAKILPVVKNQHLLLCTLLIGN 62
           GLVAFAGLMAGLTLGLMSLGLVDLEVLIKSGRPQDRKHAAKILPVVKNQHLLLCTLLIGN
Sbjct: 21  GLVAFAGLMAGLTLGLMSLGLVDLEVLIKSGRPQDRKHAAKILPVVKNQHLLLCTLLIGN 80

Query: 63  SLAMEALPIFLDMIVPPWLAVLVSVTLILMFGEILPQAICTRYGLKVGAIMAPFVRVLLM 122
           SLAMEALPIFLDMIVPPWLAVLVSVTLILMFGEILPQAICTRYGLKVGAIMAPFVRVLLM
Sbjct: 81  SLAMEALPIFLDMIVPPWLAVLVSVTLILMFGEILPQAICTRYGLKVGAIMAPFVRVLLM 140

Query: 123 IFFPISYPISKVLDWMLGKGHAVLLRRAELKTFVNFHGNEAGKGGDLTHDETTIIAGALE 182
           IFFPISYPISKVLDWMLGKGHAVLLRRAELKTFVNFHGNEAGKGGDLTHDETTIIAGALE
Sbjct: 141 IFFPISYPISKVLDWMLGKGHAVLLRRAELKTFVNFHGNEAGKGGDLTHDETTIIAGALE 200

Query: 183 LTEKTAKDAMTSISNAFSLDLDATLDLETLNAIMTKGHSRVPVYSGDPKNIIGLVLVKNL 242
           LTEKTAKDAMTSISNAFSLDLDATLDLETLNAIMTKGHSRVPVYSGDPKNIIGLVLVKNL
Sbjct: 201 LTEKTAKDAMTSISNAFSLDLDATLDLETLNAIMTKGHSRVPVYSGDPKNIIGLVLVKNL 260

Query: 243 LTVDPEDRVLLRKMIIRKIPRVSEDMPLYDILNEFQKGHSHIAVVFKKHGYQSEALLKKD 302
           LTVDPEDRVLLRKMIIRKIPRVSEDMPLYDILNEFQKGHSHIAVVFKKHGYQSEALLKKD
Sbjct: 261 LTVDPEDRVLLRKMIIRKIPRVSEDMPLYDILNEFQKGHSHIAVVFKKHGYQSEALLKKD 320

Query: 303 NGVDSTAGAATHNLAMKMELVDAQTIAEKAGGEQTKKSPPATPAFKKRHRGCSFCILDVE 362
           NGVDSTAGAATHNLAMKMELVDAQTIAEKAGGEQTKKSPPATPAFKKRHRGCSFCILDVE
Sbjct: 321 NGVDSTAGAATHNLAMKMELVDAQTIAEKAGGEQTKKSPPATPAFKKRHRGCSFCILDVE 380

Query: 363 NAPLPVLPPGEEVVGVITMEDVIEELLQEEILDETDEYVNIHNRIKINMQPSPEKPSTNP 422
           NAPLPVLPPGEEVVGVITMEDVIEELLQEEILDETDEYVNIHNRIKINMQPSPEKPSTNP
Sbjct: 381 NAPLPVLPPGEEVVGVITMEDVIEELLQEEILDETDEYVNIHNRIKINMQPSPEKPSTNP 440

Query: 423 PQLSPNVNT 431
           PQLSPNVNT
Sbjct: 441 PQLSPNVNT 449

BLAST of Cp4.1LG02g07650 vs. NCBI nr
Match: XP_022941204.1 (DUF21 domain-containing protein At1g47330 [Cucurbita moschata])

HSP 1 Score: 808 bits (2087), Expect = 5.70e-294
Identity = 421/429 (98.14%), Postives = 426/429 (99.30%), Query Frame = 0

Query: 3   GLVAFAGLMAGLTLGLMSLGLVDLEVLIKSGRPQDRKHAAKILPVVKNQHLLLCTLLIGN 62
           GLVAFAGLMAGLTLGLMSLGLVDLEVL+KSGRPQDRKHAAKILPVVKNQHLLLCTLLIGN
Sbjct: 21  GLVAFAGLMAGLTLGLMSLGLVDLEVLMKSGRPQDRKHAAKILPVVKNQHLLLCTLLIGN 80

Query: 63  SLAMEALPIFLDMIVPPWLAVLVSVTLILMFGEILPQAICTRYGLKVGAIMAPFVRVLLM 122
           SLAMEALPIFLDMIVPPW+AVLVSVTLILMFGEILPQAICTRYGLKVGAIMAPFVRVLLM
Sbjct: 81  SLAMEALPIFLDMIVPPWVAVLVSVTLILMFGEILPQAICTRYGLKVGAIMAPFVRVLLM 140

Query: 123 IFFPISYPISKVLDWMLGKGHAVLLRRAELKTFVNFHGNEAGKGGDLTHDETTIIAGALE 182
           +FFPISYPISKVLDWMLGKGHAVLLRRAELKTFVNFHGNEAGKGGDLTHDETTIIAGALE
Sbjct: 141 VFFPISYPISKVLDWMLGKGHAVLLRRAELKTFVNFHGNEAGKGGDLTHDETTIIAGALE 200

Query: 183 LTEKTAKDAMTSISNAFSLDLDATLDLETLNAIMTKGHSRVPVYSGDPKNIIGLVLVKNL 242
           LTEKTAK+AMTSISNAFSLDLDATLDLETLNAIMTKGHSRVPVYSGDPKNIIGLVLVKNL
Sbjct: 201 LTEKTAKNAMTSISNAFSLDLDATLDLETLNAIMTKGHSRVPVYSGDPKNIIGLVLVKNL 260

Query: 243 LTVDPEDRVLLRKMIIRKIPRVSEDMPLYDILNEFQKGHSHIAVVFKKHGYQSEALLKKD 302
           LTVDPEDRV LRKMIIRKIPRVSEDMPLYDILNEFQKGHSHIAVVFKKHGYQSEALLKKD
Sbjct: 261 LTVDPEDRVPLRKMIIRKIPRVSEDMPLYDILNEFQKGHSHIAVVFKKHGYQSEALLKKD 320

Query: 303 NGVDSTAGAATHNLAMKMELVDAQTIAEKAGGEQTKKSPPATPAFKKRHRGCSFCILDVE 362
           NGVDS AGAATHNLAMKME VDAQTIAEKAGG+QTKKSPPATPAFKKRHRGCSFCILDVE
Sbjct: 321 NGVDSAAGAATHNLAMKMESVDAQTIAEKAGGQQTKKSPPATPAFKKRHRGCSFCILDVE 380

Query: 363 NAPLPVLPPGEEVVGVITMEDVIEELLQEEILDETDEYVNIHNRIKINMQPSPEKPSTNP 422
           NAPLPVLPPGEEVVGVITMEDVIEELLQEEILDETDEYVNIHNRIKINMQPSPEKPSTNP
Sbjct: 381 NAPLPVLPPGEEVVGVITMEDVIEELLQEEILDETDEYVNIHNRIKINMQPSPEKPSTNP 440

Query: 423 PQLSPNVNT 431
           PQLSPNVNT
Sbjct: 441 PQLSPNVNT 449

BLAST of Cp4.1LG02g07650 vs. NCBI nr
Match: KAG6608536.1 (DUF21 domain-containing protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 806 bits (2081), Expect = 4.68e-293
Identity = 420/429 (97.90%), Postives = 425/429 (99.07%), Query Frame = 0

Query: 3   GLVAFAGLMAGLTLGLMSLGLVDLEVLIKSGRPQDRKHAAKILPVVKNQHLLLCTLLIGN 62
           GLVAFAGLMAGLTLGLMSLGLVDLEVL+KSGRPQDRKHAAKILPVVKNQHLLLCTLLIGN
Sbjct: 21  GLVAFAGLMAGLTLGLMSLGLVDLEVLMKSGRPQDRKHAAKILPVVKNQHLLLCTLLIGN 80

Query: 63  SLAMEALPIFLDMIVPPWLAVLVSVTLILMFGEILPQAICTRYGLKVGAIMAPFVRVLLM 122
           SLAMEALPIFLDMIVPPW+AVLVSVTLILMFGEILPQAICTRYGLKVGAIMAPFVRVLLM
Sbjct: 81  SLAMEALPIFLDMIVPPWVAVLVSVTLILMFGEILPQAICTRYGLKVGAIMAPFVRVLLM 140

Query: 123 IFFPISYPISKVLDWMLGKGHAVLLRRAELKTFVNFHGNEAGKGGDLTHDETTIIAGALE 182
           +FFPISYPISKVLDWMLGKGHAVLLRRAELKTFVNFHGNEAGKGGDLTHDETTIIAGALE
Sbjct: 141 VFFPISYPISKVLDWMLGKGHAVLLRRAELKTFVNFHGNEAGKGGDLTHDETTIIAGALE 200

Query: 183 LTEKTAKDAMTSISNAFSLDLDATLDLETLNAIMTKGHSRVPVYSGDPKNIIGLVLVKNL 242
           LTEKTAK+AMTSISNAFSLDLDATLDLETLNAIMTKGHSRVPVYSGDPKNIIGLVLVKNL
Sbjct: 201 LTEKTAKNAMTSISNAFSLDLDATLDLETLNAIMTKGHSRVPVYSGDPKNIIGLVLVKNL 260

Query: 243 LTVDPEDRVLLRKMIIRKIPRVSEDMPLYDILNEFQKGHSHIAVVFKKHGYQSEALLKKD 302
           LTVDPEDRV LRKMIIRKIPRVSEDMPLYDILNEFQKGHSHIAVVFKKHGYQSEALLKKD
Sbjct: 261 LTVDPEDRVPLRKMIIRKIPRVSEDMPLYDILNEFQKGHSHIAVVFKKHGYQSEALLKKD 320

Query: 303 NGVDSTAGAATHNLAMKMELVDAQTIAEKAGGEQTKKSPPATPAFKKRHRGCSFCILDVE 362
           NGVDS AGAATHNLAMKME VDAQTIAEKAGG+QTKKSPPATPAFKKRHRGCSFCILDVE
Sbjct: 321 NGVDSAAGAATHNLAMKMESVDAQTIAEKAGGQQTKKSPPATPAFKKRHRGCSFCILDVE 380

Query: 363 NAPLPVLPPGEEVVGVITMEDVIEELLQEEILDETDEYVNIHNRIKINMQPSPEKPSTNP 422
           NAPLPVLPPGEEVVGVITMEDVIEELLQEEILDETDEYVNIHNRIKINMQPSPEKPSTNP
Sbjct: 381 NAPLPVLPPGEEVVGVITMEDVIEELLQEEILDETDEYVNIHNRIKINMQPSPEKPSTNP 440

Query: 423 PQLSPNVNT 431
           PQLSP VNT
Sbjct: 441 PQLSPKVNT 449

BLAST of Cp4.1LG02g07650 vs. NCBI nr
Match: XP_022981956.1 (DUF21 domain-containing protein At1g47330 [Cucurbita maxima])

HSP 1 Score: 800 bits (2065), Expect = 1.28e-290
Identity = 416/429 (96.97%), Postives = 422/429 (98.37%), Query Frame = 0

Query: 3   GLVAFAGLMAGLTLGLMSLGLVDLEVLIKSGRPQDRKHAAKILPVVKNQHLLLCTLLIGN 62
           GLVAFAGLMAGLTLGLMSLGLVDLEVL+KSGRPQDRKHAAKILPVVKNQHLLLCTLLIGN
Sbjct: 21  GLVAFAGLMAGLTLGLMSLGLVDLEVLMKSGRPQDRKHAAKILPVVKNQHLLLCTLLIGN 80

Query: 63  SLAMEALPIFLDMIVPPWLAVLVSVTLILMFGEILPQAICTRYGLKVGAIMAPFVRVLLM 122
           SLAMEALP+FLDMIVPPW+AVLVSVTLILMFGEILPQAICTRYGLKVGAIMAPFVRVLL 
Sbjct: 81  SLAMEALPVFLDMIVPPWVAVLVSVTLILMFGEILPQAICTRYGLKVGAIMAPFVRVLLT 140

Query: 123 IFFPISYPISKVLDWMLGKGHAVLLRRAELKTFVNFHGNEAGKGGDLTHDETTIIAGALE 182
           +FFPISYPISKVLDWMLGKGHAVLLRRAELKTFVNFHGNEAGKGGDLTHDETTIIAGALE
Sbjct: 141 VFFPISYPISKVLDWMLGKGHAVLLRRAELKTFVNFHGNEAGKGGDLTHDETTIIAGALE 200

Query: 183 LTEKTAKDAMTSISNAFSLDLDATLDLETLNAIMTKGHSRVPVYSGDPKNIIGLVLVKNL 242
           LTEKTAKDAMTSISNAFSLDLDATLDLETLNAIMTKGHSRVPVYSGDPKNI+GLVLVKNL
Sbjct: 201 LTEKTAKDAMTSISNAFSLDLDATLDLETLNAIMTKGHSRVPVYSGDPKNIVGLVLVKNL 260

Query: 243 LTVDPEDRVLLRKMIIRKIPRVSEDMPLYDILNEFQKGHSHIAVVFKKHGYQSEALLKKD 302
           LTVDPED V LRKMIIRKIPRVSEDMPLYDILNEFQKGHSHIAVVFKKHGYQSEALLKKD
Sbjct: 261 LTVDPEDGVPLRKMIIRKIPRVSEDMPLYDILNEFQKGHSHIAVVFKKHGYQSEALLKKD 320

Query: 303 NGVDSTAGAATHNLAMKMELVDAQTIAEKAGGEQTKKSPPATPAFKKRHRGCSFCILDVE 362
           NGVDS AGAATHN AMKME VDAQTIAEKAGG+QTKKSPPATPAFKKRHRGCSFCILDVE
Sbjct: 321 NGVDSAAGAATHNFAMKMESVDAQTIAEKAGGQQTKKSPPATPAFKKRHRGCSFCILDVE 380

Query: 363 NAPLPVLPPGEEVVGVITMEDVIEELLQEEILDETDEYVNIHNRIKINMQPSPEKPSTNP 422
           NAPLPVLP GEEVVGVITMEDVIEELLQEEILDETDEYVNIHNRIKINMQPSPEKPSTNP
Sbjct: 381 NAPLPVLPAGEEVVGVITMEDVIEELLQEEILDETDEYVNIHNRIKINMQPSPEKPSTNP 440

Query: 423 PQLSPNVNT 431
           PQLSPNVNT
Sbjct: 441 PQLSPNVNT 449

BLAST of Cp4.1LG02g07650 vs. NCBI nr
Match: XP_038904672.1 (DUF21 domain-containing protein At1g47330-like isoform X1 [Benincasa hispida])

HSP 1 Score: 774 bits (1999), Expect = 1.35e-280
Identity = 407/428 (95.09%), Postives = 412/428 (96.26%), Query Frame = 0

Query: 3   GLVAFAGLMAGLTLGLMSLGLVDLEVLIKSGRPQDRKHAAKILPVVKNQHLLLCTLLIGN 62
           GLVAFAGLMAGLTLGLMSLGLVDLEVLIKSGRPQDRKHAAKILPVVKNQHLLLCTLLIGN
Sbjct: 21  GLVAFAGLMAGLTLGLMSLGLVDLEVLIKSGRPQDRKHAAKILPVVKNQHLLLCTLLIGN 80

Query: 63  SLAMEALPIFLDMIVPPWLAVLVSVTLILMFGEILPQAICTRYGLKVGAIMAPFVRVLLM 122
           SLAMEALPIFLDMIVPPW AVLVSVTLILMFGEILPQAICTRYGLKVGAIMAP VR+LLM
Sbjct: 81  SLAMEALPIFLDMIVPPWAAVLVSVTLILMFGEILPQAICTRYGLKVGAIMAPLVRILLM 140

Query: 123 IFFPISYPISKVLDWMLGKGHAVLLRRAELKTFVNFHGNEAGKGGDLTHDETTIIAGALE 182
           +FFPISYPISKVLDWMLGKGHAVLLRRAELKTFVNFHGNEAGKGGDLTHDETTIIAGALE
Sbjct: 141 VFFPISYPISKVLDWMLGKGHAVLLRRAELKTFVNFHGNEAGKGGDLTHDETTIIAGALE 200

Query: 183 LTEKTAKDAMTSISNAFSLDLDATLDLETLNAIMTKGHSRVPVYSGDPKNIIGLVLVKNL 242
           LTEKTAKDAMTSISNAFSLDLDATLDLETLNAIMTKGHSRVPVYSGDPKNIIGLVLVKNL
Sbjct: 201 LTEKTAKDAMTSISNAFSLDLDATLDLETLNAIMTKGHSRVPVYSGDPKNIIGLVLVKNL 260

Query: 243 LTVDPEDRVLLRKMIIRKIPRVSEDMPLYDILNEFQKGHSHIAVVFKKHGYQSEALLKKD 302
           LTVDPEDRV LRKMIIRKIPRVSEDMPLYDILNEFQKGHSHIAVVFKKHG+QSE LLKKD
Sbjct: 261 LTVDPEDRVSLRKMIIRKIPRVSEDMPLYDILNEFQKGHSHIAVVFKKHGHQSETLLKKD 320

Query: 303 NGVDSTAGAATHNLAMKMELVDAQTIAEKAGGEQTKKSPPATPAFKKRHRGCSFCILDVE 362
           NGVDS   AA  N+ MKME   AQTIAEKAGG+QTKKSPPATPAFKKRHRGCSFCILDVE
Sbjct: 321 NGVDSGDAAAAQNIGMKME--SAQTIAEKAGGQQTKKSPPATPAFKKRHRGCSFCILDVE 380

Query: 363 NAPLPVLPPGEEVVGVITMEDVIEELLQEEILDETDEYVNIHNRIKINMQPSPEKPSTNP 422
           NAPLPV PPGEEVVGVITMEDVIEELLQEEILDETDEYVNIHNRIKINMQPSPEKPSTNP
Sbjct: 381 NAPLPVFPPGEEVVGVITMEDVIEELLQEEILDETDEYVNIHNRIKINMQPSPEKPSTNP 440

Query: 423 PQLSPNVN 430
            Q SPNVN
Sbjct: 441 SQPSPNVN 446

BLAST of Cp4.1LG02g07650 vs. ExPASy TrEMBL
Match: A0A6J1FMM0 (DUF21 domain-containing protein At1g47330 OS=Cucurbita moschata OX=3662 GN=LOC111446582 PE=4 SV=1)

HSP 1 Score: 808 bits (2087), Expect = 2.76e-294
Identity = 421/429 (98.14%), Postives = 426/429 (99.30%), Query Frame = 0

Query: 3   GLVAFAGLMAGLTLGLMSLGLVDLEVLIKSGRPQDRKHAAKILPVVKNQHLLLCTLLIGN 62
           GLVAFAGLMAGLTLGLMSLGLVDLEVL+KSGRPQDRKHAAKILPVVKNQHLLLCTLLIGN
Sbjct: 21  GLVAFAGLMAGLTLGLMSLGLVDLEVLMKSGRPQDRKHAAKILPVVKNQHLLLCTLLIGN 80

Query: 63  SLAMEALPIFLDMIVPPWLAVLVSVTLILMFGEILPQAICTRYGLKVGAIMAPFVRVLLM 122
           SLAMEALPIFLDMIVPPW+AVLVSVTLILMFGEILPQAICTRYGLKVGAIMAPFVRVLLM
Sbjct: 81  SLAMEALPIFLDMIVPPWVAVLVSVTLILMFGEILPQAICTRYGLKVGAIMAPFVRVLLM 140

Query: 123 IFFPISYPISKVLDWMLGKGHAVLLRRAELKTFVNFHGNEAGKGGDLTHDETTIIAGALE 182
           +FFPISYPISKVLDWMLGKGHAVLLRRAELKTFVNFHGNEAGKGGDLTHDETTIIAGALE
Sbjct: 141 VFFPISYPISKVLDWMLGKGHAVLLRRAELKTFVNFHGNEAGKGGDLTHDETTIIAGALE 200

Query: 183 LTEKTAKDAMTSISNAFSLDLDATLDLETLNAIMTKGHSRVPVYSGDPKNIIGLVLVKNL 242
           LTEKTAK+AMTSISNAFSLDLDATLDLETLNAIMTKGHSRVPVYSGDPKNIIGLVLVKNL
Sbjct: 201 LTEKTAKNAMTSISNAFSLDLDATLDLETLNAIMTKGHSRVPVYSGDPKNIIGLVLVKNL 260

Query: 243 LTVDPEDRVLLRKMIIRKIPRVSEDMPLYDILNEFQKGHSHIAVVFKKHGYQSEALLKKD 302
           LTVDPEDRV LRKMIIRKIPRVSEDMPLYDILNEFQKGHSHIAVVFKKHGYQSEALLKKD
Sbjct: 261 LTVDPEDRVPLRKMIIRKIPRVSEDMPLYDILNEFQKGHSHIAVVFKKHGYQSEALLKKD 320

Query: 303 NGVDSTAGAATHNLAMKMELVDAQTIAEKAGGEQTKKSPPATPAFKKRHRGCSFCILDVE 362
           NGVDS AGAATHNLAMKME VDAQTIAEKAGG+QTKKSPPATPAFKKRHRGCSFCILDVE
Sbjct: 321 NGVDSAAGAATHNLAMKMESVDAQTIAEKAGGQQTKKSPPATPAFKKRHRGCSFCILDVE 380

Query: 363 NAPLPVLPPGEEVVGVITMEDVIEELLQEEILDETDEYVNIHNRIKINMQPSPEKPSTNP 422
           NAPLPVLPPGEEVVGVITMEDVIEELLQEEILDETDEYVNIHNRIKINMQPSPEKPSTNP
Sbjct: 381 NAPLPVLPPGEEVVGVITMEDVIEELLQEEILDETDEYVNIHNRIKINMQPSPEKPSTNP 440

Query: 423 PQLSPNVNT 431
           PQLSPNVNT
Sbjct: 441 PQLSPNVNT 449

BLAST of Cp4.1LG02g07650 vs. ExPASy TrEMBL
Match: A0A6J1IXZ7 (DUF21 domain-containing protein At1g47330 OS=Cucurbita maxima OX=3661 GN=LOC111480949 PE=4 SV=1)

HSP 1 Score: 800 bits (2065), Expect = 6.21e-291
Identity = 416/429 (96.97%), Postives = 422/429 (98.37%), Query Frame = 0

Query: 3   GLVAFAGLMAGLTLGLMSLGLVDLEVLIKSGRPQDRKHAAKILPVVKNQHLLLCTLLIGN 62
           GLVAFAGLMAGLTLGLMSLGLVDLEVL+KSGRPQDRKHAAKILPVVKNQHLLLCTLLIGN
Sbjct: 21  GLVAFAGLMAGLTLGLMSLGLVDLEVLMKSGRPQDRKHAAKILPVVKNQHLLLCTLLIGN 80

Query: 63  SLAMEALPIFLDMIVPPWLAVLVSVTLILMFGEILPQAICTRYGLKVGAIMAPFVRVLLM 122
           SLAMEALP+FLDMIVPPW+AVLVSVTLILMFGEILPQAICTRYGLKVGAIMAPFVRVLL 
Sbjct: 81  SLAMEALPVFLDMIVPPWVAVLVSVTLILMFGEILPQAICTRYGLKVGAIMAPFVRVLLT 140

Query: 123 IFFPISYPISKVLDWMLGKGHAVLLRRAELKTFVNFHGNEAGKGGDLTHDETTIIAGALE 182
           +FFPISYPISKVLDWMLGKGHAVLLRRAELKTFVNFHGNEAGKGGDLTHDETTIIAGALE
Sbjct: 141 VFFPISYPISKVLDWMLGKGHAVLLRRAELKTFVNFHGNEAGKGGDLTHDETTIIAGALE 200

Query: 183 LTEKTAKDAMTSISNAFSLDLDATLDLETLNAIMTKGHSRVPVYSGDPKNIIGLVLVKNL 242
           LTEKTAKDAMTSISNAFSLDLDATLDLETLNAIMTKGHSRVPVYSGDPKNI+GLVLVKNL
Sbjct: 201 LTEKTAKDAMTSISNAFSLDLDATLDLETLNAIMTKGHSRVPVYSGDPKNIVGLVLVKNL 260

Query: 243 LTVDPEDRVLLRKMIIRKIPRVSEDMPLYDILNEFQKGHSHIAVVFKKHGYQSEALLKKD 302
           LTVDPED V LRKMIIRKIPRVSEDMPLYDILNEFQKGHSHIAVVFKKHGYQSEALLKKD
Sbjct: 261 LTVDPEDGVPLRKMIIRKIPRVSEDMPLYDILNEFQKGHSHIAVVFKKHGYQSEALLKKD 320

Query: 303 NGVDSTAGAATHNLAMKMELVDAQTIAEKAGGEQTKKSPPATPAFKKRHRGCSFCILDVE 362
           NGVDS AGAATHN AMKME VDAQTIAEKAGG+QTKKSPPATPAFKKRHRGCSFCILDVE
Sbjct: 321 NGVDSAAGAATHNFAMKMESVDAQTIAEKAGGQQTKKSPPATPAFKKRHRGCSFCILDVE 380

Query: 363 NAPLPVLPPGEEVVGVITMEDVIEELLQEEILDETDEYVNIHNRIKINMQPSPEKPSTNP 422
           NAPLPVLP GEEVVGVITMEDVIEELLQEEILDETDEYVNIHNRIKINMQPSPEKPSTNP
Sbjct: 381 NAPLPVLPAGEEVVGVITMEDVIEELLQEEILDETDEYVNIHNRIKINMQPSPEKPSTNP 440

Query: 423 PQLSPNVNT 431
           PQLSPNVNT
Sbjct: 441 PQLSPNVNT 449

BLAST of Cp4.1LG02g07650 vs. ExPASy TrEMBL
Match: A0A1S3CRQ3 (DUF21 domain-containing protein At1g47330 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103503947 PE=4 SV=1)

HSP 1 Score: 768 bits (1983), Expect = 1.93e-278
Identity = 402/428 (93.93%), Postives = 409/428 (95.56%), Query Frame = 0

Query: 3   GLVAFAGLMAGLTLGLMSLGLVDLEVLIKSGRPQDRKHAAKILPVVKNQHLLLCTLLIGN 62
           GLVAFAGLMAGLTLGLMSLGLVDLEVLIKSGRPQDRKHAAKILPVVKNQHLLLCTLLIGN
Sbjct: 21  GLVAFAGLMAGLTLGLMSLGLVDLEVLIKSGRPQDRKHAAKILPVVKNQHLLLCTLLIGN 80

Query: 63  SLAMEALPIFLDMIVPPWLAVLVSVTLILMFGEILPQAICTRYGLKVGAIMAPFVRVLLM 122
           SLAMEALPIFLDMIVPPW AVLVSVTLILMFGEILPQAICTRYGLKVGAIMAP VR+LLM
Sbjct: 81  SLAMEALPIFLDMIVPPWAAVLVSVTLILMFGEILPQAICTRYGLKVGAIMAPLVRILLM 140

Query: 123 IFFPISYPISKVLDWMLGKGHAVLLRRAELKTFVNFHGNEAGKGGDLTHDETTIIAGALE 182
           +FFPISYPISKVLDWMLGKGHAVLLRRAELKTFVNFHGNEAGKGGDLTHDETTIIAGALE
Sbjct: 141 VFFPISYPISKVLDWMLGKGHAVLLRRAELKTFVNFHGNEAGKGGDLTHDETTIIAGALE 200

Query: 183 LTEKTAKDAMTSISNAFSLDLDATLDLETLNAIMTKGHSRVPVYSGDPKNIIGLVLVKNL 242
           LTEKTAKDAMTSISNAFSLDLDATLDLETLNAIMTKGHSRVPVYSGDPKNIIGLVLVKNL
Sbjct: 201 LTEKTAKDAMTSISNAFSLDLDATLDLETLNAIMTKGHSRVPVYSGDPKNIIGLVLVKNL 260

Query: 243 LTVDPEDRVLLRKMIIRKIPRVSEDMPLYDILNEFQKGHSHIAVVFKKHGYQSEALLKKD 302
           LTVDPEDRV L+KMIIRKIPRVSEDMPLYDILNEFQKGHSHIAVVFKKHG+QSE L  KD
Sbjct: 261 LTVDPEDRVSLKKMIIRKIPRVSEDMPLYDILNEFQKGHSHIAVVFKKHGHQSETLPNKD 320

Query: 303 NGVDSTAGAATHNLAMKMELVDAQTIAEKAGGEQTKKSPPATPAFKKRHRGCSFCILDVE 362
           NGVDS   AA  N+ MKME VDAQT+AEKAGG+QTKKSPPATPAFKKRHRGCSFCILDVE
Sbjct: 321 NGVDSGDAAAAQNIGMKMESVDAQTVAEKAGGQQTKKSPPATPAFKKRHRGCSFCILDVE 380

Query: 363 NAPLPVLPPGEEVVGVITMEDVIEELLQEEILDETDEYVNIHNRIKINMQPSPEKPSTNP 422
           NAPLPV PP EEVVGVITMEDVIEELLQEEILDETDEYVNIHNRIKINMQPSPEKPS N 
Sbjct: 381 NAPLPVFPPREEVVGVITMEDVIEELLQEEILDETDEYVNIHNRIKINMQPSPEKPSINQ 440

Query: 423 PQLSPNVN 430
            Q SPNVN
Sbjct: 441 SQPSPNVN 448

BLAST of Cp4.1LG02g07650 vs. ExPASy TrEMBL
Match: A0A0A0LGK3 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G856000 PE=4 SV=1)

HSP 1 Score: 762 bits (1968), Expect = 3.72e-276
Identity = 401/428 (93.69%), Postives = 409/428 (95.56%), Query Frame = 0

Query: 3   GLVAFAGLMAGLTLGLMSLGLVDLEVLIKSGRPQDRKHAAKILPVVKNQHLLLCTLLIGN 62
           GLVAFAGLMAGLTLGLMSLGLVDLEVLIKSGRPQDRKHAAKILPVVKNQHLLLCTLLIGN
Sbjct: 21  GLVAFAGLMAGLTLGLMSLGLVDLEVLIKSGRPQDRKHAAKILPVVKNQHLLLCTLLIGN 80

Query: 63  SLAMEALPIFLDMIVPPWLAVLVSVTLILMFGEILPQAICTRYGLKVGAIMAPFVRVLLM 122
           SLAMEALPIFLDMIVPPW AVLVSVTLILMFGEILPQAICTRYGLKVGAIMAP VR+LL+
Sbjct: 81  SLAMEALPIFLDMIVPPWAAVLVSVTLILMFGEILPQAICTRYGLKVGAIMAPLVRILLI 140

Query: 123 IFFPISYPISKVLDWMLGKGHAVLLRRAELKTFVNFHGNEAGKGGDLTHDETTIIAGALE 182
           +FFPISYPISKVLDWMLGKGHAVLLRRAELKTFVNFHGNEAGKGGDLTHDETTIIAGALE
Sbjct: 141 VFFPISYPISKVLDWMLGKGHAVLLRRAELKTFVNFHGNEAGKGGDLTHDETTIIAGALE 200

Query: 183 LTEKTAKDAMTSISNAFSLDLDATLDLETLNAIMTKGHSRVPVYSGDPKNIIGLVLVKNL 242
           LTEKTAKDAMTSISNAFSLDLDATLDLETLNAIMTKGHSRVPVYSGDPKNIIGLVLVKNL
Sbjct: 201 LTEKTAKDAMTSISNAFSLDLDATLDLETLNAIMTKGHSRVPVYSGDPKNIIGLVLVKNL 260

Query: 243 LTVDPEDRVLLRKMIIRKIPRVSEDMPLYDILNEFQKGHSHIAVVFKKHGYQSEALLKKD 302
           LTVDPEDRV L+KMIIRKIPRVSEDMPLYDILNEFQKGHSHIAVVFKKHG+QSE L KKD
Sbjct: 261 LTVDPEDRVSLKKMIIRKIPRVSEDMPLYDILNEFQKGHSHIAVVFKKHGHQSETLPKKD 320

Query: 303 NGVDSTAGAATHNLAMKMELVDAQTIAEKAGGEQTKKSPPATPAFKKRHRGCSFCILDVE 362
            GV+S   AA  N+ MKME VDAQT+AEKAGG QTKKSPPATPAFKKRHRGCSFCILDVE
Sbjct: 321 IGVNSGDAAAAQNIGMKMESVDAQTVAEKAGGLQTKKSPPATPAFKKRHRGCSFCILDVE 380

Query: 363 NAPLPVLPPGEEVVGVITMEDVIEELLQEEILDETDEYVNIHNRIKINMQPSPEKPSTNP 422
           NAPLPV P GEEVVGVITMEDVIEELLQEEILDETDEYVNIHNRIKINMQPSPEK S N 
Sbjct: 381 NAPLPVFPLGEEVVGVITMEDVIEELLQEEILDETDEYVNIHNRIKINMQPSPEKLSINQ 440

Query: 423 PQLSPNVN 430
           PQLSPNVN
Sbjct: 441 PQLSPNVN 448

BLAST of Cp4.1LG02g07650 vs. ExPASy TrEMBL
Match: A0A6J1CFY3 (DUF21 domain-containing protein At1g47330-like OS=Momordica charantia OX=3673 GN=LOC111011071 PE=4 SV=1)

HSP 1 Score: 761 bits (1964), Expect = 3.05e-274
Identity = 394/424 (92.92%), Postives = 405/424 (95.52%), Query Frame = 0

Query: 3   GLVAFAGLMAGLTLGLMSLGLVDLEVLIKSGRPQDRKHAAKILPVVKNQHLLLCTLLIGN 62
           GLV FAGLMAGLTLGLMSLGLVDLEVLIKSGRPQDRKHAAKI PVVKNQHLLLCTLLIGN
Sbjct: 102 GLVTFAGLMAGLTLGLMSLGLVDLEVLIKSGRPQDRKHAAKIFPVVKNQHLLLCTLLIGN 161

Query: 63  SLAMEALPIFLDMIVPPWLAVLVSVTLILMFGEILPQAICTRYGLKVGAIMAPFVRVLLM 122
           SLAMEALPIFLD IVPPW A+LVSVTLILMFGEILPQAICTRYGLKVGAIMAPFVR+LLM
Sbjct: 162 SLAMEALPIFLDKIVPPWAAILVSVTLILMFGEILPQAICTRYGLKVGAIMAPFVRILLM 221

Query: 123 IFFPISYPISKVLDWMLGKGHAVLLRRAELKTFVNFHGNEAGKGGDLTHDETTIIAGALE 182
           +FFPISYPISKVLDWMLGKGHAVLLRRAELKTFVNFHGNEAGKGGDLTHDETTIIAGALE
Sbjct: 222 LFFPISYPISKVLDWMLGKGHAVLLRRAELKTFVNFHGNEAGKGGDLTHDETTIIAGALE 281

Query: 183 LTEKTAKDAMTSISNAFSLDLDATLDLETLNAIMTKGHSRVPVYSGDPKNIIGLVLVKNL 242
           LTEKTAKDAMT ISNAFSLDLDATLDL+TLNAIMTKGHSRVPVYSGDPKNIIGLVLVKNL
Sbjct: 282 LTEKTAKDAMTPISNAFSLDLDATLDLDTLNAIMTKGHSRVPVYSGDPKNIIGLVLVKNL 341

Query: 243 LTVDPEDRVLLRKMIIRKIPRVSEDMPLYDILNEFQKGHSHIAVVFKKHGYQSEALLKKD 302
           LTVDP+DR+ L+KMIIRKIPRVSEDMPLYDILNEFQKGHSHIAVVFKKHGYQSEALLKKD
Sbjct: 342 LTVDPDDRISLKKMIIRKIPRVSEDMPLYDILNEFQKGHSHIAVVFKKHGYQSEALLKKD 401

Query: 303 NGVDSTAGAATHNLAMKMELVDAQTIAEKAGGEQTKKSPPATPAFKKRHRGCSFCILDVE 362
           NGVDS A AAT NL MK+E VDAQT AEK GG+Q KKSPPATPAFKKRH+GCSFCILDVE
Sbjct: 402 NGVDSGADAATQNLVMKLESVDAQTTAEKGGGQQIKKSPPATPAFKKRHKGCSFCILDVE 461

Query: 363 NAPLPVLPPGEEVVGVITMEDVIEELLQEEILDETDEYVNIHNRIKINMQPSPEKPSTNP 422
           NAPLP+ PP EEVVGVITMEDVIEELLQEEILDETDEYVNIHNRIKINMQ SPEKP TNP
Sbjct: 462 NAPLPIFPPSEEVVGVITMEDVIEELLQEEILDETDEYVNIHNRIKINMQASPEKPGTNP 521

Query: 423 PQLS 426
            QLS
Sbjct: 522 SQLS 525

BLAST of Cp4.1LG02g07650 vs. TAIR 10
Match: AT1G47330.1 (CBS domain-containing protein with a domain of unknown function (DUF21) )

HSP 1 Score: 583.2 bits (1502), Expect = 1.7e-166
Identity = 319/438 (72.83%), Postives = 351/438 (80.14%), Query Frame = 0

Query: 1   MFGLVAFAGLMAGLTLGLMSLGLVDLEVLIKSGRPQDRKHAAKILPVVKNQHLLLCTLLI 60
           +  LVAFAGLMAGLTLGLMSLGLVDLEVLIKSGRPQDR +A KI PVVKNQHLLLCTLLI
Sbjct: 19  IIALVAFAGLMAGLTLGLMSLGLVDLEVLIKSGRPQDRINAGKIFPVVKNQHLLLCTLLI 78

Query: 61  GNSLAMEALPIFLDMIVPPWLAVLVSVTLILMFGEILPQAICTRYGLKVGAIMAPFVRVL 120
           GNS+AMEALPIFLD IVPPWLA+L+SVTLIL+FGEI+PQA+CTRYGLKVGAIMAPFVRVL
Sbjct: 79  GNSMAMEALPIFLDKIVPPWLAILLSVTLILVFGEIMPQAVCTRYGLKVGAIMAPFVRVL 138

Query: 121 LMIFFPISYPISKVLDWMLGKGHAVLLRRAELKTFVNFHGNEAGKGGDLTHDETTIIAGA 180
           L++FFPISYPISKVLDWMLGKGH VLLRRAELKTFVNFHGNEAGKGGDLT DET+II GA
Sbjct: 139 LVLFFPISYPISKVLDWMLGKGHGVLLRRAELKTFVNFHGNEAGKGGDLTTDETSIITGA 198

Query: 181 LELTEKTAKDAMTSISNAFSLDLDATLDLETLNAIMTKGHSRVPVYSGDPKNIIGLVLVK 240
           LELTEKTAKDAMT ISNAFSL+LD  L+LETLN IM+ GHSRVPVY  +P +IIGL+LVK
Sbjct: 199 LELTEKTAKDAMTPISNAFSLELDTPLNLETLNTIMSVGHSRVPVYFRNPTHIIGLILVK 258

Query: 241 NLLTVDPEDRVLLRKMIIRKIPRVSEDMPLYDILNEFQKGHSHIAVVFKKHGYQSEALLK 300
           NLL VD    V LRKM +RKIPRVSE MPLYDILNEFQKGHSHIAVV+K    Q ++   
Sbjct: 259 NLLAVDARKEVPLRKMSMRKIPRVSETMPLYDILNEFQKGHSHIAVVYKDLDEQEQSPET 318

Query: 301 KDNGVDSTAGAAT-------------------HNLAMKMELVDAQTIAEKAGGEQT---K 360
            +NG++      T                        K+E  DA++   + G EQ    K
Sbjct: 319 SENGIERRKNKKTKDELFKDSCRKPKAQFEVSEKEVFKIETGDAKSGKSENGEEQQGSGK 378

Query: 361 KSPPATPAFKKRHRGCSFCILDVENAPLPVLPPGEEVVGVITMEDVIEELLQEEILDETD 417
            S  A PA KKRHRGCSFCILD+EN P+P  P  EEVVGVITMEDVIEELLQEEILDETD
Sbjct: 379 TSLLAAPA-KKRHRGCSFCILDIENTPIPDFPTNEEVVGVITMEDVIEELLQEEILDETD 438

BLAST of Cp4.1LG02g07650 vs. TAIR 10
Match: AT2G14520.1 (CBS domain-containing protein with a domain of unknown function (DUF21) )

HSP 1 Score: 423.7 bits (1088), Expect = 1.8e-118
Identity = 236/406 (58.13%), Postives = 295/406 (72.66%), Query Frame = 0

Query: 4   LVAFAGLMAGLTLGLMSLGLVDLEVLIKSGRPQDRKHAAKILPVVKNQHLLLCTLLIGNS 63
           LV FAGLM+GLTLGLMS+ LVDLEVL KSG P+DR HAAKILPVVKNQHLLLCTLLI N+
Sbjct: 22  LVLFAGLMSGLTLGLMSMSLVDLEVLAKSGTPRDRIHAAKILPVVKNQHLLLCTLLICNA 81

Query: 64  LAMEALPIFLDMIVPPWLAVLVSVTLILMFGEILPQAICTRYGLKVGAIMAPFVRVLLMI 123
            AMEALPIFLD +V  W A+L+SVTLIL+FGEI+PQ++C+R+GL +GA +APFVRVL+ I
Sbjct: 82  AAMEALPIFLDALVTAWGAILISVTLILLFGEIIPQSVCSRHGLAIGATVAPFVRVLVWI 141

Query: 124 FFPISYPISKVLDWMLGKGHAVLLRRAELKTFVNFHGNEAGKGGDLTHDETTIIAGALEL 183
             P+++PISK+LD++LG G   L RRAELKT V+ HGNEAGKGG+LTHDETTIIAGALEL
Sbjct: 142 CLPVAWPISKLLDFLLGHGRVALFRRAELKTLVDLHGNEAGKGGELTHDETTIIAGALEL 201

Query: 184 TEKTAKDAMTSISNAFSLDLDATLDLETLNAIMTKGHSRVPVYSGDPKNIIGLVLVKNLL 243
           +EK AKDAMT IS+ F +D++A LD + +N I+ KGHSRVPVY     NIIGLVLVKNLL
Sbjct: 202 SEKMAKDAMTPISDTFVIDINAKLDRDLMNLILDKGHSRVPVYYEQRTNIIGLVLVKNLL 261

Query: 244 TVDPEDRVLLRKMIIRKIPRVSEDMPLYDILNEFQKGHSHIAVVFKK----HGYQS---- 303
           T++P++ + ++ + IR+IPRV E +PLYDILNEFQKGHSH+AVV ++    H  QS    
Sbjct: 262 TINPDEEIQVKNVTIRRIPRVPETLPLYDILNEFQKGHSHMAVVVRQCDKIHPLQSNDAA 321

Query: 304 -EALLKKDNGVDSTAGAATHNLAMKMELVDAQTIAEKAGGEQTKKSPPATPAFKKRHRGC 363
            E + +    VD         L  +  L   ++   +A    ++         K+  +  
Sbjct: 322 NETVNEVRVDVDYERSPQETKLKRRRSLQKWKSFPNRANSLGSRS--------KRWSKDN 381

Query: 364 SFCILDVENAPLPVLPPGEEVVGVITMEDVIEELLQEEILDETDEY 401
              IL +   PLP L   E+ VG+ITMEDVIEELLQEEI DETD +
Sbjct: 382 DADILQLNEHPLPKLDEEEDAVGIITMEDVIEELLQEEIFDETDHH 419

BLAST of Cp4.1LG02g07650 vs. TAIR 10
Match: AT4G33700.1 (CBS domain-containing protein with a domain of unknown function (DUF21) )

HSP 1 Score: 421.8 bits (1083), Expect = 6.7e-118
Identity = 242/411 (58.88%), Postives = 296/411 (72.02%), Query Frame = 0

Query: 4   LVAFAGLMAGLTLGLMSLGLVDLEVLIKSGRPQDRKHAAKILPVVKNQHLLLCTLLIGNS 63
           LV FAGLM+GLTLGLMSL LVDLEVL KSG P+ RK+AAKILPVVKNQHLLL TLLI N+
Sbjct: 22  LVLFAGLMSGLTLGLMSLSLVDLEVLAKSGTPEHRKYAAKILPVVKNQHLLLVTLLICNA 81

Query: 64  LAMEALPIFLDMIVPPWLAVLVSVTLILMFGEILPQAICTRYGLKVGAIMAPFVRVLLMI 123
            AME LPIFLD +V  W A+L+SVTLIL+FGEI+PQ+IC+RYGL +GA +APFVRVL+ I
Sbjct: 82  AAMETLPIFLDGLVTAWGAILISVTLILLFGEIIPQSICSRYGLAIGATVAPFVRVLVFI 141

Query: 124 FFPISYPISKVLDWMLGKGHAVLLRRAELKTFVNFHGNEAGKGGDLTHDETTIIAGALEL 183
             P+++PISK+LD++LG   A L RRAELKT V+FHGNEAGKGG+LTHDETTIIAGALEL
Sbjct: 142 CLPVAWPISKLLDFLLGHRRAALFRRAELKTLVDFHGNEAGKGGELTHDETTIIAGALEL 201

Query: 184 TEKTAKDAMTSISNAFSLDLDATLDLETLNAIMTKGHSRVPVYSGDPKNIIGLVLVKNLL 243
           +EK  KDAMT IS+ F +D++A LD + +N I+ KGHSRVPVY   P NIIGLVLVKNLL
Sbjct: 202 SEKMVKDAMTPISDIFVIDINAKLDRDLMNLILEKGHSRVPVYYEQPTNIIGLVLVKNLL 261

Query: 244 TVDPEDRVLLRKMIIRKIPRVSEDMPLYDILNEFQKGHSHIAVVFK----------KHGY 303
           T++P++ + ++ + IR+IPRV E +PLYDILNEFQKG SH+AVV +          K+G 
Sbjct: 262 TINPDEEIPVKNVTIRRIPRVPEILPLYDILNEFQKGLSHMAVVVRQCDKIHPLPSKNGS 321

Query: 304 QSEALLKKDNGVDSTAGAATHNLAMKMELVDAQTIAEKA----GGEQTKKSPPATPAFKK 363
             EA +  D+  + T       L  K  L   ++   +A    GG ++KK      A   
Sbjct: 322 VKEARVDVDS--EGTPTPQERMLRTKRSLQKWKSFPNRASSFKGGSKSKKWSKDNDA--- 381

Query: 364 RHRGCSFCILDVENAPLPVLPPGEEVVGVITMEDVIEELLQEEILDETDEY 401
                   IL +   PLP L   EE VG+ITMEDVIEELLQEEI DETD +
Sbjct: 382 -------DILQLNGNPLPKLAEEEEAVGIITMEDVIEELLQEEIFDETDHH 420

BLAST of Cp4.1LG02g07650 vs. TAIR 10
Match: AT5G52790.1 (CBS domain-containing protein with a domain of unknown function (DUF21) )

HSP 1 Score: 405.6 bits (1041), Expect = 5.0e-113
Identity = 220/417 (52.76%), Postives = 293/417 (70.26%), Query Frame = 0

Query: 4   LVAFAGLMAGLTLGLMSLGLVDLEVLIKSGRPQDRKHAAKILPVVKNQHLLLCTLLIGNS 63
           LV FAGLM+GLTLGLMSL +V+LEV+IK+G P DRK+A KILP+VKNQHLLLCTLLIGN+
Sbjct: 23  LVVFAGLMSGLTLGLMSLSIVELEVMIKAGEPHDRKNAEKILPLVKNQHLLLCTLLIGNA 82

Query: 64  LAMEALPIFLDMIVPPWLAVLVSVTLILMFGEILPQAICTRYGLKVGAIMAPFVRVLLMI 123
           LAMEALPIF+D ++P W A+L+SVTLIL FGEI+PQA+C+RYGL +GA ++  VR+++++
Sbjct: 83  LAMEALPIFVDSLLPAWGAILISVTLILAFGEIIPQAVCSRYGLSIGAKLSFLVRLIIIV 142

Query: 124 FFPISYPISKVLDWMLGKGHAVLLRRAELKTFVNFHGNEAGKGGDLTHDETTIIAGALEL 183
           FFP+SYPISK+LD +LGK H+ LL RAELK+ V  HGNEAGKGG+LTHDETTII+GAL++
Sbjct: 143 FFPLSYPISKLLDLLLGKRHSTLLGRAELKSLVYMHGNEAGKGGELTHDETTIISGALDM 202

Query: 184 TEKTAKDAMTSISNAFSLDLDATLDLETLNAIMTKGHSRVPVYSGDPKNIIGLVLVKNLL 243
           ++K+AKDAMT +S  FSLD++  LD +T+  I + GHSR+P+YS +P  IIG +LVKNL+
Sbjct: 203 SQKSAKDAMTPVSQIFSLDINFKLDEKTMGLIASAGHSRIPIYSVNPNVIIGFILVKNLI 262

Query: 244 TVDPEDRVLLRKMIIRKIPRVSEDMPLYDILNEFQKGHSHIAVVFKKHGYQSEALLKKDN 303
            V PED   +R + IR++P+V  ++PLYDILN FQ G SH+A V     + +      + 
Sbjct: 263 KVRPEDETSIRDLPIRRMPKVDLNLPLYDILNIFQTGRSHMAAVVGTKNHTNTNTPVHEK 322

Query: 304 GVDSTAGAATHNLAMKMELVDAQTIAEKAGGEQTKKSPPATPAFKKRHRGCSFCILDVEN 363
            ++ +               DA              S PA  + +  H+     I  + +
Sbjct: 323 SINGSPNK------------DANVFL----------SIPALNSSETSHQSPIRYIDSISD 382

Query: 364 APLPVLPPGEEVVGVITMEDVIEELLQEEILDETDEYVNIHNRIKINMQPSPEKPST 421
                    EEV+G+IT+EDV+EEL+QEEI DETD+YV +H RI INM  S   P T
Sbjct: 383 -------EDEEVIGIITLEDVMEELIQEEIYDETDQYVELHKRITINMPMSGNSPET 410

BLAST of Cp4.1LG02g07650 vs. TAIR 10
Match: AT4G14240.1 (CBS domain-containing protein with a domain of unknown function (DUF21) )

HSP 1 Score: 386.0 bits (990), Expect = 4.1e-107
Identity = 217/410 (52.93%), Postives = 289/410 (70.49%), Query Frame = 0

Query: 4   LVAFAGLMAGLTLGLMSLGLVDLEVLIKSGRPQDRKHAAKILPVVKNQHLLLCTLLIGNS 63
           LV FAG+M+GLTLGLMSLGLV+LE+L +SG P ++K AA I PVV+ QH LL TLL+ N+
Sbjct: 45  LVLFAGIMSGLTLGLMSLGLVELEILQRSGTPNEKKQAAAIFPVVQKQHQLLVTLLLCNA 104

Query: 64  LAMEALPIFLDMIVPPWLAVLVSVTLILMFGEILPQAICTRYGLKVGAIMAPFVRVLLMI 123
           +AME LPI+LD +   ++A+++SVT +L FGE++PQAICTRYGL VGA     VR+L+ +
Sbjct: 105 MAMEGLPIYLDKLFNEYVAIILSVTFVLAFGEVIPQAICTRYGLAVGANFVWLVRILMTL 164

Query: 124 FFPISYPISKVLDWMLGKGHAVLLRRAELKTFVNFHGNEAGKGGDLTHDETTIIAGALEL 183
            +PI++PI K+LD +LG   A L RRA+LK  V+ H  EAGKGG+LTHDETTII+GAL+L
Sbjct: 165 CYPIAFPIGKILDLVLGHNDA-LFRRAQLKALVSIHSQEAGKGGELTHDETTIISGALDL 224

Query: 184 TEKTAKDAMTSISNAFSLDLDATLDLETLNAIMTKGHSRVPVYSGDPKNIIGLVLVKNLL 243
           TEKTA++AMT I + FSLD+++ LD E +  I+ +GHSRVPVYSG+PKN+IGL+LVK+LL
Sbjct: 225 TEKTAQEAMTPIESTFSLDVNSKLDWEAMGKILARGHSRVPVYSGNPKNVIGLLLVKSLL 284

Query: 244 TVDPEDRVLLRKMIIRKIPRVSEDMPLYDILNEFQKGHSHIAVVFKKHGYQS--EALLKK 303
           TV PE   L+  + IR+IPRV  DMPLYDILNEFQKG SH+A V K  G      + L +
Sbjct: 285 TVRPETETLVSAVCIRRIPRVPADMPLYDILNEFQKGSSHMAAVVKVKGKSKVPPSTLLE 344

Query: 304 DNGVDSTAGAATHNLAMKMELVDAQTIA--EKAGGEQTKKSPPATPAFKKRHRGCSFCIL 363
           ++  +S     T  L +K E      I   +KA G+   ++  + P       G S    
Sbjct: 345 EHTDESNDSDLTAPLLLKREGNHDNVIVTIDKANGQSFFQNNESGP------HGFSHTSE 404

Query: 364 DVENAPLPVLPPGEEVVGVITMEDVIEELLQEEILDETDEYVNIHNRIKI 410
            +E+          EV+G+IT+EDV EELLQEEI+DETDEYV++H RI++
Sbjct: 405 AIEDG---------EVIGIITLEDVFEELLQEEIVDETDEYVDVHKRIRV 438

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q8RY602.5e-16572.83DUF21 domain-containing protein At1g47330 OS=Arabidopsis thaliana OX=3702 GN=CBS... [more]
Q9ZQR42.5e-11758.13DUF21 domain-containing protein At2g14520 OS=Arabidopsis thaliana OX=3702 GN=CBS... [more]
Q8VZI29.5e-11758.88DUF21 domain-containing protein At4g33700 OS=Arabidopsis thaliana OX=3702 GN=CBS... [more]
Q9LTD87.0e-11252.76DUF21 domain-containing protein At5g52790 OS=Arabidopsis thaliana OX=3702 GN=CBS... [more]
Q67XQ05.7e-10652.93DUF21 domain-containing protein At4g14240 OS=Arabidopsis thaliana OX=3702 GN=CBS... [more]
Match NameE-valueIdentityDescription
XP_023525037.15.34e-299100.00DUF21 domain-containing protein At1g47330 [Cucurbita pepo subsp. pepo][more]
XP_022941204.15.70e-29498.14DUF21 domain-containing protein At1g47330 [Cucurbita moschata][more]
KAG6608536.14.68e-29397.90DUF21 domain-containing protein, partial [Cucurbita argyrosperma subsp. sororia][more]
XP_022981956.11.28e-29096.97DUF21 domain-containing protein At1g47330 [Cucurbita maxima][more]
XP_038904672.11.35e-28095.09DUF21 domain-containing protein At1g47330-like isoform X1 [Benincasa hispida][more]
Match NameE-valueIdentityDescription
A0A6J1FMM02.76e-29498.14DUF21 domain-containing protein At1g47330 OS=Cucurbita moschata OX=3662 GN=LOC11... [more]
A0A6J1IXZ76.21e-29196.97DUF21 domain-containing protein At1g47330 OS=Cucurbita maxima OX=3661 GN=LOC1114... [more]
A0A1S3CRQ31.93e-27893.93DUF21 domain-containing protein At1g47330 isoform X1 OS=Cucumis melo OX=3656 GN=... [more]
A0A0A0LGK33.72e-27693.69Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G856000 PE=4 SV=1[more]
A0A6J1CFY33.05e-27492.92DUF21 domain-containing protein At1g47330-like OS=Momordica charantia OX=3673 GN... [more]
Match NameE-valueIdentityDescription
AT1G47330.11.7e-16672.83CBS domain-containing protein with a domain of unknown function (DUF21) [more]
AT2G14520.11.8e-11858.13CBS domain-containing protein with a domain of unknown function (DUF21) [more]
AT4G33700.16.7e-11858.88CBS domain-containing protein with a domain of unknown function (DUF21) [more]
AT5G52790.15.0e-11352.76CBS domain-containing protein with a domain of unknown function (DUF21) [more]
AT4G14240.14.1e-10752.93CBS domain-containing protein with a domain of unknown function (DUF21) [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000644CBS domainPFAMPF00571CBScoord: 201..245
e-value: 0.0064
score: 16.9
IPR000644CBS domainPROSITEPS51371CBScoord: 192..252
score: 8.602781
IPR002550CNNM, transmembrane domainPFAMPF01595DUF21coord: 2..171
e-value: 1.5E-35
score: 122.4
IPR002550CNNM, transmembrane domainPROSITEPS51846CNNMcoord: 1..173
score: 52.814751
NoneNo IPR availableGENE3D3.10.580.10coord: 345..396
e-value: 4.3E-6
score: 28.7
coord: 171..314
e-value: 2.7E-40
score: 139.8
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 411..431
NoneNo IPR availablePANTHERPTHR12064:SF36DOMAIN-CONTAINING PROTEIN, PUTATIVE, EXPRESSED-RELATEDcoord: 2..423
NoneNo IPR availableSUPERFAMILY54631CBS-domain paircoord: 176..292
IPR045095Ancient conserved domain protein familyPANTHERPTHR12064ANCIENT CONSERVED DOMAIN PROTEIN-RELATEDcoord: 2..423
IPR044751Ion transporter-like, CBS domainCDDcd04590CBS_pair_CorC_HlyC_assoccoord: 187..292
e-value: 2.99543E-29
score: 108.737

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG02g07650.1Cp4.1LG02g07650.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0010960 magnesium ion homeostasis
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0043231 intracellular membrane-bounded organelle