Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideCDSinitialstart_codonintroninternalterminalstop_codon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGCTCGCAGTCGTCGCCACCAAGTCCAGTTGACATATTTCCAAAGAGTAAAAGGAGCGAAAAACAGTCGCCGACGGTGCTCTTACTACCAGTCGCCGACGCTGAAGCCGGACCGCGTGTGTTGGAGGAAACATGGAGAGGGCTGAGGGAGGAACGTCGTCGACACCATACGGCGGTGGAGGAATCGGAGGCAAAGTTCGTAAGCCAAACACAAGAAAGCCGCTGCCTTCCCCTTACGCTCGGCCAGTGCATAACCAATCGCATAGGCGTTGGCTTTCGAAGCTCGTTGATCCGGCCTACCGGCTCATTACCGGCGGCGCCACCCGATTGCTTCCATATTTGTTCCCGAAACCACTGCCCTCTAATGCCCTTCCGTCTCCTGGAGACGAAGATCAAGGTCATTTGCGCTCTCCCCCAATCTTCTCTACTGCTTTGGGTTTTGGACATCGCATTATCTTGGTTGAAATCATTATCCTTTTTATACTCTCTATTTTAATTTTAGGCATTTTAGGAAATTGATGGCTTTTATGATCCTAGTTAACTGACTGGTGAGGTTTCTAATTTTTTCCCCTTCGTAAGCATCTTTCTGCATTTGCAACGTAAAATTTCTAATTGACATAAGTGAAAGTATTGTAGCTATGTTGTTACCATGCTGCTTGCATATATCTAGTAGACATTGTTTACATCAGACTCCAGAGAATGTTATTATTGGTTTGTTTTGTACATTCTCTTGTAAGATTAGTTGTTGATGTGAATTCTTTTTGAGCAGATAAAGTGGAGGCAGAGGTGGAGGATAACGTCTCTGGAGAAGAACCCCAAAATGTAAATGTAGCTCTTTTTTGTTATTTTGTTACATTTTTGCTGTGATGCCGTCTTATGAATGAGTATCAGGGTATTGTTTTGATAACAGACAAATGAACAGTCATATCCAAAAGTGCTTACTGGGGTTTTATCAAAATCAGATAGAAGTCAATCGAAATAAAAGACATAGGGCAAGGGTTACCCACTCTTAACAGTCACTTTTTGCTGGTGGCCGTACTTTGATTGGTTTGTAGCCTAGAAGTTTATTACTGTTTTGACCTTGTGAGTGCGCTCATGCTTGTCTTCCAGCTGAAGTTATTGATCTTCCATTAAACTTTGATGTTCAGTCAGATGTTGATATTTGTTTTCTGCGTGAAATCTGACTGTTTACTAGTTTAAATTTTATTTACCAGGTGTCGTTTTTTTTTTCTCTTGTTTCACTTTTCTGCATAGCAATATTCTTATTCTGGTCCTTATGGCTTGGAGTTACAAAATCACATTGCTTGTCCAAACTTTTTGGTTAACACCCACGGAATGATTTCCTATGTCCTGTTTTTGTTTTTATTTCATTCCATTTTACACTTTTACTTTTCATTTGAAAAATTATATGCATGCAATAATGTGATTCATCAACATACAGCCTGGTTCATTTCTTGCAGCTAGGGGTTTCTACCTTAGTTGGATTACCTGGTTCTAGTGGAGAGGCAAATAGATCAGAGAACAATTCTGATTTTAATGGCTGCCAAAAGGACAAAGAAAATAATGCATTAGGCGGGAATGGAAAAATTGATGTTGAAAAATGGATCCAAGGAAAAACATTTTCGAGGTAACTTGCAGAGTTTGACTATTTTGTTTGTTCTATCAACGAAGTGGTGTGTCATTTATTGTCTAAGAAACATGGACATGGACACAATCCAACATGATATTGATGTAGATATGATGACTCATTTACTAACAAATTTAGACAGAGACATGACGAGGACTTGTTTGGTAAAATATCCATTTTCTGATTATATTTGAACAGATCTCAATAATTTACAGACAGAAAGTGTATTTTAATTTTACAATATTGGTGTTTGGAAGTCAGTTTGAAAGTGATATAATATGCAATATTTTGAAGAAGTTGAAGAAGCCAATCCCAACACATGCTTGACTAGTATCTGAGTTGTCCACAACACGTGAAACCAAATCATGATCCCTAAACTATAGTGTGTTCATCTTCAACTTTTTCATCACAGACTTCAGTTGAAGTGTTTCGTCTCGTTGAAATGCAACTGGACGAAACATTATTGTGAAGCAACTTATTTTATTTATAATTCCATGGTGGTAGTACTTTCCTCTGCTATGAATAAATAGGTTATTTGATTTTGAGGTTGCATCCAAGGGATAGTCAAATGTCCATATATAGCAGTGTCAGTAGTCTCGATAATTTCATTTGCAACAATATTTAACAGCATTGCCTCTGTGTCTAAACTGGTAATGGTGCAATGCCTTATGTCCGTCCAAGGGTAGTCAAGCGTCCAATATTGGTGTTCTACCATATGCTGGTAGTGGTGCAATGTCTGTAGTCTCTTTTGATCCTCTTAAACATTTTCCACAGCCACATAGCCAGTTTGAGTACTGCATTTCTCTAAATTGGCTGATATACTATGCAGGGATGAAGTGAGTCGTTTATTAGAGGTACTACGATCAAGGGCTCTTGAACCTTCTAATAAAGTGGAAGACAATACATTTTCCCCACAGAGCATTGAAAAACAAGTTGAGCAGCCATCTACTGCAAATAGAGTTCTTGAAATGCCTCGTGAAGGAAAGCAAGAAGAATTGGAGAGAGCTACGGGGGGAAACTTAACTCCTCATCCACATTCATTGGTCAGTAGGGTGAATCCTTGAGTAGTTTCTATAGAAAGTAAATATTTCCTGCTGTTATGAACTCAGAACTTCTTTTACACATTAGTGCGGACTGTAGGATTATTTCTGCCATTTTCACGATAGCTTATGTTCCTTTTATAGGTTTTATTAGTTCATGCTAGTATTTAGCGATTGTGTGTAATGCTGTTCCATAGACAGTGTTCTTTTTCTATATATATGTCTATACACACACACACAAATTTGAAGATCAAAAAATCATCCTAGGTAACTTATTTCTGTATTCATTAATAGTATAATGTATTAAACATCAAGGAGACCGGTTCCAGTATTGGGATGGTCAAAAACTAGATCAAACTGAATGCCTTATCCATTACTACACGATTACTAGGTTTTTCATATTTATAGTTATATTCAAGTTATTGTTTTAGGACTCTTCTTATTACTAAGTAATTAAATTATGTAGAAGATATCTGAAAATCAAATTGTAAAACTAGTGCTGATGAGACATTCATTAATCAAATCAGTAAAATGAATTATCTCTGTTAGGAATATGGTAAATTTGGTTAATGTTGGAAAATCAATAGGAAATTAAGATGTTAAAGCTCTAGGATTGGGATTTGTACAAGAATCTACAAATCTTCATTGCCAAAGTTTTCAATTTTTGTTCACTGTTTAGAGCTTGCTATTCAAGTTTTTCTGCCTCTAATGATTCAGCATTGAGTGTACTTTAGCAGAAACTAAGAGAAGTTGGAGCATCACCTGTGGATATTGCAAGAGCATACATGAGCAACCAAAAATCTGAACCAGGCTTAGCTTCGGACAAGATGCCAGATGATGAAAAGGCTTTGCGTCATGGTGATCATCAAATGTTTAAGCCTTTTATTCCATCAATGTCCCCCAATCCTTCAACTTGTTGGCCTGGTGCCATGTCAGAAAGTCAACGTGGTTATGTAACTCCAAGAAGTCAAAGAGGTAGATTTGGTCTTCATAATTTCCCTCGGACTCCATATTCTAGGAGTATCTTTTCAATGTCCAAATCCAAGTCTAAGGTATATAGAAACGGTCTTCAATTCCATTTTTTTTGGGTTATGACCTAGTGTATGTAACCAGACCATATGATTAACCATTCCTTTACATGTTAACAGCTAACTCAGTTGCAAGGAGATGGCCAAAAGTTTGTGAATACACCATCACCTCTCTGGCAGAGGTCACGATCTCCAGCTTATTCCATGGTAATTTCTTTTATCATTAAGCACTCAATGACTAAAAGCCAATAATATATGACCACAGGAACTCTTGTTGAGAAGTTCTTTTGTAGCATGTATATTGAGCTTCTGTACAGAACAAATTTGCGTCCATAATGAGTAGCCATTTTAAGGTAGTATAAAAGCTGATCATTTTAACGAGAAGAATAAAAGCTGAAATATTATAGAGAACTGTAGTATACTTTTTGTGCATGTGTATTACAGGGTAGAGCTAGAAACTTCTGTAGGGGAGGGCAAAATTTTACACTTGAACTTCCAAAATACGCTATTTTAATTTTCAAAATTCCAAAGGCAAGTTATAGGAAAAACTATTAACTTAAAAAACTTGGTGGTTTTATGTATGTATGACTCCCGTATTGAGCATGTCATTTATGTGTTATTGACCCTTCAACCTTTACTTTGACCGAAATATTCAGTTATCCTTGATCTTTTGTTTCTTTCATCCGCTTTCTGGTAGGATGTACATTGACCTTGCGGTTTATCCATTACCTTGGTGCCTCTATGAAAATGTGGCTATGGTTGTTGCTGTTGTGCTGCTTTGTCAAGCCCAAGTATAATTCTTTTGCCATAACCCTGATATAAACAAGTTAGATTTATGATTTTTCATCTAAACTAGAACTTAACTGATTAGCTGCAAGAGTTTCCTTATCTATGGGCCTATAGGTGGTTGCTTAATTTTTAATTATTGAGCAACTCATTCCAGGACGAGCTTTTAGTTCAGTTTGTAGTGGTGAAGTGTCTGATTGTACGTTAAGTTCTGAATGCAACTTTCGATACTGAATATTGCATTAATGCTTCCTGAGCATTATCCGCTGCTGGTTCTTATTAGTGGTTTGCGTTTGTAGTGTGCCAAGCTCAAGTTCCTTCCGATTACTTCTATCATTATTATTTTTTCTGTTTGTCTATATAGATGACTTCAAGCAAGGATCCATTGGATGAGGCAACTGGTTCCATTGGACTGACTTGTAGCCTTCAGCATAAGGCATCTGCAGTTACTAATTCCAGACGATCTGCTTACTTTTATCCACCTCAACAACCAGAGATGGAAATAGAGAACAATATTTCGGAAGCAATTTTCCCTGATATGAAGAAGAATCTAGATCGTGGAGGAGCAAGCACCATTCCTCTATCACAATCAGTGGGAATCAACAACTCTGAGTCGAGTCTACCGACTGTCCGTCCACAGTCCAGTCAGGTTGCTAGGACAATCCTAGAGCATATTACTAGAAACCCACCTACTCCTAAAGAAAAGACTGAAGAGTTAAAGAGAGCAATTGAATGGAAGAAAACCCCATCTGCCAATGTACCATCGGTCAAGCCAAATGAAACCAGTAGTTTGGCCGTAGACATAGATTCTCACCAAAAAGCAAACCAAGTAGATCAGAACTGTCACCCCCAATTGAGCGATGAGGGGAAAACCATGTCCACGGTTCTTCCAAAGGAGGGTGCTGGCAGAAATCCTGATGCTGCAAACCAGAATCCTTACGGTCTGAAGTTTAGGCTTAGCAATGCTGAATCAAAACACAAGGATGATGCAGGCTTAAATATTGGTAGCTCCTCGCCTAAGGTATCACTGCAACTTTGTAATTGGTTTTAGTTATTTAAATGTATTACTGACCGACTTACAGATGCTTCTTGTGTATGTATTTTGATATTATTAGCATATGCTTGAAGAGTTCTTAATTTCTGGTTGAATTTATGGGCAGGCTGTTCCCAAGATCTTTCCAGCTCTTGGATCCGAAGTGTGGACTCAAATCAAGCCTTCCCCCTCTCTTGGAGGTAAACCTATTTTTCCATCCATTACGATCAGCAAGCCCGAGTCAAAATGGGCATTTTCTTCCGACAGTGGTTCGGCATTTACTTTCCCTGTTTCGGGAGCATCCTCAGGAATGCTCTCAGAACCACCAAGACCATCTATCTTTCCATCAACCAGCCTTGGGGGAGGTCAGCCTCTGTTATTGAAGCCTGAGACTCCAGTTCCTTCATACAATTTTGATTCAAAGAAGACCAGCCCTAGCCTTGTTTTCTCATTCCCTTCAATAAACAGTGATACAATCGGCCCTGAAGCCTCAAATATTAAGTTTAGCTTTGGATCCGATGATCATACGAGACTTTCCTTCGGTTCTGTTGGGAAAGATGCAGTTTGTTGCTAACTGGTTAGACTTCTGAAGTATAGTTTTGAGTGGAATGACCACAAGAATTGCCAAAACGGGGTTAGAAACTAGTGCCAAATACCCTAGTGTTTATGAACAAATAGGAAATTGCAAGCTCAATCATAAACAGTAAACAGCCTCAGTGCCAGTTGACGGCTTTATGCATATGTTCCATTCCATTCCATTCATGATTATGTTATCAGGGCAATGCTTTAGTTTCCTGTACATTAGTTTGGATTTGATCAAGCTGTCACATCATTAAGTAAGAACAGGATTCCTCATTTAAGTTTGCTTGGTTTGATGTTTGAAGCAGGCAGAGTAGTTTCACCACATGTGAAGAGTTTGATGTAGATTCCCAAAGAATTCCAGAAAAATCTGGGAAAAGGGAATGGGATACCTACCTACATCCTAATTCCTTCCTACGCATGAAAGAAAGAAGTAGATATCTGAATAGATGACAGGGTAGTAGGGAAGGGCATAGGACAAGAACAGGTCCACGTGGTCAGGCTCCAAATCACAAGTGGAGGAACAGGTTGTGGTAGCCAGAGAGAGGGACTCACATGTGTTGCTCTAAATTTTGCATGTAACGAGGAAGTCTTTCCATCCTATCACAGGTAAAAGATGAGATAAACTCGAATGGTGGTGGGGCGGGCGGGCGACTGAGGGGCCCCCCCCATCCCCCATTTTTGAAGCATCATCAAAGTTGGGAATGTGATAGAATAGGGAGGAAACAAAGACAACCCTTCAGCCTGATGAAGATTTGAAGAGAGAGAGAGAGAGAGAGATAGAAAGGTCCAAGAAGAAAAGTGAGTGGGTGAGTGCGGAAGGGAAGGGGGGGAGGGGCTTGTGTCTGGACTGTGTGGGGAAGAAGAAAGTCAGAGAGATTGAAATGAGATAGGAAGTGAAATGAATGCGTAGCAGACACGAGTGTGACGTTTCAGATTCTAAATGTCAGTAAAGTTTTTGTGAAGATGTGAGGGTGGTGGTGCCGGTGCTGCTGCTGCTGCTGCTGCTACTGCTGCAGCGCCCAGTGGCAGCGGCCGTATGACTTCCACTCACCACCGCTACCTCTGCTGCTGCTGCTGCTCCTGCTCCACTCTATACTCTTTTCTCATGTCTTTTTCCAACTGA
mRNA sequence
ATGGGCTCGCAGTCGTCGCCACCAAGTCCAGTTGACATATTTCCAAAGAGTAAAAGGAGCGAAAAACAGTCGCCGACGGTGCTCTTACTACCAGTCGCCGACGCTGAAGCCGGACCGCGTGTGTTGGAGGAAACATGGAGAGGGCTGAGGGAGGAACGTCGTCGACACCATACGGCGGTGGAGGAATCGGAGGCAAAGCGTTGGCTTTCGAAGCTCGTTGATCCGGCCTACCGGCTCATTACCGGCGGCGCCACCCGATTGCTTCCATATTTGTTCCCGAAACCACTGCCCTCTAATGCCCTTCCGTCTCCTGGAGACGAAGATCAAGATAAAGTGGAGGCAGAGGTGGAGGATAACGTCTCTGGAGAAGAACCCCAAAATCTAGGGGTTTCTACCTTAGTTGGATTACCTGGTTCTAGTGGAGAGGCAAATAGATCAGAGAACAATTCTGATTTTAATGGCTGCCAAAAGGACAAAGAAAATAATGCATTAGGCGGGAATGGAAAAATTGATGTTGAAAAATGGATCCAAGGAAAAACATTTTCGAGGGATGAAGTGAGTCGTTTATTAGAGGTACTACGATCAAGGGCTCTTGAACCTTCTAATAAAGTGGAAGACAATACATTTTCCCCACAGAGCATTGAAAAACAAGTTGAGCAGCCATCTACTGCAAATAGAGTTCTTGAAATGCCTCGTGAAGGAAAGCAAGAAGAATTGGAGAGAGCTACGGGGGGAAACTTAACTCCTCATCCACATTCATTGAAACTAAGAGAAGTTGGAGCATCACCTGTGGATATTGCAAGAGCATACATGAGCAACCAAAAATCTGAACCAGGCTTAGCTTCGGACAAGATGCCAGATGATGAAAAGGCTTTGCGTCATGGTGATCATCAAATGTTTAAGCCTTTTATTCCATCAATGTCCCCCAATCCTTCAACTTGTTGGCCTGGTGCCATGTCAGAAAGTCAACGTGGTTATGTAACTCCAAGAAGTCAAAGAGGTAGATTTGGTCTTCATAATTTCCCTCGGACTCCATATTCTAGGAGTATCTTTTCAATGTCCAAATCCAAGTCTAAGCTAACTCAGTTGCAAGGAGATGGCCAAAAGTTTGTGAATACACCATCACCTCTCTGGCAGAGGTCACGATCTCCAGCTTATTCCATGATGACTTCAAGCAAGGATCCATTGGATGAGGCAACTGGTTCCATTGGACTGACTTGTAGCCTTCAGCATAAGGCATCTGCAGTTACTAATTCCAGACGATCTGCTTACTTTTATCCACCTCAACAACCAGAGATGGAAATAGAGAACAATATTTCGGAAGCAATTTTCCCTGATATGAAGAAGAATCTAGATCGTGGAGGAGCAAGCACCATTCCTCTATCACAATCAGTGGGAATCAACAACTCTGAGTCGAGTCTACCGACTGTCCGTCCACAGTCCAGTCAGGTTGCTAGGACAATCCTAGAGCATATTACTAGAAACCCACCTACTCCTAAAGAAAAGACTGAAGAGTTAAAGAGAGCAATTGAATGGAAGAAAACCCCATCTGCCAATGTACCATCGGTCAAGCCAAATGAAACCAGTAGTTTGGCCGTAGACATAGATTCTCACCAAAAAGCAAACCAAGTAGATCAGAACTGTCACCCCCAATTGAGCGATGAGGGGAAAACCATGTCCACGGTTCTTCCAAAGGAGGGTGCTGGCAGAAATCCTGATGCTGCAAACCAGAATCCTTACGGTCTGAAGTTTAGGCTTAGCAATGCTGAATCAAAACACAAGGATGATGCAGGCTTAAATATTGGTAGCTCCTCGCCTAAGGCTGTTCCCAAGATCTTTCCAGCTCTTGGATCCGAAGTGTGGACTCAAATCAAGCCTTCCCCCTCTCTTGGAGGTAAACCTATTTTTCCATCCATTACGATCAGCAAGCCCGAGTCAAAATGGGCATTTTCTTCCGACAGTGGTTCGGCATTTACTTTCCCTGTTTCGGGAGCATCCTCAGGAATGCTCTCAGAACCACCAAGACCATCTATCTTTCCATCAACCAGCCTTGGGGGAGTGATACAATCGGCCCTGAAGCCTCAAATATTAAGTTTAGCTTTGGATCCGATGATCATACGAGACTTTCCTTCGGTTCTGTTGGGAAAGATGCAGTTTGTTGCTAACTGTGAGGGTGGTGGTGCCGGTGCTGCTGCTGCTGCTGCTGCTACTGCTGCAGCGCCCAGTGGCAGCGGCCGTATGACTTCCACTCACCACCGCTACCTCTGCTGCTGCTGCTGCTCCTGCTCCACTCTATACTCTTTTCTCATGTCTTTTTCCAACTGA
Coding sequence (CDS)
ATGGGCTCGCAGTCGTCGCCACCAAGTCCAGTTGACATATTTCCAAAGAGTAAAAGGAGCGAAAAACAGTCGCCGACGGTGCTCTTACTACCAGTCGCCGACGCTGAAGCCGGACCGCGTGTGTTGGAGGAAACATGGAGAGGGCTGAGGGAGGAACGTCGTCGACACCATACGGCGGTGGAGGAATCGGAGGCAAAGCGTTGGCTTTCGAAGCTCGTTGATCCGGCCTACCGGCTCATTACCGGCGGCGCCACCCGATTGCTTCCATATTTGTTCCCGAAACCACTGCCCTCTAATGCCCTTCCGTCTCCTGGAGACGAAGATCAAGATAAAGTGGAGGCAGAGGTGGAGGATAACGTCTCTGGAGAAGAACCCCAAAATCTAGGGGTTTCTACCTTAGTTGGATTACCTGGTTCTAGTGGAGAGGCAAATAGATCAGAGAACAATTCTGATTTTAATGGCTGCCAAAAGGACAAAGAAAATAATGCATTAGGCGGGAATGGAAAAATTGATGTTGAAAAATGGATCCAAGGAAAAACATTTTCGAGGGATGAAGTGAGTCGTTTATTAGAGGTACTACGATCAAGGGCTCTTGAACCTTCTAATAAAGTGGAAGACAATACATTTTCCCCACAGAGCATTGAAAAACAAGTTGAGCAGCCATCTACTGCAAATAGAGTTCTTGAAATGCCTCGTGAAGGAAAGCAAGAAGAATTGGAGAGAGCTACGGGGGGAAACTTAACTCCTCATCCACATTCATTGAAACTAAGAGAAGTTGGAGCATCACCTGTGGATATTGCAAGAGCATACATGAGCAACCAAAAATCTGAACCAGGCTTAGCTTCGGACAAGATGCCAGATGATGAAAAGGCTTTGCGTCATGGTGATCATCAAATGTTTAAGCCTTTTATTCCATCAATGTCCCCCAATCCTTCAACTTGTTGGCCTGGTGCCATGTCAGAAAGTCAACGTGGTTATGTAACTCCAAGAAGTCAAAGAGGTAGATTTGGTCTTCATAATTTCCCTCGGACTCCATATTCTAGGAGTATCTTTTCAATGTCCAAATCCAAGTCTAAGCTAACTCAGTTGCAAGGAGATGGCCAAAAGTTTGTGAATACACCATCACCTCTCTGGCAGAGGTCACGATCTCCAGCTTATTCCATGATGACTTCAAGCAAGGATCCATTGGATGAGGCAACTGGTTCCATTGGACTGACTTGTAGCCTTCAGCATAAGGCATCTGCAGTTACTAATTCCAGACGATCTGCTTACTTTTATCCACCTCAACAACCAGAGATGGAAATAGAGAACAATATTTCGGAAGCAATTTTCCCTGATATGAAGAAGAATCTAGATCGTGGAGGAGCAAGCACCATTCCTCTATCACAATCAGTGGGAATCAACAACTCTGAGTCGAGTCTACCGACTGTCCGTCCACAGTCCAGTCAGGTTGCTAGGACAATCCTAGAGCATATTACTAGAAACCCACCTACTCCTAAAGAAAAGACTGAAGAGTTAAAGAGAGCAATTGAATGGAAGAAAACCCCATCTGCCAATGTACCATCGGTCAAGCCAAATGAAACCAGTAGTTTGGCCGTAGACATAGATTCTCACCAAAAAGCAAACCAAGTAGATCAGAACTGTCACCCCCAATTGAGCGATGAGGGGAAAACCATGTCCACGGTTCTTCCAAAGGAGGGTGCTGGCAGAAATCCTGATGCTGCAAACCAGAATCCTTACGGTCTGAAGTTTAGGCTTAGCAATGCTGAATCAAAACACAAGGATGATGCAGGCTTAAATATTGGTAGCTCCTCGCCTAAGGCTGTTCCCAAGATCTTTCCAGCTCTTGGATCCGAAGTGTGGACTCAAATCAAGCCTTCCCCCTCTCTTGGAGGTAAACCTATTTTTCCATCCATTACGATCAGCAAGCCCGAGTCAAAATGGGCATTTTCTTCCGACAGTGGTTCGGCATTTACTTTCCCTGTTTCGGGAGCATCCTCAGGAATGCTCTCAGAACCACCAAGACCATCTATCTTTCCATCAACCAGCCTTGGGGGAGTGATACAATCGGCCCTGAAGCCTCAAATATTAAGTTTAGCTTTGGATCCGATGATCATACGAGACTTTCCTTCGGTTCTGTTGGGAAAGATGCAGTTTGTTGCTAACTGTGAGGGTGGTGGTGCCGGTGCTGCTGCTGCTGCTGCTGCTACTGCTGCAGCGCCCAGTGGCAGCGGCCGTATGACTTCCACTCACCACCGCTACCTCTGCTGCTGCTGCTGCTCCTGCTCCACTCTATACTCTTTTCTCATGTCTTTTTCCAACTGA
Protein sequence
MGSQSSPPSPVDIFPKSKRSEKQSPTVLLLPVADAEAGPRVLEETWRGLREERRRHHTAVEESEAKRWLSKLVDPAYRLITGGATRLLPYLFPKPLPSNALPSPGDEDQDKVEAEVEDNVSGEEPQNLGVSTLVGLPGSSGEANRSENNSDFNGCQKDKENNALGGNGKIDVEKWIQGKTFSRDEVSRLLEVLRSRALEPSNKVEDNTFSPQSIEKQVEQPSTANRVLEMPREGKQEELERATGGNLTPHPHSLKLREVGASPVDIARAYMSNQKSEPGLASDKMPDDEKALRHGDHQMFKPFIPSMSPNPSTCWPGAMSESQRGYVTPRSQRGRFGLHNFPRTPYSRSIFSMSKSKSKLTQLQGDGQKFVNTPSPLWQRSRSPAYSMMTSSKDPLDEATGSIGLTCSLQHKASAVTNSRRSAYFYPPQQPEMEIENNISEAIFPDMKKNLDRGGASTIPLSQSVGINNSESSLPTVRPQSSQVARTILEHITRNPPTPKEKTEELKRAIEWKKTPSANVPSVKPNETSSLAVDIDSHQKANQVDQNCHPQLSDEGKTMSTVLPKEGAGRNPDAANQNPYGLKFRLSNAESKHKDDAGLNIGSSSPKAVPKIFPALGSEVWTQIKPSPSLGGKPIFPSITISKPESKWAFSSDSGSAFTFPVSGASSGMLSEPPRPSIFPSTSLGGVIQSALKPQILSLALDPMIIRDFPSVLLGKMQFVANCEGGGAGAAAAAAATAAAPSGSGRMTSTHHRYLCCCCCSCSTLYSFLMSFSN
Homology
BLAST of Csor.00g216300 vs. ExPASy Swiss-Prot
Match:
Q9CAF4 (Nuclear pore complex protein NUP1 OS=Arabidopsis thaliana OX=3702 GN=NUP1 PE=1 SV=1)
HSP 1 Score: 88.2 bits (217), Expect = 4.4e-16
Identity = 172/658 (26.14%), Postives = 247/658 (37.54%), Query Frame = 0
Query: 68 WLSKLVDPAYRLITGGATRLLPYLFPKPLPSNALPSPGDEDQDKVEAEVEDNVSGEEPQN 127
WLSKLVDPA RLIT A RL L K L S P E Q ++ E V+ E
Sbjct: 58 WLSKLVDPAQRLITYSAQRLFGSLSRKRLGSGETPLQSPEQQKQLP---ERGVNQET--- 117
Query: 128 LGVSTLVGLPGSSGEANRSENNSDFNGCQKDKENNAL---GGNGKIDVEKWIQGKTFSRD 187
G N S NG + ++ NA +G D+EK +QGKTF+R
Sbjct: 118 -----------KVGHKEDVSNLSMKNGLIRMEDTNASVDPPKDGFTDLEKILQGKTFTRS 177
Query: 188 EVSRLLEVLRSRALEPSNKVEDNTFSPQSIEKQVEQPSTANRVLEMPREGKQEELERATG 247
EV RL +LRS+A + S E+ + V P + R P G L
Sbjct: 178 EVDRLTTLLRSKAADSSTMNEEQR---NEVGMVVRHPPSHERDRTHPDNGSMNTLVSTPP 237
Query: 248 GNLTPHPHSLKLREVGASPVDIARAYMSNQKSEP-----GLASDKMPDDEKALRHGDHQM 307
G+L L E ASP +A+AYM ++ SE GL +D L
Sbjct: 238 GSLR------TLDECIASPAQLAKAYMGSRPSEVTPSMLGLRGQAGREDSVFLNR----- 297
Query: 308 FKPFIPSMSPNPS-TCWPGAMSESQRGYVTPRSQRGRFGLHNFPRTPYSR--------SI 367
PF P SP S P + G+VTPRS RGR +++ RTPYSR S+
Sbjct: 298 -TPF-PQKSPTMSLVTKPSGQRPLENGFVTPRS-RGRSAVYSMARTPYSRPQSSVKIGSL 357
Query: 368 FSMSKSKSKLTQLQGDGQKFVNTPSPLWQRSRSPAYSMMTSSKDPLDEATGSIGLTCSLQ 427
F S SK + + G Q F S L +RS LD GS+G ++
Sbjct: 358 FQASPSKWEESLPSGSRQGF---QSGLKRRS------------SVLDNDIGSVGPVRRIR 417
Query: 428 HKASAVTNSRRSAYFYPPQQPEMEIENNISEAIFPDMKKNLDRGGASTIPLSQSVGINNS 487
K++ + S P + + + N GG T S+ +
Sbjct: 418 QKSNLSSRS----LALPVSESPLSVRAN---------------GGEKTTHTSKDSAEDIP 477
Query: 488 ESSLPTVRPQSSQVARTILEH----ITRNPPTPKEKTEELKRAIEWKKTPSANVPSVKPN 547
SS V +SS++A IL+ ++ +P + + + R K + P N
Sbjct: 478 GSSFNLVPTKSSEMASKILQQLDKLVSTREKSPSKLSPSMLRGPALKSLQNVEAPKFLGN 537
Query: 548 ETSSLAVDIDSHQKANQVDQN--CHPQLSDEGKTMSTV--LPKEGAGRNPDAANQN---- 607
A DS + ++ + L+ KT V K G+ ++ D +
Sbjct: 538 LPEKKANSPDSSYQKQEISRESVSREVLAQSEKTGDAVDGTSKTGSSKDQDMRGKGVYMP 597
Query: 608 ---------PYGLKFRLSNAESKHKDDAGLNIGSSSPKAVPK--IFPALGSEVWTQIKPS 667
P FR+S E + D L S+ + K F S + S
Sbjct: 598 LTNSLEEHPPKKRSFRMSAHEDFLELDDDLGAASTPCEVAEKQNAFEVEKSHI------S 641
Query: 668 PSLGGKPIFPSITI-----------SKPESKWAFSSDSGSAFTFPVSGA-SSGMLSEP 674
+G KP+ PS + S+ S + ++ FP+ S M SEP
Sbjct: 658 MPIGEKPLTPSEAMPSTSYISNGDASQGTSNGSLETERNKFVAFPIEAVQQSNMASEP 641
BLAST of Csor.00g216300 vs. NCBI nr
Match:
KAG6589842.1 (Nuclear pore complex protein NUP1, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 1511 bits (3912), Expect = 0.0
Identity = 774/774 (100.00%), Postives = 774/774 (100.00%), Query Frame = 0
Query: 1 MGSQSSPPSPVDIFPKSKRSEKQSPTVLLLPVADAEAGPRVLEETWRGLREERRRHHTAV 60
MGSQSSPPSPVDIFPKSKRSEKQSPTVLLLPVADAEAGPRVLEETWRGLREERRRHHTAV
Sbjct: 1 MGSQSSPPSPVDIFPKSKRSEKQSPTVLLLPVADAEAGPRVLEETWRGLREERRRHHTAV 60
Query: 61 EESEAKRWLSKLVDPAYRLITGGATRLLPYLFPKPLPSNALPSPGDEDQDKVEAEVEDNV 120
EESEAKRWLSKLVDPAYRLITGGATRLLPYLFPKPLPSNALPSPGDEDQDKVEAEVEDNV
Sbjct: 61 EESEAKRWLSKLVDPAYRLITGGATRLLPYLFPKPLPSNALPSPGDEDQDKVEAEVEDNV 120
Query: 121 SGEEPQNLGVSTLVGLPGSSGEANRSENNSDFNGCQKDKENNALGGNGKIDVEKWIQGKT 180
SGEEPQNLGVSTLVGLPGSSGEANRSENNSDFNGCQKDKENNALGGNGKIDVEKWIQGKT
Sbjct: 121 SGEEPQNLGVSTLVGLPGSSGEANRSENNSDFNGCQKDKENNALGGNGKIDVEKWIQGKT 180
Query: 181 FSRDEVSRLLEVLRSRALEPSNKVEDNTFSPQSIEKQVEQPSTANRVLEMPREGKQEELE 240
FSRDEVSRLLEVLRSRALEPSNKVEDNTFSPQSIEKQVEQPSTANRVLEMPREGKQEELE
Sbjct: 181 FSRDEVSRLLEVLRSRALEPSNKVEDNTFSPQSIEKQVEQPSTANRVLEMPREGKQEELE 240
Query: 241 RATGGNLTPHPHSLKLREVGASPVDIARAYMSNQKSEPGLASDKMPDDEKALRHGDHQMF 300
RATGGNLTPHPHSLKLREVGASPVDIARAYMSNQKSEPGLASDKMPDDEKALRHGDHQMF
Sbjct: 241 RATGGNLTPHPHSLKLREVGASPVDIARAYMSNQKSEPGLASDKMPDDEKALRHGDHQMF 300
Query: 301 KPFIPSMSPNPSTCWPGAMSESQRGYVTPRSQRGRFGLHNFPRTPYSRSIFSMSKSKSKL 360
KPFIPSMSPNPSTCWPGAMSESQRGYVTPRSQRGRFGLHNFPRTPYSRSIFSMSKSKSKL
Sbjct: 301 KPFIPSMSPNPSTCWPGAMSESQRGYVTPRSQRGRFGLHNFPRTPYSRSIFSMSKSKSKL 360
Query: 361 TQLQGDGQKFVNTPSPLWQRSRSPAYSMMTSSKDPLDEATGSIGLTCSLQHKASAVTNSR 420
TQLQGDGQKFVNTPSPLWQRSRSPAYSMMTSSKDPLDEATGSIGLTCSLQHKASAVTNSR
Sbjct: 361 TQLQGDGQKFVNTPSPLWQRSRSPAYSMMTSSKDPLDEATGSIGLTCSLQHKASAVTNSR 420
Query: 421 RSAYFYPPQQPEMEIENNISEAIFPDMKKNLDRGGASTIPLSQSVGINNSESSLPTVRPQ 480
RSAYFYPPQQPEMEIENNISEAIFPDMKKNLDRGGASTIPLSQSVGINNSESSLPTVRPQ
Sbjct: 421 RSAYFYPPQQPEMEIENNISEAIFPDMKKNLDRGGASTIPLSQSVGINNSESSLPTVRPQ 480
Query: 481 SSQVARTILEHITRNPPTPKEKTEELKRAIEWKKTPSANVPSVKPNETSSLAVDIDSHQK 540
SSQVARTILEHITRNPPTPKEKTEELKRAIEWKKTPSANVPSVKPNETSSLAVDIDSHQK
Sbjct: 481 SSQVARTILEHITRNPPTPKEKTEELKRAIEWKKTPSANVPSVKPNETSSLAVDIDSHQK 540
Query: 541 ANQVDQNCHPQLSDEGKTMSTVLPKEGAGRNPDAANQNPYGLKFRLSNAESKHKDDAGLN 600
ANQVDQNCHPQLSDEGKTMSTVLPKEGAGRNPDAANQNPYGLKFRLSNAESKHKDDAGLN
Sbjct: 541 ANQVDQNCHPQLSDEGKTMSTVLPKEGAGRNPDAANQNPYGLKFRLSNAESKHKDDAGLN 600
Query: 601 IGSSSPKAVPKIFPALGSEVWTQIKPSPSLGGKPIFPSITISKPESKWAFSSDSGSAFTF 660
IGSSSPKAVPKIFPALGSEVWTQIKPSPSLGGKPIFPSITISKPESKWAFSSDSGSAFTF
Sbjct: 601 IGSSSPKAVPKIFPALGSEVWTQIKPSPSLGGKPIFPSITISKPESKWAFSSDSGSAFTF 660
Query: 661 PVSGASSGMLSEPPRPSIFPSTSLGGVIQSALKPQILSLALDPMIIRDFPSVLLGKMQFV 720
PVSGASSGMLSEPPRPSIFPSTSLGGVIQSALKPQILSLALDPMIIRDFPSVLLGKMQFV
Sbjct: 661 PVSGASSGMLSEPPRPSIFPSTSLGGVIQSALKPQILSLALDPMIIRDFPSVLLGKMQFV 720
Query: 721 ANCEGGGAGAAAAAAATAAAPSGSGRMTSTHHRYLCCCCCSCSTLYSFLMSFSN 774
ANCEGGGAGAAAAAAATAAAPSGSGRMTSTHHRYLCCCCCSCSTLYSFLMSFSN
Sbjct: 721 ANCEGGGAGAAAAAAATAAAPSGSGRMTSTHHRYLCCCCCSCSTLYSFLMSFSN 774
BLAST of Csor.00g216300 vs. NCBI nr
Match:
XP_022960828.1 (nuclear pore complex protein NUP1-like isoform X1 [Cucurbita moschata])
HSP 1 Score: 1218 bits (3151), Expect = 0.0
Identity = 624/660 (94.55%), Postives = 632/660 (95.76%), Query Frame = 0
Query: 60 VEESEAKRWLSKLVDPAYRLITGGATRLLPYLFPKPLPSNALPSPGDEDQDKVEAEVEDN 119
V +RWLSKLVDPAYRLITGGATRLLPYLFPKPLPSNALPSPGDEDQDKVEAEVEDN
Sbjct: 39 VHNQSHRRWLSKLVDPAYRLITGGATRLLPYLFPKPLPSNALPSPGDEDQDKVEAEVEDN 98
Query: 120 VSGEEPQNLGVSTLVGLPGSSGEANRSENNSDFNGCQKDKENNALGGNGKIDVEKWIQGK 179
VSGEEPQNLGVSTLVGLPGSSGEANRSENNSDFNGCQKDKENNALGGNGKIDVEKWIQGK
Sbjct: 99 VSGEEPQNLGVSTLVGLPGSSGEANRSENNSDFNGCQKDKENNALGGNGKIDVEKWIQGK 158
Query: 180 TFSRDEVSRLLEVLRSRALEPSNKVEDNTFSPQSIEKQVEQPSTANRVLEMPREGKQEEL 239
TFSRDEVSRLLEVL+SRALEPSNKVEDNTFSPQSIEKQVEQPSTANRVLEMPREGKQEEL
Sbjct: 159 TFSRDEVSRLLEVLQSRALEPSNKVEDNTFSPQSIEKQVEQPSTANRVLEMPREGKQEEL 218
Query: 240 ERATGGNLTPHPHSLKLREVGASPVDIARAYMSNQKSEPGLASDKMPDDEKALRHGDHQM 299
ERATGGNLTPHPHSLKLREVGASPVDIARAYMSNQKSEPGLASDKMPDDEKALRHGDHQM
Sbjct: 219 ERATGGNLTPHPHSLKLREVGASPVDIARAYMSNQKSEPGLASDKMPDDEKALRHGDHQM 278
Query: 300 FKPFIPSMSPNPSTCWPGAMSESQRGYVTPRSQRGRFGLHNFPRTPYSRSIFSMSKSKSK 359
FKPFIPSMSPNPSTCWP AMSESQRGYVTPRSQRGRFGLHNFPRTPYSRSIFSMSKSKSK
Sbjct: 279 FKPFIPSMSPNPSTCWPSAMSESQRGYVTPRSQRGRFGLHNFPRTPYSRSIFSMSKSKSK 338
Query: 360 LTQLQGDGQKFVNTPSPLWQRSRSPAYSMMTSSKDPLDEATGSIGLTCSLQHKASAVTNS 419
LTQLQGDGQKFVNTPSPLWQRSRSP YSMMTSSKDPLDEATGSIGLTCSLQHKASAVTNS
Sbjct: 339 LTQLQGDGQKFVNTPSPLWQRSRSPTYSMMTSSKDPLDEATGSIGLTCSLQHKASAVTNS 398
Query: 420 RRSAYFYPPQQPEMEIENNISEAIFPDMKKNLDRGGASTIPLSQSVGINNSESSLPTVRP 479
RRSAYFYPPQQPEMEIENNISEAIFPDMKKNLDRGGASTIPLSQSVGINNSESSLPTVRP
Sbjct: 399 RRSAYFYPPQQPEMEIENNISEAIFPDMKKNLDRGGASTIPLSQSVGINNSESSLPTVRP 458
Query: 480 QSSQVARTILEHITRNPPTPKEKTEELKRAIEWKKTPSANVPSVKPNETSSLAVDIDSHQ 539
QSSQV RTILEHITRNPPTPKEKTEELKRAIEWKKTPSANVPSVKPNETSSLAVDIDSHQ
Sbjct: 459 QSSQVVRTILEHITRNPPTPKEKTEELKRAIEWKKTPSANVPSVKPNETSSLAVDIDSHQ 518
Query: 540 KANQVDQNCHPQLSDEGKTMSTVLPKEGAGRNPDAANQNPYGLKFRLSNAESKHKDDAGL 599
KANQVDQNCHPQLSDEGKTMSTVLPKEGAGRNPDAANQNPYGLKFRLSNAESKHKDDAGL
Sbjct: 519 KANQVDQNCHPQLSDEGKTMSTVLPKEGAGRNPDAANQNPYGLKFRLSNAESKHKDDAGL 578
Query: 600 NIGSSSPKAVPKIFPALGSEVWTQIKPSPSLGGKPIFPSITISKPESKWAFSSDSGSAFT 659
NIGSSSPKAVPKIFPALGSEVWTQIKPSPSLGGKPIFPSITISKPESKWAFSSDSGSAFT
Sbjct: 579 NIGSSSPKAVPKIFPALGSEVWTQIKPSPSLGGKPIFPSITISKPESKWAFSSDSGSAFT 638
Query: 660 FPVSGASSGMLSEPPRPSIFPSTSLGGVIQSALKPQ--ILSLALD-----PMIIRDFPSV 712
FPVSGASSGMLSEPP PSIFPSTSLGG LK + + S + D P ++ FPS+
Sbjct: 639 FPVSGASSGMLSEPPTPSIFPSTSLGGGQPLLLKTETPVPSYSFDSKKTSPSLVFSFPSI 698
BLAST of Csor.00g216300 vs. NCBI nr
Match:
XP_023515697.1 (nuclear pore complex protein NUP1-like isoform X1 [Cucurbita pepo subsp. pepo] >XP_023515698.1 nuclear pore complex protein NUP1-like isoform X1 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 1182 bits (3057), Expect = 0.0
Identity = 606/660 (91.82%), Postives = 623/660 (94.39%), Query Frame = 0
Query: 60 VEESEAKRWLSKLVDPAYRLITGGATRLLPYLFPKPLPSNALPSPGDEDQDKVEAEVEDN 119
V +RWLSKLVDP YRLITGGATRLLPYLFPKPLPSNALPSPGDEDQDKVEAEVEDN
Sbjct: 39 VHNQSQRRWLSKLVDPTYRLITGGATRLLPYLFPKPLPSNALPSPGDEDQDKVEAEVEDN 98
Query: 120 VSGEEPQNLGVSTLVGLPGSSGEANRSENNSDFNGCQKDKENNALGGNGKIDVEKWIQGK 179
VSGEEPQN GVSTLVGLPGSSGEANRSEN+SDFNGCQKDKENNALGGNGKIDVEKWIQGK
Sbjct: 99 VSGEEPQNQGVSTLVGLPGSSGEANRSENDSDFNGCQKDKENNALGGNGKIDVEKWIQGK 158
Query: 180 TFSRDEVSRLLEVLRSRALEPSNKVEDNTFSPQSIEKQVEQPSTANRVLEMPREGKQEEL 239
TFSRDEVSRLLEVL+SRALEPSNKVEDNTFSPQSIEKQVE PSTANRVLEMPREGKQEEL
Sbjct: 159 TFSRDEVSRLLEVLQSRALEPSNKVEDNTFSPQSIEKQVEPPSTANRVLEMPREGKQEEL 218
Query: 240 ERATGGNLTPHPHSLKLREVGASPVDIARAYMSNQKSEPGLASDKMPDDEKALRHGDHQM 299
ERAT GNLTP PHSLKLREVGASPVDIARAYMSNQKSEPGLASDKMPDDEKALRHGDHQM
Sbjct: 219 ERATWGNLTPRPHSLKLREVGASPVDIARAYMSNQKSEPGLASDKMPDDEKALRHGDHQM 278
Query: 300 FKPFIPSMSPNPSTCWPGAMSESQRGYVTPRSQRGRFGLHNFPRTPYSRSIFSMSKSKSK 359
PFIPSMSPNPSTCWPGAMSESQRGYVTPRSQRGRFGLHNFPRTPYSRSIFSMSKSKSK
Sbjct: 279 SMPFIPSMSPNPSTCWPGAMSESQRGYVTPRSQRGRFGLHNFPRTPYSRSIFSMSKSKSK 338
Query: 360 LTQLQGDGQKFVNTPSPLWQRSRSPAYSMMTSSKDPLDEATGSIGLTCSLQHKASAVTNS 419
LTQLQGDGQKFVNTPSPLWQRSRSPAYS+MTSSKDPLDE TGSIGLTCSLQHKASA TNS
Sbjct: 339 LTQLQGDGQKFVNTPSPLWQRSRSPAYSVMTSSKDPLDEGTGSIGLTCSLQHKASAATNS 398
Query: 420 RRSAYFYPPQQPEMEIENNISEAIFPDMKKNLDRGGASTIPLSQSVGINNSESSLPTVRP 479
RRSAYFYPPQQPEME+ENNISEAIFPDMKKNL+RGGAS IPLSQSVGINNSESSLPTVRP
Sbjct: 399 RRSAYFYPPQQPEMEVENNISEAIFPDMKKNLERGGASIIPLSQSVGINNSESSLPTVRP 458
Query: 480 QSSQVARTILEHITRNPPTPKEKTEELKRAIEWKKTPSANVPSVKPNETSSLAVDIDSHQ 539
QSSQVARTILEHITRNPPTPKEKTEELKRA+EWKKTPS+NV SVKPNETSSLAVD+DSHQ
Sbjct: 459 QSSQVARTILEHITRNPPTPKEKTEELKRAVEWKKTPSSNVLSVKPNETSSLAVDVDSHQ 518
Query: 540 KANQVDQNCHPQLSDEGKTMSTVLPKEGAGRNPDAANQNPYGLKFRLSNAESKHKDDAGL 599
KANQVDQNCHPQLSD+GKTMSTVLPKEGAG NPDAANQNPYGLKFRLSNAESKHKDDAGL
Sbjct: 519 KANQVDQNCHPQLSDKGKTMSTVLPKEGAGINPDAANQNPYGLKFRLSNAESKHKDDAGL 578
Query: 600 NIGSSSPKAVPKIFPALGSEVWTQIKPSPSLGGKPIFPSITISKPESKWAFSSDSGSAFT 659
NIGSSSPKAVPKIFPALGSEV TQIKPSPSLGGKPIFPSITI+KPESKWAFSSDSGSAFT
Sbjct: 579 NIGSSSPKAVPKIFPALGSEVGTQIKPSPSLGGKPIFPSITINKPESKWAFSSDSGSAFT 638
Query: 660 FPVSGASSGMLSEPPRPSIFPSTSLGGVIQSALKPQ--ILSLALD-----PMIIRDFPSV 712
FPVSGASSGMLSEPP PSIFPSTSLGG LKP+ + S + D P ++ FPS+
Sbjct: 639 FPVSGASSGMLSEPPTPSIFPSTSLGGGQPLLLKPETPVPSYSFDSKKTSPSLVFSFPSI 698
BLAST of Csor.00g216300 vs. NCBI nr
Match:
XP_022987623.1 (nuclear pore complex protein NUP1-like isoform X1 [Cucurbita maxima])
HSP 1 Score: 1167 bits (3020), Expect = 0.0
Identity = 603/660 (91.36%), Postives = 620/660 (93.94%), Query Frame = 0
Query: 60 VEESEAKRWLSKLVDPAYRLITGGATRLLPYLFPKPLPSNALPSPGDEDQDKVEAEVEDN 119
V +RWLSKLVDPAYRLITGGATRLLPYLFPKPLPSNALPSPGDEDQDKVE EVEDN
Sbjct: 39 VHNQSQRRWLSKLVDPAYRLITGGATRLLPYLFPKPLPSNALPSPGDEDQDKVEVEVEDN 98
Query: 120 VSGEEPQNLGVSTLVGLPGSSGEANRSENNSDFNGCQKDKENNALGGNGKIDVEKWIQGK 179
VSGEEPQN GVSTLVGLPGSSGEANRSEN+SDFNGCQKDKENNALGGNGKIDVEKWIQGK
Sbjct: 99 VSGEEPQNKGVSTLVGLPGSSGEANRSENDSDFNGCQKDKENNALGGNGKIDVEKWIQGK 158
Query: 180 TFSRDEVSRLLEVLRSRALEPSNKVEDNTFSPQSIEKQVEQPSTANRVLEMPREGKQEEL 239
TFSRDEVSRLL VL+SRALEPSNKVEDNTFSPQSIEKQVEQ STANRVLEMPREGKQEEL
Sbjct: 159 TFSRDEVSRLLVVLQSRALEPSNKVEDNTFSPQSIEKQVEQLSTANRVLEMPREGKQEEL 218
Query: 240 ERATGGNLTPHPHSLKLREVGASPVDIARAYMSNQKSEPGLASDKMPDDEKALRHGDHQM 299
ERAT GNLTPHPHSLKLREVGASPVDIAR YMSNQKSEPGLASDKMPDDEKALRHGDHQM
Sbjct: 219 ERATWGNLTPHPHSLKLREVGASPVDIARVYMSNQKSEPGLASDKMPDDEKALRHGDHQM 278
Query: 300 FKPFIPSMSPNPSTCWPGAMSESQRGYVTPRSQRGRFGLHNFPRTPYSRSIFSMSKSKSK 359
KPFIPSMSPNPSTCWPGAMSESQRGYVTPRSQRGRFGLHNFPRTPYSRSIFSMSKSKSK
Sbjct: 279 PKPFIPSMSPNPSTCWPGAMSESQRGYVTPRSQRGRFGLHNFPRTPYSRSIFSMSKSKSK 338
Query: 360 LTQLQGDGQKFVNTPSPLWQRSRSPAYSMMTSSKDPLDEATGSIGLTCSLQHKASAVTNS 419
LTQLQGD QKFVNTPSPLW+RSRSPAYSMMTSSKDPLDEATGSIGLT SLQHK SAVTNS
Sbjct: 339 LTQLQGDDQKFVNTPSPLWRRSRSPAYSMMTSSKDPLDEATGSIGLTSSLQHKTSAVTNS 398
Query: 420 RRSAYFYPPQQPEMEIENNISEAIFPDMKKNLDRGGASTIPLSQSVGINNSESSLPTVRP 479
RRSAYFYPPQQPEME+ENNISEAIFPDMKKNL+RGGASTIPLSQSVGINNSESSLPT+RP
Sbjct: 399 RRSAYFYPPQQPEMEVENNISEAIFPDMKKNLERGGASTIPLSQSVGINNSESSLPTLRP 458
Query: 480 QSSQVARTILEHITRNPPTPKEKTEELKRAIEWKKTPSANVPSVKPNETSSLAVDIDSHQ 539
QSSQVARTILEHITRNPPTPKEKTEELKRAI+WKKTPS+NV SVKPNETSSLAVDIDSHQ
Sbjct: 459 QSSQVARTILEHITRNPPTPKEKTEELKRAIDWKKTPSSNVLSVKPNETSSLAVDIDSHQ 518
Query: 540 KANQVDQNCHPQLSDEGKTMSTVLPKEGAGRNPDAANQNPYGLKFRLSNAESKHKDDAGL 599
KANQVDQNCHPQLSD+GKTMSTVLPKEGAGRNPDAANQNPY LKFRLSNAESKHKDDAGL
Sbjct: 519 KANQVDQNCHPQLSDKGKTMSTVLPKEGAGRNPDAANQNPYCLKFRLSNAESKHKDDAGL 578
Query: 600 NIGSSSPKAVPKIFPALGSEVWTQIKPSPSLGGKPIFPSITISKPESKWAFSSDSGSAFT 659
NIGSSSPKAVPKIF ALGSEV TQIK SPSLGGKPIFPSITI+KPESKWAFSSDSGSAFT
Sbjct: 579 NIGSSSPKAVPKIFRALGSEVGTQIKHSPSLGGKPIFPSITINKPESKWAFSSDSGSAFT 638
Query: 660 FPVSGASSGMLSEPPRPSIFPSTSLGGVIQSALKPQ--ILSLALD-----PMIIRDFPSV 712
FPVSGASSGMLSEPP PSIFPSTSLGG KP+ + S + D P ++ FPS+
Sbjct: 639 FPVSGASSGMLSEPPTPSIFPSTSLGGGQPLLFKPETPVPSYSFDSKKTSPSLVFSFPSI 698
BLAST of Csor.00g216300 vs. NCBI nr
Match:
KAG7023514.1 (Nuclear pore complex protein NUP1 [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 1127 bits (2915), Expect = 0.0
Identity = 590/656 (89.94%), Postives = 599/656 (91.31%), Query Frame = 0
Query: 60 VEESEAKRWLSKLVD------PA-----YRLITGGATRLLPYLFPKPLPSNALPSPGDED 119
V +RWLSKLVD PA + + + L+P+ L + +P +
Sbjct: 39 VHNQSHRRWLSKLVDLPTGSLPAEPPDCFHICSRNHCPLMPFR----LLETKIKTPENVI 98
Query: 120 QDKVEAEVEDNVSGEEPQNLGVSTLVGLPGSSGEANRSENNSDFNGCQKDKENNALGGNG 179
DKVEAEVEDNVSGEEPQNLGVSTLVGLPGSSGEANRSENNSDFNGCQKDKENNALGGNG
Sbjct: 99 IDKVEAEVEDNVSGEEPQNLGVSTLVGLPGSSGEANRSENNSDFNGCQKDKENNALGGNG 158
Query: 180 KIDVEKWIQGKTFSRDEVSRLLEVLRSRALEPSNKVEDNTFSPQSIEKQVEQPSTANRVL 239
KIDVEKWIQGKTFSRDEVSRLLEVLRSRALEPSNKVEDNTFSPQSIEKQVEQPSTANRVL
Sbjct: 159 KIDVEKWIQGKTFSRDEVSRLLEVLRSRALEPSNKVEDNTFSPQSIEKQVEQPSTANRVL 218
Query: 240 EMPREGKQEELERATGGNLTPHPHSL------------------KLREVGASPVDIARAY 299
EMPREGKQEELERATGGNLTPHPHSL KLREVGASPVDIA AY
Sbjct: 219 EMPREGKQEELERATGGNLTPHPHSLVSRYNVLNIKETGSSIGMKLREVGASPVDIASAY 278
Query: 300 MSNQKSEPGLASDKMPDDEKALRHGDHQMFKPFIPSMSPNPSTCWPGAMSESQRGYVTPR 359
MSNQKSEPGLASDKMPDDEKALRHGDHQMFKPFIPSMSPNPSTCWPGAMSESQRGYVTPR
Sbjct: 279 MSNQKSEPGLASDKMPDDEKALRHGDHQMFKPFIPSMSPNPSTCWPGAMSESQRGYVTPR 338
Query: 360 SQRGRFGLHNFPRTPYSRSIFSMSKSKSKLTQLQGDGQKFVNTPSPLWQRSRSPAYSMMT 419
SQRGRFGLHNFPRTPYSRSIFSMSKSKSKLTQLQGDGQKFVNTPSPLWQRSRSPAYSMMT
Sbjct: 339 SQRGRFGLHNFPRTPYSRSIFSMSKSKSKLTQLQGDGQKFVNTPSPLWQRSRSPAYSMMT 398
Query: 420 SSKDPLDEATGSIGLTCSLQHKASAVTNSRRSAYFYPPQQPEMEIENNISEAIFPDMKKN 479
SSKDPLDEATGSIGLTCSLQHKASAVTNSRRSAYFYPPQQPEMEIENNISEAIFPDMKKN
Sbjct: 399 SSKDPLDEATGSIGLTCSLQHKASAVTNSRRSAYFYPPQQPEMEIENNISEAIFPDMKKN 458
Query: 480 LDRGGASTIPLSQSVGINNSESSLPTVRPQSSQVARTILEHITRNPPTPKEKTEELKRAI 539
LDRGGASTIPLSQSVGINNSESSLPTVRPQSSQVARTILEHITRNPPTPKEKTEELKRAI
Sbjct: 459 LDRGGASTIPLSQSVGINNSESSLPTVRPQSSQVARTILEHITRNPPTPKEKTEELKRAI 518
Query: 540 EWKKTPSANVPSVKPNETSSLAVDIDSHQKANQVDQNCHPQLSDEGKTMSTVLPKEGAGR 599
EWKKTPSANVPSVKPNETSSLAVDIDSHQKANQVDQNCHPQLSDEGKTMSTVLPKEGAGR
Sbjct: 519 EWKKTPSANVPSVKPNETSSLAVDIDSHQKANQVDQNCHPQLSDEGKTMSTVLPKEGAGR 578
Query: 600 NPDAANQNPYGLKFRLSNAESKHKDDAGLNIGSSSPKAVPKIFPALGSEVWTQIKPSPSL 659
NPDAANQNPYGLKFR SNAESKHKDDAGLNIGSSSPKAVPKIFPALGSEVWTQIKPSPSL
Sbjct: 579 NPDAANQNPYGLKFRFSNAESKHKDDAGLNIGSSSPKAVPKIFPALGSEVWTQIKPSPSL 638
Query: 660 GGKPIFPSITISKPESKWAFSSDSGSAFTFPVSGASSGMLSEPPRPSIFPSTSLGG 686
GGKPIFPSITISKPESKWAFSSDSGSAFTFPVSGASSGMLSEPPRPSIFPSTSLGG
Sbjct: 639 GGKPIFPSITISKPESKWAFSSDSGSAFTFPVSGASSGMLSEPPRPSIFPSTSLGG 690
BLAST of Csor.00g216300 vs. ExPASy TrEMBL
Match:
A0A6J1HA42 (nuclear pore complex protein NUP1-like isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111461519 PE=4 SV=1)
HSP 1 Score: 1218 bits (3151), Expect = 0.0
Identity = 624/660 (94.55%), Postives = 632/660 (95.76%), Query Frame = 0
Query: 60 VEESEAKRWLSKLVDPAYRLITGGATRLLPYLFPKPLPSNALPSPGDEDQDKVEAEVEDN 119
V +RWLSKLVDPAYRLITGGATRLLPYLFPKPLPSNALPSPGDEDQDKVEAEVEDN
Sbjct: 39 VHNQSHRRWLSKLVDPAYRLITGGATRLLPYLFPKPLPSNALPSPGDEDQDKVEAEVEDN 98
Query: 120 VSGEEPQNLGVSTLVGLPGSSGEANRSENNSDFNGCQKDKENNALGGNGKIDVEKWIQGK 179
VSGEEPQNLGVSTLVGLPGSSGEANRSENNSDFNGCQKDKENNALGGNGKIDVEKWIQGK
Sbjct: 99 VSGEEPQNLGVSTLVGLPGSSGEANRSENNSDFNGCQKDKENNALGGNGKIDVEKWIQGK 158
Query: 180 TFSRDEVSRLLEVLRSRALEPSNKVEDNTFSPQSIEKQVEQPSTANRVLEMPREGKQEEL 239
TFSRDEVSRLLEVL+SRALEPSNKVEDNTFSPQSIEKQVEQPSTANRVLEMPREGKQEEL
Sbjct: 159 TFSRDEVSRLLEVLQSRALEPSNKVEDNTFSPQSIEKQVEQPSTANRVLEMPREGKQEEL 218
Query: 240 ERATGGNLTPHPHSLKLREVGASPVDIARAYMSNQKSEPGLASDKMPDDEKALRHGDHQM 299
ERATGGNLTPHPHSLKLREVGASPVDIARAYMSNQKSEPGLASDKMPDDEKALRHGDHQM
Sbjct: 219 ERATGGNLTPHPHSLKLREVGASPVDIARAYMSNQKSEPGLASDKMPDDEKALRHGDHQM 278
Query: 300 FKPFIPSMSPNPSTCWPGAMSESQRGYVTPRSQRGRFGLHNFPRTPYSRSIFSMSKSKSK 359
FKPFIPSMSPNPSTCWP AMSESQRGYVTPRSQRGRFGLHNFPRTPYSRSIFSMSKSKSK
Sbjct: 279 FKPFIPSMSPNPSTCWPSAMSESQRGYVTPRSQRGRFGLHNFPRTPYSRSIFSMSKSKSK 338
Query: 360 LTQLQGDGQKFVNTPSPLWQRSRSPAYSMMTSSKDPLDEATGSIGLTCSLQHKASAVTNS 419
LTQLQGDGQKFVNTPSPLWQRSRSP YSMMTSSKDPLDEATGSIGLTCSLQHKASAVTNS
Sbjct: 339 LTQLQGDGQKFVNTPSPLWQRSRSPTYSMMTSSKDPLDEATGSIGLTCSLQHKASAVTNS 398
Query: 420 RRSAYFYPPQQPEMEIENNISEAIFPDMKKNLDRGGASTIPLSQSVGINNSESSLPTVRP 479
RRSAYFYPPQQPEMEIENNISEAIFPDMKKNLDRGGASTIPLSQSVGINNSESSLPTVRP
Sbjct: 399 RRSAYFYPPQQPEMEIENNISEAIFPDMKKNLDRGGASTIPLSQSVGINNSESSLPTVRP 458
Query: 480 QSSQVARTILEHITRNPPTPKEKTEELKRAIEWKKTPSANVPSVKPNETSSLAVDIDSHQ 539
QSSQV RTILEHITRNPPTPKEKTEELKRAIEWKKTPSANVPSVKPNETSSLAVDIDSHQ
Sbjct: 459 QSSQVVRTILEHITRNPPTPKEKTEELKRAIEWKKTPSANVPSVKPNETSSLAVDIDSHQ 518
Query: 540 KANQVDQNCHPQLSDEGKTMSTVLPKEGAGRNPDAANQNPYGLKFRLSNAESKHKDDAGL 599
KANQVDQNCHPQLSDEGKTMSTVLPKEGAGRNPDAANQNPYGLKFRLSNAESKHKDDAGL
Sbjct: 519 KANQVDQNCHPQLSDEGKTMSTVLPKEGAGRNPDAANQNPYGLKFRLSNAESKHKDDAGL 578
Query: 600 NIGSSSPKAVPKIFPALGSEVWTQIKPSPSLGGKPIFPSITISKPESKWAFSSDSGSAFT 659
NIGSSSPKAVPKIFPALGSEVWTQIKPSPSLGGKPIFPSITISKPESKWAFSSDSGSAFT
Sbjct: 579 NIGSSSPKAVPKIFPALGSEVWTQIKPSPSLGGKPIFPSITISKPESKWAFSSDSGSAFT 638
Query: 660 FPVSGASSGMLSEPPRPSIFPSTSLGGVIQSALKPQ--ILSLALD-----PMIIRDFPSV 712
FPVSGASSGMLSEPP PSIFPSTSLGG LK + + S + D P ++ FPS+
Sbjct: 639 FPVSGASSGMLSEPPTPSIFPSTSLGGGQPLLLKTETPVPSYSFDSKKTSPSLVFSFPSI 698
BLAST of Csor.00g216300 vs. ExPASy TrEMBL
Match:
A0A6J1JJZ8 (nuclear pore complex protein NUP1-like isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111485124 PE=4 SV=1)
HSP 1 Score: 1167 bits (3020), Expect = 0.0
Identity = 603/660 (91.36%), Postives = 620/660 (93.94%), Query Frame = 0
Query: 60 VEESEAKRWLSKLVDPAYRLITGGATRLLPYLFPKPLPSNALPSPGDEDQDKVEAEVEDN 119
V +RWLSKLVDPAYRLITGGATRLLPYLFPKPLPSNALPSPGDEDQDKVE EVEDN
Sbjct: 39 VHNQSQRRWLSKLVDPAYRLITGGATRLLPYLFPKPLPSNALPSPGDEDQDKVEVEVEDN 98
Query: 120 VSGEEPQNLGVSTLVGLPGSSGEANRSENNSDFNGCQKDKENNALGGNGKIDVEKWIQGK 179
VSGEEPQN GVSTLVGLPGSSGEANRSEN+SDFNGCQKDKENNALGGNGKIDVEKWIQGK
Sbjct: 99 VSGEEPQNKGVSTLVGLPGSSGEANRSENDSDFNGCQKDKENNALGGNGKIDVEKWIQGK 158
Query: 180 TFSRDEVSRLLEVLRSRALEPSNKVEDNTFSPQSIEKQVEQPSTANRVLEMPREGKQEEL 239
TFSRDEVSRLL VL+SRALEPSNKVEDNTFSPQSIEKQVEQ STANRVLEMPREGKQEEL
Sbjct: 159 TFSRDEVSRLLVVLQSRALEPSNKVEDNTFSPQSIEKQVEQLSTANRVLEMPREGKQEEL 218
Query: 240 ERATGGNLTPHPHSLKLREVGASPVDIARAYMSNQKSEPGLASDKMPDDEKALRHGDHQM 299
ERAT GNLTPHPHSLKLREVGASPVDIAR YMSNQKSEPGLASDKMPDDEKALRHGDHQM
Sbjct: 219 ERATWGNLTPHPHSLKLREVGASPVDIARVYMSNQKSEPGLASDKMPDDEKALRHGDHQM 278
Query: 300 FKPFIPSMSPNPSTCWPGAMSESQRGYVTPRSQRGRFGLHNFPRTPYSRSIFSMSKSKSK 359
KPFIPSMSPNPSTCWPGAMSESQRGYVTPRSQRGRFGLHNFPRTPYSRSIFSMSKSKSK
Sbjct: 279 PKPFIPSMSPNPSTCWPGAMSESQRGYVTPRSQRGRFGLHNFPRTPYSRSIFSMSKSKSK 338
Query: 360 LTQLQGDGQKFVNTPSPLWQRSRSPAYSMMTSSKDPLDEATGSIGLTCSLQHKASAVTNS 419
LTQLQGD QKFVNTPSPLW+RSRSPAYSMMTSSKDPLDEATGSIGLT SLQHK SAVTNS
Sbjct: 339 LTQLQGDDQKFVNTPSPLWRRSRSPAYSMMTSSKDPLDEATGSIGLTSSLQHKTSAVTNS 398
Query: 420 RRSAYFYPPQQPEMEIENNISEAIFPDMKKNLDRGGASTIPLSQSVGINNSESSLPTVRP 479
RRSAYFYPPQQPEME+ENNISEAIFPDMKKNL+RGGASTIPLSQSVGINNSESSLPT+RP
Sbjct: 399 RRSAYFYPPQQPEMEVENNISEAIFPDMKKNLERGGASTIPLSQSVGINNSESSLPTLRP 458
Query: 480 QSSQVARTILEHITRNPPTPKEKTEELKRAIEWKKTPSANVPSVKPNETSSLAVDIDSHQ 539
QSSQVARTILEHITRNPPTPKEKTEELKRAI+WKKTPS+NV SVKPNETSSLAVDIDSHQ
Sbjct: 459 QSSQVARTILEHITRNPPTPKEKTEELKRAIDWKKTPSSNVLSVKPNETSSLAVDIDSHQ 518
Query: 540 KANQVDQNCHPQLSDEGKTMSTVLPKEGAGRNPDAANQNPYGLKFRLSNAESKHKDDAGL 599
KANQVDQNCHPQLSD+GKTMSTVLPKEGAGRNPDAANQNPY LKFRLSNAESKHKDDAGL
Sbjct: 519 KANQVDQNCHPQLSDKGKTMSTVLPKEGAGRNPDAANQNPYCLKFRLSNAESKHKDDAGL 578
Query: 600 NIGSSSPKAVPKIFPALGSEVWTQIKPSPSLGGKPIFPSITISKPESKWAFSSDSGSAFT 659
NIGSSSPKAVPKIF ALGSEV TQIK SPSLGGKPIFPSITI+KPESKWAFSSDSGSAFT
Sbjct: 579 NIGSSSPKAVPKIFRALGSEVGTQIKHSPSLGGKPIFPSITINKPESKWAFSSDSGSAFT 638
Query: 660 FPVSGASSGMLSEPPRPSIFPSTSLGGVIQSALKPQ--ILSLALD-----PMIIRDFPSV 712
FPVSGASSGMLSEPP PSIFPSTSLGG KP+ + S + D P ++ FPS+
Sbjct: 639 FPVSGASSGMLSEPPTPSIFPSTSLGGGQPLLFKPETPVPSYSFDSKKTSPSLVFSFPSI 698
BLAST of Csor.00g216300 vs. ExPASy TrEMBL
Match:
A0A6J1HA80 (nuclear pore complex protein NUP1-like isoform X3 OS=Cucurbita moschata OX=3662 GN=LOC111461519 PE=4 SV=1)
HSP 1 Score: 1118 bits (2892), Expect = 0.0
Identity = 564/580 (97.24%), Postives = 569/580 (98.10%), Query Frame = 0
Query: 60 VEESEAKRWLSKLVDPAYRLITGGATRLLPYLFPKPLPSNALPSPGDEDQDKVEAEVEDN 119
V +RWLSKLVDPAYRLITGGATRLLPYLFPKPLPSNALPSPGDEDQDKVEAEVEDN
Sbjct: 39 VHNQSHRRWLSKLVDPAYRLITGGATRLLPYLFPKPLPSNALPSPGDEDQDKVEAEVEDN 98
Query: 120 VSGEEPQNLGVSTLVGLPGSSGEANRSENNSDFNGCQKDKENNALGGNGKIDVEKWIQGK 179
VSGEEPQNLGVSTLVGLPGSSGEANRSENNSDFNGCQKDKENNALGGNGKIDVEKWIQGK
Sbjct: 99 VSGEEPQNLGVSTLVGLPGSSGEANRSENNSDFNGCQKDKENNALGGNGKIDVEKWIQGK 158
Query: 180 TFSRDEVSRLLEVLRSRALEPSNKVEDNTFSPQSIEKQVEQPSTANRVLEMPREGKQEEL 239
TFSRDEVSRLLEVL+SRALEPSNKVEDNTFSPQSIEKQVEQPSTANRVLEMPREGKQEEL
Sbjct: 159 TFSRDEVSRLLEVLQSRALEPSNKVEDNTFSPQSIEKQVEQPSTANRVLEMPREGKQEEL 218
Query: 240 ERATGGNLTPHPHSLKLREVGASPVDIARAYMSNQKSEPGLASDKMPDDEKALRHGDHQM 299
ERATGGNLTPHPHSLKLREVGASPVDIARAYMSNQKSEPGLASDKMPDDEKALRHGDHQM
Sbjct: 219 ERATGGNLTPHPHSLKLREVGASPVDIARAYMSNQKSEPGLASDKMPDDEKALRHGDHQM 278
Query: 300 FKPFIPSMSPNPSTCWPGAMSESQRGYVTPRSQRGRFGLHNFPRTPYSRSIFSMSKSKSK 359
FKPFIPSMSPNPSTCWP AMSESQRGYVTPRSQRGRFGLHNFPRTPYSRSIFSMSKSKSK
Sbjct: 279 FKPFIPSMSPNPSTCWPSAMSESQRGYVTPRSQRGRFGLHNFPRTPYSRSIFSMSKSKSK 338
Query: 360 LTQLQGDGQKFVNTPSPLWQRSRSPAYSMMTSSKDPLDEATGSIGLTCSLQHKASAVTNS 419
LTQLQGDGQKFVNTPSPLWQRSRSP YSMMTSSKDPLDEATGSIGLTCSLQHKASAVTNS
Sbjct: 339 LTQLQGDGQKFVNTPSPLWQRSRSPTYSMMTSSKDPLDEATGSIGLTCSLQHKASAVTNS 398
Query: 420 RRSAYFYPPQQPEMEIENNISEAIFPDMKKNLDRGGASTIPLSQSVGINNSESSLPTVRP 479
RRSAYFYPPQQPEMEIENNISEAIFPDMKKNLDRGGASTIPLSQSVGINNSESSLPTVRP
Sbjct: 399 RRSAYFYPPQQPEMEIENNISEAIFPDMKKNLDRGGASTIPLSQSVGINNSESSLPTVRP 458
Query: 480 QSSQVARTILEHITRNPPTPKEKTEELKRAIEWKKTPSANVPSVKPNETSSLAVDIDSHQ 539
QSSQV RTILEHITRNPPTPKEKTEELKRAIEWKKTPSANVPSVKPNETSSLAVDIDSHQ
Sbjct: 459 QSSQVVRTILEHITRNPPTPKEKTEELKRAIEWKKTPSANVPSVKPNETSSLAVDIDSHQ 518
Query: 540 KANQVDQNCHPQLSDEGKTMSTVLPKEGAGRNPDAANQNPYGLKFRLSNAESKHKDDAGL 599
KANQVDQNCHPQLSDEGKTMSTVLPKEGAGRNPDAANQNPYGLKFRLSNAESKHKDDAGL
Sbjct: 519 KANQVDQNCHPQLSDEGKTMSTVLPKEGAGRNPDAANQNPYGLKFRLSNAESKHKDDAGL 578
Query: 600 NIGSSSPKAVPKIFPALGSEVWTQIKPSPSLGGKPIFPSI 639
NIGSSSPKAVPKIFPALGSEVWTQIKPSPSLGG+ + P +
Sbjct: 579 NIGSSSPKAVPKIFPALGSEVWTQIKPSPSLGGRVVSPHV 618
BLAST of Csor.00g216300 vs. ExPASy TrEMBL
Match:
A0A6J1HC90 (uncharacterized protein LOC111461519 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111461519 PE=4 SV=1)
HSP 1 Score: 1098 bits (2840), Expect = 0.0
Identity = 562/592 (94.93%), Postives = 569/592 (96.11%), Query Frame = 0
Query: 128 LGVSTLVGLPGSSGEANRSENNSDFNGCQKDKENNALGGNGKIDVEKWIQGKTFSRDEVS 187
LGVSTLVGLPGSSGEANRSENNSDFNGCQKDKENNALGGNGKIDVEKWIQGKTFSRDEVS
Sbjct: 36 LGVSTLVGLPGSSGEANRSENNSDFNGCQKDKENNALGGNGKIDVEKWIQGKTFSRDEVS 95
Query: 188 RLLEVLRSRALEPSNKVEDNTFSPQSIEKQVEQPSTANRVLEMPREGKQEELERATGGNL 247
RLLEVL+SRALEPSNKVEDNTFSPQSIEKQVEQPSTANRVLEMPREGKQEELERATGGNL
Sbjct: 96 RLLEVLQSRALEPSNKVEDNTFSPQSIEKQVEQPSTANRVLEMPREGKQEELERATGGNL 155
Query: 248 TPHPHSLKLREVGASPVDIARAYMSNQKSEPGLASDKMPDDEKALRHGDHQMFKPFIPSM 307
TPHPHSLKLREVGASPVDIARAYMSNQKSEPGLASDKMPDDEKALRHGDHQMFKPFIPSM
Sbjct: 156 TPHPHSLKLREVGASPVDIARAYMSNQKSEPGLASDKMPDDEKALRHGDHQMFKPFIPSM 215
Query: 308 SPNPSTCWPGAMSESQRGYVTPRSQRGRFGLHNFPRTPYSRSIFSMSKSKSKLTQLQGDG 367
SPNPSTCWP AMSESQRGYVTPRSQRGRFGLHNFPRTPYSRSIFSMSKSKSKLTQLQGDG
Sbjct: 216 SPNPSTCWPSAMSESQRGYVTPRSQRGRFGLHNFPRTPYSRSIFSMSKSKSKLTQLQGDG 275
Query: 368 QKFVNTPSPLWQRSRSPAYSMMTSSKDPLDEATGSIGLTCSLQHKASAVTNSRRSAYFYP 427
QKFVNTPSPLWQRSRSP YSMMTSSKDPLDEATGSIGLTCSLQHKASAVTNSRRSAYFYP
Sbjct: 276 QKFVNTPSPLWQRSRSPTYSMMTSSKDPLDEATGSIGLTCSLQHKASAVTNSRRSAYFYP 335
Query: 428 PQQPEMEIENNISEAIFPDMKKNLDRGGASTIPLSQSVGINNSESSLPTVRPQSSQVART 487
PQQPEMEIENNISEAIFPDMKKNLDRGGASTIPLSQSVGINNSESSLPTVRPQSSQV RT
Sbjct: 336 PQQPEMEIENNISEAIFPDMKKNLDRGGASTIPLSQSVGINNSESSLPTVRPQSSQVVRT 395
Query: 488 ILEHITRNPPTPKEKTEELKRAIEWKKTPSANVPSVKPNETSSLAVDIDSHQKANQVDQN 547
ILEHITRNPPTPKEKTEELKRAIEWKKTPSANVPSVKPNETSSLAVDIDSHQKANQVDQN
Sbjct: 396 ILEHITRNPPTPKEKTEELKRAIEWKKTPSANVPSVKPNETSSLAVDIDSHQKANQVDQN 455
Query: 548 CHPQLSDEGKTMSTVLPKEGAGRNPDAANQNPYGLKFRLSNAESKHKDDAGLNIGSSSPK 607
CHPQLSDEGKTMSTVLPKEGAGRNPDAANQNPYGLKFRLSNAESKHKDDAGLNIGSSSPK
Sbjct: 456 CHPQLSDEGKTMSTVLPKEGAGRNPDAANQNPYGLKFRLSNAESKHKDDAGLNIGSSSPK 515
Query: 608 AVPKIFPALGSEVWTQIKPSPSLGGKPIFPSITISKPESKWAFSSDSGSAFTFPVSGASS 667
AVPKIFPALGSEVWTQIKPSPSLGGKPIFPSITISKPESKWAFSSDSGSAFTFPVSGASS
Sbjct: 516 AVPKIFPALGSEVWTQIKPSPSLGGKPIFPSITISKPESKWAFSSDSGSAFTFPVSGASS 575
Query: 668 GMLSEPPRPSIFPSTSLGGVIQSALKPQ--ILSLALD-----PMIIRDFPSV 712
GMLSEPP PSIFPSTSLGG LK + + S + D P ++ FPS+
Sbjct: 576 GMLSEPPTPSIFPSTSLGGGQPLLLKTETPVPSYSFDSKKTSPSLVFSFPSI 627
BLAST of Csor.00g216300 vs. ExPASy TrEMBL
Match:
A0A6J1JJE0 (nuclear pore complex protein NUP1-like isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111485124 PE=4 SV=1)
HSP 1 Score: 1067 bits (2760), Expect = 0.0
Identity = 544/580 (93.79%), Postives = 557/580 (96.03%), Query Frame = 0
Query: 60 VEESEAKRWLSKLVDPAYRLITGGATRLLPYLFPKPLPSNALPSPGDEDQDKVEAEVEDN 119
V +RWLSKLVDPAYRLITGGATRLLPYLFPKPLPSNALPSPGDEDQDKVE EVEDN
Sbjct: 39 VHNQSQRRWLSKLVDPAYRLITGGATRLLPYLFPKPLPSNALPSPGDEDQDKVEVEVEDN 98
Query: 120 VSGEEPQNLGVSTLVGLPGSSGEANRSENNSDFNGCQKDKENNALGGNGKIDVEKWIQGK 179
VSGEEPQN GVSTLVGLPGSSGEANRSEN+SDFNGCQKDKENNALGGNGKIDVEKWIQGK
Sbjct: 99 VSGEEPQNKGVSTLVGLPGSSGEANRSENDSDFNGCQKDKENNALGGNGKIDVEKWIQGK 158
Query: 180 TFSRDEVSRLLEVLRSRALEPSNKVEDNTFSPQSIEKQVEQPSTANRVLEMPREGKQEEL 239
TFSRDEVSRLL VL+SRALEPSNKVEDNTFSPQSIEKQVEQ STANRVLEMPREGKQEEL
Sbjct: 159 TFSRDEVSRLLVVLQSRALEPSNKVEDNTFSPQSIEKQVEQLSTANRVLEMPREGKQEEL 218
Query: 240 ERATGGNLTPHPHSLKLREVGASPVDIARAYMSNQKSEPGLASDKMPDDEKALRHGDHQM 299
ERAT GNLTPHPHSLKLREVGASPVDIAR YMSNQKSEPGLASDKMPDDEKALRHGDHQM
Sbjct: 219 ERATWGNLTPHPHSLKLREVGASPVDIARVYMSNQKSEPGLASDKMPDDEKALRHGDHQM 278
Query: 300 FKPFIPSMSPNPSTCWPGAMSESQRGYVTPRSQRGRFGLHNFPRTPYSRSIFSMSKSKSK 359
KPFIPSMSPNPSTCWPGAMSESQRGYVTPRSQRGRFGLHNFPRTPYSRSIFSMSKSKSK
Sbjct: 279 PKPFIPSMSPNPSTCWPGAMSESQRGYVTPRSQRGRFGLHNFPRTPYSRSIFSMSKSKSK 338
Query: 360 LTQLQGDGQKFVNTPSPLWQRSRSPAYSMMTSSKDPLDEATGSIGLTCSLQHKASAVTNS 419
LTQLQGD QKFVNTPSPLW+RSRSPAYSMMTSSKDPLDEATGSIGLT SLQHK SAVTNS
Sbjct: 339 LTQLQGDDQKFVNTPSPLWRRSRSPAYSMMTSSKDPLDEATGSIGLTSSLQHKTSAVTNS 398
Query: 420 RRSAYFYPPQQPEMEIENNISEAIFPDMKKNLDRGGASTIPLSQSVGINNSESSLPTVRP 479
RRSAYFYPPQQPEME+ENNISEAIFPDMKKNL+RGGASTIPLSQSVGINNSESSLPT+RP
Sbjct: 399 RRSAYFYPPQQPEMEVENNISEAIFPDMKKNLERGGASTIPLSQSVGINNSESSLPTLRP 458
Query: 480 QSSQVARTILEHITRNPPTPKEKTEELKRAIEWKKTPSANVPSVKPNETSSLAVDIDSHQ 539
QSSQVARTILEHITRNPPTPKEKTEELKRAI+WKKTPS+NV SVKPNETSSLAVDIDSHQ
Sbjct: 459 QSSQVARTILEHITRNPPTPKEKTEELKRAIDWKKTPSSNVLSVKPNETSSLAVDIDSHQ 518
Query: 540 KANQVDQNCHPQLSDEGKTMSTVLPKEGAGRNPDAANQNPYGLKFRLSNAESKHKDDAGL 599
KANQVDQNCHPQLSD+GKTMSTVLPKEGAGRNPDAANQNPY LKFRLSNAESKHKDDAGL
Sbjct: 519 KANQVDQNCHPQLSDKGKTMSTVLPKEGAGRNPDAANQNPYCLKFRLSNAESKHKDDAGL 578
Query: 600 NIGSSSPKAVPKIFPALGSEVWTQIKPSPSLGGKPIFPSI 639
NIGSSSPKAVPKIF ALGSEV TQIK SPSLGG+ + P +
Sbjct: 579 NIGSSSPKAVPKIFRALGSEVGTQIKHSPSLGGRVVSPHV 618
BLAST of Csor.00g216300 vs. TAIR 10
Match:
AT5G20200.1 (nucleoporin-related )
HSP 1 Score: 241.1 bits (614), Expect = 2.9e-63
Identity = 221/675 (32.74%), Postives = 327/675 (48.44%), Query Frame = 0
Query: 49 LREERRRHHTAVEES-------EAKRWLSKLVDPAYRLITGGATRLLPYLFPKPLPSNAL 108
L+ + R H A S + + W+S++VDPAYR+I+GGATR+LPY F + AL
Sbjct: 28 LKRQSARRHAATPYSRPTQNQVQRRPWISRIVDPAYRIISGGATRILPYFFSNAASAPAL 87
Query: 109 PSPGDEDQDKVEAEVEDNVSGEEPQNLGVS-----TLVGLPGSSGEANRSENNSDFNGCQ 168
+P EDQ++ + E+++N +P +S + + G SG AN +E N + +
Sbjct: 88 AAP-PEDQNQHQGELQNNPQDNDPSVTPISNKPEPASIEVGGPSGTANVNEGNFSISAQR 147
Query: 169 KDKENNALGGNGKI-DVEKWIQGKTFSRDEVSRLLEVLRSRALE-PSNKVEDNTFSPQSI 228
+ K AL + I ++E+ ++GKTFS+ E+ RL+E++ SRA++ P K ++
Sbjct: 148 RGKA--ALNDDVAISELERLMEGKTFSQAEIDRLIEMISSRAIDLPDVKRDERNLEIPLR 207
Query: 229 EKQVEQPSTANRVLEMPREGKQEELERATGGNLTPHPHSL-----KLR-EVGASPVDIAR 288
E + S ++ E P GK E TP S+ K+R EVG SP ++A+
Sbjct: 208 EGAKKNMSLFDKAKE-PIGGKDANSE--IWATPTPLAKSIILDGDKIRDEVGLSPAELAK 267
Query: 289 AYMSNQKSEPGLASDKMPDDEKALRHGDHQMFKPFIPSMSPNPSTCWPGAMSESQRGYVT 348
AYM Q S + + +EK + K + S S PS CWPG S Q G+ T
Sbjct: 268 AYMGGQTSSSS-SQGFVARNEKDCLDRSMLVGKSSLASPSSKPSACWPGIKSSEQSGFAT 327
Query: 349 PRSQRGRFGLHNFPRTPYSRSIFSMSKSKSKLTQLQGDGQKFV-NTPSPLWQRSRSPAYS 408
P+S+R +GL NFPRTPYSR+I +S SKSKL QLQ D K + N SP +S Y
Sbjct: 328 PQSRRESYGLQNFPRTPYSRTI--LSNSKSKLMQLQNDSSKHLSNLQSP--SQSVERRYG 387
Query: 409 MMTSSKDPLDEATGSIGLTCSLQHKASAVTNSRRSAYFYPPQQPEMEIENNISEAIFPDM 468
++ +D GL + + T S S Y P + EN+ +
Sbjct: 388 QLSKGRDG--------GLFGPSRRTRQSATPSMVSPY-SRPSRGASRFENSA-------I 447
Query: 469 KKNLDRGGASTIPLSQSVGI---NNSESSLPTVRPQSSQVARTILEHI--TRNPPTPKEK 528
K+ + G +S + SQ +E TV SSQ+ARTIL+H+ T++ TPK K
Sbjct: 448 MKSSEAGESSYLSRSQITTYGKHKEAEVGTLTVPTHSSQIARTILDHLERTQSQSTPKNK 507
Query: 529 TEELKRAIEWK--------KTPSANVPSVKPNETSSLAVDIDSHQKANQVDQNCHPQLSD 588
T ELK A W+ + S++V +VK + ++ L DI + NQ P +
Sbjct: 508 TAELKLATSWRHPQSSKTVEKSSSDVTNVKKDGSAKLHEDIQNIFSQNQPSSVLKPPATT 567
Query: 589 EGKTMS----TVLPKEGAGRNPDAANQNPYGLKFRLSNAESKHKDDAGLNIGSSSPKAVP 648
G + T G R AA+ L++ + +G+SS A
Sbjct: 568 TGDIQNGMNKTASATNGIFRGTQAASSGGNALQYEFGKPKGSLSRSMHDELGTSSQDAAK 627
Query: 649 KIFPALGSEVWTQIK-PSPSLG-GKPIFPSITISKPESKWAFSSDSGSAFTFPVSGASSG 684
+ + G E K PS SLG KP+ PSI+++KP KWA S S + FTFPVS +
Sbjct: 628 AVPYSFGGETANLPKPPSHSLGNNKPVLPSISVAKPFQKWAVPSGSNAGFTFPVSSSDGT 675
BLAST of Csor.00g216300 vs. TAIR 10
Match:
AT3G10650.1 (BEST Arabidopsis thaliana protein match is: nucleoporin-related (TAIR:AT5G20200.1); Has 61042 Blast hits to 31782 proteins in 2093 species: Archae - 202; Bacteria - 16480; Metazoa - 16017; Fungi - 12552; Plants - 1653; Viruses - 629; Other Eukaryotes - 13509 (source: NCBI BLink). )
HSP 1 Score: 88.2 bits (217), Expect = 3.2e-17
Identity = 172/658 (26.14%), Postives = 247/658 (37.54%), Query Frame = 0
Query: 68 WLSKLVDPAYRLITGGATRLLPYLFPKPLPSNALPSPGDEDQDKVEAEVEDNVSGEEPQN 127
WLSKLVDPA RLIT A RL L K L S P E Q ++ E V+ E
Sbjct: 58 WLSKLVDPAQRLITYSAQRLFGSLSRKRLGSGETPLQSPEQQKQLP---ERGVNQET--- 117
Query: 128 LGVSTLVGLPGSSGEANRSENNSDFNGCQKDKENNAL---GGNGKIDVEKWIQGKTFSRD 187
G N S NG + ++ NA +G D+EK +QGKTF+R
Sbjct: 118 -----------KVGHKEDVSNLSMKNGLIRMEDTNASVDPPKDGFTDLEKILQGKTFTRS 177
Query: 188 EVSRLLEVLRSRALEPSNKVEDNTFSPQSIEKQVEQPSTANRVLEMPREGKQEELERATG 247
EV RL +LRS+A + S E+ + V P + R P G L
Sbjct: 178 EVDRLTTLLRSKAADSSTMNEEQR---NEVGMVVRHPPSHERDRTHPDNGSMNTLVSTPP 237
Query: 248 GNLTPHPHSLKLREVGASPVDIARAYMSNQKSEP-----GLASDKMPDDEKALRHGDHQM 307
G+L L E ASP +A+AYM ++ SE GL +D L
Sbjct: 238 GSLR------TLDECIASPAQLAKAYMGSRPSEVTPSMLGLRGQAGREDSVFLNR----- 297
Query: 308 FKPFIPSMSPNPS-TCWPGAMSESQRGYVTPRSQRGRFGLHNFPRTPYSR--------SI 367
PF P SP S P + G+VTPRS RGR +++ RTPYSR S+
Sbjct: 298 -TPF-PQKSPTMSLVTKPSGQRPLENGFVTPRS-RGRSAVYSMARTPYSRPQSSVKIGSL 357
Query: 368 FSMSKSKSKLTQLQGDGQKFVNTPSPLWQRSRSPAYSMMTSSKDPLDEATGSIGLTCSLQ 427
F S SK + + G Q F S L +RS LD GS+G ++
Sbjct: 358 FQASPSKWEESLPSGSRQGF---QSGLKRRS------------SVLDNDIGSVGPVRRIR 417
Query: 428 HKASAVTNSRRSAYFYPPQQPEMEIENNISEAIFPDMKKNLDRGGASTIPLSQSVGINNS 487
K++ + S P + + + N GG T S+ +
Sbjct: 418 QKSNLSSRS----LALPVSESPLSVRAN---------------GGEKTTHTSKDSAEDIP 477
Query: 488 ESSLPTVRPQSSQVARTILEH----ITRNPPTPKEKTEELKRAIEWKKTPSANVPSVKPN 547
SS V +SS++A IL+ ++ +P + + + R K + P N
Sbjct: 478 GSSFNLVPTKSSEMASKILQQLDKLVSTREKSPSKLSPSMLRGPALKSLQNVEAPKFLGN 537
Query: 548 ETSSLAVDIDSHQKANQVDQN--CHPQLSDEGKTMSTV--LPKEGAGRNPDAANQN---- 607
A DS + ++ + L+ KT V K G+ ++ D +
Sbjct: 538 LPEKKANSPDSSYQKQEISRESVSREVLAQSEKTGDAVDGTSKTGSSKDQDMRGKGVYMP 597
Query: 608 ---------PYGLKFRLSNAESKHKDDAGLNIGSSSPKAVPK--IFPALGSEVWTQIKPS 667
P FR+S E + D L S+ + K F S + S
Sbjct: 598 LTNSLEEHPPKKRSFRMSAHEDFLELDDDLGAASTPCEVAEKQNAFEVEKSHI------S 641
Query: 668 PSLGGKPIFPSITI-----------SKPESKWAFSSDSGSAFTFPVSGA-SSGMLSEP 674
+G KP+ PS + S+ S + ++ FP+ S M SEP
Sbjct: 658 MPIGEKPLTPSEAMPSTSYISNGDASQGTSNGSLETERNKFVAFPIEAVQQSNMASEP 641
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Q9CAF4 | 4.4e-16 | 26.14 | Nuclear pore complex protein NUP1 OS=Arabidopsis thaliana OX=3702 GN=NUP1 PE=1 S... | [more] |
Match Name | E-value | Identity | Description | |
KAG6589842.1 | 0.0 | 100.00 | Nuclear pore complex protein NUP1, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
XP_022960828.1 | 0.0 | 94.55 | nuclear pore complex protein NUP1-like isoform X1 [Cucurbita moschata] | [more] |
XP_023515697.1 | 0.0 | 91.82 | nuclear pore complex protein NUP1-like isoform X1 [Cucurbita pepo subsp. pepo] >... | [more] |
XP_022987623.1 | 0.0 | 91.36 | nuclear pore complex protein NUP1-like isoform X1 [Cucurbita maxima] | [more] |
KAG7023514.1 | 0.0 | 89.94 | Nuclear pore complex protein NUP1 [Cucurbita argyrosperma subsp. argyrosperma] | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1HA42 | 0.0 | 94.55 | nuclear pore complex protein NUP1-like isoform X1 OS=Cucurbita moschata OX=3662 ... | [more] |
A0A6J1JJZ8 | 0.0 | 91.36 | nuclear pore complex protein NUP1-like isoform X1 OS=Cucurbita maxima OX=3661 GN... | [more] |
A0A6J1HA80 | 0.0 | 97.24 | nuclear pore complex protein NUP1-like isoform X3 OS=Cucurbita moschata OX=3662 ... | [more] |
A0A6J1HC90 | 0.0 | 94.93 | uncharacterized protein LOC111461519 isoform X2 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1JJE0 | 0.0 | 93.79 | nuclear pore complex protein NUP1-like isoform X2 OS=Cucurbita maxima OX=3661 GN... | [more] |
Match Name | E-value | Identity | Description | |
AT5G20200.1 | 2.9e-63 | 32.74 | nucleoporin-related | [more] |
AT3G10650.1 | 3.2e-17 | 26.14 | BEST Arabidopsis thaliana protein match is: nucleoporin-related (TAIR:AT5G20200.... | [more] |