HG10001002 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10001002
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptiontRNA-uridine aminocarboxypropyltransferase
LocationChr09: 13017582 .. 13022793 (+)
RNA-Seq ExpressionHG10001002
SyntenyHG10001002
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTCTCCACTTTAAGGGCCTTTGCTCCAACATTCAATCGCTCTCCGTCTCGCGCGTTTTCAATCAAAACCCAGATGGATTCCATCACTAAAAACCCCGACGGCCACAGTTCCAGGTGCGCTTCCACCTCTAAACCCAGAGAAGCTCCGATCACGTTGCAGGAATGGCAGGGTTGGGGCTCCACGTCTCCTGTTCCCACCATGGTTACGGAAATTATCGAGGAATTGAAGGTTTTGGAGAAAAATATTGATGCCCAAATGAGTTTTGGTGGTACTGGGGGCAAGCTTCAGGTGTGCCACTAACCCATTGTATATGTTTTTCGCCTCTTCTTTTACTTTGTTACGTTTTGGTTCCTACTGCAACAGGTTCTGTTAAATCCTATACTCAATTATTGTTTCTATGTCTCTTGCCGTTGCATTCAACTCATCTAGATATTTTTAATGGAGTAAACGACAGCTTGAAACATTAAATTGATTCATGTCTTCTCTTTCAAAATGGTTGTGGAAGTTCTTCAATGAAGTGAATGTCATATGGAGACTTATTATTAGAAGTACGGTTTAGATAATCTCTGTTTGTCCACCAAACACCAGCTGCTAGAAATTTCCGTGACCCTTGGAATGGATTGACATTACTAAAAGTAAAACCAGGTCTTTCATGGAAAAAGGGATCACTTTTTAACTTAGCAATGGCAGAAAACTTGGGTAGTGGGAGGATATGTGGTTGGATTCTAACCCATCTTGCAATTTGTTCCCTAAAGAGTATGCATTGTCCAACACAAAAGGAAAGTTGGTCAGAGATGTTCAGCTCAGTTTTCCCTCAAGCTGGGAGCTTGTCTTTGTTAGTCAGAACGGTGATTAATCTTACTGAAATTTTGAAAAATGGGGAAGTTATCTTGTAAATCTCCTAATCTCAAATTGACCACCGTGAAAGTAAGTTGAACCTGAATTTAGCCAAGCGGATATGTGATGGAAACTTTCCTAAAAATGTAAAGTTCTTCCTTTAGACCTTGCCTTCTGAGAGCCTTAATATGCATGAAAGATTGTAAAAGAAATTTTGTATCTGTCTATATCTCCATGAGTTTGCAAAAATTGCTCCTCTAGTGAGGAATCCATAGACCACATTTTTATCCTCCATATTGTTACTACAAAAAGTTGGAAATTTATTTGCAAACTTTTCATCTGATTTGGTGTCATCCTAAAGACATCAGAAGCTGGCTTCAGGATCTCATCAGGTTGGTAGTTAAGTAGCAAAACTAGAATTCTTTGGTTAAGGGCCTGAAGGGATTATCCGTGGTTCATTTGCCAAAAAAAAAAGAGAAGGAAAAAAAAAGTTCAATGTTAAACTCAAAAGTCTGGTTCATTTCGCAAACATGTATTAACACTTAAATTTGTAGTGCTCAGCATAGGCTTTTTTGTAATTACTCCACATACTCCATCATTATAGGTTGGAAGGTCTTTTTGTAATCCTTTTCTTGGGAAAGTGGGATGCGTTGTCCATTTGTCCCCTAGGTTGTTTTGTTTGGCTCCAGTTTTTTAATGAAGTTCTCCGTTCCTTGTCAAAAAAGACAAAAAAAAAAAGTTACATTTTGGACTCAAATCGGAATCCTTACACATGTAATAAACACAAAACTACATGGTAATTCTCCTTGCTTCTCAAATGGCAGAACTATTCCTGCAGTATCTTTTTTGGTTATTACACATGTGAAAACGAATAAATTTTAGATATCCATTCCACGAACCATGAGGCTGTAGTGCTATATATGTCCCTTCTTATTTTTACTCATTGTGAAAAAGATTGCATTGCCAAGTTCAATTCCTCCATCCTACGTATTTAATCTATTCATTGTGAAAAAAATTGCATTGTTAATCTTTGGCCTTTTTTCAGGGATATTTTAAAACTCAGGAGGACAAAAAACACCGAGCTACCTATCAGGCTCTAGGTAGTTCTGAGCAGAAACTTCAATTCTTTTCTGCTCGACAAATAGCATGTCGTCTGCTTGGGAGCCGAGATTATCTTTGTCAGAAGGTATAATCAATCTTAAATGATTATGTATTTCTTGGAGTTCTATTCATATTAACACCACCAATTTGTTGAATAGGTTTTTCTATTGCATTACTGATAGGCTGTTTAGTATCTACAGACCTTTATTATTACTTTATGTAACTTATCTGCCTTTTTTGAATTCTTTTGTGTCTGCTTAGTGCTGGCTGCCTTTTGAGGATTGTATGTGTTCAAGAGTCAAGCATTGTTCTCTATGGCATGGAGCACGGTTTTGGCTATATATGCATCCAAAGGTGTGTCTACTGCTGGTCAATTCTTGATCCTTCTCATCTACTAAACTTGACATAAAGTTTATCTAGAGTTTTTGTTCAGGATTTTCTGCGGCAGAACAACACAGGGAAGTTGTTGTTGCAAGTCTTTGGTGTAGAATCCACAACTTTGTGCCTCTATGGTATCTCAGAACATGAAGAAATCATGTGGAATGCATTCAAGTCAGCAGGTAACTGTATTCCTCGTGTAAGCTTTAACAAGTTTATAGCTTTATATTACTGTAGCAAGAACTAATGTAATATCTTATTCTAGGTCTAAGAAATTAAGTATTTCTTTAAGAGATTTATGGCATTCCTCTGAATCTAGTAATATGTTAGGAATGCTATTAGGATATTAAGGGTATATGAGTGAGTAGTTAGGGAGGTGGTTATGGAGTTTAGTTATAAATAAAGGAAGTGGGGAAGGAGGGTGTTAGACATTTGGTAATTGAGAGAGATTTCAATACGAGAGGGTCCAAGTATCTCGAATTACTTGTTTATATTGTAATTTCTTCACTGAAATTGCAATATAGTTCTCTTTGGTTTTTTTGTATATTTTCTGTGTTTGGATTGGATACTAACAAAATATATGTGGATAGTTGCTATCAGCCATAAACTGCCATGTGGTTTTCACACTGTAACTCATTAGCAAAGACGGTATGCCATGTAAAACTCTATTTTCTTCTTTTGTTTACTGTGGTTTTTTCAAATATTACTCAGGTAAAAGTAAGGTTTGTTGCCTTTATCCCAACAAGAATGCAATTTCCAAAGGTGTTCAGGAGGCCTTCAGCTCTGAACTGTCAACTAAGCTAGAAAACCCACAGCAAATGGTACATATTTTCTGAAGCAGTATAGTTAATGTCCGTTCTTGATTATACTACTCACTGTGCAGCTAGAATTTGACTTCATATTTCATATTATTAAATGGTAATGATAAAAGTAAAGACTACATGGTAGAAGAAATTCAACATACCTACGACATTTATGGTTGGATATACCTATGAAACATTATTTATGGAAGACTTCTAGGCTCAGCTATTCAATCCATACAGTTCCAAATAACGTACTTTGTATTCTGGCTGATGTTTTGTTTTTTTTGCTATGGTTCTGTTTTTGTAAAATTTTGTGTTCTTCCAGTTGAACAAGGTCATAACGTTCTTGTTGATTCTACTCATAGACTGATGGAGATGGAATTCTGAATTTTGTTTTGATTGATGGTACTTGGAGCAACTCTGCTGCAATGTTCAACCGACTCAAGGTAACGTTATGTAGCATTTTCGTATTATGTCGTCTAGGTGGGCATCTTCTGGGTTATTACTGGAGAAAGCCAAAGTTTTGCCCATAACAATATGCCTGAATATTTAATCCTGCATAATTGGGTGCTATTCTTGAATTAATACTTTTGGGATTATGTCTTAGTTTTTACTGTTTCATTTTTTTAACAAGTGGGATAGGAAAATCCACCCCAAGCAAATGTGAAACCAAAACAGGATTTTGAGTACTCCAAAAGGAGGCCTGCCATCTAGCTCCCCCATCTGTCAGCAATATTAAGAAAGATTCTCAAATTTCCTTTATTCCATGATTCCTGAAGAGTGACCTTATGAAGTTGGACCACCGTAGGATCCTGGTTTCTTGAGATTTTGCCCACGAAAGCAATTAAAGTATAGCATCCTACTTTTAGATGGCACTACCCAACAGAAATGGGCCTCAAACAGCTTCTTTCCACTATTTGATGGACTGCTACCATATATGCTTTGTTGGCTCTCAGTGAAATGGTGTAGTAGTAACATCCTGACAATAGGTCCTTGTTCGCTCTCTATTGTAATCACCTATATGTGTTTGGGGTTTCCCCATATTTCATTTATCGATGAAATTTTCTTTTATCAAAAAAATAGACAGTAGGTCCTTGTTCTGAGTATCAGGGACCTTTAGGAGGCCATGAAATCAATAAAAGATAAACATTATTCGGTTTGAACCTTCATAATGTTCTTTTGGGATCCTTCATTGATAGCTTATGTTCCTTTGTGTGATGTTTGATATTCAGACTGAGGTGGACCAAACTTTTGGAAACTACTCTAAAACTATATTCTGATGATGCAATTATGTAGGAGCAAGCTATATCGGTTTGGGGAGAGGACATTCCATGCATATCATTGGCTACGGGATCCTCTGCAATGCATAAACTCAGGTAAGAATTTGAGCTAGTCATCCAATATATTCGGGTTTTCTCGCATGATCTTTTTATAGGACATTGACTCTTAAGAGCTGCTTGTCTGAATGTGAAGGCCTCAACCATCATGGGACCGTACCTGCACTGCAGCAGCAGCAGCTAGCCTGCTCTTTGAGCTTCAACTCGTTCCAAAGTTCAGTTCGGTCGAATTGGAAAAACAGGGCGAAGCATTGGAAGATGCTCTGGAAGTACTGTTAGAAGCTCTCACCACTCGACGTATTCGGATGGGGCGGTCCATCACCCGTAAAGTAAGACATGCAAGCAGTTTTTGCTAATCGAGAATGATGTTCCTCGAAACGCCTGAGGTGAAAATTTTCAATGCTTTTATTGGTAGAATTATGCTAACATACGGCTTCTCACGCAGCTTGATTTAAATCTTTTCATTATACTTTTAGGATAATATCAAAACATTAGCCTTGTAGTTTGTCATTTGATCATACAATAATTTACTACATGTGGTGTTTGGAGGGGTGGGTTGGACTTACATTTGAAGGGATGAGTTGTCTAACTCCTTTTTCATTGGAGAAACCAAACAAACTTAAACAATACAAGCTTCCTAAACCTATAACTCCACCAGCACCAACATCAAATCCACCTCTTATGCCAACTCTACTAAAATTCATATTCGGATATGATACCTCAAAGACCGATGTAAAGGGTAGGTGTCACTAG

mRNA sequence

ATGGTCTCCACTTTAAGGGCCTTTGCTCCAACATTCAATCGCTCTCCGTCTCGCGCGTTTTCAATCAAAACCCAGATGGATTCCATCACTAAAAACCCCGACGGCCACAGTTCCAGGTGCGCTTCCACCTCTAAACCCAGAGAAGCTCCGATCACGTTGCAGGAATGGCAGGGTTGGGGCTCCACGTCTCCTGTTCCCACCATGGTTACGGAAATTATCGAGGAATTGAAGGTTTTGGAGAAAAATATTGATGCCCAAATGAGTTTTGGTGGTACTGGGGGCAAGCTTCAGGGATATTTTAAAACTCAGGAGGACAAAAAACACCGAGCTACCTATCAGGCTCTAGGTAGTTCTGAGCAGAAACTTCAATTCTTTTCTGCTCGACAAATAGCATGTCGTCTGCTTGGGAGCCGAGATTATCTTTGTCAGAAGTGCTGGCTGCCTTTTGAGGATTGTATGTGTTCAAGAGTCAAGCATTGTTCTCTATGGCATGGAGCACGGTTTTGGCTATATATGCATCCAAAGGATTTTCTGCGGCAGAACAACACAGGGAAGTTGTTGTTGCAAGTCTTTGGTGTAGAATCCACAACTTTGTGCCTCTATGGTATCTCAGAACATGAAGAAATCATGTGGAATGCATTCAAGTCAGCAGGTAAAAGTAAGGTTTGTTGCCTTTATCCCAACAAGAATGCAATTTCCAAAGGTGTTCAGGAGGCCTTCAGCTCTGAACTGTCAACTAAGCTAGAAAACCCACAGCAAATGACTGATGGAGATGGAATTCTGAATTTTGTTTTGATTGATGGTACTTGGAGCAACTCTGCTGCAATGTTCAACCGACTCAAGGAGCAAGCTATATCGGTTTGGGGAGAGGACATTCCATGCATATCATTGGCTACGGGATCCTCTGCAATGCATAAACTCAGGCCTCAACCATCATGGGACCGTACCTGCACTGCAGCAGCAGCAGCTAGCCTGCTCTTTGAGCTTCAACTCGTTCCAAAGTTCAGTTCGGTCGAATTGGAAAAACAGGGCGAAGCATTGGAAGATGCTCTGGAAGTACTGTTAGAAGCTCTCACCACTCGACGTATTCGGATGGGGCGGTCCATCACCCGTAAAGGGTGGGTTGGACTTACATTTGAAGGGATGAGTTGTCTAACTCCTTTTTCATTGGAGAAACCAAACAAACTTAAACAATACAAGCTTCCTAAACCTATAACTCCACCAGCACCAACATCAAATCCACCTCTTATGCCAACTCTACTAAAATTCATATTCGGATATGATACCTCAAAGACCGATGTAAAGGGTAGGTGTCACTAG

Coding sequence (CDS)

ATGGTCTCCACTTTAAGGGCCTTTGCTCCAACATTCAATCGCTCTCCGTCTCGCGCGTTTTCAATCAAAACCCAGATGGATTCCATCACTAAAAACCCCGACGGCCACAGTTCCAGGTGCGCTTCCACCTCTAAACCCAGAGAAGCTCCGATCACGTTGCAGGAATGGCAGGGTTGGGGCTCCACGTCTCCTGTTCCCACCATGGTTACGGAAATTATCGAGGAATTGAAGGTTTTGGAGAAAAATATTGATGCCCAAATGAGTTTTGGTGGTACTGGGGGCAAGCTTCAGGGATATTTTAAAACTCAGGAGGACAAAAAACACCGAGCTACCTATCAGGCTCTAGGTAGTTCTGAGCAGAAACTTCAATTCTTTTCTGCTCGACAAATAGCATGTCGTCTGCTTGGGAGCCGAGATTATCTTTGTCAGAAGTGCTGGCTGCCTTTTGAGGATTGTATGTGTTCAAGAGTCAAGCATTGTTCTCTATGGCATGGAGCACGGTTTTGGCTATATATGCATCCAAAGGATTTTCTGCGGCAGAACAACACAGGGAAGTTGTTGTTGCAAGTCTTTGGTGTAGAATCCACAACTTTGTGCCTCTATGGTATCTCAGAACATGAAGAAATCATGTGGAATGCATTCAAGTCAGCAGGTAAAAGTAAGGTTTGTTGCCTTTATCCCAACAAGAATGCAATTTCCAAAGGTGTTCAGGAGGCCTTCAGCTCTGAACTGTCAACTAAGCTAGAAAACCCACAGCAAATGACTGATGGAGATGGAATTCTGAATTTTGTTTTGATTGATGGTACTTGGAGCAACTCTGCTGCAATGTTCAACCGACTCAAGGAGCAAGCTATATCGGTTTGGGGAGAGGACATTCCATGCATATCATTGGCTACGGGATCCTCTGCAATGCATAAACTCAGGCCTCAACCATCATGGGACCGTACCTGCACTGCAGCAGCAGCAGCTAGCCTGCTCTTTGAGCTTCAACTCGTTCCAAAGTTCAGTTCGGTCGAATTGGAAAAACAGGGCGAAGCATTGGAAGATGCTCTGGAAGTACTGTTAGAAGCTCTCACCACTCGACGTATTCGGATGGGGCGGTCCATCACCCGTAAAGGGTGGGTTGGACTTACATTTGAAGGGATGAGTTGTCTAACTCCTTTTTCATTGGAGAAACCAAACAAACTTAAACAATACAAGCTTCCTAAACCTATAACTCCACCAGCACCAACATCAAATCCACCTCTTATGCCAACTCTACTAAAATTCATATTCGGATATGATACCTCAAAGACCGATGTAAAGGGTAGGTGTCACTAG

Protein sequence

MVSTLRAFAPTFNRSPSRAFSIKTQMDSITKNPDGHSSRCASTSKPREAPITLQEWQGWGSTSPVPTMVTEIIEELKVLEKNIDAQMSFGGTGGKLQGYFKTQEDKKHRATYQALGSSEQKLQFFSARQIACRLLGSRDYLCQKCWLPFEDCMCSRVKHCSLWHGARFWLYMHPKDFLRQNNTGKLLLQVFGVESTTLCLYGISEHEEIMWNAFKSAGKSKVCCLYPNKNAISKGVQEAFSSELSTKLENPQQMTDGDGILNFVLIDGTWSNSAAMFNRLKEQAISVWGEDIPCISLATGSSAMHKLRPQPSWDRTCTAAAAASLLFELQLVPKFSSVELEKQGEALEDALEVLLEALTTRRIRMGRSITRKGWVGLTFEGMSCLTPFSLEKPNKLKQYKLPKPITPPAPTSNPPLMPTLLKFIFGYDTSKTDVKGRCH
Homology
BLAST of HG10001002 vs. NCBI nr
Match: XP_038901524.1 (uncharacterized protein LOC120088365 [Benincasa hispida])

HSP 1 Score: 726.1 bits (1873), Expect = 1.8e-205
Identity = 354/372 (95.16%), Postives = 362/372 (97.31%), Query Frame = 0

Query: 1   MVSTLRAFAPTFNRSPSRAFSIKTQMDSITKNPDGHSSRCASTSKPREAPITLQEWQGWG 60
           MVSTLRAFAPTFNRSPSRAFS+KTQMDSI +NPDGHSSRCASTSK REA ITLQEWQGWG
Sbjct: 1   MVSTLRAFAPTFNRSPSRAFSVKTQMDSIIRNPDGHSSRCASTSKSREALITLQEWQGWG 60

Query: 61  STSPVPTMVTEIIEELKVLEKNIDAQMSFGGTGGKLQGYFKTQEDKKHRATYQALGSSEQ 120
           STSPVP MVTEIIEE+KVLEK IDAQMSFGG GGKLQGYF+TQEDKKHRATYQALGSSEQ
Sbjct: 61  STSPVPIMVTEIIEEMKVLEKTIDAQMSFGGNGGKLQGYFRTQEDKKHRATYQALGSSEQ 120

Query: 121 KLQFFSARQIACRLLGSRDYLCQKCWLPFEDCMCSRVKHCSLWHGARFWLYMHPKDFLRQ 180
           KLQFFSARQIACRLLGSRDYLCQKCWLPFEDCMCSRVKHCSLWHGARFWLYMHPKDFLRQ
Sbjct: 121 KLQFFSARQIACRLLGSRDYLCQKCWLPFEDCMCSRVKHCSLWHGARFWLYMHPKDFLRQ 180

Query: 181 NNTGKLLLQVFGVESTTLCLYGISEHEEIMWNAFKSAGKSKVCCLYPNKNAISKGVQEAF 240
           NNTGKLLLQVFGVE+TTLCLYGISEHEEIMWN FKSAGKSKVCCLYPNKNA SKGVQEAF
Sbjct: 181 NNTGKLLLQVFGVEATTLCLYGISEHEEIMWNVFKSAGKSKVCCLYPNKNATSKGVQEAF 240

Query: 241 SSELSTKLENPQQMTDGDGILNFVLIDGTWSNSAAMFNRLKEQAISVWGEDIPCISLATG 300
           SSELSTKLE+ +QMTDGDGILNF+LIDGTWSNSAAMFNRLKEQAISVWGEDIPCISLATG
Sbjct: 241 SSELSTKLEDTEQMTDGDGILNFILIDGTWSNSAAMFNRLKEQAISVWGEDIPCISLATG 300

Query: 301 SSAMHKLRPQPSWDRTCTAAAAASLLFELQLVPKFSSVELEKQGEALEDALEVLLEALTT 360
           SSAMHKLRPQPSWDRTCTAAAAASLLFELQLVPKFSSVELEKQGEALEDALEVLLEALT 
Sbjct: 301 SSAMHKLRPQPSWDRTCTAAAAASLLFELQLVPKFSSVELEKQGEALEDALEVLLEALTA 360

Query: 361 RRIRMGRSITRK 373
           RRIRMGRSITRK
Sbjct: 361 RRIRMGRSITRK 372

BLAST of HG10001002 vs. NCBI nr
Match: XP_004139941.1 (uncharacterized protein LOC101203963 isoform X1 [Cucumis sativus] >KGN46744.1 hypothetical protein Csa_021008 [Cucumis sativus])

HSP 1 Score: 704.5 bits (1817), Expect = 5.7e-199
Identity = 345/372 (92.74%), Postives = 354/372 (95.16%), Query Frame = 0

Query: 1   MVSTLRAFAPTFNRSPSRAFSIKTQMDSITKNPDGHSSRCASTSKPREAPITLQEWQGWG 60
           MVSTLRAFAPTFNR PSRAFSIKTQMDSIT+NPDG++S  ASTSKP E PITLQEWQGWG
Sbjct: 1   MVSTLRAFAPTFNRYPSRAFSIKTQMDSITRNPDGYTSTSASTSKPTETPITLQEWQGWG 60

Query: 61  STSPVPTMVTEIIEELKVLEKNIDAQMSFGGTGGKLQGYFKTQEDKKHRATYQALGSSEQ 120
           STSPVPTMVTEII+ELKVLEK +DAQMSFGG GGKLQGYFKTQEDKKHRATYQALGSSEQ
Sbjct: 61  STSPVPTMVTEIIDELKVLEKTVDAQMSFGGNGGKLQGYFKTQEDKKHRATYQALGSSEQ 120

Query: 121 KLQFFSARQIACRLLGSRDYLCQKCWLPFEDCMCSRVKHCSLWHGARFWLYMHPKDFLRQ 180
           KLQFFSARQIACRLLGSRDYLCQKCWLPFEDCMCSRVKHCSLW  ARFWLYMHPKDFLRQ
Sbjct: 121 KLQFFSARQIACRLLGSRDYLCQKCWLPFEDCMCSRVKHCSLWDRARFWLYMHPKDFLRQ 180

Query: 181 NNTGKLLLQVFGVESTTLCLYGISEHEEIMWNAFKSAGKSKVCCLYPNKNAISKGVQEAF 240
           NNTGKLLLQVFG E+TTL LYGISEHEEIMWNAFKSAG+SKVCCLYPNKNA SKGVQEAF
Sbjct: 181 NNTGKLLLQVFGKEATTLSLYGISEHEEIMWNAFKSAGRSKVCCLYPNKNATSKGVQEAF 240

Query: 241 SSELSTKLENPQQMTDGDGILNFVLIDGTWSNSAAMFNRLKEQAISVWGEDIPCISLATG 300
            SELSTK EN QQMTDGDGILNF+LIDGTWSNSAAMFNRLKEQAI VWGEDIPCISL+TG
Sbjct: 241 GSELSTKQENTQQMTDGDGILNFILIDGTWSNSAAMFNRLKEQAILVWGEDIPCISLSTG 300

Query: 301 SSAMHKLRPQPSWDRTCTAAAAASLLFELQLVPKFSSVELEKQGEALEDALEVLLEALTT 360
           SSAMHKLRPQPSWDRTCTAAAAASLLFELQLVPKFSSVE EKQGEALEDALEVLLEALT 
Sbjct: 301 SSAMHKLRPQPSWDRTCTAAAAASLLFELQLVPKFSSVEFEKQGEALEDALEVLLEALTA 360

Query: 361 RRIRMGRSITRK 373
           RRIRMGRSITRK
Sbjct: 361 RRIRMGRSITRK 372

BLAST of HG10001002 vs. NCBI nr
Match: XP_022140651.1 (uncharacterized protein LOC111011257 [Momordica charantia])

HSP 1 Score: 688.3 bits (1775), Expect = 4.2e-194
Identity = 336/379 (88.65%), Postives = 351/379 (92.61%), Query Frame = 0

Query: 1   MVSTLRAFAPTFNRSPSRAFSIKTQMDSITKNPDGHSSRCASTSKPREA-------PITL 60
           M STLRAFAPTFNRSPS AFS KT MDS+ + PDGHSSRCASTS+PRE        PITL
Sbjct: 1   MFSTLRAFAPTFNRSPSLAFSTKTPMDSVVRGPDGHSSRCASTSEPRETRISSGGPPITL 60

Query: 61  QEWQGWGSTSPVPTMVTEIIEELKVLEKNIDAQMSFGGTGGKLQGYFKTQEDKKHRATYQ 120
           QEWQ WGS SP+PTMVTEI+EELKVLEKNIDAQMSFGG+GGKLQGYFKTQEDKKHRATYQ
Sbjct: 61  QEWQAWGSASPLPTMVTEIVEELKVLEKNIDAQMSFGGSGGKLQGYFKTQEDKKHRATYQ 120

Query: 121 ALGSSEQKLQFFSARQIACRLLGSRDYLCQKCWLPFEDCMCSRVKHCSLWHGARFWLYMH 180
           ALGSSEQKLQFFSARQIACRLLGSRDYLCQKCWLPFEDCMCSRVK CSLW GARFWLYMH
Sbjct: 121 ALGSSEQKLQFFSARQIACRLLGSRDYLCQKCWLPFEDCMCSRVKRCSLWGGARFWLYMH 180

Query: 181 PKDFLRQNNTGKLLLQVFGVESTTLCLYGISEHEEIMWNAFKSAGKSKVCCLYPNKNAIS 240
           PKDFLRQNNTGKLLLQVFGVE+TTLCLYGISEHEEIMWNAFK+AGKSKVCCLYPNKNA S
Sbjct: 181 PKDFLRQNNTGKLLLQVFGVEATTLCLYGISEHEEIMWNAFKAAGKSKVCCLYPNKNATS 240

Query: 241 KGVQEAFSSELSTKLENPQQMTDGDGILNFVLIDGTWSNSAAMFNRLKEQAISVWGEDIP 300
           K VQ+AFSSELSTK E  Q+ TDGDGILNF+LIDGTWSNSAAMFNRLKEQA SVWGED+P
Sbjct: 241 KSVQDAFSSELSTKQECTQKTTDGDGILNFILIDGTWSNSAAMFNRLKEQANSVWGEDLP 300

Query: 301 CISLATGSSAMHKLRPQPSWDRTCTAAAAASLLFELQLVPKFSSVELEKQGEALEDALEV 360
           CISL TGSSAMHKLRPQPSWDRTCTAAAAASLLFELQL+PKFSS+E +KQGEALEDALEV
Sbjct: 301 CISLTTGSSAMHKLRPQPSWDRTCTAAAAASLLFELQLIPKFSSIEFDKQGEALEDALEV 360

Query: 361 LLEALTTRRIRMGRSITRK 373
           LLEALT RRIRMGRSITRK
Sbjct: 361 LLEALTARRIRMGRSITRK 379

BLAST of HG10001002 vs. NCBI nr
Match: KAG6604796.1 (hypothetical protein SDJN03_02113, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 682.2 bits (1759), Expect = 3.0e-192
Identity = 334/379 (88.13%), Postives = 349/379 (92.08%), Query Frame = 0

Query: 1   MVSTLRAFAPTFNRSPSRAFSIKTQMDSITKNPDGHSSRCASTS-------KPREAPITL 60
           M+ST+RAFAP+FNRSPSRAFS KTQMDSIT+ PD   SRCAS S          EAPITL
Sbjct: 1   MLSTIRAFAPSFNRSPSRAFSTKTQMDSITRTPDARISRCASNSISTEARVTSEEAPITL 60

Query: 61  QEWQGWGSTSPVPTMVTEIIEELKVLEKNIDAQMSFGGTGGKLQGYFKTQEDKKHRATYQ 120
           QEWQGWGSTSP+PTMVTEIIEELK LEKNIDAQMSFGG GGKLQGYFKTQEDKKHRATYQ
Sbjct: 61  QEWQGWGSTSPMPTMVTEIIEELKALEKNIDAQMSFGGNGGKLQGYFKTQEDKKHRATYQ 120

Query: 121 ALGSSEQKLQFFSARQIACRLLGSRDYLCQKCWLPFEDCMCSRVKHCSLWHGARFWLYMH 180
           ALGSSEQKLQFFSARQIACRLLGSRDYLCQKCWLPFEDCMCSRVKHC LW GARFWLYMH
Sbjct: 121 ALGSSEQKLQFFSARQIACRLLGSRDYLCQKCWLPFEDCMCSRVKHCPLWSGARFWLYMH 180

Query: 181 PKDFLRQNNTGKLLLQVFGVESTTLCLYGISEHEEIMWNAFKSAGKSKVCCLYPNKNAIS 240
           PKDFLRQNNTGKLLLQVFGVE+TTLCLYGISEHE+IMW+AFKSAGKSKVCCLYPNKNA S
Sbjct: 181 PKDFLRQNNTGKLLLQVFGVEATTLCLYGISEHEDIMWSAFKSAGKSKVCCLYPNKNATS 240

Query: 241 KGVQEAFSSELSTKLENPQQMTDGDGILNFVLIDGTWSNSAAMFNRLKEQAISVWGEDIP 300
           K VQEAFSSELSTK E  QQ TDGDGILNF+LIDGTWSNSAAMFNRL+EQA SVWGED+P
Sbjct: 241 KSVQEAFSSELSTKQERTQQTTDGDGILNFILIDGTWSNSAAMFNRLREQAKSVWGEDLP 300

Query: 301 CISLATGSSAMHKLRPQPSWDRTCTAAAAASLLFELQLVPKFSSVELEKQGEALEDALEV 360
           CISL TGSSAMHKLRPQPSWDRTCTAAAAASLLFELQL+P+ SSV LEKQGEALED+LEV
Sbjct: 301 CISLTTGSSAMHKLRPQPSWDRTCTAAAAASLLFELQLIPRLSSVRLEKQGEALEDSLEV 360

Query: 361 LLEALTTRRIRMGRSITRK 373
           LLEALT+RRIRMGRSITRK
Sbjct: 361 LLEALTSRRIRMGRSITRK 379

BLAST of HG10001002 vs. NCBI nr
Match: XP_022970715.1 (uncharacterized protein LOC111469615 [Cucurbita maxima] >XP_022970717.1 uncharacterized protein LOC111469615 [Cucurbita maxima])

HSP 1 Score: 678.3 bits (1749), Expect = 4.4e-191
Identity = 334/379 (88.13%), Postives = 349/379 (92.08%), Query Frame = 0

Query: 1   MVSTLRAFAPTFNRSPSRAFSIKTQMDSITKNPDGHSSRCASTSKPREA-------PITL 60
           M+ST+RAFAP+FNRSPSRAFS KTQMDSITKNPD   SR AS S   EA       PITL
Sbjct: 1   MLSTIRAFAPSFNRSPSRAFSTKTQMDSITKNPDARISRWASNSISTEARVTSEETPITL 60

Query: 61  QEWQGWGSTSPVPTMVTEIIEELKVLEKNIDAQMSFGGTGGKLQGYFKTQEDKKHRATYQ 120
           QEWQGWGSTSP+PTMVTEIIEELK LEKNIDAQMSFGG GGKLQGYFKTQEDKKHRATYQ
Sbjct: 61  QEWQGWGSTSPMPTMVTEIIEELKALEKNIDAQMSFGGNGGKLQGYFKTQEDKKHRATYQ 120

Query: 121 ALGSSEQKLQFFSARQIACRLLGSRDYLCQKCWLPFEDCMCSRVKHCSLWHGARFWLYMH 180
           ALGSSEQKLQFFSARQIACRLLGSRDYLCQKCWLPFEDCMCSRVKHC LW GARFWLYMH
Sbjct: 121 ALGSSEQKLQFFSARQIACRLLGSRDYLCQKCWLPFEDCMCSRVKHCPLWSGARFWLYMH 180

Query: 181 PKDFLRQNNTGKLLLQVFGVESTTLCLYGISEHEEIMWNAFKSAGKSKVCCLYPNKNAIS 240
           PKDFLRQNNTGKLLLQVFGVE+TTLCLYGISEHE+IMW+AFKSAGKSKVCCLYPNKNA S
Sbjct: 181 PKDFLRQNNTGKLLLQVFGVEATTLCLYGISEHEDIMWSAFKSAGKSKVCCLYPNKNATS 240

Query: 241 KGVQEAFSSELSTKLENPQQMTDGDGILNFVLIDGTWSNSAAMFNRLKEQAISVWGEDIP 300
           K VQ+AFSSELSTK E  QQ TDGDGILNF+LIDGTWSNSAAMFNRL+EQA SVWGED+P
Sbjct: 241 KSVQDAFSSELSTKQERTQQTTDGDGILNFILIDGTWSNSAAMFNRLREQAKSVWGEDLP 300

Query: 301 CISLATGSSAMHKLRPQPSWDRTCTAAAAASLLFELQLVPKFSSVELEKQGEALEDALEV 360
           CISL TGSSAMHKLRPQPSWDRTCTAAAAASLLFELQL+P+ SSV LEKQGEALED+LEV
Sbjct: 301 CISLTTGSSAMHKLRPQPSWDRTCTAAAAASLLFELQLIPRLSSVGLEKQGEALEDSLEV 360

Query: 361 LLEALTTRRIRMGRSITRK 373
           LLEALT+RRIRMGRSITRK
Sbjct: 361 LLEALTSRRIRMGRSITRK 379

BLAST of HG10001002 vs. ExPASy TrEMBL
Match: A0A0A0KD21 (tRNA-uridine aminocarboxypropyltransferase OS=Cucumis sativus OX=3659 GN=Csa_6G128540 PE=4 SV=1)

HSP 1 Score: 704.5 bits (1817), Expect = 2.7e-199
Identity = 345/372 (92.74%), Postives = 354/372 (95.16%), Query Frame = 0

Query: 1   MVSTLRAFAPTFNRSPSRAFSIKTQMDSITKNPDGHSSRCASTSKPREAPITLQEWQGWG 60
           MVSTLRAFAPTFNR PSRAFSIKTQMDSIT+NPDG++S  ASTSKP E PITLQEWQGWG
Sbjct: 1   MVSTLRAFAPTFNRYPSRAFSIKTQMDSITRNPDGYTSTSASTSKPTETPITLQEWQGWG 60

Query: 61  STSPVPTMVTEIIEELKVLEKNIDAQMSFGGTGGKLQGYFKTQEDKKHRATYQALGSSEQ 120
           STSPVPTMVTEII+ELKVLEK +DAQMSFGG GGKLQGYFKTQEDKKHRATYQALGSSEQ
Sbjct: 61  STSPVPTMVTEIIDELKVLEKTVDAQMSFGGNGGKLQGYFKTQEDKKHRATYQALGSSEQ 120

Query: 121 KLQFFSARQIACRLLGSRDYLCQKCWLPFEDCMCSRVKHCSLWHGARFWLYMHPKDFLRQ 180
           KLQFFSARQIACRLLGSRDYLCQKCWLPFEDCMCSRVKHCSLW  ARFWLYMHPKDFLRQ
Sbjct: 121 KLQFFSARQIACRLLGSRDYLCQKCWLPFEDCMCSRVKHCSLWDRARFWLYMHPKDFLRQ 180

Query: 181 NNTGKLLLQVFGVESTTLCLYGISEHEEIMWNAFKSAGKSKVCCLYPNKNAISKGVQEAF 240
           NNTGKLLLQVFG E+TTL LYGISEHEEIMWNAFKSAG+SKVCCLYPNKNA SKGVQEAF
Sbjct: 181 NNTGKLLLQVFGKEATTLSLYGISEHEEIMWNAFKSAGRSKVCCLYPNKNATSKGVQEAF 240

Query: 241 SSELSTKLENPQQMTDGDGILNFVLIDGTWSNSAAMFNRLKEQAISVWGEDIPCISLATG 300
            SELSTK EN QQMTDGDGILNF+LIDGTWSNSAAMFNRLKEQAI VWGEDIPCISL+TG
Sbjct: 241 GSELSTKQENTQQMTDGDGILNFILIDGTWSNSAAMFNRLKEQAILVWGEDIPCISLSTG 300

Query: 301 SSAMHKLRPQPSWDRTCTAAAAASLLFELQLVPKFSSVELEKQGEALEDALEVLLEALTT 360
           SSAMHKLRPQPSWDRTCTAAAAASLLFELQLVPKFSSVE EKQGEALEDALEVLLEALT 
Sbjct: 301 SSAMHKLRPQPSWDRTCTAAAAASLLFELQLVPKFSSVEFEKQGEALEDALEVLLEALTA 360

Query: 361 RRIRMGRSITRK 373
           RRIRMGRSITRK
Sbjct: 361 RRIRMGRSITRK 372

BLAST of HG10001002 vs. ExPASy TrEMBL
Match: A0A6J1CGP4 (tRNA-uridine aminocarboxypropyltransferase OS=Momordica charantia OX=3673 GN=LOC111011257 PE=4 SV=1)

HSP 1 Score: 688.3 bits (1775), Expect = 2.0e-194
Identity = 336/379 (88.65%), Postives = 351/379 (92.61%), Query Frame = 0

Query: 1   MVSTLRAFAPTFNRSPSRAFSIKTQMDSITKNPDGHSSRCASTSKPREA-------PITL 60
           M STLRAFAPTFNRSPS AFS KT MDS+ + PDGHSSRCASTS+PRE        PITL
Sbjct: 1   MFSTLRAFAPTFNRSPSLAFSTKTPMDSVVRGPDGHSSRCASTSEPRETRISSGGPPITL 60

Query: 61  QEWQGWGSTSPVPTMVTEIIEELKVLEKNIDAQMSFGGTGGKLQGYFKTQEDKKHRATYQ 120
           QEWQ WGS SP+PTMVTEI+EELKVLEKNIDAQMSFGG+GGKLQGYFKTQEDKKHRATYQ
Sbjct: 61  QEWQAWGSASPLPTMVTEIVEELKVLEKNIDAQMSFGGSGGKLQGYFKTQEDKKHRATYQ 120

Query: 121 ALGSSEQKLQFFSARQIACRLLGSRDYLCQKCWLPFEDCMCSRVKHCSLWHGARFWLYMH 180
           ALGSSEQKLQFFSARQIACRLLGSRDYLCQKCWLPFEDCMCSRVK CSLW GARFWLYMH
Sbjct: 121 ALGSSEQKLQFFSARQIACRLLGSRDYLCQKCWLPFEDCMCSRVKRCSLWGGARFWLYMH 180

Query: 181 PKDFLRQNNTGKLLLQVFGVESTTLCLYGISEHEEIMWNAFKSAGKSKVCCLYPNKNAIS 240
           PKDFLRQNNTGKLLLQVFGVE+TTLCLYGISEHEEIMWNAFK+AGKSKVCCLYPNKNA S
Sbjct: 181 PKDFLRQNNTGKLLLQVFGVEATTLCLYGISEHEEIMWNAFKAAGKSKVCCLYPNKNATS 240

Query: 241 KGVQEAFSSELSTKLENPQQMTDGDGILNFVLIDGTWSNSAAMFNRLKEQAISVWGEDIP 300
           K VQ+AFSSELSTK E  Q+ TDGDGILNF+LIDGTWSNSAAMFNRLKEQA SVWGED+P
Sbjct: 241 KSVQDAFSSELSTKQECTQKTTDGDGILNFILIDGTWSNSAAMFNRLKEQANSVWGEDLP 300

Query: 301 CISLATGSSAMHKLRPQPSWDRTCTAAAAASLLFELQLVPKFSSVELEKQGEALEDALEV 360
           CISL TGSSAMHKLRPQPSWDRTCTAAAAASLLFELQL+PKFSS+E +KQGEALEDALEV
Sbjct: 301 CISLTTGSSAMHKLRPQPSWDRTCTAAAAASLLFELQLIPKFSSIEFDKQGEALEDALEV 360

Query: 361 LLEALTTRRIRMGRSITRK 373
           LLEALT RRIRMGRSITRK
Sbjct: 361 LLEALTARRIRMGRSITRK 379

BLAST of HG10001002 vs. ExPASy TrEMBL
Match: A0A6J1I3N8 (tRNA-uridine aminocarboxypropyltransferase OS=Cucurbita maxima OX=3661 GN=LOC111469615 PE=4 SV=1)

HSP 1 Score: 678.3 bits (1749), Expect = 2.1e-191
Identity = 334/379 (88.13%), Postives = 349/379 (92.08%), Query Frame = 0

Query: 1   MVSTLRAFAPTFNRSPSRAFSIKTQMDSITKNPDGHSSRCASTSKPREA-------PITL 60
           M+ST+RAFAP+FNRSPSRAFS KTQMDSITKNPD   SR AS S   EA       PITL
Sbjct: 1   MLSTIRAFAPSFNRSPSRAFSTKTQMDSITKNPDARISRWASNSISTEARVTSEETPITL 60

Query: 61  QEWQGWGSTSPVPTMVTEIIEELKVLEKNIDAQMSFGGTGGKLQGYFKTQEDKKHRATYQ 120
           QEWQGWGSTSP+PTMVTEIIEELK LEKNIDAQMSFGG GGKLQGYFKTQEDKKHRATYQ
Sbjct: 61  QEWQGWGSTSPMPTMVTEIIEELKALEKNIDAQMSFGGNGGKLQGYFKTQEDKKHRATYQ 120

Query: 121 ALGSSEQKLQFFSARQIACRLLGSRDYLCQKCWLPFEDCMCSRVKHCSLWHGARFWLYMH 180
           ALGSSEQKLQFFSARQIACRLLGSRDYLCQKCWLPFEDCMCSRVKHC LW GARFWLYMH
Sbjct: 121 ALGSSEQKLQFFSARQIACRLLGSRDYLCQKCWLPFEDCMCSRVKHCPLWSGARFWLYMH 180

Query: 181 PKDFLRQNNTGKLLLQVFGVESTTLCLYGISEHEEIMWNAFKSAGKSKVCCLYPNKNAIS 240
           PKDFLRQNNTGKLLLQVFGVE+TTLCLYGISEHE+IMW+AFKSAGKSKVCCLYPNKNA S
Sbjct: 181 PKDFLRQNNTGKLLLQVFGVEATTLCLYGISEHEDIMWSAFKSAGKSKVCCLYPNKNATS 240

Query: 241 KGVQEAFSSELSTKLENPQQMTDGDGILNFVLIDGTWSNSAAMFNRLKEQAISVWGEDIP 300
           K VQ+AFSSELSTK E  QQ TDGDGILNF+LIDGTWSNSAAMFNRL+EQA SVWGED+P
Sbjct: 241 KSVQDAFSSELSTKQERTQQTTDGDGILNFILIDGTWSNSAAMFNRLREQAKSVWGEDLP 300

Query: 301 CISLATGSSAMHKLRPQPSWDRTCTAAAAASLLFELQLVPKFSSVELEKQGEALEDALEV 360
           CISL TGSSAMHKLRPQPSWDRTCTAAAAASLLFELQL+P+ SSV LEKQGEALED+LEV
Sbjct: 301 CISLTTGSSAMHKLRPQPSWDRTCTAAAAASLLFELQLIPRLSSVGLEKQGEALEDSLEV 360

Query: 361 LLEALTTRRIRMGRSITRK 373
           LLEALT+RRIRMGRSITRK
Sbjct: 361 LLEALTSRRIRMGRSITRK 379

BLAST of HG10001002 vs. ExPASy TrEMBL
Match: A0A6J1G763 (tRNA-uridine aminocarboxypropyltransferase OS=Cucurbita moschata OX=3662 GN=LOC111451439 PE=4 SV=1)

HSP 1 Score: 676.4 bits (1744), Expect = 8.0e-191
Identity = 332/379 (87.60%), Postives = 347/379 (91.56%), Query Frame = 0

Query: 1   MVSTLRAFAPTFNRSPSRAFSIKTQMDSITKNPDGHSSRCASTS-------KPREAPITL 60
           M+ST+RAFAP+FNRSPSRAFS KTQMDSITKNPD   SRCAS S         +E PITL
Sbjct: 1   MLSTIRAFAPSFNRSPSRAFSTKTQMDSITKNPDARISRCASNSISTESRGTSQETPITL 60

Query: 61  QEWQGWGSTSPVPTMVTEIIEELKVLEKNIDAQMSFGGTGGKLQGYFKTQEDKKHRATYQ 120
           QEWQGWGSTSP+PTMVTEIIEELK LEKNIDAQMSFGG GGKLQGYFKTQEDKKHRATYQ
Sbjct: 61  QEWQGWGSTSPMPTMVTEIIEELKALEKNIDAQMSFGGNGGKLQGYFKTQEDKKHRATYQ 120

Query: 121 ALGSSEQKLQFFSARQIACRLLGSRDYLCQKCWLPFEDCMCSRVKHCSLWHGARFWLYMH 180
           ALGSSEQKL FFSARQIACRLLGSRDYLCQKCWLPFEDCMCSRVK C LW GARFWLYMH
Sbjct: 121 ALGSSEQKLHFFSARQIACRLLGSRDYLCQKCWLPFEDCMCSRVKQCPLWSGARFWLYMH 180

Query: 181 PKDFLRQNNTGKLLLQVFGVESTTLCLYGISEHEEIMWNAFKSAGKSKVCCLYPNKNAIS 240
           PKDFLRQNNTGKLLLQVFGVE+TTLCLYGISEHE+IMW+AFKSAGKSKVCCLYPNKNA S
Sbjct: 181 PKDFLRQNNTGKLLLQVFGVEATTLCLYGISEHEDIMWSAFKSAGKSKVCCLYPNKNATS 240

Query: 241 KGVQEAFSSELSTKLENPQQMTDGDGILNFVLIDGTWSNSAAMFNRLKEQAISVWGEDIP 300
           K VQEAF SELSTK E  QQ TDGDGILNF+LIDGTWSNSAAMFNRL+EQA SVWGED+P
Sbjct: 241 KSVQEAFISELSTKQERTQQTTDGDGILNFILIDGTWSNSAAMFNRLREQAKSVWGEDLP 300

Query: 301 CISLATGSSAMHKLRPQPSWDRTCTAAAAASLLFELQLVPKFSSVELEKQGEALEDALEV 360
           CISL TGSSAMHKLRPQPSWDRTCTAAAAASLLFELQL+P+ SSV LEKQGEALED+LEV
Sbjct: 301 CISLTTGSSAMHKLRPQPSWDRTCTAAAAASLLFELQLIPRLSSVGLEKQGEALEDSLEV 360

Query: 361 LLEALTTRRIRMGRSITRK 373
           LLEALT+RRIRMGRSITRK
Sbjct: 361 LLEALTSRRIRMGRSITRK 379

BLAST of HG10001002 vs. ExPASy TrEMBL
Match: A0A5A7V8P4 (tRNA-uridine aminocarboxypropyltransferase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold89G004010 PE=4 SV=1)

HSP 1 Score: 595.1 bits (1533), Expect = 2.3e-166
Identity = 335/584 (57.36%), Postives = 348/584 (59.59%), Query Frame = 0

Query: 1   MVSTLRAFAPTFNRSPSRAFSIKTQMDSITKNPDGHSSRCASTSKPREAPITLQEWQGWG 60
           MVSTLRAFAPTFNR PSRAFSIKTQM SIT+NPD HS    STSKP E PITL+EWQGWG
Sbjct: 2   MVSTLRAFAPTFNRYPSRAFSIKTQMGSITRNPDAHS----STSKPTETPITLREWQGWG 61

Query: 61  STSPVPTMVTEIIEELKVLEKNIDAQMSFGGTGGKLQGYFKTQEDKKHRATYQALGSSEQ 120
           STSPVPTMVTEII+ELKVLEKN+DAQMSFGG GGKLQGYFKTQEDKKHRATY+ALGSSEQ
Sbjct: 62  STSPVPTMVTEIIDELKVLEKNVDAQMSFGGNGGKLQGYFKTQEDKKHRATYKALGSSEQ 121

Query: 121 KLQFFSARQIACRLLGSRDYLCQKCWLPFEDCMCSRVKHCSLWHGARFWLYMHPKDFLRQ 180
           KLQFFSARQIACRLLGSRDYLCQKCWLP EDCMCSRVKH SLW GARFWLYMHPKDFLRQ
Sbjct: 122 KLQFFSARQIACRLLGSRDYLCQKCWLPVEDCMCSRVKHYSLWDGARFWLYMHPKDFLRQ 181

Query: 181 NNTGKLLLQVFGVESTTLCLYGISEHEEIMWNAFKSAGKSKVCCLYPNKNAISKGVQEAF 240
           NNTGKLLLQVFGVE+TTL LYGI+EHEEIMWNAFK AGKSKVCCLYPNKNA SKG+QEAF
Sbjct: 182 NNTGKLLLQVFGVEATTLSLYGIAEHEEIMWNAFKLAGKSKVCCLYPNKNATSKGIQEAF 241

Query: 241 SSELSTKLENPQQM---------------------------------------------- 300
            SELSTK EN QQ+                                              
Sbjct: 242 GSELSTKQENTQQLTSVRPPITIIWQYGRRRSKGSACVMGSIVAKAGTHCTSNGALNSFV 301

Query: 301 ------------------------------------------------------------ 360
                                                                       
Sbjct: 302 TPALLDMAVIREEVENGYLRTYKRLTGELYWEGMKGRCKEIFPQLNRRSAYHPQLDGQIE 361

Query: 361 ------------------------------------------------------------ 373
                                                                       
Sbjct: 362 VVNQSVEAYLHCFYGERPKERERNVTLRVLKEHLRVAQEKMKKSADNGACGVFRIGPVAY 421

BLAST of HG10001002 vs. TAIR 10
Match: AT1G03687.1 (DTW domain-containing protein )

HSP 1 Score: 445.7 bits (1145), Expect = 4.4e-125
Identity = 214/328 (65.24%), Postives = 260/328 (79.27%), Query Frame = 0

Query: 48  EAPITLQEWQGWGSTSPVPTMVTEIIEELKVLEKNIDAQMSFGGTGGKLQGYFKTQEDKK 107
           E  I+++EW+ WG  SP P+ V +I+++LKVLE  +D+ + FGG GGKLQG F   EDKK
Sbjct: 49  EGIISVEEWRKWGPVSPFPSAVKQIVDDLKVLECKLDSPIDFGGNGGKLQGPFGAYEDKK 108

Query: 108 HRATYQALGSSEQKLQFFSARQIACRLLGSRDYLCQKCWLPFEDCMCSRVKHCSLWHGAR 167
           HRATY+AL   E+K +FFSARQ+ACRLLGSR YLCQKCWL  EDCMCS VK C LW   R
Sbjct: 109 HRATYEALDDPEKKFRFFSARQVACRLLGSRGYLCQKCWLAMEDCMCSYVKPCGLWKRIR 168

Query: 168 FWLYMHPKDFLRQNNTGKLLLQVFGVESTTLCLYGISEHEEIMWNAFKSAGKSKVCCLYP 227
           FWLYMHP+DFLRQNNTGKLL Q+FGV+S TLC++GI+E EEIMWN FK AGKS+V CLYP
Sbjct: 169 FWLYMHPRDFLRQNNTGKLLWQIFGVQSATLCVFGIAEDEEIMWNEFKRAGKSQVRCLYP 228

Query: 228 NKNA-ISKGVQEAFSSELSTKLENPQQMTDGDGILNFVLIDGTWSNSAAMFNRLKEQAIS 287
           N N+ ++  V++AF S  S    +    TD D  L+F+L+DGTW+NSAAM  RLK+ A S
Sbjct: 229 NHNSEVTFSVKDAFGSSASE--NHVSSTTDEDKTLHFILLDGTWNNSAAMLKRLKDHAKS 288

Query: 288 VWG-EDIPCISLATGSSAMHKLRPQPSWDRTCTAAAAASLLFELQLVPKFSSVELEKQGE 347
           VWG ED+PCISLATG+SAMHKLRPQPSWDRTCTAAAA  LL EL L+P+ SS EL+KQ +
Sbjct: 289 VWGDEDLPCISLATGASAMHKLRPQPSWDRTCTAAAAIGLLSELSLLPQLSSYELDKQAD 348

Query: 348 ALEDALEVLLEALTTRRIRMGRSITRKG 374
           A+E+AL +LL++LT RR+RMGRSITRKG
Sbjct: 349 AVEEALVILLDSLTGRRLRMGRSITRKG 374

BLAST of HG10001002 vs. TAIR 10
Match: AT1G03687.2 (DTW domain-containing protein )

HSP 1 Score: 355.1 bits (910), Expect = 7.9e-98
Identity = 168/263 (63.88%), Postives = 204/263 (77.57%), Query Frame = 0

Query: 48  EAPITLQEWQGWGSTSPVPTMVTEIIEELKVLEKNIDAQMSFGGTGGKLQGYFKTQEDKK 107
           E  I+++EW+ WG  SP P+ V +I+++LKVLE  +D+ + FGG GGKLQG F   EDKK
Sbjct: 49  EGIISVEEWRKWGPVSPFPSAVKQIVDDLKVLECKLDSPIDFGGNGGKLQGPFGAYEDKK 108

Query: 108 HRATYQALGSSEQKLQFFSARQIACRLLGSRDYLCQKCWLPFEDCMCSRVKHCSLWHGAR 167
           HRATY+AL   E+K +FFSARQ+ACRLLGSR YLCQKCWL  EDCMCS VK C LW   R
Sbjct: 109 HRATYEALDDPEKKFRFFSARQVACRLLGSRGYLCQKCWLAMEDCMCSYVKPCGLWKRIR 168

Query: 168 FWLYMHPKDFLRQNNTGKLLLQVFGVESTTLCLYGISEHEEIMWNAFKSAGKSKVCCLYP 227
           FWLYMHP+DFLRQNNTGKLL Q+FGV+S TLC++GI+E EEIMWN FK AGKS+V CLYP
Sbjct: 169 FWLYMHPRDFLRQNNTGKLLWQIFGVQSATLCVFGIAEDEEIMWNEFKRAGKSQVRCLYP 228

Query: 228 NKNA-ISKGVQEAFSSELSTKLENPQQMTDGDGILNFVLIDGTWSNSAAMFNRLKEQAIS 287
           N N+ ++  V++AF S  S    +    TD D  L+F+L+DGTW+NSAAM  RLK+ A S
Sbjct: 229 NHNSEVTFSVKDAFGSSASE--NHVSSTTDEDKTLHFILLDGTWNNSAAMLKRLKDHAKS 288

Query: 288 VWG-EDIPCISLATGSSAMHKLR 309
           VWG ED+PCISLATG+SAMHKLR
Sbjct: 289 VWGDEDLPCISLATGASAMHKLR 309

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038901524.11.8e-20595.16uncharacterized protein LOC120088365 [Benincasa hispida][more]
XP_004139941.15.7e-19992.74uncharacterized protein LOC101203963 isoform X1 [Cucumis sativus] >KGN46744.1 hy... [more]
XP_022140651.14.2e-19488.65uncharacterized protein LOC111011257 [Momordica charantia][more]
KAG6604796.13.0e-19288.13hypothetical protein SDJN03_02113, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_022970715.14.4e-19188.13uncharacterized protein LOC111469615 [Cucurbita maxima] >XP_022970717.1 uncharac... [more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0KD212.7e-19992.74tRNA-uridine aminocarboxypropyltransferase OS=Cucumis sativus OX=3659 GN=Csa_6G1... [more]
A0A6J1CGP42.0e-19488.65tRNA-uridine aminocarboxypropyltransferase OS=Momordica charantia OX=3673 GN=LOC... [more]
A0A6J1I3N82.1e-19188.13tRNA-uridine aminocarboxypropyltransferase OS=Cucurbita maxima OX=3661 GN=LOC111... [more]
A0A6J1G7638.0e-19187.60tRNA-uridine aminocarboxypropyltransferase OS=Cucurbita moschata OX=3662 GN=LOC1... [more]
A0A5A7V8P42.3e-16657.36tRNA-uridine aminocarboxypropyltransferase OS=Cucumis melo var. makuwa OX=119469... [more]
Match NameE-valueIdentityDescription
AT1G03687.14.4e-12565.24DTW domain-containing protein [more]
AT1G03687.27.9e-9863.88DTW domain-containing protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 337..357
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 28..47
NoneNo IPR availablePANTHERPTHR21392:SF4DTW DOMAIN-CONTAINING PROTEINcoord: 51..372
IPR005636DTWSMARTSM01144DTW_2acoord: 138..367
e-value: 4.3E-32
score: 122.6
IPR005636DTWPFAMPF03942DTWcoord: 140..358
e-value: 8.6E-42
score: 143.0
IPR039262DTW domain-containing protein DTWD2/YfiPPANTHERPTHR21392UNCHARACTERIZEDcoord: 51..372

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10001002.1HG10001002.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008033 tRNA processing
molecular_function GO:0016432 tRNA-uridine aminocarboxypropyltransferase activity