Gene/Protein Disease Symptom Drug Enzyme Compound
Pivot Concepts:   Target Concepts:
Query: UNIPROT:P06889 (Mol)
630,302 document(s) hit in 31,850,051 MEDLINE articles (0.00 seconds)

DNA comprising 219 447 bp was sequenced in nine cosmids and verified at > 99.9% precision. Of the standard repetitive elements, 187 Alus make up 20.6% of the sequence, but there were only 27 MERs (2.9%) and 17 L1 fragments (1.6%). This may be characteristic of such high GC (57%) regions. The sequence also includes an 11.3 kb tract duplicated with 99.2% identity at a distance of 38 kb. The region is 80-90% transcribed and 12.5% translated. Thirteen known genes and their exon-intron borders are all accurately predicted at least in part by GRAIL programs, as are six additional genes. From centromere to telomere, the orientation of transcription varies among the first eight genes, then runs centromeric to telomeric for the next five, and is in the opposite sense for the last six. Eighteen of the 19 genes are associated with CpG islands. Two islands are exact copies in the 11.3 kb repeat units, and could thus give rise to double dosage levels of an X-linked gene. Another island is associated with two genes transcribed in opposite directions. From the sequence data, three genes and their exon structure are inferred. One of them, previously associated with HEX2, is shown to be a different gene unrelated to hexokinases; a second gene, previously known by an EST, is plexin, from its 65.5% identity with the Xenopus analog; and a third is a subunit of a vacuolar H-ATPase, and is named VATPS1.
Hum Mol Genet 1996 May
PMID:Long-range sequence analysis in Xq28: thirteen known and six candidate genes in 219.4 kb of high GC DNA between the RCP/GCP and G6PD loci. 873 35

EST 221 derived from human adult testis detects homology to the Drosophila fat facets gene (fat) and has related sequences on both the X and Y chromosomes mapping to Xp11.4 and Yq11.2 respectively. These two loci have been termed DFFRX and DFFRY for Drosophila fat facets related X and Y. The major transcript detected by EST 221 is-8 kb in size and is expressed widely in a range of 16 human adult tissues. RT-PCR analysis of 13 different human embryonic tissues with primers specific for the X and Y sequences demonstrates that both loci are expressed in developing tissues and quantitative RT-PCR of lymphoblastoid cell lines carrying different numbers of X chromosomes reveals that the X-linked gene escapes X-inactivation. The amino acid sequence (2547 residues) of the complete open reading frame of the X gene has 44% identity and 88% similarity to the Drosophila sequence and contains the conserved Cys and His domains characteristic of deubiquitinating enzymes, suggesting its biochemical function may be the hydrolysis of ubiquitin from protein-ubiquitin conjugates. The requirement of faf for normal oocyte development in Drosophila combined with the map location and escape from X-inactivation of DFFRX raises the possibility that the human homologue plays a role in the defects of oocyte proliferation and subsequent gonadal degeneration found in Turner syndrome.
Hum Mol Genet 1996 Nov
PMID:The Drosophila developmental gene fat facets has a human homologue in Xp11.4 which escapes X-inactivation and has related sequences on Yq11.2. 892 96

Analysis of DNA sequences from the 5' end of 239 directionally cloned Toxoplasma gondii RH strain tachyzoite-derived cDNAs revealed significant similarity to several classes of genes/proteins including 24 ribosomal proteins, five metabolic enzymes, four cell-cycle regulators and 15 previously cloned T. gondii genes. The remaining sequences with no significant match include several which were recovered more than once. The variety and redundancy of expressed sequence tags (ESTs GenBank accession numbers T62239-T62475) in this sample suggest that the tachyzoite cDNA library reflects tachyzoite gene expression. A large scale EST effort should uncover many new genes and provide a wealth of information about genes involved with the growth and proliferation of tachyzoites.
Mol Biochem Parasitol 1996 Jan
PMID:Toxoplasma gondii expressed sequence tags: insight into tachyzoite gene expression. 899 16

The Zn2+-finger DNA-binding domain has been identified in several developmental control proteins, transcription factors and gene products associated with diseases, as well as in several RNA-binding proteins. We applied library screening, expressed sequence tagging (EST sequencing), Zn2+-binding assays and Northern blot hybridization, in order to characterize novel cDNA clones of the human cardiovascular system which contain Zn2+-finger motifs. An embryonic (8-10 weeks gestation) heart lambda ZAP Express cDNA library was screened with an oligonucleotide probe deduced from a consensus amino acid sequence which is highly conserved for Zn2+-finger proteins, and approximately 350 positive clones were isolated from 1 x 10(4) plaque-forming units (pfu) initially plated. The isolated clones were classified as known and novel following single pass automated DNA sequencing. Analysis of Northern blot hybridization delineated the tissue specificity of these clones, as well as their association with cardiac growth and development. Existence of Zn2+-finger motifs in the novel clones was confirmed by Zn2+-binding assay. In this report, we present the characterization of eight novel clones, including the complete cDNA sequences of one of these clones (HHZ-123).
J Mol Cell Cardiol 1997 Jan
PMID:Identification of genes encoding zinc finger motifs in the cardiovascular system. 904 43

The Arabidopsis gene Terminal Flower 1 (TFL1) controls inflorescence meristem identity. A terminal flower (tfl1) mutant, which develops a terminal flower at the apex of the inflorescence, was induced by transformation with T-DNA. Using a plant DNA fragment flanking the integrated T-DNA as a probe, a clone was selected from a wild-type genomic library. Comparative sequence analysis of this clone with an EST clone (129D7T7) suggested the existence of a gene encoding a protein similar to that encoded by the cen gene which controls inflorescence meristem identity in Antirrhinum. Nucleotide sequences of the region homologous to this putative TFL1 gene were compared between five chemically induced tfl1 mutants and their parental wild-type ecotypes. Every mutant was found to have a nucleotide substitution which could be responsible for the tfl1 phenotype. This result confirmed that the cloned gene is TFL1 itself. In our tfl1 mutant, no nucleotide substitution was found in the transcribed region of the gene, and the T-DNA-insertion site was located at 458 bp downstream of the putative polyadenylation signal, suggesting that an element important for expression of the TFL1 gene exists in this area.
Mol Gen Genet 1997 Mar 26
PMID:Cloning and molecular analysis of the Arabidopsis gene Terminal Flower 1. 910 81

cDNA clones encoding two new Arabidopsis thaliana peroxidases, ATP 1a and ATP 2a, have been identified by searching the Arabidopsis database of expressed sequence tags (dbEST). They represent a novel branch of hitherto uncharacterized plant peroxidases which is only 35% identical in amino acid sequence to the well characterized group of basic plant peroxidases represented by the horseradish (Armoracia rusticana) isoperoxidases HRP C, HRP E5 and the similar Arabidopsis isoperoxidases ATP Ca, ATP Cb, and ATP Ea. However ATP 1a is 87% identical in amino acid sequence to a peroxidase encoded by an mRNA isolated from cotton (Gossypium hirsutum). As cotton and Arabidopsis belong to rather diverse families (Malvaceae and Crucifereae, respectively), in contrast with Arabidopsis and horseradish (both Crucifereae), the high degree of sequence identity indicates that this novel type of peroxidase, albeit of unknown function, is likely to be widespread in plant species. The atp 1 and atp 2 types of cDNA sequences were the most redundant among the 28 different isoperoxidases identified among about 200 peroxidase encoding ESTs. Interestingly, 8 out of totally 38 EST sequences coding for ATP 1 showed three identical nucleotide substitutions. This variant form is designated ATP 1b. Similarly, six out of totally 16 EST sequences coding for ATP 2 showed a number of deletions and nucleotide changes. This variant form is designated ATP 2b. The selected EST clones are full-length and contain coding regions of 993 nucleotides for atp 1a, and 984 nucleotides for atp 2a. These regions show 61% DNA sequence identity. The predicted mature proteins ATP 1a, and ATP 2a are 57% identical in sequence and contain the structurally and functionally important residues, characteristic of the plant peroxidase superfamily. However, they do show two differences of importance to peroxidase catalysis: (1) the asparagine residue linked with the active site distal histidine via hydrogen bonding is absent; (2) an N-glycosylation site is located right at the entrance to the heme channel. The reverse transcriptase polymerase chain reaction (RT-PCR) was used to identify mRNAs coding for ATP 1a/b and ATP 2a/b in germinating seeds, seedlings, roots, leaves, stems, flowers and cell suspension culture using elongation factor 1alpha (EF-1alpha) for the first time as a positive control. Both mRNAs were transcribed at levels comparable to EF-1alpha in all plant tissues investigated which were more than two days old, and in cell suspension culture. In addition, the mRNA coding for ATP 1a/b was found in two day old germinating seeds. The abundant transcription of ATP 1a/b and ATP 2a/b is in line with their many entries in dbEST, and indicates essential roles for these novel peroxidases.
Plant Mol Biol 1997 Mar
PMID:Sequence and RT-PCR expression analysis of two peroxidases from Arabidopsis thaliana belonging to a novel evolutionary branch of plant peroxidases. 913 61

Translation in eukaryotes does not always start at the first AUG in an mRNA, implying that context information also plays a role. This makes prediction of translation initiation sites a non-trivial task, especially when analysing EST and genome data where the entire mature mRNA sequence is not known. In this paper, we employ artificial neural networks to predict which AUG triplet in an mRNA sequence is the start codon. The trained networks correctly classified 88% of Arabidopsis and 85% of vertebrate AUG triplets. We find that our trained neural networks use a combination of local start codon context and global sequence information. Furthermore, analysis of false predictions shows that AUGs in frame with the actual start codon are more frequently selected than out-of-frame AUGs, suggesting that our networks use reading frame detection. A number of conflicts between neural network predictions and database annotations are analysed in detail, leading to identification of possible database errors.
Proc Int Conf Intell Syst Mol Biol 1997
PMID:Neural network prediction of translation initiation sites in eukaryotes: perspectives for EST and genome analysis. 932 41

Sulfation is an important conjugation reaction for a wide range of endogenous and exogenous compounds in humans, including steroids, bile acids, catecholamine neurotransmitters and thyroid hormones. The cDNA for a distinct human cytosolic sulfotransferase (ST), hST1B2, has been isolated from a human liver lambdaZap cDNA library. The hST1B2 cDNA consists of 1144 bp and contains the coding region for a novel human cytosolic ST that has been termed hST1B2 on the basis of its sequence similarity to a rat sulfotransferase, ST1B1. The hST1B2 cDNA contains an 888-bp open reading frame that encodes a 296-amino acid protein with a calculated molecular mass of 34,897 Da. The hST1B2 cDNA also has a 127-bp 5' untranslated region (UTR) and a 129-bp 3'-UTR, including a 22-bp poly(A)+ tract. The amino acid sequence of hST1B2 is 74%, 53%, 53%, 52%, 56%, and 34% identical to the amino acid sequences of rat ST1B1 and human P-PST-1, P-PST-2, M-PST, EST, and DHEA-ST, respectively. Enzymatically active hST1B2 was expressed in the bacterial expression vector pKK233-2 for kinetic characterization and in the bacterial expression vector pQE-31, which generates a histidine-tagged fusion protein for the generation of antibodies. Expressed hST1B2 sulfates small phenols such as 1-naphthol and p-nitrophenol and thyroid hormones, including 3,3'-diiodothyronine, triiodothyronine, reverse triiodothyronine, and thyroxine. No activity was detected when several steroids or dopamine were tested as substrates. High levels of hST1B2 message were detected by Northern blot analysis in RNA isolated from human liver, colon, small intestine, and blood leukocytes. Immunoblot analysis detected a protein with the same mass as expressed hST1B2 in several human tissues that also possessed hST1B2 message. These results indicate that a novel cytosolic ST is present in human tissues, which may have an important role in thyroid hormone and xenobiotic metabolism.
Mol Pharmacol 1998 Feb
PMID:Expression and characterization of a novel thyroid hormone-sulfating form of cytosolic sulfotransferase from human liver. 946 86

Two cDNA clones which appear to encode different subunits of NAD(+)-dependent isocitrate dehydrogenase (IDH; EC 1.1.1.41) were identified by homology searches from the Arabidopsis EST database. These cDNA clones were obtained and sequenced; both encoded full-length messages and displayed 82.7% nucleotide sequence identity over the coding region. The deduced amino acid sequences revealed preprotein lengths of 367 residues, with an amino acid identity of 86.1%. Genomic Southern blot analysis showed distinct single-copy genes for both IDH subunits. Both IDH subunits were expressed as recombinant proteins in Escherichia coli, and polyclonal antibodies were raised to each subunit. The Arabidopsis cDNA clones were expressed in Saccharomyces cerevisiae mutants which were deficient in either one or both of the yeast NAD(+)-dependent IDH subunits. The Arabidopsis cDNA clones failed to complement the yeast mutations; although both IDH-I and IDH-II were expressed at detectable levels, neither protein was imported into the mitochondria.
Plant Mol Biol 1998 Mar
PMID:NAD(+)-dependent isocitrate dehydrogenase from Arabidopsis thaliana. Characterization of two closely related subunits. 952 1

We have applied exon amplification, GRAIL2 exon prediction and EST database searching to a 2 Mb segment of chromosome 4p16.3. Experimental and computational methods of identifying exons were comparable in efficiency and apparent false positive rate, but were complementary in gene identification, revealing distinct overlapping sets of expressed sequences. EST searching was most powerful when we considered only those ESTs that show evidence of splicing relative to the genomic sequence. The combination of the three gene finding methods produced a transcription map of 30 loci in this segment of 4p16.3 that includes known human genes, homologs of loci identified in rodents and several anonymous transcripts, including a putative novel DNA polymerase and a gene related to Drosophila ash1. While most of the genes in the region have been found, our data suggest that even with the entire DNA sequence available, complete saturation of the transcript map will require additional, focused experimental effort.
Somat Cell Mol Genet 1997 Nov
PMID:Exon trapping and sequence-based methods of gene finding in transcript mapping of human 4p16.3. 966 4


<< Previous 1 2 3 4 5 6 7 8 9 10 Next >>