Gene/Protein Disease Symptom Drug Enzyme Compound
Pivot Concepts:   Target Concepts:
Query: UMLS:C0003873 (rheumatoid arthritis)
53,068 document(s) hit in 31,850,051 MEDLINE articles (0.00 seconds)

The influence of certain alleles of the HLA-DRB1 locus on risk for rheumatoid arthritis has been well established through linkage and association studies. In addition, other loci in the HLA region on 6p21 may also affect an individual's risk profile. Here, we used a method to detect excess identity-by-descent sharing between affected sib pairs conditional on the observed genotypes at the hypothesized causal locus to test for the presence of additional arthritis risk loci in the linked region. We used affected sib pairs from two different studies. Because the test depends heavily on specifying accurate allele frequency estimates at the proposed causal locus, we used HLA-DRB1 allele frequency estimates from a large, population-based sample. We also discuss an alternate form of the test in which we could condition on parental genotypes, thereby eliminating the need for actual allele frequencies. The test showed no evidence for the presence of additional arthritis risk loci in the region in the British or North American samples made available for Genetic Analysis Workshop 15. Given the prior knowledge that there likely are arthritis risk loci other than HLA-DRB1 in the region, it appears the tests may have inadequate power to detect the presence of these loci in certain cases.
BMC Proc 2007
PMID:No evidence for multiple loci affecting rheumatoid arthritis risk on chromosome 6p21. 1846 41

Using parametric and nonparametric techniques, our study investigated the presence of single locus and pairwise effects between 20 markers of the Genetic Analysis Workshop 15 (GAW15) North American Rheumatoid Arthritis Consortium (NARAC) candidate gene data set (Problem 2), analyzing 463 independent patients and 855 controls. Specifically, our work examined the correspondence between logistic regression (LR) analysis of single-locus and pairwise interaction effects, and random forest (RF) single and joint importance measures. For this comparison, we selected small but stable RFs (500 trees), which showed strong correlations (r~0.98) between their importance measures and those by RFs grown on 5000 trees. Both RF importance measures captured most of the LR single-locus and pairwise interaction effects, while joint importance measures also corresponded to full LR models containing main and interaction effects. We furthermore showed that RF measures were particularly sensitive to data imputation. The most consistent pairwise effect on rheumatoid arthritis was found between two markers within MAP3K7IP2/SUMO4 on 6q25.1, although LR and RFs assigned different significance levels.Within a hypothetical two-stage design, pairwise LR analysis of all markers with significant RF single importance would have reduced the number of possible combinations in our small data set by 61%, whereas joint importance measures would have been less efficient for marker pair reduction. This suggests that RF single importance measures, which are able to detect a wide range of interaction effects and are computationally very efficient, might be exploited as pre-screening tool for larger association studies. Follow-up analysis, such as by LR, is required since RFs do not indicate high-risk genotype combinations.
BMC Proc 2007
PMID:Analyses of single marker and pairwise effects of candidate loci for rheumatoid arthritis using logistic regression and random forests. 1846 54

In this report, we compared haplotyping approaches using families and unrelated individuals on the simulated rheumatoid arthritis (RA) data in Problem 3 from Genetic Analysis Workshop (GAW) 15. To investigate these two approaches, we picked two representative programs: PedPhase and fastPHASE, respectively, for each approach. PedPhase is a rule-based method focusing on the haplotyping constraints within each pedigree and solving them using integer linear programming. fastPHASE is a statistical method based on the clustering property of haplotypes in a population over short regions. It is believed that with family information, one can obtain more accurate phasing results with considerably more cost for genotyping additional family members. Our results indicate that, though only relying on the constraints within each family (with four members) individually, PedPhase has better phasing accuracy than fastPHASE, even when the total numbers of genotyped individuals are the same. But for missing genotype imputation, fastPHASE performs better than PedPhase by taking population information into consideration. The relative influence of family constraints and population information on haplotyping accuracy as shown in this report provides some empirical bases on assessing the trade-off of genotyping family data under different settings.
BMC Proc 2007
PMID:Comparison of haplotyping methods using families and unrelated individuals on simulated rheumatoid arthritis data. 1846 55

We used the simulated data set from Genetic Analysis Workshop 15 Problem 3 to assess a two-stage approach for identifying single-nucleotide polymorphisms (SNPs) associated with rheumatoid arthritis (RA). In the first stage, we used random forests (RF) to screen large amounts of genetic data using the variable importance measure, which takes into account SNP interaction effects as well as main effects without requiring model specification. We used the simulated 9187 SNPs mimicking a 10 K SNP chip, along with covariates DR (the simulated DRB1 gentoype), smoking, and sex as input to the RF analyses with a training set consisting of 750 unrelated RA cases and 750 controls. We used an iterative RF screening procedure to identify a smaller set of variables for further analysis. In the second stage, we used the software program CaMML for producing Bayesian networks, and developed complex etiologic models for RA risk using the variables identified by our RF screening procedure. We evaluated the performance of this method using independent test data sets for up to 100 replicates.
BMC Proc 2007
PMID:Two-stage approach for identifying single-nucleotide polymorphisms associated with rheumatoid arthritis using random forests and Bayesian networks. 1846 56

With the development of high-throughput single-nucleotide polymorphism (SNP) technologies, the vast number of SNPs in smaller samples poses a challenge to the application of classical statistical procedures. A possible solution is to use a two-stage approach for case-control data in which, in the first stage, a screening test selects a small number of SNPs for further analysis. The second stage then estimates the effects of the selected variables using logistic regression (logReg). Here, we introduce a novel approach in which the selection of SNPs is based on the permutation importance estimated by random forests (RFs). For this, we used the simulated data provided for the Genetic Analysis Workshop 15 without knowledge of the true model.The data set was randomly split into a first and a second data set. In the first stage, RFs were grown to pre-select the 37 most important variables, and these were reduced to 32 variables by haplotype tagging. In the second stage, we estimated parameters using logReg.The highest effect estimates were obtained for five simulated loci. We detected smoking, gender, and the parental DR alleles as covariates. After correction for multiple testing, we identified two out of four genes simulated with a direct effect on rheumatoid arthritis risk and all covariates without any false positive.We showed that a two-staged approach with a screening of SNPs by RFs is suitable to detect candidate SNPs in genome-wide association studies for complex diseases.
BMC Proc 2007
PMID:Picking single-nucleotide polymorphisms in forests. 1846 59

The Genetic Analysis Workshop 15 Problem 3 simulated rheumatoid arthritis data set provided 100 replicates of simulated single-nucleotide polymorphism (SNP) and covariate data sets for 1500 families with an affected sib pair and 2000 controls, modeled after real rheumatoid arthritis data. The data generation model included nine unobserved trait loci, most of which have one or more of the generated SNPs associated with them. These data sets provide an ideal experimental test bed for evaluating new and old algorithms for selecting SNPs and covariates that can separate cases from controls, because the cases and controls are known as well as the identities of the trait loci. LASSO-Patternsearch is a new multi-step algorithm with a LASSO-type penalized likelihood method at its core specifically designed to detect and model interactions between important predictor variables. In this article the original LASSO-Patternsearch algorithm is modified to handle the large number of SNPs plus covariates. We start with a screen step within the framework of parametric logistic regression. The patterns that survived the screen step were further selected by a penalized logistic regression with the LASSO penalty. And finally, a parametric logistic regression model were built on the patterns that survived the LASSO step. In our analysis of Genetic Analysis Workshop 15 Problem 3 data we have identified most of the associated SNPs and relevant covariates. Upon using the model as a classifier, very competitive error rates were obtained.
BMC Proc 2007
PMID:Detecting disease-causing genes by LASSO-Patternsearch algorithm. 1846 61

Significant alterations of T-cell function, along with activation of the inflammatory response system, appear to be linked not only to treatment-resistant schizophrenia, but also to functional psychoses and mood disorders. Because there is a relatively high comorbidity between rheumatoid arthritis (RA), schizophrenia and major depression, the question arises whether there is a common, genetically modulated inflammatory process involved in these disorders. On the basis of three family studies from the U.S. and Europe which were ascertained through an index case suffering from RA (599 nuclear families, 1868 subjects), we aimed to predict the inter-individual variation of autoantibody IgM levels, as an unspecific indicator of inflammatory processes, through molecular-genetic factors. In a three-stage strategy, we first used nonparametric linkage (NPL) analysis to construct an initial configuration of genomic loci showing a sufficiently high NPL score in all three populations. This initial configuration was then modified by iteratively adding or removing genomic loci such that genotype-phenotype correlations were improved. Finally, neural network analysis (NNA) was applied to derive classifiers that predicted the phenotype from the multidimensional genotype. Our analysis led to an activation model that predicted individual IgM levels from the subjects' multidimensional genotypes very reliably. This allowed us to use the activation model for an analysis of the DNA of an existing sample of 1003 psychiatric patients in order to test, in a first approach, whether a deviant, genetically modulated inflammatory process is involved in the pathogenesis of major psychiatric disorders.
BMC Proc 2007
PMID:Modeling activation of inflammatory response system: a molecular-genetic neural network analysis. 1846 62

Using the North American Rheumatoid Arthritis Consortium (NARAC) candidate gene and genome-wide single-nucleotide polymorphism (SNP) data sets, we applied regression methods and tree-based random forests to identify genetic associations with rheumatoid arthritis (RA) and to predict RA disease status. Several genes were consistently identified as weakly associated with RA without a significant interaction or combinatorial effect with other candidate genes. Using random forests, the tested candidate gene SNPs were not sufficient to predict RA patients and normal subjects with high accuracy. However, using the top 500 SNPs, ranked by the importance score, from the genome-wide linkage panel of 5742 SNPs, we were able to accurately predict RA patients and normal subjects with sensitivity of approximately 90% and specificity of approximately 80%, which was confirmed by five-fold cross-validation. However, in a complete training-testing framework, replication of genetic predictors was less satisfactory; thus, further evaluation of existing methodology and development of new methods are warranted.
BMC Proc 2007
PMID:Classification of rheumatoid arthritis status with candidate gene and genome-wide single-nucleotide polymorphisms using random forests. 1846 63

We performed a genome-wide search for pairs of susceptibility loci that jointly contribute to rheumatoid arthritis in families recruited by the North American Rheumatoid Arthritis Consortium. A complete two-dimensional (2D) non-parametric linkage scan was carried out using 380 autosomal microsatellite markers in 511 families. At each 2D peak we obtained the most likely underlying genetic model explaining the two-locus effects, defining epistasis as a departure from an additive or a multiplicative two-locus penetrance function. The highest peak in the surface identified an epistatic interaction between loci 6p21 and 16p12 (two-locus lod score = 18.02, epistasis P < 0.012). Significant and suggestive two-locus effects were also obtained for region 6p21 in combination with loci 18q21, 8p23, 1q41, and 6p22, while the highest 2D peaks excluding region 6p21 were observed at locus pairs 8p23-18q21 and 1p21-18q21. The 2D peaks were further examined using combined microsatellite and single-nucleotide polymorphism (SNP) marker genotypes in 744 families. The two-locus evidence for linkage increased for region pairs 6p21-18q12, 6p21-16p12, 6p21-8p23, 1q41-6p21, and 6p21-6p22, but decreased for pairs of regions that did not include locus 6p21. In conclusion, we obtained evidence for multi-locus interactions in rheumatoid arthritis that are mediated by the major susceptibility locus at 6p21.
BMC Proc 2007
PMID:A two-dimensional genome scan for rheumatoid arthritis susceptibility loci. 1846 64

When two genes interact to cause a clinically important phenotype, it would seem reasonable to expect that we could leverage genotypic information at one of the loci in order to improve our ability to detect the other. We were therefore interested in extending the posterior probability of linkage (PPL), a class of linkage statistics we have been developing over the past decade, in order to explicitly allow for gene x gene interaction. In this report we utilize a new implementation of the PPL incorporating liability classes (LCs), which provide a direct parameterization of gene x gene interaction by allowing the penetrances at the locus being evaluated to depend upon measured genotypes at a known locus. With knowledge of the generating model for the simulated rheumatoid arthritis (RA) data, we selected two loci for examination: Locus A, which in interaction with the HLA-DR antigen locus affects risk of the dichotomous RA phenotype; and Locus E, which in interaction with DR affects quantitative levels of the anti-CCP phenotype. The data comprised nuclear families of two parents and an affected sib pair (ASP). Our results confirm theoretical work suggesting that gene x gene interactions CANNOT be leveraged to improve linkage detection for dichotomous traits based on affecteds-only data structures. However, incorporation of DR-based LCs did lead to appreciably higher quantitative trait PPLs. This suggests that gene x gene interactions could be effectively used in quantitative trait analyses even when families have been ascertained as ASPs for a related dichotomous trait.
BMC Proc 2007
PMID:Exploiting gene x gene interaction in linkage analysis. 1846 65


<< Previous 1 2 3 4 5 6 7 8 9 10 Next >>