US20120270794A1 - Polymorphisms associated with parkinson's disease - Google Patents

Polymorphisms associated with parkinson's disease Download PDF

Info

Publication number
US20120270794A1
US20120270794A1 US13/452,341 US201213452341A US2012270794A1 US 20120270794 A1 US20120270794 A1 US 20120270794A1 US 201213452341 A US201213452341 A US 201213452341A US 2012270794 A1 US2012270794 A1 US 2012270794A1
Authority
US
United States
Prior art keywords
seq
nucleic acid
gene
allele
disease
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US13/452,341
Inventor
Nicholas Eriksson
Chuong Do
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
23andMe Inc
Original Assignee
23andMe Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 23andMe Inc filed Critical 23andMe Inc
Priority to US13/452,341 priority Critical patent/US20120270794A1/en
Publication of US20120270794A1 publication Critical patent/US20120270794A1/en
Abandoned legal-status Critical Current

Links

Images

Classifications

    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K31/00Medicinal preparations containing organic active ingredients
    • A61K31/70Carbohydrates; Sugars; Derivatives thereof
    • A61K31/7088Compounds having three or more nucleosides or nucleotides
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61PSPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
    • A61P25/00Drugs for disorders of the nervous system
    • A61P25/14Drugs for disorders of the nervous system for treating abnormal movements, e.g. chorea, dyskinesia
    • A61P25/16Anti-Parkinson drugs
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6876Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
    • C12Q1/6883Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2600/00Oligonucleotides characterized by their use
    • C12Q2600/136Screening for pharmacological compounds
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2600/00Oligonucleotides characterized by their use
    • C12Q2600/156Polymorphic or mutational markers
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2600/00Oligonucleotides characterized by their use
    • C12Q2600/158Expression markers
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10TTECHNICAL SUBJECTS COVERED BY FORMER US CLASSIFICATION
    • Y10T436/00Chemistry: analytical and immunological testing
    • Y10T436/14Heterocyclic carbon compound [i.e., O, S, N, Se, Te, as only ring hetero atom]
    • Y10T436/142222Hetero-O [e.g., ascorbic acid, etc.]
    • Y10T436/143333Saccharide [e.g., DNA, etc.]

Definitions

  • the present invention is related to polymorphisms associated with Parkinson's disease. More specifically, the invention is related to compositions, methods and for use in therapeutic and preventative treatment, study, diagnosis and prognosis of Parkinson's disease.
  • sequence listing included in the electronic file submitted herewith as one of the parts of this application, entitled “23MEP023_PD_Sequence_Listing”, is incorporated by reference into this application in its entirety.
  • This sequence listing file is created on Apr. 19, 2012, and is 3 kilobytes in size.
  • Parkinson's disease is a progressive degenerative disease of the central nervous system (CNS). PD is characterized by muscle rigidity, tremors, a slowness of physical movement (bradykinesia), impaired balance and coordination, and, in advanced stages, a loss of physical movement (akinesia). Over one million Americans suffer from Parkinson's disease, with the prevalence of approximately 1 in 272 or 0.37% in the United States (US Census Bureau, Population Estimates, 2004).
  • the PARK1 region at 4q21 contains the alpha-synuclein (SNCA) gene. Certain mutations in this gene confer a rare autosomal dominant form of PD (Duvoisin, R. C. (1996), Recent advances in the genetics of Parkinson's disease, Adv Neurol 69:33-40; Polymeropoulos et al. (1997) Mutation in the alpha - synuclein gene identified in families with Parkinson's disease, Science 276:2045-7; and Kruger et al.
  • SNCA alpha-synuclein
  • PARK3 at 2p13 autosomal dominant
  • PARK4 at 4p15 autosomal dominant; same locus as PARK1
  • PARK5 at 4p14 which contains a gene encoding a neuron-specific C-terminal ubiquitin hydrolase
  • PARK6 at 1p35 autosomal recessive
  • PARK7 at 1p36 which contains the DJ-1 gene; autosomal recessive
  • PARK8 at 12p1.2-q13.1 which contains the LRRK2 gene; autosomal dominant
  • PARK9, PARK10, PARK11, PARK12, PARK13, PARK14, PARK15, and PARK16 have also been linked to PD. While the molecular bases for most cases of PD are unclear, the various genetic regions that have been linked to this disease serve to illustrate the potential that the etiology of PD may involve the interaction of a large number of genetic components.
  • the present application provides compositions, methods and systems for determining increased or decreased risk or susceptibility of an individual to developing Parkinson's disease (PD).
  • the application provides nucleic acid sequences that may be used to determine the presence or absence of nucleotides at polymorphic sites in an individual's RNA or genomic DNA that are associated with susceptibility to or protection from PD.
  • the application provides a method for identifying a human subject having an increased or decreased susceptibility to PD, including the following steps: 1) obtaining a nucleic acid sample from a patient; and ii) detecting in the sample the identity of nucleotide or nucleotides at one or more polymorphic nucleotide positions listed in Tables 1-1 (SEQ ID NO: 1-8) and 2-1 (SEQ ID NO: 9).
  • methods of identifying a modulator of a PD phenotype include contacting a potential modulator to a gene or gene product, e.g. wherein the gene or gene product comprises or is closely linked to a polymorphism described herein (e.g. in Table 1-1 (SEQ ID NO: 1-8) and/or Table 2-1 (SEQ ID NO: 9)).
  • a polymorphism described herein e.g. in Table 1-1 (SEQ ID NO: 1-8) and/or Table 2-1 (SEQ ID NO: 9).
  • Kits for performing any of the methods herein are another feature of the disclosure in this application.
  • Such kits can include probes or amplicons for detecting any polymorphism herein, appropriate packaging materials, and instructions for practicing the methods.
  • the application also provides systems for generating a prognosis of a human subject's increased or decreased susceptibility to PD based on genotyping data spanning hundreds of thousands of single nucleotide polymorphisms (SNPs).
  • the system may include means for storing a subject's profile comprising a set of patient-specific information including a subject's medical history, the family medical history and the subject's genetic testing results including genotypes at the various polymorphic sites listed in Tables 1-1 (SEQ ID NO: 1-8) and 2-1 (SEQ ID NO: 9).
  • the invention also provides methods of PD prognosis based on expression profiling. Such methods include determining the expression levels of at least 2 and no more than 5,000 genes in a subject, wherein at least two of the genes are selected from the group consisting of MCCC1, TMEM175, RIT2, GAK, DGKQ, RIN, SYT4, STBD1, SCARB2, HLA-DRB1, HLA-DQA1, LOC729862, PGDB3P2 and LRKK2, wherein the expression levels are used to create an expression profile.
  • the expression levels are used for PD prognosis by comparing expression levels of the genes in a human subject to expression levels of the genes in a control human subject known to have Parkinson's disease (PD) and/or a healthy control human subject known to not have PD, wherein similarity of expression profiles in the subject and the control subject having PD is suggestive that the subject has a higher likelihood of having or developing Parkinson's disease, and similarity of the expression profiles of the subject and a control subject not having PD-is suggestive that the subject has a lower likelihood of having or developing Parkinson's disease.
  • PD Parkinson's disease
  • Any of the above methods can include informing the subject or a relative thereof of the presence of or susceptibility to PD; or further comprising administering a treatment regimen effective to treat or effect prophylaxis of PD in said subject.
  • nucleic acid refers to a deoxyribonucleotide or ribonucleotide, whether singular or in polymers, naturally occurring or non-naturally occurring, double-stranded or single-stranded, coding (e.g. translated gene) or non-coding (e.g. regulatory region), or any fragments, derivatives, mimetics or complements thereof.
  • nucleic acids examples include oligonucleotides, nucleotides, polynucleotides, nucleic acid sequences, genomic sequences, antisense nucleic acids, DNA regions, probes, primers, genes, regulatory regions, introns, exons, open-reading frames, binding sites, target nucleic acids and allele-specific nucleic acids.
  • a nucleic acid can include one or more polymorphisms, variations or mutations (e.g., SNPs, insertions, deletions, inversions, translocations, etc.).
  • a nucleic acid includes analogs (e.g., phosphorothioates, phosphoramidates, methyl phosphonate, chiral-methyl phosphonates, 2-O-methyl ribonucleotides) or modified nucleic acids (e.g., modified backbone residues or linkages) or nucleic acids that are combined with carbohydrates, lipids, protein or other materials, or peptide nucleic acids (PNAs) (e.g., chromatin, ribosomes, transcriptosomes, etc.) or nucleic acids in various structures (e.g., A DNA, B DNA, Z-form DNA, siRNA, tRNA, ribozymes, etc.).
  • a nucleic acid may also include a detectable label.
  • detectable label refers to, for example, a luminescent label, a light scattering label or a radioactive label, or any other form of labeling that can be detected by a physical, chemical, or a biological process.
  • Fluorescent labels include commercially available fluorescein phosphoramidites such as Fluoreprime (Pharmacia), Fluoredite (Millipore) and FAM (ABI).
  • hybridization refers to the process in which two single-stranded polynucleotides bind non-covalently to form a stable double-stranded polynucleotide; triple-stranded hybridization is also theoretically possible.
  • Hybridizations are usually performed under stringent conditions. For example, conditions of 5 ⁇ SSPE (750 mM NaCl, 50 mM NaPhosphate, 5 mM EDTA, pH 7.4) and a temperature of 25-30° C. are suitable for allele-specific probe hybridizations.
  • specific hybridization refers to the ability of a first nucleic acid to bind, duplex or hybridize to a second nucleic acid in a manner such that the second nucleic acid can be identified or distinguished from other components of a mixture (e.g., cellular extracts, genomic DNA, etc.). In certain embodiments, specific hybridization is performed under stringent conditions.
  • hybridization-based assay means any assay that relies on the formation of a stable duplex or triplex between a probe and a target nucleotide sequence for detecting or measuring such a sequence.
  • Hybridization-based assays include, without limitation, assays based on use of oligonucleotides, such as polymerase chain reactions, oligonucleotide ligation reactions, single-base extensions of primers, circularizable probe reactions, allele-specific oligonucleotide hybridizations, either in solution phase or bound to solid phase supports, such as microarrays or microbeads.
  • isolated and purified refer to a material that is substantially or essentially removed from or concentrated in its natural environment.
  • a purified nucleic acid is one that is separated from the nucleic acids that normally flank it or from other biological materials (e.g., other nucleic acids, proteins, lipids, cellular components, etc.) in a sample.
  • linkage disequilibrium refers to the preferential segregation of a particular polymorphic form with another polymorphic form at a different chromosomal location more frequently than expected by chance.
  • modulate refers to a change in expression, lifespan, function or activity of a nucleic acid or a protein. Such changes may include, for example, an increase, decrease, alteration, enhancement or inhibition of expression or activity of a nucleic acid or protein.
  • PD-related nucleic acid refers to a nucleic acid, or fragment, derivative (e.g., RNA), variant, polymorphism, or complement thereof, associated with resistance or susceptibility to PD including, for example, at least one or more PD-associated polymorphisms, genomic regions spanning 10 kb immediately upstream and 10 kb immediately downstream of a PD-associated polymorphism, coding and non-coding regions of an associated gene, and/or genomic regions spanning 10 kb immediately upstream and 10 kb immediately downstream of an associated gene, and nucleotide variants thereof.
  • phenotype is a trait or collection of traits that is/are observable in an individual or population.
  • the trait can be quantitative (a quantitative trait, or QTL) or qualitative.
  • susceptibility to Parkinson's disease is a phenotype that can be identified according to the methods and compositions of the application as described herein.
  • a “PD susceptibility phenotype” is a phenotype that displays a predisposition towards developing Parkinson's disease in an individual.
  • a phenotype that displays a predisposition for PD can, for example, show a higher likelihood that the disease will develop in an individual with the phenotype than in members of a relevant general population under a given set of environmental conditions (diet, physical activity regime, geographic location, etc.).
  • polymorphism refers to a position in a nucleic acid that possesses the quality or character of occurring in several different forms.
  • a nucleic acid may be naturally or non-naturally polymorphic, e.g., having one or more sequence differences (e.g., additions, deletions and/or substitutions) as compared to a reference sequence.
  • a reference sequence may be based on publicly available information (e.g., the U.C.
  • a nucleic acid polymorphism is characterized by two or more “alleles”, or versions of the nucleic acid sequence.
  • an allele of a polymorphism that is identical to a reference sequence is referred to as a “reference allele” and an allele of a polymorphism that is different from a reference sequence is referred to as an “alternate allele”, or sometimes a “variant allele”.
  • the term “major allele” refers to the more frequently occurring allele at a given polymorphic site, and “minor allele” refers to the less frequently occurring allele, as present in the general or study population.
  • risk allele refers to an allele of a genetic polymorphism associated with an increased risk for PD.
  • protecting allele refers to the allele associated with a decreased risk for PD.
  • single nucleotide polymorphism refers to a polymorphic site occupied by a single nucleotide, which is the site of variation between allelic sequences. The site is usually preceded by and followed by highly conserved sequences of the allele (e.g., sequences that vary in less than 1/100 or 1/1000 members of the populations).
  • a single nucleotide polymorphism usually arises due to substitution of one nucleotide for another at the polymorphic site.
  • Single nucleotide polymorphisms can also arise from a deletion of a nucleotide or an insertion of a nucleotide relative to a reference allele.
  • FIG. 1 depicts a Manhattan plot of the GWAS, showing the distribution of p-values along the genome, with chromosomes arranged along the X-axis, and all SNPs associated with p-values under 10e-7 represented with an “x”.
  • FIG. 2 depicts a quantile-quantile plot of the GWAS performed by the applicants. There is no evidence of population structure biasing the results, as the plot shows no inflation.
  • the genomic inflation factor (Devlin and Roeder (1999) Genomic control for association studies, Biometrics 55:997-1004) was calculated to be 1.11.
  • FIG. 3 depicts plots of the p-values surrounding significant SNPs around the MCCC1/LAMP3 region. SNPs with a p-value under 10e-6 are shown.
  • FIG. 4 depicts plots of the p-values surrounding significant SNPs around the GAK/DGKQ/TMEM175 region. SNPs with a p-value under 10 e-6 are shown.
  • FIG. 5 depicts plots of the p-values surrounding significant SNPs around the RIT2 region. SNPs with a p-value under 10 e-6 are shown.
  • FIG. 6 depicts plots of the p-values surrounding significant SNPs around the SCARB2 region. SNPs with a p-value under10 e-6 are shown.
  • FIG. 7 depicts plots of the p-values surrounding significant SNPs around the HLA-DRB1 and HLA-DQA1regions. SNPs with a p-value under 10 e-6 are shown.
  • FIG. 8 depicts plots of the p-values surrounding significant SNPs around the LOC729862 region. SNPs with a p-value under 10e-6 are shown.
  • the invention provides a set of novel polymorphisms associated with Parkinson's disease (PD). Identification of such polymorphisms is useful for the development and design of diagnostic or prognostic assays for PD.
  • the polymorphisms may also have additional applications, including diagnostic and prognostic use in Parkinson's disease-related conditions, therapeutic treatments for Parkinson's disease, genetic linkage analysis and positional cloning.
  • GWAS genome-wide association study
  • the GWAS performed by the applicants identified a total of 12 SNPs that were found to be independently and significantly associated with Parkinson's disease. Of these, 8 SNPs have never before been demonstrated to be associated with PD. In addition, 4 SNPs previously associated with PD replicated in the GWAS performed by the applicants. The presence of these 4 previously identified and well-known PD-associated SNPs in the set of significantly associated SNPs serves as supporting evidence for the validity of the study design and methodology. Finally, the GWAS performed by the applicants identified a novel PD-associated SNP modifying a known PD-associated mutation in the LRKK2 gene (LRRK2 G2019S, rs34637584).
  • Tables 1-1 (SEQ ID NO: 1-8) and 1-2 (SEQ ID NO: 1-8) identify the 8 novel PD-associated SNPs that independently associated with PD. These SNPs were selected on the basis of fulfilling the following criteria: 1) a p-value under 1e-7 for association with PD; 2) replication in the National Institute of Neurological Disorders and Stroke (NINDS) Parkinson's disease dataset; 3) evidence of independent effect (significant after controlling for other SNPs in the list); and 4) no evidence of genotyping error.
  • NINDS National Institute of Neurological Disorders and Stroke
  • SNP rs10513789 (SEQ ID NO: 1) is found in the following sequence: SEQ ID NO: 1: 5′-tgatggtttttcaattttgttatgttgata[t/g]gtactgcatgataccagattacaaacaggg-3′.
  • the major allele is T
  • the minor allele is G
  • the major allele is the risk allele.
  • This association lies in an intron of MCCC1.
  • Another candidate gene in this region is LAMP3, which has been found to be overexpressed in the brains of individuals with PD.
  • SNP rs6599389 (SEQ ID NO: 2) is found in the following sequence:
  • SEQ ID NO: 2 5'-cccgccgtctctgtctcctcactccccgcc[g/a] tttgttgctgcctcagcttctgttttcccc-3'.
  • G The major allele is G
  • A the minor allele is A
  • DGKQ Other candidate genes in this region include GAK and DGKQ.
  • GAK is a promising candidate gene as it is differentially expressed in Parkinson's disease in the substantia nigra (Grünblatt et al (2004) Gene expression profiling of parkinsonian substantia nigra pars compacta: alterations in ubiquitin - proteasome, heat shock protein, iron and oxidative stress regulated proteins, cell adhesion/cellular matrix and vesicle trafficking genes, J Neural Transm 111(12):1543-73).
  • SNP rs873785 (SEQ ID NO: 3) is found in the following sequence:
  • SEQ ID NO: 3 5'-tcgtgatcccccttcgtcttcaacacctc[a/g] gtgtttctcttcccggtgaacatctactcg-3'.
  • the major allele is A
  • the minor allele is G
  • This SNP lies in an intron of GAK.
  • Other candidate genes in this region include TMEM175 and DGKQ.
  • SNP rs11248060 (SEQ ID NO: 4) is found in the following sequence:
  • SEQ ID NO: 4 5'-cttgtgggttcacacatagctgtgaggaaa[c/t] tgaacagaccccctgaagcctttgtcagct-3'.
  • the major allele is C
  • the minor allele is T
  • the minor allele being the risk allele.
  • This SNP lies in an intron of DGKQ.
  • Other candidate genes in this region include TMEM175 and GAK.
  • SNP rs4130047 (SEQ ID NO: 5) is found in the following sequence:
  • SEQ ID NO: 5 5'-atcagaacgattctaggtgataatctttta[t/c] ggtcagagttttcaaaaatggcagaaacat-3'.
  • the major allele is T
  • the minor allele is C
  • the minor allele being the risk allele.
  • SNP rs4130047 lies near two possible candidate genes.
  • the first candidate gene, RIT2 contains rs4130047 (SEQ ID NO: 5) in an intron. RIT2 is similar to RIN, a gene potentially implicated in calcium-mediated signaling within neurons.
  • Rin expression in brain is widespread but variable, with moderate expression in the majority of neurons throughout the diencephalon, midbrain, and hindbrain.
  • the SNP is also close to SYT4 (synaptotagmin IV), which is expressed in the brain.
  • SYT4 expression levels in brain are highest in the hippocampus, with substantial levels also detected in the amygdala and thalamus.
  • SNP rs6812193 (SEQ ID NO: 6) is found in the following sequence:
  • SEQ ID NO: 6 5'-cctactaagttggggaaagctggatttgaa[c/t] cctggtctgtcaaaatccaaaggccatgtt-3'.
  • the major allele is C
  • the minor allele is T
  • SNP rs6812193 lies in an intron of STBD1.
  • SCARB2 Another candidate gene in this region is SCARB2, which binds to GBA in mice (Reczek D.
  • LIMP -2 Is a Receptor for Lysosomal Mannose -6- Phosphate - Independent Targeting of beta - Glucocerebrosidase, Cell November 16; 131(4):770-83).
  • GBA is known to be associated with PD (E. Sidransky (2009) Multicenter Analysis of Glucocerebrosidase Mutations in Parkinson's Disease, NEJM, Volume 361:1651-1661).
  • SNP rs7451962 (SEQ ID NO: 7) is found in the following sequence:
  • SEQ ID NO: 7 5′-gcgctttatttctaggaagggcaggaattc[g/a]gaatccctcatcctgtccccagctacctgt-3′.
  • the major allele is G
  • the minor allele is A
  • the minor allele being the risk allele.
  • SNP rs7451962 (SEQ ID NO: 7) candidate genes include HLA-DRB1 and HLA-DQA1. This SNP helps to tag the HLA type DQA1*0102, which is a risk allele for multiple sclerosis.
  • SNP rs4397141 (SEQ ID NO: 8) is found in the following sequence: SEQ ID NO: 8: 5′-attggtcttttatactgtatttttaatata[c/t]cttttgtatgtttagacatacagatcttta-3′.
  • the major allele C, the minor allele is T, with the major allele being the risk allele.
  • the candidate genes include LOC729862, and PGBD3P2.
  • LOC729862 is similar to Striatin, which is a protein involved in locomotor activity. Down regulation of striatin impairs the growth of dendrites as well as rat locomotor activity. (Castets et al.
  • Zinedin, SG 2 NA, and striatin are calmodulin - binding, WD repeat proteins principally expressed in the brain. J. Biol. Chem. 275: 19970-19977).
  • PGBD3P2 is annotated as a pseudogene of a gene derived from a transposable element.
  • Tables 2-1 (SEQ ID NO: 9) and 2-2 (SEQ ID NO: 9) identify a novel PD-associated SNP rs11755699 (SEQ ID NO: 9) that modifies a known PD-associated mutation in the LRKK2 gene (LRRK2 G2019S, rs34637584).
  • a known PD-associated mutation in the LRKK2 gene LRRK2 G2019S, rs34637584.
  • people with the minor allele C at rs11755699 (SEQ ID NO: 9) have a lower risk of developing Parkinson's disease.
  • SNP rs11755699 (SEQ ID NO: 9) is found in the following sequence: SEQ ID NO: 9: 5′-tgtgtcacagattcaaagctcttcgttcc[t/c]gaagtcctgctactgtgacttggaactctc-3′.
  • the major allele is T
  • the minor allele is C
  • the minor allele being the protective allele.
  • SNP rs11755699 (SEQ ID NO: 9) lies in an intron of SGK1, a gene known to be associated with cell survival in neurodegenerative disease (Stichel et al.
  • Sgk 1 a member of an RNA cluster associated with cell death in a model of Parkinson's disease, Eur J Neurosci. 21(2):301-316; and Schoenebeck et al. (2005) Sgk 1 , a cell survival response in neurodegenerative diseases, Mol Cell Neurosci. 30(2):249-264).
  • Table 3 lists 4 SNPs previously associated with PD that replicated in the GWAS performed by the applicants.
  • the 4 replicated SNPs met the same selection criteria as the 8 novel SNPs identified by the applicants in Tables 1-1 (SEQ ID NO: 1-8) and 1-2 (SEQ ID NO: 1-8), and serve as supporting evidence for the validity of the study design and methodology.
  • Tables 1-1 (SEQ ID NO: 1-8) and 2-1 (SEQ ID NO: 9) provide only a partial list of candidate genes associated with the novel PD-associated SNPs identified herein.
  • Other candidate genes or genomic regions include, for example, sequences spanning 10 kb immediately upstream and 10 kb immediately downstream of a PD-associated SNP, coding and non-coding regions of an associated gene, and/or genomic regions spanning 10 kb immediately upstream and 10 kb immediately downstream of an associated gene, and nucleotide variants thereof.
  • Alleles refer to major/minor alleles on forward strand.
  • the major/minor allele order is based on allelic frequencies in the study cohort.
  • MAF Minor allele frequency
  • Score P-value of association test at ⁇ Log 10 p-value
  • OR odds ratio per copy of the minor allele
  • CI confidence interval. Odds ratio > 1 means the minor allele is associated with Parkinson's disease.
  • the present invention provides, in part, novel methods for identifying individuals at an increased risk for PD.
  • the SNPs, alleles and associated genes and genomic regions identified in Tables 1-1 (SEQ ID NO: 1-8) and 2-1 (SEQ ID NO: 9) can be used to identify, isolate and amplify PD-related nucleic acids. Such nucleic acids can be useful for prognostics, diagnostics, prevention, treatment and further study of PD.
  • nucleic acids identified in Tables 1-2 (SEQ ID NO: 1-8) and 2-2 (SEQ ID NO: 9) can specifically hybridize to genomic regions associated with PD. Due to the duplex nature of DNA, it will be clear to one skilled in the art that sequence complementary to those provided in Tables 1-2 (SEQ ID NO: 1-8) to 2-2 (SEQ ID NO: 9) can also specifically hybridize to the genomic regions associated with PD, and are contemplated to be part of the instant invention.
  • Nucleic acids provided in Tables 1-2 (SEQ ID NO: 1-8) and 2-2 (SEQ ID NO: 9) and their complementary sequences can, in some embodiments, specifically hybridize to a genomic sequence containing the polymorphisms identified in Tables 1-1 (SEQ ID NO: 1-8) to 2-1 (SEQ ID NO: 9).
  • the nucleic acids identified herein and those in linkage disequilibrium with them are associated with an increased or decreased risk for PD.
  • the nucleic acids in linkage disequilibrium with those disclosed herein usually occur in the same genes or with 100 kb of the same genes.
  • a nucleic acid associated with increased risk for PD is one that contains the risk allele at the polymorphisms identified at Tables 1-1 (SEQ ID NO: 1-8) and 2-1 (SEQ ID NO: 9).
  • a nucleic acid associated with decreased risk for PD is one that contains the protective allele at the polymorphisms identified at Tables 1-1 (SEQ ID NO: 1-8) and 2-1 (SEQ ID NO: 9).
  • Conditions for nucleic acid hybridization vary depending on the buffers used, length of nucleic acids, ionic strength, temperature, etc.
  • stringent conditions refers to the incubation and wash conditions (e.g., conditions of temperature and buffer concentration) that permit hybridization of a first nucleic acid to a second nucleic acid.
  • the first nucleic acid may be perfectly (e.g. 100%) complementary to the second or may share some degree of complementarity, which is less than perfect (e.g., more than 70%, 75%, 85%, or 95%).
  • certain high stringency conditions can be used which distinguish perfectly complementary nucleic acids from those less complementary, even those having only a single base mismatch.
  • the exact conditions which determine the stringency of hybridization depend not only on ionic strength (e.g., 0.2.times.SSC, 0.1.times.SSC), temperature (e.g., room temperature, 42 C, 68 C) and the concentration of destabilizing agents such as formamide or denaturing agents such as SDS, but also on factors such as the length of the nucleic acid sequence, base composition, percent mismatch between hybridizing sequences and the frequency of occurrence of subsets of that sequence within other non-identical sequences. Equivalent conditions can be determined by varying one or more of these parameters while maintaining a similar degree of identity or similarity between the two nucleic acid molecules.
  • probes and primers can be used as probes and primers in various assays.
  • probe and primer refer to nucleic acids that hybridize, in whole or in part, in a sequence-specific manner to a complementary strand.
  • Probes with a detectable label can be used in cloning of full-length cDNA or genomic DNA by screening cDNA or genomic DNA libraries.
  • the term “primer” refers to a single-stranded nucleic acid that can act as a point of initiation of template-directed DNA synthesis, such as in PCR.
  • PCR reactions can be designed based on the human genome sequence and the associated genomic regions or polymorphisms. For example, where a polymorphism is located in an exon, the exon can be isolated and amplified using primers that are complementary to the nucleotide sequences at both ends of the exon. Similarly, where a polymorphism is located in an intron, the entire intron can be isolated and amplified using primers that are complementary to the nucleotide sequences at both ends of the intron.
  • a probe or primer contains a region of at least about 10 contiguous nucleotides, preferably at least about 16 contiguous nucleotides, more preferably about 20 or about 30 or about 50 contiguous nucleotides, that can specifically hybridize to a complementary nucleic acid sequence.
  • a probe or primer is preferably about 100 or fewer nucleotides, more preferably between 6 and 50 nucleotides, and more preferably between 12 and 30 nucleotides in length.
  • a first portion of a probe or primer is perfectly complementary to a target nucleic acid, and a second portion of the probe or primer is not perfectly complementary to the target nucleic acid.
  • the portion that is not perfectly complementary contains a binding site, e.g., for a polypeptide or another probe or primer.
  • Detection of the presence or increased level of one or more nucleic acids, or fragments, derivatives, polymorphisms, variants or complements thereof, associated with resistance or susceptibility to PD-related disease can be used as a prognostic and diagnostic tool for PD.
  • a gene or a genomic region comprising or under the control of a PD-related nucleic acid may be differentially expressed.
  • “Differential expression” as used herein refers to both quantitative and qualitative differences in a gene's expression patterns including, e.g., changes in tissue-specificity or temporal aspects of expression. Detection of such expression patterns may be made by standard techniques well known to those of skill in the art, for example, differential screening (Tedder et al. (1998) Proc. Natl. Acad. Sci. USA 85:208-212), subtractive hybridization (Hedrick et al. (1984) Nature 308:149-153; Lee et al. (1984) Proc. Natl. Acad. Sci. USA 88:2825), differential display (Liang et al., U.S. Pat. No. 5,262,311), reverse transcriptase-(RT-) PCR and/or Northern analysis.
  • a gene comprising, or under the control of a PD-related nucleic acid may exhibit differential allelic expression.
  • “Differential allelic expression” as used herein refers to both qualitative and quantitative differences in the allelic expression of multiple alleles of a single gene present in a cell.
  • a gene displaying differential allelic expression may have one allele expressed at a different time or level as compared to a second allele in the same cell/tissue.
  • Differential allelic expression and analysis methods thereof are disclosed in detail in U.S. patent application Ser. No. 10/438,184, filed May 13, 2003 and U.S. patent application Ser. No. 10/845,316, filed May 12, 2004, incorporated into this application in their entirety herein.
  • Detection of nucleic acids may be made using any method known in the art, for example, Southern or northern analyses, in situ hybridization analyses, single-stranded conformational polymorphism analyses, polymerase chain reaction analyses and nucleic acid microarray analyses, all of which are well known to those of skill in the art.
  • analyses may reveal not only the alleles present in a test sample, but also both quantitative and qualitative aspects of the expression pattern of polypeptides encoded by PD-related nucleic acids.
  • analyses may reveal expression patterns of polypeptides associated with resistance or susceptibility to PD-related disease.
  • a diagnosis or prognosis is made using a test sample containing genomic DNA or RNA obtained from an individual to be tested.
  • the individual can be an adult, child or fetus. In a preferred embodiment, the individual is a human.
  • the terms “individual”, “subject” and “patient” are used interchangeably in the present application.
  • the test sample can be from any source which contains genomic DNA or RNA, including for example, blood, amniotic fluid, cerebrospinal fluid, skin, muscle, buccal or conjunctival mucosa, placenta, gastrointestinal tract or other tissues.
  • a DNA or RNA sample is obtained from neuronal tissue.
  • test sample of DNA from fetal cells or tissue can be obtained by appropriate methods such as by amniocentesis or chorionic villus sampling.
  • the test sample is subjected to one or more tests to identify the presence or absence of a nucleic acid of interest (e.g., a PD-related nucleic acid).
  • a nucleic acid of interest e.g., a PD-related nucleic acid
  • test sample is subjected to purification, isolation and/or amplification techniques, many of which are well known in the art (e.g., Sambrook et al. (1989) Molecular Cloning, A Laboratory Manual, Cold Spring Harbor Press, N.Y.; and Ausubel et al. (eds. 1987-1993), Current Protocols in Molecular Biology, John Wiley & Sons, Inc. New York).
  • purification, isolation and/or amplification techniques many of which are well known in the art (e.g., Sambrook et al. (1989) Molecular Cloning, A Laboratory Manual, Cold Spring Harbor Press, N.Y.; and Ausubel et al. (eds. 1987-1993), Current Protocols in Molecular Biology, John Wiley & Sons, Inc. New York).
  • Southern blot, northern blot or similar analyses methods are used to identify the presence or absence of one or more genomic DNA sequences associated with resistance or susceptibility to PD using complementary nucleic acid probes.
  • the nucleic acid probes are labeled before they are contacted with the test sample; in other embodiments, the nucleic acids in the test sample are labeled before they are contacted with the nucleic acid probes.
  • Alternative diagnostic and prognostic methods employ amplification of target nucleic acids associated with resistance or susceptibility to PD, e.g., by PCR. This is especially useful for target nucleic acids present in very low quantities.
  • amplification of target nucleic acids associated with resistance to PD indicates their presence and is a prognostic and diagnostic of resistance to PD.
  • amplification of target nucleic acids associated with susceptibility to PD indicates their presence and is a prognostic and diagnostic of susceptibility to PD.
  • Microarrays can also be utilized for diagnosis and prognosis of resistance or susceptibility to PD-related disease.
  • Microarrays comprise probes that are complementary to target nucleic acid sequences from an individual.
  • a microarray probe is preferably allele-specific.
  • the microarray comprises a plurality of different probes, each coupled to a surface of a substrate in different known locations and each, capable of binding complementary strands. See, e.g. U.S. Pat. No. 5,143,854 and PCT Publication Nos. WO 90/15070 and WO 92/10092.
  • microarrays can generally be produced using mechanical synthesis methods or light-directed synthesis methods that incorporate a combination of photolithographic methods and solid phase oligonucleotide synthesis methods. See Fodor et al., (1991) Science 251:767-777; and U.S. Pat. No. 5,424,186. Techniques for the mechanical synthesis of microarrays are described in, for example, U.S. Pat. No. 5,384,261.
  • the invention also provides methods of expression profiling by determining levels of expression profiling of one or more genes of the invention, i.e. MCCC1, TMEM175, RIT2, GAK, DGKQ, RIN, SYT4, STBD1, SCARB2, HLA-DRB1, HLA-DQA1, LOC729862, PGDB3P and LRKK2.
  • the methods preferably determine expression levels of at least 2, 5, or 10 or all of the above genes.
  • expression levels of other genes beyond those shown to be associated with PD in the present application are also determined.
  • the polymorphisms listed in Tables 1-2 (SEQ ID NO: 1-8) and 2-2 (SEQ ID NO: 9) may have an effect on the expression profile of several other genes.
  • the expression levels of one or more genes in discrete sample are referred to as an expression profile.
  • the expression profile is compared with an expression profile of the same genes in a control sample.
  • the control sample can be a negative control (e.g., an individual (or population of individuals) not having or susceptible to PD or a positive control (e.g., an individual (or population of individuals) having or susceptible to PD.
  • the controls can be contemporaneous or historical. Individual expression levels in both the test and control samples can be normalized before comparison, e.g., by reference to the levels of a house keeping genes to avoid differences unrelated to the disease.
  • the relative similarity of the expression profile of a test individual to the negative and positive control expression profiles is a measure of the individual's resistance or susceptibility to PD. For example, if an expression profile is determined for seven genes of the invention, and the expression levels in the test subject are more similar to the positive control than the negative control for five of the genes, one can conclude that the test individual has or is susceptible to PD or PD-related disorder.
  • the analysis can be performed at a more sophisticated level by weighting expression level according to where they lie between negative and positive controls.
  • the system will include instructions to compare detected information as to allele sequence with a database that includes correlations between the alleles and the incidence of PD.
  • the system will also include a database containing allelic information for control subjects that have PD, and control subjects that do not have PD.
  • a typical system will further include a database comprising detailed phenotypic information for a subject, medical history of the subject, family medical history and other lifestyle specific information for the subject.
  • the system will also include a set of instructions to analyze the subject's risk of developing PD or identify the presence of PD in the subject based on the genetic profile of the subject and other measurable characteristics as stated above.
  • the system will include links to external databases to allow a user to communicate results of the diagnosis or prognosis to a medical practitioner for effecting suitable treatment for the subject.
  • Agents that modulate the expression, function and/or activity of PD-related disease nucleic acids or polypeptides can be obtained using any of the numerous approaches in combinatorial library methods known in the art, including: biological libraries; natural products libraries; spatially addressable parallel solid phase or solution phase libraries; synthetic library methods requiring deconvolution; the ‘one-bead one-compound’ library method; and synthetic library methods using affinity chromatography selection.
  • biological libraries is largely limited to polypeptide libraries, while the other four approaches are applicable to polypeptide, non-peptide oligomer or small molecule libraries of compounds. See Lam, K. S. (1997) Anticancer Drug Des. 12:145.
  • Non-peptide agents or small molecules are generally preferred because they are more readily absorbed after oral administration and have fewer potential antigenic determinants. Small molecules are also more likely to cross the blood brain barrier than larger protein-based pharmaceuticals. Methods for screening small molecule libraries for candidate protein-binding molecules are well known in the art and may be employed to identify molecules that modulate (e.g., through direct or indirect interaction) one or more of the PD-related disease polypeptides herein. Briefly, PD-related disease polypeptides may be immobilized on a substrate and a solution including the small molecules is contacted with the PD-related disease polypeptide under conditions that are permissive for binding. The substrate is then washed with a solution that substantially reflects physiological conditions to remove unbound or weakly bound small molecules.
  • a second wash may then elute those compounds that are bound strongly to the immobilized polypeptide.
  • the small molecules can be immobilized and a solution of PD-related disease polypeptides can be contacted with the column, filter or other substrate on which the small molecules are immobilized.
  • the ability to detect binding of a PD-related disease polypeptide to a small molecule may be facilitated by labeling (e.g., radio-labeling or chemiluminescence) the polypeptide or small molecule.
  • Systems for screening modulators are also a feature of the application.
  • the systems, genes linked to a polymorphism herein, or an encoded expression products of the gene. can include, e.g.
  • the systems will typically include a detector that measures increased or decreased expression of the gene or gene product in the presence of the modulator; increased or decreased activity of the gene product in the presence of the modulator; or an altered expression pattern of the gene or gene product in the presence of the modulator.
  • the systems can also include fluid handling elements for mixing and aliquotting modulator and/or the gene or product, mixing them, performing laboratory operations (e.g., purification, synthesis, cell culture, etc.).
  • System instructions for recording modulator effects and, optionally, for selecting modulators are also an optional feature of these systems.
  • DNA extraction and genotyping were performed on saliva samples by National Genetics Institute (NGI, Los Angeles, Calif. 90064), a CLIA licensed clinical laboratory and a subsidiary of Laboratory Corporation of America. Every sample that failed to reach 98.5% call rate was re-analyzed. Individuals whose analyses failed repeatedly were re-contacted to provide additional samples. SNPs with a call rate under 98% were excluded from analysis, as were those with minor allele frequency under 0.5% or a p-value for Hardy-Weinberg equilibrium under 1e-10. Both minor allele frequency and Hardy-Weinberg statistics were calculated within the dataset. Due to the two slightly different platforms in the analysis the no-call rate was calculated only among individuals genotyped on a given platform.
  • E. LRKK2 modifying SNP rs11755699 (SEQ ID NO: 9).
  • the LRRK2 G2019S mutation (rs34637584) is known to raise an individual's risk of developing PD by over 20-fold (Healy et al. (2008) Phenotype, genotype, and worldwide genetic penetrance of LRRK 2- associated Parkinson's disease: a case - control study, Lancet Neurol. 7(7):583-590).
  • GWAS genome-wide association study
  • the GWAS was conducted in 60 people over the age of 50 who carried the LRRK2 G2019S mutation. Although by typical standards this may appear to be a small set in terms of a population size for a GWAS, the cohort represents one of the largest collections of G2019S positive people in the world, as the G2019S mutation is quite rare.
  • the most significant association found in this analysis was the mutation rs11755699 (SEQ ID NO: 9), which lies in an intron of SGK1. It had a p-value (using Fisher's exact test) of under 10 ⁇ 5.5, and an odds ratio of over 42 for not developing PD if one carried the C allele (T being the more common allele). Of the 20 people with G2019S and without PD, 10 of them had the C allele, as compared to the 0 or 1 that would be expected.
  • rs11755699 (SEQ ID NO: 9) is intronic in SGK1 gene which is known to be associated with cell survival in neurodegenerative diseases such as PD (Stichel et al. (2005) Sgk 1 , a member of an RNA cluster associated with cell death in a model of Parkinson's disease, Eur J Neurosci. 21(2):301-316; and Schoenebeck et al. (2005) Sgk 1 , a cell survival response in neurodegenerative diseases, Mol Cell Neurosci. 30(2):249-264.
  • PD Stichel et al. (2005) Sgk 1 , a member of an RNA cluster associated with cell death in a model of Parkinson's disease, Eur J Neurosci. 21(2):301-316; and Schoenebeck et al. (2005) Sgk 1 , a cell survival response in neurodegenerative diseases, Mol Cell Neurosci. 30(2):249-264.

Abstract

The invention provides human polymorphisms that are associated with Parkinson's disease (PD). Also disclosed are compositions and methods for use in diagnostics, prognostics, prevention, treatment and/or study of PD.

Description

    CROSS REFERENCE TO OTHER APPLICATIONS
  • This application is a continuation of co-pending U.S. patent application Ser. No. 12/956,525 entitled POLYMORPHISMS ASSOCIATED WITH PARKINSON'S DISEASE filed Nov. 30, 2010 which is incorporated herein by reference for all purposes, which claims priority to U.S. Provisional Patent Application No. 61/265,304 entitled POLYMORPHISMS ASSOCIATED WITH PARKINSON'S DISEASE filed Nov. 30, 2009 and U.S. Provisional Patent Application No. 61/359,769 entitled POLYMORPHISMS ASSOCIATED WITH PARKINSON'S DISEASE filed Jun. 29, 2010 which are incorporated herein by reference for all purposes.
  • FIELD OF THE INVENTION
  • The present invention is related to polymorphisms associated with Parkinson's disease. More specifically, the invention is related to compositions, methods and for use in therapeutic and preventative treatment, study, diagnosis and prognosis of Parkinson's disease.
  • CROSS-REFERENCE TO SEQUENCE LISTING
  • The sequence listing included in the electronic file submitted herewith as one of the parts of this application, entitled “23MEP023_PD_Sequence_Listing”, is incorporated by reference into this application in its entirety. This sequence listing file is created on Apr. 19, 2012, and is 3 kilobytes in size.
  • BACKGROUND OF THE INVENTION
  • Parkinson's disease (PD) is a progressive degenerative disease of the central nervous system (CNS). PD is characterized by muscle rigidity, tremors, a slowness of physical movement (bradykinesia), impaired balance and coordination, and, in advanced stages, a loss of physical movement (akinesia). Over one million Americans suffer from Parkinson's disease, with the prevalence of approximately 1 in 272 or 0.37% in the United States (US Census Bureau, Population Estimates, 2004).
  • There is no known cure for PD. Patients are treated with drugs and physical therapy to control the symptoms, but the disease is a progressive disorder and symptoms continue to worsen throughout life. Four major classes of drugs are used to treat PD: Levodopa, direct dopamine agonists, catechol-O-methyltransferase (COMT) inhibitors and anticholinergics. Other types of drugs include selegiline (an MAO-B inhibitor), amantadine (an antiviral agent), vitamin E and hormone replacement therapy. Although these treatments may provide relief from the symptoms of PD, these noncurative drug treatments are often are accompanied by side effects, such as low blood pressure, nausea, constipation, and various psychiatric or behavioral disorders (e.g., hallucinations, depression, and sleep disorders).
  • While the molecular bases for PD have not been fully elucidated, several genetic regions have been found to be associated with PD. The PARK1 region at 4q21 contains the alpha-synuclein (SNCA) gene. Certain mutations in this gene confer a rare autosomal dominant form of PD (Duvoisin, R. C. (1996), Recent advances in the genetics of Parkinson's disease, Adv Neurol 69:33-40; Polymeropoulos et al. (1997) Mutation in the alpha-synuclein gene identified in families with Parkinson's disease, Science 276:2045-7; and Kruger et al. (1998) Ala30Pro mutation in the gene encoding alpha-synuclein in Parkinson's disease, Nat Genet 18:106-108). The PARK2 region at 6q25-27 contains the parkin gene. The loss of function of both copies of the parkin gene confers an autosomal recessive juvenile form of PD (Abbas et al. (1999) A wide variety of mutation in the parkin gene are responsible for autosomal recessive parkinsonism in Europe, Hum Mol Genet 8:567-574; Lucking et al. (1998) Homozygous deletions in the parkin gene in European and North African families with autosomal recessive juvenile parkinsonism, Lancet 352:1355-1356; and Lucking et al. (2000) Association between early-onset Parkinson's disease and mutations in the parkin gene, N Engl J Med 342:1560-1567). Other regions believed to contain one or more genes associated with PD include PARK3 at 2p13 (autosomal dominant), PARK4 at 4p15 (autosomal dominant; same locus as PARK1), PARK5 at 4p14 (which contains a gene encoding a neuron-specific C-terminal ubiquitin hydrolase), PARK6 at 1p35 (autosomal recessive), PARK7 at 1p36 (which contains the DJ-1 gene; autosomal recessive) and PARK8 at 12p1.2-q13.1 (which contains the LRRK2 gene; autosomal dominant). Additional loci designated PARK9, PARK10, PARK11, PARK12, PARK13, PARK14, PARK15, and PARK16 have also been linked to PD. While the molecular bases for most cases of PD are unclear, the various genetic regions that have been linked to this disease serve to illustrate the potential that the etiology of PD may involve the interaction of a large number of genetic components.
  • SUMMARY OF THE INVENTION
  • The present application provides compositions, methods and systems for determining increased or decreased risk or susceptibility of an individual to developing Parkinson's disease (PD). In one aspect, the application provides nucleic acid sequences that may be used to determine the presence or absence of nucleotides at polymorphic sites in an individual's RNA or genomic DNA that are associated with susceptibility to or protection from PD. In another aspect, the application provides a method for identifying a human subject having an increased or decreased susceptibility to PD, including the following steps: 1) obtaining a nucleic acid sample from a patient; and ii) detecting in the sample the identity of nucleotide or nucleotides at one or more polymorphic nucleotide positions listed in Tables 1-1 (SEQ ID NO: 1-8) and 2-1 (SEQ ID NO: 9).
  • In an additional aspect, methods of identifying a modulator of a PD phenotype are also provided. The methods include contacting a potential modulator to a gene or gene product, e.g. wherein the gene or gene product comprises or is closely linked to a polymorphism described herein (e.g. in Table 1-1 (SEQ ID NO: 1-8) and/or Table 2-1 (SEQ ID NO: 9)). An effect of the potential modulator on the gene or gene product is detected, thereby identifying whether the potential modulator modulates the phenotype.
  • Kits for performing any of the methods herein are another feature of the disclosure in this application. Such kits can include probes or amplicons for detecting any polymorphism herein, appropriate packaging materials, and instructions for practicing the methods.
  • The application also provides systems for generating a prognosis of a human subject's increased or decreased susceptibility to PD based on genotyping data spanning hundreds of thousands of single nucleotide polymorphisms (SNPs). The system may include means for storing a subject's profile comprising a set of patient-specific information including a subject's medical history, the family medical history and the subject's genetic testing results including genotypes at the various polymorphic sites listed in Tables 1-1 (SEQ ID NO: 1-8) and 2-1 (SEQ ID NO: 9).
  • The invention also provides methods of PD prognosis based on expression profiling. Such methods include determining the expression levels of at least 2 and no more than 5,000 genes in a subject, wherein at least two of the genes are selected from the group consisting of MCCC1, TMEM175, RIT2, GAK, DGKQ, RIN, SYT4, STBD1, SCARB2, HLA-DRB1, HLA-DQA1, LOC729862, PGDB3P2 and LRKK2, wherein the expression levels are used to create an expression profile. In one aspect the expression levels are used for PD prognosis by comparing expression levels of the genes in a human subject to expression levels of the genes in a control human subject known to have Parkinson's disease (PD) and/or a healthy control human subject known to not have PD, wherein similarity of expression profiles in the subject and the control subject having PD is suggestive that the subject has a higher likelihood of having or developing Parkinson's disease, and similarity of the expression profiles of the subject and a control subject not having PD-is suggestive that the subject has a lower likelihood of having or developing Parkinson's disease.
  • Any of the above methods can include informing the subject or a relative thereof of the presence of or susceptibility to PD; or further comprising administering a treatment regimen effective to treat or effect prophylaxis of PD in said subject.
  • DEFINITIONS
  • Terms and symbols of genetics, molecular biology, biochemistry and nucleic acid used herein follow those of standard treatises and texts in the field, e.g. Kornberg and Baker, DNA Replication, Second Edition (W. H. Freeman, New York, 1992); Lehninger, Biochemistry, Second Edition (Worth Publishers, New York, 1975); Strachan and Read, Human Molecular Genetics, Second Edition (Wiley-Liss, New York, 1999); Eckstein, editor, Oligonucleotides and Analogs: A Practical Approach (Oxford University Press, New York, 1991); Gait, editor, Oligonucleotide Synthesis: A Practical Approach (IRL Press, Oxford, 1984); and the like.
  • All terms are to be understood with their typical meanings established in the relevant art. Without limiting any term, further clarifications for some of the terms are provided below:
  • The term “nucleic acid” refers to a deoxyribonucleotide or ribonucleotide, whether singular or in polymers, naturally occurring or non-naturally occurring, double-stranded or single-stranded, coding (e.g. translated gene) or non-coding (e.g. regulatory region), or any fragments, derivatives, mimetics or complements thereof. Examples of nucleic acids include oligonucleotides, nucleotides, polynucleotides, nucleic acid sequences, genomic sequences, antisense nucleic acids, DNA regions, probes, primers, genes, regulatory regions, introns, exons, open-reading frames, binding sites, target nucleic acids and allele-specific nucleic acids. A nucleic acid can include one or more polymorphisms, variations or mutations (e.g., SNPs, insertions, deletions, inversions, translocations, etc.). A nucleic acid includes analogs (e.g., phosphorothioates, phosphoramidates, methyl phosphonate, chiral-methyl phosphonates, 2-O-methyl ribonucleotides) or modified nucleic acids (e.g., modified backbone residues or linkages) or nucleic acids that are combined with carbohydrates, lipids, protein or other materials, or peptide nucleic acids (PNAs) (e.g., chromatin, ribosomes, transcriptosomes, etc.) or nucleic acids in various structures (e.g., A DNA, B DNA, Z-form DNA, siRNA, tRNA, ribozymes, etc.). A nucleic acid may also include a detectable label. The term “detectable label” as used herein refers to, for example, a luminescent label, a light scattering label or a radioactive label, or any other form of labeling that can be detected by a physical, chemical, or a biological process. Fluorescent labels include commercially available fluorescein phosphoramidites such as Fluoreprime (Pharmacia), Fluoredite (Millipore) and FAM (ABI).
  • The term “hybridization” as used herein refers to the process in which two single-stranded polynucleotides bind non-covalently to form a stable double-stranded polynucleotide; triple-stranded hybridization is also theoretically possible. Hybridizations are usually performed under stringent conditions. For example, conditions of 5× SSPE (750 mM NaCl, 50 mM NaPhosphate, 5 mM EDTA, pH 7.4) and a temperature of 25-30° C. are suitable for allele-specific probe hybridizations. The term “specific hybridization” refers to the ability of a first nucleic acid to bind, duplex or hybridize to a second nucleic acid in a manner such that the second nucleic acid can be identified or distinguished from other components of a mixture (e.g., cellular extracts, genomic DNA, etc.). In certain embodiments, specific hybridization is performed under stringent conditions.
  • The term “hybridization-based assay” means any assay that relies on the formation of a stable duplex or triplex between a probe and a target nucleotide sequence for detecting or measuring such a sequence. Hybridization-based assays include, without limitation, assays based on use of oligonucleotides, such as polymerase chain reactions, oligonucleotide ligation reactions, single-base extensions of primers, circularizable probe reactions, allele-specific oligonucleotide hybridizations, either in solution phase or bound to solid phase supports, such as microarrays or microbeads.
  • The terms “isolated” and “purified” refer to a material that is substantially or essentially removed from or concentrated in its natural environment. For example, a purified nucleic acid is one that is separated from the nucleic acids that normally flank it or from other biological materials (e.g., other nucleic acids, proteins, lipids, cellular components, etc.) in a sample.
  • The term “linkage disequilibrium” refers to the preferential segregation of a particular polymorphic form with another polymorphic form at a different chromosomal location more frequently than expected by chance.
  • The term “modulate” refers to a change in expression, lifespan, function or activity of a nucleic acid or a protein. Such changes may include, for example, an increase, decrease, alteration, enhancement or inhibition of expression or activity of a nucleic acid or protein.
  • The term “PD-related nucleic acid” refers to a nucleic acid, or fragment, derivative (e.g., RNA), variant, polymorphism, or complement thereof, associated with resistance or susceptibility to PD including, for example, at least one or more PD-associated polymorphisms, genomic regions spanning 10 kb immediately upstream and 10 kb immediately downstream of a PD-associated polymorphism, coding and non-coding regions of an associated gene, and/or genomic regions spanning 10 kb immediately upstream and 10 kb immediately downstream of an associated gene, and nucleotide variants thereof.
  • The term “phenotype” is a trait or collection of traits that is/are observable in an individual or population. The trait can be quantitative (a quantitative trait, or QTL) or qualitative. For example, susceptibility to Parkinson's disease is a phenotype that can be identified according to the methods and compositions of the application as described herein.
  • A “PD susceptibility phenotype” is a phenotype that displays a predisposition towards developing Parkinson's disease in an individual. A phenotype that displays a predisposition for PD, can, for example, show a higher likelihood that the disease will develop in an individual with the phenotype than in members of a relevant general population under a given set of environmental conditions (diet, physical activity regime, geographic location, etc.).
  • The terms “polymorphism”, “polymorphic nucleotide”, “polymorphic site” or “polymorphic nucleotide position” refer to a position in a nucleic acid that possesses the quality or character of occurring in several different forms. A nucleic acid may be naturally or non-naturally polymorphic, e.g., having one or more sequence differences (e.g., additions, deletions and/or substitutions) as compared to a reference sequence. A reference sequence may be based on publicly available information (e.g., the U.C. Santa Cruz Human Genome Browser Gateway (genome.ucsc.edu/cgi-bin/hgGateway) or the NCBI website (www.ncbi.nlm.nih.gov)) or may be determined by a practitioner of the present invention using methods well known in the art (e.g., by sequencing a reference nucleic acid). A nucleic acid polymorphism is characterized by two or more “alleles”, or versions of the nucleic acid sequence. Typically, an allele of a polymorphism that is identical to a reference sequence is referred to as a “reference allele” and an allele of a polymorphism that is different from a reference sequence is referred to as an “alternate allele”, or sometimes a “variant allele”. As used herein, the term “major allele” refers to the more frequently occurring allele at a given polymorphic site, and “minor allele” refers to the less frequently occurring allele, as present in the general or study population. The term “risk allele” as used herein refers to an allele of a genetic polymorphism associated with an increased risk for PD. The term “protective allele” as used herein refers to the allele associated with a decreased risk for PD.
  • The term “single nucleotide polymorphism” or “SNP” refers to a polymorphic site occupied by a single nucleotide, which is the site of variation between allelic sequences. The site is usually preceded by and followed by highly conserved sequences of the allele (e.g., sequences that vary in less than 1/100 or 1/1000 members of the populations). A single nucleotide polymorphism usually arises due to substitution of one nucleotide for another at the polymorphic site. Single nucleotide polymorphisms can also arise from a deletion of a nucleotide or an insertion of a nucleotide relative to a reference allele.
  • BRIEF DESCRIPTION OF THE DRAWINGS
  • FIG. 1 depicts a Manhattan plot of the GWAS, showing the distribution of p-values along the genome, with chromosomes arranged along the X-axis, and all SNPs associated with p-values under 10e-7 represented with an “x”.
  • FIG. 2 depicts a quantile-quantile plot of the GWAS performed by the applicants. There is no evidence of population structure biasing the results, as the plot shows no inflation. The genomic inflation factor (Devlin and Roeder (1999) Genomic control for association studies, Biometrics 55:997-1004) was calculated to be 1.11.
  • FIG. 3 depicts plots of the p-values surrounding significant SNPs around the MCCC1/LAMP3 region. SNPs with a p-value under 10e-6 are shown.
  • FIG. 4 depicts plots of the p-values surrounding significant SNPs around the GAK/DGKQ/TMEM175 region. SNPs with a p-value under 10 e-6 are shown.
  • FIG. 5 depicts plots of the p-values surrounding significant SNPs around the RIT2 region. SNPs with a p-value under 10 e-6 are shown.
  • FIG. 6 depicts plots of the p-values surrounding significant SNPs around the SCARB2 region. SNPs with a p-value under10 e-6 are shown.
  • FIG. 7 depicts plots of the p-values surrounding significant SNPs around the HLA-DRB1 and HLA-DQA1regions. SNPs with a p-value under 10 e-6 are shown.
  • FIG. 8 depicts plots of the p-values surrounding significant SNPs around the LOC729862 region. SNPs with a p-value under 10e-6 are shown.
  • DETAILED DESCRIPTION OF THE INVENTION
  • The invention provides a set of novel polymorphisms associated with Parkinson's disease (PD). Identification of such polymorphisms is useful for the development and design of diagnostic or prognostic assays for PD. The polymorphisms may also have additional applications, including diagnostic and prognostic use in Parkinson's disease-related conditions, therapeutic treatments for Parkinson's disease, genetic linkage analysis and positional cloning.
  • Polymorphisms of the Invention
  • A genome-wide association study (GWAS) was performed to search for novel genetic variants associated with PD. Such studies have proven successful in identifying many hundreds of genetic associations to a wide range of diseases (Hirschhorn, J N (2009) Genomewide Association Studies—Illuminating Biologic Pathways, N Engl J Med 360: 1699-1701). Briefly, a GWAS is performed by collecting genome-wide SNP data on a large number of cases and controls and then testing each of the many (typically over 500,000) SNPs that were typed for significant frequency differences between cases and controls. A significant frequency difference is evidence that the SNP is associated with the disease, however due to the large multiple testing burden incurred by doing hundreds of thousands of tests, the association must be very significant to be considered true. Typical standards (employed by the applicants in the present application) require a p-value of under 1e-7 and replication of the association in an independent sample.
  • The GWAS performed by the applicants identified a total of 12 SNPs that were found to be independently and significantly associated with Parkinson's disease. Of these, 8 SNPs have never before been demonstrated to be associated with PD. In addition, 4 SNPs previously associated with PD replicated in the GWAS performed by the applicants. The presence of these 4 previously identified and well-known PD-associated SNPs in the set of significantly associated SNPs serves as supporting evidence for the validity of the study design and methodology. Finally, the GWAS performed by the applicants identified a novel PD-associated SNP modifying a known PD-associated mutation in the LRKK2 gene (LRRK2 G2019S, rs34637584).
  • Tables 1-1 (SEQ ID NO: 1-8) and 1-2 (SEQ ID NO: 1-8).
  • Tables 1-1 (SEQ ID NO: 1-8) and 1-2 (SEQ ID NO: 1-8) identify the 8 novel PD-associated SNPs that independently associated with PD. These SNPs were selected on the basis of fulfilling the following criteria: 1) a p-value under 1e-7 for association with PD; 2) replication in the National Institute of Neurological Disorders and Stroke (NINDS) Parkinson's disease dataset; 3) evidence of independent effect (significant after controlling for other SNPs in the list); and 4) no evidence of genotyping error.
  • SNP rs10513789 (SEQ ID NO: 1) is found in the following sequence: SEQ ID NO: 1: 5′-tgatggtttttcaattttgttatgttgata[t/g]gtactgcatgataccagattacaaacaggg-3′. The major allele is T, the minor allele is G, with the major allele being the risk allele. This association lies in an intron of MCCC1. Another candidate gene in this region is LAMP3, which has been found to be overexpressed in the brains of individuals with PD.
  • SNP rs6599389 (SEQ ID NO: 2) is found in the following sequence:
  • SEQ ID NO: 2:
    5'-cccgccgtctctgtctcctcactccccgcc[g/a]
    tttgttgctgcctcagcttctgttttcccc-3'.
  • The major allele is G, the minor allele is A, with the minor allele being the risk allele. This SNP lies in an intron of TMEM175. Other candidate genes in this region include GAK and DGKQ. GAK is a promising candidate gene as it is differentially expressed in Parkinson's disease in the substantia nigra (Grünblatt et al (2004) Gene expression profiling of parkinsonian substantia nigra pars compacta: alterations in ubiquitin-proteasome, heat shock protein, iron and oxidative stress regulated proteins, cell adhesion/cellular matrix and vesicle trafficking genes, J Neural Transm 111(12):1543-73).
  • SNP rs873785 (SEQ ID NO: 3) is found in the following sequence:
  • SEQ ID NO: 3:
    5'-tcgtgatcccccttcgtctttcaacacctc[a/g]
    gtgtttctcttcccggtgaacatctactcg-3'.

    The major allele is A, the minor allele is G, with the major allele being the risk allele. This SNP lies in an intron of GAK. Other candidate genes in this region include TMEM175 and DGKQ.
  • SNP rs11248060 (SEQ ID NO: 4) is found in the following sequence:
  • SEQ ID NO: 4:
    5'-cttgtgggttcacacatagctgtgaggaaa[c/t]
    tgaacagaccccctgaagcctttgtcagct-3'.

    The major allele is C, the minor allele is T, with the minor allele being the risk allele. This SNP lies in an intron of DGKQ. Other candidate genes in this region include TMEM175 and GAK.
  • SNP rs4130047 (SEQ ID NO: 5) is found in the following sequence:
  • SEQ ID NO: 5:
    5'-atcagaacgattctaggtgataatctttta[t/c]
    ggtcagagttttcaaaaatggcagaaacat-3'.

    The major allele is T, the minor allele is C, with the minor allele being the risk allele. SNP rs4130047 (SEQ ID NO: 5) lies near two possible candidate genes. The first candidate gene, RIT2, contains rs4130047 (SEQ ID NO: 5) in an intron. RIT2 is similar to RIN, a gene potentially implicated in calcium-mediated signaling within neurons. Rin expression in brain is widespread but variable, with moderate expression in the majority of neurons throughout the diencephalon, midbrain, and hindbrain. The SNP is also close to SYT4 (synaptotagmin IV), which is expressed in the brain. SYT4 expression levels in brain are highest in the hippocampus, with substantial levels also detected in the amygdala and thalamus.
  • SNP rs6812193 (SEQ ID NO: 6) is found in the following sequence:
  • SEQ ID NO: 6:
    5'-cctactaagttggggaaagctggatttgaa[c/t]
    cctggtctgtcaaaatccaaaggccatgtt-3'.

    The major allele is C, the minor allele is T, with the major allele being the risk allele. SNP rs6812193 (SEQ ID NO: 6) lies in an intron of STBD1. Another candidate gene in this region is SCARB2, which binds to GBA in mice (Reczek D. et al (2007) LIMP-2 Is a Receptor for Lysosomal Mannose-6-Phosphate-Independent Targeting of beta-Glucocerebrosidase, Cell November 16; 131(4):770-83). GBA is known to be associated with PD (E. Sidransky (2009) Multicenter Analysis of Glucocerebrosidase Mutations in Parkinson's Disease, NEJM, Volume 361:1651-1661).
  • SNP rs7451962 (SEQ ID NO: 7) is found in the following sequence:
  • SEQ ID NO: 7: 5′-gcgctttatttctaggaagggcaggaattc[g/a]gaatccctcatcctgtccccagctacctgt-3′. The major allele is G, the minor allele is A, with the minor allele being the risk allele. SNP rs7451962 (SEQ ID NO: 7) candidate genes include HLA-DRB1 and HLA-DQA1. This SNP helps to tag the HLA type DQA1*0102, which is a risk allele for multiple sclerosis.
  • SNP rs4397141 (SEQ ID NO: 8) is found in the following sequence: SEQ ID NO: 8: 5′-attggtcttttatactgtatttttaatata[c/t]cttttgtatgtttagacatacagatcttta-3′. The major allele C, the minor allele is T, with the major allele being the risk allele. The candidate genes include LOC729862, and PGBD3P2. LOC729862 is similar to Striatin, which is a protein involved in locomotor activity. Down regulation of striatin impairs the growth of dendrites as well as rat locomotor activity. (Castets et al. (2000) Zinedin, SG2NA, and striatin are calmodulin-binding, WD repeat proteins principally expressed in the brain. J. Biol. Chem. 275: 19970-19977). PGBD3P2 is annotated as a pseudogene of a gene derived from a transposable element.
  • Tables 2-1 (SEQ ID NO: 9) and 2-2 (SEQ ID NO: 9).
  • Tables 2-1 (SEQ ID NO: 9) and 2-2 (SEQ ID NO: 9) identify a novel PD-associated SNP rs11755699 (SEQ ID NO: 9) that modifies a known PD-associated mutation in the LRKK2 gene (LRRK2 G2019S, rs34637584). Among people who carry the LRKK2 G2019S mutation, people with the minor allele C at rs11755699 (SEQ ID NO: 9) have a lower risk of developing Parkinson's disease.
  • SNP rs11755699 (SEQ ID NO: 9) is found in the following sequence: SEQ ID NO: 9: 5′-tgtgtcacagattcaaagctcttcgttccc[t/c]gaagtcctgctactgtgacttggaactctc-3′. The major allele is T, the minor allele is C, with the minor allele being the protective allele. SNP rs11755699 (SEQ ID NO: 9) lies in an intron of SGK1, a gene known to be associated with cell survival in neurodegenerative disease (Stichel et al. (2005) Sgk1, a member of an RNA cluster associated with cell death in a model of Parkinson's disease, Eur J Neurosci. 21(2):301-316; and Schoenebeck et al. (2005) Sgk1, a cell survival response in neurodegenerative diseases, Mol Cell Neurosci. 30(2):249-264).
  • Table 3.
  • Table 3 lists 4 SNPs previously associated with PD that replicated in the GWAS performed by the applicants. The 4 replicated SNPs met the same selection criteria as the 8 novel SNPs identified by the applicants in Tables 1-1 (SEQ ID NO: 1-8) and 1-2 (SEQ ID NO: 1-8), and serve as supporting evidence for the validity of the study design and methodology.
  • It is to be understood that Tables 1-1 (SEQ ID NO: 1-8) and 2-1 (SEQ ID NO: 9) provide only a partial list of candidate genes associated with the novel PD-associated SNPs identified herein. Other candidate genes or genomic regions, include, for example, sequences spanning 10 kb immediately upstream and 10 kb immediately downstream of a PD-associated SNP, coding and non-coding regions of an associated gene, and/or genomic regions spanning 10 kb immediately upstream and 10 kb immediately downstream of an associated gene, and nucleotide variants thereof.
  • TABLE 1-1
    Novel PD-associated SNPs.
    Other candidate
    SNP ID Chromosome Locus genes Position Alleles MAF Score OR 95% CI
    rs10513789
    3 MCC1 LAMP3 184.242.767 T/G 0.2 9.27 0.801 [0.75-0.86]
    (SEQ ID
    NO: 1)
    rs6599389 4 TMEM175 GAK, DGKQ 929.113 G/A 0.073 7.41 1.316 [1.20-1.45]
    (SEQ ID
    NO: 2)
    rs873785 4 GAK TMEM175, 915.149 A/G 0.37 7.78 0.847 [0.80-0.90]
    (SEQ ID DGKQ
    NO: 3)
    rs11248060 4 DGKQ TMEM175, 954.359 C/T 0.13 7.10 1.220 [1.13-1.32]
    (SEQ ID GAK
    NO: 4)
    rs4130047 18 RIT2 SYT4 38.932.233 T/C 0.31 7.59 1.178 [1.11-1.25]
    (SEQ ID
    NO: 5)
    rs6812193 4 STBD1 SCARB2 77.418.010 C/T 0.36 7.48 0.857 [0.81-0.91]
    (SEQ ID
    NO: 6)
    rs7451962 6 HLA-DRB1 HLA-DQA1 32.690.413 G/A 0.442 6.937 1.15 [1.1-1.2]
    (SEQ ID
    NO: 7)
    rs4397141 5 LOC729862 PGBD3P2 29.647.646 C/T 0.0797 5.998 0.77 [0.69-0.86]
    (SEQ ID
    NO: 8)
    The SNPs listed in the table independently and significantly associated with Parkinson's disease.
    Position refers to NCBI Build 36.
    Alleles refer to major/minor alleles on forward strand, with the risk allele identified in bold. The major/minor allele order is based on allelic frequencies in the study cohort.
    MAF = minor allele frequency,
    Score = P-value of association test at −Log10 p-value,
    OR = odds ratio per copy of the minor allele,
    CI = confidence interval.
    Odds ratio > 1 means the minor allele is associated with Parkinson's disease.
  • TABLE 1-2
    Novel PD-associated SNPs.
    SNP ID Sequence (+)
    rs10513789 TGATGGTTTTTCAATTTTGTTATGTTGATA [T/G]
    (SEQ ID NO: GTACTGCATGATACCAGATTACAAACAGGG
    1)
    rs6599389 CCCGCCGTCTCTGTCTCCTCACTCCCCGCC [G/A]
    (SEQ ID NO: TTTGTTGCTGCCTCAGCTTCTGTTTTCCCC
    2)
    rs873785 TCGTGATCCCCCTTCGTCTTTCAACACCTC [A/G]
    (SEQ ID NO: GTGTTTCTCTTCCCGGTGAACATCTACTCG
    3)
    rs11248060 CTTGTGGGTTCACACATAGCTGTGAGGAAA[C/T]
    (SEQ ID NO: TGAACAGACCCCCTGAAGCCTTTGTCAGCT
    4)
    rs4130047 ATCAGAACGATTCTAGGTGATAATCTTTTA[T/C]
    (SEQ ID NO: GGTCAGAGTTTTCAAAAATGGCAGAAACAT
    5)
    rs6812193 CCTACTAAGTTGGGGAAAGCTGGATTTGAA[C/T]
    (SEQ ID NO: CCTGGTCTGTCAAAATCCAAAGGCCATGTT
    6)
    rs7451962 GCGCTTTATTTCTAGGAAGGGCAGGAATTC[G/A]
    (SEQ ID NO: GAATCCCTCATCCTGTCCCCAGCTACCTGT
    7)
    rs4397141 ATTGGTCTTTTATACTGTATTTTTAATATA[C/T]
    (SEQ ID NO: CTTTTGTATGTTTAGACATACAGATCTTTA
    8)
    Sequences are shown in the forward reference strand, with the risk allele identified in bold.
  • TABLE 2-1
    Novel PD-associated SNP rs111755699.
    Other
    candidate
    SNP ID Chromosome Locus genes Position Alleles MAF Score OR 95% CI
    rs11755699
    6 SGK1 134.640.940 T/C 0.0639 2.57 5.677 [1.67-19.27]
    (SEQ ID NO: 9)
    The SNP rs111755699 modifies a known PD-associated mutation in the LRKK2 gene.
    Position refers to NCBI Build 36. Alleles refer to major/minor alleles on forward strand, with the protective allele identified in bold. The major/minor allele order is based on allelic frequencies in the study cohort.
    MAF = Minor allele frequency,
    Score = P-value of association test at −Log10 p-value,
    OR = odds ratio per copy of the minor allele,
    CI = confidence interval.
    Odds ratio > 1 means the minor allele is associated with Parkinson's disease.
  • TABLE 2-2
    Novel PD-associated SNP rs111755699.
    SNP ID Sequence (+)
    rs11755699 TGTGTCACAGATTCAAAGCTCTTCGTTCCC[T/C]
    (SEQ ID GAAGTCCTGCTACTGTGACTTGGAACTCTC
    NO: 9)
    Sequence is shown in the forward reference strand, with the protective allele identified in bold.
  • TABLE 3
    SNPs previously associated with PD which replicated
    in the GWAS performed by the applicants.
    SNP ID Chromosome Locus Position Alleles MAF Score OR 95% CI
    rs3467584
    12 LRRK2 39.020.459 G/A 0.0029 20.81 5.788 [3.96-8.46]
    GBA N370S 1 GBA 153.472.258 T/C 0.0052 19.75 3.657 [2.77-4.83]
    rs356220 4 SNCA 90.860.363 C/T 0.38 18.94 1.293 [1.22-1.37]
    rs2316765 17 MAPT 41.268.235 T/C 0.21 11.50 0.777 [0.72-0.83]
    Position refers to NCBI Build 36.
    Alleles refer to major/minor alleles on forward strand. The major/minor allele order is based on allelic frequencies in the study cohort.
    MAF = Minor allele frequency,
    Score = P-value of association test at −Log10 p-value,
    OR = odds ratio per copy of the minor allele,
    CI = confidence interval.
    Odds ratio > 1 means the minor allele is associated with Parkinson's disease.
  • II. METHODS OF USE OF POLYMORPHISMS OF THE INVENTION
  • The present invention provides, in part, novel methods for identifying individuals at an increased risk for PD. The SNPs, alleles and associated genes and genomic regions identified in Tables 1-1 (SEQ ID NO: 1-8) and 2-1 (SEQ ID NO: 9) can be used to identify, isolate and amplify PD-related nucleic acids. Such nucleic acids can be useful for prognostics, diagnostics, prevention, treatment and further study of PD.
  • In one embodiment, the nucleic acids identified in Tables 1-2 (SEQ ID NO: 1-8) and 2-2 (SEQ ID NO: 9) can specifically hybridize to genomic regions associated with PD. Due to the duplex nature of DNA, it will be clear to one skilled in the art that sequence complementary to those provided in Tables 1-2 (SEQ ID NO: 1-8) to 2-2 (SEQ ID NO: 9) can also specifically hybridize to the genomic regions associated with PD, and are contemplated to be part of the instant invention. Nucleic acids provided in Tables 1-2 (SEQ ID NO: 1-8) and 2-2 (SEQ ID NO: 9) and their complementary sequences can, in some embodiments, specifically hybridize to a genomic sequence containing the polymorphisms identified in Tables 1-1 (SEQ ID NO: 1-8) to 2-1 (SEQ ID NO: 9).
  • In a preferred embodiment, the nucleic acids identified herein and those in linkage disequilibrium with them, are associated with an increased or decreased risk for PD. The nucleic acids in linkage disequilibrium with those disclosed herein usually occur in the same genes or with 100 kb of the same genes. A nucleic acid associated with increased risk for PD is one that contains the risk allele at the polymorphisms identified at Tables 1-1 (SEQ ID NO: 1-8) and 2-1 (SEQ ID NO: 9). A nucleic acid associated with decreased risk for PD is one that contains the protective allele at the polymorphisms identified at Tables 1-1 (SEQ ID NO: 1-8) and 2-1 (SEQ ID NO: 9).
  • Conditions for nucleic acid hybridization vary depending on the buffers used, length of nucleic acids, ionic strength, temperature, etc. The term “stringent conditions” for hybridization refers to the incubation and wash conditions (e.g., conditions of temperature and buffer concentration) that permit hybridization of a first nucleic acid to a second nucleic acid. The first nucleic acid may be perfectly (e.g. 100%) complementary to the second or may share some degree of complementarity, which is less than perfect (e.g., more than 70%, 75%, 85%, or 95%). For example, certain high stringency conditions can be used which distinguish perfectly complementary nucleic acids from those less complementary, even those having only a single base mismatch. High stringency, moderate stringency and low stringency conditions for nucleic acid hybridization are known in the art. Ausubel, F. M. et al., Current Protocols in Molecular Biology (John Wiley & Sons 1998), pages 2.10.1-2.10.16; 6.3.1-6.3.6. The exact conditions which determine the stringency of hybridization depend not only on ionic strength (e.g., 0.2.times.SSC, 0.1.times.SSC), temperature (e.g., room temperature, 42 C, 68 C) and the concentration of destabilizing agents such as formamide or denaturing agents such as SDS, but also on factors such as the length of the nucleic acid sequence, base composition, percent mismatch between hybridizing sequences and the frequency of occurrence of subsets of that sequence within other non-identical sequences. Equivalent conditions can be determined by varying one or more of these parameters while maintaining a similar degree of identity or similarity between the two nucleic acid molecules.
  • Probes and Primers.
  • The nucleic acids described herein can be used as probes and primers in various assays. The terms “probe” and “primer” refer to nucleic acids that hybridize, in whole or in part, in a sequence-specific manner to a complementary strand. Probes with a detectable label can be used in cloning of full-length cDNA or genomic DNA by screening cDNA or genomic DNA libraries.
  • In certain embodiments, the term “primer” refers to a single-stranded nucleic acid that can act as a point of initiation of template-directed DNA synthesis, such as in PCR. PCR reactions can be designed based on the human genome sequence and the associated genomic regions or polymorphisms. For example, where a polymorphism is located in an exon, the exon can be isolated and amplified using primers that are complementary to the nucleotide sequences at both ends of the exon. Similarly, where a polymorphism is located in an intron, the entire intron can be isolated and amplified using primers that are complementary to the nucleotide sequences at both ends of the intron.
  • In preferred embodiments, a probe or primer contains a region of at least about 10 contiguous nucleotides, preferably at least about 16 contiguous nucleotides, more preferably about 20 or about 30 or about 50 contiguous nucleotides, that can specifically hybridize to a complementary nucleic acid sequence. In addition, a probe or primer is preferably about 100 or fewer nucleotides, more preferably between 6 and 50 nucleotides, and more preferably between 12 and 30 nucleotides in length. In certain embodiments, a first portion of a probe or primer is perfectly complementary to a target nucleic acid, and a second portion of the probe or primer is not perfectly complementary to the target nucleic acid. In some aspects, the portion that is not perfectly complementary contains a binding site, e.g., for a polypeptide or another probe or primer.
  • Detection of PD-Related Nucleic Acids.
  • Detection of the presence or increased level of one or more nucleic acids, or fragments, derivatives, polymorphisms, variants or complements thereof, associated with resistance or susceptibility to PD-related disease can be used as a prognostic and diagnostic tool for PD.
  • In some aspects of the present application, a gene or a genomic region comprising or under the control of a PD-related nucleic acid may be differentially expressed. “Differential expression” as used herein refers to both quantitative and qualitative differences in a gene's expression patterns including, e.g., changes in tissue-specificity or temporal aspects of expression. Detection of such expression patterns may be made by standard techniques well known to those of skill in the art, for example, differential screening (Tedder et al. (1998) Proc. Natl. Acad. Sci. USA 85:208-212), subtractive hybridization (Hedrick et al. (1984) Nature 308:149-153; Lee et al. (1984) Proc. Natl. Acad. Sci. USA 88:2825), differential display (Liang et al., U.S. Pat. No. 5,262,311), reverse transcriptase-(RT-) PCR and/or Northern analysis.
  • In other aspect of the present application, a gene comprising, or under the control of a PD-related nucleic acid may exhibit differential allelic expression. “Differential allelic expression” as used herein refers to both qualitative and quantitative differences in the allelic expression of multiple alleles of a single gene present in a cell. As such, a gene displaying differential allelic expression may have one allele expressed at a different time or level as compared to a second allele in the same cell/tissue. Differential allelic expression and analysis methods thereof are disclosed in detail in U.S. patent application Ser. No. 10/438,184, filed May 13, 2003 and U.S. patent application Ser. No. 10/845,316, filed May 12, 2004, incorporated into this application in their entirety herein.
  • Detection of nucleic acids may be made using any method known in the art, for example, Southern or northern analyses, in situ hybridization analyses, single-stranded conformational polymorphism analyses, polymerase chain reaction analyses and nucleic acid microarray analyses, all of which are well known to those of skill in the art. Such analyses may reveal not only the alleles present in a test sample, but also both quantitative and qualitative aspects of the expression pattern of polypeptides encoded by PD-related nucleic acids. In particular, such analyses may reveal expression patterns of polypeptides associated with resistance or susceptibility to PD-related disease.
  • In one example, a diagnosis or prognosis is made using a test sample containing genomic DNA or RNA obtained from an individual to be tested. The individual can be an adult, child or fetus. In a preferred embodiment, the individual is a human. The terms “individual”, “subject” and “patient” are used interchangeably in the present application. The test sample can be from any source which contains genomic DNA or RNA, including for example, blood, amniotic fluid, cerebrospinal fluid, skin, muscle, buccal or conjunctival mucosa, placenta, gastrointestinal tract or other tissues. In a preferred embodiment a DNA or RNA sample is obtained from neuronal tissue. Alternatively, a test sample of DNA from fetal cells or tissue can be obtained by appropriate methods such as by amniocentesis or chorionic villus sampling. The test sample is subjected to one or more tests to identify the presence or absence of a nucleic acid of interest (e.g., a PD-related nucleic acid).
  • In one embodiment, the test sample is subjected to purification, isolation and/or amplification techniques, many of which are well known in the art (e.g., Sambrook et al. (1989) Molecular Cloning, A Laboratory Manual, Cold Spring Harbor Press, N.Y.; and Ausubel et al. (eds. 1987-1993), Current Protocols in Molecular Biology, John Wiley & Sons, Inc. New York).
  • In one embodiment, Southern blot, northern blot or similar analyses methods are used to identify the presence or absence of one or more genomic DNA sequences associated with resistance or susceptibility to PD using complementary nucleic acid probes. In certain embodiments, the nucleic acid probes are labeled before they are contacted with the test sample; in other embodiments, the nucleic acids in the test sample are labeled before they are contacted with the nucleic acid probes.
  • Alternative diagnostic and prognostic methods employ amplification of target nucleic acids associated with resistance or susceptibility to PD, e.g., by PCR. This is especially useful for target nucleic acids present in very low quantities. In one embodiment, amplification of target nucleic acids associated with resistance to PD indicates their presence and is a prognostic and diagnostic of resistance to PD. In a related embodiment, amplification of target nucleic acids associated with susceptibility to PD indicates their presence and is a prognostic and diagnostic of susceptibility to PD.
  • Microarrays can also be utilized for diagnosis and prognosis of resistance or susceptibility to PD-related disease. Microarrays comprise probes that are complementary to target nucleic acid sequences from an individual. A microarray probe is preferably allele-specific. In one embodiment, the microarray comprises a plurality of different probes, each coupled to a surface of a substrate in different known locations and each, capable of binding complementary strands. See, e.g. U.S. Pat. No. 5,143,854 and PCT Publication Nos. WO 90/15070 and WO 92/10092. These microarrays can generally be produced using mechanical synthesis methods or light-directed synthesis methods that incorporate a combination of photolithographic methods and solid phase oligonucleotide synthesis methods. See Fodor et al., (1991) Science 251:767-777; and U.S. Pat. No. 5,424,186. Techniques for the mechanical synthesis of microarrays are described in, for example, U.S. Pat. No. 5,384,261.
  • Other methods to detect polymorphic nucleic acids include, for example, direct manual sequencing (Church and Gilbert, (1988) Proc. Natl. Acad. Sci. USA 81:1991-1995; Sanger, F. et al. (1977) Proc. Natl. Acad. Sci. USA 74:5463-5467; and U.S. Pat. No. 5,288,644); automated fluorescent sequencing; single-stranded conformation polymorphism assays; clamped denaturing gel electrophoresis; denaturing gradient gel electrophoresis (Sheffield, V. C. et al. (1981) Proc. Natl. Acad. Sci. USA 86:232-236), mobility shift analysis (Orita, M. et al. (1989) Proc. Natl. Acad. Sci. USA 86:2766-2770), restriction enzyme analysis (Flavell et al. (1978) Cell 15:25; Geever, et al. (1981) Proc. Natl. Acad. Sci. USA 78:5081); heteroduplex analysis; Tm-shift genotyping (Germer et al. (1999) Genome Research 9:72-78); kinetic PCR (Germer et al. (2000) Genome Research 10:258-266); chemical mismatch cleavage (Cotton et al. (1985) Proc. Natl. Acad. Sci. USA 85:4397-4401); RNase protection assays (Myers, R. M. et al. (1985) Science 230:1242); and use of polypeptides which recognize nucleotide mismatches, such as E. coli mutS protein.
  • The invention also provides methods of expression profiling by determining levels of expression profiling of one or more genes of the invention, i.e. MCCC1, TMEM175, RIT2, GAK, DGKQ, RIN, SYT4, STBD1, SCARB2, HLA-DRB1, HLA-DQA1, LOC729862, PGDB3P and LRKK2. The methods preferably determine expression levels of at least 2, 5, or 10 or all of the above genes. Optionally, expression levels of other genes beyond those shown to be associated with PD in the present application are also determined. The polymorphisms listed in Tables 1-2 (SEQ ID NO: 1-8) and 2-2 (SEQ ID NO: 9) may have an effect on the expression profile of several other genes.
    • See http://www.plosgenetics.org/article/info:doi/10.1371/journal.pgen.1000888
  • PD Prognosis by Expression Profiling
  • The expression levels of one or more genes in discrete sample (e.g. from a particular individual or cell line) are referred to as an expression profile. Typically, the expression profile is compared with an expression profile of the same genes in a control sample. The control sample can be a negative control (e.g., an individual (or population of individuals) not having or susceptible to PD or a positive control (e.g., an individual (or population of individuals) having or susceptible to PD. The controls can be contemporaneous or historical. Individual expression levels in both the test and control samples can be normalized before comparison, e.g., by reference to the levels of a house keeping genes to avoid differences unrelated to the disease. The relative similarity of the expression profile of a test individual to the negative and positive control expression profiles is a measure of the individual's resistance or susceptibility to PD. For example, if an expression profile is determined for seven genes of the invention, and the expression levels in the test subject are more similar to the positive control than the negative control for five of the genes, one can conclude that the test individual has or is susceptible to PD or PD-related disorder. The analysis can be performed at a more sophisticated level by weighting expression level according to where they lie between negative and positive controls. For example, if there is a large difference between negative and positive controls, and an expression level of a particular gene in a test individual lies close to the positive control that expression level is accorded greater weight than an expression level in a gene in which there is a smaller difference in expression levels between negative and positive controls, and the expression level of the test individual lies only slightly above the midpoint of the negative and positive control expression levels.
  • Systems for Prognosis of PD
  • Systems for correlating the presence or absence of an allele with an increased or decreased susceptibility to PD are another aspect of the invention. The system will include instructions to compare detected information as to allele sequence with a database that includes correlations between the alleles and the incidence of PD. In another aspect, the system will also include a database containing allelic information for control subjects that have PD, and control subjects that do not have PD. A typical system will further include a database comprising detailed phenotypic information for a subject, medical history of the subject, family medical history and other lifestyle specific information for the subject. The system will also include a set of instructions to analyze the subject's risk of developing PD or identify the presence of PD in the subject based on the genetic profile of the subject and other measurable characteristics as stated above. In another aspect, the system will include links to external databases to allow a user to communicate results of the diagnosis or prognosis to a medical practitioner for effecting suitable treatment for the subject.
  • Screening for Small Molecules.
  • Agents that modulate the expression, function and/or activity of PD-related disease nucleic acids or polypeptides can be obtained using any of the numerous approaches in combinatorial library methods known in the art, including: biological libraries; natural products libraries; spatially addressable parallel solid phase or solution phase libraries; synthetic library methods requiring deconvolution; the ‘one-bead one-compound’ library method; and synthetic library methods using affinity chromatography selection. The biological library approach is largely limited to polypeptide libraries, while the other four approaches are applicable to polypeptide, non-peptide oligomer or small molecule libraries of compounds. See Lam, K. S. (1997) Anticancer Drug Des. 12:145.
  • Non-peptide agents or small molecules are generally preferred because they are more readily absorbed after oral administration and have fewer potential antigenic determinants. Small molecules are also more likely to cross the blood brain barrier than larger protein-based pharmaceuticals. Methods for screening small molecule libraries for candidate protein-binding molecules are well known in the art and may be employed to identify molecules that modulate (e.g., through direct or indirect interaction) one or more of the PD-related disease polypeptides herein. Briefly, PD-related disease polypeptides may be immobilized on a substrate and a solution including the small molecules is contacted with the PD-related disease polypeptide under conditions that are permissive for binding. The substrate is then washed with a solution that substantially reflects physiological conditions to remove unbound or weakly bound small molecules. A second wash may then elute those compounds that are bound strongly to the immobilized polypeptide. Alternatively, the small molecules can be immobilized and a solution of PD-related disease polypeptides can be contacted with the column, filter or other substrate on which the small molecules are immobilized. The ability to detect binding of a PD-related disease polypeptide to a small molecule may be facilitated by labeling (e.g., radio-labeling or chemiluminescence) the polypeptide or small molecule.
  • Systems for screening modulators are also a feature of the application. The systems, genes linked to a polymorphism herein, or an encoded expression products of the gene. can include, e.g. The systems will typically include a detector that measures increased or decreased expression of the gene or gene product in the presence of the modulator; increased or decreased activity of the gene product in the presence of the modulator; or an altered expression pattern of the gene or gene product in the presence of the modulator. The systems can also include fluid handling elements for mixing and aliquotting modulator and/or the gene or product, mixing them, performing laboratory operations (e.g., purification, synthesis, cell culture, etc.). System instructions for recording modulator effects and, optionally, for selecting modulators are also an optional feature of these systems.
  • III. EXAMPLES Study Design and Statistical Analysis
  • A. Cohort. The subjects with Parkinson's disease were recruited via a series of emails to the mailing lists of Parkinson's disease foundations and clinics. Emails were sent to all patients who had registered with these foundations as PD patients. A limited number of patients were also recruited in person at PD workshops and conferences. All patients were offered the 23andMe Personal Genome Service for $25. Over 75% of enrollees filled out an initial web-based survey that asked detailed questions about diagnosis and symptoms. The criteria for selection were that they were diagnosed with PD by a doctor and did not later change diagnosis. Most cases fit the clinical definitions of PD, including unilateral, gradual onset and response to dopaminurgic therapy. Controls were taken from the 23andMe customer database. All customers who did not say they were affected with PD were used as controls. This is reasonable since the prevalence of PD is under 2% in the general population, well under the typical misdiagnosis rate for PD. The final dataset included 3227 cases and 30,817 controls. The replication group consisted of 939 Parkinson's disease cases and 802 controls collected by NINDS and downloaded from dbGaP (http://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/study.cgi?study_id=phs000089.v3.p2).
  • B. Genotyping and SNP Quality Control. Samples were genotyped on the Illumina HumanHap550+ BeadChip platform (Illumina, Inc., San Diego, Calif. 92121), which included SNPs from the standard HumanHap550 panel in addition to a custom set of about 25,000 SNPs. A total of 609695 SNPs were genotyped on the 550+ plus custom platform. After quality control a total of 554197 SNPs were used in the analysis.
  • DNA extraction and genotyping were performed on saliva samples by National Genetics Institute (NGI, Los Angeles, Calif. 90064), a CLIA licensed clinical laboratory and a subsidiary of Laboratory Corporation of America. Every sample that failed to reach 98.5% call rate was re-analyzed. Individuals whose analyses failed repeatedly were re-contacted to provide additional samples. SNPs with a call rate under 98% were excluded from analysis, as were those with minor allele frequency under 0.5% or a p-value for Hardy-Weinberg equilibrium under 1e-10. Both minor allele frequency and Hardy-Weinberg statistics were calculated within the dataset. Due to the two slightly different platforms in the analysis the no-call rate was calculated only among individuals genotyped on a given platform. In addition, a total of 1553 SNPs with Mendelian discordance rates (the fraction of trios in which the called SNPs followed an impossible inheritance pattern) of at least 1% were discarded. Since cases and controls for this study were run in different batches, about 1000 additional SNPs were discarded due to the possibility of batch effects.
  • C. Statistical Methods. All p-values were calculated using logistic regression using the formula PD status˜Genotype at SNP i+Sex+Age+PC1+PC2+PC3+PC4+PC5. Phenotype was coded as 1=PD patient, 0=unaffected. Genotypes were coded as 0, 1, and 2, counting the number of minor alleles present. Sex was coded as female=1, male=0. PC1 through PC5 are the first five principle components of genetic variation among all subjects. The reported odds ratio (OR) for each SNP is thus the odds per copy of the minor allele of that SNP present. The log of the odds ratio is the coefficient of the genotype in the above regression. For each novel association, the set of nearby SNPs was evaluated (FIGS. 3-6). In each case, the presence of signal in other nearby SNPs provides substantial evidence that these associations are not due to genotyping error.
  • D. Population Structure. Identity by descent (IBD) was measured for all pairs of participants using a novel algorithm, described in detail in pending application titled “Finding Relatives In a Database”, application Ser. No. 12/644,791, and incorporated by reference into this application in its entirety herein. The aforementioned novel algorithm acts on unphased data by comparing homozygous calls in a window. A set of “unrelated” participants was defined by requiring that no two individuals share over 700 cM IBD, counting both full (diploid) and half (haploid) levels of identity by descent. This level of relatedness (approximately 20% of the genome) corresponds approximately to the minimal expected sharing between first-cousins in an outbred population. Only unrelated (in the aforementioned sense) individuals from this subset were considered during the present analysis.
  • To correct for possible confounding due to ancestry, a subset of individuals having European ancestry was selected using multi-dimensional scaling (MDS) and a collection of ancestry informative markers. There was only very slight inflation present in the statistics (genomic control inflation factor of λ≈1.04, (Devlin and Roeder (1999) Genomic control for association studies, Biometrics 55(4): 997-1004). This indicates very high confidence that these associations are not just due to population. See FIG. 2, where the close concordance of p-values to expected shows no evidence of systematic bias.
  • E. LRKK2 modifying SNP rs11755699 (SEQ ID NO: 9). The LRRK2 G2019S mutation (rs34637584) is known to raise an individual's risk of developing PD by over 20-fold (Healy et al. (2008) Phenotype, genotype, and worldwide genetic penetrance of LRRK2-associated Parkinson's disease: a case-control study, Lancet Neurol. 7(7):583-590). Applicants conducted a genome-wide association study (GWAS) using a population of individuals identified as carriers of this mutation in the 23andMe database in order to find potential genetic modifiers (23andMe, Mountain View, Calif. 94040).
  • The GWAS was conducted in 60 people over the age of 50 who carried the LRRK2 G2019S mutation. Although by typical standards this may appear to be a small set in terms of a population size for a GWAS, the cohort represents one of the largest collections of G2019S positive people in the world, as the G2019S mutation is quite rare. The most significant association found in this analysis was the mutation rs11755699 (SEQ ID NO: 9), which lies in an intron of SGK1. It had a p-value (using Fisher's exact test) of under 10̂−5.5, and an odds ratio of over 42 for not developing PD if one carried the C allele (T being the more common allele). Of the 20 people with G2019S and without PD, 10 of them had the C allele, as compared to the 0 or 1 that would be expected.
  • While this association wasn't significant genome-wide, it was the most significant among all the SNPs tested. Furthermore, rs11755699 (SEQ ID NO: 9) is intronic in SGK1 gene which is known to be associated with cell survival in neurodegenerative diseases such as PD (Stichel et al. (2005) Sgk1, a member of an RNA cluster associated with cell death in a model of Parkinson's disease, Eur J Neurosci. 21(2):301-316; and Schoenebeck et al. (2005) Sgk1, a cell survival response in neurodegenerative diseases, Mol Cell Neurosci. 30(2):249-264.
  • The descriptions provided above are exemplary methods of carrying out the invention and are not intended to limit the scope of the invention in any way. One of ordinary skill in the art will appreciate alternates or modifications that may be made without departure from the breadth and the scope of the invention as set forth in the claims below.

Claims (7)

1. A method for screening a human subject with a LRKK2 G2019S mutation for susceptibility to PD, the method comprising:
obtaining a nucleic acid sample from the human subject; and
determining which allele is present in the sample at the polymorphic nucleotide position of SNP rs11755699 (SEQ ID NO: 9), wherein presence of C at the polymorphic nucleotide position is indicative of a lower risk of developing PD.
2. A purified nucleic acid molecule that specifically hybridizes to a PD-related nucleic acid, for use in diagnostics, prognostics, prevention, treatment or study of PD, wherein said PD-related nucleic acid contains a base at one or more of the polymorphic nucleotide positions identified in Tables 1-2 (SEQ ID NO: 1-8) or 2-2 (SEQ ID NO: 9).
3. A purified nucleic acid molecule of claim 2 that specifically hybridizes to at least 16 contiguous nucleotides of said PD-related nucleic acid.
4. A purified nucleic acid molecule of claim 2 further comprising a detectable label.
5. A method for treating or preventing the development of PD, comprising administering to a subject suffering from or identified at risk for PD an agent that modulates expression or activity of a protein further modulated by a PD-related nucleic acid containing a base at one or more of the polymorphic nucleotide positions identified in Tables 1-2 (SEQ ID NO: 1-8) or 2-2 (SEQ ID NO: 9).
6. A method of identifying a modulator of a PD phenotype, the method comprising contacting a potential modulator to a gene or gene product, wherein the gene or gene product comprises or is associated with or regulated by a PD-related nucleic acid containing a base at one or more of the polymorphic nucleotide positions identified in Tables 1-2 (SEQ ID NO: 1-8) or 2-2 (SEQ ID NO: 9) and, detecting an effect of the potential modulator on the gene or gene product, thereby identifying whether the potential modulator modulates the PD phenotype.
7. The method of claim 6, wherein the effect is selected from: (a) increased or decreased expression of the gene or gene product in the presence of the modulator; (b) increased or decreased activity of the gene product in the presence of the modulator; and (c) an altered expression pattern of the gene or gene product in the presence of the modulator.
US13/452,341 2009-11-30 2012-04-20 Polymorphisms associated with parkinson's disease Abandoned US20120270794A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US13/452,341 US20120270794A1 (en) 2009-11-30 2012-04-20 Polymorphisms associated with parkinson's disease

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US26530409P 2009-11-30 2009-11-30
US35976910P 2010-06-29 2010-06-29
US12/956,525 US8187811B2 (en) 2009-11-30 2010-11-30 Polymorphisms associated with Parkinson's disease
US13/452,341 US20120270794A1 (en) 2009-11-30 2012-04-20 Polymorphisms associated with parkinson's disease

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
US12/956,525 Continuation US8187811B2 (en) 2009-11-30 2010-11-30 Polymorphisms associated with Parkinson's disease

Publications (1)

Publication Number Publication Date
US20120270794A1 true US20120270794A1 (en) 2012-10-25

Family

ID=44067156

Family Applications (2)

Application Number Title Priority Date Filing Date
US12/956,525 Expired - Fee Related US8187811B2 (en) 2009-11-30 2010-11-30 Polymorphisms associated with Parkinson's disease
US13/452,341 Abandoned US20120270794A1 (en) 2009-11-30 2012-04-20 Polymorphisms associated with parkinson's disease

Family Applications Before (1)

Application Number Title Priority Date Filing Date
US12/956,525 Expired - Fee Related US8187811B2 (en) 2009-11-30 2010-11-30 Polymorphisms associated with Parkinson's disease

Country Status (3)

Country Link
US (2) US8187811B2 (en)
CA (1) CA2782207A1 (en)
WO (1) WO2011065982A2 (en)

Cited By (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2014110481A2 (en) * 2013-01-11 2014-07-17 The Trustees Of Columbia University In The City Of New York Rab7l1 interacts with lrrk2 to modify intraneuronal protein sorting and parkinson' s disease risk
US9213944B1 (en) * 2012-11-08 2015-12-15 23Andme, Inc. Trio-based phasing using a dynamic Bayesian network
US9213947B1 (en) * 2012-11-08 2015-12-15 23Andme, Inc. Scalable pipeline for local ancestry inference
US10025877B2 (en) 2012-06-06 2018-07-17 23Andme, Inc. Determining family connections of individuals in a database
US10162880B1 (en) 2011-10-11 2018-12-25 23Andme, Inc. Cohort selection with privacy protection
US10275569B2 (en) 2007-10-15 2019-04-30 22andMe, Inc. Family inheritance
US10432640B1 (en) 2007-10-15 2019-10-01 23Andme, Inc. Genome sharing
US10854318B2 (en) 2008-12-31 2020-12-01 23Andme, Inc. Ancestry finder
US10936626B1 (en) 2011-11-23 2021-03-02 23Andme, Inc. Database and data processing system for use with a network-based personal genetics services platform
US11514627B2 (en) 2019-09-13 2022-11-29 23Andme, Inc. Methods and systems for determining and displaying pedigrees
US11621089B2 (en) 2007-03-16 2023-04-04 23Andme, Inc. Attribute combination discovery for predisposition determination of health conditions
US11783919B2 (en) 2020-10-09 2023-10-10 23Andme, Inc. Formatting and storage of genetic markers
US11817176B2 (en) 2020-08-13 2023-11-14 23Andme, Inc. Ancestry composition determination

Families Citing this family (20)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10522240B2 (en) 2006-05-03 2019-12-31 Population Bio, Inc. Evaluating genetic disorders
US7702468B2 (en) 2006-05-03 2010-04-20 Population Diagnostics, Inc. Evaluating genetic disorders
WO2012018387A2 (en) 2010-08-02 2012-02-09 Population Diagnotics, Inc. Compositions and methods for discovery of causative mutations in genetic disorders
CA2851388C (en) 2011-10-10 2023-11-21 The Hospital For Sick Children Methods and compositions for screening and treating developmental disorders
US11180807B2 (en) * 2011-11-04 2021-11-23 Population Bio, Inc. Methods for detecting a genetic variation in attractin-like 1 (ATRNL1) gene in subject with Parkinson's disease
WO2013120018A1 (en) 2012-02-09 2013-08-15 Population Diagnostics, Inc. Methods and compositions for screening and treating developmental disorders
DK2895621T3 (en) 2012-09-14 2020-11-30 Population Bio Inc METHODS AND COMPOSITION FOR DIAGNOSIS, FORECAST AND TREATMENT OF NEUROLOGICAL CONDITIONS
CA2922005A1 (en) 2012-09-27 2014-04-03 Population Diagnostics, Inc. Methods and compositions for screening and treating developmental disorders
US9599605B1 (en) 2013-03-15 2017-03-21 Boston Medical Center Corporation Parkinson's disease model and methods
EP4286517A3 (en) 2013-04-04 2024-03-13 President and Fellows of Harvard College Therapeutic uses of genome editing with crispr/cas systems
GB2558326B (en) 2014-09-05 2021-01-20 Population Bio Inc Methods and compositions for inhibiting and treating neurological conditions
KR20180020125A (en) 2015-03-27 2018-02-27 프레지던트 앤드 펠로우즈 오브 하바드 칼리지 Modified T cells and methods for their manufacture and use
US10907160B2 (en) 2016-01-05 2021-02-02 Ionis Pharmaceuticals, Inc. Methods for reducing LRRK2 expression
US10240205B2 (en) 2017-02-03 2019-03-26 Population Bio, Inc. Methods for assessing risk of developing a viral disease using a genetic test
US11913015B2 (en) 2017-04-17 2024-02-27 University Of Maryland, College Park Embryonic cell cultures and methods of using the same
US11332746B1 (en) 2018-06-27 2022-05-17 Ionis Pharmaceuticals, Inc. Compounds and methods for reducing LRRK2 expression
SI3625368T1 (en) 2018-08-08 2023-04-28 Pml Screening, Llc Methods for assessing the risk of developing progressive multifocal leukoencephalopathy caused by john cunningham virus by genetic testing
AU2019370896A1 (en) 2018-10-31 2021-06-17 Ancestry.Com Dna, Llc Estimation of phenotypes using DNA, pedigree, and historical data
US20230364102A1 (en) * 2020-09-10 2023-11-16 Duke University Compositions and methods for the treatment of neurological disorders
CN115537460A (en) * 2022-06-30 2022-12-30 中南大学湘雅医院 PD risk site detection kit, use method thereof and risk scoring method

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CA2603550A1 (en) 2005-03-31 2006-10-05 The Board Of Trustees Of The Leland Stanford Junior University Compositions and methods for diagnosing and treating neuropsychiatric disorders
US7816083B2 (en) 2007-05-03 2010-10-19 Celera Corporation Genetic polymorphisms associated with neurodegenerative diseases, methods of detection and uses thereof

Cited By (46)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11621089B2 (en) 2007-03-16 2023-04-04 23Andme, Inc. Attribute combination discovery for predisposition determination of health conditions
US11791054B2 (en) 2007-03-16 2023-10-17 23Andme, Inc. Comparison and identification of attribute similarity based on genetic markers
US11735323B2 (en) 2007-03-16 2023-08-22 23Andme, Inc. Computer implemented identification of genetic similarity
US10516670B2 (en) 2007-10-15 2019-12-24 23Andme, Inc. Genome sharing
US11683315B2 (en) 2007-10-15 2023-06-20 23Andme, Inc. Genome sharing
US11171962B2 (en) 2007-10-15 2021-11-09 23Andme, Inc. Genome sharing
US11170873B2 (en) 2007-10-15 2021-11-09 23Andme, Inc. Genetic comparisons between grandparents and grandchildren
US11875879B1 (en) 2007-10-15 2024-01-16 23Andme, Inc. Window-based method for determining inherited segments
US10275569B2 (en) 2007-10-15 2019-04-30 22andMe, Inc. Family inheritance
US10999285B2 (en) 2007-10-15 2021-05-04 23Andme, Inc. Genome sharing
US10432640B1 (en) 2007-10-15 2019-10-01 23Andme, Inc. Genome sharing
US10841312B2 (en) 2007-10-15 2020-11-17 23Andme, Inc. Genome sharing
US10643740B2 (en) 2007-10-15 2020-05-05 23Andme, Inc. Family inheritance
US11803777B2 (en) 2008-03-19 2023-10-31 23Andme, Inc. Ancestry painting
US11625139B2 (en) 2008-03-19 2023-04-11 23Andme, Inc. Ancestry painting
US10296847B1 (en) 2008-03-19 2019-05-21 23Andme, Inc. Ancestry painting with local ancestry inference
US11531445B1 (en) 2008-03-19 2022-12-20 23Andme, Inc. Ancestry painting
US11049589B2 (en) 2008-12-31 2021-06-29 23Andme, Inc. Finding relatives in a database
US11657902B2 (en) 2008-12-31 2023-05-23 23Andme, Inc. Finding relatives in a database
US11031101B2 (en) 2008-12-31 2021-06-08 23Andme, Inc. Finding relatives in a database
US11776662B2 (en) 2008-12-31 2023-10-03 23Andme, Inc. Finding relatives in a database
US10854318B2 (en) 2008-12-31 2020-12-01 23Andme, Inc. Ancestry finder
US11935628B2 (en) 2008-12-31 2024-03-19 23Andme, Inc. Finding relatives in a database
US11322227B2 (en) 2008-12-31 2022-05-03 23Andme, Inc. Finding relatives in a database
US11468971B2 (en) 2008-12-31 2022-10-11 23Andme, Inc. Ancestry finder
US11508461B2 (en) 2008-12-31 2022-11-22 23Andme, Inc. Finding relatives in a database
US10891317B1 (en) 2011-10-11 2021-01-12 23Andme, Inc. Cohort selection with privacy protection
US11748383B1 (en) 2011-10-11 2023-09-05 23Andme, Inc. Cohort selection with privacy protection
US10162880B1 (en) 2011-10-11 2018-12-25 23Andme, Inc. Cohort selection with privacy protection
US10936626B1 (en) 2011-11-23 2021-03-02 23Andme, Inc. Database and data processing system for use with a network-based personal genetics services platform
US10025877B2 (en) 2012-06-06 2018-07-17 23Andme, Inc. Determining family connections of individuals in a database
US11170047B2 (en) 2012-06-06 2021-11-09 23Andme, Inc. Determining family connections of individuals in a database
US9213944B1 (en) * 2012-11-08 2015-12-15 23Andme, Inc. Trio-based phasing using a dynamic Bayesian network
US10658071B2 (en) 2012-11-08 2020-05-19 23Andme, Inc. Scalable pipeline for local ancestry inference
US9836576B1 (en) 2012-11-08 2017-12-05 23Andme, Inc. Phasing of unphased genotype data
US9977708B1 (en) 2012-11-08 2018-05-22 23Andme, Inc. Error correction in ancestry classification
US10755805B1 (en) 2012-11-08 2020-08-25 23Andme, Inc. Ancestry painting with local ancestry inference
US11521708B1 (en) 2012-11-08 2022-12-06 23Andme, Inc. Scalable pipeline for local ancestry inference
US10699803B1 (en) 2012-11-08 2020-06-30 23Andme, Inc. Ancestry painting with local ancestry inference
US9213947B1 (en) * 2012-11-08 2015-12-15 23Andme, Inc. Scalable pipeline for local ancestry inference
US10572831B1 (en) 2012-11-08 2020-02-25 23Andme, Inc. Ancestry painting with local ancestry inference
WO2014110481A3 (en) * 2013-01-11 2014-09-12 The Trustees Of Columbia University In The City Of New York Rab7l1 interacts with lrrk2 to modify intraneuronal protein sorting and parkinson' s disease risk
WO2014110481A2 (en) * 2013-01-11 2014-07-17 The Trustees Of Columbia University In The City Of New York Rab7l1 interacts with lrrk2 to modify intraneuronal protein sorting and parkinson' s disease risk
US11514627B2 (en) 2019-09-13 2022-11-29 23Andme, Inc. Methods and systems for determining and displaying pedigrees
US11817176B2 (en) 2020-08-13 2023-11-14 23Andme, Inc. Ancestry composition determination
US11783919B2 (en) 2020-10-09 2023-10-10 23Andme, Inc. Formatting and storage of genetic markers

Also Published As

Publication number Publication date
WO2011065982A3 (en) 2011-09-29
US20110130337A1 (en) 2011-06-02
WO2011065982A2 (en) 2011-06-03
US8187811B2 (en) 2012-05-29
CA2782207A1 (en) 2011-06-03

Similar Documents

Publication Publication Date Title
US8187811B2 (en) Polymorphisms associated with Parkinson's disease
US9896729B2 (en) Method for diagnosing a neurodegenerative disease
Proitsi et al. Alzheimer's disease susceptibility variants in the MS4A6A gene are associated with altered levels of MS4A6A expression in blood
US20090098557A1 (en) Identification of genetic markers associated with parkinson disease
WO2008112177A2 (en) Genemap of the human genes associated with schizophrenia
AU2012323345A1 (en) Single nucleotide polymorphisms useful to predict clinical response for glatiramer acetate
Bettens et al. Follow-up study of susceptibility loci for Alzheimer's disease and onset age identified by genome-wide association
EP1673473A1 (en) Use of genetic polymorphisms that associate with efficacy of treatment of inflammatory disease
WO2009094592A2 (en) Genetic basis of alzheimer's disease and diagnosis and treatment thereof
WO2008086579A1 (en) Diagnostic methods and agents
Szigeti et al. Ordered subset analysis of copy number variation association with age at onset of Alzheimer's disease
US20030170678A1 (en) Genetic markers for Alzheimer's disease and methods using the same
Harold et al. Sequence variation in the CHAT locus shows no association with late-onset Alzheimer's disease
Hoenicka et al. From dopaminergic genes to psychiatric disorders
US20050191652A1 (en) Identification of genetic forms of a gene that leads to high risk for parkinson disease
Karpyak et al. Sequence variations of the human MPDZ gene and association with alcoholism in subjects with European ancestry
EP3790994A1 (en) Method for predicting the risk of late-onset alzheimer's diseases
WO2010111080A2 (en) Optimized treatment of schizophrenia
US20060177860A1 (en) Genetic markers in the HLA-DQBI gene associated with an adverse hematological response to drugs
EP2147117A2 (en) Methods of diagnosing alzheimer's disease and markers identified by set association
CA3174572A1 (en) Compositions and methods for assessing the efficacy of inhibitors of neurotransmitter transporters
EP2137539B1 (en) Amiloride sensitive sodium channels associated with panic disorders
EP1255870B1 (en) 5-hydroxytryptamine transporter gene polymorphisms
EP1673472B1 (en) Use of genetic polymorphisms to predict drug-induced hepatotoxicity
Gardner Effects of Dysregulated RNA Editing on Alzheimer Disease Pathogenesis

Legal Events

Date Code Title Description
STCB Information on status: application discontinuation

Free format text: EXPRESSLY ABANDONED -- DURING EXAMINATION