IEEE/ACM transactions on computational biology and bioinformatics, Jan 18, 2017
Biomolecular controlled annotations have become pivotal in computational biology, because they al... more Biomolecular controlled annotations have become pivotal in computational biology, because they allow scientists to analyze large amounts of biological data to better understand test results, and to infer new knowledge. Yet, biomolecular annotation databases are incomplete by definition, like our knowledge of biology, and might contain errors and inconsistent information. In this context, machine-learning algorithms able to predict and prioritize new annotations are both effective and efficient, especially if compared with time-consuming trials of biological validation. To limit the possibility that these techniques predict obvious and trivial high-level features, and to help prioritizing their results, we introduce a new element that can improve accuracy and relevance of the results of an annotation prediction and prioritization pipeline. We propose a novelty indicator able to state the level of "originality" of the annotations predicted for a specific gene to Gene Ontolog...
In frontotemporal dementia (FTD), age at disease onset (AAO) is unpredictable in both early and l... more In frontotemporal dementia (FTD), age at disease onset (AAO) is unpredictable in both early and late-onset cases; AAO variability is found even in autosomal dominant FTD. The present study was aimed at identifying genetic modifiers modulating AAO in a large cohort of Italian FTD patients. We conducted an association analysis on 411 FTD patients, belonging to 7 Italian Centers, and for whom AAO was available. Population structure was evaluated by principal component analysis to infer continuous axes of genetic variation, and single linear regression models were applied. A genetic score (GS) was calculated on the basis of suggestive single nucleotide polymorphisms (SNPs) found by association analyses. GS showed genome-wide significant slope decrease by -3.86 (95% CI: -4.64 to -3.07, p < 2×10-16) per standard deviation of the GS for 6 SNPs mapping to genes involved in neuronal development and signaling, axonal myelinization, and glutamatergic/GABA neurotransmission. An increase of t...
In this paper, we reconstructed the medical history of frontotemporal dementia (FTD) by reviewing... more In this paper, we reconstructed the medical history of frontotemporal dementia (FTD) by reviewing the literature and analyzing papers with the highest impact through citation index. Several research studies and groups involved in FTD have been reviewed. An increasing amount of knowledge has been made available in the last 20 years through a large number of publications, leading to a better definition of the genetic and clinical bases of the disease. A total of 1,436 references (articles and reviews), published in 395 journals, were retrieved through the Scopus database. The two highest publication peaks (i.e., largest number of publications) were found in 2000 and 2008. The most cited papers considering both total citation number and the number of citations within the first two years after publication refer to: (i) the genetic bases of FTD, (ii) the clinical criteria that progressively refined the different FTD phenotypes, and (iii) FTD epidemiology. Advanced neuroimaging techniques, genotype-phenotype heterogeneity, and animal models gave us a broader understanding of various aspects of the disorder. These findings confirm the great interest in FTD research. The analysis of the literature might help in guiding future goals in the field.
Improvement of sequencing technologies and data processing pipelines is rapidly providing sequenc... more Improvement of sequencing technologies and data processing pipelines is rapidly providing sequencing data, with associated high-level features, of many individual genomes in multiple biological and clinical conditions. They allow for data-driven genomic, transcriptomic and epigenomic characterizations, but require state-of-the-art "big data" computing strategies, with abstraction levels beyond available tool capabilities. We propose a high-level, declarative GenoMetric Query Language (GMQL) and a toolkit for its use. GMQL operates downstream of raw data preprocessing pipelines and supports queries over thousands of heterogeneous datasets and samples; as such it is key to genomic "big data" analysis. GMQL leverages a simple data model that provides both abstractions of genomic region data and associated experimental, biological and clinical metadata, and interoperability between many data formats. Based on Hadoop framework and Apache Pig platform, GMQL ensures hig...
Frontotemporal dementia (FTD) is the second most prevalent form of early onset dementia after Alz... more Frontotemporal dementia (FTD) is the second most prevalent form of early onset dementia after Alzheimer&amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;#39;s disease (AD). We performed a case-control association study in an Italian FTD cohort (n = 530) followed by the novel single nucleotide polymorphisms (SNPs)-to-genes approach and functional annotation analysis. We identified 2 novel potential loci for FTD. Suggestive SNPs reached p-values ∼10(-7) and odds ratio &amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;gt; 2.5 (2p16.3) and 1.5 (17q25.3). Suggestive alleles at 17q25.3 identified a disease-associated haplotype causing decreased expression of -cis genes such as RFNG and AATK involved in neuronal genesis and differentiation and axon outgrowth, respectively. We replicated this locus through the SNPs-to-genes approach. Our functional annotation analysis indicated significant enrichment for functions of the brain (neuronal genesis, differentiation, and maturation), the synapse (neurotransmission and synapse plasticity), and elements of the immune system, the latter supporting our recent international FTD-genome-wide association study. This is the largest genome-wide study in Italian FTD to date. Although our results are not conclusive, we set the basis for future replication studies and identification of susceptible molecular mechanisms involved in FTD pathogenesis.
IEEE/ACM transactions on computational biology and bioinformatics, Jan 18, 2017
Biomolecular controlled annotations have become pivotal in computational biology, because they al... more Biomolecular controlled annotations have become pivotal in computational biology, because they allow scientists to analyze large amounts of biological data to better understand test results, and to infer new knowledge. Yet, biomolecular annotation databases are incomplete by definition, like our knowledge of biology, and might contain errors and inconsistent information. In this context, machine-learning algorithms able to predict and prioritize new annotations are both effective and efficient, especially if compared with time-consuming trials of biological validation. To limit the possibility that these techniques predict obvious and trivial high-level features, and to help prioritizing their results, we introduce a new element that can improve accuracy and relevance of the results of an annotation prediction and prioritization pipeline. We propose a novelty indicator able to state the level of "originality" of the annotations predicted for a specific gene to Gene Ontolog...
In frontotemporal dementia (FTD), age at disease onset (AAO) is unpredictable in both early and l... more In frontotemporal dementia (FTD), age at disease onset (AAO) is unpredictable in both early and late-onset cases; AAO variability is found even in autosomal dominant FTD. The present study was aimed at identifying genetic modifiers modulating AAO in a large cohort of Italian FTD patients. We conducted an association analysis on 411 FTD patients, belonging to 7 Italian Centers, and for whom AAO was available. Population structure was evaluated by principal component analysis to infer continuous axes of genetic variation, and single linear regression models were applied. A genetic score (GS) was calculated on the basis of suggestive single nucleotide polymorphisms (SNPs) found by association analyses. GS showed genome-wide significant slope decrease by -3.86 (95% CI: -4.64 to -3.07, p < 2×10-16) per standard deviation of the GS for 6 SNPs mapping to genes involved in neuronal development and signaling, axonal myelinization, and glutamatergic/GABA neurotransmission. An increase of t...
In this paper, we reconstructed the medical history of frontotemporal dementia (FTD) by reviewing... more In this paper, we reconstructed the medical history of frontotemporal dementia (FTD) by reviewing the literature and analyzing papers with the highest impact through citation index. Several research studies and groups involved in FTD have been reviewed. An increasing amount of knowledge has been made available in the last 20 years through a large number of publications, leading to a better definition of the genetic and clinical bases of the disease. A total of 1,436 references (articles and reviews), published in 395 journals, were retrieved through the Scopus database. The two highest publication peaks (i.e., largest number of publications) were found in 2000 and 2008. The most cited papers considering both total citation number and the number of citations within the first two years after publication refer to: (i) the genetic bases of FTD, (ii) the clinical criteria that progressively refined the different FTD phenotypes, and (iii) FTD epidemiology. Advanced neuroimaging techniques, genotype-phenotype heterogeneity, and animal models gave us a broader understanding of various aspects of the disorder. These findings confirm the great interest in FTD research. The analysis of the literature might help in guiding future goals in the field.
Improvement of sequencing technologies and data processing pipelines is rapidly providing sequenc... more Improvement of sequencing technologies and data processing pipelines is rapidly providing sequencing data, with associated high-level features, of many individual genomes in multiple biological and clinical conditions. They allow for data-driven genomic, transcriptomic and epigenomic characterizations, but require state-of-the-art "big data" computing strategies, with abstraction levels beyond available tool capabilities. We propose a high-level, declarative GenoMetric Query Language (GMQL) and a toolkit for its use. GMQL operates downstream of raw data preprocessing pipelines and supports queries over thousands of heterogeneous datasets and samples; as such it is key to genomic "big data" analysis. GMQL leverages a simple data model that provides both abstractions of genomic region data and associated experimental, biological and clinical metadata, and interoperability between many data formats. Based on Hadoop framework and Apache Pig platform, GMQL ensures hig...
Frontotemporal dementia (FTD) is the second most prevalent form of early onset dementia after Alz... more Frontotemporal dementia (FTD) is the second most prevalent form of early onset dementia after Alzheimer&amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;#39;s disease (AD). We performed a case-control association study in an Italian FTD cohort (n = 530) followed by the novel single nucleotide polymorphisms (SNPs)-to-genes approach and functional annotation analysis. We identified 2 novel potential loci for FTD. Suggestive SNPs reached p-values ∼10(-7) and odds ratio &amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;gt; 2.5 (2p16.3) and 1.5 (17q25.3). Suggestive alleles at 17q25.3 identified a disease-associated haplotype causing decreased expression of -cis genes such as RFNG and AATK involved in neuronal genesis and differentiation and axon outgrowth, respectively. We replicated this locus through the SNPs-to-genes approach. Our functional annotation analysis indicated significant enrichment for functions of the brain (neuronal genesis, differentiation, and maturation), the synapse (neurotransmission and synapse plasticity), and elements of the immune system, the latter supporting our recent international FTD-genome-wide association study. This is the largest genome-wide study in Italian FTD to date. Although our results are not conclusive, we set the basis for future replication studies and identification of susceptible molecular mechanisms involved in FTD pathogenesis.
Uploads
Papers by Fernando Palluzzi