Predicting giant transmembrane β-barrel architecture
Motivation: The β-barrel is a ubiquitous fold that is deployed to accomplish a wide variety of biological functions including membrane-embedded pores. Key influences of β-barrel lumen diameter include the number of β-strands ( n ) and the degree of ...
1001 Proteomes
- Hiren J. Joshi,
- Katy M. Christiansen,
- Joffrey Fitz,
- Jun Cao,
- Anna Lipzen,
- Joel Martin,
- A. Michelle Smith-Moritz,
- Len A. Pennacchio,
- Wendy S. Schackwitz,
- Detlef Weigel,
- Joshua L. Heazlewood
Motivation: The sequencing of over a thousand natural strains of the model plant Arabidopsis thaliana is producing unparalleled information at the genetic level for plant researchers. To enable the rapid exploitation of these data for functional ...
CONTRA
- Jason Li,
- Richard Lupat,
- Kaushalya C. Amarasinghe,
- Ella R. Thompson,
- Maria A. Doyle,
- Georgina L. Ryland,
- Richard W. Tothill,
- Saman K. Halgamuge,
- Ian G. Campbell,
- Kylie L. Gorringe
Motivation: In light of the increasing adoption of targeted resequencing (TR) as a cost-effective strategy to identify disease-causing variants, a robust method for copy number variation (CNV) analysis is needed to maximize the value of this ...
Probabilistic suffix array
Motivation: Markov models are very popular for analyzing complex sequences such as protein sequences, whose sources are unknown, or whose underlying statistical characteristics are not well understood. A major problem is the computational complexity ...
Fulcrum
Motivation: Ultra-high-throughput sequencing produces duplicate and near-duplicate reads, which can consume computational resources in downstream applications. A tool that collapses such reads should reduce storage and assembly complications and ...
A subspace method for the detection of transcription factor binding sites
Motivation: The identification of the sites at which transcription factors (TFs) bind to Deoxyribonucleic acid (DNA) is an important problem in molecular biology. Many computational methods have been developed for motif finding, most of them based on ...
PhyLAT
Motivation: The expansion of DNA sequencing capacity has enabled the sequencing of whole genomes from a number of related species. These genomes can be combined in a multiple alignment that provides useful information about the evolutionary history ...
Fast protein binding site comparisons using visual words representation
Motivation: Finding geometrically similar protein binding sites is crucial for understanding protein functions and can provide valuable information for protein–protein docking and drug discovery. As the number of known protein–protein interaction ...
Matrix eQTL
Fast and accurate inference of local ancestry in Latino populations
- Yael Baran,
- Bogdan Pasaniuc,
- Sriram Sankararaman,
- Dara G. Torgerson,
- Christopher Gignoux,
- Celeste Eng,
- William Rodriguez-Cintron,
- Rocio Chapela,
- Jean G. Ford,
- Pedro C. Avila,
- Jose Rodriguez-Santana,
- Esteban Gonzàlez Burchard,
- Eran Halperin
Motivation: It is becoming increasingly evident that the analysis of genotype data from recently admixed populations is providing important insights into medical genetics and population history. Such analyses have been used to identify novel disease ...
Inferring gene regulatory networks by ANOVA
Motivation: To improve the understanding of molecular regulation events, various approaches have been developed for deducing gene regulatory networks from mRNA expression data.
Results: We present a new score for network inference, η2, that is ...
Improving GO semantic similarity measures by exploring the ontology beneath the terms and modelling uncertainty
Motivation: Several measures have been recently proposed for quantifying the functional similarity between gene products according to well-structured controlled vocabularies where biological terms are organized in a tree or in a directed acyclic ...
AutoLabDB
GECA
Summary: GECA is a fast, user-friendly and freely-available tool for representing gene exon/intron organization and highlighting changes in gene structure among members of a gene family. It relies on protein alignment, completed with the ...
Rknots
Motivation: Rknots is a flexible R package providing tools for the detection and characterization of topological knots in biological polymers. The package is well documented and provides a simple syntax for data import and preprocessing, structure ...
OCAP
DADP
Summary: Anuran tissues, and especially skin, are a rich source of bioactive peptides and their precursors. We here present a manually curated database of antimicrobial and other defense peptides with a total of 2571 entries, most of them in the ...
Metab2MeSH
Summary: Progress in high-throughput genomic technologies has led to the development of a variety of resources that link genes to functional information contained in the biomedical literature. However, tools attempting to link small molecules to ...