-
Lomics: Generation of Pathways and Gene Sets using Large Language Models for Transcriptomic Analysis
Authors:
Chun-Ka Wong,
Ali Choo,
Eugene C. C. Cheng,
Wing-Chun San,
Kelvin Chak-Kong Cheng,
Yee-Man Lau,
Minqing Lin,
Fei Li,
Wei-Hao Liang,
Song-Yan Liao,
Kwong-Man Ng,
Ivan Fan-Ngai Hung,
Hung-Fat Tse,
Jason Wing-Hon Wong
Abstract:
Interrogation of biological pathways is an integral part of omics data analysis. Large language models (LLMs) enable the generation of custom pathways and gene sets tailored to specific scientific questions. These targeted sets are significantly smaller than traditional pathway enrichment analysis libraries, reducing multiple hypothesis testing and potentially enhancing statistical power. Lomics (…
▽ More
Interrogation of biological pathways is an integral part of omics data analysis. Large language models (LLMs) enable the generation of custom pathways and gene sets tailored to specific scientific questions. These targeted sets are significantly smaller than traditional pathway enrichment analysis libraries, reducing multiple hypothesis testing and potentially enhancing statistical power. Lomics (Large Language Models for Omics Studies) v1.0 is a python-based bioinformatics toolkit that streamlines the generation of pathways and gene sets for transcriptomic analysis. It operates in three steps: 1) deriving relevant pathways based on the researcher's scientific question, 2) generating valid gene sets for each pathway, and 3) outputting the results as .GMX files. Lomics also provides explanations for pathway selections. Consistency and accuracy are ensured through iterative processes, JSON format validation, and HUGO Gene Nomenclature Committee (HGNC) gene symbol verification. Lomics serves as a foundation for integrating LLMs into omics research, potentially improving the specificity and efficiency of pathway analysis.
△ Less
Submitted 12 July, 2024;
originally announced July 2024.
-
Application of Novel PACS-based Informatics Platform to Identify Imaging Based Predictors of CDKN2A Allelic Status in Glioblastomas
Authors:
Niklas Tillmanns,
Jan Lost,
Joanna Tabor,
Sagar Vasandani,
Shaurey Vetsa,
Neelan Marianayagam,
Kanat Yalcin,
E. Zeynep Erson-Omay,
Marc von Reppert,
Leon Jekel,
Sara Merkaj,
Divya Ramakrishnan,
Arman Avesta,
Irene Dixe de Oliveira Santo,
Lan Jin,
Anita Huttner,
Khaled Bousabarah,
Ichiro Ikuta,
MingDe Lin,
Sanjay Aneja,
Bernd Turowski,
Mariam Aboian,
Jennifer Moliterno
Abstract:
Gliomas with CDKN2A mutations are known to have worse prognosis but imaging features of these gliomas are unknown. Our goal is to identify CDKN2A specific qualitative imaging biomarkers in glioblastomas using a new informatics workflow that enables rapid analysis of qualitative imaging features with Visually AcceSAble Rembrandtr Images (VASARI) for large datasets in PACS. Sixty nine patients under…
▽ More
Gliomas with CDKN2A mutations are known to have worse prognosis but imaging features of these gliomas are unknown. Our goal is to identify CDKN2A specific qualitative imaging biomarkers in glioblastomas using a new informatics workflow that enables rapid analysis of qualitative imaging features with Visually AcceSAble Rembrandtr Images (VASARI) for large datasets in PACS. Sixty nine patients undergoing GBM resection with CDKN2A status determined by whole-exome sequencing were included. GBMs on magnetic resonance images were automatically 3D segmented using deep learning algorithms incorporated within PACS. VASARI features were assessed using FHIR forms integrated within PACS. GBMs without CDKN2A alterations were significantly larger (64% vs. 30%, p=0.007) compared to tumors with homozygous deletion (HOMDEL) and heterozygous loss (HETLOSS). Lesions larger than 8 cm were four times more likely to have no CDKN2A alteration (OR: 4.3; 95% CI:1.5-12.1; p<0.001). We developed a novel integrated PACS informatics platform for the assessment of GBM molecular subtypes and show that tumors with HOMDEL are more likely to have radiographic evidence of pial invasion and less likely to have deep white matter invasion or subependymal invasion. These imaging features may allow noninvasive identification of CDKN2A allele status.
△ Less
Submitted 18 September, 2023;
originally announced September 2023.
-
CloudBrain-NMR: An Intelligent Cloud Computing Platform for NMR Spectroscopy Processing, Reconstruction and Analysis
Authors:
Di Guo,
Sijin Li,
Jun Liu,
Zhangren Tu,
Tianyu Qiu,
Jingjing Xu,
Liubin Feng,
Donghai Lin,
Qing Hong,
Meijin Lin,
Yanqin Lin,
Xiaobo Qu
Abstract:
Nuclear Magnetic Resonance (NMR) spectroscopy has served as a powerful analytical tool for studying molecular structure and dynamics in chemistry and biology. However, the processing of raw data acquired from NMR spectrometers and subsequent quantitative analysis involves various specialized tools, which necessitates comprehensive knowledge in programming and NMR. Particularly, the emerging deep l…
▽ More
Nuclear Magnetic Resonance (NMR) spectroscopy has served as a powerful analytical tool for studying molecular structure and dynamics in chemistry and biology. However, the processing of raw data acquired from NMR spectrometers and subsequent quantitative analysis involves various specialized tools, which necessitates comprehensive knowledge in programming and NMR. Particularly, the emerging deep learning tools is hard to be widely used in NMR due to the sophisticated setup of computation. Thus, NMR processing is not an easy task for chemist and biologists. In this work, we present CloudBrain-NMR, an intelligent online cloud computing platform designed for NMR data reading, processing, reconstruction, and quantitative analysis. The platform is conveniently accessed through a web browser, eliminating the need for any program installation on the user side. CloudBrain-NMR uses parallel computing with graphics processing units and central processing units, resulting in significantly shortened computation time. Furthermore, it incorporates state-of-the-art deep learning-based algorithms offering comprehensive functionalities that allow users to complete the entire processing procedure without relying on additional software. This platform has empowered NMR applications with advanced artificial intelligence processing. CloudBrain-NMR is openly accessible for free usage at https://csrc.xmu.edu.cn/CloudBrain.html
△ Less
Submitted 12 September, 2023;
originally announced September 2023.
-
A Large Open Access Dataset of Brain Metastasis 3D Segmentations with Clinical and Imaging Feature Information
Authors:
Divya Ramakrishnan,
Leon Jekel,
Saahil Chadha,
Anastasia Janas,
Harrison Moy,
Nazanin Maleki,
Matthew Sala,
Manpreet Kaur,
Gabriel Cassinelli Petersen,
Sara Merkaj,
Marc von Reppert,
Ujjwal Baid,
Spyridon Bakas,
Claudia Kirsch,
Melissa Davis,
Khaled Bousabarah,
Wolfgang Holler,
MingDe Lin,
Malte Westerhoff,
Sanjay Aneja,
Fatima Memon,
Mariam S. Aboian
Abstract:
Resection and whole brain radiotherapy (WBRT) are the standards of care for the treatment of patients with brain metastases (BM) but are often associated with cognitive side effects. Stereotactic radiosurgery (SRS) involves a more targeted treatment approach and has been shown to avoid the side effects associated with WBRT. However, SRS requires precise identification and delineation of BM. While…
▽ More
Resection and whole brain radiotherapy (WBRT) are the standards of care for the treatment of patients with brain metastases (BM) but are often associated with cognitive side effects. Stereotactic radiosurgery (SRS) involves a more targeted treatment approach and has been shown to avoid the side effects associated with WBRT. However, SRS requires precise identification and delineation of BM. While many AI algorithms have been developed for this purpose, their clinical adoption has been limited due to poor model performance in the clinical setting. Major reasons for non-generalizable algorithms are the limitations in the datasets used for training the AI network. The purpose of this study was to create a large, heterogenous, annotated BM dataset for training and validation of AI models to improve generalizability. We present a BM dataset of 200 patients with pretreatment T1, T1 post-contrast, T2, and FLAIR MR images. The dataset includes contrast-enhancing and necrotic 3D segmentations on T1 post-contrast and whole tumor (including peritumoral edema) 3D segmentations on FLAIR. Our dataset contains 975 contrast-enhancing lesions, many of which are sub centimeter, along with clinical and imaging feature information. We used a streamlined approach to database-building leveraging a PACS-integrated segmentation workflow.
△ Less
Submitted 11 September, 2023; v1 submitted 10 September, 2023;
originally announced September 2023.
-
Single-molecule fluorescence multiplexing by multi-parameter spectroscopic detection of nanostructured FRET labels
Authors:
Jiachong Chu,
Ayesha Ejaz,
Kyle M. Lin,
Madeline R. Joseph,
Aria E. Coraor,
D. Allan Drummond,
Allison H. Squires
Abstract:
Multiplexed, real-time fluorescence detection at the single-molecule level is highly desirable to reveal the stoichiometry, dynamics, and interactions of individual molecular species within complex systems. However, traditionally fluorescence sensing is limited to 3-4 concurrently detected labels, due to low signal-to-noise, high spectral overlap between labels, and the need to avoid dissimilar dy…
▽ More
Multiplexed, real-time fluorescence detection at the single-molecule level is highly desirable to reveal the stoichiometry, dynamics, and interactions of individual molecular species within complex systems. However, traditionally fluorescence sensing is limited to 3-4 concurrently detected labels, due to low signal-to-noise, high spectral overlap between labels, and the need to avoid dissimilar dye chemistries. We have engineered a palette of several dozen fluorescent labels, called FRETfluors, for spectroscopic multiplexing at the single-molecule level. Each FRETfluor is a compact nanostructure formed from the same three chemical building blocks (DNA, Cy3, and Cy5). The composition and dye-dye geometries create a characteristic Förster Resonance Energy Transfer (FRET) efficiency for each construct. In addition, we varied the local DNA sequence and attachment chemistry to alter the Cy3 and Cy5 emission properties and thereby shift the emission signatures of an entire series of FRET constructs to new sectors of the multi-parameter detection space. Unique spectroscopic emission of each FRETfluor is therefore conferred by a combination of FRET and this site-specific tuning of individual fluorophore photophysics. We show single-molecule identification of a set of 27 FRETfluors in a sample mixture using a subset of constructs statistically selected to minimize classification errors, measured using an Anti-Brownian ELectrokinetic (ABEL) trap which provides precise multi-parameter spectroscopic measurements. The ABEL trap also enables discrimination between FRETfluors attached to a target (here: mRNA) and unbound FRETfluors, eliminating the need for washes or removal of excess label by purification. We show single-molecule identification of a set of 27 FRETfluors in a sample mixture using a subset of constructs selected to minimize classification errors.
△ Less
Submitted 25 January, 2024; v1 submitted 4 July, 2023;
originally announced July 2023.
-
The Brain Tumor Segmentation (BraTS-METS) Challenge 2023: Brain Metastasis Segmentation on Pre-treatment MRI
Authors:
Ahmed W. Moawad,
Anastasia Janas,
Ujjwal Baid,
Divya Ramakrishnan,
Rachit Saluja,
Nader Ashraf,
Leon Jekel,
Raisa Amiruddin,
Maruf Adewole,
Jake Albrecht,
Udunna Anazodo,
Sanjay Aneja,
Syed Muhammad Anwar,
Timothy Bergquist,
Evan Calabrese,
Veronica Chiang,
Verena Chung,
Gian Marco Marco Conte,
Farouk Dako,
James Eddy,
Ivan Ezhov,
Ariana Familiar,
Keyvan Farahani,
Juan Eugenio Iglesias,
Zhifan Jiang
, et al. (206 additional authors not shown)
Abstract:
The translation of AI-generated brain metastases (BM) segmentation into clinical practice relies heavily on diverse, high-quality annotated medical imaging datasets. The BraTS-METS 2023 challenge has gained momentum for testing and benchmarking algorithms using rigorously annotated internationally compiled real-world datasets. This study presents the results of the segmentation challenge and chara…
▽ More
The translation of AI-generated brain metastases (BM) segmentation into clinical practice relies heavily on diverse, high-quality annotated medical imaging datasets. The BraTS-METS 2023 challenge has gained momentum for testing and benchmarking algorithms using rigorously annotated internationally compiled real-world datasets. This study presents the results of the segmentation challenge and characterizes the challenging cases that impacted the performance of the winning algorithms. Untreated brain metastases on standard anatomic MRI sequences (T1, T2, FLAIR, T1PG) from eight contributed international datasets were annotated in stepwise method: published UNET algorithms, student, neuroradiologist, final approver neuroradiologist. Segmentations were ranked based on lesion-wise Dice and Hausdorff distance (HD95) scores. False positives (FP) and false negatives (FN) were rigorously penalized, receiving a score of 0 for Dice and a fixed penalty of 374 for HD95. Eight datasets comprising 1303 studies were annotated, with 402 studies (3076 lesions) released on Synapse as publicly available datasets to challenge competitors. Additionally, 31 studies (139 lesions) were held out for validation, and 59 studies (218 lesions) were used for testing. Segmentation accuracy was measured as rank across subjects, with the winning team achieving a LesionWise mean score of 7.9. Common errors among the leading teams included false negatives for small lesions and misregistration of masks in space.The BraTS-METS 2023 challenge successfully curated well-annotated, diverse datasets and identified common errors, facilitating the translation of BM segmentation across varied clinical environments and providing personalized volumetric reports to patients undergoing BM treatment.
△ Less
Submitted 17 June, 2024; v1 submitted 1 June, 2023;
originally announced June 2023.
-
Thermodynamic force thresholds biomolecular behavior
Authors:
Milo M. Lin
Abstract:
In living systems, collective molecular behavior is driven by thermodynamic forces in the form of chemical gradients. Leveraging recent advances in the field of nonequilibrium physics, I show that increasing the thermodynamic force alone can induce qualitatively new behavior. To demonstrate this principle, general equations governing kinetic proofreading and microtubule assembly are derived. These…
▽ More
In living systems, collective molecular behavior is driven by thermodynamic forces in the form of chemical gradients. Leveraging recent advances in the field of nonequilibrium physics, I show that increasing the thermodynamic force alone can induce qualitatively new behavior. To demonstrate this principle, general equations governing kinetic proofreading and microtubule assembly are derived. These equations show that new capabilities, including catalytic regulation of steady-state behavior and exponential enhancement of molecular discrimination, are only possible if the system is driven sufficiently far from equilibrium, and can emerge sharply at a threshold force. Regardless of design parameters, these results reveal that the thermodynamic force sets fundamental performance limits on tuning sensitivity, error, and waste. Experimental data show that these biomolecular processes operate at the limits allowed by theory.
△ Less
Submitted 19 September, 2022;
originally announced September 2022.
-
A fully differentiable ligand pose optimization framework guided by deep learning and traditional scoring functions
Authors:
Zechen Wang,
Liangzhen Zheng,
Sheng Wang,
Mingzhi Lin,
Zhihao Wang,
Adams Wai-Kin Kong,
Yuguang Mu,
Yanjie Wei,
Weifeng Li
Abstract:
The machine learning (ML) and deep learning (DL) techniques are widely recognized to be powerful tools for virtual drug screening. The recently reported ML- or DL-based scoring functions have shown exciting performance in predicting protein-ligand binding affinities with fruitful application prospects. However, the differentiation between highly similar ligand conformations, including the native b…
▽ More
The machine learning (ML) and deep learning (DL) techniques are widely recognized to be powerful tools for virtual drug screening. The recently reported ML- or DL-based scoring functions have shown exciting performance in predicting protein-ligand binding affinities with fruitful application prospects. However, the differentiation between highly similar ligand conformations, including the native binding pose (the global energy minimum state), remains challenging which could greatly enhance the docking. In this work, we propose a fully differentiable framework for ligand pose optimization based on a hybrid scoring function (SF) combined with a multi-layer perceptron (DeepRMSD) and the traditional AutoDock Vina SF. The DeepRMSD+Vina, which combines (1) the root mean square deviation (RMSD) of the docking pose with respect to the native pose and (2) the AutoDock Vina score, is fully differentiable thus is capable of optimizing the ligand binding pose to the energy-lowest conformation. Evaluated by the CASF-2016 docking power dataset, the DeepRMSD+Vina reaches a success rate of 95.4%, which is by far the best reported SF to date. Based on this SF, an end-to-end ligand pose optimization framework was implemented to improve the docking pose quality. We demonstrated that this method significantly improves the docking success rate (by 15%) in redocking and crossdocking tasks, revealing the high potentialities of this framework in drug design and discovery.
△ Less
Submitted 27 June, 2022;
originally announced June 2022.
-
Inferring, comparing and exploring ecological networks from time-series data through R packages constructnet, disgraph and dynet
Authors:
Anshuman Swain,
Travis Byrum,
Zhaoyi Zhuang,
Luke Perry,
Michael Lin,
William Fagan
Abstract:
Network inference is a major field of interest for the ecological community, especially in light of the high cost and difficulty of manual observation, and easy availability of remote, long term monitoring data. In addition, comparing across similar network structures, especially with spatial, environmental, or temporal variability and, simulating processes on networks to create toy models and hyp…
▽ More
Network inference is a major field of interest for the ecological community, especially in light of the high cost and difficulty of manual observation, and easy availability of remote, long term monitoring data. In addition, comparing across similar network structures, especially with spatial, environmental, or temporal variability and, simulating processes on networks to create toy models and hypotheses - are topics of considerable interest to the researchers. A large number of methods are being developed in the network science community to achieve these objectives but either don't have their code available or an implementation in R, the language preferred by ecologists and other biologists. We provide a suite of three packages which will provide a central suite of standardized network inference methods from time-series data (constructnet), distance metrics (disgraph) and (process) simulation models (dynet) to the growing R network analysis environment and would help ecologists and biologists to perform and compare methods under one roof. These packages are implemented in a coherent, consistent framework - making comparisons across methods and metrics easier. We hope that these tools in R will help increase the accessibility of network tools to ecologists and other biologists, who the language for most of their analysis.
△ Less
Submitted 29 March, 2021;
originally announced March 2021.
-
In situ Measurement of Airborne Particle Concentration in a Real Dental Office: Implications for Disease Transmission
Authors:
Maryam Ravazi,
Zahid Butt,
Mark H. E. Lin,
Helen Chen,
Zhongchao Tan
Abstract:
Recent guidelines by WHO recommend delaying non-essential oral health care amid COVID-19 pandemic and call for research on aerosol generated during dental procedures. Thus, this study aims to assess the mechanisms of dental aerosol dispersion in dental offices and to provide recommendations based on a quantitative study to minimize infection transmission in dental offices. The spread and removal o…
▽ More
Recent guidelines by WHO recommend delaying non-essential oral health care amid COVID-19 pandemic and call for research on aerosol generated during dental procedures. Thus, this study aims to assess the mechanisms of dental aerosol dispersion in dental offices and to provide recommendations based on a quantitative study to minimize infection transmission in dental offices. The spread and removal of aerosol particles generated from dental procedures in a dental office are measured near the source and at the corner of the office. We studied the effects of air purification (on/off), door condition (open/close), and particle sizes on the temporal concentration distribution of particles. The results show that in the worst-scenario scenario it takes 95 min for 0.5 um particles to settle, and that it takes a shorter time for the larger particles. The indoor air purifier tested expedited the removal time at least 6.3 times faster than the scenario air purifier off. Airborne particles may be transported from the source to the rest of the room, even when the particle concentrations in the generation zone return to the background level. These results are expected to be valuable to related policy making and technology development for infection disease control in dental offices and similar built environments.
△ Less
Submitted 19 August, 2020;
originally announced August 2020.
-
A neural network model of perception and reasoning
Authors:
Paul J. Blazek,
Milo M. Lin
Abstract:
How perception and reasoning arise from neuronal network activity is poorly understood. This is reflected in the fundamental limitations of connectionist artificial intelligence, typified by deep neural networks trained via gradient-based optimization. Despite success on many tasks, such networks remain unexplainable black boxes incapable of symbolic reasoning and concept generalization. Here we s…
▽ More
How perception and reasoning arise from neuronal network activity is poorly understood. This is reflected in the fundamental limitations of connectionist artificial intelligence, typified by deep neural networks trained via gradient-based optimization. Despite success on many tasks, such networks remain unexplainable black boxes incapable of symbolic reasoning and concept generalization. Here we show that a simple set of biologically consistent organizing principles confer these capabilities to neuronal networks. To demonstrate, we implement these principles in a novel machine learning algorithm, based on concept construction instead of optimization, to design deep neural networks that reason with explainable neuron activity. On a range of tasks including NP-hard problems, their reasoning capabilities grant additional cognitive functions, like deliberating through self-analysis, tolerating adversarial attacks, and learning transferable rules from simple examples to solve problems of unencountered complexity. The networks also naturally display properties of biological nervous systems inherently absent in current deep neural networks, including sparsity, modularity, and both distributed and localized firing patterns. Because they do not sacrifice performance, compactness, or training time on standard learning tasks, these networks provide a new black-box-free approach to artificial intelligence. They likewise serve as a quantitative framework to understand the emergence of cognition from neuronal networks.
△ Less
Submitted 26 February, 2020;
originally announced February 2020.
-
Hepatocellular Carcinoma Intra-arterial Treatment Response Prediction for Improved Therapeutic Decision-Making
Authors:
Junlin Yang,
Nicha C. Dvornek,
Fan Zhang,
Julius Chapiro,
MingDe Lin,
Aaron Abajian,
James S. Duncan
Abstract:
This work proposes a pipeline to predict treatment response to intra-arterial therapy of patients with Hepatocellular Carcinoma (HCC) for improved therapeutic decision-making. Our graph neural network model seamlessly combines heterogeneous inputs of baseline MR scans, pre-treatment clinical information, and planned treatment characteristics and has been validated on patients with HCC treated by t…
▽ More
This work proposes a pipeline to predict treatment response to intra-arterial therapy of patients with Hepatocellular Carcinoma (HCC) for improved therapeutic decision-making. Our graph neural network model seamlessly combines heterogeneous inputs of baseline MR scans, pre-treatment clinical information, and planned treatment characteristics and has been validated on patients with HCC treated by transarterial chemoembolization (TACE). It achieves Accuracy of $0.713 \pm 0.075$, F1 of $0.702 \pm 0.082$ and AUC of $0.710 \pm 0.108$. In addition, the pipeline incorporates uncertainty estimation to select hard cases and most align with the misclassified cases. The proposed pipeline arrives at more informed intra-arterial therapeutic decisions for patients with HCC via improving model accuracy and incorporating uncertainty estimation.
△ Less
Submitted 1 December, 2019;
originally announced December 2019.
-
Application of Deep Learning on Predicting Prognosis of Acute Myeloid Leukemia with Cytogenetics, Age, and Mutations
Authors:
Mei Lin,
Vanya Jaitly,
Iris Wang,
Zhihong Hu,
Lei Chen,
Md. Amer Wahed,
Zeyad Kanaan,
Adan Rios,
Andy N. D. Nguyen
Abstract:
We explore how Deep Learning (DL) can be utilized to predict prognosis of acute myeloid leukemia (AML). Out of TCGA (The Cancer Genome Atlas) database, 94 AML cases are used in this study. Input data include age, 10 common cytogenetic and 23 most common mutation results; output is the prognosis (diagnosis to death, DTD). In our DL network, autoencoders are stacked to form a hierarchical DL model f…
▽ More
We explore how Deep Learning (DL) can be utilized to predict prognosis of acute myeloid leukemia (AML). Out of TCGA (The Cancer Genome Atlas) database, 94 AML cases are used in this study. Input data include age, 10 common cytogenetic and 23 most common mutation results; output is the prognosis (diagnosis to death, DTD). In our DL network, autoencoders are stacked to form a hierarchical DL model from which raw data are compressed and organized and high-level features are extracted. The network is written in R language and is designed to predict prognosis of AML for a given case (DTD of more than or less than 730 days). The DL network achieves an excellent accuracy of 83% in predicting prognosis. As a proof-of-concept study, our preliminary results demonstrate a practical application of DL in future practice of prognostic prediction using next-gen sequencing (NGS) data.
△ Less
Submitted 30 October, 2018;
originally announced October 2018.
-
Multimodal Cross-registration and Quantification of Metric Distortions in Whole Brain Histology of Marmoset using Diffeomorphic Mappings
Authors:
Brian C. Lee,
Meng Kuan Lin,
Yan Fu,
Junichi Hata,
Michael I. Miller,
Partha P. Mitra
Abstract:
Whole brain neuroanatomy using tera-voxel light-microscopic data sets is of much current interest. A fundamental problem in this field is the mapping of individual brain data sets to a reference space. Previous work has not rigorously quantified the distortions in brain geometry from in-vivo to ex-vivo brains due to the tissue processing, which will be important when computing properties such as l…
▽ More
Whole brain neuroanatomy using tera-voxel light-microscopic data sets is of much current interest. A fundamental problem in this field is the mapping of individual brain data sets to a reference space. Previous work has not rigorously quantified the distortions in brain geometry from in-vivo to ex-vivo brains due to the tissue processing, which will be important when computing properties such as local cell and process densities at the voxel level in creating reference brain maps. Further, existing approaches focus on registering uni-modal volumetric data; however, given the increasing interest in the marmoset model for neuroscience research, it is necessary to cross-register multi-modal data sets including MRIs and multiple histological series that can help address individual variations in brain architecture. Here we present a computational approach for same-subject multimodal MRI guided reconstruction of a histological series, jointly with diffeomorphic mapping to a reference atlas. We quantify the scale change during the different stages of histological processing of the brains using the Jacobian determinant of the diffeomorphic transformations involved. There are two major steps in the histology process with associated scale distortions (a) brain perfusion (b) histological sectioning and reassembly. By mapping the final image stacks to the ex-vivo post fixation MRI, we show that tape-transfer histology can be reassembled accurately into 3D volumes with a local scale change of 2.0 $\pm$ 0.4% per axis dimension. In contrast, the perfusion step, as assessed by mapping the in-vivo MRIs to the ex-vivo post fixation MRIs, shows a larger local scale change of 6.9 $\pm$ 2.1% per axis dimension. This is the first systematic quantification of the local metric distortions associated with whole-brain histological processing, and we expect that the results will generalize to other species.
△ Less
Submitted 17 April, 2019; v1 submitted 13 May, 2018;
originally announced May 2018.
-
A proposal for a coordinated effort for the determination of brainwide neuroanatomical connectivity in model organisms at a mesoscopic scale
Authors:
Jason W. Bohland,
Caizhi Wu,
Helen Barbas,
Hemant Bokil,
Mihail Bota,
Hans C. Breiter,
Hollis T. Cline,
John C. Doyle,
Peter J. Freed,
Ralph J. Greenspan,
Suzanne N. Haber,
Michael Hawrylycz,
Daniel G. Herrera,
Claus C. Hilgetag,
Z. Josh Huang,
Allan Jones,
Edward G. Jones,
Harvey J. Karten,
David Kleinfeld,
Rolf Kotter,
Henry A. Lester,
John M. Lin,
Brett D. Mensh,
Shawn Mikula,
Jaak Panksepp
, et al. (12 additional authors not shown)
Abstract:
In this era of complete genomes, our knowledge of neuroanatomical circuitry remains surprisingly sparse. Such knowledge is however critical both for basic and clinical research into brain function. Here we advocate for a concerted effort to fill this gap, through systematic, experimental mapping of neural circuits at a mesoscopic scale of resolution suitable for comprehensive, brain-wide coverag…
▽ More
In this era of complete genomes, our knowledge of neuroanatomical circuitry remains surprisingly sparse. Such knowledge is however critical both for basic and clinical research into brain function. Here we advocate for a concerted effort to fill this gap, through systematic, experimental mapping of neural circuits at a mesoscopic scale of resolution suitable for comprehensive, brain-wide coverage, using injections of tracers or viral vectors. We detail the scientific and medical rationale and briefly review existing knowledge and experimental techniques. We define a set of desiderata, including brain-wide coverage; validated and extensible experimental techniques suitable for standardization and automation; centralized, open access data repository; compatibility with existing resources, and tractability with current informatics technology. We discuss a hypothetical but tractable plan for mouse, additional efforts for the macaque, and technique development for human. We estimate that the mouse connectivity project could be completed within five years with a comparatively modest budget.
△ Less
Submitted 28 January, 2009;
originally announced January 2009.
-
An analysis of the abstracts presented at the annual meetings of the Society for Neuroscience from 2001 to 2006
Authors:
J. M. Lin,
J. W. Bohland,
P. Andrews,
G. Burns,
C. B. Allen,
P. P. Mitra
Abstract:
We extracted and processed abstract data from the SFN annual meeting abstracts during the period 2001-2006, using techniques and software from natural language processing, database management, and data visualization and analysis. An important first step in the process was the application of data cleaning and disambiguation methods to construct a unified database, since the data were too noisy to…
▽ More
We extracted and processed abstract data from the SFN annual meeting abstracts during the period 2001-2006, using techniques and software from natural language processing, database management, and data visualization and analysis. An important first step in the process was the application of data cleaning and disambiguation methods to construct a unified database, since the data were too noisy to be of full utility in the raw form initially available. The resulting co-author graph in 2006, for example, had 39,645 nodes (with an estimated 6% error rate in our disambiguation of similar author names) and 13,979 abstracts, with an average of 1.5 abstracts per author, 4.3 authors per abstract, and 5.96 collaborators per author (including all authors on shared abstracts). Recent work in related areas has focused on reputational indices such as highly cited papers or scientists and journal impact factors, and to a lesser extent on creating visual maps of the knowledge space. In contrast, there has been relatively less work on the demographics and community structure, the dynamics of the field over time to examine major research trends and the structure of the sources of research funding. In this paper we examined each of these areas in order to gain an objective overview of contemporary neuroscience. Some interesting findings include a high geographical concentration of neuroscience research in north eastern United States, a surprisingly large transient population (60% of the authors appear in only one out of the six studied years), the central role played by the study of neurodegenerative disorders in the neuroscience community structure, and an apparent growth of behavioral/systems neuroscience with a corresponding shrinkage of cellular/molecular neuroscience over the six year period.
△ Less
Submitted 16 October, 2007; v1 submitted 12 October, 2007;
originally announced October 2007.