We gratefully acknowledge support from
the Simons Foundation and member institutions.

Quantitative Biology

New submissions

[ total of 21 entries: 1-21 ]
[ showing up to 2000 entries per page: fewer | more ]

New submissions for Tue, 17 May 22

[1]  arXiv:2205.06834 [pdf, other]
Title: Multi-variant COVID-19 model with heterogeneous transmission rates using deep neural networks
Subjects: Populations and Evolution (q-bio.PE); Machine Learning (cs.LG)

Mutating variants of COVID-19 have been reported across many US states since 2021. In the fight against COVID-19, it has become imperative to study the heterogeneity in the time-varying transmission rates for each variant in the presence of pharmaceutical and non-pharmaceutical mitigation measures. We develop a Susceptible-Exposed-Infected-Recovered mathematical model to highlight the differences in the transmission of the B.1.617.2 delta variant and the original SARS-CoV-2. Theoretical results for the well-posedness of the model are discussed. A Deep neural network is utilized and a deep learning algorithm is developed to learn the time-varying heterogeneous transmission rates for each variant. The accuracy of the algorithm for the model is shown using error metrics in the data-driven simulation for COVID-19 variants in the US states of Florida, Alabama, Tennessee, and Missouri. Short-term forecasting of daily cases is demonstrated using long short term memory neural network and an adaptive neuro-fuzzy inference system.

[2]  arXiv:2205.06884 [pdf, other]
Title: Baseline control of optimal performance in recurrent neural networks
Comments: 38 pages, 5 figures
Subjects: Neurons and Cognition (q-bio.NC); Biological Physics (physics.bio-ph)

Changes in behavioral state, such as arousal and movements, strongly affect neural activity in sensory areas. Recent evidence suggests that they may be mediated by top-down projections regulating the statistics of baseline input currents to sensory areas, inducing qualitatively different effects across sensory modalities. What are the computational benefits of these baseline modulations? We investigate this question within a brain-inspired framework for reservoir computing, where we vary the quenched baseline inputs to a random neural network. We found that baseline modulations control the dynamical phase of the reservoir network, unlocking a vast repertoire of network phases. We uncover a new zoo of bistable phases exhibiting the simultaneous coexistence of fixed points and chaos, of two fixed points, and of weak and strong chaos. Crucially, we discovered a host of novel phenomena, including noise-driven enhancement of chaos and ergodicity breaking; neural hysteresis, whereby transitions across phase boundary retain the memory of the initial phase. Strikingly, we found that baseline control can achieve optimal performance without any fine tuning of recurrent couplings. In summary, baseline control of network dynamics opens new directions for brain-inspired artificial intelligence and provides a new interpretation for the ubiquitously observed behavioral modulations of cortical activity.

[3]  arXiv:2205.07045 [pdf]
Title: How can I investigate causal brain networks with iEEG?
Comments: Forthcoming chapter in "Intracranial EEG for Cognitive Neuroscience"
Subjects: Neurons and Cognition (q-bio.NC); Quantitative Methods (q-bio.QM)

While many human imaging methodologies probe the structural and functional connectivity of the brain, techniques to investigate cortical networks in a causal and directional manner are critical but limited. The use of iEEG enables several approaches to directly characterize brain regions that are functionally connected and in some cases also establish directionality of these connections. In this chapter we focus on the basis, method and application of the cortico-cortical evoked potential (CCEP), whereby electrical pulses applied to one set of intracranial electrodes yields an electrically-induced brain response at local and remote regions. In this chapter, CCEPs are first contextualized within common brain connectivity methods used to define cortical networks and how CCEP adds unique information. Second, the practical and analytical considerations when using CCEP are discussed. Third, we review the neurophysiology underlying CCEPs and the applications of CCEPs including exploring functional and pathological brain networks and probing brain plasticity. Finally, we end with a discussion of limitations, caveats, and directions to improve CCEP utilization in the future.

[4]  arXiv:2205.07258 [pdf, ps, other]
Title: Backward bifurcation, basic reinfection number and robustness of a SEIRE epidemic model with reinfection
Comments: 19 pages, 7 figures
Subjects: Populations and Evolution (q-bio.PE)

Recent evidences show that individuals who recovered from COVID-19 can be reinfected. However, this phenomenon has rarely been studied using mathematical models. In this paper, we propose a SEIRE epidemic model to describe the spread of the epidemic with reinfection. We obtain the important thresholds $R_0$ (the basic reproduction number) and Rc (a threshold less than one). Our investigations show that when $R_0 > 1$, the system has an endemic equilibrium, which is globally asymptotically stable. When $R_c < R_0 < 1$, the epidemic system exhibits bistable dynamics. That is, the system has backward bifurcation and the disease cannot be eradicated. In order to eradicate the disease, we must ensure that the basic reproduction number $R_0$ is less than $R_c$. The basic reinfection number is obtained to measure the reinfection force, which turns out to be a new tipping point for disease dynamics. We also give definition of robustness, a new concept to measure the difficulty of completely eliminating the disease for a bistable epidemic system. Numerical simulations are carried out to verify the conclusions.

[5]  arXiv:2205.07360 [pdf, other]
Title: Qualitative dynamics of chemical reaction networks: an investigation using partial tropical equilibrations
Comments: 23 pages, 5 figures, submitted to CMSB 2022
Subjects: Molecular Networks (q-bio.MN); Symbolic Computation (cs.SC); Dynamical Systems (math.DS)

We discuss a method to describe the qualitative dynamics of chemical reaction networks in terms of symbolic dynamics. The method, that can be applied to mass-action reaction networks with separated timescales, uses solutions of the partial tropical equilibration problem as proxies for symbolic states. The partial tropical equilibration solutions are found algorithmically. These solutions also provide the scaling needed for slow-fast decomposition and model reduction. Any trace of the model can thus be represented as a sequence of local approximations of the full model. We illustrate the method using as case study a biochemical model of the cell cycle.

[6]  arXiv:2205.07553 [pdf]
Title: The interactions of SARS-CoV-2 with co-circulating pathogens: Epidemiological implications and current knowledge gaps
Comments: Main text 19 pages including 4 figures. Appendices 1-5 (Table S1-S4, Model details)
Subjects: Populations and Evolution (q-bio.PE)

Despite the availability of effective vaccines, the persistence of SARS-CoV-2 suggests that co-circulation with other pathogens and resulting multi-epidemics -- such as twindemics of COVID-19 and influenza -- will become increasingly frequent. To better forecast and control the risk of such multi-epidemics, it is essential to elucidate the potential interactions of SARS-CoV- 2 with other pathogens; these interactions, however, remain poorly defined. Here, we aimed to review the current body of evidence about SARS-CoV-2 interactions. To study pathogen interactions in a systematic way, we first developed a general framework to capture their major components - namely, sign, strength, symmetry, duration, and mechanism. We then reviewed the experimental evidence from animal models about SARS-CoV-2 interactions. The studies identified demonstrated that SARS-CoV-2 and influenza A virus co-infection increased disease severity compared with mono-infection. By contrast, the effect of previous or co-infection on viral load of either virus was inconsistent across studies. Next, we reviewed the epidemiological evidence about SARS-CoV-2 interactions in human populations. Although numerous studies were identified, only few were specifically designed to infer interaction and many were prone to bias and confounding. Nevertheless, their results suggested that influenza and pneumococcal conjugate vaccinations were associated with reduced risk, and earlier influenza infection with increased risk, of SARS-CoV-2 infection and severe COVID-19. Finally, we formulated simple transmission models of SARS-CoV-2 co-circulation with a virus or a bacterium, showing how they can naturally incorporate the proposed framework. More generally, we propose that such models, when designed with an integrative and multidisciplinary perspective, will be invaluable tools in studying SARS-CoV-2 interactions with other pathogens.

[7]  arXiv:2205.07656 [pdf, other]
Title: Using a physical model and aggregate data to estimate the spreading of Covid-19 in Israel in the presence of waning immunity and competing variants
Comments: 9 pages 8 figures
Subjects: Populations and Evolution (q-bio.PE)

In more than two years since the COVID-19 virus was first detected in China, hundreds of millions of individuals have been infected, and millions have died. Aside from the immediate need for medical solutions (such as vaccines and medications) to treat the epidemic, the Corona pandemic has strengthened the demand for mathematical models that can predict the spread of the pandemic in an ever-changing reality. Here, we present a novel, dynamic particle model based on the basic principles of statistical physics that enables the prediction of the spreading of Covid-19 in the presence of effective vaccines. This particle model enables us to accurately examine the effects of the vaccine on different subgroups of the vaccinated population and the entire population and to identify the vaccine waning. Furthermore, a particle model can predict the prevalence of two competing variants over time and their associated morbidity.

[8]  arXiv:2205.07673 [pdf, other]
Title: ProNet DB: A proteome-wise database for protein surface property representations and RNA-binding profiles
Comments: 12 pages, 6 figures
Subjects: Quantitative Methods (q-bio.QM); Biomolecules (q-bio.BM); Molecular Networks (q-bio.MN)

The rapid growth in the number of experimental and predicted protein structures and more complicated protein structures challenge users in computational biology for utilizing the structural information and protein surface property representation. Recently, AlphaFold2 released the comprehensive proteome of various species, and protein surface property representation plays a crucial role in protein-molecule interaction prediction such as protein-protein interaction, protein-nucleic acid interaction, and protein-compound interaction. Here, we propose the first comprehensive database, namely ProNet DB, which incorporates multiple protein surface representations and RNA-binding landscape for more than 33,000 protein structures covering the proteome from AlphaFold Protein Structure Database (AlphaFold DB) and experimentally validated protein structures deposited in Protein Data Bank (PDB). For each protein, we provide the original protein structure, surface property representation including hydrophobicity, charge distribution, hydrogen bond, interacting face, and RNA-binding landscape such as RNA binding sites and RNA binding preference. To interpret protein surface property representation and RNA binding landscape intuitively, we also integrate Mol* and Online 3D Viewer to visualize the representation on the protein surface. The pre-computed features are available for the users instantaneously and their potential applications are including molecular mechanism exploration, drug discovery, and novel therapeutics development. The server is now available on https://proj.cse.cuhk.edu.hk/pronet/ and future releases will expand the species and property coverage.

Cross-lists for Tue, 17 May 22

[9]  arXiv:2205.07011 (cross-list from cs.IT) [pdf, other]
Title: ACID: A Low Dimensional Characterization of Markov-Modulated and Self-Exciting Counting Processes
Subjects: Information Theory (cs.IT); Probability (math.PR); Molecular Networks (q-bio.MN)

The conditional intensity (CI) of a counting process $Y_t$ is based on the minimal knowledge $\mathcal{F}_t^Y$, i.e., on the observation of $Y_t$ alone. Prominently, the mutual information rate of a signal and its Poisson channel output is a difference functional between the CI and the intensity that has full knowledge about the input. While the CI of Markov-modulated Poisson processes evolves according to Snyder's filter, self-exciting processes, e.g., Hawkes processes, specify the CI via the history of $Y_t$. The emergence of the CI as a self-contained stochastic process prompts us to bring its statistical ensemble into focus. We investigate the asymptotic conditional intensity distribution (ACID) and emphasize its rich information content. We assume the case in which the CI is determined from a sufficient statistic that progresses as a Markov process. We present a simulation-free method to compute the ACID when the dimension of the sufficient statistic is low. The method is made possible by introducing a backward recurrence time parametrization, which has the advantage to align all probability inflow in a boundary condition for the master equation. Case studies illustrate the usage of ACID for three primary examples: 1) the Poisson channels with binary Markovian input (as an example of a Markov-modulated Poisson process), 2) the standard Hawkes process with exponential kernel (as an example of a self-exciting counting process) and 3) the Gamma filter (as an example of an approximate filter to a Markov-modulated Poisson process).

[10]  arXiv:2205.07249 (cross-list from cs.LG) [pdf, other]
Title: Pocket2Mol: Efficient Molecular Sampling Based on 3D Protein Pockets
Comments: ICML 2022 accepted
Subjects: Machine Learning (cs.LG); Biomolecules (q-bio.BM)

Deep generative models have achieved tremendous success in designing novel drug molecules in recent years. A new thread of works have shown the great potential in advancing the specificity and success rate of in silico drug design by considering the structure of protein pockets. This setting posts fundamental computational challenges in sampling new chemical compounds that could satisfy multiple geometrical constraints imposed by pockets. Previous sampling algorithms either sample in the graph space or only consider the 3D coordinates of atoms while ignoring other detailed chemical structures such as bond types and functional groups. To address the challenge, we develop Pocket2Mol, an E(3)-equivariant generative network composed of two modules: 1) a new graph neural network capturing both spatial and bonding relationships between atoms of the binding pockets and 2) a new efficient algorithm which samples new drug candidates conditioned on the pocket representations from a tractable distribution without relying on MCMC. Experimental results demonstrate that molecules sampled from Pocket2Mol achieve significantly better binding affinity and other drug properties such as druglikeness and synthetic accessibility.

[11]  arXiv:2205.07309 (cross-list from cs.LG) [pdf, other]
Title: 3DLinker: An E(3) Equivariant Variational Autoencoder for Molecular Linker Design
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Biomolecules (q-bio.BM)

Deep learning has achieved tremendous success in designing novel chemical compounds with desirable pharmaceutical properties. In this work, we focus on a new type of drug design problem -- generating a small "linker" to physically attach two independent molecules with their distinct functions. The main computational challenges include: 1) the generation of linkers is conditional on the two given molecules, in contrast to generating full molecules from scratch in previous works; 2) linkers heavily depend on the anchor atoms of the two molecules to be connected, which are not known beforehand; 3) 3D structures and orientations of the molecules need to be considered to avoid atom clashes, for which equivariance to E(3) group are necessary. To address these problems, we propose a conditional generative model, named 3DLinker, which is able to predict anchor atoms and jointly generate linker graphs and their 3D structures based on an E(3) equivariant graph variational autoencoder. So far as we know, there are no previous models that could achieve this task. We compare our model with multiple conditional generative models modified from other molecular design tasks and find that our model has a significantly higher rate in recovering molecular graphs, and more importantly, accurately predicting the 3D coordinates of all the atoms.

[12]  arXiv:2205.07375 (cross-list from eess.SY) [pdf, other]
Title: Chetaev Instability Framework for Kinetostatic Compliance-Based Protein Unfolding
Comments: Accepted for Publication in IEEE Control Systems Letters (L-CSS)
Subjects: Systems and Control (eess.SY); Optimization and Control (math.OC); Biomolecules (q-bio.BM)

Understanding the process of protein unfolding plays a crucial role in various applications such as design of folding-based protein engines. Using the well-established kinetostatic compliance (KCM)-based method for modeling of protein conformation dynamics and a recent nonlinear control theoretic approach to KCM-based protein folding, this paper formulates protein unfolding as a destabilizing control analysis/synthesis problem. In light of this formulation, it is shown that the Chetaev instability framework can be used to investigate the KCM-based unfolding dynamics. In particular, a Chetaev function for analysis of unfolding dynamics under the effect of optical tweezers and a class of control Chetaev functions for synthesizing control inputs that elongate protein strands from their folded conformations are presented. Based on the presented control Chetaev function, an unfolding input is derived from the Artstein-Sontag universal formula and the results are compared against optical tweezer-based unfolding.

[13]  arXiv:2205.07575 (cross-list from cs.CV) [pdf, other]
Title: An automatic pipeline for atlas-based fetal and neonatal brain segmentation and analysis
Subjects: Computer Vision and Pattern Recognition (cs.CV); Quantitative Methods (q-bio.QM)

The automatic segmentation of perinatal brain structures in magnetic resonance imaging (MRI) is of utmost importance for the study of brain growth and related complications. While different methods exist for adult and pediatric MRI data, there is a lack for automatic tools for the analysis of perinatal imaging. In this work, a new pipeline for fetal and neonatal segmentation has been developed. We also report the creation of two new fetal atlases, and their use within the pipeline for atlas-based segmentation, based on novel registration methods. The pipeline is also able to extract cortical and pial surfaces and compute features, such as curvature, thickness, sulcal depth, and local gyrification index. Results show that the introduction of the new templates together with our segmentation strategy leads to accurate results when compared to expert annotations, as well as better performances when compared to a reference pipeline (developing Human Connectome Project (dHCP)), for both early and late-onset fetal brains.

[14]  arXiv:2205.07582 (cross-list from cs.LG) [pdf]
Title: Chemical transformer compression for accelerating both training and inference of molecular modeling
Authors: Yi Yu, Karl Borjesson
Subjects: Machine Learning (cs.LG); Biomolecules (q-bio.BM)

Transformer models have been developed in molecular science with excellent performance in applications including quantitative structure-activity relationship (QSAR) and virtual screening (VS). Compared with other types of models, however, they are large, which results in a high hardware requirement to abridge time for both training and inference processes. In this work, cross-layer parameter sharing (CLPS), and knowledge distillation (KD) are used to reduce the sizes of transformers in molecular science. Both methods not only have competitive QSAR predictive performance as compared to the original BERT model, but also are more parameter efficient. Furthermore, by integrating CLPS and KD into a two-state chemical network, we introduce a new deep lite chemical transformer model, DeLiCaTe. DeLiCaTe captures general-domains as well as task-specific knowledge, which lead to a 4x faster rate of both training and inference due to a 10- and 3-times reduction of the number of parameters and layers, respectively. Meanwhile, it achieves comparable performance in QSAR and VS modeling. Moreover, we anticipate that the model compression strategy provides a pathway to the creation of effective generative transformer models for organic drug and material design.

Replacements for Tue, 17 May 22

[15]  arXiv:2009.01354 (replaced) [pdf]
Title: Unfolding selection to infer individual risk heterogeneity for optimising disease forecasts and policy development
Comments: 10 pages, 3 figures
Subjects: Populations and Evolution (q-bio.PE)
[16]  arXiv:2012.11665 (replaced) [pdf, ps, other]
Title: A new algebraic approach to genome rearrangement models
Comments: 32 pages. v2 more concise (former Sec. 4 removed)
Journal-ref: J. Math. Biol. 84, 49 (2022)
Subjects: Populations and Evolution (q-bio.PE); Rings and Algebras (math.RA)
[17]  arXiv:2109.08031 (replaced) [src]
Title: Accurately Modeling Biased Random Walks on Weighted Graphs Using $\textit{Node2vec+}$
Comments: The final analysis on gene classification in the previous version was incorrect. A bug in the code for GNN evaluation causes the GNN to have access to part of the testing data during training, and thus significantly biased the true testing evaluation for GNNs. A revision will be released that correct for this error will be released shortly
Subjects: Social and Information Networks (cs.SI); Machine Learning (cs.LG); Molecular Networks (q-bio.MN)
[18]  arXiv:2110.01191 (replaced) [pdf, other]
Title: Molformer: Motif-based Roto-Translation Invariant Transformer on 3D Heterogeneous Molecular Graphs
Subjects: Quantitative Methods (q-bio.QM); Computational Engineering, Finance, and Science (cs.CE); Machine Learning (cs.LG)
[19]  arXiv:2202.02245 (replaced) [pdf, other]
Title: Personalized visual encoding model construction with small data
Subjects: Quantitative Methods (q-bio.QM); Computer Vision and Pattern Recognition (cs.CV)
[20]  arXiv:2204.06071 (replaced) [pdf, other]
Title: Understanding Saliency Prediction with Deep Convolutional Neural Networks and Psychophysical Models
Authors: Qiang Li
Comments: Submitted to IEEE ICIP 2022
Subjects: Neurons and Cognition (q-bio.NC)
[21]  arXiv:2204.07532 (replaced) [pdf, other]
Title: Accurate ADMET Prediction with XGBoost
Subjects: Biomolecules (q-bio.BM); Machine Learning (cs.LG); Quantitative Methods (q-bio.QM)
[ total of 21 entries: 1-21 ]
[ showing up to 2000 entries per page: fewer | more ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, q-bio, recent, 2205, contact, help  (Access key information)