Stochastic Biological Evolution

Application oriented research topic of WIAS

DE |
EN

This topic is currently not treated at the WIAS.

Evolution on Random Fitness Landscapes

Mutation-selection model describes the evolution on molecular level where the competition between the main driving forces of Darwinian evolution, mutation and selection, occurs at the same time scale. The effects coming from a heterogenous, highly complex fitness landscape have received attraction. In this respect, the study of mathematical evolution models where the fitness landscape is given by a collection of random variables are received as a good starting point. Mathematically, this system is closely related to a branching random walk in a random environment, something that the WIAS intends to explore more in future. Here, the evolution on the sequential space of DNA codes is studied, where the fitness landscape is given by an independent collection of random variables. For a wide range of distributions, which have fat enough upper tails, the time scales are identified upon which the make up of the population makes a shift from a configuration of all aparticles being the type of the starting to a one consists those of the type of the fittest.

Stochastic Encounter-Mating Models

Another type of slective force acting on evolution is due to the availability of a mate to produce offspring with, known as the sexual selection. Evolution of mating preferences, its effects on mating patterns and the role of the encounter mechanism are important aspects of sexual selection. At WIAS recently the study of these questions was taken up by introducing a stochastic encounter-mating model with which it is possible to capture the different relations of these aspects in one model. One of the most important results that was reached is the classification of different mating preferences and encounter mechanisms that lead to panmictic mating patterns, that is, uniform mating. Moreover, in another study it was observed that the dynamics of the mating patterns of infinite populations under individual encounter is related to certain Lotka-Volterra systems.

Other Aspects of Population Genetics

An interesting phenomena observed in some organisms is the creation of seed banks, seeds that are kept in latency, only to be used later, as a protection against possible long lasting hazardeous external conditions. At WIAS one is able to describe the ancestral relations of the populations using moment duality techniques of the mathematical sees-bank model.

Recently, experiments that try to understand the evolution in controlled environments, such as the famous Lenski experiment, have received great attention. At WIAS, a mathematical model for the Lenski experiment is studied. The logarithmic increase of the mean fitness function is proved, which is actually observed in these experiments.

Publications

Monographs

TH. Dickhaus, Simultaneous Statistical Inference, Springer, Berlin et al., 2014, 180 pages, (Monograph Published).

Articles in Refereed Journals

T. Iyer, On a sufficient condition for explosion in CMJ branching processes and applications to recursive trees, Electronic Communications in Probability, 29 (2024), pp. 46/1--46/12, DOI 10.1214/24-ECP616 .
Abstract
We provide sufficient criteria for explosion in Crump-Mode-Jagers branching process, via the process producing an infinite path in finite time. As an application, we deduce a curious phase-transition in the infinite tree associated with a class of recursive tree models with fitness, showing that in one regime every node in the tree has infinite degree, whilst in another, the tree is locally finite, with a unique infinite path. The latter class encompasses many models studied in the literature, including the weighted random recursive tree, the preferential attachment tree with additive fitness, and the Bianconi-Barab´asi model, or preferential attachment tree with multiplicative fitness.
J. Kern, B. Wiederhold, A Lambda-Fleming--Viot type model with intrinsically varying population size, Electronic Journal of Probability, 29 (2024), pp. 125/1--125/28, DOI 10.1214/24-EJP1185 .
Abstract
We propose an extension of the classical ?-Fleming-Viot model to intrinsically varying pop- ulation sizes. During events, instead of replacing a proportion of the population, a random mass dies and a, possibly different, random mass of new individuals is added. The model can also incorporate a drift term, representing infinitesimally small, but frequent events. We investigate el- ementary properties of the model, analyse its relation to the Λ-Fleming-Viot model and describe a duality relationship. Through the lookdown framework, we provide a forward-in-time analysis of fixation and coming down from infinity. Furthermore, we present a new duality argument allowing one to deduce well-posedness of the measure-valued process without the necessity of proving uniqueness of the associated lookdown martingale problem.
B. Jahnel, Ch. Külske, Gibbsianness and non-Gibbsianness for Bernoulli lattice fields under removal of isolated sites, Bernoulli. Official Journal of the Bernoulli Society for Mathematical Statistics and Probability, 29 (2023), pp. 3013--3032, DOI 10.3150/22-BEJ1572 .
Abstract
We consider the i.i.d. Bernoulli field μ _p on Z ^d with occupation density p ∈ [0,1]. To each realization of the set of occupied sites we apply a thinning map that removes all occupied sites that are isolated in graph distance. We show that, while this map seems non-invasive for large p, as it changes only a small fraction p(1-p)^2d of sites, there is p(d) <1 such that for all p ∈ (p(d), 1) the resulting measure is a non-Gibbsian measure, i.e., it does not possess a continuous version of its finite-volume conditional probabilities. On the other hand, for small p, the Gibbs property is preserved.
N. Fountoulakis, T. Iyer, Condensation phenomena in preferential attachment trees with neighbourhood influence, Electronic Journal of Probability, 27 (2022), pp. 76/1--76/49, DOI 10.1214/22-EJP787 .
Abstract
We introduce a model of evolving preferential attachment trees where vertices are assigned weights, and the evolution of a vertex depends not only on its own weight, but also on the weights of its neighbours. We study the distribution of edges with endpoints having certain weights, and the distribution of degrees of vertices having a given weight. We show that the former exhibits a condensation phenomenon under a certain critical condition, whereas the latter converges almost surely to a distribution that resembles a power law distribution. Moreover, in the absence of condensation, we prove almost-sure setwise convergence of the related quantities. This generalises existing results on the Bianconi-BarabÃ¡si tree as well as on an evolving tree model introduced by the second author.
A. Caiazzo, F. Caforio, G. Montecinos, L.O. Müller, P.J. Blanco, E.F. Toro, Assessment of reduced order Kalman filter for parameter identification in one-dimensional blood flow models using experimental data, International Journal of Numerical Methods in Biomedical Engineering, 33 (2017), pp. e2843/1--e2843/26, DOI 10.1002/cnm.2843 .
Abstract
This work presents a detailed investigation of a parameter estimation approach based on the reduced order unscented Kalman filter (ROUKF) in the context of one-dimensional blood flow models. In particular, the main aims of this study are (i) to investigate the effect of using real measurements vs. synthetic data (i.e., numerical results of the same in silico model, perturbed with white noise) for the estimation and (ii) to identify potential difficulties and limitations of the approach in clinically realistic applications in order to assess the applicability of the filter to such setups. For these purposes, our numerical study is based on the in vitro model of the arterial network described by [Alastruey et al. 2011, J. Biomech. bf 44], for which experimental flow and pressure measurements are available at few selected locations. In order to mimic clinically relevant situations, we focus on the estimation of terminal resistances and arterial wall parameters related to vessel mechanics (Young's modulus and thickness) using few experimental observations (at most a single pressure or flow measurement per vessel). In all cases, we first perform a theoretical identifiability analysis based on the generalized sensitivity function, comparing then the results obtained with the ROUKF, using either synthetic or experimental data, to results obtained using reference parameters and to available measurements.
O. Gün, A. Yilmaz, The stochastic encounter-mating model, Acta Applicandae Mathematicae. An International Survey Journal on Applying Mathematics and Mathematical Applications, 148 (2017), pp. 71--102.
A. González Casanova Soberón, N. Kurt, A. Wakolbinger, L. Yuan, An individual-based mathematical model for the Lenski experiment, and the deceleration of the relative fitness, Stochastic Processes and their Applications, 126 (2016), pp. 2211--2252.
J. Blath, A. González Casanova Soberón, N. Kurt, M. Wilke-Berenguer, A new coalescent for seed-bank models, The Annals of Applied Probability, 26 (2016), pp. 857--891.
K. Schildknecht, S. Olek, Th. Dickhaus, Simultaneous statistical inference for epigenetic data, PLOS ONE, 10 (2015), pp. e0125587/1--e0125587/15.
Abstract
Epigenetic research leads to complex data structures. Since parametric model assumptions for the distribution of epigenetic data are hard to verify we introduce in the present work a nonparametric statistical framework for two-group comparisons. Furthermore, epigenetic analyses are often performed at various genetic loci simultaneously. Hence, in order to be able to draw valid conclusions for specific loci, an appropriate multiple testing correction is necessary. Finally, with technologies available for the simultaneous assessment of many interrelated biological parameters (such as gene arrays), statistical approaches also need to deal with a possibly unknown dependency structure in the data. Our statistical approach to the nonparametric comparison of two samples with independent multivariate observables is based on recently developed multivariate multiple permutation tests. We adapt their theory in order to cope with families of hypotheses regarding relative effects. Our results indicate that the multivariate multiple permutation test keeps the pre-assigned type I error level for the global null hypothesis. In combination with the closure principle, the family-wise error rate for the simultaneous test of the corresponding locus/parameter-specific null hypotheses can be controlled. In applications we demonstrate that group differences in epigenetic data can be detected reliably with our methodology.
J. Blath, A. González Casanova Soberón, B. Eldon, N. Kurt, M. Wilke-Berenguer, Genetic variability under the seedbank coalescent, Genetics, 200 (2015), pp. 921--934.
Abstract
We analyze patterns of genetic variability of populations in the presence of a large seedbank with the help of a new coalescent structure called the seedbank coalescent. This ancestral process appears naturally as a scaling limit of the genealogy of large populations that sustain seedbanks, if the seedbank size and individual dormancy times are of the same order as those of the active population. Mutations appear as Poisson processes on the active lineages and potentially at reduced rate also on the dormant lineages. The presence of "dormant" lineages leads to qualitatively altered times to the most recent common ancestor and nonclassical patterns of genetic diversity. To illustrate this we provide a Wright-Fisher model with a seedbank component and mutation, motivated from recent models of microbial dormancy, whose genealogy can be described by the seedbank coalescent. Based on our coalescent model, we derive recursions for the expectation and variance of the time to most recent common ancestor, number of segregating sites, pairwise differences, and singletons. Estimates (obtained by simulations) of the distributions of commonly employed distance statistics, in the presence and absence of a seedbank, are compared. The effect of a seedbank on the expected site-frequency spectrum is also investigated using simulations. Our results indicate that the presence of a large seedbank considerably alters the distribution of some distance statistics, as well as the site-frequency spectrum. Thus, one should be able to detect from genetic data the presence of a large seedbank in natural populations.
TH. Dickhaus, Th. Royen, On multivariate chi-square distributions and their applications in testing multiple hypotheses, Statistics. A Journal of Theoretical and Applied Statistics, (2015), pp. 427--454.
Abstract
We are considered with three different types of multivariate chi-square distributions. Their members play important roles as limiting distributions of vectors of test statistics in several applications of multiple hypotheses testing. We explain these applications and provide formulas for computing multiplicity-adjusted $p$-values under the respective global hypothesis.
M. Hesse, A. Kyprianou, The total mass of super-Brownian motion upon exiting balls and Sheu's compact support condition, Stochastic Processes and their Applications, 124 (2014), pp. 2003--2022.
Abstract
We study the total mass of a d-dimensional super-Brownian motion as it first exits an increasing sequence of balls. The total mass process is a time-inhomogeneous continuous-state branching process, where the increasing radii of the balls are taken as the time-parameter. We characterise its time-dependent branching mechanism and show that it converges, as time goes to infinity, towards the branching mechanism of the total mass of a one-dimensional super-Brownian motion as it first crosses above an increasing sequence of levels.
Our results identify the compact support criterion in Sheu (1994) as Grey's condition (1974) for the aforementioned limiting branching mechanism.
V. Heinrich, T. Kamphans, J. Stange, D. Parkhomchuk, Th. Dickhaus, J. Hecht, P.N. Robinson, P.M. Krawitz, Estimating exome genotyping accuracy by comparing to data from large scale sequencing projects, Genome Medicine, 5 (2013), pp. 69/1--69/11.
I. Türbachova, T. Schwachula, I. Vasconcelos, A. Mustea, T. Baldinger, K.A. Jones, H. Bujard, A. Olek, K. Olek, K. Gellhaus, I. Braicu, D. Könsgen, Ch. Fryer, E. Ravot, A. Hellwag, N. Westerfeld, J.O. Gruss, M. Meissner, M. Hassan, M. Weber, U. Hofmüller, S. Zimmermann, Ch. Loddenkemper, S. Mahner, N. Babel, E. Berns, R. Adams, R. Zeilinger, U. Baron, I. Vergote, T. Maughan, F. Marme, Th. Dickhaus, J. Sehouli, S. Olek, The cellular ratio of immune tolerance (immunoCRIT) is a definite marker for aggressiveness of solid tumors and may explain tumor dissemination patterns, Epigenetics, 8 (2013), pp. 1226--1235.
M. Schreuder, J. Höhne, B. Blankertz, S. Haufe, Th. Dickhaus, M. Tangermann, Optimizing event-related potential based brain-computer interfaces: A systematic evaluation of dynamic stopping methods, Journal of Neural Engineering, 10 (2013), pp. 036025/1--036025/13.
TH. Dickhaus, J. Stange, Multiple point hypothesis test problems and effective numbers of tests for control of the family-wise error rate, Calcutta Statistical Association Bulletin, 65 (2013), pp. 123--144.

Contributions to Collected Editions

J. Blath, E. Bjarki, A. González Casanova Soberón, N. Kurt, Genealogy of a Wright--Fisher model with strong seedbank component, in: XI Symposium of Probability and Stochastic Processes, R.H. Mena, J.C. Pardo, V. Rivero, G. Uribe Bravo, eds., 69 of Birkhäuser Progress in Probability, Springer International Publishing, Switzerland, 2015, pp. 81--100.
H.-J. Mucha, H.-G. Bartel, Resampling techniques in cluster analysis: Is subsampling better than bootstrapping?, in: Data Science, Learning by Latent Structures, and Knowledge Discovery, B. Lausen, S. Krolak-Schwerdt, M. Böhmer, eds., Studies in Classification, Data Analysis and Knowledge Organization, Springer, Berlin et al., 2015, pp. 113--122.

Preprints, Reports, Technical Reports

J. Bäumler, T. Iyer, Permutations in competing growth processes and balls-in-bins, Preprint no. 3220, WIAS, Berlin, 2025, DOI 10.20347/WIAS.PREPRINT.3220 .
Abstract, PDF (393 kByte)
Consider a model of N independent, increasing ℕ₀-valued processes, with random, independent waiting times between jumps. It is known that there is either an emergent `leader', in which a single process possesses the maximal value for all sufficiently large times, or every pair of processes alternates leadership infinitely often. We show that in the latter regime, almost surely, one sees every possible permutation of rankings of processes infinitely often. In the case that the waiting times are exponentially distributed, this proves a conjecture from Spencer (appearing in a paper from Oliveira) on the `balls-in-bins' process with feedback [8, Conjecture1].
H. Shafigh, Dormancy in random environment: Symmetric exclusion, Preprint no. 3166, WIAS, Berlin, 2025, DOI 10.20347/WIAS.PREPRINT.3166 .
Abstract, PDF (368 kByte)
In this paper, we study a spatial model for dormancy in random environment via a two-type branching random walk in continuous-time, where individuals can switch between dormant and active states through spontaneous switching independent of the random environment. However, the branching mechanism is governed by a random environment which dictates the branching rates, namely the simple symmetric exclusion process. We will interpret the presence of the exclusion particles either as emphcatalysts, accelerating the branching mechanism, or as emphtraps, aiming to kill the individuals. The difference between active and dormant individuals is defined in such a way that dormant individuals are protected from being trapped, but do not participate in migration or branching. We quantify the influence of dormancy on the growth resp. survival of the population by identifying the large-time asymptotics of the expected population size. The starting point for our mathematical considerations and proofs is the parabolic Anderson model via the Feynman-Kac formula. In particular, the quantitative investigation of the role of dormancy is done by extending the Parabolic Anderson model to a two-type random walk
T. Iyer, Persistent hubs in CMJ branching processes with independent increments and preferential attachment trees, Preprint no. 3138, WIAS, Berlin, 2024, DOI 10.20347/WIAS.PREPRINT.3138 .
Abstract, PDF (512 kByte)
A sequence of trees (T_n) _{n ∈ N} contains a emphpersistent hub, or displays emphdegree centrality, if there is a fixed node of maximal degree for all sufficiently large n ∈ N. We derive sufficient criteria for the emergence of a persistent hub in genealogical trees associated with Crump-Mode-Jagers branching processes with independent waiting times between births of individuals, and sufficient criteria for the non-emergence of a persistent hub. We also derive criteria for uniqueness of these persistent hubs. As an application, we improve results in the l iterature concerning the emergence of unique persistent hubs in generalised preferential attachment trees, in particular, allowing for cases where there may not be a emphMalthusian parameter associated with the process. The approach we use is mostly self-contained, and does not rely on prior results about Crump-Mode-Jagers branching processes
T. Iyer, Fixation of leadership in non-Markovian growth processes, Preprint no. 3137, WIAS, Berlin, 2024, DOI 10.20347/WIAS.PREPRINT.3137 .
Abstract, PDF (462 kByte)
Consider a model where N equal agents possess `values', belonging to N₀, that are subject to incremental growth over time. More precisely, the values of the agents are represented by N independent, increasing N₀ valued processes with random, independent waiting times between jumps. We show that the event that a single agent possesses the maximum value for all sufficiently large values of time (called `leadership') occurs with probability zero or one, and provide necessary and sufficient conditions for this to occur. Under mild conditions we also provide criteria for a single agent to become the unique agent of maximum value for all sufficiently large times, and also conditions for the emergence of a unique agent having value that tends to infinity before `explosion' occurs (i.e. conditions for `strict leadership' or `monopoly' to occur almost surely). The novelty of this model lies in allowing non-exponentially distributed waiting times between jumps in value. In the particular case when waiting times are mixtures of exponential distributions, we improve a well-established result on the `balls in bins' model with feedback, removing the requirement that the feedback function be bounded from below and also allowing random feedback functions. As part of the proofs we derive necessary and sufficient conditions for the distribution of a convergent series of independent random variables to have an atom on the real line, a result which we believe may be of interest in its own right.
H. Shafigh, A spatial model for dormancy in random environment, Preprint no. 3136, WIAS, Berlin, 2024, DOI 10.20347/WIAS.PREPRINT.3136 .
Abstract, PDF (627 kByte)
In this paper, we introduce a spatial model for dormancy in random environment via a two-type branching random walk in continuous-time, where individuals can switch between dormant and active states through spontaneous switching independent of the random environment. However, the branching mechanism is governed by a random environment which dictates the branching rates. We consider three specific choices for random environments composed of particles: (1) a Bernoulli field of immobile particles, (2) one moving particle, and (3) a Poisson field of moving particles. In each case, the particles of the random environment can either be interpreted as emphcatalysts, accelerating the branching mechanism, or as emphtraps, aiming to kill the individuals. The different between active and dormant individuals is defined in such a way that dormant individuals are protected from being trapped, but do not participate in migration or branching. We quantify the influence of dormancy on the growth resp.,survival of the population by identifying the large-time asymptotics of the expected population size. The starting point for our mathematical considerations and proofs is the parabolic Anderson model via the Feynman-Kac formula. Especially, the quantitative investigation of the role of dormancy is done by extending the Parabolic Anderson model to a two-type random walk.

Talks, Poster

T. Iyer, Fixiation of leadership in growth processes, Probability Seminar, University of Melbourne, School of Mathematics and Statistics, Melbourne, Australia, October 1, 2024.
T. Iyer, Persistent hubs in generalised preferential attachment trees, Seminar Series in Probability and Combinatorics, Uppsala University, Department of Mathematics, Uppsala, Sweden, May 23, 2024.
T. Iyer, Persistent hubs in generalised preferential attachment trees, Seminar Series in Mathematical Statistics, Stockholm University, Department of Mathematics, Stockholm, Sweden, May 22, 2024.
T. Iyer, Persistent hubs in generalised preferential attachment trees, Monash Probability and Statistics Seminar, Monash University, School of Mathematics, Melbourne, Australia, November 3, 2024.
T. Iyer, Persistent hubs in preferential attachment trees, Oberseminar AG Stochastik, Technische Universität Darmstadt, Fachbereich Mathematik, June 20, 2024.
B. Jahnel, First-passage percolation and chase-escape dynamics on random geometric graphs, Stochastic Geometry Days, November 15 - 19, 2021, Dunkerque, France, November 17, 2021.
B. Jahnel, Stochastic geometry for epidemiology (online talk), Monday Biostatistics Roundtable, Institute of Biometry and Clinical Epidemiology (Online Event), Campus Charité, June 14, 2021.
A. González Casanova Soberón, An individual-based model for the Lenski experiment, and the deceleration of the relative fitness, Workshop on Probabilistic Models in Biology, October 24 - 30, 2015, Playa del Carmen, Mexico, October 28, 2015.
A. González Casanova Soberón, Modeling the Lenski experiment, Genealogies in Evolution: Looking Backward and Forward, Workshop of the Priority Program (SPP) 1590 ``Probabilistic Structures in Evolution'', October 5 - 6, 2015, Goethe-Universität Frankfurt, October 6, 2015.
O. Gün, Stochastic encounter-mating model, Mathematical Model in Ecology and Evolution (MMEE 2015), July 7 - 13, 2015, Collège de France, Paris, France, July 8, 2015.
M. Hesse, Asymptotic growth of a branching random walk in a random environment on the hypercube, Friedrich-Alexander-Universität Erlangen-Nürnberg, Department Mathematik, December 4, 2014.
TH. Dickhaus, Simultaneous Bayesian analysis of contingency tables in genetic association studies, Bayesian Biostatistics 2014, July 2 - 5, 2014, University of Zurich, Switzerland, July 2, 2014.
TH. Dickhaus, Simultaneous Bayesian analysis of contingency tables in genetic association studies, International Workshop ``Advances in Optimization and Statistics'', May 15 - 16, 2014, Russian Academy of Sciences, Institute of Information Transmission Problems (Kharkevich Institute), Moscow, May 15, 2014.

Contact

Prof. Dr. Wolfgang König

Contributing Groups of WIAS

Interacting Random Systems