LibCat » Книги » Приключения » unrecognised » Nathalie Peyrard - Statistical Approaches for Hidden Variables in Ecology

Nathalie Peyrard - Statistical Approaches for Hidden Variables in Ecology

Здесь есть возможность читать онлайн «Nathalie Peyrard - Statistical Approaches for Hidden Variables in Ecology» — ознакомительный отрывок электронной книги совершенно бесплатно, а после прочтения отрывка купить полную версию. В некоторых случаях можно слушать аудио, скачать через торрент в формате fb2 и присутствует краткое содержание. Жанр: unrecognised, на английском языке. Описание произведения, (предисловие) а так же отзывы посетителей доступны на портале библиотеки ЛибКат.

Читать книгу

Название:
Statistical Approaches for Hidden Variables in Ecology
Автор:
Nathalie Peyrard
Жанр:
unrecognised / на английском языке
Год:
неизвестен
ISBN:
нет данных
Рейтинг книги:
5 / 5. Голосов: 1
Избранное:

Добавить в избранное
Отзывы:
Написать комментарий
Ваша оценка:
- 100
- 1
- 2
- 3
- 4
- 5

Statistical Approaches for Hidden Variables in Ecology: краткое содержание, описание и аннотация

Предлагаем к чтению аннотацию, описание, краткое содержание или предисловие (зависит от того, что написал сам автор книги «Statistical Approaches for Hidden Variables in Ecology»). Если вы не нашли необходимую информацию о книге — напишите в комментариях, мы постараемся отыскать её.

The study of ecological systems is often impeded by components that escape perfect observation, such as the trajectories of moving animals or the status of plant seed banks. These hidden components can be efficiently handled with statistical modeling by using hidden variables, which are often called latent variables. Notably, the hidden variables framework enables us to model an underlying interaction structure between variables (including random effects in regression models) and perform data clustering, which are useful tools in the analysis of ecological data.<br /><br />This book provides an introduction to hidden variables in ecology, through recent works on statistical modeling as well as on estimation in models with latent variables. All models are illustrated with ecological examples involving different types of latent variables at different scales of organization, from individuals to ecosystems. Readers have access to the data and R codes to facilitate understanding of the model and to adapt inference tools to their own data.

Statistical Approaches for Hidden Variables in Ecology — читать онлайн ознакомительный отрывок

Ниже представлен текст книги, разбитый по страницам. Система сохранения места последней прочитанной страницы, позволяет с удобством читать онлайн бесплатно книгу «Statistical Approaches for Hidden Variables in Ecology», без необходимости каждый раз заново искать на чём Вы остановились. Поставьте закладку, и сможете в любой момент перейти на страницу, на которой закончили чтение.

Тёмная тема

Шрифт:

↓

↑

Сбросить

Интервал:

↓

↑

Закладка:

Сделать

A further question concerns the extent to which activity is influenced by covariates (distance from a point of interest, time of day, etc.). One way of including covariates is to model their impact on the transition between activities (Calenge et al . 2009; Morales et al . 2004; Michelot et al . 2016).

For example, in the model presented here, ℙ ( Zt = j | Zt− 1= i ) is independent of t and takes a value of Π ( i, j ). Let us suppose that at each moment t , p covariates are measured and stored in a line vector x t. Transition probability can be linked to these variables according to a multiclass logistic regression approach:

[1.9] The first equation indicates that the probability of switching to a different - фото 26

The first equation indicates that the probability of switching to a different activity j from a current activity i is connected to external conditions via a linear combination of covariates at time t . β ( i, j ) is the column vector (of dimension p ) of the coefficients corresponding to the influence of each covariate on this probability. The second equation is a constraint equation that ensures that the vector (ℙ ( Zt = 1| Zt− 1= i )) , . . . , ℙ ( Zt = J | Zt− 1= i )) is a probability vector.

It is thus possible to take account of notions such as the fact that an individual will spend a longer period of time actively foraging in a location that is rich in food sources, while in a less favorable environment, it will rapidly switch to a traveling state in order to move to a better location. The inclusion of covariates in this model makes it possible to identify environmental variables, which favor particular states.

1.2.2.4. Example: a three-state HMM with Gaussian emission

Let us illustrate model [ 1.4] using a toy example. Consider an individual with a total of three possible behaviors. Thus, let J = 3 be the number of activities for this individual. Each of these activities is characterized by a different movement pattern: for example, a direct trajectory at high velocity, a more sinuous pattern at a lower speed, and a third, different pattern. As we have seen, these differences may be characterized using different metrics. In this example, we have chosen to model persistent velocity and turning velocity, using equations [ 1.7] and [ 1.8].

Thus, in model [ 1.4], ν 0is a probability vector of size 3, Π is a 3 × 3 matrix such that the sum of the elements in each line is equal to 1, and, for 1 ≤ j ≤ 3, distribution ( θj ) is a distribution ℕ ( μj, Σ j), where:

– μj is a vector of dimension 2 (the mean of V p and V r for activity j);

– Σj is a variance–covariance matrix (of size 2 × 2).

1.2.2.5. Inference

Using the model defined by [ 1.4], inference is used to fulfill two purposes:

– Estimation of activity: to determine the distribution of real activities given the observations, that is, for 0 ≤ t ≤ n, the distribution of the random variable Zt|Y0:n. For each time t, the estimated smoothing distribution gives the probability of being involved in each of the j activities.

– Estimation of parameters: the distribution ν0 and the transition matrix Π characterize the dynamic of activities, and the set of parameters {θj} 1 ≤ j ≤ J indicates the way in which the activity influences the distribution of observations.

In the case of unknown parameters, these two steps are carried out conjointly. Taking a frequentist approach, the EM algorithm may be used, as in section 1.2.1. Once again, it is easy to write an equation, analogous to [ 1.2], giving the full likelihood.

For this model, step E once again consists of calculating the quantity given by [ 1.4]. Again, the difficulty lies in calculating the smoothing distribution. Nevertheless, the discrete character of the hidden dynamic means that explicit calculation is possible. This is carried out iteratively using the forward–backward algorithm. The equations used in this simple and efficient algorithm can be found in Rabiner (1989).

Step M, in which the parameters are updated, is dependent on the nature of the emission distribution. In the Gaussian case, this step has an explicit expression; conversely, this is not the case when using a distribution such as von Mises.

Bayesian estimation may be applied by using Monte Carlo Markov Chain (MCMC) algorithms, which are found in programs such as Stan (Carpenter et al . 2017), Winbugs (Lunn et al . 2000) or NIMBLE (de Valpine et al . 2017).

1.2.2.6. Reconstruction of hidden states

The reconstruction of hidden activities allows us to identify homogeneous phases in behaviors, and is often of considerable interest from an ecological perspective. This hidden Markov model may thus be seen as an unsupervised segmentation/ classification model for movement.

One possibility is to reconstruct the most likely hidden activity for each time increment in turn, taking Statistical Approaches for Hidden Variables in Ecology - изображение 27 such that

[1.10] This method is known as the maximum a posteriori MAP method One possible - фото 28

This method is known as the maximum a posteriori (MAP) method. One possible problem with the estimator [ 1.10] is that it provides no guarantee that the sequence картинка 29 0:nwill be coherent with the transition matrix Π; it may give a result of Statistical Approaches for Hidden Variables in Ecology - изображение 30 t = 1 and t +1= 2 for an estimation , for example.

Using a Bayesian approach, the sampling algorithms used to estimate parameters permit the use of a joint smoothing distribution, that is, samples of Z 0:n| Y 0:ncan be obtained. Each sample produced is thus a possible sequence of activities corresponding to given observations.

Sampling across this distribution can also be carried out in conjunction with a frequentist approach, but the combinatorial level is high and the computational effort involved rapidly becomes prohibitive as n increases. Hidden activities are most commonly reconstructed using the most probable sequence of hidden states, that is, which maximizes the overall a posteriori distribution, or, more formally,

This sequence can be calculated in an efficient manner using the Viterbi - фото 33

This sequence can be calculated in an efficient manner using the Viterbi algorithm, and is the version which is generally returned by libraries offering frequentist estimation. Note that the m most probable sequences can be obtained using the generalized Viterbi algorithm (Guédon 2007).