LibCat » Книги » Приключения » unrecognised » Computational Statistics in Data Science

Computational Statistics in Data Science

Здесь есть возможность читать онлайн «Computational Statistics in Data Science» — ознакомительный отрывок электронной книги совершенно бесплатно, а после прочтения отрывка купить полную версию. В некоторых случаях можно слушать аудио, скачать через торрент в формате fb2 и присутствует краткое содержание. Жанр: unrecognised, на английском языке. Описание произведения, (предисловие) а так же отзывы посетителей доступны на портале библиотеки ЛибКат.

Читать книгу

Название:
Computational Statistics in Data Science
Автор:
Неизвестный Автор
Жанр:
unrecognised / на английском языке
Год:
неизвестен
ISBN:
нет данных
Рейтинг книги:
4 / 5. Голосов: 1
Избранное:

Добавить в избранное
Отзывы:
Написать комментарий
Ваша оценка:
- 80
- 1
- 2
- 3
- 4
- 5

Computational Statistics in Data Science: краткое содержание, описание и аннотация

Предлагаем к чтению аннотацию, описание, краткое содержание или предисловие (зависит от того, что написал сам автор книги «Computational Statistics in Data Science»). Если вы не нашли необходимую информацию о книге — напишите в комментариях, мы постараемся отыскать её.

An essential roadmap to the application of computational statistics in contemporary data science
Computational Statistics in Data Science
Computational Statistics in Data Science
Wiley StatsRef: Statistics Reference Online
Computational Statistics in Data Science

Computational Statistics in Data Science — читать онлайн ознакомительный отрывок

Ниже представлен текст книги, разбитый по страницам. Система сохранения места последней прочитанной страницы, позволяет с удобством читать онлайн бесплатно книгу «Computational Statistics in Data Science», без необходимости каждый раз заново искать на чём Вы остановились. Поставьте закладку, и сможете в любой момент перейти на страницу, на которой закончили чтение.

Тёмная тема

Шрифт:

↓

↑

Сбросить

Интервал:

↓

↑

Закладка:

Сделать

Computational Statistics in Data Science - изображение 509

Confidence intervals are notoriously difficult to understand at a first instance, and thus a standard Monte Carlo experiment in an introductory statistics course is that of repeating the above experiment multiple times and illustrating that on average about Computational Statistics in Data Science - изображение 510 proportion of such confidence intervals will contain the true mean. That is, for , we generate , calculate the mean картинка 513 and the sample variance and define to be where - фото 514 , and define to be where is the indicator function By the law of large numbers - фото 515 to be

where is the indicator function By the law of large numbers with pro - фото 516

where is the indicator function By the law of large numbers with probability 1 as - фото 517 is the indicator function. By the law of large numbers, with probability 1 as and the following CLT holds In conductin - фото 518 with probability 1, as , and the following CLT holds:

In conducting this experiment we must choose the Monte Carlo sample size A - фото 520

In conducting this experiment, we must choose the Monte Carlo sample size картинка 521 . A reasonable argument here is that our estimator картинка 522 must be accurate up to the second significant digit with roundoff. That is, we may allow a margin of error of 0.005. This implies that Computational Statistics in Data Science - изображение 523 must be chosen so that

Computational Statistics in Data Science - изображение 524

That is, to construct, say a картинка 525 confidence interval, an accurate Monte Carlo study in this simple example requires at least 1900 Monte Carlo samples. A higher precision would require an even larger simulation size! This is an example of an absolute precision stopping rule ( Section 5) and is unique since the limiting variance is known. For further discussion of this example, see Frey [8].

2 Estimation

Recall that картинка 526 is a картинка 527 ‐dimensional target distribution, and interest is in estimating different features of Computational Statistics in Data Science - изображение 528 . In Monte Carlo simulation, we generate either via IID sampling or via a Markov chain that has картинка 530 as its limiting distribution. For MCMC samples, we assume throughout that a Harris ergodic Markov chain is employed ensuring convergence of sample statistics to (finite) population quantities (see Roberts and Rosenthal [9], for definitions).

2.1 Expectations

The most common quantity of interest in Monte Carlo simulations is the expectation of a function of the target distribution. Let Computational Statistics in Data Science - изображение 531 denote the Euclidean norm, and let , so that interest is in estimating

Computational Statistics in Data Science - изображение 533

where we assume Computational Statistics in Data Science - изображение 534 . If картинка 535 is identity, then the mean of the target is of interest. Alternatively, картинка 536 can be chosen so that moments or other quantities are of interest. A Monte Carlo estimator of is

Computational Statistics in Data Science - изображение 538

For IID and MCMC sampling, the ergodic theorem implies that картинка 539 as картинка 540 . The Monte Carlo average картинка 541 is naturally unbiased as long as the samples are either IID or the Markov chain is stationary.

2.2 Quantiles

Quantiles are particularly of interest when making credible intervals in Bayesian posterior distributions or making boxplots from Monte Carlo simulations. In this section, we assume that картинка 542 is one‐dimensional (i.e., картинка 543 ). Extensions to картинка 544 are straightforward but notationally involved [10]. For картинка 545 , interest may be in estimating a quantile of картинка 546 . Let картинка 547 be the distribution function of картинка 548 , assumed to be absolutely continuous with a continuous density картинка 549 . The quantile associated with is Sample statistics are used to estimate - фото 550 ‐quantile associated with is

Тёмная тема

Шрифт:

↓

↑

Сбросить

Интервал:

↓

↑

Закладка:

Сделать

Похожие книги на «Computational Statistics in Data Science»

Представляем Вашему вниманию похожие книги на «Computational Statistics in Data Science» списком для выбора. Мы отобрали схожую по названию и смыслу литературу в надежде предоставить читателям больше вариантов отыскать новые, интересные, ещё непрочитанные произведения.

Роман Зыков

Роман с Data Science. Как монетизировать большие данные &#91;litres]

Неизвестный Автор

Mathematics in Computational Science and Engineering

Tormod Næs

Multiblock Data Fusion in Statistics and Machine Learning

Никита Сергеев

Аналитика и Data Science. Для не-аналитиков и даже 100% гуманитариев…

Prof Carla Moreira

The Statistical Analysis of Doubly Truncated Data

Maria Cristina Mariani

Data Science in Theory and Practice

Эмили Робинсон

Data Science для карьериста

Lillian Pierson

Data Science For Dummies

Роман Зыков

Роман с Data Science. Как монетизировать большие данные

Field Cady

Data Science

Cole Stryker

Smarter Data Science

Schmidt Jutta

Smart Data statt Big Data

Отзывы о книге «Computational Statistics in Data Science»

Обсуждение, отзывы о книге «Computational Statistics in Data Science» и просто собственные мнения читателей. Оставьте ваши комментарии, напишите, что Вы думаете о произведении, его смысле или главных героях. Укажите что конкретно понравилось, а что нет, и почему Вы так считаете.