Iain Pardoe - Applied Regression Modeling

Здесь есть возможность читать онлайн «Iain Pardoe - Applied Regression Modeling» — ознакомительный отрывок электронной книги совершенно бесплатно, а после прочтения отрывка купить полную версию. В некоторых случаях можно слушать аудио, скачать через торрент в формате fb2 и присутствует краткое содержание. Жанр: unrecognised, на английском языке. Описание произведения, (предисловие) а так же отзывы посетителей доступны на портале библиотеки ЛибКат.

Applied Regression Modeling: краткое содержание, описание и аннотация

Предлагаем к чтению аннотацию, описание, краткое содержание или предисловие (зависит от того, что написал сам автор книги «Applied Regression Modeling»). Если вы не нашли необходимую информацию о книге — напишите в комментариях, мы постараемся отыскать её.

Master the fundamentals of regression without learning calculus with this one-stop resource The newly and thoroughly revised 3rd Edition of
delivers a concise but comprehensive treatment of the application of statistical regression analysis for those with little or no background in calculus. Accomplished instructor and author Dr. Iain Pardoe has reworked many of the more challenging topics, included learning outcomes and additional end-of-chapter exercises, and added coverage of several brand-new topics including multiple linear regression using matrices.
The methods described in the text are clearly illustrated with multi-format datasets available on the book's supplementary website. In addition to a fulsome explanation of foundational regression techniques, the book introduces modeling extensions that illustrate advanced regression strategies, including model building, logistic regression, Poisson regression, discrete choice models, multilevel models, Bayesian modeling, and time series forecasting. Illustrations, graphs, and computer software output appear throughout the book to assist readers in understanding and retaining the more complex content.
covers a wide variety of topics, like:
Simple linear regression models, including the least squares criterion, how to evaluate model fit, and estimation/prediction Multiple linear regression, including testing regression parameters, checking model assumptions graphically, and testing model assumptions numerically Regression model building, including predictor and response variable transformations, qualitative predictors, and regression pitfalls Three fully described case studies, including one each on home prices, vehicle fuel efficiency, and pharmaceutical patches Perfect for students of any undergraduate statistics course in which regression analysis is a main focus,
also belongs on the bookshelves of non-statistics graduate students, including MBAs, and for students of vocational, professional, and applied courses like data science and machine learning.

Applied Regression Modeling — читать онлайн ознакомительный отрывок

Ниже представлен текст книги, разбитый по страницам. Система сохранения места последней прочитанной страницы, позволяет с удобством читать онлайн бесплатно книгу «Applied Regression Modeling», без необходимости каждый раз заново искать на чём Вы остановились. Поставьте закладку, и сможете в любой момент перейти на страницу, на которой закончили чтение.

Тёмная тема
Сбросить

Интервал:

Закладка:

Сделать

2 1.2 Assume that final scores in a statistics course follow a normal distribution with mean and standard deviation . Find the scores, , that correspond to the:90th percentile (i.e., find such that );99th percentile (i.e., find such that );5th percentile (i.e., find such that ). Suppose represents potential values of repeated sample means from this population for samples of size (e.g., average class scores). Use the normal version of the central limit theorem to find the mean scores, , that correspond to the:90th percentile (i.e., find such that );99th percentile (i.e., find such that );5th percentile (i.e., find such that ).If the bottom 5% of the class fail, what is the cut‐off percentage to pass the class?The university requires the long‐term average class score for this course to be no higher than 75%. Does this requirement seem feasible?

3 1.3 The NBASALARY data file contains salary information for 214 guards in the National Basketball Association (NBA) for 2009–2010 (obtained from the online USA Today NBA Salaries Database).Construct a histogram of the variable, representing 2009–2010 salaries in thousands of dollars [computer help #14].What would we expect the histogram to look like if the data were normal?Construct a QQ‐plot of the variable [computer help #22].What would we expect the QQ‐plot to look like if the data were normal?Compute the natural logarithm of guard salaries (call this variable ) [computer help #6], and construct a histogram of this variable [computer help #14]. Hint: The “natural logarithm” transformation (also known as “log to base‐e,” or by the symbols or ln) is a way to transform (rescale) skewed data to make them more symmetric and normal.Construct a QQ‐plot of the variable [computer help #22].Based on the plots in parts (a), (c), (e), and (f), say whether salaries or log‐salaries more closely follow a normal curve, and justify your response.

4 1.4 A company's pension plan includes 50 mutual funds, with each fund expected to earn a mean, , of 3% over the risk‐free rate with a standard deviation of %. Based on the assumption that the funds are randomly selected from a population of funds with normally distributed returns in excess of the risk‐free rate, find the probability that an individual fund's return in excess of the risk‐free rate is, respectively, greater than 34.1%, greater than 15.7%, or less than %. In other words, if represents potential values of individual fund returns, find:;;. Use the normal version of the central limit theorem to approximate the probability that the pension plan's overall mean return in excess of the risk‐free rate is, respectively, greater than 7.4%, greater than 4.8%, or less than 0.7%. In other words, if represents potential values of repeated sample means, find:;;.

5 1.5 Consider the data on 2009–2010 salaries of 214 NBA guards from Problem 1.3.Calculate a 95% confidence interval for the population mean in thousands of dollars [computer help #23]. Hint: Calculate by hand (using the fact that the sample mean of is 3980.318, the sample standard deviation is 4525.378, and the 97.5th percentile of the t‐distribution with 213 degrees of freedom is approximately 1.971) and check your answer using statistical software.Consider , the natural logarithms of the salaries. The sample mean of is 7.664386. Re‐express this number in thousands of dollars (the original units of salary).Hint: To back‐transform a number in natural logarithms to its original scale, use the “exponentiation” function on a calculator [denoted exp(X) or , where X is the variable expressed in natural logarithms]. This is because exp((Y)) Y.Compute a 95% confidence interval for the population mean in natural logarithms of thousands of dollars [computer help #23].Hint: Calculate by hand (using the fact that the sample mean of is 7.664386, the sample standard deviation of is 1.197118, and the 97.5th percentile of the t‐distribution with 213 degrees of freedom is approximately 1.971) and check your answer using statistical software.Re‐express each interval endpoint of your 95% confidence interval computed in part (c) in thousands of dollars and say what this interval means in words.The confidence interval computed in part (a) is exactly symmetric about the sample mean of . Is the confidence interval computed in part (d) exactly symmetric about the sample mean of back‐transformed to thousands of dollars that you computed in part (b)? How does this relate to quantifying our uncertainty about the population mean salary?Hint: Looking at the histogram from Problem 3 part (a), if someone asked you to give lower and upper bounds on the population mean salary using your intuition rather than statistics, would you give a symmetric or an asymmetric interval?

6 1.6 The FINALSCORES data file contains values of variable , which measures final scores in a statistics course.Calculate the sample mean and sample standard deviation of [computer help #10].Calculate a 90% confidence interval for the population mean of [computer help #23]. Hint: Calculate by hand (using the sample mean and sample standard deviation from part (a), and the 95th percentile of the t‐distribution with 99 degrees of freedom, which is approximately 1.660) and check your answer using statistical software.

7 1.7 Gapminder is a “non‐profit venture promoting sustainable global development and achievement of the United Nations Millennium Development Goals.” It provides related time series data for all countries in the world at the website www.gapminder.org . For example, the COUNTRIES data file contains the 2010 population count (variable in millions) of the 55 most populous countries together with 2010 life expectancy at birth (variable in years).Calculate the sample mean and sample standard deviation of [computer help #10].Briefly say why calculating a confidence interval for the population mean would not be useful for understanding mean population counts for all countries in the world.Consider the variable , which represents the average number of years a newborn child would live if current mortality patterns were to stay the same. Suppose that for this variable, these 55 countries could be considered a random sample from the population of all countries in the world. Calculate a 95% confidence interval for the population mean of [computer help #23]. Hint: Calculate by hand (using the fact that the sample mean of is 69.787, the sample standard deviation is 9.2504, and the 97.5th percentile of the t‐distribution with 54 degrees of freedom is approximately 2.005) and check your answer using statistical software.

8 1.8 Consider the FINALSCORES data file from Problem 1.6.Do a hypothesis test to determine whether there is sufficient evidence at a significance level of 5% to conclude that the population mean of is greater than 66 [computer help #24].Repeat part (a) but test whether the population mean of is less than 73.Repeat part (a) but test whether the population mean of is not equal to 66.

9 1.9 Consider the COUNTRIES data file from Problem 1.7. A journalist speculates that the population mean of is greater than 68 years. Based on the sample of 55 countries, a smart statistics student thinks that there is insufficient evidence to conclude this. Do a hypothesis test to show who is correct based on a significance level of 5% [computer help #24].Hint: Make sure that you lay out all the steps involved—as in Section 1.6.1—and include a short sentence summarizing your conclusion; that is, who do you think is correct, the journalist or the student?

10 1.10Consider the housing market represented by the sale prices in the HOMES1 data file.As suggested in Section 1.3, calculate the probability of finding an affordable home (less than ) in this housing market. Assume that the population of sale prices () is normal, with mean and standard deviation .As suggested in Section 1.5, calculate a 90% confidence interval for the population mean in this housing market. Recall that the sample mean , the sample standard deviation , and the sample size . Check your answer using statistical software [computer help #23].Practice the mechanics of hypothesis tests by conducting the following tests using a significance level of 5%.: versus : ;: versus : ;: versus : ;: versus : .As suggested in Section 1.7, calculate a 90% prediction interval for an individual sale price in this market.

Читать дальше
Тёмная тема
Сбросить

Интервал:

Закладка:

Сделать

Похожие книги на «Applied Regression Modeling»

Представляем Вашему вниманию похожие книги на «Applied Regression Modeling» списком для выбора. Мы отобрали схожую по названию и смыслу литературу в надежде предоставить читателям больше вариантов отыскать новые, интересные, ещё непрочитанные произведения.


Отзывы о книге «Applied Regression Modeling»

Обсуждение, отзывы о книге «Applied Regression Modeling» и просто собственные мнения читателей. Оставьте ваши комментарии, напишите, что Вы думаете о произведении, его смысле или главных героях. Укажите что конкретно понравилось, а что нет, и почему Вы так считаете.

x