LibCat » Книги » Приключения » unrecognised » Douglas C. Montgomery - Introduction to Linear Regression Analysis

Douglas C. Montgomery - Introduction to Linear Regression Analysis

Здесь есть возможность читать онлайн «Douglas C. Montgomery - Introduction to Linear Regression Analysis» — ознакомительный отрывок электронной книги совершенно бесплатно, а после прочтения отрывка купить полную версию. В некоторых случаях можно слушать аудио, скачать через торрент в формате fb2 и присутствует краткое содержание. Жанр: unrecognised, на английском языке. Описание произведения, (предисловие) а так же отзывы посетителей доступны на портале библиотеки ЛибКат.

Читать книгу

Название:
Introduction to Linear Regression Analysis
Автор:
Douglas C. Montgomery
Жанр:
unrecognised / на английском языке
Год:
неизвестен
ISBN:
нет данных
Рейтинг книги:
4 / 5. Голосов: 1
Избранное:

Добавить в избранное
Отзывы:
Написать комментарий
Ваша оценка:
- 80
- 1
- 2
- 3
- 4
- 5

Introduction to Linear Regression Analysis: краткое содержание, описание и аннотация

Предлагаем к чтению аннотацию, описание, краткое содержание или предисловие (зависит от того, что написал сам автор книги «Introduction to Linear Regression Analysis»). Если вы не нашли необходимую информацию о книге — напишите в комментариях, мы постараемся отыскать её.

A comprehensive and current introduction to the fundamentals of regression analysis Introduction to Linear Regression Analysis, 6th Edition The new edition focuses on four key areas of improvement over the fifth edition:
New exercises and data sets New material on generalized regression techniques The inclusion of JMP software in key areas Carefully condensing the text where possible
skillfully blends theory and application in both the conventional and less common uses of regression analysis in today's cutting-edge scientific research. The text equips readers to understand the basic principles needed to apply regression model-building techniques in various fields of study, including engineering, management, and the health sciences.

Introduction to Linear Regression Analysis — читать онлайн ознакомительный отрывок

Ниже представлен текст книги, разбитый по страницам. Система сохранения места последней прочитанной страницы, позволяет с удобством читать онлайн бесплатно книгу «Introduction to Linear Regression Analysis», без необходимости каждый раз заново искать на чём Вы остановились. Поставьте закладку, и сможете в любой момент перейти на страницу, на которой закончили чтение.

Тёмная тема

Шрифт:

↓

↑

Сбросить

Интервал:

↓

↑

Закладка:

Сделать

Figure 27 JMP output for the simple linear regression model for the patient - фото 234

Figure 2.7 JMP output for the simple linear regression model for the patient satisfaction data.

Low values for R 2occur occasionally in practice. The model is significant, there are no obvious problems with assumptions or other indications of model inadequacy, but the proportion of variability explained by the model is low. Now this is not an entirely disastrous situation. There are many situations where explaining 30 to 40% of the variability in y with a single predictor provides information of considerable value to the analyst. Sometimes, a low value of R 2results from having a lot of variability in the measurements of the response due to perhaps the type of measuring instrument being used, or the skill of the person making the measurements. Here the variability in the response probably arises because the response is an expression of opinion, which can be very subjective. Also, the measurements are taken on human patients, and there can be considerably variability both within people and between people. Sometimes, a low value of R 2is a result of a poorly specified model. In these cases the model can often be improved by the addition of one or more predictor or regressor variables. We see in Chapter 3that the addition of another regressor results in considerable improvement of this model.

2.8 DOES PITCHING WIN BASEBALL GAMES?

Part of the never-ending debate about what makes a winning baseball team is the theory that a team cannot win consistently without good pitching. Baseball is a sport deeply rooted in analytics and there is an ever-growing body of data available. Major league baseball now has a system in all stadiums that includes cameras and other sensors to collect data on every pitch, including pitch speed, player positions, launch angle for balls hit in the air, and so forth. Table B.22 contains a summary of performance for 2016 for all National and American League baseball teams. The response variable is the number of games won and among the various statistics listed is the team earned run average (ERA), a standard measure of pitching performance, with low values of team ERA generally attributed to an outstanding pitching staff. Figure 2.8is the JMP output from fitting a linear regression model to wins versus the ERA data. The plots of wins versus ERA and actual versus predicted show that there is definitely a linear relationship between these variables. The model is significant and it seems that one point of team ERA is equivalent to about 18.6 wins. JMP also produces a plot of residuals versus the predicted number of wins. Residual plots such as this are useful in assessing model adequacy. We will discuss this in detail later but this plot indicates that there is no structure in the relationship between the residuals and the predicted number of wins. This is one indication that the model fit is satisfactory. However, the model only explains about 63% of the variability in the response. While this is not bad, it does suggest that there may be other useful explanatory variables.

Figure 28 JMP output for the model relating team wins to team ERA for the 2016 - фото 235

Figure 2.8 JMP output for the model relating team wins to team ERA for the 2016 baseball season.

2.9 USING SAS® AND R FOR SIMPLE LINEAR REGRESSION

The purpose of this section is to introduce readers to SAS and to R. Appendix Dgives more details about using SAS, including how to import data from both text and EXCEL files. Appendix Eintroduces the R statistical software package. R is becoming increasingly popular since it is free over the Internet.

Table 2.7 gives the SAS source code to analyze the rocket propellant data that we have been analyzing throughout this chapter. Appendix Dprovides detail explaining how to enter the data into SAS. The statement PROC REG tells the software that we wish to perform an ordinary least-squares linear regression analysis. The “model” statement specifies the specific model and tells the software which analyses to perform. The variable name to the left of the equal sign is the response. The variables to the right of the equal sign but before the solidus are the regressors. The information after the solidus specifies additional analyses. By default, SAS prints the analysis-of-variance table and the tests on the individual coefficients. In this case, we have specified three options: “p” asks SAS to print the predicted values, “clm” (which stands for confidence limit, mean) asks SAS to print the confidence band, and “cli” (which stands for confidence limit, individual observations) asks SAS to print the prediction band.

Table 2.8 gives the SAS output for this analysis. PROC REG always produces the analysis-of-variance table and the information on the parameter estimates. The “p clm cli” options on the model statement produced the remainder of the output file.

TABLE 2.7 SAS Code for Rocket Propellant Data

`data rocket;`
`input shear age;`
`cards;`
`2158.70`	`15.50`
`1678.15`	`23.75`
`2316.00`	`8.00`
`2061.30`	`17.00`
`2207.50`	`5.50`
`1708.30`	`19.00`
`1784.70`	`24.00`
`2575.00`	`2.50`
`2357.90`	`7.50`
`2256.70`	`11.00`
`2165.20`	`13.00`
`2399.55`	`3.75`
`1779.80`	`25.99`
`2336.75`	`9.75`
`1765.30`	`22.00`
`2053.50`	`18.00`
`2414.40`	`6.00`
`2200.50`	`12.50`
`2654.20`	`2.00`
`1753.70`	`21.50`
`proc reg;`
`model shear=age/p clm cli;`
`run;`

TABLE 2.8 SAS Output for Analysis of Rocket Propellant Data.

SAS also produces a log file that provides a brief summary of the SAS session - фото 236

SAS also produces a log file that provides a brief summary of the SAS session. The log file is almost essential for debugging SAS code. Appendix Dprovides more details about this file.

R is a popular statistical software package, primarily because it is freely available at www.r-project.org. An easier-to-use version of R is R Commander. R itself is a high-level programming language. Most of its commands are prewritten functions. It does have the ability to run loops and call other routines, for example, in C. Since it is primarily a programming language, it often presents challenges to novice users. The purpose of this section is to introduce the reader as to how to use R to analyze simple linear regression data sets.