LibCat » Книги » Приключения » unrecognised » Bhisham C. Gupta - Statistics and Probability with Applications for Engineers and Scientists Using MINITAB, R and JMP

Bhisham C. Gupta - Statistics and Probability with Applications for Engineers and Scientists Using MINITAB, R and JMP

Здесь есть возможность читать онлайн «Bhisham C. Gupta - Statistics and Probability with Applications for Engineers and Scientists Using MINITAB, R and JMP» — ознакомительный отрывок электронной книги совершенно бесплатно, а после прочтения отрывка купить полную версию. В некоторых случаях можно слушать аудио, скачать через торрент в формате fb2 и присутствует краткое содержание. Жанр: unrecognised, на английском языке. Описание произведения, (предисловие) а так же отзывы посетителей доступны на портале библиотеки ЛибКат.

Читать книгу

Название:
Statistics and Probability with Applications for Engineers and Scientists Using MINITAB, R and JMP
Автор:
Bhisham C. Gupta
Жанр:
unrecognised / на английском языке
Год:
неизвестен
ISBN:
нет данных
Рейтинг книги:
4 / 5. Голосов: 1
Избранное:

Добавить в избранное
Отзывы:
Написать комментарий
Ваша оценка:
- 80
- 1
- 2
- 3
- 4
- 5

Statistics and Probability with Applications for Engineers and Scientists Using MINITAB, R and JMP: краткое содержание, описание и аннотация

Предлагаем к чтению аннотацию, описание, краткое содержание или предисловие (зависит от того, что написал сам автор книги «Statistics and Probability with Applications for Engineers and Scientists Using MINITAB, R and JMP»). Если вы не нашли необходимую информацию о книге — напишите в комментариях, мы постараемся отыскать её.

Introduces basic concepts in probability and statistics to data science students, as well as engineers and scientists Aimed at undergraduate/graduate-level engineering and natural science students, this timely, fully updated edition of a popular book on statistics and probability shows how real-world problems can be solved using statistical concepts. It removes Excel exhibits and replaces them with R software throughout, and updates both MINITAB and JMP software instructions and content. A new chapter discussing data mining—including big data, classification, machine learning, and visualization—is featured. Another new chapter covers cluster analysis methodologies in hierarchical, nonhierarchical, and model based clustering. The book also offers a chapter on Response Surfaces that previously appeared on the book’s companion website.
Statistics and Probability with Applications for Engineers and Scientists using MINITAB, R and JMP, Second Edition Features two new chapters—one on Data Mining and another on Cluster Analysis Now contains R exhibits including code, graphical display, and some results MINITAB and JMP have been updated to their latest versions Emphasizes the p-value approach and includes related practical interpretations Offers a more applied statistical focus, and features modified examples to better exhibit statistical concepts Supplemented with an Instructor's-only solutions manual on a book’s companion website
is an excellent text for graduate level data science students, and engineers and scientists. It is also an ideal introduction to applied statistics and probability for undergraduate students in engineering and the natural sciences.

Statistics and Probability with Applications for Engineers and Scientists Using MINITAB, R and JMP — читать онлайн ознакомительный отрывок

Ниже представлен текст книги, разбитый по страницам. Система сохранения места последней прочитанной страницы, позволяет с удобством читать онлайн бесплатно книгу «Statistics and Probability with Applications for Engineers and Scientists Using MINITAB, R and JMP», без необходимости каждый раз заново искать на чём Вы остановились. Поставьте закладку, и сможете в любой момент перейти на страницу, на которой закончили чтение.

Тёмная тема

Шрифт:

↓

↑

Сбросить

Интервал:

↓

↑

Закладка:

Сделать

Table 2.3.2Frequency distribution for the data in Table 2.3.1.

		Frequency	Cumulative		Cumulative
Categories	Tally	or count	frequency	Percentage	percentage
1	///// ///// ///// ///// ///// ///	28	28	25.45	25.45
2	///// ///// ///// ///// ///// /	26	54	23.64	49.09
3	///// ///// ///// /////	20	74	18.18	67.27
4	///// ///// ///// /	16	90	14.55	81.82
5	///// ///// ///// /////	20	110	18.18	100.00
Total		110		100.00

Interestingly, we can put technology to work on data in Table 2.3.1to produce Table 2.3.2.

Example 2.3.2(Industrial revenue) Using MINITAB and R, construct a frequency distribution table for the data in Table 2.3.1.

Solution:

MINITAB

1 Enter the data in column C1 of the Worksheet Window and name it Categories.

2 From the Menu bar, select Stat Tables Tally Individual Variables

3 In this dialog box, enter C1 in the box under Variables.

4 Check all the boxes under Display and click OK.

5 The frequency distribution table as shown below appears in the Session window.

This frequency distribution table may also be obtained by using R as follows:

USING R

R has built in ‘table()’ function that can be used to get the basic frequency distribution of categorical data. To get the cumulative frequencies, we can apply built in ‘cumsum()’ function to tabulated frequency data. Then using the ‘cbind()’ function we combine categories, frequencies, cumulative frequencies, and cumulative percentages to build the final distribution table. In addition, we can use the ‘colnames()’ function to name the columns of the final table as needed. The task can be completed running the following R code in R Console window.

#Assign given data to the variable data data = c(4,3,5,3,4,1,2,3,4,3,1,5,3,4,2,1,1,4,5,3,2,5,2,5,2,1,2,3,3,2, 1,5,3,2,1,1,2,1,2,4,5,3,5,1,3,1,2,1,4,1,4,5,4,1,1,2,4,1,4,1,2,4,3,4,1, 4,1,4,1,2,1,5,3,1,5,2,1,2,3,1,2,2,1,1,2,1,5,3,2,5,5,2,5,3,5,2,3,2,3,5, 2,3,5,5,2,3,2,5,1,4) #To get frequencies data.freq = table(data) #To combine necessary columns freq.dist = cbind(data.freq, cumsum(data.freq), 100*cumsum(data.freq)/sum(data.freq)) #To name the table columns colnames(freq.dist) = c(‘Frequency’,‘Cum.Frequency’,‘Cum Percentage’) freq.dist #R output

	Frequency	Cum.Frequency	Cum Percentage
1	28.00	28.00	25.45
2	26.00	54.00	49.09
3	20.00	74.00	67.27
4	16.00	90.00	81.82
5	20.00	110.00	100.00

Note that sometimes a quantitative data set is such that it consists of only a few distinct observations that occur repeatedly. These kind of data are usually summarized in the same manner as the categorical data. The categories are represented by the distinct observations. We illustrate this scenario with the following example.

Example 2.3.3(Hospital data) The following data show the number of coronary artery bypass graft surgeries performed at a hospital in a 24‐hour period for each of the last 50 days. Bypass surgeries are usually performed when a patient has multiple blockages or when the left main coronary artery is blocked. Construct a frequency distribution table for these data.

1	2	1	5	4	2	3	1	5	4	3	4	6	2	3	3	2	2	3	5	2	5	3	4	3
1	3	2	2	4	2	6	1	2	6	6	1	4	5	4	1	4	2	1	2	5	2	2	4	3

Solution:In this example, the variable of interest is the number of bypass surgeries performed at a hospital in a period of 24 hours. Now, following the discussion in Example 2.3.1, we can see that the frequency distribution table for the data in this example is as shown in Table 2.3.3. Frequency distribution table defined by using a single numerical value is usually called a single‐valued frequency distribution table.

Table 2.3.3Frequency distribution table for the hospital data.

		Frequency	Cumulative		Cumulative
Categories	Tally	or count	frequency	Percentage	percentage
1	///// ///	8	8	16.00	16.00
2	///// ///// ////	14	22	28.00	44.00
3	///// ////	9	31	18.00	62.00
4	///// ////	9	40	18.00	80.00
5	///// /	6	46	12.00	92.00
6	////	4	50	8.00	100.00
Total		50		100.00

2.3.2 Quantitative Data

So far, we have discussed frequency distribution tables for qualitative data and quantitative data that can be treated as qualitative data. In this section, we discuss frequency distribution tables for quantitative data.

Let Statistics and Probability with Applications for Engineers and Scientists Using MINITAB R and JMP - изображение 142 be a set of quantitative data values. To construct a frequency distribution table for this data set, we follow the steps given below.

1 Step 1. Find the range of the data that is defined as(2.3.1)

2 Step 2. Divide the data set into an appropriate number of classes. The classes are also sometimes called categories, cells, or bins. There are no hard and fast rules to determine the number of classes. As a rule, the number of classes, say , should be somewhere between 5 and 20. However, Sturges's formula is often used, given by (2.3.2)or (2.3.3)where is the total number of data points in a given data set and log denotes the log to base 10. The result often gives a good estimate for an appropriate number of intervals. Note that since , the number of classes, should always be a whole number, the reader may have to round up or down the value of obtained when using either equation (2.3.2) or (2.3.3).

3 Step 3. Determine the width of classes as follows: (2.3.4)The class width should always be a number that is easy to work with, preferably a whole number. Furthermore, this number should be obtained only by rounding up (never by rounding down) the value obtained when using equation (2.3.4).

4 Step 4. Finally, preparing the frequency distribution table is achieved by assigning each data point to an appropriate class. While assigning these data points to a class, one must be particularly careful to ensure that each data point be assigned to one, and only one, class and that the whole set of data is included in the table. Another important point is that the class at the lowest end of the scale must begin at a number that is less than or equal to the smallest data point and that the class at the highest end of the scale must end with a number that is greater than or equal to the largest data point in the data set.