LibCat » Книги » Приключения » unrecognised » Bhisham C. Gupta - Statistics and Probability with Applications for Engineers and Scientists Using MINITAB, R and JMP

Bhisham C. Gupta - Statistics and Probability with Applications for Engineers and Scientists Using MINITAB, R and JMP

Здесь есть возможность читать онлайн «Bhisham C. Gupta - Statistics and Probability with Applications for Engineers and Scientists Using MINITAB, R and JMP» — ознакомительный отрывок электронной книги совершенно бесплатно, а после прочтения отрывка купить полную версию. В некоторых случаях можно слушать аудио, скачать через торрент в формате fb2 и присутствует краткое содержание. Жанр: unrecognised, на английском языке. Описание произведения, (предисловие) а так же отзывы посетителей доступны на портале библиотеки ЛибКат.

Читать книгу

Название:
Statistics and Probability with Applications for Engineers and Scientists Using MINITAB, R and JMP
Автор:
Bhisham C. Gupta
Жанр:
unrecognised / на английском языке
Год:
неизвестен
ISBN:
нет данных
Рейтинг книги:
4 / 5. Голосов: 1
Избранное:

Добавить в избранное
Отзывы:
Написать комментарий
Ваша оценка:
- 80
- 1
- 2
- 3
- 4
- 5

Statistics and Probability with Applications for Engineers and Scientists Using MINITAB, R and JMP: краткое содержание, описание и аннотация

Предлагаем к чтению аннотацию, описание, краткое содержание или предисловие (зависит от того, что написал сам автор книги «Statistics and Probability with Applications for Engineers and Scientists Using MINITAB, R and JMP»). Если вы не нашли необходимую информацию о книге — напишите в комментариях, мы постараемся отыскать её.

Introduces basic concepts in probability and statistics to data science students, as well as engineers and scientists Aimed at undergraduate/graduate-level engineering and natural science students, this timely, fully updated edition of a popular book on statistics and probability shows how real-world problems can be solved using statistical concepts. It removes Excel exhibits and replaces them with R software throughout, and updates both MINITAB and JMP software instructions and content. A new chapter discussing data mining—including big data, classification, machine learning, and visualization—is featured. Another new chapter covers cluster analysis methodologies in hierarchical, nonhierarchical, and model based clustering. The book also offers a chapter on Response Surfaces that previously appeared on the book’s companion website.
Statistics and Probability with Applications for Engineers and Scientists using MINITAB, R and JMP, Second Edition Features two new chapters—one on Data Mining and another on Cluster Analysis Now contains R exhibits including code, graphical display, and some results MINITAB and JMP have been updated to their latest versions Emphasizes the p-value approach and includes related practical interpretations Offers a more applied statistical focus, and features modified examples to better exhibit statistical concepts Supplemented with an Instructor's-only solutions manual on a book’s companion website
is an excellent text for graduate level data science students, and engineers and scientists. It is also an ideal introduction to applied statistics and probability for undergraduate students in engineering and the natural sciences.

Statistics and Probability with Applications for Engineers and Scientists Using MINITAB, R and JMP — читать онлайн ознакомительный отрывок

Ниже представлен текст книги, разбитый по страницам. Система сохранения места последней прочитанной страницы, позволяет с удобством читать онлайн бесплатно книгу «Statistics and Probability with Applications for Engineers and Scientists Using MINITAB, R and JMP», без необходимости каждый раз заново искать на чём Вы остановились. Поставьте закладку, и сможете в любой момент перейти на страницу, на которой закончили чтение.

Тёмная тема

Шрифт:

↓

↑

Сбросить

Интервал:

↓

↑

Закладка:

Сделать

Definition 2.1.6

A list of all sampling units is called the sampling frame .

The most commonly used sample design is the simple random sampling design , which consists of selecting картинка 125 (sample size) sampling units in such a way that each sampling unit has the same chance of being selected. If, however, the population is finite of size картинка 126 , say, then the simple random sampling design may be defined as selecting картинка 127 sampling units in such a way that each possible sample of size картинка 128 has the same chance of being selected. The number of such samples of size картинка 129 that may be formed from a finite population of size картинка 130 is discussed in Section 3.4.3.

Example 2.1.1(Simple random sampling) Suppose that an engineer wants to take a sample of machine parts manufactured during a shift at a given plant. Since the parts from which the engineer wants to take the sample are manufactured during the same shift at the same plant, it is quite safe to assume that all parts are representative. Hence in this case, a simple random sampling design should be appropriate.

The second sampling design is the stratified random sampling design , which may give improved results for the same amount of money spent for simple random sampling. However, a stratified random sampling design is appropriate when a population can be divided into various nonoverlapping groups called strata . The sampling units in each stratum are similar but differ from stratum to stratum. Each stratum is treated as a subpopulation, and a simple random sample is taken from each of these subpopulations or strata.

In the manufacturing world, this type of sampling situation arises quite often. For instance, in Example 2.1.1, if the sample is taken from a population of parts manufactured either in different plants or in different shifts, then stratified random sampling can be more appropriate than simple random sampling. In addition, there is the advantage of administrative convenience. For example, if the machine parts are manufactured in plants located in different parts of the country, then stratified random sampling can be beneficial. Often, each plant (stratum) has a sampling department that can conduct the random sampling within each plant. In order to obtain best results in this case, the sampling departments in all the plants need to communicate with one another before sampling in order to ensure that the same sampling norms are followed. Another example of stratified random sampling in manufacturing occurs when samples are taken of products that are produced in different batches; here, products produced in different batches constitute the different strata.

A third kind of sampling design is systematic random sampling . The systematic random sampling procedure is the easiest one. This sampling scheme is particularly useful in manufacturing processes, when the sampling is done from a continuously operating assembly line. Under this scheme, a first item is selected randomly and thereafter every Statistics and Probability with Applications for Engineers and Scientists Using MINITAB R and JMP - изображение 131 th item manufactured is selected until we have a sample of the desired size ( картинка 133 ). Systematic sampling is not only easy to employ but, under certain conditions, is also more precise than simple random sampling.

The fourth and last sampling design is cluster random sampling . In cluster sampling, each sampling unit is a group of smaller units. In the manufacturing environment, this sampling scheme is particularly useful since it is difficult to prepare a list of each part that constitutes a frame. On the other hand, it may be easier to prepare a list of boxes in which each box contains many parts. Thus, in this case, a cluster random sample is merely a simple random sample of these boxes. Another advantage of cluster sampling is that by selecting a simple random sample of only a few clusters, we can in fact have quite a large sample of smaller units. Such sampling is achieved at minimum cost, since both preparing the frame and taking the sample are much more economical. In preparing any frame, we must define precisely the characteristic of interest or variable, where a variable may be defined as follows:

Definition 2.1.7

A variable is a characteristic of interest that may take different values for different elements.

For example, an instructor is interested in finding the ages, heights, weights, GPA, gender, and family incomes of all the students in her engineering class. Thus, in this example, the variables (characteristics of interest) are ages, heights, weights, GPA, gender, and family incomes.

2.2 Classification of Various Types of Data

In practice, it is common to collect a large amount of nonnumerical and/or numerical data on a daily basis. For example, we may collect data concerning customer satisfaction, comments of employees, or perceptions of suppliers. Or we may track the number of employees in various departments of a company or check weekly production volume in units produced and sales dollars per unit of time, and so on. All the data collected, however, cannot be treated the same way as there are differences in types of data. Accordingly, statistical data can normally be divided into two major categories:

Qualitative

Quantitative

Each of these categories can be further subdivided into two subcategories each. The two subcategories of qualitative data are nominal and ordinal , whereas the two subcategories of quantitative data are interval and ratio . We may summarize this classification of statistical data as in Figure 2.2.1.

The classification of data as nominal, ordinal, interval, and ratio is arranged in the order of the amount of information they can provide. Nominal data provide minimum information, whereas ratio data provide maximum information.

Figure 2.2.1Classifications of statistical data.

2.2.1 Nominal Data

As previously mentioned, nominal data contain the smallest amount of information. Only symbols are used to label categories of a population. For example, production part numbers with a 2003 prefix are nominal data, wherein the 2003 prefix indicates only that the parts were produced in 2003 (in this case, the year 2003 serves as the category). No arithmetic operation, such as addition, subtraction, multiplication, or division, can be performed on numbers representing nominal data. As another example, jersey numbers of baseball, football, or soccer players are nominal. Thus, adding any two jersey numbers and comparing with another number makes no sense. Other examples of nominal data are ID numbers of workers, account numbers used by a financial institution, ZIP codes, telephone numbers, sex, or color.