197 195
198 196
199 197
200 198
201 199
202 200
203 201
204 202
205 203
206 204
207 205
208 206
209 207
210 209
211 210
212 211
213 212
214 213
215 214
216 215
217 216
218 217
219 218
220 219
221 220
222 221
223 222
224 223
225 224
226 225
227 226
228 227
229 228
230 229
231 230
232 231
233 232
234 233
235 234
236 235
237 236
238 237
239 238
240 239
241 241
242 243
243 244
244 245
245 246
246 247
247 248
248 249
249 250
250 251
251 252
252 253
253 255
254 256
255 257
256 258
257 259
258 260
259 261
260 262
261 263
262 264
263 265
264 266
265 267
266 268
267 269
268 270
269 271
270 272
271 273
272 274
273 275
274 276
275 277
276 278
277 279
278 280
279 281
280 282
281 283
282 284
283 285
284 286
285 287
286 288
287 289
288 290
289 291
290 293
291 295
292 296
293 297
294 298
295 299
296 300
297 301
298 302
299 303
300 304
301 305
302 306
303 307
304 308
305 309
306 310
307 311
308 312
309 313
310 314
311 315
312 316
313 317
314 318
315 319
316 320
317 321
318 322
319 323
320 324
321 325
322 326
323 327
324 328
325 329
326 330
327 331
328 332
329 333
330 334
331 335
332 337
333 338
334 339
335 340
336 341
337 342
338 343
339 344
340 345
341 346
342 347
343 348
344 349
345 352
346 353
347 354
348 355
349 356
350 357
351 358
352 359
353 360
354 361
355 362
356 363
WILEY SERIES IN PROBABILITY AND STATISTICS
Established by WALTER A. SHEWHART and SAMUEL S. WILKS
Editors
David J. Balding, Noel A.C. Cressie, Garrett M. Fitzmaurice, Harvey Goldstein, Geert Molenberghs, David W. Scott, Adrian F.M. Smith, and Ruey S. Tsay
Editors Emeriti
Vic Barnett, Ralph A. Bradley, J. Stuart Hunter, J.B. Kadane, David G. Kendall, and Jozef L. Teugels
A complete list of the titles in this series appears at the end of this volume.
Handbook of Regression Analysis With Applications in R
Second Edition
Samprit Chatterjee
New York University, New York, USA
Jeffrey S. Simonoff
New York University, New York, USA

This second edition first published 2020
© 2020 John Wiley & Sons, Inc
Edition History
Wiley‐Blackwell (1e, 2013)
All rights reserved. No part of this publication may be reproduced, stored in a retrieval system, or transmitted, in any form or by any means, electronic, mechanical, photocopying, recording or otherwise, except as permitted by law. Advice on how to obtain permission to reuse material from this title is available at http://www.wiley.com/go/permissions.
The right of Samprit Chatterjee and Jeffery S. Simonoff to be identified as the authors of this work has been asserted in accordance with law.
Registered Office
John Wiley & Sons, Inc., 111 River Street, Hoboken, NJ 07030, USA
Editorial Office
111 River Street, Hoboken, NJ 07030, USA
For details of our global editorial offices, customer services, and more information about Wiley products visit us at www.wiley.com.
Wiley also publishes its books in a variety of electronic formats and by print‐on‐demand. Some content that appears in standard print versions of this book may not be available in other formats.
Limit of Liability/Disclaimer of Warranty
While the publisher and authors have used their best efforts in preparing this work, they make no representations or warranties with respect to the accuracy or completeness of the contents of this work and specifically disclaim all warranties, including without limitation any implied warranties of merchantability or fitness for a particular purpose. No warranty may be created or extended by sales representatives, written sales materials or promotional statements for this work. The fact that an organization, website, or product is referred to in this work as a citation and/or potential source of further information does not mean that the publisher and authors endorse the information or services the organization, website, or product may provide or recommendations it may make. This work is sold with the understanding that the publisher is not engaged in rendering professional services. The advice and strategies contained herein may not be suitable for your situation. You should consult with a specialist where appropriate. Further, readers should be aware that websites listed in this work may have changed or disappeared between when this work was written and when it is read. Neither the publisher nor authors shall be liable for any loss of profit or any other commercial damages, including but not limited to special, incidental, consequential, or other damages.
Library of Congress Cataloging‐in‐Publication Data
Names: Chatterjee, Samprit, 1938- author. | Simonoff, Jeffrey S., author.
Title: Handbook of regression analysis with applications in R / Professor
Samprit Chatterjee, New York University, Professor Jeffrey S. Simonoff,
New York University.
Other titles: Handbook of regression analysis
Description: Second edition. | Hoboken, NJ : Wiley, 2020. | Series: Wiley
series in probability and statistics | Revised edition of: Handbook of
regression analysis. 2013. | Includes bibliographical references and
index.
Identifiers: LCCN 2020006580 (print) | LCCN 2020006581 (ebook) | ISBN
9781119392378 (hardback) | ISBN 9781119392477 (adobe pdf) | ISBN
9781119392484 (epub)
Subjects: LCSH: Regression analysis--Handbooks, manuals, etc. | R (Computer
program language)
Classification: LCC QA278.2 .C498 2020 (print) | LCC QA278.2 (ebook) |
DDC 519.5/36--dc23
LC record available at https://lccn.loc.gov/2020006580
LC ebook record available at https://lccn.loc.gov/2020006581
Cover Design: Wiley
Cover Image: © Dmitriy Rybin/Shutterstock
Set in 10.82/12pt AGaramondPro by SPi Global, Chennai, India
Dedicated to everyone who labors in the field of statistics, whether they are students, teachers, researchers, or data analysts.
Preface to the Second Edition
The years since the first edition of this book appeared have been fast‐moving in the world of data analysis and statistics. Algorithmically‐based methods operating under the banner of machine learning, artificial intelligence, or data science have come to the forefront of public perceptions about how to analyze data, and more than a few pundits have predicted the demise of classic statistical modeling.
To paraphrase Mark Twain, we believe that reports of the (impending) death of statistical modeling in general, and regression modeling in particular, are exaggerated. The great advantage that statistical models have over “black box” algorithms is that in addition to effective prediction, their transparency also provides guidance about the actual underlying process (which is crucial for decision making), and affords the possibilities of making inferences and distinguishing real effects from random variation based on those models. There have been laudable attempts to encourage making machine learning algorithms interpretable in the ways regression models are (Rudin, 2019), but we believe that models based on statistical considerations and principles will have a place in the analyst's toolkit for a long time to come.
Читать дальше