Machine Learning For Dummies®, 2nd Edition
Published by: John Wiley & Sons, Inc.,111 River Street, Hoboken, NJ 07030-5774, www.wiley.com
Copyright © 2021 by John Wiley & Sons, Inc., Hoboken, New Jersey
Published simultaneously in Canada
No part of this publication may be reproduced, stored in a retrieval system or transmitted in any form or by any means, electronic, mechanical, photocopying, recording, scanning or otherwise, except as permitted under Sections 107 or 108 of the 1976 United States Copyright Act, without the prior written permission of the Publisher. Requests to the Publisher for permission should be addressed to the Permissions Department, John Wiley & Sons, Inc., 111 River Street, Hoboken, NJ 07030, (201) 748-6011, fax (201) 748-6008, or online at http://www.wiley.com/go/permissions
.
Trademarks:Wiley, For Dummies, the Dummies Man logo, Dummies.com, Making Everything Easier, and related trade dress are trademarks or registered trademarks of John Wiley & Sons, Inc. and may not be used without written permission. All other trademarks are the property of their respective owners. John Wiley & Sons, Inc. is not associated with any product or vendor mentioned in this book.
LIMIT OF LIABILITY/DISCLAIMER OF WARRANTY: THE PUBLISHER AND THE AUTHOR MAKE NO REPRESENTATIONS OR WARRANTIES WITH RESPECT TO THE ACCURACY OR COMPLETENESS OF THE CONTENTS OF THIS WORK AND SPECIFICALLY DISCLAIM ALL WARRANTIES, INCLUDING WITHOUT LIMITATION WARRANTIES OF FITNESS FOR A PARTICULAR PURPOSE. NO WARRANTY MAY BE CREATED OR EXTENDED BY SALES OR PROMOTIONAL MATERIALS. THE ADVICE AND STRATEGIES CONTAINED HEREIN MAY NOT BE SUITABLE FOR EVERY SITUATION. THIS WORK IS SOLD WITH THE UNDERSTANDING THAT THE PUBLISHER IS NOT ENGAGED IN RENDERING LEGAL, ACCOUNTING, OR OTHER PROFESSIONAL SERVICES. IF PROFESSIONAL ASSISTANCE IS REQUIRED, THE SERVICES OF A COMPETENT PROFESSIONAL PERSON SHOULD BE SOUGHT. NEITHER THE PUBLISHER NOR THE AUTHOR SHALL BE LIABLE FOR DAMAGES ARISING HEREFROM. THE FACT THAT AN ORGANIZATION OR WEBSITE IS REFERRED TO IN THIS WORK AS A CITATION AND/OR A POTENTIAL SOURCE OF FURTHER INFORMATION DOES NOT MEAN THAT THE AUTHOR OR THE PUBLISHER ENDORSES THE INFORMATION THE ORGANIZATION OR WEBSITE MAY PROVIDE OR RECOMMENDATIONS IT MAY MAKE. FURTHER, READERS SHOULD BE AWARE THAT INTERNET WEBSITES LISTED IN THIS WORK MAY HAVE CHANGED OR DISAPPEARED BETWEEN WHEN THIS WORK WAS WRITTEN AND WHEN IT IS READ.
For general information on our other products and services, please contact our Customer Care Department within the U.S. at 877-762-2974, outside the U.S. at 317-572-3993, or fax 317-572-4002. For technical support, please visit https://hub.wiley.com/community/support/dummies
.
Wiley publishes in a variety of print and electronic formats and by print-on-demand. Some material included with standard print versions of this book may not be included in e-books or in print-on-demand. If this book refers to media such as a CD or DVD that is not included in the version you purchased, you may download this material at http://booksupport.wiley.com
. For more information about Wiley products, visit www.wiley.com
.
Library of Congress Control Number: 2020952332
ISBN: 978-1-119-72401-8
ISBN 978-1-119-72406-3 (ebk); ISBN 978-1-119-72405-6 (ebk)
Machine Learning For Dummies®
To view this book's Cheat Sheet, simply go to www.dummies.comand search for “Machine Learning For Dummies Cheat Sheet” in the Search box.
Table of Contents
1 Cover
2 Title Page
3 Copyright
4 Introduction About This Book Foolish Assumptions Icons Used in This Book Beyond the Book Where to Go from Here
5 Part 1: Introducing How Machines Learn Chapter 1: Getting the Real Story about AI Moving beyond the Hype Dreaming of Electric Sheep Overcoming AI Fantasies Considering the Relationship between AI and Machine Learning Considering AI and Machine Learning Specifications Defining the Divide between Art and Engineering Predicting the Next AI Winter Chapter 2: Learning in the Age of Big Data Considering the Machine Learning Essentials Defining Big Data Considering the Sources of Big Data Specifying the Role of Statistics in Machine Learning Understanding the Role of Algorithms Defining What Training Means Chapter 3: Having a Glance at the Future Creating Useful Technologies for the Future Discovering the New Work Opportunities with Machine Learning Avoiding the Potential Pitfalls of Future Technologies
6 Part 2: Preparing Your Learning Tools Chapter 4: Installing a Python Distribution Using Anaconda for Machine Learning Installing Anaconda on Linux Installing Anaconda on Mac OS X Installing Anaconda on Windows Downloading the Datasets and Example Code Chapter 5: Beyond Basic Coding in Python Defining the Basics You Should Know Storing Data Using Sets, Lists, and Tuples Defining Useful Iterators Indexing Data Using Dictionaries Chapter 6: Working with Google Colab Defining Google Colab Getting a Google Account Working with Notebooks Performing Common Tasks Using Hardware Acceleration Viewing Your Notebook Executing the Code Sharing Your Notebook Getting Help
7 Part 3: Getting Started with the Math Basics Chapter 7: Demystifying the Math Behind Machine Learning Working with Data Exploring the World of Probabilities Describing the Use of Statistics Chapter 8: Descending the Gradient Acknowledging Different Kinds of Learning The learning process Optimizing with big data Chapter 9: Validating Machine Learning Considering the Use of Example Data Checking Out-of-Sample Errors Training, Validating, and Testing Optimizing by Cross-Validation Avoiding Sample Bias and Leakage Traps Chapter 10: Starting with Simple Learners Discovering the Incredible Perceptron Growing Greedy Classification Trees Taking a Probabilistic Turn
8 Part 4: Learning from Smart and Big Data Chapter 11: Preprocessing Data Gathering and Cleaning Data Repairing Missing Data Transforming Distributions Creating Your Own Features Delimiting Anomalous Data Chapter 12: Leveraging Similarity Measuring Similarity between Vectors Using Distances to Locate Clusters Tuning the K-Means Algorithm Finding Similarity by K-Nearest Neighbors Chapter 13: Working with Linear Models the Easy Way Starting to Combine Features Mixing Features of Different Types Switching to Probabilities Guessing the Right Features Learning One Example at a Time Chapter 14: Hitting Complexity with Neural Networks Revising the Perceptron Representing the Way of Learning of a Network Introducing Deep Learning Chapter 15: Going a Step Beyond Using Support Vector Machines Revisiting the Separation Problem Explaining the Algorithm Classifying and Estimating with SVM Chapter 16: Resorting to Ensembles of Learners Leveraging Decision Trees Working with Almost Random Guesses Boosting Smart Predictors Averaging Different Predictors
9 Part 5: Applying Learning to Real Problems Chapter 17: Classifying Images Working with a Set of Images Revising the State of the Art in Computer Vision Extracting Visual Features Recognizing Faces Using Eigenfaces Classifying Images Chapter 18: Scoring Opinions and Sentiments Introducing Natural Language Processing Revising the State of the Art in NLP Understanding How Machines Read Using Scoring and Classification Chapter 19: Recommending Products and Movies Realizing the Revolution of E-Commerce Downloading Rating Data Catching the Limits of Behavioral Data Integrating Text and Behaviors Leveraging SVD
10 Part 6: The Part of Tens Chapter 20: Ten Ways to Improve Your Machine Learning Models Studying Learning Curves Using Cross-Validation Correctly Choosing the Right Error or Score Metric Searching for the Best Hyper-Parameters Testing Multiple Models Averaging Models Stacking Models Applying Feature Engineering Selecting Features and Examples Looking for More Data Chapter 21: Ten Guidelines for Ethical Data Usage Obtaining Permission Using Sanitization Techniques Avoiding Data Inference Using Generalizations Correctly Shunning Discriminatory Practices Detecting Black Swans in Code Understanding the Process Considering the Consequences of an Action Balancing Decision Making Verifying a Data Source Chapter 22: Ten Machine Learning Packages to Master Gensim imbalanced-learn OpenCV SciPy SHAP Statsmodels Modin PyTorch Poetry Snorkel
Читать дальше