AWS Certified Machine Learning Specialty: MLS-C01 Certification Guide
eBook - ePub

AWS Certified Machine Learning Specialty: MLS-C01 Certification Guide

The definitive guide to passing the MLS-C01 exam on the very first attempt

Somanath Nanda, Weslley Moura

  1. 338 pages
  2. English
  3. ePUB (mobile friendly)
  4. Available on iOS & Android
eBook - ePub

AWS Certified Machine Learning Specialty: MLS-C01 Certification Guide

The definitive guide to passing the MLS-C01 exam on the very first attempt

Somanath Nanda, Weslley Moura

Book details
Book preview
Table of contents
Citations

About This Book

Prepare to achieve AWS Machine Learning Specialty certification with this complete, up-to-date guide and take the exam with confidence

Key Features

  • Get to grips with core machine learning algorithms along with AWS implementation
  • Build model training and inference pipelines and deploy machine learning models to the Amazon Web Services (AWS) cloud
  • Learn all about the AWS services available for machine learning in order to pass the MLS-C01 exam

Book Description

The AWS Certified Machine Learning Specialty exam tests your competency to perform machine learning (ML) on AWS infrastructure. This book covers the entire exam syllabus using practical examples to help you with your real-world machine learning projects on AWS.Starting with an introduction to machine learning on AWS, you'll learn the fundamentals of machine learning and explore important AWS services for artificial intelligence (AI). You'll then see how to prepare data for machine learning and discover a wide variety of techniques for data manipulation and transformation for different types of variables. The book also shows you how to handle missing data and outliers and takes you through various machine learning tasks such as classification, regression, clustering, forecasting, anomaly detection, text mining, and image processing, along with the specific ML algorithms you need to know to pass the exam. Finally, you'll explore model evaluation, optimization, and deployment and get to grips with deploying models in a production environment and monitoring them.By the end of this book, you'll have gained knowledge of the key challenges in machine learning and the solutions that AWS has released for each of them, along with the tools, methods, and techniques commonly used in each domain of AWS ML.

What you will learn

  • Understand all four domains covered in the exam, along with types of questions, exam duration, and scoring
  • Become well-versed with machine learning terminologies, methodologies, frameworks, and the different AWS services for machine learning
  • Get to grips with data preparation and using AWS services for batch and real-time data processing
  • Explore the built-in machine learning algorithms in AWS and build and deploy your own models
  • Evaluate machine learning models and tune hyperparameters
  • Deploy machine learning models with the AWS infrastructure

Who this book is for

This AWS book is for professionals and students who want to prepare for and pass the AWS Certified Machine Learning Specialty exam or gain deeper knowledge of machine learning with a special focus on AWS. Beginner-level knowledge of machine learning and AWS services is necessary before getting started with this book.

]]>

Frequently asked questions

How do I cancel my subscription?
Simply head over to the account section in settings and click on “Cancel Subscription” - it’s as simple as that. After you cancel, your membership will stay active for the remainder of the time you’ve paid for. Learn more here.
Can/how do I download books?
At the moment all of our mobile-responsive ePub books are available to download via the app. Most of our PDFs are also available to download and we're working on making the final remaining ones downloadable now. Learn more here.
What is the difference between the pricing plans?
Both plans give you full access to the library and all of Perlego’s features. The only differences are the price and subscription period: With the annual plan you’ll save around 30% compared to 12 months on the monthly plan.
What is Perlego?
We are an online textbook subscription service, where you can get access to an entire online library for less than the price of a single book per month. With over 1 million books across 1000+ topics, we’ve got you covered! Learn more here.
Do you support text-to-speech?
Look out for the read-aloud symbol on your next book to see if you can listen to it. The read-aloud tool reads text aloud for you, highlighting the text as it is being read. You can pause it, speed it up and slow it down. Learn more here.
Is AWS Certified Machine Learning Specialty: MLS-C01 Certification Guide an online PDF/ePUB?
Yes, you can access AWS Certified Machine Learning Specialty: MLS-C01 Certification Guide by Somanath Nanda, Weslley Moura in PDF and/or ePUB format, as well as other popular books in Informatik & Künstliche Intelligenz (KI) & Semantik. We have over one million books available in our catalogue for you to explore.

Information

Section 1: Introduction to Machine Learning

This section provides details about the AWS Machine Learning Specialty Exam. It also introduces machine learning fundamentals and covers the most important AWS application services for artificial intelligence.
This section contains the following chapters:
  • Chapter 1, Machine Learning Fundamentals
  • Chapter 2, AWS Application Services for AI/ML

Chapter 1: Machine Learning Fundamentals

For many decades, researchers have been trying to simulate human brain activity through the field known as artificial intelligence, AI for short. In 1956, a group of people met at the Dartmouth Summer Research Project on Artificial Intelligence, an event that is widely accepted as the first group discussion about AI as we know it today. Researchers were trying to prove that many aspects of the learning process could be precisely described and, therefore, automated and replicated by a machine. Today, we know they were right!
Many other terms appeared in this field, such as machine learning (ML) and deep learning (DL). These sub-areas of AI have also been evolving for many decades (granted, nothing here is new to the science). However, with the natural advance of the information society and, more recently, the advent of big data platforms, AI applications have been reborn with much more power and applicability. Power, because now we have more computational resources to simulate and implement them; applicability, because now information is everywhere.
Even more recently, cloud services providers have put AI in the cloud. This is helping all sizes of companies to either reduce their operational costs or even letting them sample AI applications (considering that it could be too costly for a small company to maintain its own data center).
That brings us to the goal of this chapter: being able to describe what the terms AI, ML, and DL mean, as well as understanding all the nuances of an ML pipeline. Avoiding confusion on these terms and knowing what exactly an ML pipeline is will allow you to properly select your services, develop your applications, and master the AWS Machine Learning Specialty exam.
The main topics of this chapter are as follows:
  • Comparing AI, ML, and DL
  • Classifying supervised, unsupervised, and reinforcement learning
  • The CRISP-DM modeling life cycle
  • Data splitting
  • Modeling expectations
  • Introducing ML frameworks
  • ML in the cloud

Comparing AI, ML, and DL

AI is a broad field that studies different ways to create systems and machines that will solve problems by simulating human intelligence. There are different levels of sophistication to create these programs and machines, which go from simple, rule-based engines to complex, self-learning systems. AI covers, but is not limited to, the following sub-areas:
  • Robotics
  • Natural language processing
  • Rule-based systems
  • ML
The area we are particularly interested in now is ML.

Examining ML

ML is a sub-area of AI that aims to create systems and machines that are able to learn from experience, without being explicitly programmed. As the name suggests, the system is able to observe its running environment, learn, and adapt itself without human intervention. Algorithms behind ML systems usually extract and improve knowledge from the data that is available to them, as well as conditions (such as hyperparameters), and feed back after trying different approaches to solve a particular problem:
Figure 1.1 – Heirarchy of AI, ML, DL
Figure 1.1 – Heirarchy of AI, ML, DL
There are different types of ML algorithms; for instance, we can list decision tree-based, probabilistic-based, and neural networks. Each of these classes might have dozens of specific algorithms. Most of them will be covered in later sections of this book.
As you might have noticed in Figure 1.1, we can be even more specific and break the ML field down into another very important topic for the Machine Learning Specialty exam: DL.

Examining DL

DL is a subset of ML that aims to propose algorithms that connect multiple layers to solve a particular problem. The knowledge is then passed through layer by layer until the optimal solution is found. The most common type of DL algorithm is deep neural networks.
At the time of writing this book, DL is a very hot topic in the field of ML. Most of the current state-of-the-art algorithms for machine translation, image captioning, and computer vision were proposed in the past few years and are a part of DL.
Now that we have an overview of types of AI, let's look at some of the ways we can classify ML.

Classifying supervised, unsupervised, and reinforcement learning

ML is a very extensive field of study; that's why it is very important to have a clear definition of its sub-divisions. From a very broad perspective, we can split ML algorithms into two main classes: supervised learning and unsupervised learning.

Introducing supervised learning

Supervised algorithms use a class or label (from the input data) as support to find and validate the optimal solution. In Figure 1.2, there is a dataset that aims to classify fraudulent transactions from a bank:
Figure 1.2 – Sample dataset for supervised learning
Figure 1.2 – Sample dataset for supervised learning
The first four columns are known as features, or independent variables, and they can be used by a supervised algorithm to find fraudulent patterns. For example, by combining those four features (day of the week, EST hour, transaction amount, and merchant type) and six observations (each row is technically one observation), you can infer that e-commerce transactions with a value greater than $5,000 and processed at night are potentially fraudulent cases.
Important note
In a real scenario, we should have more observations in order to have statistical support to make this type of inference.
The key point is that we were able to infer a potential fraudulent pattern just because we knew, a priori, what is fraud and what is not fraud. This information is present in the last column of Figure 1.2 and is commonly referred to as a target variable, label, response variable, or dependent variable. If the input dataset has a target variable, you should be able to apply supervised learning.
In supervised learning, the target variable might store different types of data. For instance, it could be a binary column (yes or no), a multi-class column (class A, B, or C), or even a numerical column (any real number, such as a transaction amount). According to the data type of the target variable, you will find which type of supervised learning your problem refers to. Figure 1.3 shows how to classify supervised learning into two main groups: classification and regression algorithms:
Figure 1.3 – Choosing the right type of supervised learning given the target variable
Figure 1.3 – Choosing the right type of supervised learning given the target variable
While classification algorithms predict a class (either binary or multiple classes), regression algorithms predict a real number (either continuous or discrete).
Understanding data types is important to make the right decisions on ML projects. We can split data types into two main categories: numerical and categorical data. Numerical data can then be split into continuous or discrete subclasses, while categorical data might refer to ordinal or nominal data:
  • Numerical/discrete data refers to individual and countable items (for example, the number of students in a classroom or the number of items in an online shopping cart).
  • Numerical/continuous data refers to an infinite number of possible measurements and they often carry decimal points (for example, temperature).
  • Categorical/nominal data refers to labeled variables with no quantitative value (for example, name or gender).
  • Categorical/ordinal data adds the sense of order to a labeled variable (for example, education level or employee title level).
In other words, when choosing an algorithm for your project, you should ask yourself:...

Table of contents

  1. AWS Certified Machine Learning Specialty: MLS-C01 Certification Guide
  2. Contributors
  3. Preface
  4. Section 1: Introduction to Machine Learning
  5. Chapter 1: Machine Learning Fundamentals
  6. Chapter 2: AWS Application Services for AI/ML
  7. Section 2: Data Engineering and Exploratory Data Analysis
  8. Chapter 3: Data Preparation and Transformation
  9. Chapter 4: Understanding and Visualizing Data
  10. Chapter 5: AWS Services for Data Storing
  11. Chapter 6: AWS Services for Data Processing
  12. Section 3: Data Modeling
  13. Chapter 7: Applying Machine Learning Algorithms
  14. Chapter 8: Evaluating and Optimizing Models
  15. Chapter 9: Amazon SageMaker Modeling
  16. Other Books You May Enjoy
Citation styles for AWS Certified Machine Learning Specialty: MLS-C01 Certification Guide

APA 6 Citation

Nanda, S., & Moura, W. (2021). AWS Certified Machine Learning Specialty: MLS-C01 Certification Guide (1st ed.). Packt Publishing. Retrieved from https://www.perlego.com/book/2381900/aws-certified-machine-learning-specialty-mlsc01-certification-guide-the-definitive-guide-to-passing-the-mlsc01-exam-on-the-very-first-attempt-pdf (Original work published 2021)

Chicago Citation

Nanda, Somanath, and Weslley Moura. (2021) 2021. AWS Certified Machine Learning Specialty: MLS-C01 Certification Guide. 1st ed. Packt Publishing. https://www.perlego.com/book/2381900/aws-certified-machine-learning-specialty-mlsc01-certification-guide-the-definitive-guide-to-passing-the-mlsc01-exam-on-the-very-first-attempt-pdf.

Harvard Citation

Nanda, S. and Moura, W. (2021) AWS Certified Machine Learning Specialty: MLS-C01 Certification Guide. 1st edn. Packt Publishing. Available at: https://www.perlego.com/book/2381900/aws-certified-machine-learning-specialty-mlsc01-certification-guide-the-definitive-guide-to-passing-the-mlsc01-exam-on-the-very-first-attempt-pdf (Accessed: 15 October 2022).

MLA 7 Citation

Nanda, Somanath, and Weslley Moura. AWS Certified Machine Learning Specialty: MLS-C01 Certification Guide. 1st ed. Packt Publishing, 2021. Web. 15 Oct. 2022.