Data Science and Machine Learning Interview Questions Using R
eBook - ePub

Data Science and Machine Learning Interview Questions Using R

Data Science and Machine Learning Interview Questions Using R

  1. English
  2. ePUB (mobile friendly)
  3. Available on iOS & Android
eBook - ePub

Data Science and Machine Learning Interview Questions Using R

Data Science and Machine Learning Interview Questions Using R

Book details
Book preview
Table of contents
Citations

About This Book

Get answers to frequently asked questions on Data Science and Machine Learning using R Key Features

  • Understand the capabilities of the R programming language
  • Most of the machine learning algorithms and their R implementation covered in depth
  • Answers on conceptual data science concepts are also covered

  • Description
    This book prepares you for the Data Scientist and Machine Learning Engineer interview w.r.t. R programming language.
    The book is divided into various parts, making it easy for you to remember and associate with the questions asked in an interview. It covers multiple possible transformations and data filtering techniques in depth. You will be able to create visualizations like graphs and charts using your data. You will also see some examples of how to build complex charts with this data. This book covers the frequently asked interview questions and shares insights on the kind of answers that will help you get this job.
    By the end of this book, you will not only crack the interview but will also have a solid command of the concepts of Data Science as well as R programming. What will you learn
  • Get answers to the basics, intermediate and advanced questions on R programming
  • Understand the transformation and filtering capabilities of R
  • Know how to perform visualization using R

  • Who this book is for
    This book is a must for anyone interested in Data Science and Machine Learning. Anyone who wants to clear the interview can use it as a last-minute revision guide. Table of Contents
    1. Data Science basic questions and terms
    2. R programming questions
    3. GGPLOT Questions
    4. Statistics with excel sheet About the Author
    Vishwanathan Narayanan has 18 years of experience in the field of information technology and data analysis. He made many enterprise-level applications with stable output and scalability.Advanced level data analysis for complex problems using both R and Python has been the key area of work for many years. Extreme programmer on Java, Python, R, and many more technologies

Frequently asked questions

Simply head over to the account section in settings and click on β€œCancel Subscription” - it’s as simple as that. After you cancel, your membership will stay active for the remainder of the time you’ve paid for. Learn more here.
At the moment all of our mobile-responsive ePub books are available to download via the app. Most of our PDFs are also available to download and we're working on making the final remaining ones downloadable now. Learn more here.
Both plans give you full access to the library and all of Perlego’s features. The only differences are the price and subscription period: With the annual plan you’ll save around 30% compared to 12 months on the monthly plan.
We are an online textbook subscription service, where you can get access to an entire online library for less than the price of a single book per month. With over 1 million books across 1000+ topics, we’ve got you covered! Learn more here.
Look out for the read-aloud symbol on your next book to see if you can listen to it. The read-aloud tool reads text aloud for you, highlighting the text as it is being read. You can pause it, speed it up and slow it down. Learn more here.
Yes, you can access Data Science and Machine Learning Interview Questions Using R by Vishwanathan Narayanan in PDF and/or ePUB format, as well as other popular books in Computer Science & Data Processing. We have over one million books available in our catalogue for you to explore.

Information

Year
2020
ISBN
9789389845846

SECTION 1

Data Science Basic Questions and Terms

Learning objective

In this session, we will learn about data science terminologies and machine learning.

Key points

  • Steps involved in data science
  • Variables and types
  • Machine learning and types
  • Algorithms used in Machine learning
Let us begin!
  1. Explain the steps involved in data science?
    Ans. Following are the steps involved:
    1) Get Data from various Data sources available
    2) Generate research question from data
    3) Identify variables present in data. Also, identify important variables or variables to be analyzed as such
    4) Generate hypothesis
    5) Analyze data using graph data like a histogram for example
    6) Fit a model from analyzed data
    7) Accept or reject the hypothesis
    8) Research question answer found
    Figure 1.1: Steps involved in data science
    Example of above steps:
    1) Get data related to temperature for India reference https://data.gov.in/catalog/annual-and-seasonal-maximum-temperature-india
    A template of data set is as follows:
    β€œYEARβ€œ, β€œANNUALβ€œ, β€œJAN-FEBβ€œ, β€œMAR-MAYβ€œ, β€œJUN-SEPβ€œ, β€œOCT-DECβ€œ
    β€œ1901β€œ, β€œ28.96β€œ, β€œ23.27β€œ, β€œ31.46β€œ, β€œ31.27β€œ, β€œ27.25β€œ
    β€œ1902β€œ, β€œ29.22β€œ, β€œ25.75β€œ, β€œ31.76β€œ, β€œ31.09β€œ, β€œ26.49β€œ
    β€œ1903β€œ, β€œ28.47β€œ, β€œ24.24β€œ, β€œ30.71β€œ, β€œ30.92β€œ, β€œ26.26β€œ
    β€œ1904β€œ, β€œ28.49β€œ, β€œ23.62β€œ, β€œ30.95β€œ, β€œ30.67β€œ, β€œ26.40β€œ
    β€œ1905β€œ, β€œ28.30β€œ, β€œ22.25β€œ, β€œ30.00β€œ, β€œ31.33β€œ, β€œ26.57β€œ
    β€œ1906β€œ, β€œ28.73β€œ, β€œ23.03β€œ, β€œ31.11β€œ, β€œ30.86β€œ, β€œ27.29β€œ
    β€œ1907β€œ, β€œ28.65β€œ, β€œ24.23β€œ, β€œ29.92β€œ, β€œ30.80β€œ, β€œ27.36β€œ
    β€œ1908β€œ, β€œ28.83β€œ, β€œ24.42β€œ, β€œ31.43β€œ, β€œ30.72β€œ, β€œ26.64β€œ
    β€œ1909β€œ, β€œ28.39β€œ, β€œ23.52β€œ, β€œ31.02β€œ, β€œ30.33β€œ, β€œ26.88β€œ
    β€œ1910β€œ, β€œ28.53β€œ, β€œ24.20β€œ, β€œ31.14β€œ, β€œ30.48β€œ, β€œ26.20β€œ
    β€œ1911β€œ, β€œ28.62β€œ, β€œ23.90β€œ, β€œ30.70β€œ, β€œ31.14β€œ, β€œ26.31β€œ
    β€œ1912β€œ, β€œ28.95β€œ, β€œ24.88β€œ, β€œ31.10β€œ, β€œ31.15β€œ, β€œ26.57β€œ
    β€œ1913β€œ, β€œ28.67β€œ, β€œ24.25β€œ, β€œ30.89β€œ, β€œ30.92β€œ, β€œ26.42β€œ
    β€œ1914β€œ, β€œ28.66β€œ, β€œ24.59β€œ, β€œ30.73β€œ, β€œ30.84β€œ, β€œ26.40β€œ
    β€œ1915β€œ, β€œ28.94β€œ, β€œ23.22β€œ, β€œ31.06β€œ, β€œ31.51β€œ, β€œ27.18β€œ
    β€œ1916β€œ, β€œ28.82β€œ, β€œ24.57β€œ, β€œ31.88β€œ, β€œ30.52β€œ, β€œ26.32β€œ
    β€œ1917β€œ, β€œ28.11β€œ, β€œ24.52β€œ, β€œ30.06β€œ, β€œ30.24β€œ, β€œ25.74β€œ
    β€œ1918β€œ, β€œ28.66β€œ, β€œ23.57β€œ, β€œ30.68β€œ, β€œ31.11β€œ, β€œ26.77β€œ
    2) The research question is the annual temperature in India rising?
    3) Variable of interest from the above data set ANNUAL
    4) Hypothesis: Temperature is rising
    5) Analyze data from the above data set:
    Figure 1.2: Graph showing year vs. temperature
    6) Fit the model
    7) Hypothesis accepted or rejected
  2. Define a variable?
    Ans. Anything which keeps on changing is called variable.
  3. Explain different types of variables?
    Ans. Variables are of the following type:
    • Dependant/Outcome: A variable being affected for example annual temperature in the above example.
    • Independent/Predictor: A variable affecting the outcome e.g. deforestation, pollution, and so on in the above example.
  4. Define Categorical measurement?
    Ans. Categorical measurement contains categories i.e. distinct entities. Example of categories of life on earth is plants, animals, and so on.
  5. Define Binary variables?
    Ans. Binary variables are those in which only two classes exist like live or dead male or female on or off.
  6. Define Nominal measurement?
    Ans. Nominal measurements are those of more than two classes. Such categories can be numbers too.
  7. Explain the Ordinal variable?
    Ans. These are nominal variables that have a logical order. Examples include team ranks in cricket or football, merit list of students appearing for grade students.
  8. Define Continuous variables?
    Ans. These are variables that can take can any value on the measurement scale example includes pitch of voice which can take any possible value within the range.
  9. Define Discrete variables?
    Ans. These are variables that can take fixed values in the range. Example number of customers in a bank.
  10. Is it possible to convert continuous values to discrete and vice versa?
    Ans. Yes based upon the motive of study it is possible to convert discrete values to continuous and vice versa. Example Level of water in the tank can take any value in the range and as such a continuous variable.
    But we can approximate the same to three different levels like empty, full, or half empty and this now becomes discrete.
  11. Explain the interval variables?
    Ans. These are variables that are grouped on the interval. E.g. is age can be divided into the range like 1-10, 10-20, 20-30, and so on and the person with a particular age would be placed in one of the above groups. When intervals are equal they represent the difference in the equal property being measured.
  12. Explain the ratio variables?
    Ans. This is a subtype of interval variables where the ratio of scales is used for measurement.
    E.g. Water representation in chemistry is H2O which represent two molecules of hydrogen and one molecule of oxygen. Thus the ratio of elements is 2: 1.
  13. Define Univariate and Bivariate variables?
    Ans. Following are the definitions:
    • Univariate variable: When the variable under consideration is only one then it is called a univariate variable study.
    • Bivariate variable: Involves the study of the relationship between two variables.
  14. Explain the measurement error?
    Ans. The discrepancy between the measured value and actual value in terms of number is called measurement error.
    E.g. While buying fruits from a vendor in kilograms, if we wanted 1 kilogram of fruits and the vendor’s weighing machine showed 1 kilogram when we brought the same. After checking the same in another machine if the measured value show 0.1 kilograms less than expected then this difference is what we call as measurement error.
  15. Define Validity?
    Ans. Validity implies whether an instrument measures what it is supposed to measure.
  16. Explain Reliability?
    Ans. Reliability implies whether the instrument gives consistent results across different conditions.
    E.g. if we test the same value twice on the same entity then the results from the instrument should remain the same if it has to be reliable. Such test is known as test-retest.
  17. Explain different ways to test hypotheses?
    Ans. There are two ways in which hypotheses can be tested
    1) Correlational research:
    • This is also known as cross-sectional research
    • This involves observing the natural pattern or occurrence to test
    • Original occurrences are not manipulated
    2) Experimental research:
    • We select the variabl...

Table of contents

  1. Cover Page
  2. Title Page
  3. Copyright Page
  4. Dedication
  5. About the Author
  6. Foreword
  7. About the Reviewer
  8. Acknowledgement
  9. Preface
  10. Errata
  11. Table of Contents
  12. 1. Data Science Basic Questions and Terms
  13. 2. Programming Questions
  14. 3. GGPLOT Questions
  15. 4. Statistics with Excel Sheet