R for Health Data Science
eBook - ePub

R for Health Data Science

Ewen Harrison, Riinu Pius

  1. 346 pages
  2. English
  3. ePUB (mobile friendly)
  4. Available on iOS & Android
eBook - ePub

R for Health Data Science

Ewen Harrison, Riinu Pius

Book details
Book preview
Table of contents
Citations

About This Book

In this age of information, the manipulation, analysis, and interpretation of data have become a fundamental part of professional life; nowhere more so than in the delivery of healthcare. From the understanding of disease and the development of new treatments, to the diagnosis and management of individual patients, the use of data and technology is now an integral part of the business of healthcare. Those working in healthcare interact daily with data, often without realising it. The conversion of this avalanche of information to useful knowledge is essential for high-quality patient care.

R for Health Data Science includes everything a healthcare professional needs to go from R novice to R guru. By the end of this book, you will be taking a sophisticated approach to health data science with beautiful visualisations, elegant tables, and nuanced analyses.

Features



  • Provides an introduction to the fundamentals of R for healthcare professionals


  • Highlights the most popular statistical approaches to health data science


  • Written to be as accessible as possible with minimal mathematics


  • Emphasises the importance of truly understanding the underlying data through the use of plots


  • Includes numerous examples that can be adapted for your own data


  • Helps you create publishable documents and collaborate across teams

With this book, you are in safe hands – Prof. Harrison is a clinician and Dr. Pius is a data scientist, bringing 25 years' combined experience of using R at the coal face. This content has been taught to hundreds of individuals from a variety of backgrounds, from rank beginners to experts moving to R from other platforms.

Frequently asked questions

How do I cancel my subscription?
Simply head over to the account section in settings and click on “Cancel Subscription” - it’s as simple as that. After you cancel, your membership will stay active for the remainder of the time you’ve paid for. Learn more here.
Can/how do I download books?
At the moment all of our mobile-responsive ePub books are available to download via the app. Most of our PDFs are also available to download and we're working on making the final remaining ones downloadable now. Learn more here.
What is the difference between the pricing plans?
Both plans give you full access to the library and all of Perlego’s features. The only differences are the price and subscription period: With the annual plan you’ll save around 30% compared to 12 months on the monthly plan.
What is Perlego?
We are an online textbook subscription service, where you can get access to an entire online library for less than the price of a single book per month. With over 1 million books across 1000+ topics, we’ve got you covered! Learn more here.
Do you support text-to-speech?
Look out for the read-aloud symbol on your next book to see if you can listen to it. The read-aloud tool reads text aloud for you, highlighting the text as it is being read. You can pause it, speed it up and slow it down. Learn more here.
Is R for Health Data Science an online PDF/ePUB?
Yes, you can access R for Health Data Science by Ewen Harrison, Riinu Pius in PDF and/or ePUB format, as well as other popular books in Computer Science & Computer Science General. We have over one million books available in our catalogue for you to explore.

Information

Year
2020
ISBN
9781000226164
Edition
1

Part II

Data analysis

In the second part of this book, we focus specifically on the business of data analysis, that is, formulating clear questions and seeking to answer them using available datasets.
Again, we emphasise the importance of understanding the underlying data through visualisation, rather than relying on statistical tests or, heaven forbid, the p-value alone.
There are five chapters. Testing for continuous outcome variables (6) leads naturally into linear regression (7). We would expect the majority of actual analysis done by readers to be using the methods in chapter 7 rather than 6. Similarly, testing for categorical outcome variables (8) leads naturally to logistic regression (9), where we would expect the majority of work to focus.
Chapters 6 and 8 however do provide helpful reminders of how to prepare data for these analyses and shouldn’t be skipped. time-to-event data (10) introduces survival analysis and includes sections on the manipulation of dates.

6

Working with continuous outcome variables

Continuous data can be measured.
Categorical data can be counted.

6.1 Continuous data

Continuous data is everywhere in healthcare. From physiological measures in patients such as systolic blood pressure or pulmonary function tests, through to population measures like life expectancy or disease incidence, the analysis of continuous outcome measures is common and important.
Our goal in most health data questions, is to draw a conclusion on a comparison between groups. For instance, understanding differences in life expectancy between the year 2002 and 2007 is more useful than simply describing the average life expectancy across all of time.
The basis for comparisons between continuous measures is the distribution of the data. That word, as many which have a statistical flavour, brings on the sweats in many people. It needn’t. By distribution, we are simply referring to the shape of the data.

6.2 The Question

The examples in this chapter all use the data introduced previously from the amazing Gapminder project.1 We will start by looking at the life expectancy of populations over time and in different geographical regions.

6.3 Get and check the data

# Load packages
library(tidyverse)
library(finalfit)
library(gapminder)
# Create object gapdata from object gapminder
gapdata <- gapminder
It is vital that datasets be carefully inspected when first read (for help reading data into R see 2.1). The three functions below provide a clear summary, allowing errors or miscoding to be quickly identified. It is particularly important to ensure that any missing data is identified (see Chapter 11). If you don’t do this you will regret it! There are many times when an analysis has got to a relatively advanced stage before the researcher was hit by the realisation that the dataset was far from complete.
glimpse(gapdata) # each variable as line, variable type, first values
## Rows: 1,704
## Columns: 6
## $ country <fct> Afghanistan, Afghanistan, Afghanistan, Afghanistan, Afgha...
## $ continent <fct> Asia, Asia, Asia, Asia, Asia, Asia, Asia, Asia, Asia, Asi...
## $ year <int> 1952, 1957, 1962, 1967, 1972, 1977, 1982, 1987, 1992, 199...
## $ lifeExp <dbl> 28.801, 30.332, 31.997, 34.020, 36.088, 38.438, 39.854, 4...
## $ pop <int> 8425333, 9240934, 10267083, 11537966, 13079460, 14880372,...
## $ gdpPercap <dbl> 779.4453, 820.8530, 853.1007, 836.1971, 739.9811, 786.113...
missing_glimpse(gapdata) # missin...

Table of contents

  1. Cover
  2. Half Title
  3. Title Page
  4. Copyright Page
  5. Dedication
  6. Preface
  7. About the Authors
  8. I Data wrangling and visualisation
  9. II Data analysis
  10. III Workflow
  11. Appendix
  12. Bibliography
  13. Index
Citation styles for R for Health Data Science

APA 6 Citation

Harrison, E., & Pius, R. (2020). R for Health Data Science (1st ed.). CRC Press. Retrieved from https://www.perlego.com/book/2011631/r-for-health-data-science-pdf (Original work published 2020)

Chicago Citation

Harrison, Ewen, and Riinu Pius. (2020) 2020. R for Health Data Science. 1st ed. CRC Press. https://www.perlego.com/book/2011631/r-for-health-data-science-pdf.

Harvard Citation

Harrison, E. and Pius, R. (2020) R for Health Data Science. 1st edn. CRC Press. Available at: https://www.perlego.com/book/2011631/r-for-health-data-science-pdf (Accessed: 15 October 2022).

MLA 7 Citation

Harrison, Ewen, and Riinu Pius. R for Health Data Science. 1st ed. CRC Press, 2020. Web. 15 Oct. 2022.