Data Analysis for the Social Sciences
eBook - ePub

Data Analysis for the Social Sciences

Integrating Theory and Practice

  1. 664 pages
  2. English
  3. ePUB (mobile friendly)
  4. Available on iOS & Android
eBook - ePub

Data Analysis for the Social Sciences

Integrating Theory and Practice

Book details
Book preview
Table of contents
Citations

About This Book

?This book fosters in-depth understanding of the logic underpinning the most common statistical tests within the behavioural sciences. By emphasising the shared ground between these tests, the author provides crucial scaffolding for students as they embark upon their research journey.? — Ruth Horry, Psychology, Swansea University ?This unique text presents the conceptual underpinnings of statistics as well as the computation and application of statistics to real-life situations--a combination rarely covered in one book. A must-have for students learning statistical techniques and a go-to handbook for experienced researchers.?— Barbra Teater, Social Work, College of Staten Island, City University of New York

Accessible, engaging, and informative, this book will help any social science student approach statistics with confidence.

With a well-paced and well-judged integrated approach rather than a simple linear trajectory, this book progresses at a realistic speed that matches the pace at which statistics novices actually learn. Packed with global, interdisciplinary examples that ground statistical theory and concepts in real-world situations, it shows students not only how to apply newfound knowledge using IBM SPSS Statistics, but also why they would want to. Spanning statistics basics like variables, constants, and sampling through to t-tests, multiple regression and factor analysis, it builds statistical literacy while also covering key research principles like research questions, error types and results reliability.

It shows you how to:

  • Describe data with graphs, tables, and numbers
  • Calculate probability and value distributions
  • Test a priori and post hoc hypotheses
  • Conduct Chi-squared tests and observational studies
  • Structure ANOVA, ANCOVA, and factorial designs

Supported by lots of visuals and a website with interactive demonstrations, author video, and practice datasets, this book is the student-focused companion to support students through their statistics journeys.

Frequently asked questions

Simply head over to the account section in settings and click on “Cancel Subscription” - it’s as simple as that. After you cancel, your membership will stay active for the remainder of the time you’ve paid for. Learn more here.
At the moment all of our mobile-responsive ePub books are available to download via the app. Most of our PDFs are also available to download and we're working on making the final remaining ones downloadable now. Learn more here.
Both plans give you full access to the library and all of Perlego’s features. The only differences are the price and subscription period: With the annual plan you’ll save around 30% compared to 12 months on the monthly plan.
We are an online textbook subscription service, where you can get access to an entire online library for less than the price of a single book per month. With over 1 million books across 1000+ topics, we’ve got you covered! Learn more here.
Look out for the read-aloud symbol on your next book to see if you can listen to it. The read-aloud tool reads text aloud for you, highlighting the text as it is being read. You can pause it, speed it up and slow it down. Learn more here.
Yes, you can access Data Analysis for the Social Sciences by Douglas Bors in PDF and/or ePUB format, as well as other popular books in Ciencias sociales & Investigación y metodología de las ciencias sociales. We have over one million books available in our catalogue for you to explore.

Part I The Foundations

The materials covered in the chapters that comprise this first part of this book provide the general framework for the specific statistical tests presented in Parts II and III. Part I reveals the plot and the main characters who are encountered throughout the book in the evolving episodes, each a surprising sequel. Despite the increasing twists and turns of each instalment, in Parts II and III, the most general plot and character types remain the same.
In Chapter 1 the logic which applies to all forms of data analysis covered in this book is described. The basic form of the logic revolves around four interrelated questions: What is expected? What is observed? What is the difference between the expected and the observed? How much of a difference between the expected and the observed can be expected due to chance alone? In this first chapter the concept of randomness is introduced along with other key concepts and the most common forms of research design.
In Chapter 2 the pictorial and numerical techniques researchers use to summarize their data are described. An informative summary is the first and, arguably, the most important stage in all forms of data analysis. Most statistical tests can be viewed as estimates of the reliability of the picture painted by the summary.
The coverage of probability theory and its laws in Chapter 3 is the basis for estimating the reliability or replicability of the picture portrayed in the summary. Parts II and III are built upon the foundation constructed in Part I. Each chapter in Parts II and III represents an application of probability theory (Chapter 3) to the summary of specific data (Chapter 2) to be analysed within a variation and adaptation of the general framework (Chapter 1).

1 Overview

Chapter contents

  • 1.1 Purpose 4
  • 1.2 The general framework 4
  • 1.3 Recognizing randomness 8
  • 1.4 Lies, damned lies, and statistics 9
  • 1.5 Testing for randomness 10
  • 1.6 Research design and key concepts 14
  • 1.7 Paradoxes 19
  • 1.8 Chapter summary 20
  • 1.9 Recommended readings 20
Key Concepts: randomness, experiment, quasi-experiment, observational designs, question of difference, question of association, categorical data, measurement data, null hypothesis, Simpson’s paradox, Type I error, Type II error, variables, population, sample, random sample, independent variable, dependent variable.

1.1 Purpose

The first purpose of this chapter is to introduce you to a few concepts and themes that will be present, directly or indirectly, throughout this book. If there is one concept that is omnipresent, if not explicitly then at least implicitly, it is randomness. As will be seen, the concept underlies other phrases used either to refer to the presence of randomness or to its absence. To claim that two groups of people differ in some respect is to say that group membership is not completely random; for example, height is not random with respect to basketball players versus non-basketball players. To say that two groups do not differ in some regard is to say that group membership is random; for example, the maximum speed at which a car can travel is probably unrelated to the car’s colour. To assert that two events are related is to say that they do not occur randomly with respect to each other; for example, tsunamis are associated with earthquakes. To state that two events are unrelated is to say that they occur randomly with respect to each other. Related to our use of randomness are four key questions: What is expected? What is observed? What is the difference between the expected and the observed? How much of a difference can be expected due to chance alone?
The second purpose of this chapter is to review some basic strategies and principles of empirical research. We differentiate the basic forms of research (experimental, quasi-experimental, and observational designs) and review the main characteristics of each.

1.2 The general framework

Statistics, which are the numbers researchers use to describe their data and to test the trustworthiness or replicability of their findings, can feel convoluted and mysterious both for students and for researchers. In this section I offer you a three-part framework; if you use it, it will make the material in this book, and the statistics you encounter in everyday life, more easily understood.
Part 1. As a researcher, you begin with a question (or questions) about the nature of the world, or at least that aspect of the world which interests you. Let us start with the simplest type of research, where there is only one question. Regardless of your topic, your question will take one of two basic forms.
The first form is one of difference. For example, imagine yourself as a political science professor who wishes to know if your students prefer term papers or essay examinations as a means of evaluation. Or think of yourself as a clinical psychologist wishing to know if cognitive behavioural therapy (CBT) reduces your patients’ anxiety symptoms more than does the most commonly prescribed anxiolytic (a medication to reduce anxiety). In both of these examples you suspect that one set of scores will be different from the other: more students will prefer one form of assessment over the other; the CBT group on average will show fewer symptoms than the anxiolytic drug patients.
The second form a research question may take is one of relation or association. For example, you may manage a coffee shop and are interested in customer behaviour. Is the choice of beverage (coffee versus tea) associated with gender (men versus women)? Or if you are an educational psychologist you may suspect that there is an association between the number of hours per week a student works off-campus and his or her grades at the end of term. In both of these examples you suspect that one set of scores will be related (or will predict) the other set of scores. Perhaps a greater proportion of women will prefer tea than will men. Perhaps the more hours a student works off-campus the lower his or her grade point average will tend to be. The type of question – difference versus association – orients you towards appropriate statistical procedures. Questions of differences are linked with one family of statistical tests, and questions of association are linked with another family of tests.
Questions of differences and questions of association are not as dissimilar as they may appear. They are usually two sides of a single coin, with one question implying the other. Furthermore, a research project in psychology and in the social sciences often entails more than one question, and it may involve both questions of differences and questions of associations. For example, you may be a ‘sportologist’ wishing to know why some baseball players hit more home runs than others. You suspect that the taller the player, the more home runs he will hit (this is a question of a possible association between height and the number of home runs). You may also suspect that players who use aluminium bats will hit more home runs than will players who use the old-fashioned wooden bats (this is a question of a possible difference between types of bats).
Part 2. As an empirical researcher you collect data. Regardless of your area of interest, the observations usually take one of two general forms.
The first form your observations can take is that of frequency data or categorical data. Remember, as a political science professor you wished to know if among your students term papers are more popular than essay examinations as a form of evaluation. You are keeping count of the number of students in the two categories: those who prefer a term paper versus those who prefer an essay examination. As a manager of a coffee shop you were keeping track of the frequencies in four categories: the number of women who prefer coffee, the number of women who prefer tea, the number of men who prefer coffee, and the number of men who prefer tea.
The second form your observations can take is that of measurement data. As a clinical psychologist you wished to know if two groups of patients (CBT versus anxiolytic) differ in terms of their average number of anxiety symptoms. You are recording the number of symptoms each patient exhibits. It is possible that no two patients will exhibit the same number of symptoms. As an educational psychologist interested in hours worked and academic performance, you are recording the actual number of hours per week each student works off-campus and his or her grade. It is possible that no two students in your study will have worked the same number of hours or have exactly the same grade.
I need to warn you: the two types of data are not as different as they may at first appear, nor do they encompass all possible types of data. And often one type of data can be transformed or treated as if it were the other type. Examples of this transformation will appear at the end of Chapter 3.
As we will see in Chapter 2, frequency/categorical data and measurement data can be further divided into four types of number scales: nominal, ordinal, interval, and ratio. Where nominal and ordinal number scales are described as being frequency/categorical data, interval and ratio scales are considered as measurement data. As will become apparent in Part II of this book, for purposes of analysis ordinal data (such as percentile scores on an examination) often form an intermediate form of data or are transformed into a type of measurement data called z-scores, which are discussed in detail in Chapter 3.
We now have two basic research questions and two types of data. Earlier we said that each research question is linked with its own family of statistical tests. The same may be said with respect to the two types of data. Frequency data are linked with one family of statistical tests and measurement data are associated with another family of tests.
There are four families of statistical test:
  • Tests for a question of difference with frequency data
  • Tests for a question of relation with frequency data
  • Tests for a question of difference with measurement data
  • Tests for a question of relation with measurement data.
Keep in mind that this framework is not carved in stone, nor are the boundaries between the four categories impermeable. Rather, the framework is a guideline for following the flow of this book. It will help you to cut through what appear to be so many unrelated procedures and formulae and to see the general storyline and character types.
Part 3. We have seen that there are different families of statistical tests which reflect an intersection of the type of question the researcher asks and the type of data he or she has collected. Surprisingly, almost all statistical tests – at least those covered in this book – have the same underlying logic based on a few simple questions.
Question 1: What do you as a researcher expect to find?
You may have taken a course that introduces you to research methodology and know that...

Table of contents

  1. Cover
  2. Half Title
  3. Publisher Note
  4. Title Page
  5. Copyright Page
  6. Contents
  7. Online resources
  8. About the author
  9. Acknowledgements
  10. Preface
  11. Part I The Foundations
  12. 1 Overview
  13. 2 Descriptive Statistics
  14. 3 Probability
  15. Part II Basic Research Designs
  16. 4 Categorical Data and Hypothesis Testing
  17. 5 Testing for a Difference: Two Conditions
  18. 6 Observational Studies: Two Categorical Variables
  19. 7 Observational studies: Two measurement variables
  20. 8 Testing for a Difference: Multiple Between-Subjects Conditions (ANOVA)
  21. 9 Testing for a Difference: Multiple Related Samples
  22. 10 Testing for Specific Differences: Planned and Unplanned Tests
  23. Part III Analysing Complex Designs
  24. 11 Testing for Differences: ANOVA and Factorial Designs
  25. 12 Multiple Regression
  26. 13 Factor Analysis
  27. Appendices
  28. References
  29. Index