- 446 pages
- English
- ePUB (mobile friendly)
- Available on iOS & Android
Data Analysis with IBM SPSS Statistics
About This Book
Master data management & analysis techniques with IBM SPSS Statistics 24About This Bookā¢ Leverage the power of IBM SPSS Statistics to perform efficient statistical analysis of your dataā¢ Choose the right statistical technique to analyze different types of data and build efficient models from your data with easeā¢ Overcome any hurdle that you might come across while learning the different SPSS Statistics concepts with clear instructions, tips and tricksWho This Book Is ForThis book is designed for analysts and researchers who need to work with data to discover meaningful patterns but do not have the time (or inclination) to become programmers. We assume a foundational understanding of statistics such as one would learn in a basic course or two on statistical techniques and methods. What You Will Learnā¢ Install and set up SPSS to create a working environment for analyticsā¢ Techniques for exploring data visually and statistically, assessing data quality and addressing issues related to missing dataā¢ How to import different kinds of data and work with itā¢ Organize data for analytical purposes (create new data elements, sampling, weighting, subsetting, and restructure your data)ā¢ Discover basic relationships among data elements (bivariate data patterns, differences in means, correlations)ā¢ Explore multivariate relationshipsā¢ Leverage the offerings to draw accurate insights from your research, and benefit your decision-makingIn DetailSPSS Statistics is a software package used for logical batched and non-batched statistical analysis. Analytical tools such as SPSS can readily provide even a novice user with an overwhelming amount of information and a broad range of options for analyzing patterns in the data.The journey starts with installing and configuring SPSS Statistics for first use and exploring the data to understand its potential (as well as its limitations). Use the right statistical analysis technique such as regression, classification and more, and analyze your data in the best possible manner. Work with graphs and charts to visualize your findings. With this information in hand, the discovery of patterns within the data can be undertaken. Finally, the high level objective of developing predictive models that can be applied to other situations will be addressed.By the end of this book, you will have a firm understanding of the various statistical analysis techniques offered by SPSS Statistics, and be able to master its use for data analysis with ease.Style and approachProvides a practical orientation to understanding a set of data and examining the key relationships among the data elements. Shows useful visualizations to enhance understanding and interpretation. Outlines a roadmap that focuses the process so decision regarding how to proceed can be made easily.
Frequently asked questions
Information
Principal Components and Factor Analysis
- Choosing between PCA and FA
- Description of PCA example data
- SPSS Code for initial PCA analysis of example data
- Assessing factorability of the data
- Principal components analysis--two-component run
- Description of factor analysis example data
- The reduced correlation matrix and its eigenvalues
- Factor analysis code
- Factor analysis results
Choosing between principal components analysis and factor analysis
- Regarding methods: If you wish to run PCA, there is one method--PCA. If you wish to run factor analysis, the most commonly used methods are principal axis factoring (PAF) and maximum likelihood (ML). Other methods are available in FACTOR, and you should consult a textbook or the following references for more information. Because the default method is PCA, you must explicitly specify a factor method if you intend to do factor analysis and not principal components analysis.
- Regarding communality estimates: The communality, or common variance, of a variable is the amount of variance that is shared among a set of variables that can be explained by a set of common factors. The goal of PCA is to explain the total variance among the set of variables, or at least some fraction of it, while the goal of FA is to explain the common variance among a set of variables.
As indicated in the following PCA example, the initial communality estimates for the variables are ones, while the final communality estimates depends on the order of the solution. If you specify as many components as there are factors, then the final communalities are one, while if you specify fewer components than there are factors, the final communalities are typically less than one.
In FA, SPSS Statistics FACTOR supplies initial communality estimates automatically. Typically, these are squared multiple correlations when the variable in question is regressed on the rest of the variables in the analysis. Final communalities are a byproduct of analysis elements, such as the extraction method used and the specified number of factors. - Regarding the number of components or factors: In PCA, you can extract as many components as there are variables, while in FA, the number of factors is necessarily less than the number of variables. In FA, but not PCA, the closeness of the reproduced correlations (off the main diagonal) to the observed correlations guides the choice of the number of factors to retain.
- Regarding rotation: Principal components have the geometric interpretation of being uncorrelated directions of maximum variations in the data. This interpretation holds only for the unrotated component loadings, so if you do perform rotation on component loadings, you lose this interpretation.
In the case of factor analysis, rotation is often done to aid interpretability. Ideally, after rotation, you can identify sets of variables that go together, as indicated by high loadings on a given factor. Presumably, these sets represent variables that are correlated more with each other than with the rest of the variables.
SPSS Statistics FACTOR provides a number of popular orthogonal and oblique rotation methods. Orthogonal rotations lead to uncorrelated factors, but there is no reason to think a priori that the factors are uncorrelated, so in general, you should consider oblique rotation methods which allow factors to correlate. - Regarding scores: When using PCA, you can compute component scores, and when using FA, you can compute factor scores. When computing component scores, you are mathematically projecting the observations into the space of components. We demonstrate this in the following PCA example. When computing factor scores, technical literature draws attention to the problem of factor indeterminacy (see the Mulaik reference for a discussion). For this reason, some researchers caution against computing and using factor scores.
Table of contents
- Title Page
- Copyright
- Credits
- About the Authors
- Acknowledgement
- About the Reviewers
- www.PacktPub.com
- Customer Feedback
- Preface
- Installing and Configuring SPSS
- Accessing and Organizing Data
- Statistics for Individual Data Elements
- Dealing with Missing Data and Outliers
- Visually Exploring the Data
- Sampling, Subsetting, and Weighting
- Creating New Data Elements
- Adding and Matching Files
- Aggregating and Restructuring Data
- Crosstabulation Patterns for Categorical Data
- Comparing Means and ANOVA
- Correlations
- Linear Regression
- Principal Components and Factor Analysis
- Clustering
- Discriminant Analysis