Data Analysis and Applications 1
eBook - ePub

Data Analysis and Applications 1

Clustering and Regression, Modeling-estimating, Forecasting and Data Mining

  1. English
  2. ePUB (mobile friendly)
  3. Available on iOS & Android
eBook - ePub

Data Analysis and Applications 1

Clustering and Regression, Modeling-estimating, Forecasting and Data Mining

Book details
Book preview
Table of contents
Citations

About This Book

This series of books collects a diverse array of work that provides the reader with theoretical and applied information on data analysis methods, models, and techniques, along with appropriate applications. Volume 1 begins with an introductory chapter by Gilbert Saporta, a leading expert in the field, who summarizes the developments in data analysis over the last 50 years. The book is then divided into three parts: Part 1 presents clustering and regression cases; Part 2 examines grouping and decomposition, GARCH and threshold models, structural equations, and SME modeling; and Part 3 presents symbolic data analysis, time series and multiple choice models, modeling in demography, and data mining.

Frequently asked questions

Simply head over to the account section in settings and click on ā€œCancel Subscriptionā€ - itā€™s as simple as that. After you cancel, your membership will stay active for the remainder of the time youā€™ve paid for. Learn more here.
At the moment all of our mobile-responsive ePub books are available to download via the app. Most of our PDFs are also available to download and we're working on making the final remaining ones downloadable now. Learn more here.
Both plans give you full access to the library and all of Perlegoā€™s features. The only differences are the price and subscription period: With the annual plan youā€™ll save around 30% compared to 12 months on the monthly plan.
We are an online textbook subscription service, where you can get access to an entire online library for less than the price of a single book per month. With over 1 million books across 1000+ topics, weā€™ve got you covered! Learn more here.
Look out for the read-aloud symbol on your next book to see if you can listen to it. The read-aloud tool reads text aloud for you, highlighting the text as it is being read. You can pause it, speed it up and slow it down. Learn more here.
Yes, you can access Data Analysis and Applications 1 by Christos H. Skiadas, James R. Bozeman, Christos H. Skiadas, James R. Bozeman in PDF and/or ePUB format, as well as other popular books in Mathematics & Probability & Statistics. We have over one million books available in our catalogue for you to explore.

Information

Publisher
Wiley-ISTE
Year
2019
ISBN
9781119597681
Edition
1

PART 1
Clustering and Regression

1
Cluster Validation by Measurement of Clustering Characteristics Relevant to the User

There are many cluster analysis methods that can produce quite different clusterings on the same data set. Cluster validation is about the evaluation of the quality of a clustering; ā€œrelative cluster validationā€ is about using such criteria to compare clusterings. This can be used to select one of a set of clusterings from different methods, or from the same method ran with different parameters such as different numbers of clusters.
There are many cluster validation indexes in the literature. Most of them attempt to measure the overall quality of a clustering by a single number, but this can be inappropriate. There are various different characteristics of a clustering that can be relevant in practice, depending on the aim of clustering, such as low within-cluster distances and high between-cluster separation.
In this chapter, a number of validation criteria will be introduced that refer to different desirable characteristics of a clustering, and that characterize a clustering in a multidimensional way. In specific applications the user may be interested in some of these criteria rather than others. A focus of the chapter is on methodology to standardize the different characteristics so that users can aggregate them in a suitable way specifying weights for the various criteria that are relevant in the clustering application at hand.

1.1. Introduction

The aim of this chapter is to present a range of cluster validation indexes that provide a multivariate assessment covering different complementary aspects of cluster validity. Here, I focus on ā€œinternalā€ validation criteria that measure the quality of a clustering without reference to external information such as a known ā€œtrueā€ clustering. Furthermore, I am mostly interested in comparing different clusterings on the same data, which is often referred to as ā€œrelativeā€ cluster validation. This can be used to select one of a set of clusterings from different methods, or from the same method ran with different parameters such as different numbers of clusters.
In the literature (for an overview, see Halkidi et al. 2016), many cluster validation indexes are proposed. Usually, these are advertised as measures of ...

Table of contents

  1. Cover
  2. Table of Contents
  3. Preface
  4. Introduction: 50 Years of Data Analysis: From Exploratory Data Analysis to Predictive Modeling and Machine Learning
  5. PART 1: Clustering and Regression
  6. PART 2: Models and Modeling
  7. PART 3: Estimators, Forecasting and Data Mining
  8. List of Authors
  9. Index
  10. End User License Agreement