Computational Methods for Data Analysis
eBook - ePub

Computational Methods for Data Analysis

Yeliz Karaca,Carlo Cattani

  1. 395 páginas
  2. English
  3. ePUB (apto para móviles)
  4. Disponible en iOS y Android
eBook - ePub

Computational Methods for Data Analysis

Yeliz Karaca,Carlo Cattani

Detalles del libro
Vista previa del libro
Índice
Citas

Información del libro

This graduate text covers a variety of mathematical and statistical tools for the analysis of big data coming from biology, medicine and economics. Neural networks, Markov chains, tools from statistical physics and wavelet analysis are used to develop efficient computational algorithms, which are then used for the processing of real-life data using Matlab.

Preguntas frecuentes

¿Cómo cancelo mi suscripción?
Simplemente, dirígete a la sección ajustes de la cuenta y haz clic en «Cancelar suscripción». Así de sencillo. Después de cancelar tu suscripción, esta permanecerá activa el tiempo restante que hayas pagado. Obtén más información aquí.
¿Cómo descargo los libros?
Por el momento, todos nuestros libros ePub adaptables a dispositivos móviles se pueden descargar a través de la aplicación. La mayor parte de nuestros PDF también se puede descargar y ya estamos trabajando para que el resto también sea descargable. Obtén más información aquí.
¿En qué se diferencian los planes de precios?
Ambos planes te permiten acceder por completo a la biblioteca y a todas las funciones de Perlego. Las únicas diferencias son el precio y el período de suscripción: con el plan anual ahorrarás en torno a un 30 % en comparación con 12 meses de un plan mensual.
¿Qué es Perlego?
Somos un servicio de suscripción de libros de texto en línea que te permite acceder a toda una biblioteca en línea por menos de lo que cuesta un libro al mes. Con más de un millón de libros sobre más de 1000 categorías, ¡tenemos todo lo que necesitas! Obtén más información aquí.
¿Perlego ofrece la función de texto a voz?
Busca el símbolo de lectura en voz alta en tu próximo libro para ver si puedes escucharlo. La herramienta de lectura en voz alta lee el texto en voz alta por ti, resaltando el texto a medida que se lee. Puedes pausarla, acelerarla y ralentizarla. Obtén más información aquí.
¿Es Computational Methods for Data Analysis un PDF/ePUB en línea?
Sí, puedes acceder a Computational Methods for Data Analysis de Yeliz Karaca,Carlo Cattani en formato PDF o ePUB, así como a otros libros populares de Mathematics y Applied Mathematics. Tenemos más de un millón de libros disponibles en nuestro catálogo para que explores.

Información

Editorial
De Gruyter
Año
2018
ISBN
9783110493603
Edición
1
Categoría
Mathematics

1Introduction

In mathematics and computer science, an algorithm is defined as an unambiguous specification of how to solve a problem, based on a sequence of logical-numerical deductions. Algorithm, in fact, is a general name given to the systematic method of any kind of numerical calculation or a route drawn for solving a problem or achieving a goal utilized in the mentioned fields. Tasks such as calculation, data processing and automated reasoning can be realized through algorithms. Algorithm, as a concept, has been around for centuries. The formation of the modern algorithm started with endeavors exerted for the solution of what David Hilbert (1928) called Entscheidungsproblem (decision problem). The following formalizations of the concept were identified as efforts for the definition of “effective calculability” or “effective method” with Gödel–Herbrand–Kleene recursive functions (1930, 1934 and 1935), lambda calculus of Alonzo Church (1936), “Formulation 1” by Emil Post (1936) and the Turing machines of Alan Turing (1936–37 and 1939). Since then, particularly in the twentieth century, there has been a growing interest in data analysis algorithms as well as their applications to interdisciplinary various datasets.
Data analysis can be defined as a process of collecting raw data and converting it into information that would prove to be useful for the users in their decision-making processes. Data collection is performed and data analysis is done for the purpose of answering questions, testing hypotheses or refuting theories. According to the statistician John Tukey (1961), data analysis is defined as the set of (1) procedures for analyzing data, (2) techniques for interpreting the results of these procedures and (3) methods for planning the gathering of data so that one can render its analysis more accurate and also much easier. It also comprises the entire mechanism and outcomes of (mathematical) statistics, which are applicable to the analyzing of data. Numerous ways exist for the classification of algorithms and each of them has its own merits.
Accordingly, knowledge itself turns into power when it is processed, analyzed and interpreted in a proper and accurate way. With this key motive in mind, our general aim in this book is to ensure the integration of relevant findings in an interdisciplinary approach, discussing various relevant methods, thus putting forth a common approach for both problems and solutions. The main aim of this book is to provide the readers with core skills regarding data analysis in interdisciplinary studies. Data analysis is characterized by three typical features: (1) algorithms for classification, clustering, association analysis, modeling, data visualization as well as singling out the singularities; (2) computer algorithms’ source codes for conducting data analysis; and (3) specific fields (economics, physics, medicine, psychology, etc.) where the data are collected.
This book will help the readers establish a bridge from equations to algorithms’ source codes and from the interpretation of results to draw meaningful information about data and the process they represent. As the algorithms are developed further, it will be possible to grasp the significance of having a variety of variables. Moreover, it will be showing how to use the obtained results of data analysis for the forecasting of future developments, diagnosis and prediction in the field of medicine and related fields. In this way, we will present how knowledge merges with applications.
With this concern in mind, the book will be guiding for interdisciplinary studies to be carried out by those who are engaged in the fields of mathematics, statistics, economics, medicine, engineering, neuroengineering, computer science, neurology, cognitive sciences and psychiatry and so on.
In this book, we will analyze in detail important algorithms of data analysis and classification. We will discuss the contribution gained through linear model and multilinear model, decision trees, naive Bayesian classifier, support vector machines, k-nearest neighbor and artificial neural network (ANN) algorithms. Besides these, the book will also include fractal and multifractal methods with ANN algorithm.
The main goal of this book is to provide the readers with core skills regarding data analysis in interdisciplinary datasets. The second goal is to analyze each of the main components of data analysis:
Application of algorithms to real dataset and synthetic dataset
Specific application of data analysis algorithm in interdisciplinary datasets
Detailed description of general concepts for extracting knowledge from data, which undergird the wide-ranging array of datasets and application algorithms
Accordingly, each component has adequate resources so that data analysis can be developed through algorithms. This comprehensive collection is organized into three parts:
Classification of real dataset and synthetic dataset by algorithms
Singling out singularities features by fractals and multifractals for real dataset and synthetic datasets
Achieving high accuracy rate for classification of singled out singularities features by ANN algorithm (learning vector quantization algorithm is one of the ANN algorithms).
Moreover, we aim to coalesce three scientific endeavors and pave a way for providing direction for future applications to
real dataset and synthetic datasets,
fractals and multifractals for singled out singularities data as obtained from real datasets and synthetic datasets and
data analysis algorithms for the classification of datasets.
Main objectives are as follows:

1.1Objectives

Our book intends to enhance knowledge and facilitate learning, by using linear model and multilinear model, decision trees, naive Bayesian classifier, support vector machines, k-nearest neighbor, ANN algorithms as well as fractal and multifractal methods with ANN with the following goals:
Understand what data analysis means and how data analysis can be employed to solve real problems through the use of computational mathematics
Recognize whether data analysis solution with algorithm is a feasible alternative for a specific problem
Draw inferences on the results of a given algorithm through discovery process
Apply relevant mathematical rules and statistical techniques to evaluate the results of a given algorithm
Recognize several different computational mathematic techniques for data analysis strategies and optimize the results by selecting the most appropriate strategy
Develop a comprehensive understanding of how different data analysis techniques build models to solve problems related to decision-making, classification and selection of the more significant critical attributes from datasets and so on
Understand the types of problems that can be solved by combining an expert systems problem solving algorithm approach and a data analysis strategy
Develop a general awareness about the structure of a dataset and how a dataset can be used to enhance opportunities related to different fields which include but are not limited to psychiatry, neurology (radiology) as well as economy
Understand how data analysis through computational mathematics can be applied to algorithms via concrete examples whose procedures are explained in depth
Handle independent variables that have direct correlation with dependent variable
Learn how to use a decision tree to be able to design a rule-based system
Calculate the probability of which class the samples with certain attributes in dataset belong to
Calculate which training samples the smallest k unit belongs to among the distance vector obtained
Specify significant singled out singularities in data
Know how to implement codes and use them in accordance with computational mathematical principles

1.2Intended audience

Our intended audience are undergraduate, graduate, postgraduate students as well as academics and scholars; however, it also encompasses a wider range of readers who specialize or are interested in the applications of data analysis to real-world problems concerning various fields, such as engineering, medical studies, mathematics, physics, social sciences and economics. The purpose of the book is to provide the readers with the mathematical foundations for some of the main computational approaches to data analysis, decision-making, classification and selecting the significant critical attributes. These include techniques and methods for numerical solution of systems of linear and nonlinear algorithms. This requires making connections between techniques of numerical analysis and algorithms. The content of the book focuses on presenting the main algorithmic approaches and the underlying mathematical concepts, with particular attention given to the implementation aspects. Hence, use of typical mathematical environments, Matlab and available solvers/ libraries, is experimented throughout the chapters.
In writing this text, we directed our attention toward three groups of individuals:
Academics who wish to teach a unit and conduct a workshop or an entire course on essential computational mathematical approac...

Índice

  1. Cover
  2. Title Page
  3. Copyright
  4. Preface
  5. Acknowledgment
  6. Contents
  7. 1 Introduction
  8. 2 Dataset
  9. 3 Data preprocessing and model evaluation
  10. 4 Algorithms
  11. 5 Linear model and multilinear model
  12. 6 Decision Tree
  13. 7 Naive Bayesian classifier
  14. 8 Support vector machines algorithms
  15. 9 k-Nearest neighbor algorithm
  16. 10 Artificial neural networks algorithm
  17. 11 Fractal and multifractal methods with ANN
  18. Index
Estilos de citas para Computational Methods for Data Analysis

APA 6 Citation

Karaca, Y., & Cattani, C. (2018). Computational Methods for Data Analysis (1st ed.). De Gruyter. Retrieved from https://www.perlego.com/book/886463/computational-methods-for-data-analysis-pdf (Original work published 2018)

Chicago Citation

Karaca, Yeliz, and Carlo Cattani. (2018) 2018. Computational Methods for Data Analysis. 1st ed. De Gruyter. https://www.perlego.com/book/886463/computational-methods-for-data-analysis-pdf.

Harvard Citation

Karaca, Y. and Cattani, C. (2018) Computational Methods for Data Analysis. 1st edn. De Gruyter. Available at: https://www.perlego.com/book/886463/computational-methods-for-data-analysis-pdf (Accessed: 14 October 2022).

MLA 7 Citation

Karaca, Yeliz, and Carlo Cattani. Computational Methods for Data Analysis. 1st ed. De Gruyter, 2018. Web. 14 Oct. 2022.