Pattern Recognition And Big Data
eBook - ePub

Pattern Recognition And Big Data

Amita Pal, Sankar K Pal

  1. 876 pages
  2. English
  3. ePUB (mobile friendly)
  4. Available on iOS & Android
eBook - ePub

Pattern Recognition And Big Data

Amita Pal, Sankar K Pal

Book details
Book preview
Table of contents
Citations

About This Book

-->

Containing twenty six contributions by experts from all over the world, this book presents both research and review material describing the evolution and recent developments of various pattern recognition methodologies, ranging from statistical, linguistic, fuzzy-set-theoretic, neural, evolutionary computing and rough-set-theoretic to hybrid soft computing, with significant real-life applications.

Pattern Recognition and Big Data provides state-of-the-art classical and modern approaches to pattern recognition and mining, with extensive real life applications. The book describes efficient soft and robust machine learning algorithms and granular computing techniques for data mining and knowledge discovery; and the issues associated with handling Big Data. Application domains considered include bioinformatics, cognitive machines (or machine mind developments), biometrics, computer vision, the e-nose, remote sensing and social network analysis.

--> Editor Amita Pal, Editor Sankar K Pal 0Pattern Recognition, Machine Learning, Image Processing, Computer Vision, Data Mining, Soft Computing, Rough Sets, Fuzzy Sets, Neural Networks, Evolutionary Computation, Granular Computing, Bioinformatics, Biometry, Social Networks, Cognitive Machine, Remote Sensing, Big Data0

Frequently asked questions

How do I cancel my subscription?
Simply head over to the account section in settings and click on ā€œCancel Subscriptionā€ - itā€™s as simple as that. After you cancel, your membership will stay active for the remainder of the time youā€™ve paid for. Learn more here.
Can/how do I download books?
At the moment all of our mobile-responsive ePub books are available to download via the app. Most of our PDFs are also available to download and we're working on making the final remaining ones downloadable now. Learn more here.
What is the difference between the pricing plans?
Both plans give you full access to the library and all of Perlegoā€™s features. The only differences are the price and subscription period: With the annual plan youā€™ll save around 30% compared to 12 months on the monthly plan.
What is Perlego?
We are an online textbook subscription service, where you can get access to an entire online library for less than the price of a single book per month. With over 1 million books across 1000+ topics, weā€™ve got you covered! Learn more here.
Do you support text-to-speech?
Look out for the read-aloud symbol on your next book to see if you can listen to it. The read-aloud tool reads text aloud for you, highlighting the text as it is being read. You can pause it, speed it up and slow it down. Learn more here.
Is Pattern Recognition And Big Data an online PDF/ePUB?
Yes, you can access Pattern Recognition And Big Data by Amita Pal, Sankar K Pal in PDF and/or ePUB format, as well as other popular books in Computer Science & Computer Vision & Pattern Recognition. We have over one million books available in our catalogue for you to explore.

Information

Chapter 1

Pattern Recognition: Evolution, Mining and Big Data

Amita Pal1 and Sankar K. Pal2
1 Interdisciplinary Statistical Research Unit Indian Statistical Institute, Kolkata, India
2Machine Intelligence Unit Indian Statistical Institute, Kolkata, India
This chapter traces the evolution of pattern recognition (PR) over the years, from its humble beginnings as an extension of statistical discriminant analysis, to the multidisciplinary approach that it has become now, on account of the continuous import of ideas from various scientific disciplines. It begins with an introduction to the discipline of PR, explaining the basic underlying concepts, different tasks involved, some conventional classification techniques and the subsequent development of various modern methodologies. The evolution has been nurtured and aided by the likes of statistical decision theory, the theory of formal languages (which led to the syntactic or structural approach), followed by the theories of fuzzy sets, artificial neural networks, genetic algorithms, rough sets, granular computing and support vector machines individually (leading to different modern approaches), and finally, their integration into the theory of soft computing. While tracing the journey of pattern recognition along this complex route, significant aspects are highlighted. The chapter also discusses the significance of data mining, which has drawn the attention of many PR researchers world-wide for the past couple of decades. Finally, the challenging issues of Big Data analysis are addressed along with the relevance of PR and machine learning.

1.1.Introduction

Pattern recognition is an activity that humans normally excel in. They do it almost all the time, and without conscious effort. Information received via the various sensory organs is processed almost instantaneously by the human brain so that it is possible to identify the source or content of the information, without any perceptible effort. What is even more impressive is the accuracy with which such recognition tasks can be performed even under non-ideal conditions, for instance, when the information that needs to be processed is vague, imprecise or even incomplete. In fact, most of our day-to-day activities are based on our success in performing various pattern recognition tasks. For example, when we read a book, we recognize the letters, words and, ultimately, concepts and notions, from the visual signals received by our brain, which processes them speedily and probably does a neurobiological implementation of template-matching!
The discipline of Pattern Recognition (PR) or Pattern Recognition by machine essentially deals with the problem of developing algorithms and methodologies/devices that can enable the computer-implementation of many of the recognition tasks that humans normally perform. The motivation is to perform these tasks more accurately, or faster, and perhaps, more economically than humans and, in many cases, to release them from drudgery resulting from performing routine recognition tasks repetitively and mechanically. The scope of PR also encompasses tasks humans are not good at, like reading bar codes. The goal of pattern recognition research is to devise ways and means of automating certain decision-making processes that lead to classification and recognition. PR has been a thriving field of research for the past few decades, as is amply borne out by the numerous books ([1]ā€“[17], for example) and journals devoted exclusively to it. In this regard, mention must be made of the seminal article by Kanal [18] which gives a comprehensive review of the advances made in the field till the early nineteen-seventies. A review article by Jain et al. [19] provides an engrossing survey of the advances made in statistical pattern recognition till the end of the twentieth century.
Though the subject has attained a very high level of maturity in the past five decades or so, it remains evergreen to the researchers due to continuous crossā€“fertilization of ideas from disciplines like computer science, physics, neurobiology, psychology, engineering, statistics, mathematics and cognitive science. Depending on the practical need and demand, various modern methodologies have come into being, which often supplement the classical techniques. The present article gives a birdā€™s-eye view of the different methodologies that have evolved so far including the emergence of data mining. Since any discussion today on pattern recognition remains incomplete without the mention of Big Data, we have included a section describing the ABCs of Big data, challenging issues, and the relevance of pattern recognition, machine learning and data mining. Before we describe them, we explain briefly the basic concept of PR including supervised and unsupervised classification, and feature selection/extraction. Though these were mentioned in the leading chapter of the first edition of the volume [20], the authors repeat some of them here for the convenience of readers in following the remaining chapters.

1.2.The Pattern Recognition Problem

Let Ī© denote the universe of patterns that are of interest, and let X be a vector of p variables (called features) defined on objects in Ī©, which together provide some sort of numerical description for them. Let Ļ‡ āŠ‚ ā„p be the feature space, or the domain of variation of X corresponding to all patterns in Ī©, which contains K categories of objects, where K may or may not be known a priori. Let Ī©1, Ī©2, , . . . , Ī©K denote the corresponding categories or pattern classes. In this setup, the pattern recognition problem is to determine, for any pattern of unknown categorization (or label) from Ī© and having a corresponding feature vector x, which pattern class it belongs to. Essentially, the general approach to solving the problem is to find, in some way, a partition of Ļ‡ into Ļ‡1, Ļ‡2, , . . . , Ļ‡K so that if x āˆˆ Ļ‡r we can infer that the unknown pattern comes from the pattern class Ī©r. Obviously, this is not possible unless some additional information is provided for each of the classes, say, in the form of a set of n patterns, called training samples.

1.2.1.Supervised vs. unsupervised classification

Human pattern recognition capability is mainly learnt from past experiences, though it is certainly not possible to describe the procedure by which the human brain accomplishes this. Thus learning is an indispensable component of pattern recognizers, both human and mechanical. The information contained in the training samples provides the basis for learning in pattern recognition systems. In some cases, learning is accomplished with the help of a teacher, that is, an external agency of some sort that provides the correct labels or classifications of the training samples for building the classifier. The training samples in such cases become representatives of the classes they belong to, and can be processed in a suitable manner so that the class-specific information they carry may be distilled from them. This is referred to as supervised pattern recognition. References [5], [7], [12]ā€“[17], [21] are a few of the many books in which detailed information on this is available.
On the other hand, if no teacher is available for a pattern classification task, that is, the training samples are not labeled, then we have a case of unsupervised pattern recognition. In such cases, learning essentially means discovery of the natural groupings inherent in the training set. The generic name for computational techniques applicable to unsupervised classification is clustering, for which there is no dearth of literature ([1], [2], [22]ā€“[25]).

1.2.2.Feature selection and extraction

Most of the approaches designed for solving PR problems presuppose the representation of patterns by a set of measurements, called features. A judicious selection of features for building classifiers is a very crucial aspect of classifier design, and deserves careful consideration. On one hand, there is certainly nothing to lose in using all available measurements in classifier design. On the other hand, too many features make the classifier increasingly complex (sometimes confusing too), in fact, unnecessarily so, in case some of the measurements are redundant. It is encouraging to see that this aspect of classifier design has indeed been given the importance it deserves, judging from the work reported. References may be found, for example, in [3, 8, 9, 26, 27]. Two broad approaches have been used traditionally. The first is called feature selection, and is essentially the selection of the subset of measurements that optimizes some criterion of separability of classes, since, intuitively, the best set of features should discriminate most efficiently among the classes, that is, enhance the separability ...

Table of contents

  1. Cover Page
  2. Title
  3. Copyright
  4. Contents
  5. Preface
  6. 1. Pattern Recognition: Evolution, Mining and Big Data
  7. 2. Pattern Classification with Gaussian Processes
  8. 3. Active Multitask Learning using Supervised and Shared Latent Topics
  9. 4. Sparse and Low-Rank Models for Visual Domain Adaptation
  10. 5. Pattern Classification using the Principle of Parsimony: Two Examples
  11. 6. Robust Learning of Classifiers in the Presence of Label Noise
  12. 7. Sparse Representation for Time-Series Classification
  13. 8. Fuzzy Sets as a Logic Canvas for Pattern Recognition
  14. 9. Optimizing Neural Network Structures to Match Pattern Recognition Task Complexity
  15. 10. Multi-Criterion Optimization and Decision Making Using Evolutionary Computing
  16. 11. Rough Sets in Pattern Recognition
  17. 12. The Twin SVM Minimizes the Total Risk
  18. 13. Dynamic Kernels based Approaches to Analysis of Varying Length Patterns in Speech and Image Processing Tasks
  19. 14. Fuzzy Rough Granular Neural Networks for Pattern Analysis
  20. 15. Fundamentals of Rough-Fuzzy Clustering and Its Application in Bioinformatics
  21. 16. Keygraphs: Structured Features for Object Detection and Applications
  22. 17. Mining Multimodal Data
  23. 18. Solving Classification Problems on Human Epithelial Type Cells for Anti-Nuclear Antibodies Test: Traditional versus Contemporary Approaches
  24. 19. Representation Learning for Spoken Term Detection
  25. 20. Tongue Pattern Recognition to Detect Diabetes Mellitus and Non-Proliferative Diabetic Retinopathy
  26. 21. Moving Object Detection using Multi-layer Markov Random Field Model
  27. 22. Recent Advances in Remote Sensing Time Series Image Classification
  28. 23. Sensor Selection for E-Nose
  29. 24. Understanding the Usage of Idioms in Twitter Social Network
  30. 25. Sampling Theorems for Twitter: Ideas from Large Deviation Theory
  31. 26. A Machine-mind Architecture and Z-numbers for Real-world Comprehension
  32. Author Index
  33. Subject Index
  34. About the Editors
Citation styles for Pattern Recognition And Big Data

APA 6 Citation

[author missing]. (2016). Pattern Recognition And Big Data ([edition unavailable]). World Scientific Publishing Company. Retrieved from https://www.perlego.com/book/853256/pattern-recognition-and-big-data-pdf (Original work published 2016)

Chicago Citation

[author missing]. (2016) 2016. Pattern Recognition And Big Data. [Edition unavailable]. World Scientific Publishing Company. https://www.perlego.com/book/853256/pattern-recognition-and-big-data-pdf.

Harvard Citation

[author missing] (2016) Pattern Recognition And Big Data. [edition unavailable]. World Scientific Publishing Company. Available at: https://www.perlego.com/book/853256/pattern-recognition-and-big-data-pdf (Accessed: 14 October 2022).

MLA 7 Citation

[author missing]. Pattern Recognition And Big Data. [edition unavailable]. World Scientific Publishing Company, 2016. Web. 14 Oct. 2022.