eBook - ePub

Pattern Recognition And Big Data

Name: Pattern Recognition And Big Data
ISBN: 9789813144569

Amita Pal,

Sankar K Pal,

876 pages
English
ePUB (mobile friendly)
Available on iOS & Android

eBook - ePub

Pattern Recognition And Big Data

Amita Pal,

Sankar K Pal,

About this book

-->

Containing twenty six contributions by experts from all over the world, this book presents both research and review material describing the evolution and recent developments of various pattern recognition methodologies, ranging from statistical, linguistic, fuzzy-set-theoretic, neural, evolutionary computing and rough-set-theoretic to hybrid soft computing, with significant real-life applications.

Pattern Recognition and Big Data provides state-of-the-art classical and modern approaches to pattern recognition and mining, with extensive real life applications. The book describes efficient soft and robust machine learning algorithms and granular computing techniques for data mining and knowledge discovery; and the issues associated with handling Big Data. Application domains considered include bioinformatics, cognitive machines (or machine mind developments), biometrics, computer vision, the e-nose, remote sensing and social network analysis.

--> Editor Amita Pal, Editor Sankar K Pal 0Pattern Recognition, Machine Learning, Image Processing, Computer Vision, Data Mining, Soft Computing, Rough Sets, Fuzzy Sets, Neural Networks, Evolutionary Computation, Granular Computing, Bioinformatics, Biometry, Social Networks, Cognitive Machine, Remote Sensing, Big Data0

Tools to learn more effectively

Saving Books

Keyword Search

Annotating Text

Listen to it instead

Information

Publisher

Year

eBook ISBN

Topic

Subtopic

Computer Vision & Pattern Recognition

Index

Computer Science

Chapter 1 Pattern Recognition: Evolution, Mining and Big Data

Amita Pal¹ and Sankar K. Pal²

¹ Interdisciplinary Statistical Research Unit Indian Statistical Institute, Kolkata, India

[email protected]

²Machine Intelligence Unit Indian Statistical Institute, Kolkata, India

[email protected]

This chapter traces the evolution of pattern recognition (PR) over the years, from its humble beginnings as an extension of statistical discriminant analysis, to the multidisciplinary approach that it has become now, on account of the continuous import of ideas from various scientific disciplines. It begins with an introduction to the discipline of PR, explaining the basic underlying concepts, different tasks involved, some conventional classification techniques and the subsequent development of various modern methodologies. The evolution has been nurtured and aided by the likes of statistical decision theory, the theory of formal languages (which led to the syntactic or structural approach), followed by the theories of fuzzy sets, artificial neural networks, genetic algorithms, rough sets, granular computing and support vector machines individually (leading to different modern approaches), and finally, their integration into the theory of soft computing. While tracing the journey of pattern recognition along this complex route, significant aspects are highlighted. The chapter also discusses the significance of data mining, which has drawn the attention of many PR researchers world-wide for the past couple of decades. Finally, the challenging issues of Big Data analysis are addressed along with the relevance of PR and machine learning.

1.1.Introduction

Pattern recognition is an activity that humans normally excel in. They do it almost all the time, and without conscious effort. Information received via the various sensory organs is processed almost instantaneously by the human brain so that it is possible to identify the source or content of the information, without any perceptible effort. What is even more impressive is the accuracy with which such recognition tasks can be performed even under non-ideal conditions, for instance, when the information that needs to be processed is vague, imprecise or even incomplete. In fact, most of our day-to-day activities are based on our success in performing various pattern recognition tasks. For example, when we read a book, we recognize the letters, words and, ultimately, concepts and notions, from the visual signals received by our brain, which processes them speedily and probably does a neurobiological implementation of template-matching!

The discipline of Pattern Recognition (PR) or Pattern Recognition by machine essentially deals with the problem of developing algorithms and methodologies/devices that can enable the computer-implementation of many of the recognition tasks that humans normally perform. The motivation is to perform these tasks more accurately, or faster, and perhaps, more economically than humans and, in many cases, to release them from drudgery resulting from performing routine recognition tasks repetitively and mechanically. The scope of PR also encompasses tasks humans are not good at, like reading bar codes. The goal of pattern recognition research is to devise ways and means of automating certain decision-making processes that lead to classification and recognition. PR has been a thriving field of research for the past few decades, as is amply borne out by the numerous books ([1]–[17], for example) and journals devoted exclusively to it. In this regard, mention must be made of the seminal article by Kanal [18] which gives a comprehensive review of the advances made in the field till the early nineteen-seventies. A review article by Jain et al. [19] provides an engrossing survey of the advances made in statistical pattern recognition till the end of the twentieth century.

Though the subject has attained a very high level of maturity in the past five decades or so, it remains evergreen to the researchers due to continuous cross–fertilization of ideas from disciplines like computer science, physics, neurobiology, psychology, engineering, statistics, mathematics and cognitive science. Depending on the practical need and demand, various modern methodologies have come into being, which often supplement the classical techniques. The present article gives a bird’s-eye view of the different methodologies that have evolved so far including the emergence of data mining. Since any discussion today on pattern recognition remains incomplete without the mention of Big Data, we have included a section describing the ABCs of Big data, challenging issues, and the relevance of pattern recognition, machine learning and data mining. Before we describe them, we explain briefly the basic concept of PR including supervised and unsupervised classification, and feature selection/extraction. Though these were mentioned in the leading chapter of the first edition of the volume [20], the authors repeat some of them here for the convenience of readers in following the remaining chapters.

1.2.The Pattern Recognition Problem

Let Ω denote the universe of patterns that are of interest, and let X be a vector of p variables (called features) defined on objects in Ω, which together provide some sort of numerical description for them. Let χ ⊂ ℝ^p be the feature space, or the domain of variation of X corresponding to all patterns in Ω, which contains K categories of objects, where K may or may not be known a priori. Let Ω₁, Ω₂, , . . . , Ω_K denote the corresponding categories or pattern classes. In this setup, the pattern recognition problem is to determine, for any pattern of unknown categorization (or label) from Ω and having a corresponding feature vector x, which pattern class it belongs to. Essentially, the general approach to solving the problem is to find, in some way, a partition of χ into χ₁, χ₂, , . . . , χ_K so that if x ∈ χ_r we can infer that the unknown pattern comes from the pattern class Ω_r. Obviously, this is not possible unless some additional information is provided for each of the classes, say, in the form of a set of n patterns, called training samples.

1.2.1.Supervised vs. unsupervised classification

Human pattern recognition capability is mainly learnt from past experiences, though it is certainly not possible to describe the procedure by which the human brain accomplishes this. Thus learning is an indispensable component of pattern recognizers, both human and mechanical. The information contained in the training samples provides the basis for learning in pattern recognition systems. In some cases, learning is accomplished with the help of a teacher, that is, an external agency of some sort that provides the correct labels or classifications of the training samples for building the classifier. The training samples in such cases become representatives of the classes they belong to, and can be processed in a suitable manner so that the class-specific information they carry may be distilled from them. This is referred to as supervised pattern recognition. References [5], [7], [12]–[17], [21] are a few of the many books in which detailed information on this is available.

On the other hand, if no teacher is available for a pattern classification task, that is, the training samples are not labeled, then we have a case of unsupervised pattern recognition. In such cases, learning essentially means discovery of the natural groupings inherent in the training set. The generic name for computational techniques applicable to unsupervised classification is clustering, for which there is no dearth of literature ([1], [2], [22]–[25]).

1.2.2.Feature selection and extraction

Most of the approaches designed for solving PR problems presuppose the representation of patterns by a set of measurements, called features. A judicious selection of features for building classifiers is a very crucial aspect of classifier design, and deserves careful consideration. On one hand, there is certainly nothing to lose in using all available measurements in classifier design. On the other hand, too many features make the classifier increasingly complex (sometimes confusing too), in fact, unnecessarily so, in case some of the measurements are redundant. It is encouraging to see that this aspect of classifier design has indeed been given the importance it deserves, judging from the work reported. References may be found, for example, in [3, 8, 9, 26, 27]. Two broad approaches have been used traditionally. The first is called feature selection, and is essentially the selection of the subset of measurements that optimizes some criterion of separability of classes, since, intuitively, the best set of features should discriminate most efficiently among the classes, that is, enhance the separability ...

Cover Page
Title
Copyright
Contents
Preface
1. Pattern Recognition: Evolution, Mining and Big Data
2. Pattern Classification with Gaussian Processes
3. Active Multitask Learning using Supervised and Shared Latent Topics
4. Sparse and Low-Rank Models for Visual Domain Adaptation
5. Pattern Classification using the Principle of Parsimony: Two Examples
6. Robust Learning of Classifiers in the Presence of Label Noise
7. Sparse Representation for Time-Series Classification
8. Fuzzy Sets as a Logic Canvas for Pattern Recognition
9. Optimizing Neural Network Structures to Match Pattern Recognition Task Complexity
10. Multi-Criterion Optimization and Decision Making Using Evolutionary Computing
11. Rough Sets in Pattern Recognition
12. The Twin SVM Minimizes the Total Risk
13. Dynamic Kernels based Approaches to Analysis of Varying Length Patterns in Speech and Image Processing Tasks
14. Fuzzy Rough Granular Neural Networks for Pattern Analysis
15. Fundamentals of Rough-Fuzzy Clustering and Its Application in Bioinformatics
16. Keygraphs: Structured Features for Object Detection and Applications
17. Mining Multimodal Data
18. Solving Classification Problems on Human Epithelial Type Cells for Anti-Nuclear Antibodies Test: Traditional versus Contemporary Approaches
19. Representation Learning for Spoken Term Detection
20. Tongue Pattern Recognition to Detect Diabetes Mellitus and Non-Proliferative Diabetic Retinopathy
21. Moving Object Detection using Multi-layer Markov Random Field Model
22. Recent Advances in Remote Sensing Time Series Image Classification
23. Sensor Selection for E-Nose
24. Understanding the Usage of Idioms in Twitter Social Network
25. Sampling Theorems for Twitter: Ideas from Large Deviation Theory
26. A Machine-mind Architecture and Z-numbers for Real-world Comprehension
Author Index
Subject Index
About the Editors

Frequently asked questions

Yes, you can cancel anytime from the Subscription tab in your account settings on the Perlego website. Your subscription will stay active until the end of your current billing period. Learn how to cancel your subscription

No, books cannot be downloaded as external files, such as PDFs, for use outside of Perlego. However, you can download books within the Perlego app for offline reading on mobile or tablet. Learn how to download books offline

Perlego offers two plans: Essential and Complete

Essential is ideal for learners and professionals who enjoy exploring a wide range of subjects. Access the Essential Library with 800,000+ trusted titles and best-sellers across business, personal growth, and the humanities. Includes unlimited reading time and Standard Read Aloud voice.
Complete: Perfect for advanced learners and researchers needing full, unrestricted access. Unlock 1.4M+ books across hundreds of subjects, including academic and specialized titles. The Complete Plan also includes advanced features like Premium Read Aloud and Research Assistant.

Both plans are available with monthly, semester, or annual billing cycles.

We are an online textbook subscription service, where you can get access to an entire online library for less than the price of a single book per month. With over 1 million books across 990+ topics, we’ve got you covered! Learn about our mission

Look out for the read-aloud symbol on your next book to see if you can listen to it. The read-aloud tool reads text aloud for you, highlighting the text as it is being read. You can pause it, speed it up and slow it down. Learn more about Read Aloud

Yes! You can use the Perlego app on both iOS and Android devices to read anytime, anywhere — even offline. Perfect for commutes or when you’re on the go.
Please note we cannot support devices running on iOS 13 and Android 7 or earlier. Learn more about using the app

Yes, you can access Pattern Recognition And Big Data by Amita Pal, Sankar K Pal in PDF and/or ePUB format, as well as other popular books in Computer Science & Computer Vision & Pattern Recognition. We have over one million books available in our catalogue for you to explore.