Analysis of Integrated Data
eBook - ePub

Analysis of Integrated Data

  1. 256 pages
  2. English
  3. ePUB (mobile friendly)
  4. Available on iOS & Android
eBook - ePub

Analysis of Integrated Data

Book details
Table of contents
Citations

About This Book

The advent of "Big Data" has brought with it a rapid diversification of data sources, requiring analysis that accounts for the fact that these data have often been generated and recorded for different reasons. Data integration involves combining data residing in different sources to enable statistical inference, or to generate new statistical data for purposes that cannot be served by each source on its own. This can yield significant gains for scientific as well as commercial investigations.

However, valid analysis of such data should allow for the additional uncertainty due to entity ambiguity, whenever it is not possible to state with certainty that the integrated source is the target population of interest. Analysis of Integrated Data aims to provide a solid theoretical basis for this statistical analysis in three generic settings of entity ambiguity: statistical analysis of linked datasets that may contain linkage errors; datasets created by a data fusion process, where joint statistical information is simulated using the information in marginal data from non-overlapping sources; and estimation of target population size when target units are either partially or erroneously covered in each source.



  • Covers a range of topics under an overarching perspective of data integration.


  • Focuses on statistical uncertainty and inference issues arising from entity ambiguity.


  • Features state of the art methods for analysis of integrated data.


  • Identifies the important themes that will define future research and teaching in the statistical analysis of integrated data.

Analysis of Integrated Data is aimed primarily at researchers and methodologists interested in statistical methods for data from multiple sources, with a focus on data analysts in the social sciences, and in the public and private sectors.

Frequently asked questions

Simply head over to the account section in settings and click on “Cancel Subscription” - it’s as simple as that. After you cancel, your membership will stay active for the remainder of the time you’ve paid for. Learn more here.
At the moment all of our mobile-responsive ePub books are available to download via the app. Most of our PDFs are also available to download and we're working on making the final remaining ones downloadable now. Learn more here.
Both plans give you full access to the library and all of Perlego’s features. The only differences are the price and subscription period: With the annual plan you’ll save around 30% compared to 12 months on the monthly plan.
We are an online textbook subscription service, where you can get access to an entire online library for less than the price of a single book per month. With over 1 million books across 1000+ topics, we’ve got you covered! Learn more here.
Look out for the read-aloud symbol on your next book to see if you can listen to it. The read-aloud tool reads text aloud for you, highlighting the text as it is being read. You can pause it, speed it up and slow it down. Learn more here.
Yes, you can access Analysis of Integrated Data by Li-Chun Zhang,Raymond L. Chambers in PDF and/or ePUB format, as well as other popular books in Mathematics & Probability & Statistics. We have over one million books available in our catalogue for you to explore.

Information

Year
2019
ISBN
9781351646727
Edition
1

Table of contents

  1. Cover
  2. Half Title
  3. Series Page
  4. Title Page
  5. Copyright Page
  6. Dedication
  7. Contents
  8. Preface
  9. Contributors
  10. 1. Introduction
  11. 2. On secondary analysis of datasets that cannot be linked without errors
  12. 3. Capture-recapture methods in the presence of linkage errors
  13. 4. An overview on uncertainty and estimation in statistical matching
  14. 5. Auxiliary variable selection in a statistical matching problem
  15. 6. Minimal inference from incomplete 2 × 2-tables
  16. 7. Dual- and multiple-system estimation with fully and partially observed covariates
  17. 8. Estimating population size in multiple record systems with uncertainty of state identification
  18. 9. Log-linear models of erroneous list data
  19. 10. Sampling design and analysis using geo-referenced data
  20. Index