Exploring Data in Engineering, the Sciences, and Medicine (Hardcover)


Two recent and ongoing developments have greatly increased both the range of opportunities for exploratory data analysis and the variety of tools to support this type of analysis. First has been the dramatic rise in the number of publicly available datasets available free from the Internet and second has been the similarly dramatic evolution of the Open Source software movement, making powerful analysis packages like R also freely available. The objective of this book is to provide a reasonably thorough introduction to a useful subset of these analysis tools, illustrating what they are, what they do, and when and how they sometimes fail or do something very different than we expect them to. Specific topics covered include descriptive characterizations like summary statistics (mean, median, standard deviation, MAD scale estimate, etc.), graphical techniques like boxplots and nonparametric density estimates, various forms of regression modeling (standard linear regression models, logistic regression, and highly robust techniques like least trimmed squares), and the recognition and treatment of important data anomalies like outliers and missing data. In addition, the book also introduces a variety of dynamic data analysis tools, including autocorrelation analysis, parametric and nonparametric spectrum estimation, and the use of nonlinear data cleaning filters to improve dynamic characterization results. The book assumes familiarity with calculus and linear algebra, but does not assume any prior exposure to probability or statistics. Both simulation-based and real data examples are included and the book is intended either as an introductory textbook for an exploratory data analysis course like ones the author taught at the ETH where some of this material was used, or for self-study. Exercises are included at the end of each chapter and both R code and datasets are available through the associated OUP website.

R5,420

Or split into 4x interest-free payments of 25% on orders over R50
Learn more

Discovery Miles54200
Mobicred@R508pm x 12* Mobicred Info
Free Delivery
Delivery AdviceShips in 10 - 15 working days


Toggle WishListAdd to wish list
Review this Item

Product Description

Two recent and ongoing developments have greatly increased both the range of opportunities for exploratory data analysis and the variety of tools to support this type of analysis. First has been the dramatic rise in the number of publicly available datasets available free from the Internet and second has been the similarly dramatic evolution of the Open Source software movement, making powerful analysis packages like R also freely available. The objective of this book is to provide a reasonably thorough introduction to a useful subset of these analysis tools, illustrating what they are, what they do, and when and how they sometimes fail or do something very different than we expect them to. Specific topics covered include descriptive characterizations like summary statistics (mean, median, standard deviation, MAD scale estimate, etc.), graphical techniques like boxplots and nonparametric density estimates, various forms of regression modeling (standard linear regression models, logistic regression, and highly robust techniques like least trimmed squares), and the recognition and treatment of important data anomalies like outliers and missing data. In addition, the book also introduces a variety of dynamic data analysis tools, including autocorrelation analysis, parametric and nonparametric spectrum estimation, and the use of nonlinear data cleaning filters to improve dynamic characterization results. The book assumes familiarity with calculus and linear algebra, but does not assume any prior exposure to probability or statistics. Both simulation-based and real data examples are included and the book is intended either as an introductory textbook for an exploratory data analysis course like ones the author taught at the ETH where some of this material was used, or for self-study. Exercises are included at the end of each chapter and both R code and datasets are available through the associated OUP website.

Customer Reviews

No reviews or ratings yet - be the first to create one!

Product Details

General

Imprint

Oxford UniversityPress

Country of origin

United States

Release date

February 2011

Availability

Expected to ship within 10 - 15 working days

First published

2011

Authors

Dimensions

234 x 171 x 50mm (L x W x T)

Format

Hardcover - Cloth over boards

Pages

792

ISBN-13

978-0-19-508965-3

Barcode

9780195089653

Categories

LSN

0-19-508965-0



Trending On Loot