Foundations of Agnostic Statistics
Title | Foundations of Agnostic Statistics PDF eBook |
Author | Peter M. Aronow |
Publisher | Cambridge University Press |
Pages | 317 |
Release | 2019-01-31 |
Genre | Mathematics |
ISBN | 1107178916 |
Provides an introduction to modern statistical theory for social and health scientists while invoking minimal modeling assumptions.
Foundations of Data Science
Title | Foundations of Data Science PDF eBook |
Author | Avrim Blum |
Publisher | Cambridge University Press |
Pages | 433 |
Release | 2020-01-23 |
Genre | Computers |
ISBN | 1108617360 |
This book provides an introduction to the mathematical and algorithmic foundations of data science, including machine learning, high-dimensional geometry, and analysis of large networks. Topics include the counterintuitive nature of data in high dimensions, important linear algebraic techniques such as singular value decomposition, the theory of random walks and Markov chains, the fundamentals of and important algorithms for machine learning, algorithms and analysis for clustering, probabilistic models for large networks, representation learning including topic modelling and non-negative matrix factorization, wavelets and compressed sensing. Important probabilistic techniques are developed including the law of large numbers, tail inequalities, analysis of random projections, generalization guarantees in machine learning, and moment methods for analysis of phase transitions in large random graphs. Additionally, important structural and complexity measures are discussed such as matrix norms and VC-dimension. This book is suitable for both undergraduate and graduate courses in the design and analysis of algorithms for data.
Foundations of Statistics
Title | Foundations of Statistics PDF eBook |
Author | D.G. Rees |
Publisher | CRC Press |
Pages | 564 |
Release | 1987-09-01 |
Genre | Mathematics |
ISBN | 9780412285608 |
This text provides a through, straightforward first course on basics statistics. Emphasizing the application of theory, it contains 200 fully worked examples and supplies exercises in each chapter-complete with hints and answers.
Elementary Probability for Applications
Title | Elementary Probability for Applications PDF eBook |
Author | Rick Durrett |
Publisher | Cambridge University Press |
Pages | 255 |
Release | 2009-07-31 |
Genre | Mathematics |
ISBN | 1139480731 |
This clear and lively introduction to probability theory concentrates on the results that are the most useful for applications, including combinatorial probability and Markov chains. Concise and focused, it is designed for a one-semester introductory course in probability for students who have some familiarity with basic calculus. Reflecting the author's philosophy that the best way to learn probability is to see it in action, there are more than 350 problems and 200 examples. The examples contain all the old standards such as the birthday problem and Monty Hall, but also include a number of applications not found in other books, from areas as broad ranging as genetics, sports, finance, and inventory management.
Modern Mathematical Statistics with Applications
Title | Modern Mathematical Statistics with Applications PDF eBook |
Author | Jay L. Devore |
Publisher | Springer Nature |
Pages | 981 |
Release | 2021-04-29 |
Genre | Mathematics |
ISBN | 3030551563 |
This 3rd edition of Modern Mathematical Statistics with Applications tries to strike a balance between mathematical foundations and statistical practice. The book provides a clear and current exposition of statistical concepts and methodology, including many examples and exercises based on real data gleaned from publicly available sources. Here is a small but representative selection of scenarios for our examples and exercises based on information in recent articles: Use of the “Big Mac index” by the publication The Economist as a humorous way to compare product costs across nations Visualizing how the concentration of lead levels in cartridges varies for each of five brands of e-cigarettes Describing the distribution of grip size among surgeons and how it impacts their ability to use a particular brand of surgical stapler Estimating the true average odometer reading of used Porsche Boxsters listed for sale on www.cars.com Comparing head acceleration after impact when wearing a football helmet with acceleration without a helmet Investigating the relationship between body mass index and foot load while running The main focus of the book is on presenting and illustrating methods of inferential statistics used by investigators in a wide variety of disciplines, from actuarial science all the way to zoology. It begins with a chapter on descriptive statistics that immediately exposes the reader to the analysis of real data. The next six chapters develop the probability material that facilitates the transition from simply describing data to drawing formal conclusions based on inferential methodology. Point estimation, the use of statistical intervals, and hypothesis testing are the topics of the first three inferential chapters. The remainder of the book explores the use of these methods in a variety of more complex settings. This edition includes many new examples and exercises as well as an introduction to the simulation of events and probability distributions. There are more than 1300 exercises in the book, ranging from very straightforward to reasonably challenging. Many sections have been rewritten with the goal of streamlining and providing a more accessible exposition. Output from the most common statistical software packages is included wherever appropriate (a feature absent from virtually all other mathematical statistics textbooks). The authors hope that their enthusiasm for the theory and applicability of statistics to real world problems will encourage students to pursue more training in the discipline.
Text as Data
Title | Text as Data PDF eBook |
Author | Justin Grimmer |
Publisher | Princeton University Press |
Pages | 360 |
Release | 2022-01-04 |
Genre | Social Science |
ISBN | 0691207992 |
A guide for using computational text analysis to learn about the social world From social media posts and text messages to digital government documents and archives, researchers are bombarded with a deluge of text reflecting the social world. This textual data gives unprecedented insights into fundamental questions in the social sciences, humanities, and industry. Meanwhile new machine learning tools are rapidly transforming the way science and business are conducted. Text as Data shows how to combine new sources of data, machine learning tools, and social science research design to develop and evaluate new insights. Text as Data is organized around the core tasks in research projects using text—representation, discovery, measurement, prediction, and causal inference. The authors offer a sequential, iterative, and inductive approach to research design. Each research task is presented complete with real-world applications, example methods, and a distinct style of task-focused research. Bridging many divides—computer science and social science, the qualitative and the quantitative, and industry and academia—Text as Data is an ideal resource for anyone wanting to analyze large collections of text in an era when data is abundant and computation is cheap, but the enduring challenges of social science remain. Overview of how to use text as data Research design for a world of data deluge Examples from across the social sciences and industry
Demystifying Causal Inference
Title | Demystifying Causal Inference PDF eBook |
Author | Vikram Dayal |
Publisher | Springer Nature |
Pages | 304 |
Release | 2023-09-29 |
Genre | Business & Economics |
ISBN | 9819939054 |
This book provides an accessible introduction to causal inference and data analysis with R, specifically for a public policy audience. It aims to demystify these topics by presenting them through practical policy examples from a range of disciplines. It provides a hands-on approach to working with data in R using the popular tidyverse package. High quality R packages for specific causal inference techniques like ggdag, Matching, rdrobust, dosearch etc. are used in the book. The book is in two parts. The first part begins with a detailed narrative about John Snow’s heroic investigations into the cause of cholera. The chapters that follow cover basic elements of R, regression, and an introduction to causality using the potential outcomes framework and causal graphs. The second part covers specific causal inference methods, including experiments, matching, panel data, difference-in-differences, regression discontinuity design, instrumental variables and meta-analysis, with the help of empirical case studies of policy issues. The book adopts a layered approach that makes it accessible and intuitive, using helpful concepts, applications, simulation, and data graphs. Many public policy questions are inherently causal, such as the effect of a policy on a particular outcome. Hence, the book would not only be of interest to students in public policy and executive education, but also to anyone interested in analysing data for application to public policy.