Both Traditional Students and Working Professionals Acquire the Skills to Analyze Social Problems. Big Data and Social Science: A Practical Guide to Methods and Tools shows how to apply data science to real-world problems in both research and the practice. The book provides practical guidance on combining methods and tools from computer science, statistics, and social science. This concrete approach is illustrated throughout using an important national problem, the quantitative study of innovation. The text draws on the expertise of prominent leaders in statistics, the social sciences, data science, and computer science to teach students how to use modern social science research principles as well as the best analytical and computational tools. It uses a real-world challenge to introduce how these tools are used to identify and capture appropriate data, apply data science models and tools to that data, and recognize and respond to data errors and limitations. For more information, including sample chapters and news, please visit the author's website.

An Update of the Most Popular Graduate-Level Introductions to Bayesian Statistics for Social Scientists Now that Bayesian modeling has become standard, MCMC is well understood and trusted, and computing power continues to increase, Bayesian Methods: A Social and Behavioral Sciences Approach, Third Edition focuses more on implementation details of the procedures and less on justifying procedures. The expanded examples reflect this updated approach. New to the Third Edition A chapter on Bayesian decision theory, covering Bayesian and frequentist decision theory as well as the connection of empirical Bayes with James–Stein estimation A chapter on the practical implementation of MCMC methods using the BUGS software Greatly expanded chapter on hierarchical models that shows how this area is well suited to the Bayesian paradigm Many new applications from a variety of social science disciplines Double the number of exercises, with 20 now in each chapter Updated BaM package in R, including new datasets, code, and procedures for calling BUGS packages from R This bestselling, highly praised text continues to be suitable for a range of courses, including an introductory course or a computing-centered course. It shows students in the social and behavioral sciences how to use Bayesian methods in practice, preparing them for sophisticated, real-world work in the field.

Taking a practical approach that draws on the authors’ extensive teaching, consulting, and research experiences, Applied Survey Data Analysis provides an intermediate-level statistical overview of the analysis of complex sample survey data. It emphasizes methods and worked examples using available software procedures while reinforcing the principles and theory that underlie those methods. After introducing a step-by-step process for approaching a survey analysis problem, the book presents the fundamental features of complex sample designs and shows how to integrate design characteristics into the statistical methods and software for survey estimation and inference. The authors then focus on the methods and models used in analyzing continuous, categorical, and count-dependent variables; event history; and missing data problems. Some of the techniques discussed include univariate descriptive and simple bivariate analyses, the linear regression model, generalized linear regression modeling methods, the Cox proportional hazards model, discrete time models, and the multiple imputation analysis method. The final chapter covers new developments in survey applications of advanced statistical techniques, including model-based analysis approaches. Designed for readers working in a wide array of disciplines who use survey data in their work, this book also provides a useful framework for integrating more in-depth studies of the theory and methods of survey data analysis. A guide to the applied statistical analysis and interpretation of survey data, it contains many examples and practical exercises based on major real-world survey data sets. Although the authors use Stata for most examples in the text, they offer SAS, SPSS, SUDAAN, R, WesVar, IVEware, and Mplus software code for replicating the examples on the book’s website: http://www.isr.umich.edu/src/smp/asda/

Survey sampling is fundamentally an applied field. The goal in this book is to put an array of tools at the fingertips of practitioners by explaining approaches long used by survey statisticians, illustrating how existing software can be used to solve survey problems, and developing some specialized software where needed. This book serves at least three audiences: (1) Students seeking a more in-depth understanding of applied sampling either through a second semester-long course or by way of a supplementary reference; (2) Survey statisticians searching for practical guidance on how to apply concepts learned in theoretical or applied sampling courses; and (3) Social scientists and other survey practitioners who desire insight into the statistical thinking and steps taken to design, select, and weight random survey samples. Several survey data sets are used to illustrate how to design samples, to make estimates from complex surveys for use in optimizing the sample allocation, and to calculate weights. Realistic survey projects are used to demonstrate the challenges and provide a context for the solutions. The book covers several topics that either are not included or are dealt with in a limited way in other texts. These areas include: sample size computations for multistage designs; power calculations related to surveys; mathematical programming for sample allocation in a multi-criteria optimization setting; nuts and bolts of area probability sampling; multiphase designs; quality control of survey operations; and statistical software for survey sampling and estimation. An associated R package, PracTools, contains a number of specialized functions for sample size and other calculations. The data sets used in the book are also available in PracTools, so that the reader may replicate the examples or perform further analyses.

Multivariable Modeling and Multivariate Analysis for the Behavioral Sciences shows students how to apply statistical methods to behavioral science data in a sensible manner. Assuming some familiarity with introductory statistics, the book analyzes a host of real-world data to provide useful answers to real-life issues. The author begins by exploring the types and design of behavioral studies. He also explains how models are used in the analysis of data. After describing graphical methods, such as scatterplot matrices, the text covers simple linear regression, locally weighted regression, multiple linear regression, regression diagnostics, the equivalence of regression and ANOVA, the generalized linear model, and logistic regression. The author then discusses aspects of survival analysis, linear mixed effects models for longitudinal data, and the analysis of multivariate data. He also shows how to carry out principal components, factor, and cluster analyses. The final chapter presents approaches to analyzing multivariate observations from several different populations. Through real-life applications of statistical methodology, this book elucidates the implications of behavioral science studies for statistical analysis. It equips behavioral science students with enough statistical tools to help them succeed later on in their careers. Solutions to the problems as well as all R code and data sets for the examples are available at www.crcpress.com

Learn the techniques and math you need to start making sense of your data About This Book Enhance your knowledge of coding with data science theory for practical insight into data science and analysis More than just a math class, learn how to perform real-world data science tasks with R and Python Create actionable insights and transform raw data into tangible value Who This Book Is For You should be fairly well acquainted with basic algebra and should feel comfortable reading snippets of R/Python as well as pseudo code. You should have the urge to learn and apply the techniques put forth in this book on either your own data sets or those provided to you. If you have the basic math skills but want to apply them in data science or you have good programming skills but lack math, then this book is for you. What You Will Learn Get to know the five most important steps of data science Use your data intelligently and learn how to handle it with care Bridge the gap between mathematics and programming Learn about probability, calculus, and how to use statistical models to control and clean your data and drive actionable results Build and evaluate baseline machine learning models Explore the most effective metrics to determine the success of your machine learning models Create data visualizations that communicate actionable insights Read and apply machine learning concepts to your problems and make actual predictions In Detail Need to turn your skills at programming into effective data science skills? Principles of Data Science is created to help you join the dots between mathematics, programming, and business analysis. With this book, you'll feel confident about asking—and answering—complex and sophisticated questions of your data to move from abstract and raw statistics to actionable ideas. With a unique approach that bridges the gap between mathematics and computer science, this books takes you through the entire data science pipeline. Beginning with cleaning and preparing data, and effective data mining strategies and techniques, you'll move on to build a comprehensive picture of how every piece of the data science puzzle fits together. Learn the fundamentals of computational mathematics and statistics, as well as some pseudocode being used today by data scientists and analysts. You'll get to grips with machine learning, discover the statistical models that help you take control and navigate even the densest datasets, and find out how to create powerful visualizations that communicate what your data means. Style and approach This is an easy-to-understand and accessible tutorial. It is a step-by-step guide with use cases, examples, and illustrations to get you well-versed with the concepts of data science. Along with explaining the fundamentals, the book will also introduce you to slightly advanced concepts later on and will help you implement these techniques in the real world.

Providing a practical, thorough understanding of how factor analysis works, Foundations of Factor Analysis, Second Edition discusses the assumptions underlying the equations and procedures of this method. It also explains the options in commercial computer programs for performing factor analysis and structural equation modeling. This long-awaited edition takes into account the various developments that have occurred since the publication of the original edition. New to the Second Edition A new chapter on the multivariate normal distribution, its general properties, and the concept of maximum-likelihood estimation More complete coverage of descriptive factor analysis and doublet factor analysis A rewritten chapter on analytic oblique rotation that focuses on the gradient projection algorithm and its applications Discussions on the developments of factor score indeterminacy A revised chapter on confirmatory factor analysis that addresses philosophy of science issues, model specification and identification, parameter estimation, and algorithm derivation Presenting the mathematics only as needed to understand the derivation of an equation or procedure, this textbook prepares students for later courses on structural equation modeling. It enables them to choose the proper factor analytic procedure, make modifications to the procedure, and produce new results.

Devising tests that evaluate a nation’s educational standing and implement efficacious educational reforms requires a careful balance among the contributions of technology, psychometrics, test design, and the learning sciences. Unlike other forms of adaptive testing, multistage testing (MST) is highly suitable for testing educational achievement because it can be adapted to educational surveys and student testing. Computerized Multistage Testing: Theory and Applications covers the methodologies, underlying technology, and implementation aspects of this type of test design. The book discusses current scientific perspectives and practical considerations for each step involved in setting up an MST program. It covers the history of MST, test design and implementation for various purposes, item pool development and maintenance, IRT-based and classical test theory-based methodologies for test assembly, routing and scoring, equating, test security, and existing software. It also explores current research, existing operational programs, and innovative future assessments using MST. Intended for psychologists, social scientists, and educational measurement scientists, this volume provides the first unified source of information on the design, psychometrics, implementation, and operational use of MST. It shows how to apply theoretical statistical tools to testing in novel and useful ways. It also explains how to explicitly tie the assumptions made by each model to observable (or at least inferable) data conditions. Winner of the 2016 AERA Award for Significant Contribution to Educational Measurement and Research Methodology The 2016 American Education Research Association (AERA) Div. D award committee for Significant Contributions to Educational Measurement and Research Methodology has recognized unanimously this collaborative work advancing the theory and applications of computerized MST. This annual award recognizes published research judged to represent a significant conceptual advancement in the theory and practice of educational measurement and/or educational research methodology. The 2016 award was made under the heading: Measurement, Psychometrics, and Assessment. This collective work, published in 2014 as an edited volume titled Computerized Multistage Testing: Theory and Applications, was cited by the committee both for the originality of the conceptual foundations presented in support of multistage testing and for arguing persuasively for its potential impact on the practice of educational measurement.

Walking readers step by step through complex concepts, this book translates missing data techniques into something that applied researchers and graduate students can understand and utilize in their own research. Enders explains the rationale and procedural details for maximum likelihood estimation, Bayesian estimation, multiple imputation, and models for handling missing not at random (MNAR) data. Easy-to-follow examples and small simulated data sets illustrate the techniques and clarify the underlying principles. The companion website (www.appliedmissingdata.com) includes data files and syntax for the examples in the book as well as up-to-date information on software. The book is accessible to substantive researchers while providing a level of detail that will satisfy quantitative specialists.

This textbook provides a comprehensive and reader-friendly introduction to the field of computational social science (CSS). Presenting a unified treatment, the text examines in detail the four key methodological approaches of automated social information extraction, social network analysis, social complexity theory, and social simulation modeling. This updated new edition has been enhanced with numerous review questions and exercises to test what has been learned, deepen understanding through problem-solving, and to practice writing code to implement ideas. Topics and features: contains more than a thousand questions and exercises, together with a list of acronyms and a glossary; examines the similarities and differences between computers and social systems; presents a focus on automated information extraction; discusses the measurement, scientific laws, and generative theories of social complexity in CSS; reviews the methodology of social simulations, covering both variable- and object-oriented models.

The last two decades have seen enormous developments in statistical methods for incomplete data. The EM algorithm and its extensions, multiple imputation, and Markov Chain Monte Carlo provide a set of flexible and reliable tools from inference in large classes of missing-data problems. Yet, in practical terms, those developments have had surprisingly little impact on the way most data analysts handle missing values on a routine basis. Analysis of Incomplete Multivariate Data helps bridge the gap between theory and practice, making these missing-data tools accessible to a broad audience. It presents a unified, Bayesian approach to the analysis of incomplete multivariate data, covering datasets in which the variables are continuous, categorical, or both. The focus is applied, where necessary, to help readers thoroughly understand the statistical properties of those methods, and the behavior of the accompanying algorithms. All techniques are illustrated with real data examples, with extended discussion and practical advice. All of the algorithms described in this book have been implemented by the author for general use in the statistical languages S and S Plus. The software is available free of charge on the Internet.

Drawing on the authors’ extensive research in the analysis of categorical longitudinal data, Latent Markov Models for Longitudinal Data focuses on the formulation of latent Markov models and the practical use of these models. Numerous examples illustrate how latent Markov models are used in economics, education, sociology, and other fields. The R and MATLAB® routines used for the examples are available on the authors’ website. The book provides you with the essential background on latent variable models, particularly the latent class model. It discusses how the Markov chain model and the latent class model represent a useful paradigm for latent Markov models. The authors illustrate the assumptions of the basic version of the latent Markov model and introduce maximum likelihood estimation through the Expectation-Maximization algorithm. They also cover constrained versions of the basic latent Markov model, describe the inclusion of the individual covariates, and address the random effects and multilevel extensions of the model. After covering advanced topics, the book concludes with a discussion on Bayesian inference as an alternative to maximum likelihood inference. As longitudinal data become increasingly relevant in many fields, researchers must rely on specific statistical and econometric models tailored to their application. A complete overview of latent Markov models, this book demonstrates how to use the models in three types of analysis: transition analysis with measurement errors, analyses that consider unobserved heterogeneity, and finding clusters of units and studying the transition between the clusters.

Incorporating a hands-on pedagogical approach, Nonparametric Statistics for Social and Behavioral Sciences presents the concepts, principles, and methods used in performing many nonparametric procedures. It also demonstrates practical applications of the most common nonparametric procedures using IBM’s SPSS software. This text is the only current nonparametric book written specifically for students in the behavioral and social sciences. Emphasizing sound research designs, appropriate statistical analyses, and accurate interpretations of results, the text: Explains a conceptual framework for each statistical procedure Presents examples of relevant research problems, associated research questions, and hypotheses that precede each procedure Details SPSS paths for conducting various analyses Discusses the interpretations of statistical results and conclusions of the research With minimal coverage of formulas, the book takes a nonmathematical approach to nonparametric data analysis procedures and shows students how they are used in research contexts. Each chapter includes examples, exercises, and SPSS screen shots illustrating steps of the statistical procedures and resulting output.

In addition to learning how to apply classic statistical methods, students need to understand when these methods perform well, and when and why they can be highly unsatisfactory. Modern Statistics for the Social and Behavioral Sciences illustrates how to use R to apply both standard and modern methods to correct known problems with classic techniques. Numerous illustrations provide a conceptual basis for understanding why practical problems with classic methods were missed for so many years, and why modern techniques have practical value. Designed for a two-semester, introductory course for graduate students in the social sciences, this text introduces three major advances in the field: Early studies seemed to suggest that normality can be assumed with relatively small sample sizes due to the central limit theorem. However, crucial issues were missed. Vastly improved methods are now available for dealing with non-normality. The impact of outliers and heavy-tailed distributions on power and our ability to obtain an accurate assessment of how groups differ and variables are related is a practical concern when using standard techniques, regardless of how large the sample size might be. Methods for dealing with this insight are described. The deleterious effects of heteroscedasticity on conventional ANOVA and regression methods are much more serious than once thought. Effective techniques for dealing heteroscedasticity are described and illustrated. Requiring no prior training in statistics, Modern Statistics for the Social and Behavioral Sciences provides a graduate-level introduction to basic, routinely used statistical techniques relevant to the social and behavioral sciences. It describes and illustrates methods developed during the last half century that deal with known problems associated with classic techniques. Espousing the view that no single method is always best, it imparts a general understanding of the relative merits of various techniques so that the choice of method can be made in an informed manner.

Statistical Rethinking: A Bayesian Course with Examples in R and Stan builds readers’ knowledge of and confidence in statistical modeling. Reflecting the need for even minor programming in today’s model-based statistics, the book pushes readers to perform step-by-step calculations that are usually automated. This unique computational approach ensures that readers understand enough of the details to make reasonable choices and interpretations in their own modeling work. The text presents generalized linear multilevel models from a Bayesian perspective, relying on a simple logical interpretation of Bayesian probability and maximum entropy. It covers from the basics of regression to multilevel models. The author also discusses measurement error, missing data, and Gaussian process models for spatial and network autocorrelation. By using complete R code examples throughout, this book provides a practical foundation for performing statistical inference. Designed for both PhD students and seasoned professionals in the natural and social sciences, it prepares them for more advanced or specialized statistical modeling. Web Resource The book is accompanied by an R package (rethinking) that is available on the author’s website and GitHub. The two core functions (map and map2stan) of this package allow a variety of statistical models to be constructed from standard model formulas.

Quantitative research in social science research is changing rapidly. Researchers have vast and complex arrays of data with which to work: we have incredible tools to sift through the data and recognize patterns in that data; there are now many sophisticated models that we can use to make sense of those patterns; and we have extremely powerful computational systems that help us accomplish these tasks quickly. This book focuses on some of the extraordinary work being conducted in computational social science - in academia, government, and the private sector - while highlighting current trends, challenges, and new directions. Thus, Computational Social Science showcases the innovative methodological tools being developed and applied by leading researchers in this new field. The book shows how academics and the private sector are using many of these tools to solve problems in social science and public policy.

"We live, today, in world of big data. The amount of information collected on human behavior every day is staggering, and exponentially greater than at any time in the past. At the same time, we are inundated by stories of powerful algorithms capable of churning through this sea of data and uncovering patterns. These techniques go by many names - data mining, predictive analytics, machine learning - and they are being used by governments as they spy on citizens and by huge corporations are they fine-tune their advertising strategies. And yet social scientists continue mainly to employ a set of analytical tools developed in an earlier era when data was sparse and difficult to come by. In this timely book, Paul Attewell and David Monaghan provide a simple and accessible introduction to Data Mining geared towards social scientists. They discuss how the data mining approach differs substantially, and in some ways radically, from that of conventional statistical modeling familiar to most social scientists. They demystify data mining, describing the diverse set of techniques that the term covers and discussing the strengths and weaknesses of the various approaches. Finally they give practical demonstrations of how to carry out analyses using data mining tools in a number of statistical software packages. It is the hope of the authors that this book will empower social scientists to consider incorporating data mining methodologies in their analytical toolkits"--Provided by publisher.

A powerful tool for analyzing nested designs in a variety of fields, multilevel/hierarchical modeling allows researchers to account for data collected at multiple levels. Multilevel Modeling Using R provides you with a helpful guide to conducting multilevel data modeling using the R software environment. After reviewing standard linear models, the authors present the basics of multilevel models and explain how to fit these models using R. They then show how to employ multilevel modeling with longitudinal data and demonstrate the valuable graphical options in R. The book also describes models for categorical dependent variables in both single level and multilevel data. The book concludes with Bayesian fitting of multilevel models. For those new to R, the appendix provides an introduction to this system that covers basic R knowledge necessary to run the models in the book. Through the R code and detailed explanations provided, this book gives you the tools to launch your own investigations in multilevel modeling and gain insight into your research.

This book is designed primarily for upper level undergraduate and graduate level students taking a course in multilevel modelling and/or statistical modelling with a large multilevel modelling component. The focus is on presenting the theory and practice of major multilevel modelling techniques in a variety of contexts, using Mplus as the software tool, and demonstrating the various functions available for these analyses in Mplus, which is widely used by researchers in various fields, including most of the social sciences. In particular, Mplus offers users a wide array of tools for latent variable modelling, including for multilevel data.

Make healthcare analytics work: leverage its powerful opportunities for improving outcomes, cost, and efficiency.This book gives you thepractical frameworks, strategies, tactics, and case studies you need to go beyond talk to action. The contributing healthcare analytics innovators survey the field's current state, present start-to-finish guidance for planning and implementation, and help decision-makers prepare for tomorrow's advances. They present in-depth case studies revealing how leading organizations have organized and executed analytic strategies that work, and fully cover the primary applications of analytics in all three sectors of the healthcare ecosystem: Provider, Payer, and Life Sciences. Co-published with the International Institute for Analytics (IIA), this book features the combined expertise of IIA's team of leading health analytics practitioners and researchers. Each chapter is written by a member of the IIA faculty, and bridges the latest research findings with proven best practices. This book will be valuable to professionals and decision-makers throughout the healthcare ecosystem, including provider organization clinicians and managers; life sciences researchers and practitioners; and informaticists, actuaries, and managers at payer organizations. It will also be valuable in diverse analytics, operations, and IT courses in business, engineering, and healthcare certificate programs.