Probability Calculator

Transcription

1 Chapter 95 Introduction Most statisticians have a set of probability tables that they refer to in doing their statistical wor. This procedure provides you with a set of electronic statistical tables that will let you loo up values for various probability distributions. To run this option, select from the Other menu of the Analysis menu. A window will appear that will let you indicate which probability distribution you want to use along with various input parameters. Select the Calculate button to find and display the results. Many of the probability distributions have two selection buttons to the left of them. The first (left) button selects the inverse probability distribution. An inverse probability distribution is in a form so that when you give it a probability, it calculates the associated critical value. The second (right) button selects the regular probability distribution which is formulated so that when you give it a critical value, it calculates the (left tail) probability. Probability Distributions Beta Distribution The beta distribution is usually used because of its relationship to other distributions, such as the t and F distributions. The noncentral beta distribution function is formulated as follows: Γ( A + B) e L Pr( x A, B, L) = I ( A, B, L) = Γ( A) Γ( B)! < A, < B, L, and x = L A+ B t ( t) dt When the noncentrality parameter (NCP), L, is set to zero, the above formula reduces to the standard beta distribution, formulated as Γ( A + B) Pr(, ) x A B = ( ) ( A) ( B) t A t B Γ Γ dt When the inverse distribution is selected, you supply the probability value and the program solves for. When the regular distribution is selected, you supply and the program solves for the cumulative (left-tail) probability. 95-

2 Binomial Distribution The binomial distribution is used to model the counts of a sequence of independent binary trials in which the probability of a success, P, is constant. The total number of trials (sample size) is N. R represents the number of successes in N trials. The probability of exactly R successes is: N Pr( r = R N, P) = ( ) R P P R N R = N! R!( N R)! The probability of from to R successes is given by: R Pr(, ) = ( ) N r R N P r P P r= r N r Bivariate Normal Distribution The bivariate normal distribution is given by the formula h x + rxy y Pr( x < h, y < r) = exp r π ( r ) x and y follow the bivariate normal distribution with correlation coefficient r. dx dy Chi-Square Distribution The Chi-square distribution arises often in statistics when the normally distributed random variables are squared and added together. DF is the degrees of freedom of the estimated standard error. The noncentral Chi-square distribution function is used in power calculations. The noncentral Chi-square distribution is calculated using the formula: P( df ) = df / df Γ L e Pr( x df, L) = ( + )! P df df / t/ t e dt When the noncentrality parameter (NCP), L, is set to zero, the above formula reduces to the (central) Chi-square distribution. When the inverse distribution is selected, you supply the probability value and the program solves for. When the regular distribution is selected, you supply and the program solves for the cumulative (left-tail) probability. = L 95-

3 Correlation Coefficient Distribution The correlation coefficient distribution is formulated as follows: Pr( r R n, ρ) = r <, ρ <, and R < ( ρ ) n 3 ( )! ( ) ( n )/ ( ) ( n )/ n + i r r n ρ π 3 4 Γ i! R i= F Distribution The F distribution is used in the analysis of variance and in other places the distribution of the ratio of two variances is needed. The degrees of freedom of the numerator variance is DF and the degrees of freedom of the denominator variance is DF. The noncentral-f distribution function is used in power calculations. We calculate the noncentral-f distribution using the following relationship between the F and the beta distribution function. F( df) = F( df ) + df df df Pr( f F df, df, L) = I (,, L) When the noncentrality parameter (NCP), L, is set to zero, the above formula reduces to the standard F distribution When the inverse distribution is selected, you supply the probability value and the program solves for F. When the regular distribution is selected, you supply F and the program solves for the cumulative (left-tail) probability. Hotelling s T Distribution Hotelling s T-Squared distribution is used in multivariate analysis. We calculate the distribution using the following relationship between the F and the T distribution function. ( df + ) ( df ) Pr( x T,, df ) = Pr( x F, +, df ) df df is the number of variables and df is the degrees of freedom associated with the covariance matrix. When the inverse distribution is selected, you supply the probability value and the program solves for T. When the regular distribution is selected, you supply T and the program solves for the cumulative (left-tail) probability. i dr 95-3

4 Gamma Distribution The Gamma distribution is formulated as follows: Pr( g G A, B) = G A / B A B Γ( A) x e dx A x Γ( A) = x e dx < A, < B, and G When the inverse distribution is selected, you supply the probability value and the program solves for G. When the regular distribution is selected, you supply G and the program solves for the cumulative (left-tail) probability. Hypergeometric Distribution The hypergeometric distribution is used to model the following situation. Suppose a sample of size R is selected from a population with N items, M of which have a characteristic of interest. What is the probability that of the items in the sample have this characteristic. The probability of exactly successes is: M N M R Pr( x = N, M, R) = = N! R!( N R)! Maximum(, R-N+M) <= <= Minimum(M, R) Negative Binomial Distribution The negative binomial distribution is used to model the counts of a sequence of independent binary trials in which the probability of a success, P, is constant. The total number of trials (sample size) is N. R represents the number of successes in N trials. Unlie the binomial distribution, the sample size, N, is the variable of interest. The question answered by the negative binomial distribution is: how many tosses of a coin (with probability of a head equal to P) is necessary to achieve R heads and tails. The probability of exactly R successes is: + R Pr( x = R, P) = P R R ( P) = N! R!( N R)! 95-4

5 Normal Distribution The normal distribution is formulated as follows: ( x µ ) Pr( x µ, σ ) = exp πσ σ When the mean is and the variance is, we have the standard normal distribution. The regular normal distribution uses the variable. The standard normal distribution uses the variable Z. Any normal distribution may be transformed to the standard normal distribution using the relationship: z x = µ σ dx Poisson Distribution The Poisson distribution is used to model the following situation. Suppose the average number of accidents at a given intersection is 3.5 per year. What is the probability of having accidents during the next half year? The probability of exactly occurrences with a mean occurrence rate of M is: M Pr( x = M ) = e M! Studentized Range Distribution The studentized range distribution is used whenever the distribution of the ratio of a range and an independent estimate of its standard error is needed. This distribution is used quite often in multiple comparison tests run after an analysis of variance. DF is the degrees of freedom of the estimated standard error (often the degrees of freedom of the MSE). K is the number of items (means) in the sample. The distribution function is given by: df / + df / df Pr(, ) = df s dfs r R df exp ( ) P Rs n dx df Γ P(Rs n) is the probability integral of the range. 95-5

6 Student s t Distribution The t distribution is used whenever the distribution of the ratio of a statistic and its standard error is needed. DF is the degrees of freedom of the estimated standard error. The noncentral-t distribution function is used in power calculations. We calculate the noncentral-t distribution using the following relationship between the t and the beta distribution function. df = df + T Pr( t T df, L) = e = L / ( L / ) df I (,,)! When the noncentrality parameter (NCP), L, is set to zero, the above formula reduces to the (central) Student s t distribution When the inverse distribution is selected, you supply the probability value and the program solves for T. When the regular distribution is selected, you supply T and the program solves for the cumulative (left-tail) probability. Weibull Distribution The Weibull distribution is formulated as follows: ( ) Pr( t T λ, γ ) = exp ( λt ) γ When gamma (γ) equal to one, the distribution simplifies to the exponential distribution. When the inverse distribution is selected, you supply the probability value and the program solves for T. When the regular distribution is selected, you supply T and the program solves for the cumulative (left-tail) probability. 95-6

Chapter 2, part 2 Petter Mostad mostad@chalmers.se Parametrical families of probability distributions How can we solve the problem of learning about the population distribution from the sample? Usual procedure:

Part : Chapter 7 Statistics A Homework 8 Solutions Ryan Rosario. A player throws a fair die and simultaneously flips a fair coin. If the coin lands heads, then she wins twice, and if tails, the one-half

Chapter 3 RANDOM VARIATE GENERATION In order to do a Monte Carlo simulation either by hand or by computer, techniques must be developed for generating values of random variables having known distributions.

Notes on Continuous Random Variables Continuous random variables are random quantities that are measured on a continuous scale. They can usually take on any value over some interval, which distinguishes

Chapter 450 on-inferiority Tests for Two Means using Differences Introduction This procedure computes power and sample size for non-inferiority tests in two-sample designs in which the outcome is a continuous

Chapter 807 Point Biserial Correlation Tests Introduction The point biserial correlation coefficient (ρ in this chapter) is the product-moment correlation calculated between a continuous random variable

Chapter 855 Introduction Linear regression is a commonly used procedure in statistical analysis. One of the main objectives in linear regression analysis is to test hypotheses about the slope (sometimes

4 4. Distribution (DIST) There is a variety of different types of distribution, but the most well-known is normal distribution, which is essential for performing statistical calculations. Normal distribution

Chapter 400 Introduction Canonical correlation analysis is the study of the linear relations between two sets of variables. It is the multivariate extension of correlation analysis. Although we will present

Probability and Statistics Vocabulary List (Definitions for Middle School Teachers) B Bar graph a diagram representing the frequency distribution for nominal or discrete data. It consists of a sequence

Chapter 4 Poisson Models for Count Data In this chapter we study log-linear models for count data under the assumption of a Poisson error structure. These models have many applications, not only to the

Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics For 2015 Examinations Aim The aim of the Probability and Mathematical Statistics subject is to provide a grounding in

Multivariate hypothesis tests for fixed effects Testing homogeneity of level-1 variances In the following sections, we use the model displayed in the figure below to illustrate the hypothesis tests. Partial

ST 371 (VIII): Theory of Joint Distributions So far we have focused on probability distributions for single random variables. However, we are often interested in probability statements concerning two or

Summary of Formulas and Concepts Descriptive Statistics (Ch. 1-4) Definitions Population: The complete set of numerical information on a particular quantity in which an investigator is interested. We assume

Binomial and Poisson Random Variables Solutions STAT-UB.0103 Statistics for Business Control and Regression Models Binomial random variables 1. A certain coin has a 5% of landing heads, and a 75% chance

Statistical Functions in Excel There are many statistical functions in Excel. Moreover, there are other functions that are not specified as statistical functions that are helpful in some statistical analyses.

3. The Multivariate Normal Distribution 3.1 Introduction A generalization of the familiar bell shaped normal density to several dimensions plays a fundamental role in multivariate analysis While real data

Chapter 250 Introduction The Chi-square test is often used to test whether sets of frequencies or proportions follow certain patterns. The two most common instances are tests of goodness of fit using multinomial

Chapter 200 Tests for Two Proportions Introduction This module computes power and sample size for hypothesis tests of the difference, ratio, or odds ratio of two independent proportions. The test statistics

Chapter 560 Factorial Analysis of Variance Introduction A common task in research is to compare the average response across levels of one or more factor variables. Examples of factor variables are income

1 WHERE DOES THE 10% CONDITION COME FROM? The text has mentioned The 10% Condition (at least) twice so far: p. 407 Bernoulli trials must be independent. If that assumption is violated, it is still okay

Introduction to R I. Using R for Statistical Tables and Plotting Distributions The R suite of programs provides a simple way for statistical tables of just about any probability distribution of interest

ECE32 Spring 26 HW7 Solutions March, 26 Solutions to HW7 Note: Most of these solutions were generated by R. D. Yates and D. J. Goodman, the authors of our textbook. I have added comments in italics where

PRACTICE EXAMINATION NUMBER 6. An insurance company eamines its pool of auto insurance customers and gathers the following information: i) All customers insure at least one car. ii) 64 of the customers

Chapter 4 Hypothesis Testing in Linear Regression Models 41 Introduction As we saw in Chapter 3, the vector of OLS parameter estimates ˆβ is a random vector Since it would be an astonishing coincidence

SPSS Guide: Regression Analysis I put this together to give you a step-by-step guide for replicating what we did in the computer lab. It should help you run the tests we covered. The best way to get familiar

DISCRETE RANDOM VARIABLES DISCRETE RANDOM VARIABLES Documents prepared for use in course B01.1305, New York University, Stern School of Business Definitions page 3 Discrete random variables are introduced

Chapter 552 Gamma Distribution Fitting Introduction This module fits the gamma probability distributions to a complete or censored set of individual or grouped data values. It outputs various statistics

Continuous Random Variables The probability that a continuous random variable, X, has a value between a and b is computed by integrating its probability density function (p.d.f.) over the interval [a,b]:

Sampling methods by Agner Fog This document is published at www.agner.org/random, Feb. 008, as part of a software package. Introduction A C++ class library of non-uniform random number generators is available

Notes on the Negative Binomial Distribution John D. Cook October 28, 2009 Abstract These notes give several properties of the negative binomial distribution. 1. Parameterizations 2. The connection between

Gaussian Conjugate Prior Cheat Sheet Tom SF Haines 1 Purpose This document contains notes on how to handle the multivariate Gaussian 1 in a Bayesian setting. It focuses on the conjugate prior, its Bayesian

4.6 I company that manufactures and bottles of apple juice uses a machine that automatically fills 6 ounce bottles. There is some variation, however, in the amounts of liquid dispensed into the bottles

Normality Testing in Excel By Mark Harmon Copyright 2011 Mark Harmon No part of this publication may be reproduced or distributed without the express permission of the author. mark@excelmasterseries.com

Lecture #10 Chapter 10 Correlation and Regression The main focus of this chapter is to form inferences based on sample data that come in pairs. Given such paired sample data, we want to determine whether

The Binomial Probability Distribution MATH 130, Elements of Statistics I J. Robert Buchanan Department of Mathematics Fall 2015 Objectives After this lesson we will be able to: determine whether a probability

Chapter 4. robability and robability Distributions Importance of Knowing robability To know whether a sample is not identical to the population from which it was selected, it is necessary to assess the

Chapter 2 Fundamentals of Probability and Statistics for Reliability Analysis Assessment of the reliability of a hydrosystems infrastructural system or its components involves the use of probability and

Handout 4: Binomial Distribution Reading Assignment: Chapter 5 In the previous handout, we looked at continuous random variables and calculating probabilities and percentiles for those type of variables.