Term
|
Definition
The set that consists of all the subjects of interest |
|
|
Term
|
Definition
|
|
Term
|
Definition
The researcher observes values of the response variable and explanatory variables for the sampled subjects, without anything being done to the subjects (such as imposing a treatment) |
|
|
Term
|
Definition
Type of observational study. A sample survey selects a sample of people from a population and interviews them to collect data. |
|
|
Term
|
Definition
Assigning subjects to certain experimental conditions and then observing outcomes on the response variable. The experimental conditions, which correspond to assigned values of the explanatory variable, are called treatments. |
|
|
Term
Is observational or experimental better? Why? |
|
Definition
An experiment reduces the potential for lurking variables to affect the result. Thus, an experiment gives the researcher more control over outside infuences.
Also, notice that we can not determine cause and effect merely from an observational study. Remember that association does not imply causation. Hence, cause and eect can only be established through rigorous experimentation.
Unfortunately, experiments are not always possible due to ethical reasons, time considerations and other factors. Therefore, it is important to do observational studies properly. |
|
|
Term
|
Definition
List of subjects in the population from which the sample is taken, ideally it lists the entire population of interest |
|
|
Term
|
Definition
The best way of obtaining a sample that is representative of the population |
|
|
Term
Simple random sampling (SRS) |
|
Definition
Each possible sample of that size has the same chance of being selected |
|
|
Term
|
Definition
Tells how well the sample estimate predicts the population percentage (1/sqrt n * 100%) |
|
|
Term
|
Definition
Type of sample survey that is easy to obtain. Some problems with convenience samples are: - Unlikely to be representative of the population. - Often severe biases result from such a sample. - Results apply ONLY to the observed subjects. |
|
|
Term
|
Definition
The subjects volunteer to be part of it. Volunteers do not tend to be representative of the entire population. Type of convenience sample |
|
|
Term
|
Definition
Tendency to systematically favor certain parts of the population over others. |
|
|
Term
|
Definition
Bias resulting from the sampling method such as using nonrandom samples or having under-coverage. |
|
|
Term
|
Definition
Occurs when some sampled subjects cannot be reached or refuse to participate or fail to answer some questions. |
|
|
Term
|
Definition
Occurs when the subject gives an incorrect response or the question is misleading. |
|
|
Term
|
Definition
The subjects of an experiment; the entities that we measure in an experiment. |
|
|
Term
|
Definition
A specific experimental condition imposed on the subjects of the study; the treatments correspond to assigned values of the explanatory variable. |
|
|
Term
|
Definition
Defines the groups to be compared with respect to values on the response variable. |
|
|
Term
|
Definition
The outcome measured on the subjects to reveal the effect of the treatment(s). |
|
|
Term
|
Definition
Allows the researcher to analyze the effectiveness of the primary treatment. |
|
|
Term
|
Definition
It helps to eliminate the effects of lurking variables and confounding, and also helps to reduce researcher bias. |
|
|
Term
|
Definition
Allows us to attribute observed effects to the treatments rather than ordinary variability. |
|
|
Term
Main components of a good experiment |
|
Definition
Randomization, replication, control/comparison group |
|
|
Term
|
Definition
|
|
Term
|
Definition
An improvement in health due not to any treatment but only to the patient's belief that he or she will improve. |
|
|
Term
3 things randomly assigning subjects does |
|
Definition
-Eliminate bias that may result from the researcher assigning the subjects. -Balance the groups on variables known to affect the response. -Balance the groups on lurking variables that may be unknown to the researcher. |
|
|
Term
3 things replication does |
|
Definition
-The difference due to ordinary variation is smaller with larger samples. -We have more confindence that the sample results reflect a true difference due to treatments when the sample size is large. -Since it is always possible that the observed effects were due to chance alone, replicating the experiment also builds confidence in our conclusions. |
|
|
Term
|
Definition
Neither the subjects nor the investigators working with them know which treatment each subject is receiving |
|
|
Term
Double-blind controls what type of bias? |
|
Definition
|
|
Term
3 types of observational study? |
|
Definition
Sample, retrospective, prospective |
|
|
Term
|
Definition
Attempts to take a cross section of a population at the current time |
|
|
Term
|
Definition
|
|
Term
|
Definition
Follows subjects into the future |
|
|
Term
3 forms of randomization sample surveys use |
|
Definition
Simple random sampling, clustered sampling, and stratified random sampling |
|
|
Term
|
Definition
A retrospective observational study in which subjects who have a response outcome of interest (the cases) and subjects who have the other response outcome (the controls) are matched and compared on an explanatory variable. |
|
|
Term
|
Definition
Categorical explanatory variables in an experiment |
|
|
Term
|
Definition
|
|
Term
|
Definition
Uses a single experiment to analyze the effects of two or more explanatory variables on the response. |
|
|
Term
|
Definition
The subjects receiving the two treatments are somehow matched |
|
|
Term
|
Definition
The same individual is used for the two treatments. |
|
|
Term
|
Definition
The generalization of the matched pairs design, extended to three or more treatments. Each set of matched experimental units is then called a block. |
|
|
Term
|
Definition
As the number of (independent) trials increase, the proportion of occurrences of any given outcome approaches a particular number in the "long run". |
|
|
Term
|
Definition
The proportion of times that the outcome would occur in a long run of observations. |
|
|
Term
|
Definition
Different trials of a random phenomenon are independent if the outcome of any one trial is not affected by the outcome of any other. |
|
|
Term
|
Definition
The long run proportion of times that the outcome occurs in a very large number of trials. Unfortunately, this is not always helpful or possible, because sometimes we can not observe "repetitions" of the experiment. |
|
|
Term
|
Definition
Your degree of belief that the outcome will occur based on the information available. |
|
|
Term
|
Definition
A branch of statistics that uses subjective probability as its foundation. |
|
|
Term
|
Definition
Are not in some other event |
|
|
Term
Intersection of two events |
|
Definition
Are in one event and in another event |
|
|
Term
|
Definition
Are in one event or in another event |
|
|
Term
|
Definition
If A and B do not have any common outcomes. That is, if their intersection is empty. |
|
|
Term
|
Definition
A and B, consists of the outcomes that are in A or B. Notice that this include the outcomes that are in both, A and B. We write A U B. |
|
|
Term
|
Definition
P(A or B) = P(A) + P(B) - P(A and B) |
|
|
Term
|
Definition
|
|
Term
Multiplication rule (two independent events intersecting) |
|
Definition
|
|
Term
|
Definition
The probability that one occurs is not affected by the occurrence of the other event. |
|
|
Term
|
Definition
A numerical measurement of the outcome of a random phenomenon |
|
|
Term
Probability distribution of a random variable |
|
Definition
Species its possible values and their respective probabilities |
|
|
Term
Probability distribution of a discrete random variable |
|
Definition
-For each x, the probability P(x) falls between 0 and 1. -The sum of the probabilities for all the possible values of x is equal to 1. |
|
|
Term
Probability density function |
|
Definition
-Each interval has probability between 0 and 1. The probability corresponds to the area under the curve, above the interval. -The interval containing all possible values has probability equal to 1, so the total area under the curve always equals 1. |
|
|
Term
|
Definition
Bell-shaped and characterized by its mean and standard deviation |
|
|
Term
Why normal distribution is important |
|
Definition
-Many distributions have an approximate normal distribution. -Approximates many discrete distributions well when there are a large number of possible outcomes. -Many statistical methods use it even when the data are not bell shaped |
|
|
Term
Z score for a value of x of a random variable |
|
Definition
The number of standard deviations that x falls from the mean. |
|
|
Term
Z score for a value of x of a random variable |
|
Definition
The number of standard deviations that x falls from the mean. |
|
|
Term
|
Definition
Has one of two possible outcome |
|
|
Term
Conditions for the Binomial Distribution |
|
Definition
-Each of n trials has two possible outcomes: The outcome of interest is called success and the other outcome is called failure. -Each trial has the same probability of observing a "success". This probability is denoted by p. -The n trials are independent. |
|
|
Term
When the number of trials n is large enough so that np and n(1 - p) are both at least 15, then the binomial distribution can be well approximated by a normal distribution with mean np and standard deviation sqrt (np(1-p)) |
|
Definition
|
|