Shared Flashcard Set

Details

Title

statistics vocabulary

Description

Semester 1 statistics vocabulary

Total Cards

Subject

Mathematics

Level

Graduate

Created

11/12/2010

Click here to study/print these flashcards.

Create your own flash cards! Sign up here.

Additional Mathematics Flashcards

Cards Return to Set Details

Term

Categorical data

Definition

data that fits into a small number of discrete categories.

data is either non-ordered (nominal) such as gender or city, or ordered (ordinal) such as high, medium, or low temperature.

Term

Box-and-whisker plot

Definition

a diagram constructed from a set of numerical data showing a box that indicates the middle 50% of the marked observations together with lines, sometime called ‘whiskers’, that go out from the quartile to the most extreme data value in that direction which is not more than 1.5 times the Inter Quartile Range from the quartile.

Term

Class intervals

Definition

a subdivision within a range of values. In a histogram, the range of values is divided into sections, known as class intervals, also referred to as “bins.”

Term

Central limit theorem

Definition

it pertains to the convergence in distribution of (normalized) sums of random variables. The distribution of the mean of a sequence of random variables tends to a normal distribution as the number in the sequence increases indefinitely. The most general version of the C.L.T states: Let X1, X2 ,... be a sequence of independent, identically distributed random variables with mean μ and finite variance σ 2 . Then as n increases indefinitely, the distribution of n Z tends to the standard normal distribution.

Term

Confidence interval

Definition

an interval, calculated from a sample, which contains the value of a certain population parameter with a specified probability.

Term

Confidence level

Definition

the probability that the statistician's confidence interval contains the true, unknown population parameter.

Term

Correlation

Definition

is a measure of how closely related two variables are, or how linearly related they are.

It is the measure of the extent to which a change in one random variable tends to correspond to change in the other random variable.

Term

Correlation coefficients

Definition

a measure of how close two random variables are to being perfectly linearly related; computed by dividing the covariance of the random variables by the product of their standard deviation. The correlation coefficient denoted by ρ takes values between -1 and 1; -1 represents a perfect negative correlation while 1 represents a perfect positive correlation.

Term

Cumulative frequency

Definition

the sum of the frequencies of all the values up to a given value. If the values 1 2 , ,..., n x x x, in ascending order, occur with frequencies 1 2 , ,..., n f f f respectively, then the cumulative frequency at xi is f1 + f2 +... fi .

Term

Cumulative relative frequency

Definition

The cumulative frequency in a frequency distribution divided by the total number of data points.

Term

Data

Definition

the observations gathered from an experiment, survey or observational study.

Term

Dependent event

Definition

two events where the occurrence of either affects the probability of the occurrence of the other.

Term

Dispersion

Definition

a way of describing how scattered or spread out the observations in a sample are. Common measures are the range, inter quartile range, variance, and standard deviation.

Term

Distribution

Definition

the way in which the probability of it taking a certain value, or a value within a certain interval is described.

Term

Element

Definition

an object in a set.

Term

Event

Definition

a subset of the sample space.

Term

Experiment

Definition

processes in which there are an observable set of outcomes

Term

Frequency

Definition

the number of times that a particular value occurs as an observation.

Term

Frequency distribution

Definition

the information consisting of the possible values/groups and the corresponding frequencies

Term

Frequency table

Definition

a table giving the number of data points in a data set falling in each of a set of given intervals.

Term

Histogram

Definition

A bar graph presenting the frequencies of occurrence of data points.

Term

Independent event

Definition

two events where the outcome of one event has no effect on the outcome of the other.

Term

Inter-quartile range

Definition

the difference between the first quartile and third quartile of a set of data, (IQR).

Term

Linear regression

Definition

a method for finding an equation for the line that best fits the data set. The method is based on minimizing the sum of the squared vertical distances from the data points to the line of best fit.

Term

Mean

Definition

the average

Term

Measures of central tendency

Definition

measures of the location of the middle or the center of a distribution. The definition of "middle" or "center" is purposely left somewhat vague so that the term can refer to a wide variety of measures. The three most common measures are the mean, median, and mode.

Term

Median

Definition

suppose the observations in a set of numerical data are ranked in ascending order. It is the middle observation if there are an odd number of observations, and is the average of the two middle most observations if there are an even number of observations.

Term

Mode

Definition

the most frequently occurring value in a set of discrete data. There can be more than one if two or more values are equally common.

Term

Nominal scale

Definition

a data set is said to be this if the observations belonging to it can be assigned a code in the form of a number where the numbers are simply labels - cases are classified into categories. You can count but not order or measure this kind of data.

Term

Normal distribution

Definition

a continuous probability distribution (with parameters μ and σ ) whose probability density function f is given They are symmetric about their mean and have bell-shaped density curves.

Term

Ordinal Scale

Definition

a set of data is said to be this if the values / observations belonging to it can be ranked (put in order) or have a rating scale attached.

Term

Outliers of data

Definition

an observation that is deemed to be unusual and possibly erroneous because it does not follow the general pattern of the data in the sample.

Term

Percentile

Definition

is the value n /100 x such that n percent of the population is less than or equal to n /100 x . The 25th, 50th and 75th percentiles are called quantiles.

Term

Population

Definition

the entire set of items from which data can be selected.

Term

Population variable

Definition

collection of related behaviors of a group that are associated in a meaningful way.

Term

Quartile

Definition

for numerical data ranked in ascending order, these are values derived from the data which divide the data into four equal parts.

Term

Random sample

Definition

a set of data chosen from a population in such a way that each member of the population has an equal probability of being selected

Term

Range

Definition

the range of a sample (or a data set) is a measure of the spread or the dispersion of the observations. It is the difference between the largest and the smallest observed value

Term

Relative frequency

Definition

is another term for proportion; it is the value calculated by dividing the number of times an event occurs by the total number of times an experiment is carried out. The probability of an event can be thought of as its long-run relative frequency when the experiment is carried out many times.

Term

Sample

Definition

a subset of a population that is obtained through some process, possibly random selection or selection based on a certain set of criteria, for the purposes of investigating the properties of the underlying parent population

Term

Sample size

Definition

the number of items in a sample.

Term

variance

Definition

the average distance from the mean or

the average of the squared deviation scores

Term

Sampling

Definition

the process of selecting a proper subset of elements from the full population so that the subset can be used to make inference to the population as a whole.

Term

Single-variable data

Definition

data that uses only one unknown.

Term

Skewness

Definition

the degree of asymmetry of a distribution.

Term

Spread of data

Definition

the degree to which data are spread out around their center. Measures of spread include the mean deviation, variance, standard deviation, and interquartile range

Term

Standard deviation

Definition

is a measure of the spread or dispersion of a set of data. It is defined as the square root of the variance.

Term

Standard normal distribution

Definition

a normal distribution with parameters 0(mean) and 1(variance).

Term

Statistics

Definition

the branch of mathematics that deals with the collection, organization, and interpretation of data.

Term

Stem-and-leaf plot

Definition

a semi-graphical method used to represent numerical data, in which the first (leftmost) digit of each data value is a stem and the rest of the digits of the number are the leaves.

Term

Variable

Definition

a quantity that varies. usually represented by letters.

Term

Variance

Definition

a measure of the amount of spread in a set of data; the larger this is, the more scattered the observations on average.

Term

frequency distribution

Definition

an organized tabulation of the number of individuals located in each category on the scale of measurement. For a histogram, vertical bars are drawn above each score so that 1) the height of the bar corresponds to the frequency, & 2) The width of the bar extends to the real limits of the score. A histogram is used when the data are measured on an interval or a ratio scale.

Term

independent variable

Definition

the variable that is manipulated by the researcher.

Term

quasi-experimental method

Definition

examines differences between pre-existing groups of sugjects (for example, men vs. women) or differences between groups of scores obtained at different times (for example, before treatment vs. after treatment).

Term

standard score

Definition

a transformed score that provides information about its location in a distribution. A z-score is an example

Term

t statistic

Definition

used to test hypotheses about m when the value for s2 is not known.

Term

Variability

Definition

provides a quantitiative measure of the degree to which scores in a distribution are spread out or clustered together.

Term

z-score

Definition

specifies the precise location of each X value within a distribution. the distance from the mean by counting the number of standard deviations between X and m.

Term

matched pairs

Definition

type of study in which subjects who are similar in ways not under study may be grouped together and then compared with each other on the variables of interest

Term

convenience sampling

Definition

sampling design where individuals are chosen based on who is easily available

Term

simple random

Definition

sampling design in which each set of n elements in the population has an equal chance of selection

Term

statistically significant

Definition

when an observed difference is too large to believe that it is likely to have occurred naturally

Term

stratified random sampling

Definition

sampling design in which the population is divided into several strata (categories), and random samples are then drawn from each stratum

Term

systematic random sampling

Definition

sample drawn by select an individual from a list and then each of the next N individuals from the sampling frame - each nth person

Term

Discrete Data

Definition

Contains no values between the major divisions, like 30, 31, 32, etc. and not 30.1, 30.2, 30.3, etc. Test scores for a class would be Discrete Data.

Term

Kurtosis

Definition

The amount of peakedness of a distributtion.

Term

Leptokurtic

Definition

Distributions that are high and thin. Many middle scores.

Term

Negatively Skewed

Definition

Long tail goes to the left - many high scores

Term

Positively Skewed

Definition

Long tail goes to the right - many low scores

Term

Platykurtic

Definition

Distribution with few middle scores - flat and looks like a plate.

Term

Homoscedasticity

Definition

Assumption that means that the variance around the regression line is the same for all values of the predictor variable (X)

Term

Cluster Sampling

Definition

Use whole classes to test - in education different classes are taught using different methods

Term

Research Hypothesis

Alternative Hypothesis

Definition

states what the researcher expects to find

Term

Null Hypothesis

Definition

states that there is no expected effect on the DV (dependent variable) due to the IV (independent variable)

Term

Sampling Error

Definition

any deviation due only to the particular cases falling within the samples -

deviation from expectation that is due to mere chance

Term

Significant Difference

Definition

any difference that is greater than would be expected by mere chance

Term

Prototype for any inferential test of statistical significance

Definition

(What did you get - What did you expect)

Raw Score - Sample Mean

____________________________

Standardized random error

Standard Deviation

Term

Confidence

Definition

the probability that the effect we found is real and not due to mere sampling error

Term

Interval Scale

Definition

Distances between adjacent scores are equal and consistent throughout

Term

Ratio Scale

Definition

distance between the scores is equal throughout the distribution however there is an absolute zero

Flashcard Machine - create, study and share online flash cards

Shared Flashcard Set

Details

Additional Mathematics Flashcards

Cards Return to Set Details

My Flashcards

Flashcard Library

Browse

About

Help

Mobile