Term
Statistics for ___
Ex. Studying Highway Safety
|
|
Definition
|
|
Term
Statistics for ___
Ex. Weather Forecasts |
|
Definition
|
|
Term
Statistics in ___
Ex. Economics, Politics, Sports |
|
Definition
|
|
Term
A branch of mathematics that transforms numbers into useful information for decision makers. |
|
Definition
|
|
Term
Measures used to describe the population are called ____ |
|
Definition
|
|
Term
Measures computed from sample data are called ____ |
|
Definition
|
|
Term
A __ consists of all the items or individuals about which you want to draw a conclusion. |
|
Definition
|
|
Term
A __ is the portion of a population selected for analysis. |
|
Definition
|
|
Term
A statistic is a numerical measure that describes a characteristic of a __. |
|
Definition
|
|
Term
A parameter is a numerical measure that describes a characteristic of a __. |
|
Definition
|
|
Term
____ - The methods that help collect, summarize, present, and analyze data (tables & charts) |
|
Definition
|
|
Term
___ - The methods that use data collected from a small group to draw conclusions about a larger group. |
|
Definition
|
|
Term
Four Important Uses of Statistics in Business
1. ________
2. ________
3. ________
4. ________ |
|
Definition
1. To visualize and summarize business data
2. To draw conclusions from business data
3. To make reliable forecasts about business activities
4. To improve business processes |
|
|
Term
__ - A characteristic of an item or individual. |
|
Definition
|
|
Term
__ - the different values asociated with a variable. |
|
Definition
|
|
Term
Data values are meaningless unless their variables have ___, universally accepted meanings that are clear to all associated with an analysis. |
|
Definition
|
|
Term
__ - a statistical package designed to perform advanced statistical analyses |
|
Definition
|
|
Term
___ - a multi-functional data analysis tool that can perform many functions, but non as well as programs that are dedicated to a single function. |
|
Definition
|
|
Term
Both SPSS and Microsoft Excel use ___ to store data. |
|
Definition
|
|
Term
DCOVA:
D: ___
C: ___
O: ___
V: ___
A: ___ |
|
Definition
Define
Collect
Organize
Visualize
Analyze |
|
|
Term
In DCOVA, you ___ the variables for which you want to reach conclusions. |
|
Definition
|
|
Term
In DCOVA, you ___ the data from appropriate sources |
|
Definition
|
|
Term
In DCOVA, you ___ the data collected by developing tables. |
|
Definition
|
|
Term
In DCOVA, you ___ the data by developing charts. |
|
Definition
|
|
Term
In DCOVA, you ___ the data by examining the appropriate tables and charts to reach conclusions. |
|
Definition
|
|
Term
__ (__) variables have values that present qualities or attributes. |
|
Definition
Categorical (qualitative) |
|
|
Term
__ (__) variables have values that represent quantities.
|
|
Definition
|
|
Term
__ variables can only be places into categories such as "yes" or "no" |
|
Definition
Nominal
Ex. Marital Status, Gender, Yes/No |
|
|
Term
__ variables indicate meaningful ordering or distances.
Ex. XS, S, M, L, XL
A, A-, B, B-, C
Fr, So, Jr, Sr |
|
Definition
|
|
Term
__ variables arise from a counting process. |
|
Definition
Discrete
Ex. Number of Books, Number of Text Messages |
|
|
Term
__ variables arise from a measuring process. |
|
Definition
Continuous
Ex. Waiting Time, Weight |
|
|
Term
__ sources - The data is collected by the researcher directly from his/her own observations and experiences. |
|
Definition
Primary
Ex. Data from a political survey, data collected from an experiment, observed data |
|
|
Term
__ Sources - The data is originally collected elsewhere rather than by the researcher her/himself. |
|
Definition
Secondary
Ex. Analyzing census data, Examining data from print journals or data published on the internet |
|
|
Term
Sources of data fall into four categories:
1. ___
2. ___
3. ___
4. ___ |
|
Definition
1. Distributed by an organization or individual
2. A designed experiment
3. A survey
4. An observational study |
|
|
Term
____ - a table indicating the frequency, amount, or percentage of items in a set of categories so you can see differences between categories. |
|
Definition
|
|
Term
___ - a sequence of data, in rank order, from the smallest value to the largest value. It shows the range and may help identify outliers. |
|
Definition
|
|
Term
____ - a summary table in which the data are arranged into numerically ordered classes. |
|
Definition
|
|
Term
Why use a frequency distribution?
1. It __ the raw data into a _____
2. It __ for a _____ of the data
3. It ___ the ___ of the ______ or the data set including where the data are ______ |
|
Definition
1. It condenses the raw data into a more useful form
2. It allows for a quick visual representation of the data
3. It enables the determination of the major characteristics of the data set including where the data are concentrated/clustered. |
|
|
Term
Different _____ may provide different pictures for the same data. |
|
Definition
|
|
Term
___ in __ __ may show up when different class boundaries are chosen. |
|
Definition
Shifts in Data Concentration |
|
|
Term
As the size of the data set ___, the impact of alterations in the selection of class boundaries is greatly reduced. |
|
Definition
|
|
Term
When comparing two or more groups with different sample sizes, you must use either a __ __ or a __ __. |
|
Definition
Relative Frequency or a Percentage Distribution |
|
|
Term
Visualizing Categorical Data: Charts
__ Chart - directly compare data
__ Chart - how data fits into the whole
__ Chart - used when most of the data occurs in just a few categories
__ Chart - Used when comparing two different categorical variables |
|
Definition
Bar Chart
Pie Chart
Pareto Chart
Side by side Chart |
|
|
Term
___ - a chart where each bar represents a category, the length of which indicates the amount, frequency, or percentage of values falling into a category coming from the summary table of the variable. |
|
Definition
|
|
Term
___ - A circle broken up into slices that represent categories. The size of each slice varies according to the percentage in each category. |
|
Definition
|
|
Term
__ - a vertical bar chart used to portray categorical data where categories are shown in descending order of frequency, and a cumulative polygon is shown in the same graph. Used to separate the "vital few" from the "trivial many" |
|
Definition
|
|
Term
___ - organizes data into groups (called __) so that the values within each group (called __) branch out to the right on each row. |
|
Definition
Stem-and-Leaf Display
Stems
Leaves |
|
|
Term
__ - A vertical bar chart of the data in a frequency distribution in which there are no gaps between adjacent bars. |
|
Definition
|
|
Term
In a histogram, the ___ (or ___) are shown on the horizontal axis. The vertical axis is either __, __, or __. |
|
Definition
Class Boundaries (or Class Midpoints)
Frequency, Relative frequency, or Percentage
|
|
|
Term
___ - formed by having a midpoint of each class represent the data in that class, and then connecting the sequence of midpoints at their respective class percentages. |
|
Definition
|
|
Term
The _____, or __, displays the variable of interest along the X axis, and the cumulative percentages along the Y axis. |
|
Definition
Cumulative Percentage Polygon (or Ogive) |
|
|
Term
___ - used to study patterns in the values of a numeric variable over time. |
|
Definition
|
|
Term
Principles of Excellent Graphs
1. The graph should not __ the data.
2. The graph should not contain unnecessary ___
3. The scale on the vertical axis should ___
4. All axes should be ___
5. The graph should contain a __
6. The ___ should be used for a given set of data. |
|
Definition
1. distort
2. adornments
3. begin at zero
4. properly labeled
5. title
6. simplest possible graph |
|
|
Term
___ - the extent to which all the data values group around a typical or central value. |
|
Definition
|
|
Term
__ - the amount of dispersion or scattering of values |
|
Definition
|
|
Term
__ - the pattern of the distribution of values from the lowest value to the highest value |
|
Definition
|
|
Term
__ - the most common measure of central tendency, is the sum of all values divided by the number of values. It is affected by extreme values (outliers) |
|
Definition
|
|
Term
In an ordered array, the __ is the "middle" number. It is not affected by extreme values (outliers) |
|
Definition
|
|
Term
__ - the value that occurs most often. It is not affected by extreme values (outliers). Used for either numerical or categorical (nominal) data. |
|
Definition
|
|
Term
___ - the simplest measure of variation. Equal to the difference between the largest and smallest values. |
|
Definition
|
|
Term
__ - the average (approximately) of squared deviations of values from the mean. |
|
Definition
|
|
Term
____ - most commonly used measure of variation. Shows variation about the mean, is the square root of the variance, and has the same units as the original data. |
|
Definition
Sample Standard Deviation |
|
|
Term
To compute the ___ of a data value, subtract the mean and divide by the standard deviation. |
|
Definition
|
|
Term
A data value is considered an extreme ___ if its z-score is less than -3.0 or greater than +3.0 (far away from the mean) |
|
Definition
|
|
Term
__ - the number of standard deviations a data value is from the mean.
|
|
Definition
|
|
Term
The larger the absolute value of the Z-Score, the farther away the data value is from the __. |
|
Definition
|
|
Term
___ - measures the amount of asymmetry in a distribution. |
|
Definition
|
|
Term
___ - Measures the spread of the middle 50% of the data.
= Q3 - Q1
(Also called the ___)
It is not influenced by outliers or extreme values. |
|
Definition
IQR (Interquartile Range)
Midspread |
|
|
Term
Measures like the median, Q1, Q3, and IQR that are not influenced by outliers are called ____ |
|
Definition
|
|
Term
|
Definition
1. Xsmallest
2. First Quartile (Q1)
3. Median (Q2)
4. Third Quartile (Q3)
5. Xlargest |
|
|
Term
___ - A Graphical display of the data based on the five-number summary. |
|
Definition
|
|
Term
___ - the sum of the values in the population divided by the population size, N |
|
Definition
|
|
Term
___ - Average of the squared deviations of values from the mean. |
|
Definition
|
|
Term
__ - The most commonly used measure of variation. Shows variation about the mean, and is the square root of the population variance. Has the same units as the original data. |
|
Definition
Population Standard Deviation
σ |
|
|
Term
When comparing two or more large batches of numerical data, the distributions being developed should use the same ________. |
|
Definition
|
|
Term
The point halfway between the boundaries of each class interval in a grouped frequency distribution is called the ________. |
|
Definition
|
|
Term
In order to compare one large batch of numerical data to another, a ________ distribution must be developed from the frequency distribution. |
|
Definition
relative frequency or percentage |
|
|