$go to Math$

AP Statistics

AP Stats Extravaganza

Studied by 0 people

0.0(0)

get a hint

hint

statistics

1 / 152

Tags and Description

Statistics

AP Statistics

Literally every term that I had in my notes for AP Stats. Doesn't have formulas, just terms and definitions.

153 Terms

statistics

science of data, information gathered into useful figures

New cards

individuals

one singular object described by a set of data

New cards

variables

any characteristics measured or collected in a data set

New cards

categorical variable

places an individual into a category or group

New cards

quantitative variable

measures a specific numerical value that can be used for analysis

New cards

marginal distributions

the frequency distributions of values of cat. variables among all individuals in a two-way table

New cards

conditional distributions

gives the frequency distribution of a specified group

New cards

association

relationship between two variables, value of one variable occurs in combination with values from another variable

New cards

SOCS

shape, outliers, center, spread

New cards

shape

graph's shape, whether it's symmetric or skewed left/right

New cards

outliers

any unusual data that doesn't fit, Q1-1.5IQR and Q3 +1.5IQR

New cards

center

described by mean or median, use median unless the data is symmetric

New cards

spread

variability in data, range & standard deviation

New cards

unimodal

one mode

New cards

bimodal

two modes

New cards

multimodal

multiple modes

New cards

uniform

no distinct mode

New cards

resistance

how much a measure is influenced by extreme values, median is resistant, mean, s.d. and range are not

New cards

five-number summary

minimum, first quartile, median, third quartile, maximum

New cards

standard deviation

average distance of a value from the mean

New cards

variance (r^2)

average squared distance from the mean

New cards

skew

data skewed to the right have the "tail" of the data on the right and vice versa

New cards

z-score

how many standard deviations you are away from the mean

New cards

percentiles

the nth percentile is the lowest score that is greater than a certain percentage

New cards

adding/subtracting numbers from data

changes the mean but not the standard deviation or range

New cards

multiplying/dividing data by numbers

changes range, mean, median, standard deviation

New cards

measures of center

median, mean, quartiles, percentiles

New cards

measures of spread

standard deviation, range

New cards

density curve

a curve on or above the horizontal axis, total area underneath is equal to one, 100% of observations

New cards

normal distribution

shown by a normal density curve with the mean, median, and mode at the center of the curve

New cards

68-95-99.7 rule/empirical rule

68% of values within one standard deviation of the mean, 95% within two, 99.7% within three

New cards

way to describe normal distribution

N(u, o) a.k.a. (mean, standard deviation)

New cards

standard normal probabilities table

a table of areas under the standard normal curve, table entry for each value of z is the area under the curve to the left of the z-score

New cards

central limit theorem

when n is large, the sampling distribution of the sample mean is approximately normal, n is greater than or equal to 30

New cards

bivariate data

quantitative data that has two variables, often represented w/ a scatterplot

New cards

explanatory variables

independent variable, used to explain or to predict changes in values of another variable

New cards

response variable

dependent variable, measures the outcome in response to the explanatory variable

New cards

correlation

if a graph has a negative/positive "slope", ranges from -1 to 1, r-value

New cards

DOFS

direction, outliers, form, strength - all things that should be addressed when describing the relationship between two quantitative variables on a scatterplot

New cards

regression line

a line describing how the response variable changes as the explanatory variable changes

New cards

least-squares regression

the line that makes the sum of the squared residuals as small as possible

New cards

extrapolation

a pitfall of statistical prediction, use of a regression line to predict values outside the data interval

New cards

residual

the difference between the observed value of the response variable and the value predicted by the regression, outliers have a large residual

New cards

influential point

any point that, if removed, changes the relationship/regression significantly, a regression will change up or down if you remove/replace influential points

New cards

LSRL

least squares regression line

New cards

residual plot

a graph showing the residuals on the vertical axis and the explanatory variable on the horizontal axis, you want it to be random to show a linear relationship

New cards

coefficient of determination

r^2, tells what percent of the variation in data values is explained by the regression line

New cards

linear transformations

preserve linear relationship, e.g. addition, subtraction, multiplication, division

New cards

non-linear transformations

don't preserve linear relationship, e.g. roots, exponents, logarithms

New cards

goal of transformations

to increase linear relationship

New cards

procedure for transformations

conduct standard regression
construct residual plot and transform it if the plot has a pattern
evaluate r^2
choose transformation method
transform one or both variables
conduct another regression analysis
find the new r^2 and it should be higher than the original r^2

New cards

population

the entire collection of objects or individuals about which information is desired

New cards

sample

a subset of the population being studied

New cards

census

a survey that collects information from every member of a population

New cards

observational study

observe and measure variables

New cards

experiment

manipulate variables & see results

New cards

inferential statistics

statistical data from a sample that are used to draw conclusions about the entire population

New cards

things well-designed surveys do

define the population, tell what the researcher wants to measure, show how data members of a sample set are chosen, represent the population accurately in the sample, are free from bias

New cards

sampling errors

biased design, convenience sample, voluntary response, undercoverage, nonresponse, response bias

New cards

bias

over/under estimating the desired response in a survey consistently, similar to accuracy

New cards

convenience

choosing only individuals for a survey who are easy to access

New cards

voluntary response

when sample members are allowed to volunteer

New cards

undercoverage

when members of the population are represented inadequately in a sample

New cards

nonresponse

when the individual selected for the sample is left out or refuses to participate

New cards

representative sampling

a group or set chosen to replicate characteristics of a larger population

New cards

random sample

a group or set chosen in a random manner that allows for each member of the population to have an equal chance at being selected

New cards

SRS

simple random sample, every individual has the same chance of being selected and every possible sample has the same chance of being selected

New cards

stratified random sampling

divide population into subgroups called strata then conduct SRS from each subgroup

New cards

systematic sampling

form of SRS, first person is random & the rest are chosen systematically

New cards

cluster sampling

select clusters randomly from population, selecting groups randomly, not individuals from groups

New cards

experiments

observe responses to variables, administer a treatment to observe response, attempt to determine causation

New cards

observational studies

only observe responses, don't attempt to influence responses

New cards

key aspects of experimental design

replication, randomization, control

New cards

replication

assigning treatment to many experimental units to lower variability

New cards

randomization

using chance to assign groups, use a random number tables/number generators

New cards

control

control for confounding variables or outside influences

New cards

blinding

not telling subjects or researcher what treatments subjects are receiving to eliminate bias

New cards

sample survey

analyzes data from a subset of a population, can be costly & time-consuming

New cards

cross-sectional study

analyzes data from a certain point in time, focused type of sample survey

New cards

blocking

method of dividing subjects into subgroups called blocks so the variability in blocks is less than the variability between blocks, controls for variables you know (randomization controls for variables you don't know)

New cards

matched pairs design

experimental method where subjects are grouped into pairs based on a blocking variable, one is control and one is treatment

New cards

lurking variable

variables other than independent/dependent variables that may affect experimental outcomes

New cards

placebo effect

a subject's positive response to receiving a placebo when no treatment has actually been applied

New cards

confounding variables

variables that affect the response variable under consideration

New cards

causation

cause and effect relationship between or among variables, experiments can determine causation

New cards

ethical research

participants must give informed consent, data collected must be private if it's personal info, risks to participants must be minimized

New cards

simpson's paradox

a paradox in statistics in which a trend appears in different groups of data but is reversed when those groups are combined

New cards

probability

the likelihood that an event will occur, the mathematics of chance

New cards

law of large numbers

as we observe more and more repetitions of any chance process, the proportion of times a specific outcome occurs approaches a single value

New cards

empirical probability

probability in actual trials

New cards

theoretical probability

probability calculated with a formula, not based on observed sample

New cards

independent event

when the occurrence of one event does not change the probability that the other event will happen

New cards

probability rules

always between zero and 1
sum of possible outcomes of a trial must equal 1
complement rule: the probability of an event occurring is one minus the probability that it doesn't occur

New cards

event

any outcome or collection of outcomes that is a subset of the sample space

New cards

determining the number of outcomes in a sample space

raise the number of outcomes in one trail to the power of the total number of trials

New cards

simulation

a random process of numerous trails used to estimate probability and imitate chance behavior

New cards

steps to conduct a simulation

state the problem
state the assumptions
describe the process for one repetition, including possible outcomes, assigned representations, & measured variables
simulate many repetitions
state conclusions

New cards

complement

the probability that the event won't occur

New cards

disjoint/mutually exclusive

two events have no outcomes in common

New cards

100

conditional probability

the probability of an event given another event has occurred

New cards