Statistics Flashcards UNIT 1

studied byStudied by 3 people
5.0(1)
get a hint
hint

Individual

1 / 66

encourage image

There's no tags or description

Looks like no one added any tags here yet for you.

67 Terms

1

Individual

the objects described by a set of data

New cards
2

variable

any characteristic of an individual (can take different values for different individuals)

New cards
3

categorical variable or qualitative data

places an individual into one of several groups or categories (example: male vs. female). This data is COUNTABLE

New cards
4

quantitative data

takes numerical values for which arithmetic operations make sense (measurable)

New cards
5

distribution of a variable

tells us what values the variable takes and how often it takes these values - pattern of variation (table, bar, graph, pie chart, etc.)

New cards
6

outlier

an extreme value that differs greatly from other values in a set of values

New cards
7
<p>dotplot</p>

dotplot

a graph of qualitative data; a quick way to visualize a set of data

New cards
8
<p>stemplot</p>

stemplot

the digit(s) in the greatest place value(s) of the data values are the stems. the digits in the next greatest are the leaves.

use key

4|2 = 42

New cards
9
<p>split stem plot</p>

split stem plot

eac stem is listed more than once

  • first stem number: 0-4’s

  • second stem number: 5-9’s

New cards
10

Shape

Outliers

Center

Spread

describing or interpreting quantative data distributions.

*All of these with context —use question to answer the question! (WRITE THEM ALL OUT)

  • example

    • S - skewed to the left; unimodal

    • O - maybe at 40

    • C - median 85

    • range 60

    • The distribution of quiz scores is unimodal and skewed to the left. There is a possible outlier of 40. The center of the distribution is the median at 85 and the range of the quiz scores is 60.

New cards
11
<p>shape</p>

shape

a visual description of what the distribution looks like

  • a distribution is symmetric if the right and left sides of the histogram are approximately mirror images

  • is it skewed?

  • clusters? (unimodal, bimodal, trimodal?)

New cards
12
<p>skewed to the right</p>

skewed to the right

if a distribution extends much further from the right side than the left

***The tail is to the right

New cards
13
<p>skewed to the left</p>

skewed to the left

if a distribution extends much further from the left side than the right

***The tail is to the left

New cards
14

unimodal

one cluster

New cards
15

bimodal

two clusters

New cards
16

multimodal

more than two clusters

New cards
17

“potential or possible outlier”

If looking at a histogram with no data, use words like

New cards
18

IQR Method

IQR = Q3-Q1

  • used to determine if an outlier exists

New cards
19

outlier < Q1-1.5(IQR)

outlier>Q3+1.5(IQR)

if you have a histogram with data

New cards
20

center

a value that divides the observations so that about half takes longer larger value and about half take smaller values

USE MEDIAN

New cards
21

mean

the arithmetic average of a data set

  • the sum of all the values divided by the number of values

New cards
22

median

***USE THIS FOR CENTER

the middle value of a data set; the equal areas part, where 50% of the data are at or below this value and 50% of the data are at or above this value

New cards
23

spread

describes the variability of the data

  • range (histogram with no data)

histogram with data

  • variance

  • standard deviation

  • IQR

New cards
24

range

(max-min)

***ONLY USED THIS FOR HISTOGRAM WITH NO DATA

New cards
25

frequency

the count of how often something occurs

New cards
26
<p>relative frequency</p>

relative frequency

percentage or proportion of the whole number of data

  • frequency/total number

New cards
27
<p>histogram</p>

histogram

breaks the range of values of a variable into intervals and displays only the count or percent of the observations that fall into each interval

  1. divide the data into classes (intervals) of equal width

    1. need to specify classes so that each individual falls into one class

    2. usually will need between 5 and 7 intervals

  2. each bar of the histogram can include only one of its endpoinrs

  3. intervals should NEVER overlap

  4. LABEL AND SCALE YOUR AXIS

  5. !!!!!!!title your graph!!!!!

class width = (max-min)/number of groups

New cards
28

making a histogram (calculator)

  1. enter the data into L1

  2. 2ND stat plot (above y=) → plot1 → ENTER

  3. Turn on → choose symmetric histogram picture → graph → zoom9

  4. window → fix xscale=_____

  5. Hit TRACE for interval

New cards
29
<p>heartbeat </p>

heartbeat

if not starting at zero when its on a graph

New cards
30

4 decimal places

round to

New cards
31

time plot

plot each observation against the time at which it was measured - time is always on the z-axis

New cards
32

Five number summary

STAT → CALC → 1 Var Stats

Min

Q1

Med

Q3

Max

New cards
33

IQR

Inner Quartile Range

Q3-Q1

New cards
34

checking for outliers

  1. find IQR (Q3-Q1)

  2. Q1-1.5(1=IQR); Q3+1.5(IQR) → [ __ , __ ]

  3. Any number outside this interval is an outlier

New cards
35

resistance

a statistic is resistant if adding an extreme value does NOT change the value of the statistic much

  • A mean is NOT resistant, a MEDIAN is

New cards
36

symetric distibution of a boxplot

DO NOT USE BOXPLOTS FOR A SHAPE, alway use histograms

  • Boxplots are good fo five number summary

<p>DO NOT USE BOXPLOTS FOR A SHAPE, alway use histograms</p><ul><li><p>Boxplots are good fo five number summary</p></li></ul>
New cards
37

negative distribution of a boxplot

DO NOT USE BOXPLOTS FOR A SHAPE, alway use histograms

  • Boxplots are good fo five number summary

<p>DO NOT USE BOXPLOTS FOR A SHAPE, alway use histograms</p><ul><li><p>Boxplots are good fo five number summary</p></li></ul>
New cards
38

positive distribution of a boxplot

DO NOT USE BOXPLOTS FOR A SHAPE, alway use histograms

  • Boxplots are good fo five number summary

<p>DO NOT USE BOXPLOTS FOR A SHAPE, alway use histograms</p><ul><li><p>Boxplots are good fo five number summary</p></li></ul>
New cards
39

The mean and median of a roughly symmetric distribuion are

close together

New cards
40

Don’t confuse the “average” value of a variable (the mean) with its

“typical” value, which we migh describe by the median

New cards
41

If he distribution is excatly symmetric

the mean and median are exactly the same

New cards
42

In a skewed distribution, the mean is

usually further out in the long tail than the median is

  • The mean is pulled toward the skew and outliers

New cards
43

modified boxplots

The whiskers only extend to adjacent values, not outliers

<p>The whiskers only extend to adjacent values, not outliers</p>
New cards
44

modified boxplot on calculator

  • Hitting TRACE will show all points

<ul><li><p>Hitting TRACE will show all points</p></li></ul>
New cards
45

Standard deviation

One of the most common measures of spread. It looks as how far each deviation is from the mean.

  • Measuring spread: STANDARD DEVIATION

New cards
46

deviation

=observation-mean

New cards
47

mean equation

add all the numbers and divide by the sum

New cards
48

how to find standard deviation

  1. Calculate the mean

    1. add all the numbers and divide by the sum

  2. Calculate each deviation

    1. deviation=observation-mean

  3. Square each deviation

  4. Find the “average” squared deviation

    1. Calculate the sum of the squared deviations divided by the degrees of freedom (n-1) - This is called the VARIANCE

  5. Calculate the square root of the variance, this is the STANDARD DEVIATION

New cards
49

variance

The average square distance

  • (sum of square deviations)/(n-1)

New cards
50

Sx

measures the spread about the mean

is the is sample standard deviation

NOT RESISTANT

  • ALWAYS greater than or equal to 0

  • Sx=0 ONLY when there is no variability

New cards
51

Sx is always greater >than or equal =/≥ to

0

New cards
52

Sx is ONLY equal to 0 when

there is no variability

New cards
53

Sx is more meaningful with data that

has a symmetric shape

New cards
54

Is Sx resistant?

No, Sx is not resistant. It is even less resistant than mean to extreme outliers.

New cards
55

divide by n if you are looking for a

parameter (population SD)

New cards
56

divide by n-1

if you are looking for a statistic (sample SD)

New cards
57

measures of center and spread

  • mean and standard deviation

or

  • median and IQR

New cards
58

median and IQR

are usually better than the mean and standard deviation for describing a SKEWED DISTRIBUTION or a distribution with outliers

New cards
59

use mean and standard deviation only for

symmetric distibutions that don’t have outliers

  • Note: numerical

New cards
60

mean > median

the distribution is skewed to the right

New cards
61

measuring position: percentiles

knowing the mean or median is helpful, but sometimes you want to know where something falls with respect to everything else…

  • The pth pecentile of a distribution is the value with p percent of observations at or below it

  • [(number of values below x)/(number of total values)] x 100

  • If it says “justify your answer”, use 1.5IQR method

New cards
62

equation for p percentile

[(number of values below x)/(number of total values)] x 100

New cards
63

Cumulative Relative Frequency Graphs (Ogaves)

A graph that displays the cumulative relative frequency of each class of a frequency distribution

  • (cumulative frequency)/sample size

New cards
64

Measuring posistion: z-scores

Tells use how many standard deviations from the mean an observation falls and in what direction

  • If x is an observation from a distribution that has a known mean and standard deviation, the standardized value of x is:

    • z=(x-mean)/SD

New cards
65

we can use z-scores to compare the posistion of

individuals in different distributions

New cards
66

z-score facts

  • not measured in the same units as the original data

  • it is the number of standard deviations away from the mean

  • positive z-scores are above the mean

  • negative z-scores are below the mean

New cards
67
New cards

Explore top notes

note Note
studied byStudied by 18 people
Updated ... ago
5.0 Stars(1)
note Note
studied byStudied by 4 people
Updated ... ago
5.0 Stars(1)
note Note
studied byStudied by 16 people
Updated ... ago
5.0 Stars(1)
note Note
studied byStudied by 31 people
Updated ... ago
5.0 Stars(1)
note Note
studied byStudied by 11 people
Updated ... ago
5.0 Stars(1)
note Note
studied byStudied by 9 people
Updated ... ago
5.0 Stars(1)
note Note
studied byStudied by 20082 people
Updated ... ago
4.7 Stars(231)

Explore top flashcards

flashcards Flashcard86 terms
studied byStudied by 13 people
Updated ... ago
5.0 Stars(1)
flashcards Flashcard108 terms
studied byStudied by 56 people
Updated ... ago
5.0 Stars(2)
flashcards Flashcard42 terms
studied byStudied by 9 people
Updated ... ago
5.0 Stars(1)
flashcards Flashcard69 terms
studied byStudied by 17 people
Updated ... ago
5.0 Stars(1)
flashcards Flashcard41 terms
studied byStudied by 1 person
Updated ... ago
5.0 Stars(1)
flashcards Flashcard92 terms
studied byStudied by 31 people
Updated ... ago
5.0 Stars(1)
flashcards Flashcard69 terms
studied byStudied by 6 people
Updated ... ago
4.0 Stars(1)
flashcards Flashcard76 terms
studied byStudied by 316 people
Updated ... ago
5.0 Stars(7)