Vocab Unit 6

studied byStudied by 50 people
0.0(0)
get a hint
hint

Subsets

1 / 11

12 Terms

1

Subsets

Groups within a larger pool of data that are different in a specific way and thus often should be considered separately when creating linear models

New cards
2

Extrapolation

Using a linear model to predict values beyond those found within the domain of the data… can be highly unreliable, as it assumes that all other conditions still hold

New cards
3

Influential Point

A point that results in a very different slope for a regression model if it is removed

New cards
4

Outlier

A y-value that is far from its predicted value, resulting in a large initial residual… may or may not be an influential point

New cards
5

Leverage

A data value whose x-value is far from the mean of x has high leverage, like an outlier in the x direction rather than y… may or may not be an influential point

New cards
6

Lurking Variable

A hidden variable that simultaneously affects both variables in an association, accounting for the correlation that may appear between the two

New cards
7

Residuals Plot

A scatterplot of the residuals versus the x-values of an association, with the x-axis denoting a residual of 0… a residuals plot with no apparent pattern (blob), bouncing above and below the x-axis, means that the determined LSR model is appropriate

New cards
8

Re-Expression

Transforming a data set by taking the logarithm, square root, reciprocal, or some other math operation of ALL values in the data set to make it more conducive for linear regression

New cards
9

Nearly Normal Residuals Condition

To perform inference for regression, the residuals must be ~Normally distributed [linear normal probability plot]

New cards
10

Straight Enough Condition

To perform inference for regression, the association (scatterplot) studied must be ~linear [check residual plot]

New cards
11

Equal Variance Condition

To perform inference for regression, the variability of y must be ~constant for all values of x; check the spread of the residuals around the predicted value of the residuals plot

New cards
12

Standard Error of the Slope

The variation of the slope due to sampling variability, which is influenced by three factors: spread about the line (se), spread of the x-values (sx), and sample size (n)

New cards

Explore top notes

note Note
studied byStudied by 83 people
Updated ... ago
5.0 Stars(2)
note Note
studied byStudied by 5 people
Updated ... ago
5.0 Stars(1)
note Note
studied byStudied by 16 people
Updated ... ago
5.0 Stars(1)
note Note
studied byStudied by 16 people
Updated ... ago
5.0 Stars(1)
note Note
studied byStudied by 2681 people
Updated ... ago
4.8 Stars(17)
note Note
studied byStudied by 4 people
Updated ... ago
5.0 Stars(1)
note Note
studied byStudied by 28 people
Updated ... ago
5.0 Stars(1)
note Note
studied byStudied by 2663 people
Updated ... ago
4.8 Stars(10)

Explore top flashcards

flashcards Flashcard31 terms
studied byStudied by 22 people
Updated ... ago
5.0 Stars(2)
flashcards Flashcard92 terms
studied byStudied by 12 people
Updated ... ago
5.0 Stars(1)
flashcards Flashcard44 terms
studied byStudied by 3 people
Updated ... ago
5.0 Stars(1)
flashcards Flashcard146 terms
studied byStudied by 1 person
Updated ... ago
5.0 Stars(1)
flashcards Flashcard57 terms
studied byStudied by 9 people
Updated ... ago
5.0 Stars(1)
flashcards Flashcard88 terms
studied byStudied by 12 people
Updated ... ago
5.0 Stars(1)
flashcards Flashcard146 terms
studied byStudied by 11 people
Updated ... ago
5.0 Stars(1)
flashcards Flashcard156 terms
studied byStudied by 392 people
Updated ... ago
5.0 Stars(1)