COMM550/Midterm
From Driscollwiki
Contents |
Descriptive stats
Testing hypotheses
p-value: the probability that we will obtain the observed outcome when the null is assumed as true, which we cannot determine before the analysis
- Decrease by half for one-tailed test
alpha: criterion we set to decide whether or not to reject the null before the analysis.
- 0.5 is the conventional alpha
- Equal to the probability of Type I error
If the p-value obtained is smaller than the alpha, then we can reject the null hypothesis.
beta: is typically hard to calculate
- Set prior to the analysis
- Equal to the probability of a Type II error
power: is the probability that we correctly reject the false null hypothesis
- 1-beta
Increasing power
- Increase sample size
- Increase alpha level
- Use a one-tailed test (if appropriate)
- Select reliable measures for the study
- Develop a strong research design
- Use all information available to identify an appropriate population and recruit
participants for the study
Poong's fantastic example:
Here's an example for better understanding of "power" and "alpha." hopefully.
I got a result from a public opinion poll: 50% of people support Obama with 95% of confidence level and +/- 3% points. This means that the proportion of population who support Obama would be between 47% and 53% with a prob of 95%. and that the true proportion would be NOT within the interval with a prob of 5% (Type I error).
Because I don't want the type I error, I reset the alpha level as .0000...00000001%. Then, the result would be changed like this: 50% of people support Obama with 99.9999...9999999% of confidence level and +/- 49% points (hypothetically). This means that the true proportion of population who support Obama would be between 1% and 99% with a prob of 99.999...99%. This statement is, in fact, meaningless (or "powerless"), even though the chance to be wrong is almost zero. Power is the degree to which the stat analysis provides meaningful results.
t-test
- Most important to reproduce formula from Chap 7.1
Degrees of freedom (d.f.)
d.f. = (n1 + n2 − 2)
- Sum of samples in each group minus number of groups
Computing t-test
- t-test:
ANOVA
One-way ANOVA
- One IV
- Two (or more) groups/conditions
Degrees of freedom (d.f.)
dfb + dfw = dftotal
(k − 1) + (N − K) = (N − 1)
- k: number of groups, conditions
- N: total number of subjects under observation
F
Two-way ANOVA
Factorial ANOVA
- Two or more IVs
- Two or more groups, conditions, levels
Degrees of freedom (d.f.)
SSbetween = SSfirst IV + SSsecond IV + ... + SSlast IV
- SSfirst IV = SSmain effect of first IV = (a − 1)
- a: the number of levels in first IV
- SSsecond IV = SSmain effect of second IV = (b − 1)
- b: the number of levels in second IV
- SSinteraction effect between first and second IV = (a − 1)(b − 1)
- SSerror = SSwithin = N − ab
- SStotal = N − 1
F
- Compare against F based on your alpha (0.01 or 0.05), using dfamong,within
t
- Compare against t tabke, in order to find your p using df
Partitioning of Variance
(For both single- and multi-factor ANOVA)
SStotal = SSbetween/among + SSwithin
- SStotal: distances of the individual scores from the grand mean
- SSbetween/among: distances of the sub-group means from grand mean
- SSwithin: distances of the individual scores from their sub-group means
See pp368-372 of Hayes
F table
- Williams, Chp 9
- TODO locate calculations
The logic of statistics
- Error is always within
Why is stats a ratio?
Hypothesis testing
Working toward rejecting the null hypothesis
- Cannot prove your research hypothesis
Mantras
- Minimize error variance
- Control for extraneous variance
- Maximize systemic variance
Research & experimental design
Types of experiments
Types of design
Experimental design
X = IV (manipulated)
(X) = IV (not manipulated)
~X = manipulable, not manipulated
Y = DV
R = random
m = matched
Defective design
Two groups, no control
See: FBR Chp 19
Sampling
APA style (citations, bias, formatting)
Control for extraneous variance
- IV is constant
- Randomization [best!]
- Matching participants.
Note: Can also statistically control (like with ANCOVA) or work the IV into the design (as with a factorial design).
Threats to validity
- Internal - regarding conduct of the experiment (people, materials)
- History
- Instrumentation
- Maturation
- Measurement
- Statistical regression towards the mean
- Sample begins to lose its breadth
- Attrition
- Selection
- Interaction
- External (FBR p477) - generalizability
- Reactive or interaction effects in testing
- Selection bias, Interaction of selection and IV
- Effect of being observed, Reactive effects of experimental arrangements
- Effect of previous trials on later trials
- Construct
Partitioning of variance
Scales
Gutman scale
Each element in the scale builds on the previous
Would you,
- Permit Armenians to enter the country
- Grant Armenians residents citizenship
- Hire an Armenian to work in your home
- Marry an Armenian
Semantic scale
Pairs of opposites
- Good . _ . _ . _ . bad
- Strong . _ . _ . _ . weak
- Hot . _ . _ . _ . cold
Likert scale (TODO check spelling)
- Strongly disagree
- Disagree
- Neutral (Neither disagree nor agree)
- Agree
- Strongly Agree

