[QUOTE=PReed]
Thank you all so much again! I knew I could count on you guys! There is noway I could have figured this out without your help, especially in such a short time!
I came across this wonderful thing called SPSS.
This is what I understand so far:
In SPSS, run a Kruskal-Wallis test first to check if the “category of poster” affects “# of replies”.
Analyze -> Nonparametric Tests -> K Independent Sample
Use “# of replies” as Test Variable; use “category of poster” as Grouping Variable.
Output: Asymp. Sig: .032 -> so “category of poster” statistically affects “# of replies”
Is this right so far?
Could I then use Mann-Whitney:
Nonparametric Tests -> 2 Independent Samples
for each pair of “category of poster” to find out if one category has higher # of replies to another?
So, a Mann-Whitney for between Category 1&2, 1&3, 1&4, 1&5, 2&3, 2&4, 2&5, 3&4, 3&5, 4&5?
I do need other levels of parsing (yes, gender of posters happens to one of them!).
If I want to find out if a combination of characteristics would result in more replies, do I use:
Descriptive Statistics -> Crosstabs -> then use Chi-Square for this?
Is there any reason that I have to transform the data into normal distribution instead of just using nonparametric tests?
[/QUOTE]
You don’t want to pick and choose which test to use like this. In theory, you should design an experiment with the test statistic clearly in mind.
I’ve never done a chi-square in spss. I usually just wimp out and use the free contingency tables that are available on-line. So you’d set up the data like this:
CAT________1___2__3
Male________xx xx xx
Female______xx xx xx
where xx equals total number of replies (not average).
You could also consider a two-way Kruskal Wallis (don’t know if spss has this function). But since you have vastly different sample sizes per category (9 versus 214), you probably shouldn’t be doing a KW at all. If I were your fried, I wouldn’t use category 5. I would either lump it in with a similar category, or I would drop it completely. Such a low sample size (relative to the others) is going to skew your results. And likewise, category 4 is too large to fairly compare to the others. You shouldn’t have one sample that’s less than half the size of the others.
Yes, you can use the Mann-Whitney as a post-hoc test after a KW test.
For a chi-square, you could reduce the table to a 2X2 (by leaving out categories) until you find the differences. But what most people do is just eyeball the table and guess at the ones that look different. Not as rigorous, but it generally works.
As the study design stands now, I’m leaning more towards the chi-square than the Kruskal Wallis.
I wouldn’t bother with transformation. Nonparametric tests are just as good as parametric tests. In fact, depending on the nature of the data, they can be more powerful. People tend to stay away from them out of fear of the unknown (kind of like how we treat lesser known name brands at the grocery store), but this an irrational fear.