PDA

View Full Version : Simple Statistics Question - Multiple Regression

usar_jag
12-04-2006, 09:57 AM
I have a friend who is taking a stats course in grad school. I took the same course moons ago, but I can't remember my statistics well enough to answer her question.

She is running a multiple regression on a survey of job statisfaction. The variables are between 1 - 5. In running the regression, her group got a constant of -0.25. She wants to know if that's possible, given that the variables are positive numbers. I know it's possible, especially given that the constant is so close to zero, but I'm having a tough time with the reasoning.

Any quick thoughts?

Thanks!

Harriet the Spry
12-04-2006, 10:22 AM
Yes. The regression is determining the slope of the line and projecting where it would intercept the y axis. Since it is only a projection, and zero apparently isn't a valid value, the intercept is coming up less than zero. At least the way I'm being taught, you can't really do multiple regression (at least not without more know-how than I'm getting in this class) on a variable that is not continuous. So she may be violating an assumption by using a categorical variable 1-5.

Maeglin
12-04-2006, 10:46 AM
I have a friend who is taking a stats course in grad school. I took the same course moons ago, but I can't remember my statistics well enough to answer her question.

She is running a multiple regression on a survey of job statisfaction. The variables are between 1 - 5. In running the regression, her group got a constant of -0.25. She wants to know if that's possible, given that the variables are positive numbers. I know it's possible, especially given that the constant is so close to zero, but I'm having a tough time with the reasoning.

Any quick thoughts?

Thanks!

The short answer: yes you can do multivariate regression with ordinal variables, and yes, the intercept south of the origin is certainly possible. If this makes no intuitive sense to your friend, she should be able to suppress it using her stats package. This will have no impact on her results.

ultrafilter
12-04-2006, 11:26 AM
Think of it this way: you're creating a model Y = B0X + b1 with values of B0 and b1 that minimize the sum of the squared error in predicting Y from X. In this particular case, B0X overshoots Y by a bit, so you need to adjust downward with negative values in b1.

There are some issues related to doing regression on categorical data, but this isn't one of them.

nivlac
12-04-2006, 12:03 PM
...The variables are between 1 - 5. In running the regression, her group got a constant of -0.25. She wants to know if that's possible, given that the variables are positive numbers. I know it's possible, especially given that the constant is so close to zero, but I'm having a tough time with the reasoning.

Sure, it's possible as others have already replied. What your friend should do is to check for the statistical significance of the constant (intercept) term. Tell her to just check the associated p-value. If this value is very small, just exclude the intercept from the model. The software should have a simple option to do that.

usar_jag
12-04-2006, 10:06 PM
Thanks to all who replied. Muchas gracias!

Send questions for Cecil Adams to: cecil@straightdope.com