e Rating Service Gender Raise 8 23 1.000 23 32 90 9 1 5.8 10 22 0.956 23 30 80 7 1 4.7 11 23 1.000 23 41 100 19 1 4.8 14 24 1.043 23 32 90 12 1 6 15 24 1.043 23 32 80 8 1 4.9 23 23 1.000 23 36 65 6 1 3.3 26 24 1.043 23 22 95 2 1 6.2 31 24 1.043 23 29 60 4 1 3.9 35 24 1.043 23 23 90 4 1 5.3 36 23 1.000 23 27 75 3 1 4.3 37 22 0.956 23 22 95 2 1 6.2 42 24 1.043 23 32 100 8 1 5.7 3 34 1.096 31 30 75 5 1 3.6 18 36 1.161 31 31 80 11 1 5.6 20 34 1.096 31 44 70 16 1 4.8 39 35 1.129 31 27 90 6 1 5.5 7 41 1.025 40 32 100 8 1 5.7 13 42 1.050 40 30 100 2 1 4.7 22 57 1.187 48 48 65 6 1 3.8 24 50 1.041 48 30 75 9 1 3.8 45 55 1.145 48 36 95 8 1 5.2 17 69 1.210 57 27 55 3 1 3 48 65 1.140 57 34 90 11 1 5.3 28 75 1.119 67 44 95 9 1 4.4 43 77 1.149 67 42 95 20 1 5.5 19 24 1.043 23 32 85 1 0 4.6 25 24 1.043 23 41 70 4 0 4 40 25 1.086 23 24 90 2 0 6.3 2 27 0.870 31 52 80 7 0 3.9 32 28 0.903 31 25 95 4 0 5.6 34 28 0.903 31 26 80 2 0 4.9 16 47 1.175 40 44 90 4 0 5.7 27 40 1.000 40 35 80 7 0 3.9 41 43 1.075 40 25 80 5 0 4.3 5 47 0.979 48 36 90 16 0 5.7 30 49 1.020 48 45 90 18 0 4.3 1 58 1.017 57 34 85 8 0 5.7 4 66 1.157 57 42 100 16 0 5.5 12 60 1.052 57 52 95 22 0 4.5 33 64 1.122 57 35 90 9 0 5.5 38 56 0.982 57 45 95 11 0 4.5 44 60 1.052 57 45 90 16 0 5.2 46 65 1.140 57 39 75 20 0 3.9 47 62 1.087 57 37 95 5 0 5.5 49 60 1.052 57 41 95 21 0 6.6 50 66 1.157 57 38 80 12 0 4.6 6 76 1.134 67 36 70 12 0 4.5 9 77 1.149 67 49 100 10 0 4 21 76 1.134 67 43 95 13 0 6.3 29 72 1.074 67 52 95 5 0 5.4 Degree Gender1 Grade 0 F A The ongoing question that the weekly assignments will focus on is: Are males and females paid the same for equal work (under the Equal Pay Act)? 0 F A Note: to simplfy the analysis, we will assume that jobs within each grade comprise equal work. 0 F A 0 F A The column labels in the table mean: 0 F A ID Employee sample number Salary Salary in thousands 1 F A Age Age in years Performance Rating Appraisal rating (Employee evaluation score) 1 F A Service Years of service (rounded) Gender: 0 = male, 1 = female 0 F A Midpoint salary grade midpoint Raise percent of last raise 1 F A Grade job/pay grade Degree (0= BS\BA 1 = MS) 1 F A Gender1 (Male or Female) Compa - salary divided by midpoint 1 F A 0 F A 0 F B 1 F B 1 F B 1 F B 0 F C 1 F C 0 F D 1 F D 0 F D 0 F E 1 F E 1 F F 1 F F 1 M A 0 M A 0 M A 0 M B 0 M B 1 M B 0 M C 1 M C 0 M C 1 M D 0 M D 0 M E 1 M E 0 M E 1 M E 0 M E 1 M E 1 M E 1 M E 0 M E 0 M E 1 M F 1 M F 1 M F 0 M F 10 The ongoing question that the weekly assignments will focus on is: Are males and females paid the same for equal work (under the Equal Pay Act)? Note: to simplfy the analysis, we will assume that jobs within each grade comprise equal work. Salary Salary in thousands Performance Rating Appraisal rating (Employee evaluation score) Gender: 0 = male, 1 = female Raise percent of last raise Degree (0= BS\BA 1 = MS) Compa - salary divided by midpoint Week 1. Measurement and Description - chapters 1 and 2 1 Measurement issues. Data, even numerically coded variables, can be one of 4 levels - nominal, ordinal, interval, or ratio. It is important to identify which level a variable is, as this impact the kind of analysis we can do with the data. For example, descriptive statistics such as means can only be done on interval or ratio level data. Please list under each label, the variables in our data set that belong in each group. Nominal Ordinal Interval Ratio Gender ID Degree Salary Gender1 Compa Grade Mid point Performance Servics raise b. For each variable that you did not call ratio, why did you make that decision? Ratio scales are the ultimatenirvanawhen it comes to measurement scales because they tell us about the order, they tell us the exact value between units No one variable is ratio because no variable values tells about the order among them so they are ratio variables. 2 The first step in analyzing data sets is to find some summary descriptive statistics for key variables. For salary, compa, age, performance rating, and service; find the mean, standard deviation, and range for 3 groups: overall sample, Females, and Males. You can use either the Data Analysis Descriptive Statistics tool or the Fx =average and =stdev functions. (the range must be found using the difference between the =max and =min functions with Fx) functions. Note: Place data to the right, if you use Descriptive statistics, place that to the right as well. Salary Compa Age Perf. Rat. Service Overall Mean 45.0 1.0625 35.7 85.9 9.0 Standard Deviation 19.20 0.08 8.25 11.41 5.72 Range 55 0.34 30 45 21 Female Mean 38.0 1.0687 32.5 84.2 7.9 Standard Deviation 18.29 0.07 6.88 13.59 4.91 Range 55 0.254 26 45 18 Male Mean 52.0 1.0562 38.9 87.6 10.0 Standard Deviation 17.78 0.08 8.39 8.67 6.36 Range 53 0.305 28 30 21 3 What is the probability for a: a. Randomly selected person being a male in grade E? b. Randomly selected male being in grade E? Note part b is the same as given a male, what is probabilty of being in grade E? c. Why are the results different? 4 For each group (overall, females, and males) find: a. The value that cuts off the top 1/3 salary in each group. b. The z score for each value: c. The normal curve probability of exceeding this score: d. What is the empirical probability of being at or exceeding this salary value? e. The value that cuts off the top 1/3 compa in each group. f. The z score for each value: g. The normal curve probability of exceeding this score: h. What is the empirical probability of being at or exceeding this compa value? i. How do you interpret the relationship between the data sets? What do they mean about our equal pay for equal work question? Answer: we will find the correlation matrix to find the relationship among the variables. Equal pay for equal work means the correlation of salaries with the remaining variable in the data set is high, actually thy are dependent to each other. 5. What conclusions can you make about the issue of male and female pay equality? Are all of the results consistent? What is the difference between the sal and compa measures of pay? The salary male and females are not equal Yes, all of the result is consistent The means of salaries and Compa are not equal. Conclusions from looking at salary results: looking at the salaries the male and femaly payments are not equal Conclusions from looking at compa results: Looking at the Compa result the payments are not equal Do both salary measures show the same results? Yes, in both the case we see that the the payments are not equal for the male and female. Can we make any conclusions about equal pay for equal work yet? No, because in both the case we see that male and females payments according to salary and compa are not equal therefore we canot say that equal pay for equal work Ratio scales are the ultimatenirvanawhen it comes to measurement scales because they tell us about the order, they tell us the exact value between units No one variable is ratio because no variable values tells about the order among them so they are ratio variables. The first step in analyzing data sets is to find some summary descriptive statistics for key variables. For salary, compa, age, performance rating, and service; find the mean, standard deviation, and range for 3 groups: overall sample, Females, and Males. You can use either the Data Analysis Descriptive Statistics tool or the Fx =average and =stdev functions. (the range must be found using the difference between the =max and =min functions with Fx) functions. Probability 0.4 0.83 The results are different because population and samples are different for both the cases. In the first case male is the population and we are choosing those males who got grade E In the second case among the grade E we choose thos emales who are male. Overall Female Male 41 24 40 -0.208 -1.094 -0.260 0.583 0.863 0.603 0.583 0.778 0.750 1.025 1.043 1.075 -0.488 -0.366 0.224 0.687 0.643 0.411 0.687 0.643 0.411 How do you interpret the relationship between the data sets? What do they mean about our equal pay for equal work question? Equal pay for equal work means the correlation of salaries with the remaining variable in the data set is high, actually thy are dependent to each other. What conclusions can you make about the issue of male and female pay equality? Are all of the results consistent? No, because in both the case we see that male and females payments according to salary and compa are not equal therefore we canot say that equal pay for equal work The results are different because population and samples are different for both the cases. In the first case male is the population and we are choosing those males who got grade E Week 2 Testing means In questions 2 and 3, be sure to include the null and alternate hypotheses you will be testing. In the first 3 questions use alpha = 0.05 in making your decisions on rejecting or not rejecting the null hypothesis. 1 Below are 2 one-sample t-tests comparing male and female average salaries to the overall sample mean. (Note: a one-sample t-test in Excel can be performed by selecting the 2-sample unequal variance t-test and making the second variable = Ho value -- see column S) Based on our sample, how do you interpret the results and what do these results suggest about the population means for male and female average salaries? Males Females Ho: Mean salary = 45 Ho: Mean salary = 45 Ha: Mean salary =/= 45 Ha: Mean salary =/= 45 Note: While the results both below are actually from Excel's t-Test: Two-Sample Assuming Unequal Variances, having no variance in the Ho variable makes the calculations default to the one-sample t-test outcome - we are tricking Excel into doing a one sample test for us. Male Ho Female Mean 52 45 Mean 38 Variance 316 0 Variance 334.667 Observations 25 25 Observations 25 Hypothesized Mean Difference 0 Hypothesized Mean Difference 0 df 24 df 24 t Stat 1.96890383 t Stat -1.9132 P(T<=t) one-tail 0.03030785 P(T<=t) one-tail 0.03386 t Critical one-tail 1.71088208 t Critical one-tail 1.71088 P(T<=t) two-tail 0.0606157 P(T<=t) two-tail 0.06772 t Critical two-tail 2.06389856 t Critical two-tail 2.0639 Is this a 1 or 2 tail test? Is this a 1 or 2 tail test? - why? - why? P-value is: P-value is: Is P-value > 0.05? Is P-value > 0.05? Why do we not reject Ho? Why do we not reject Ho? Interpretation: 2 Based on our sample data set, perform a 2-sample t-test to see if the population male and female average salaries could be equal to each other. (Since we have not yet covered testing for variance equality, assume the data sets have statistically equal variances.) Conclusion: Do not reject Ho; mean equals 45 Conclusion: Do not reject Ho; mean equals 45 Ho: Ha: Test to use: Place B43 in Outcome range box.
P-value is: Is P-value < 0.05? Reject or do not reject Ho: Meaning of effect size measure: Interpretation: b. Since the one and two tail t-test results provided different outcomes, which is the proper/correct apporach to comparing salary equality? Why? 3 Based on our sample data set, can the male and female compas in the population be equal to each other? (Another 2-sample t-test.) Ho: Ha: Statistical test to use: Place B75 in Outcome range box. If the null hypothesis was rejected, what is the effect size value: What is the p-value: Is P-value < 0.05? Reject or do not reject Ho: Meaning of effect size measure: Interpretation: 4 Since performance is often a factor in pay levels, is the average Performance Rating the same for both genders? Ho: Ha: Test to use: Place B106 in Outcome range box. If the null hypothesis was rejected, what is the effect size value: What is the p-value: Is P-value < 0.05? Do we REJ or Not reject the null? Meaning of effect size measure: Interpretation: 5 If the salary and compa mean tests in questions 2 and 3 provide different results about male and female salary equality, which would be more appropriate to use in answering the question about salary equity? Why? What are your conclusions about equal pay at this point? If the null hypothesis was rejected, what is the effect size In questions 2 and 3, be sure to include the null and alternate hypotheses you will be testing. In the first 3 questions use alpha = 0.05 in making your decisions on rejecting or not rejecting the null hypothesis. Below are 2 one-sample t-tests comparing male and female average salaries to the overall sample mean. (Note: a one-sample t-test in Excel can be performed by selecting the 2-sample unequal variance t-test and making the second variable = Ho value -- see column S) Based on our sample, how do you interpret the results and what do these results suggest about the population means for male and female average salaries? Note: While the results both below are actually from Excel's t-Test: Two-Sample Assuming Unequal Variances, having no variance in the Ho variable makes the calculations default to the one-sample t-test outcome - we are tricking Excel into doing a one sample test for us. Ho 45 0 25 Based on our sample data set, perform a 2-sample t-test to see if the population male and female average salaries could be equal to each other. (Since we have not yet covered testing for variance equality, assume the data sets have statistically equal variances.) Conclusion: Do not reject Ho; mean equals 45 Since the one and two tail t-test results provided different outcomes, which is the proper/correct apporach to comparing salary equality? Why? Based on our sample data set, can the male and female compas in the population be equal to each other? (Another 2-sample t-test.) Since performance is often a factor in pay levels, is the average Performance Rating the same for both genders? If the salary and compa mean tests in questions 2 and 3 provide different results about male and female salary equality, which would be more appropriate to use in answering the question about salary equity? Why? Q3 Ho Female Male Female 45 34 1.017 1.096 45 41 0.870 1.025 45 23 1.157 1.000 45 22 0.979 0.956 45 23 1.134 1.000 45 42 1.149 1.050 45 24 1.052 1.043 45 24 1.175 1.043 45 69 1.043 1.210 45 36 1.134 1.161 45 34 1.043 1.096 45 57 1.000 1.187 45 23 1.074 1.000 45 50 1.020 1.041 45 24 0.903 1.043 45 75 1.122 1.119 45 24 0.903 1.043 45 24 0.982 1.043 45 23 1.086 1.000 45 22 1.075 0.956 45 35 1.052 1.129 45 24 1.140 1.043 45 77 1.087 1.149 45 55 1.052 1.145 45 65 1.157 1.140 Week 3 At this point we know the following about male and female salaries. a. Male and female overall average salaries are not equal in the population. b. Male and female overall average compas are equal in the population, but males are a bit more spread out. c. The male and female salary range are almost the same, as is their age and service. d. Average performance ratings per gender are equal. Let's look at some other factors that might influence pay - education(degree) and performance ratings. 1 Last week, we found that average performance ratings do not differ between males and females in the population. Now we need to see if they differ among the grades. Is the average performace rating the same for all grades? (Assume variances are equal across the grades for this ANOVA.) Null Hypothesis: Alt. Hypothesis: Place B17 in Outcome range box. Interpretation: What is the p-value: Is P-value < 0.05? Do we REJ or Not reject the null? Meaning of effect size measure: What does that decision mean in terms of our equal pay question: If the null hypothesis was rejected, what is the effect size value (eta squared): 2 While it appears that average salaries per each grade differ, we need to test this assumption. Is the average salary the same for each of the grade levels? (Assume equal variance, and use the analysis toolpak function ANOVA.) Use the input table to the right to list salaries under each grade level. Null Hypothesis: Alt. Hypothesis: Place B55 in Outcome range box. What is the p-value: Is P-value < 0.05? Do you reject or not reject the null hypothesis: Meaning of effect size measure: Interpretation: 3 The table and analysis below demonstrate a 2-way ANOVA with replication. Please interpret the results. BA MA Ho: Average compas by gender are equal If the null hypothesis was rejected, what is the effect size value (eta squared): Male 1.017 1.157 Ha: Average compas by gender are not equal 0.870 0.979 Ho: Average compas are equal for each degree 1.052 1.134 Ho: Average compas are not equal for each degree 1.175 1.149 Ho: Interaction is not significant 1.043 1.043 Ha: Interaction is significant 1.074 1.134 1.020 1.000 Perform analysis: 0.903 1.122 0.982 0.903 Anova: Two-Factor With Replication 1.086 1.052 1.075 1.140 SUMMARY BA MA 1.052 1.087 Male Female 1.096 1.050 Count 12 12 1.025 1.161 Sum 12.349 12.9 1.000 1.096 Average 1.02908333 1.075 0.956 1.000 Variance 0.00668645 0.00652 1.000 1.041 1.043 1.043 Female 1.043 1.119 Count 12 12 1.210 1.043 Sum 12.791 12.787 1.187 1.000 Average 1.06591667 1.065583 1.043 0.956 Variance 0.00610245 0.004213 1.043 1.129 1.145 1.149 Total Count 24 24 Sum 25.14 25.687 Average 1.0475 1.070292 Variance 0.00647035 0.005156 ANOVA Source of Variation SS df Sample 0.00225502 1 Columns 0.00623352 1 Interaction 0.00641719 1 Within 0.25873675 44 Total 0.27364248 47 Interpretation: Ha: Average compas by gender are not equal What is the p-value: Is P-value < 0.05? For Ho: Average compas by gender are equal Do you reject or not reject the null hypothesis: Meaning of effect size measure: Ha: Average salaries are not equal for all grades What is the p-value: Is P-value < 0.05? Do you reject or not reject the null hypothesis: Meaning of effect size measure: For: Ho: Interaction is not significant Ha: Interaction is significant What is the p-value: Do you reject or not reject the null hypothesis: Meaning of effect size measure: What do these decisions mean in terms of our equal pay question: 4 Many companies consider the grade midpoint to be the "market rate" - what is needed to hire a new employee. Does the company, on average, pay its existing employees at or above the market rate? Null Hypothesis: Alt. Hypothesis: Statistical test to use: Place the cursor in B160 for correl. If the null hypothesis was rejected, what is the effect size value (eta squared): If the null hypothesis was rejected, what is the effect size value (eta squared): For Ho: Average salaries are equal for all grades If the null hypothesis was rejected, what is the effect size value (eta squared): What is the p-value: Is P-value < 0.05? Do we REJ or Not reject the null? Meaning of effect size measure: NA Interpretation: 5. Using the results up thru this week, what are your conclusions about gender equal pay for equal work at this point? If the null hypothesis was rejected, what is the effect size value: Since the effect size was not discussed in this chapter, we do not have a formula for it - it differs from the non-paired t. Male and female overall average compas are equal in the population, but males are a bit more spread out. Last week, we found that average performance ratings do not differ between males and females in the population. Now we need to see if they differ among the grades. Is the average performace rating the same for all grades? A B C D E F What does that decision mean in terms of our equal pay question: While it appears that average salaries per each grade differ, we need to test this assumption. Is the average salary the same for each of the grade levels? (Assume equal variance, and use the analysis toolpak function ANOVA.) A B C D E F Do you reject or not reject the null hypothesis: The table and analysis below demonstrate a 2-way ANOVA with replication. Please interpret the results. Ho: Average compas by gender are equal Ha: Average compas by gender are not equal Ho: Average compas are equal for each degree Ho: Average compas are not equal for each degree Total 24 25.249 1.052042 0.006866 24 25.578 1.06575 0.004933 MS F P-value F crit 0.002255 0.383482 0.538939 4.061706 (This is the row variable or gender.) 0.006234 1.060054 0.30883 4.061706 (This is the column variable or Degree.) 0.006417 1.091288 0.301892 4.061706 0.00588 Ha: Average compas by gender are not equal Do you reject or not reject the null hypothesis: Ha: Average salaries are not equal for all grades Do you reject or not reject the null hypothesis: Do you reject or not reject the null hypothesis: What do these decisions mean in terms of our equal pay question: Many companies consider the grade midpoint to be the "market rate" - what is needed to hire a new employee. Midpoint Salary Does the company, on average, pay its existing employees at or above the market rate? Using the results up thru this week, what are your conclusions about gender equal pay for equal work at this point? Week 4 Confidence Intervals and Chi Square (Chs 11 - 12) For questions 3 and 4 below, be sure to list the null and alternate hypothesis statements. Use .05 for your significance level in making your decisions. For full credit, you need to also show the statistical outcomes - either the Excel test result or the calculations you performed. 1 Using our sample data, construct a 95% confidence interval for the population's mean salary for each gender. Interpret the results. How do they compare with the findings in the week 2 one sample t-test outcomes (Question 1)? Mean St error t value Low to Males Females <Reminder: standard error is the sample standard deviation divided by the square root of the sample size.> Interpretation: 2 Using our sample data, construct a 95% confidence interval for the mean salary difference between the genders in the population. How does this compare to the findings in week 2, question 2? Difference St Err. T value Low to Yes/No Can the means be equal? Why? How does this compare to the week 2, question 2 result (2 sampe t-test)? a. Why is using a two sample tool (t-test, confidence interval) a better choice than using 2 one-sample techniques when comparing two samples? 3 We found last week that the degrees compa values within the population. do not impact compa rates. This does not mean that degrees are distributed evenly across the grades and genders. Do males and females have athe same distribution of degrees by grade? (Note: while technically the sample size might not be large enough to perform this test, ignore this limitation for this exercise.) What are the hypothesis statements: Ho: Ha: Note: You can either use the Excel Chi-related functions or do the calculations manually. Data input tables - graduate degrees by gender and grade level OBSERVED A B C D E F Total M Grad Fem Grad Male Und Female Und EXPECTED M Grad Fem Grad Male Und Female Und Interpretation: What is the value of the chi square statistic: What is the p-value associated with this value: Is the p-value <0.05? Do you reject or not reject the null hypothesis: If you rejected the null, what is the Cramer's V correlation: What does this correlation mean? What does this decision mean for our equal pay question: 4 Based on our sample data, can we conclude that males and females are distributed across grades in a similar pattern within the population? What are the hypothesis statements: Ho: Ha: A B C D E F OBS COUNT - m OBS COUNT - f EXPECTED What is the value of the chi square statistic: What is the p-value associated with this value: Is the p-value <0.05? Do you reject or not reject the null hypothesis: If you rejected the null, what is the Phi correlation: What does this correlation mean? What does this decision mean for our equal pay question: 5. How do you interpret these results in light of our question about equal pay for equal work? For questions 3 and 4 below, be sure to list the null and alternate hypothesis statements. Use .05 for your significance level in making your decisions. For full credit, you need to also show the statistical outcomes - either the Excel test result or the calculations you performed. Using our sample data, construct a 95% confidence interval for the population's mean salary for each gender. Interpret the results. How do they compare with the findings in the week 2 one sample t-test outcomes (Question 1)? High <Reminder: standard error is the sample standard deviation divided by the square root of the sample size.> Using our sample data, construct a 95% confidence interval for the mean salary difference between the genders in the population. High Why is using a two sample tool (t-test, confidence interval) a better choice than using 2 one-sample techniques when comparing two samples? do not impact compa rates. This does not mean that degrees are distributed evenly across the grades and genders. (Note: while technically the sample size might not be large enough to perform this test, ignore this limitation for this exercise.) Do manual calculations per cell here (if desired) A B C D E F M Grad Fem Grad Male Und Female Und Sum = For this exercise - ignore the requirement for a correction for expected values less than 5. What is the value of the chi square statistic: What is the p-value associated with this value: Do you reject or not reject the null hypothesis: If you rejected the null, what is the Cramer's V correlation: What does this decision mean for our equal pay question: Based on our sample data, can we conclude that males and females are distributed across grades in a similar pattern Do manual calculations per cell here (if desired) A B C D E F M F Sum = What is the value of the chi square statistic: What is the p-value associated with this value: Do you reject or not reject the null hypothesis: If you rejected the null, what is the Phi correlation: Week 5 Correlation and Regression 1. Create a correlation table for the variables in our data set. (Use analysis ToolPak or StatPlus:mac LE function Correlation.) a. Reviewing the data levels from week 1, what variables can be used in a Pearson's Correlation table (which is what Excel produces)? b. Place table here (C8 in Output range box): c. Using r = approximately .28 as the signicant r value (at p = 0.05) for a correlation between 50 values, what variables are significantly related to Salary? To compa? d. Looking at the above correlations - both significant or not - are there any surprises -by that I mean any relationships you expected to be meaningful and are not and vice-versa? e. Does this help us answer our equal pay for equal work question? 2 Below is a regression analysis for salary being predicted/explained by the other variables in our sample (Midpoint, age, performance rating, service, gender, and degree variables. (Note: since salary and compa are different ways of expressing an employees salary, we do not want to have both used in the same regression.) Plase interpret the findings. Ho: The regression equation is not significant. Ha: The regression equation is significant. Ho: The regression coefficient for each variable is not significant Note: technically we have one for each input variable. Ha: The regression coefficient for each variable is significant Listing it this way to save space. Sal SUMMARY OUTPUT Regression Statistics Multiple R 0.99155907 R Square 0.9831894 Adjusted R Square 0.98084373 Standard Error 2.65759257 Observations 50 ANOVA df SS MS F Significance F Regression 6 17762.3 2960.38 419.1516 1.812E-36 Residual 43 303.7003 7.0628 Total 49 18066 Coefficients Standard Error t Stat P-value Lower 95% Upper 95% Intercept -1.7496212 3.618368 -0.48354 0.631166 -9.046755 5.5475126 Midpoint 1.21670105 0.031902 38.1383 8.66E-35 1.1523638 1.2810383 Age -0.004628 0.065197 -0.07098 0.943739 -0.136111 0.1268547 Performace Rating -0.0565964 0.034495 -1.64071 0.108153 -0.126162 0.0129695 Service -0.0425004 0.084337 -0.50394 0.616879 -0.212582 0.1275814 Gender 2.42033721 0.860844 2.81159 0.007397 0.6842792 4.1563952 Degree 0.27553341 0.799802 0.3445 0.732148 -1.337422 1.8884885 Note: since Gender and Degree are expressed as 0 and 1, they are considered dummy variables and can be used in a multiple regression equation. Interpretation: For the Regression as a whole: What is the value of the F statistic: What is the p-value associated with this value: Is the p-value <0.05? Do you reject or not reject the null hypothesis: What does this decision mean for our equal pay question: For each of the coefficients: Intercept Midpoint Age What is the coefficient's p-value for each of the variables: Is the p-value < 0.05? Do you reject or not reject each null hypothesis: What are the coefficients for the significant variables? Using only the significant variables, what is the equation? Salary = Is gender a significant factor in salary: If so, who gets paid more with all other things being equal? How do we know? 3 Perform a regression analysis using compa as the dependent variable and the same independent variables as used in question 2. Show the result, and interpret your findings by answering the same questions. Note: be sure to include the appropriate hypothesis statements. Regression hypotheses Ho: Ha: Coefficient hypotheses (one to stand for all the separate variables) Ho: Ha: Put C94 in output range box Interpretation: For the Regression as a whole: What is the value of the F statistic: What is the p-value associated with this value: Is the p-value < 0.05? Do you reject or not reject the null hypothesis: What does this decision mean for our equal pay question: For each of the coefficients: Intercept Midpoint Age What is the coefficient's p-value for each of the variables: Is the p-value < 0.05? Do you reject or not reject each null hypothesis: What are the coefficients for the significant variables? Using only the significant variables, what is the equation? Compa = Is gender a significant factor in compa: If so, who gets paid more with all other things being equal? How do we know? 4 Based on all of your results to date, do we have an answer to the question of are males and females paid equally for equal work? If so, which gender gets paid more? How do we know? Which is the best variable to use in analyzing pay practices - salary or compa? Why? What is most interesting or surprising about the results we got doing the analysis during the last 5 weeks? 5 Why did the single factor tests and analysis (such as t and single factor ANOVA tests on salary equality) not provide a complete answer to our salary equality question? What outcomes in your life or work might benefit from a multiple regression examination rather than a simpler one variable test? Create a correlation table for the variables in our data set. (Use analysis ToolPak or StatPlus:mac LE function Correlation.) Reviewing the data levels from week 1, what variables can be used in a Pearson's Correlation table (which is what Excel produces)? Using r = approximately .28 as the signicant r value (at p = 0.05) for a correlation between 50 values, what variables are Looking at the above correlations - both significant or not - are there any surprises -by that I Below is a regression analysis for salary being predicted/explained by the other variables in our sample (Midpoint, age, performance rating, service, gender, and degree variables. (Note: since salary and compa are different ways of expressing an employees salary, we do not want to have both used in the same regression.) Note: technically we have one for each input variable. Listing it this way to save space. Lower 95.0% Upper 95.0% -9.046755043 5.547512618 1.152363828 1.281038273 -0.136110719 0.126854699 -0.126162375 0.012969494 -0.212582091 0.127581377 0.684279192 4.156395232 -1.337421655 1.888488483 Note: since Gender and Degree are expressed as 0 and 1, they are considered dummy variables and can be used in a multiple regression equation. What does this decision mean for our equal pay question: Perf. Rat. Service Gender Degree What is the coefficient's p-value for each of the variables: Do you reject or not reject each null hypothesis: What are the coefficients for the significant variables? If so, who gets paid more with all other things being equal? Perform a regression analysis using compa as the dependent variable and the same independent variables as used in question 2. Show the result, and interpret your findings by answering the same questions. What does this decision mean for our equal pay question: Perf. Rat. Service Gender Degree What is the coefficient's p-value for each of the variables: Do you reject or not reject each null hypothesis: What are the coefficients for the significant variables? If so, who gets paid more with all other things being equal? Based on all of your results to date, do we have an answer to the question of are males and females paid equally for equal work? Which is the best variable to use in analyzing pay practices - salary or compa? Why? What is most interesting or surprising about the results we got doing the analysis during the last 5 weeks? Why did the single factor tests and analysis (such as t and single factor ANOVA tests on salary equality) not provide a complete answer to our salary equality question? What outcomes in your life or work might benefit from a multiple regression examination rather than a simpler one variable test?