Using the Bernoulli model to analyze the distribution of course withdrawals at UPR-Bayamón

Horacio Matos-Díaz

resúmenes

secciones

referencias

imágenes

Abstract: Using arichpanel data comprising 39,337 courses offered in the UPR-Bayamón during forty-one consecutive terms,this paper analyses the distribution of course withdrawals, estimating four parameters per course: the proportion of withdrawals and its variance, as well as the coefficients of skewness and kurtosis. Evidence suggests that the characteristics of courses, students, and, particularly, unobservable faculty heterogeneity exert a strong and statistically significant effect on these parameters over time and within academic fields. Faculty members and students engage in a shopping-around process where both parties improve their well-being at the expense of the academic standards and the quality of the education provided.

Keywords: course withdrawals, Bernoulli model, moment-generating function, skewness, kurtosis.

Resumen: Usando un archivo longitudinal de los 39,337 cursos ofrecidos en la UPR-Bayamón durante 41 semestres consecutivos, se analiza la distribución de bajas parciales por curso a través de los primeros cuatro momentos: media, varianza, asimetría, y curtosis. Las características de los cursos, de los estudiantes, y muy particularmente, la heterogeneidad inobservable de los profesores, ejercen una fuerte y significativa influencia sobre el comportamiento de los momentos a través del tiempo. Parecería, que profesores y estudiantes están involucrados en un proceso de ir de compras que les beneficia mutuamente a expensas de los estándares académicos y de la calidad de la educación provista.

Palabras clave: bajas parciales por curso, modelo de Bernoulli, función generatriz de momentos, asimetrías, curtosis.

Carátula del artículo

Using the Bernoulli model to analyze the distribution of course withdrawals at UPR-Bayamón

Uso del modelo de Bernoulli para analizar la distribución de bajas parciales por curso en la UPR-Bayamón

Horacio Matos-Díaz horacio.matos1@upr.edu

University of Puerto Rico at Bayamón, Estados Unidos de América

Forum Empresarial, vol. 29, núm. 1, pp. 45-82, 2024
Universidad de Puerto Rico

Recepción: Junio 28, 2024

Revisado: Octubre 23, 2024

Aprobación: Noviembre 07, 2024

Introduction

One of the most significant challenges that institutions of higher education face is the establishment of a selective admission policy which allows them to identify and admit, from a total pool of applicants, the candidates most able and likely to academically succeed: the greater the institutional economic shortage, the greater the urgency. Suppose that, to reach such an objective, the university administrators consider designing an ideal standardized test. According to Rothstein (2004), the ideal test should be able to predict as accurately as possible which applicants would be most successful if admitted. That is, all applicants whose performance in the test exceeds a determinate threshold after admission would be likely to (a) succeed academically and (b) fulfill all academic requisites during the allotted time. Should this be the case, the institutional admission policy would be quite simple and efficient. However, designing and implementing such a test is extremely difficult, if not impossible. Many diverse factors influence student academic achievement, which are difficult to identify and measure, and most are beyond student and institutional control.

Since the academic year 1979-80, the University of Puerto Rico (UPR) —the country’s public university system composed of eleven campuses across the Island— has adopted as its official admission policy a standardized test administered by the Puerto Rico (PR) and Latin America Office of the College Entrance Examination Board (CEEB), named the General Admission Index (GAI).[1] Every year, each of the UPR’s eleven campuses establishes the minimum GAI required by its different academic programs in response to trends in enrollment demand and the program's capacity.[2] The fact that the GAI required for each program is made public every year has led from its inception to a self-inclusion/exclusion process by which students themselves decide whether to apply to the UPR (and to a particular program), based on their GAI and the minimum established by the program. Students’ admissions to each academic program follow a strict order entirely defined by their GAI. It is expected that these students will be able to sort themselves in such a way that more (less) academically able are admitted to the highest (lowest) selective programs with more (less) inherent difficult content.

Thus, the role of the ideal test described earlier has been ascribed to the GAI. The issue to be settled is whether the GAI satisfies conditions (a) and (b)previously mentioned. Of course, the answer is no. Because of the inaccuracies of GAI, there are several endemic academic problems whose incidence varies among and within the eleven UPR campuses. For instance, to be admitted to the UPR at Bayamón (UPR-Bayamón), many students whose GAI is below the minimum apply to non-desired programs for which they qualify, looking for an eventual possible transfer to their desired program. The strategy consists of enrolling in courses in their desired program even though they are officially admitted to a different one. Because the academic requirements and contents of the programs could differ significantly, the likelihood of course failures and withdrawals increases. Moreover, such a strategy lengthens time until graduation, increasing the opportunity cost of schooling. Eventually, many such students withdraw from the institution because of failure to be admitted to their desired programs. Furthermore, it should be mentioned that these endemic problems are also prevalent even among students who were admitted to their desired programs from the beginning. Therefore, the official admission policy generates undesirable by-products like academic failures, too many applications for program transfers, as well as total and partial withdrawals. Among these problems, this study seeks to analyze the distribution of withdrawals (W) observed in the 39,337 courses offered during 41 consecutive terms (including summer sessions) at UPR-Bayamón from fall 1995-96 to fall 2015-16. To the best of my knowledge, the extant literature lacks studies devoted to analyzing the implications and consequences of the proportion of course withdrawals on the education process; this paper seeks to fill such a gap.

To this end, this paper adopts the Bernoulli probability model and derives the moment-generating functions around its origin and mean. For each course, the objective is to calculate the following four parameters: first, the proportion of withdrawals, which equals the first moment around the origin; second, the variance of its distribution, which is equal to the second moment around the mean; third, the coefficient of skewness, using the third moment around the mean; and fourth, the coefficient of kurtosis, using the fourth moment around the mean. Using different econometric specifications, including random- and fixed-effects models, allows the modeling of these four parameters. This paper uses a rich and detailed panel data containing time-varying variables describing faculty, student, and course characteristics to fulfill this objective.

This study contributes to the extant literature by (a) being the first to analyze in detail the distribution of withdrawals and its key moments at the course level, (b) using a rich panel data comprising all courses offered at UPR-Bayamón during 41 consecutive terms, (c) including time-varying variables describing in detail the faculty characteristics, and showing that courses, faculty and students characteristics exert strong and significant effects on the estimated models, and (d) presenting empirical evidence pointing to the conclusion that faculty and students engage in a symbiotic relationship where both parties improve their well-being at the expense of diminishing academic standards.

The remainder of the paper is organized as follows. Section 2 justifies adopting the Bernoulli model based on its simplicity and statistical properties. Section 3 describes the nature of the data and the specification of the statistical models to be estimated. Section 4 discusses the empirical results, as well as their policy implications. Finally, Section 5 closes the study with a summary and conclusions.

Motivating the Adoption of the Bernoulli Probability Model

When and why do students decide to withdraw from a course? Although the answer to this question has dramatic policy implications for students and universities since withdrawals entail significant cost consequences, the research published on this topic is limited. Wollman and Lawrence (1984), Adams and Becker (1990), Dunwoody and Frank (1995), and Miller (1997) constitute notable exceptions. However, inspired by the original work of Spady (1970, 1971) and Tinto (1975), there is an extensive and diverse body of published research related to the withdrawals of students from college. This is the first research to analyze the determinants of the distribution of withdrawals and their key moments at the course level.

The study of the distribution of course withdrawals among and within academic programs and across time is relevant for several reasons. Withdrawals can increase student time until graduation and the total cost of the degree. Moreover, they could predict or signal total withdrawals and attrition from college, decreasing college graduation rates. At one time, Dunwoody and Frank (1995) raised the issue that individual course withdrawal could have the highest impact on overall retention, attrition, and institutional success. For some researchers (e.g., Zwick & Sklar, 2005), the best criterion to measure an institution’s academic success is, precisely, the proportion of students who complete their degrees in the allotted time. In this context, low graduation rates negatively impact institutional rankings and, consequently, their ability to attract students with more significant academic potential. Moreover, student attrition represents a fiscal cost to institutions in terms of lost revenues from tuition, room and board, and alum donations (Raisman, 2013; Schuch, 2005). Attrition also constitutes a problem for society in general by reducing the availability of college-educated workers in the labor market (Bound et al., 2007). It also negatively impacts lower tax receipts for federal and state governments (Schneider & Yin, 2011). Although these considerations are beyond the scope of this research, they illustrate how important it is to model the determinants of the distribution of withdrawals at the course level.

From Adams and Becker (1990), it will be hypothesized that students want to maximize their utility function subject to the constraints imposed by their economic and academic environment. Students derive utility from their present and future stream of consumption of the goods and services they will be able to buy in the market as a product of their investment in human capital through education. Education is costly in terms of money and opportunity cost. Therefore, withdrawing from courses would entail a waste of money and increase the opportunity cost of schooling by lengthening the time until graduation. It seems reasonable to posit that the disutility derived by students directly varies with the intrinsic course difficulty level. However, such a concept is relative and unobservable. Thus, it will be assumed that a student will withdraw whenever the disutility (dissatisfaction) derived from the course is greater than the disutility induced by the cost of withdrawing it.

While student disutility or dissatisfaction is not observable, their actions of withdrawing or remaining in the course are. The indicator variable defined in (5) allows us to consider these actions.

Usually, the determination of the number of total courses offered by academic fields (AFs) and the number of students enrolled in each one occurs at the beginning of each term. Likewise, by the end of the term, each academic department head knows with certainty the number of students who withdraw from each course and those who remain in it. Suppose each academic unit adopts a coding system such that code “1” represents the students who withdraw and code “0” represents those who do not withdraw. Thus, expression (1) defines the random variable W.

Formulas

Formula 6

The Moment-Generating Functions: Interesting Analytical Results

.The first moment around the origin is equal to π. This calculation requires taking the first derivative with respect to t (evaluated at t = 0) from the corresponding moment-generating function.[6] The superior moments around the origin are all equal to πThe first moment around the origin is equal to π. This calculation requires taking the first derivative with respect to t (evaluated at t = 0) from the corresponding moment-generating function.6The superior moments around the origin are all equal to π. How-ever, moments around the mean are more interesting. Let μ1,μ2,μ3, and μ4 be the first four moments around the mean. To find them, it is necessary to take the first, second, third, and fourth derivatives with respect to t and to evaluate each one at t = 0; Table 1 reports their values, as well as the coefficients of skewness (CS) and kurtosis (CK). The units of measurement of μ3 and μ4 influence their respective size. Therefore, considered alone, they are poor measures of skewness and kurtosis, respectively. Such dimensionality disappears, defining each coefficient as a relative measure, as done in (

Formula 7

As expected, μ1 = 0. The second moment is equal to the variance. It is an open downward parabola, which reaches its absolute maximum (0.25) at π = 0.5. Obviously, it is zero in the extremes, at

Table 1
Key Parameters of the Bernoulli Probability Model

or at π = 1 (→ –∞ ). It is a positive decreasing convex function (C”S> 0) on π∈(0; 0.5), reaches its inflection point at π = 0.5, and continues decreasing as a negative concave function (C”S<0) on π∈(0.5; 1). The coefficient of kurtosis (CK) is a positive U-shaped convex function symmetric with respect to the line π = 0.5. The coefficient, as well as its first and second derivatives are undefined at points π = 0 and π = 1. The coefficient decreases on π∈(0; 0.5), reaches its absolute minimum equal to 1 on π = 0.5, and increases unbounded on π∈(0.5; 1). As mentioned earlier, π determines, completely and uniquely, the four parameters of interest for this study. Three different points are of key interest in the range of π: π = 0, π = 0.5 and π = 1. There are 11,206 (28.49%) courses where π = 0, and none where π = 1 (see Section 4). For all courses where π = 0, σ2 (W) = 0; however, CS and CK are undefined. In all courses where π = 0.5, CS = 0 , implying a symmetric distribution of withdrawals, while σ2 (W) and CK reach their maximum and minimum values (0.25 and 1, respectively). On the other hand, the distribution of withdrawals is skewed to the left in all courses, where π > 0.5, since CS <0. Thus, once πj is known, it is straightforward to characterize the degrees of skewness and kurtosis prevailing in course j according to these analytical results. Moreover, the mean and variance of CS and CK distributed by specific courses or AFs can be computed and depicted over the 41 terms considered in this study for analytical comparison. Therefore, the simplicity of the Bernoulli model allows us to easily analyze the distribution of course withdrawals looking for policy measures that improve the academic process.

Data Description

The UPR-Bayamón is an autonomous unit of the UPR system. Accredited by the Middle States Association of Colleges and Secondary Schools, it offers associate and bachelor’s degrees, as well as articulated transfer programs to the Río Piedras, Mayagüez, and Medical Sciences campuses. In the fall of 2024, total enrollment at UPR-Bayamón was 2,852, including 2,520 full-time students.

For each one of the 39,337 courses offered in UPR-Bayamón from 1995 to 2015, the following variables are available: enrollment; instructor who taught the course; letter grade distribution (As, Bs, Cs, Ds, Fs, and W); GPA; variance of the GPA, and AFs (21 dummies). As proxies to account for student quality at the course level, this research uses the mean and variance of the following variables: high school graduation GPA (HS-GPA), GAI, and the score on each of the five sections of the CEEB.[8] Furthermore, the proportions of students by gender and type of high school (public or private) are available for each offered course. Dummies control for academic schedules (weekdays and hours) and for summer terms. For each faculty member in the sample, the following time-varying variables are available: age, rank, degree, and tenure status. Dummies account for the instructor’s gender and the presence of courses subject to student evaluations of teaching (SET). The inclusion of a set of forty-one dummies, identifying term/year, allows for capturing time effects.[9] Table 2 describes the variables used.

Models for Estimation

The preceding discussion suggests that the model specified in (8) is appropriate to estimate the equations that predict the proportion of withdrawals and the distribution variance, as well as the coefficients of skewness and kurtosis observed in course j, taught by professor f. The matrices Xj,Zf, and Ms, consist of course (j), faculty (f), and student’s (s) characteristics, respective-ly. The vectors β, F,and c represent parameters to be estimated and ɛjf is the composite error term. The inclusion of random- and fixed-effects models allows to account for unobservable faculty heterogeneity (UFH = γf)

Table 2
Sample Statistics

Results and Discussion

Stylized Facts of the Parameters Across Time

Table 3 reports the mean values of the four parameters computed over the 41 terms analyzed in this study distributed by AFs: π, σ2 (W) CS, and CK. AFs are ordered using the values of π, from smallest to largest, as reference. The parameters π and σ2(W)move in the same direction until π = 0.5, and the former is always greater than the latter. The values of π are not randomly distributed by AFs. For instance, for all courses offered by the Marketing and Physical Education departments, the respective figures are the lowest: 2.69% and 4.6%. However, for the Economics & Statistics and Mathematics courses, the figures are the highest: 18.31% and 29.22%, respectively. To the extent that π values were directly related to the inherent difficulty of the course, evidence points to the conclusion that Chemistry (14.93%), Economics & Statistics (18.31%), and Mathematics (29.22%) are the most challenging courses. Likewise, Marketing (2.69%), Physical Education (4.6%), and Management (5.42%) are the easiest ones. A cautionary note is in order here. Higher π values would signal increased course inherent difficulty levels to the extent that academic standards do not decrease over time. According to the leniency hypothesis (Gump, 2007), faculty members can buy higher SET ratings, recruit more students, improve their teaching schedules, or even become more popular by relaxing their academic standards through leniency grading. If so, GPA will in-crease (implying grade inflation), and withdrawals will decrease among and within courses across time. To test for such a conjecture, all the econometric models include a dummy variable that takes on the value 1 if SET were conducted in the course and 0 otherwise. The coefficients of skewness and kurtosis move in opposite directions to the course’s inherent difficulty level. Marketing exhibits the highest figures: CS = 4.24 and CK = 20.14; while Mathematics exhibits the lowest: 1.21 and 4.08, respectively. Over the period covered in the study, on average, all AFs exhibit skewed

Table 3
Parameters of the Distribution of Withdrawals by AFs

to the right and leptokurtic distributions. However, there are 194 courses where 0.49 <π < 0.51. For all of them, σ2(W) reaches its absolute maximum (0.25). Furthermore, CS tends to zero, im-plying symmetric distributions; while CK tends to one, implying platykurtic distributions. A total of 106 (≈ 55%) of those courses belongs to Mathematics. Thus, to the extent that π tends to zero, CS and CK increase unbounded implying distributions exhibiting higher skewed to the right and greater leptokurtosis degrees, respectively. Finally, there are 686 courses where π > 0.5; out of this number, 465 (≈ 68%)belong to Mathematics. On the other hand, there are 11,206 courses where π = 0, but only 125 (1%) be-long to Mathematics. Thus, according to student withdrawal decisions, Mathematics courses are the most difficult, independent of the criteria used to measure difficulty

Figures 1 and 2 clearly depict the growth path of the student quality proxies and the four parameters under study. Four proxies account for student quality at the course level: GAI (see footnote 1), HS-GPA, mathematics, and verbal aptitude. Although GAI tends to increase over time, it should be mentioned that such a tendency is pushed by the self-sustained growth path of HS-GPA, which tends to increase over time (implying grade inflation). However, according to mathematics and verbal aptitude figures, student quality decreases over time. It should be mentioned that this decreasing tendency is consistent with empirical evidence documented at the international level, particularly evidence from Norway.10 Given that, on average, students are academically less able each term, two results should be expected by course: dimin-ishing GPA and increasing π

The last two columns of Table 4 report the GPAs for the full sample (39,337 courses), as well as the GPAs for the subsample where W > 1 (28,131 courses) distributed by terms. Each GPA in the first column is greater than its counterpart in the second one, and both series increase over time, implying grade inflation since simultaneously, student quality is diminishing. On the other hand, during the forty-one terms studied, the overall π is 10.89%. As shown in the sixth column of Table 4 and clearly depicted in the first graph of Figure 1, π decreased over time, from 13.21% in the fall 1995 term to 9.88% in the fall 2015 term. Therefore, contrary to what should be expected, evidence points to increasing GPAs and decreasing π.11

Figure 1
Students Quality Proxies’ Growth-Path
These results are consistent with an academic environment characterized by diminishing standards and grade inflation. Several recent studies conducted in the institution have documented such a problematic. For details, refer to Matos-Díaz (2012, 2014, 2018) and Matos-Díaz & García-Vázquez (2014).

Figure 2
Growth Path of the Key Parameters of the Distribution of the Withdrawals

It should be emphasized that this inverse relationship be-tween GPA and π documented across time is a robust one observed among and within AFs. Table 5 reports the evidence. Once again, AFs are ordered using the values of π, from smallest to largest, as reference. Conversely, the respective GPAs reported in the three columns run from largest to smallest. Moreover, for each academic field, the GPAs observed in courses where W> 1 is lower than the respective one observed in the full sample, where W> 0, which in turn is lower than the one observed in courses where W = 0. Therefore, either over time or between and within AFs, GPA and π move in opposite directions. This result is at odds with that reported in the extant literature (Matos-Díaz, 2018). Figure 2 also depicts the growth-paths of σ2 (W) CS, and CK. Like π, σ2 (W) decreases over time. Conversely, CS and CK exhib-it an increasing tendency over time. Thus, the distributions of course withdrawals become more skewed to the right and more leptokurtic, implying greater academic homogeneity among and within courses over time.

Table 4
Stylized Facts of Withdrawals by Terms

Table 4(continued)

Table 5

Table 6 reports several key facts of π by AFs. The service de-partments responsible for offering the highest number of courses were English (3,922), Mathematics (3,784), Spanish (2,948), and Humanities (2,903). The value in Mathematics was the greatest (29.22%), while the respective figures in the English, Humani-ties, and Spanish courses were 7.57%, 7.08%, and 5.19%. Two other service programs exhibiting high π values were Economics & Statistics (17.38%) and Chemistry (15.14%). The last column of Table 6 transforms withdrawals (W) into equivalent courses by AFs. The exercise requires dividing each value of W by the aver-age course size of the respective AFs. The total W (98,940) observed during the period would require offering 4,239 equivalent courses to satisfy future demand.

Estimating the costs will be necessary to gauge withdrawals’ eco-nomic and academic consequences. The approach suggested by Matos-Díaz (2018), assuming that equivalent courses were offered by part-time faculty, paid through the mechanism of additional compensation ($2,000 per course), allows estimating the lower-bound monetary cost of the total withdrawals of around $8.48 mil-lion (4,239 x $2,000 = $8,478,000). However, their actual cost might be significantly higher. The 4,239 equivalent courses are more than the total courses offered by service departments such as English (3,922) and Mathematics (3,784) and more than all the courses offered jointly by six different programs.12 That is, withdrawals entail a waste of resources greater than the whole budget assigned to and spent by such programs during 20.5 consecutive academic years. This is, indeed, a significant waste of scarce resources.

Predicting π, σ2 (W) CS ,and CS

Thus far, the discussion has centered on the characteristics of the parameters distributed by AFs and over time. This section is devoted to discussing the results of the estimated models and their policy implications. It was shown that π and σ2 (W) move in the same direction until π =0.5; then, σ2 (W) decreases for all π > 0.5. Likewise, CS is an entirely decreasing function of π, while CK decreases until π = 0.5, and then increases unbound-edly. Based on these analytical results, the coefficients of Models 1 and 2 in Table 7 should be expected to share the same pattern of signs. Likewise, the coefficients of Models 3 and 4 should also share the same signs. However, the pattern of signs of Models 1 and 2 should be the opposite of Models 3 and 4, and vice versa, except for values of π > 0.5 in the two mentioned cases. All semi-continuous covariates, as well as almost all the dummies, satisfy this condition of consistency in the pattern of signs.

Table 6
Stylized Facts of Withdrawals by AFs

The baseline model estimates the equation described in (8) as a first approximation, using the following covariates: summer, SET, GAI, the variance of GAI, the proportion of private high school students, the proportion of female students, and the constant term.13 The adjusted R-squared for the four models were 0.03, 0.03, 0.05, and 0.04, respectively. When the models were re-estimated accounting for UFH,14 through fixed-effects models, the coefficients increased to 0.44, 0.41, 0.41, and 0.34, respective-ly. That is, the total variation around the means explained by the models increased by a factor of 14.67, 13.67, 8.2, and 8.5 times, re-spectively (results are available upon request). Thus, UFH plays a significant role in the student’s decision process related to course withdrawal.

Later, the four models were estimated using all the covariates in Table 7 without accounting for UFH. Almost all AFs’ covariates are statistically significant and exhibit the expected pattern of signs previously discussed. Nonetheless, the adjusted R-squared coefficients are 0.34, 0.31, 0.32, and 0.26, respectively. These coefficients are even smaller than those reported for the baseline models after accounting for UFH (0.44, 0.41, 0.41, and 0.34, respectively). However, the process of re-estimation of the models accounting for UFH gives rise to the statistical insignificance of a great proportion of the AFs’ estimated coefficients. As Table 7 reports, the adjusted R-squared coefficients were 0.46, 0.43, 0.43, and 0.36, respectively. This result illustrates the superiority of UFH over AFs covariates.15 Table 7 reports (in parentheses) the standard errors, corrected for heteroscedasticity and contemporaneous correlation, of all models; however, for space limitations, the table does not report the AFs’ coefficients, even though they were included in the four models.16

Table 7
Predicting the Parameters of the Distribution of Withdrawals

Among the dummies controlling for faculty characteristics, the associate professor exhibits the correct pattern of signs, but it is marginally significant and positive only in Model 2. Professor co-variate shares the correct pattern of signs (positive in Models 1 and 2 and negative in Models 3 and 4) in all estimated models. It is significant in Models 1, 2, and 3 but insignificant in Model 4. On the other hand, assistant professor, probation, and tenure are insignificant in all models; while doctorate exhibits the appropriate pattern of signs in all models, it is significant only in Models 1 and 3

The professor’s age covariate could capture the effects of two different scenarios. On the one hand, the course withdrawals may be significant and directly related to the young faculty’s lack of teaching skills. If so, they should tend to diminish to the extent that faculty members improve their teaching skills over their aca-demic career life cycle. On the other hand, it might be the case that withdrawals were significant and directly related to intrinsic course difficulty level rather than to a lack of faculty teaching skills. Under such a scenario, it should be expected that during their first years of teaching, new faculty members were subject to pressure from students (through SET) and administrators to grade more leniently. However, such pressure tends to diminish to the extent that faculty get tenure and promotions to higher ranks. If so, course withdrawals and the professor’s age will be expected to move in the same direction. Evidence points to the conclusion that the second scenario prevails at UPR-Bayamón be-cause the covariate is significant and exhibits the correct pattern of signs in all models (positive in Models 1 and 2 and negative in Models 3 and 4). Increases of one standard deviation in this covariate will induce increases of 2.32 and 1.42 points in π and σ2 (W), as well as decreases of 0.38 and 2.25 points in CS and CK, respectively.

Almost all the covariates that define the section characteristics, such as course size, hour, and weekdays, as well as summer and SET, are statistically significant. Compared to the reference group (13 to 29 students per course), π and σ2 (W)decrease by 2.14 and 1.79 points, while CS and CK decrease by 0.45 and 2.79 points in smaller courses, respectively. This result is at odds with the expected pattern of signs since the signs of CS and CK should be the opposite. One plausible explanation is that the smallest courses have been designed to accommodate students with special aca-demic needs. There are 3,939 courses with enrollment less than or equal to 12 students. Among them, there are 2,256 where π = 0, and 1,683 where average π = 18.24%. If the first set consists of academically lagging students enrolled in remedial courses, while the second set is composed primarily of advanced students placed in small groups of the most difficult or advanced courses, then it will be very unlikely that the models could disentangle the relationship between course size and academic achievement.

On the other hand, in bigger courses (30 or more students), the pattern of signs is consistent (positive in Models 1 and 2 and negative in Models 3 and 4), and the covariate is statistically in-significant only in Model 1. Furthermore, compared to the reference group, σ2 (W) tends to decrease by 0.17 points, while CS and CKtend to increase by 0.26 and 2.21 points, respectively. There-fore, π and course size move in opposite directions. This result has policy implications since the institution would be able to de-sign strategies to identify in advance students with high probabilities to withdraw from determinate courses and try to place them in smaller courses with academic support.

Compared to courses offered in the afternoon CS and CK tend to increase by 0.04 and 0.19 points in morning courses, respectively. Meantime, π and σ2 (W) move in the opposite direction, but their coefficients are statistically insignificant. On the other hand, in the evening courses, π decreases by 0.81 points and CSincreases by 0.08 points; however, the covariate was insignificant in the case of Model 2 (σ2 (W)) and Model 4 (CK). One possible explanation for such results could be the traffic congestion con-fronted by students enrolled in courses scheduled early in the morning or the lack of sufficient parking spaces. Both situations could increase late arrivals to classes and absenteeism among students, which in turn would increase π. If such problems have occurred, their frequency seems to be significantly smaller for evening courses. If so, the problem could be mitigated by improving the schedule of the academic offering according to students’ needs.

A great proportion (14/20 = 70%) of the weekday dummy co-variates is statistically significant. Nonetheless, the pattern of signs of the estimated coefficients is inconsistent. To shed more light on this issue, it would be convenient to increase the specificity level of the analysis considering interactions among hours, week-days, and level of courses by AFs. Such a task will require further research.

To empirically test the leniency hypothesis, attention is placed on the SET estimated coefficients. According to this conjecture, faculty members will get better SET ratings if they reduce aca-demic standards and course difficulty levels through leniency grading. Such a symbiotic relationship between students and faculty has been proposed in the literature for a long time with-out direct statistical testing.17 If so, it should be expected that in courses where SET = 1, the difficulty level diminishes, the GPA increases, and π decreases. The SET estimated coefficients are statistically significant and exhibit the expected pattern of signs in all models. For instance, π and σ2 (W) decrease by 0.52 and 0.33 points; meantime, CSand CK increase by 0.11 and 0.63 points if SET were conducted in the course. Other things being equal, π significantly diminishes whenever SET = 1. Therefore, according to students’ criteria, inherent difficulty significantly decreases just for the simple reason that the course is under SET. This result is consistent with the symbiotic relationship conjectured in the leniency hypothesis. Among the available student quality proxies, GAI is the most relevant because it constitutes the institution’s admission policy criterion. Therefore, it should be expected that both GAI and GAI variance exert a significant effect on the four dependent variables under study. Table 7 reports the estimated coefficients showing that such is the case. For example, the estimated coefficients of the GAI covariate are highly significant (with the correct pattern of signs) in the four estimated models. Other things being equal, increases of one standard deviation in student quality (GAI) will induce decreases of 1.56 and 0.86 points in π and σ2(W), respectively. However, CS and CK are expected to increase by 0.17 and 0.85 points, respectively. On the other hand, increases of one standard deviation on the GAI variance covariate will induce reductions of 0.18 and 0.09 points in π and σ2 (W), respectively. Contrariwise, CS and CK are expected to increase by 0.02 and 0.08 points, respectively.

The observed inverse relationship between GAI and π is what should be expected under normal academic circumstances. Not-withstanding, the pattern of signs exhibited by the estimated coefficients of GAI variance covariate needs some further explanations. The heterogeneity of student quality, proxied by this co-variate, might have different effects on the dependent variables under study depending on the professor’s attitude toward risk. For instance, faced with courses of highly heterogeneous students, a risk-averse professor would relax the academic standards to allow students belonging to the lower bound of the quality distribution to exceed threshold GPA values that induce them to not withdraw from the course. Thus, relaxing academic standards would improve the distribution of grades, reduce π, which, in turn, would increase the probability of better SET ratings for the professor teaching the course. Under such scenario, π and GAI variance should move in opposite directions. Evidence points to the conclusion that this is the case prevailing at UPR-Bayamón. Thus, both variables behave as expected.

However, their policy implications are difficult to achieve. For example, other things being equal, to induce a reduction of 3.12 points in π observed in Mathematics courses, it would be necessary to admit new entrance students with GAI two standard deviations above the mean. That is, it would require recruiting students with a GAI of about 333 points. Usually, students with such credentials apply and obtain admission to programs more competitively offered by other campuses of the UPR system or by U.S. universities. In the case of GAI variance covariate, it will be difficult, if not impossible, to control it. Hence, given the institutional official admission policy (GAI), the increases in student quality required to partially offset the observed π by AFs are unfeasible. However, other things being equal, π is expected to diminish by 8.68 points just for the simple reason that the course will be during the summer session. There-fore, the structure of incentive mechanisms prevailing among faculty members and students during summer sessions deserves further research

Two other student characteristics that could contribute to ex-plaining the variance around the dependent variables of the models are the private high school and female student proportions. Both proportions significantly vary among programs. For the full sample, they are equal to 47% and 53%, respectively. However, for Office Systems, the figures are 32% and 94%, respectively. On the other hand, the respective female proportion in programs such as Education and Biology are 85% and 71%, but in Electronics, it is only 7%. Therefore, students are not randomly distributed among programs.

The female proportion covariate is highly significant in all the models. Other things being equal, increases of one standard de-viation on it will be associated with reductions of 1.57 and 0.89 points in π and σ2 (W), respectively. Meanwhile, it is expected that CS and CK increase by 0.18 and 0.82 points, respectively. On the other hand, increases of the same magnitude in the proportion of private high school students will induce a decrease of 0.14 points in π, as well as increases of 0.02 and 0.08 points in CS and CK, respectively. The coefficient is statistically insignificant in the case of Model 2 (σ2 (W)). Thus, to the extent that both proportions tend to increase, π decreases significantly. Given that the control of both variables is beyond institutional reach, there is no space to use them as a policy mechanism design.

The inclusion of a set of forty time-varying dummies, which uses the first term as the reference group, allows us to capture the effect of time on the dependent variables of the models. The purpose was to evaluate whether the estimated models might mimic the growth path exhibited by the key parameters depicted in Figure 2. Although Table 7 does not report the estimated coefficients, a significant proportion is statistically significant and exhibits the expected pattern of signs in all models. For instance, nineteen out of 40 (48%) of the estimated coefficients of Model 1 and 25 out of 40 (62.5%) of Model 2 were significant, and their pattern of signs is the expected one (negative), according to Figure 2. On the other hand, the respective proportion for Models 3 and 4 is 65% for each one (26/40), and the pattern of signs is the expected one (positive), according to Figure 1. Thus, the time-varying coefficients of the four estimated models mimicked the exhibited growth path of the dependent variables very well.

Summary and Conclusions

Using a rich panel containing detailed information on the 39,337 courses offered during forty-one consecutive terms, this study analyzed the distribution of course withdrawals and its key moments at the UPR-Bayamón. Overall, the fit of the estimated models is very good. Evidence shows that courses, faculty, and students’ characteristics exert a strong and significant influence on π, σ2 (W), CS and CK. UFH, captured through random- and fixed-effects models, explains a significant proportion of the variation observed around the dependent variable of each estimated model. Empirical evidence does not allow rejection of the symbiotic relationship between faculty members and students, conjectured in the literature, since the estimated coefficients of the SET co-variate were highly significant and exhibited the correct pattern of signs in all models. That is, π and σ2 (W) tend to decrease, while CS and CK tend to increase for the simple reason that the SET was conducted in the course.

A similar result was observed in the case of summer covariate. Its estimated coefficients were highly significant in all estimated models. As discussed previously, π is expected to diminish by 8.8 points if the course is offered during the summer session. However, under the unlikely scenario that the institution would be able to recruit new entrant students with a GAI two standard deviations above the mean (GAI about 333 points), π would decrease by only 3.1 points. Hence, offering a Mathematics course during the summer session would have an expected effect on π equivalent to ad-mitting new entrant students with GAI 5.64 standard deviations above the mean, which is impossible. Therefore, the signs and significance of the coefficients of GAI, SET, summer covariates, and UFH have important implications for the institution’s academic policy mechanism design. Empirical evidence points to the conclusion that at UPR-Bayamón, there exists an environment where faculty members and students engage in a shopping-around process where both parties improve their well-being at the expense of re-ductions in academic standards and the quality of the education provided. Under such a scenario, it might be possible to explain the contradictions observed in the institution where, even though the indicators of student quality are consistently decreasing over time, the GPAs are increasing and π is decreasing simultaneously

Material suplementario

References

Adams, J. L., & Becker, W.E. (1990). Course withdrawals: a probit model and policy

Bound, J., Lovenheim, M. F., & Turner, S. (2007). Understanding the decrease in college completion rates and the increased time to the baccalaureate degree (Research Report No. 07-626). Population Studies Center, University of Michigan. https://users.nber.org/~confer/2006/HIEDs06/bound.pdf

Bratsberg, B., & Rogeberg, O. (2018). Flinn effect and its reversal are both environmentally caused. PNAS, 115(26), 6674–6678. https://doi.org/10.1073/pnas.1718793115

Chow, Y.-L. (1989). Statistical analysis for business and economics. Elsevier.

Dilts, D. A. (1980). A statistical interpretation of student evaluation feedback. TheJournal of Economic Education, 11(2), 10–15. https://doi.org/10.1080/00220485.1980.10844949

Dunwoody, P.T., & Frank, M.L. (1995). Why students withdraw from classes. The Journal ofPsychology: Interdisciplinary and Applied, 129(5), 553–558. https://doi.org/10.1080/00223980.1995.9914927

Gump, S. E. (2007). Student evaluation of teaching effectiveness and the leniency hypothesis: a literature review. Educational Research Quarterly, 30, 55–68. https://files.eric.ed.gov/fulltext/EJ787711.pdf

Isely, P., & Singh, H. (2005). Do higher grades lead to favorable student evaluations? TheJournalof Economic Education, 36(1), 29–42. https://doi.org/10.3200/JECE.36.1.29-42

Krautmann, A. C., & Sander, W. (1999). Grades and student evaluations of teachers. Economicsof Education Review, 18(1), 59–63. https://doi.org/10.1016/S0272-7757(98)00004-1

Matos-Díaz, H. (2012). Student evaluation of teaching, formulation of grade expectations, and instructor choice: explorations with random-effects ordered probability models. EasternEconomic Journal, 38, 296–309. https://doi.org/10.1057/eej.2011.7

Matos-Díaz, H. (2014). Measuring grade inflation and grade divergence accounting for student quality, Cogent Economics &Finance, 2(1), 1–16. http://dx.doi.org/10.1080/23322039.2014.915756

Matos-Díaz, H. (2018). Estimating the effects of partial withdrawals on GPA through time: evidence from the University of Puerto Rico, EconomiA, 19(1), 90–114. https://doi.org/10.1016/j.econ.2017.10.004

Matos-Díaz, H., & García-Vázquez, D. (2014). Modeling college graduation GPA considering equity in admissions: evidence from the University of Puerto Rico. Education Policy Analysis Archives22(96), 1–33. http://dx.doi.org/10.14507/epaa.v22n96.2014

McPherson, M. A. (2006). Determinants of how students evaluate teachers. TheJournal ofEconomic Education, 37(1), 3–20. https://doi.org/10.3200/JECE.37.1.3-20

Miller, J. C. (1997). Variables affecting the decision to withdraw from Liberal Arts and Sciences courses. Community College Review, 25(3), 39–54. https://doi.org/10.1177/009155219702500304

Nelson, J. P., & Lynch, K. A. (1984). Grade inflation, real income, simultaneity, and teaching evaluations. TheJournal of Economic Education, 15(1), 21–37. https://doi.org/10.1080/00220485.1984.10845044

Quantitative Micro Software. (2009). EViews (versión 7) [Software]. IHS Global Inc. https://www.eviews.com

Raisman, N. A. (2013). The cost of college attrition at four-year colleges and universities: an analysis of 1,669 U.S. institutions. Educational Policy Institute. http://hdl.handle.net/10919/83250.

Rice, J. A. (1995). Mathematical statistics and data analysis (3ra ed.). Duxbury Press.

Rothstein, J. M. (2004). College performance predictions and the SAT. Journal of Econometrics, 121(1-2), 297–317. https://doi.org/10.1016/j.jeconom.2003.10.003

Schneider, M., & Yin, L. (2011). The high cost of low graduation rates: how much doesdropping out of college reallycost? American Institutes for Research. AIR_High_Cost_of_Low_Graduation_Aug2011_0.pdf

Schuh, J. H. (2005). Finances and retention: trends and potential implications. In A. Seidman (Ed.), College student retention: formula for student success (pp. 89–106). Praeger.

Seiver, D. A. (1983). Evaluations and grades: a simultaneous framework. TheJournal of Economic Education, 14(3), 32–38. https://doi.org/10.1080/00220485.1983.10845024

Spady, W. G. (1970). Dropouts from higher education: an interdisciplinary review and synthesis. Interchange, 1, 64–85. https://doi.org/10.1007/BF02214313

Spady, W. G. (1971). Dropouts from higher education: toward an empirical model. Interchange, 2, 38–62. https://doi.org/10.1007/BF02282469

Tinto, V. (1975). Dropout from higher education: a theoretical synthesis of recent research. Review of Educational Research, 45(1), 89–125. https://doi.org/10.3102/00346543045001089

Wollman, W., & Lawrenz, F. (1984). Identifying potential “dropouts” from college physics classes. Journal of Research in Science Teaching, 21(4), 385–390. https://doi.org/10.1002/tea.3660210406

Zangenehzadeh, H. (1988). Grade inflation: a way out. TheJournal of EconomicEducation, 19(3), 217–226. https://doi.org/10.1080/00220485.1988.10845263

Zwick, R., & Sklar, J. C. (2005). Predicting college grades and degree completion using high school grades and SAT scores: the role of student ethnicity and first language. AmericanEducational Research Journal, 42(3), 439–464. https://doi.org/10.3102/00028312042003439

Notas