Abstract
Introduction: Mexico uses hydrological models to determine floods, evaluate land use change scenarios, evaluate climate change scenarios, and define federal zones, among other applications. However, the models are rarely calibrated beforehand, which increases uncertainty in the design of structures and hydraulic standards.
Objective: To build a hydrological model for the watershed of the Fuerte River, Mexico, of extreme rainfall events occurred in 2009, 2011, 2015, 2016 and 2017.
Methodology: Five extreme rainfall events were considered for this study. The hydrologic model was design using the HEC-HMS program, and calibrated at the Tubares hydrometric station. The runoff curve number methodology and the Clark unit hydrograph were used.
Results: The results collected in four of the five events were positive; the Nash-Sutcliffe efficiency (NSE) ranged between 0.22 and 0.52. The temporal behavior of the river, at times when it moved away from the flow value, preserved the variation trend.
Limitations of the study: The study reaches the Tubares hydrometric station, Chihuahua, without including the downstream dams in Sinaloa.
Originality: There are few hydrological studies that generate a calibrated and, therefore, reliable hourly model.
Conclusions: The hourly hydrologic model had an acceptable performance in four of the five predicted events in terms of NSE and root mean square error (RMSE).
Keywords: Clark unit hydrograph, runoff curve number, time of concentration with the California Culvert Practice equation, calibration and validation of surface hydrologic models.
Resumen
Introducción: En México se utilizan los modelos hidrológicos para determinar avenidas, evaluar escenarios de cambio de uso de suelo, evaluar escenarios de cambio climático, definir zonas federales, entre otras aplicaciones. Sin embargo, rara vez los modelos se calibran previamente, lo que incrementa la incertidumbre en el diseño de estructuras y normas hidráulicas.
Objetivo: Construir un modelo hidrológico para la cuenca del río Fuerte, México, de eventos de lluvia extremos ocurridos en 2009, 2011, 2015, 2016 y 2017.
Metodología: Se consideraron cinco eventos de lluvia extremos. El modelo hidrológico se construyó con el programa HEC-HMS, y se calibró en la estación hidrométrica Tubares. Se utilizó la metodología de número de curva de escurrimiento y el hidrograma unitario de Clark.
Resultados: Los resultados obtenidos en cuatro de los cinco eventos fueron buenos; la eficiencia de Nash-Sutcliffe (NSE) osciló entre 0.22 y 0.52. El comportamiento temporal del cauce, en momentos en que se alejaba del valor del caudal, conservaba la tendencia de variación.
Limitaciones del estudio: El estudio llega hasta la estación hidrométrica Tubares, Chihuahua, sin incluir las presas que están aguas abajo en Sinaloa.
Originalidad: Existen pocos estudios hidrológicos que generen un modelo horario calibrado y, por lo tanto, confiable.
Conclusiones: El modelo hidrológico horario construido tuvo un desempeño aceptable en cuatro de los cinco eventos modelados en términos del NSE y la raíz del cuadrado medio del error (RMSE).
Palabras clave: hidrograma unitario de Clark, número de curva de escurrimiento, tiempo de concentración con la ecuación California Culvert Practice, calibración y validación de modelos hidrológicos superficiales.
Scientific article
Hourly hydrologic modeling in the upper basin of the Fuerte River, Sinaloa, Mexico
Modelación hidrológica horaria en la cuenca alta del río Fuerte, Sinaloa, México
Received: 23 November 2020
Accepted: 29 April 2021
Hydrological phenomena are complex and difficult to understand, and in the absence of exact knowledge they can be represented in a simplified form by means of the system concept (Weber et al., 2012). A system can be understood as a set of elements or components related to each other by different processes (Rodríguez et al., 2006).
Due to the need to determine the effects of climate variation, climate change, land use changes and water resource management, several hydrological simulation models have been developed (Van Liew et al., 2005). The United States Department of Agriculture (USDA, 1986) mentions that hydrological studies should be based on historical flow records, but these are rarely available and statistical analysis becomes inaccurate due to land use changes. Therefore, Aparicio-Mijares (2008) indicates that it is necessary to have methods that estimate runoff from the characteristics of the watershed and precipitation, which are known as rainfall-runoff methods.
Although there are different programs for hydrologic modeling, Estrada-Sifontes and Pacheco-Moya (2012) mention that the Hydrologic Modeling System of the Hydrologic Engineering Center of the US Army Corps of Engineers (HEC-HMS) is a flexible program that allows selecting different methods for the calculation of losses, hydrographs, and river traffic; in addition, it allows simulating at the event level and on a continuous basis. Magaña-Hernández et al. (2013) predicted with HEC-HMS, the Escondido river watershed in Coahuila, with an area of 3 242 km2, for three hydrometeorological events, which are among the few with radar data (NEXRAD from NOAA of the USA) for hourly rainfall. These authors had almost perfect Nash-Sutcliffe coefficients (NSE), which was due to the fact that, at the time of calibrating the model, they left the range in which the hydrological parameters could move very flexible. The NSE measures the deviation with respect to the unit of the ratio between the mean square error and the variance of the observations (Singh, 2017).
In the hydrological region of the Fuerte River, Mexico, there is an important infrastructure for storing and supplying water for different uses, and for generating electricity and controlling floods. However, the incidence of extreme phenomena (such as hurricanes and droughts) is present, being unpredictable and with effects that can cause very serious economic damages (Comisión Nacional del Agua [ CONAGUA ], 2015).
Therefore, the objective of this paper was to build a calibrated and validated hydrological model, at hourly scale in subbasins belonging to the hydrological region of the Fuerte River, for extreme rainfall events of 2009, 2011, 2015, 2016 and 2017.
The hydrological subregion of the Fuerte River is located in northwestern Mexico, in the hydrological region number 10 of Sinaloa. Figure 1 shows how its surface area is distributed in Sinaloa, Chihuahua, Sonora, and Durango.

In the upper part of the watershed there are Tarahumara settlements, whose poverty levels are high, and in the lower part there are three dams (Miguel Hidalgo y Costilla, Luis Donaldo Colosio and Josefa Ortiz de Domínguez) for electricity generation and irrigation purposes (Castillo-Castillo et al., 2017). The dam system supplies water to two irrigation districts (RD): DR075 El Fuerte and DR076 El Carrizo, with a total area of 231 699 and 77 657 ha, respectively (CONAGUA, 2020). In addition, according to CONAGUA (2015), this subregion is exposed to the incidence of hurricanes in summer.
Once the available information on the watershed was reviewed, it was delimited with the HEC Geo-HMS 10.2 extension of ArcGis and based on the digital elevation model of the Instituto Nacional de Estadística, Geografía e Informática (INEGI, 2019a), up to station 25009 Bocatoma Sufragio of the Servicio Meteorológico Nacional (SMN, 2019a). With the same extension, the characteristics of the subbasins were obtained: slope, river length, lag time and HMS hydrological scheme (scheme or drawing with icons). The study area was delimited up to the Tubares station, which covers an area of 17 145.02 km2. The Huites, Miguel Hidalgo and Josefa Ortiz de Dominguez dams are outside the study area, immediately downstream of the Tubares hydrometric station. Tubares is in Chihuahua, approximately 39 km from the Huites Dam, Sinaloa.
Information from automatic weather stations (AWS) was provided by the Comisión Federal de Electricidad (CFE, 2019) and the SMN (2019b). The months reported were from June to October 2009, 2011, 2015, 2016 and 2017. Twenty-four h rainfall information was also requested from the SMN (Table 1). Figure 1 shows the location and spatial distribution of meteorological and hydrometric stations.

Once the information was sorted and knowing that the series of hourly flow rates is continuous over time, the linear interpolation method was used to estimate the missing data, as long as there were no more than six continuous data. Gordon et al. (2004) mention that for short periods, linear interpolation can be used. For precipitation series, since they are not continuous (neither temporally nor spatially), the inverse of the square of the distance method was used.
The model was developed on an hourly scale, since the flow data measured by the CFE allowed it, taking into account that a large part of the watershed is operated by this institution, at least up to the dams where electric power is generated. It is important to clarify that the dams were located downstream of the calibration point (Tubares, Chihuahua) of the hydrological model.
Hourly rainfall data from eight automatic weather stations were used: four belonging to the CFE and four to the SMN. In addition, daily rainfall data from five SMN stations were used for a better spatial representation of rainfall in those areas that were not covered by the eight AWS. In the stations with cumulative rainfall values every 24 h, the proportional distribution of precipitation in the same period as the nearest AWS was used. This data management was used by Haberlandt et al. (2008).
To complete this part of the process, the spatial distribution of rainfall was determined using the Thiessen polygon method. The information on the predicted events is shown in Table 2.

The World Meteorological Organization (WMO, 2018) considers that rainfall measurement at a given point only represents a limited area, which is bounded according to the accumulation period, physiographic homogeneity of the region, topography, and the different processes that favor precipitation. Espinosa-López et al. (2020) predicted the Huaynamota river watershed with hourly data from the CFE AWS and hourly satellite rainfall images. The model with AWS data gave better results in terms of fitting coefficient at the time of calibration and validation; therefore, data from the AWS were used in the present study.
Hydrologic modeling was performed using the HEC-HMS 4.2.1 program, with the U.S. Soil Conservation Service (SCS) method to transform rainfall into runoff, and the Clark unit hydrograph (U.S. Army Corps of Engineers [ USACE ], 2000) was used to calculate the hydrographs at the outflow of the subbasins. The modeling time interval was 1 h. The importance of building an hourly model lies in the relevance of using this model as a forecasting tool for civil protection purposes and to be more certain of the time when the highest flow will occur. It is also important to have an hourly model at those points in the watershed where the runoff concentration time is less than 24 h, since otherwise the model would not be useful for forecasting the time of the maximum flow. The calibrated model, in hourly terms, would be useful for decision making.
According to McCuen (2016), the equation for estimating the runoff is as follows:
where Q is the runoff or excess rainfall (mm), P is the precipitated rainfall (mm), S is the maximum retention potential of the watershed (mm) and Ia is the initial abstraction (mm). Empirical evidence leads to the assumption that Ia = 0.2S; considering this, Equation 1 is conditioned to P ≥ Ia , otherwise Q = 0. Furthermore, the maximum retention potential is related to the curve number (CN) by the following equation:
According to USDA (1986), the runoff curve number is mainly a function of the hydrological group of the soil (A, B, C or D), type of cover and antecedent runoff condition. Curve number values were obtained from the same source for the different watershed conditions. CNA (1987) explains how to choose the hydrologic group according to the soil unit reported in the edaphological map of INEGI (2019b).
To calculate the runoff curve number, a curve number raster layer was first created using the vector data of soil type series II, and land use and vegetation series VI; both were downloaded from the INEGI digital platform (2019c). The curve number raster was performed in the HEC-GeoHMS extension, as indicated by USACE (2013) in the user manual of that extension. Once the curve numbers were calculated, the correction for slope was performed according to Neitsch et al. (2011):
where CN2S is the curve number for condition II (CN2 ) adjusted for slope, i.e., the condition when moisture content is considered average, and slp is the average slope of the watershed (%). USACE (2013) defines three antecedent moisture conditions: I dry (wilting point), II average moisture and III field capacity.
For each storm, the correction was made for antecedent rainfall accumulated five days before (Aparicio-Mijares, 2008) (Table 3). The conditions used for this purpose were: 1) correction A: when the antecedent rainfall is lower than 25 mm, 2) no correction: when the antecedent rainfall is greater than 25 and lower than 50 mm, and 3) correction B: when the antecedent rainfall is greater than 50 mm. A practical way to apply Equations 3 and 4 is shown in Table 3.

Source: Aparicio-Mijares (2008).
Synthetic unit hydrographs (SUH) can be used to calculate the maximum flow and runoff volume in a hydrograph, especially in ungauged watersheds. This study used the Clark unit hydrograph, which represents two processes of transformation of excess rainfall to runoff: 1) translation of excess rainfall from its origin through the drainage network to the outflow point and 2) attenuation of the amount of discharge once the excess is stored in the watershed (USACE, 2000). Straub et al. (2000) mention that flow attenuation can be represented as a linear reservoir, where storage is related to the outflow as follows:
where S is the storage in the watershed (m3), R is the storage coefficient (h) and O is the outflow (m3·s-1). From this relationship, the basic Clark unit hydrograph equation can be derived, where I is added as the inflow (m3·s-1) and C as the coefficient that is a function of the modeling step (h):
Hydrological models commonly use a time-related parameter, and the most used parameter is time of concentration (Campos-Aranda, 2010). Smith-Quintero and Velásquez-Henao (1995) indicate that the time of concentration is the time required for water to reach from the farthest point of the watershed to the exit point. Grimaldi et al. (2012) mention that, in practi/ce, empirical formulas are used to determine this parameter; one is that of Giandotti, which is highly used in Italy, another is that of Kirpich and that of NRCS, which are used in the USA. Table 4 shows different formulas to calculate this parameter.

Using the HEC-GeoHMS extension, delay time can be calculated with the following expression (USACE, 2013):
Where Lag is the lag time (h), CN is the curve number, L is the hydraulic length of the watershed (m) and y is the average slope of the watershed (%). Hereafter, this method will be called SCS. It is important to remember that Lag = 0.6Tc , where Tc is the time of concentration. L is the length that corresponds to the time it takes for the water to travel from the hydraulically farthest point, this in terms of distance and slope, since water takes more time to reach the exit.
There are different studies that try to find the best formula to calculate the time of concentration, such as Sharifi and Hosseini (2011) and Vélez-Upegui and Botero-Gutiérrez (2011). The events with different times of concentration were predicted in this study, and the one with the best fitting coefficients in an initial modeling was chosen, specifically the one with higher NSE coefficients and lower root mean square error (RMSE) values for the same event.
Regarding the Clark unit hydrograph (Equation 5), the USACE (1967) recommends using a storage coefficient (R) with a value of 0.8 times the time of concentration. Zimmermann (2003) mentions that, in the absence of flow data, formulas such as R = cTc should be used, where c varies from 0.5 to 0.8. Magaña-Hernández et al. (2013) used a c of 0.75 when modeling a watershed in northern Mexico. This study used a value of 0.75 times the time of concentration.
Streamflow was determined using the Muskingum method (Equation 9). According to Karamouz et al. (2013), in that method the outflow hydrograph in the lower part of a river is calculated for a hydrograph determined in the upper part of the river.
where K is the travel time constant, and X is the weighting factor that varies from 0 to 1. According to Aparicio-Mijares (2008), K is equal to:
where L is the length of the section (m) and ꙍ is the wave velocity (m·s-1). Aparicio-Mijares (2008) mentions that K can be related as 1.5 times the mean water velocity in the channel. This study considered a reference velocity of 2.8 m·s-1, which is higher than the average velocity of natural channels not well defined on slopes of 12 to 15 % (2.4 m·s-1) according to Aparicio-Mijares (2008). The same author considers that, in the absence of information, it is recommended to use a value of 0.2 for the weighting factor. According to Heggen (1984), the value of X commonly varies from 0.2 to 0.3; therefore, the value used was 0.2.
Finally, a self-calibration was performed in the 2009 event, which allowed a maximum fit of 20 % in curve numbers, concentration times and, therefore, in the storage coefficient R. This was carried out with the optimization trial mechanism in HEC-HMS. Subsequently, validation was performed on the 2011, 2015, 2016 and 2017 events. Calibration and validation were carried out in three hydrometeorological stations of the CFE: Urique, Guerachic and Tubares; however, it was not successful in the first two (corresponding to runoff from subbasins of the upper part), so fitting in the Tubares station was considered as a reference. It is worth mentioning that the optimization trial mechanism follows the procedure shown in Figure 2, where the univariate gradient method was selected, and the objective function was evaluated with the NSE coefficient.
According to USACE (2000), the univariate gradient algorithm performs consecutive corrections to the estimated parameters. If xk is the parameter estimated by the objective function f(xk) at iteration k, the search defines a new xk+1 at iteration k + 1 as:
where Δxk is the parameter correction and should be such that the estimates tend toward the minimum value of the objective function. The objective function can be the goodness-of-fit measured with the NSE coefficient, RMSE or others.
The most used indicators to assess model fitting are NSE coefficient, RMSE, coefficient of determination (R2 ) and mean squared error (MSE) (Vargas-Castañeda et al., 2015). The present study used NSE and RMSE. No reference of a good hourly hydrological model was found. Moriasi et al. (2007) present a general classification of recommended statistics to evaluate hydrological simulations with monthly step size, which is divided as follows: very good 0.75 < NSE ≤ 1, good 0.65 < NSE ≤ 0.75, satisfactory 0.50 < NSE ≤ 0.65 and unsatisfactory NSE ≤ 0.50. These authors also mention that values between 0 and 1 are considered acceptable levels of fit, since an NSE > 0 means that, at least, the model is a better predictor than simply using the arithmetic mean. Moriasi et al. (2007) show a general classification of recommended statistics for evaluating hydrologic simulations with monthly step size, which is divided as follows: very good 0.75 < NSE ≤ 1, good 0.65 < NSE ≤ 0.75, satisfactory 0.50 < NSE ≤ 0.65 and unsatisfactory NSE ≤ 0.50. These authors also mention that values between 0 and 1 are considered acceptable levels of fit, since an NSE > 0 means that, at least, the model is a better predictor than using the arithmetic mean.
Figure 3 shows the subbasins corresponding to the modeling carried out, and their characteristics are shown in Table 5. The subbasins, up to the Tubares hydrometric station, have an average slope of 40.47 % and an average curve number of 79.04.


Concentration times calculated with different formulas are shown in Table 6. The maximum values are those calculated with the Giandotti and California Culvert formula, and the minimum values are those obtained with the SCS formula. Regarding the above, the events were predicted using these three times of concentration.

Table 7 shows the NSE coefficients derived from different concentration times. In 14 predicted events a better fit was obtained when the California Culvert method was used, three were better with SCS and two with Giandotti. The above coincides with the RMSE estimated, so the time of concentration calculated with the California Culvert Practice formula was used in this study.

The values calculated in the 2009 event calibration are shown in Table 8, which were applied to each event and resulted in the curve numbers shown in Table 9.


Fittings on the curve numbers showed three cases: 1) decrease in eight subbasins (two of them runoff directly into the Guerachic station and the others directly into Tubares), 2) four subbasins had no changes and 3) two subbasins had direct runoff into Tubares, and there was increase in the Batopilas subbasin, in two subbasins of Guerachic and in three subbasins of Tubares. On average, the decrease was 1.76 % and the increase was 7.63 %.
With respect to time of concentration, an average decrease of 9.56 % was recorded in 15 of the 18 subbasins, and an increase of 5.04 % in three subbasins (Batopilas, one in Tubares and one in Guerachic), which represent 12 % of the total study area.
The NSE coefficients and RMSE for calibration and validations are shown in Table 10. Figures 4, 5, and 6 show three of the five predicted events. The results show an adequate fit at the Tubares station, with coefficients above zero. A negative value was obtained in the 2016 storm; however, the predicted hydrograph is temporally similar to that measured (Figure 6).




The maximum flow rates measured and calculated are shown in Table 11. To contrast the visual interpretation (Figures 4, 5 and 6) and the model performance efficiencies (Table 10), Figures 7, 8, 9 and 10 show the behavior of that measured versus that predicted flow rates of the 2009, 2015, 2016 and 2017 events. Although the 2016 event had an NSE of -0.28, Figure 9 shows that the R2 is 0.59. This is important because if there were no error between that measured and predicted the trend of this relationship would be a straight line; therefore, the modeling of that 2016 event is not as negative as the NSE would suggest.

Figures 7, 8, 9 and 10 show the goodness-of-fit for three hydrometeorological events from 2009, 2015, 2016 and 2017, respectively.




The model fit would be expected to have a relationship with the maximum flow measured; however, the results show no influence. For example, in the calibration of the 2009 event, an NSE coefficient of 0.42 was obtained for a maximum discharge measured of 1 283.70 m3·s-1 (Figure 7), while the 2015 validation had an NSE of 0.52, corresponding to a maximum flow measured of 506.50 m3·s-1 (60.54 % lower) (Figure 8), and the 2017 event, with a flow of 1 499.90 m3·s-1 (16.84 % higher), had an NSE of 0.22 (Figure 10). Moriasi et al. (2007) consider, for a calibration and validation on a daily scale, a value greater than 0.4 as satisfactory.
If we compare the NSE of 0.4 of this study with the 0.95 of the hourly hydrological model of Magaña-Hernández et al. (2013), for the Escondido river watershed, Coahuila, it would seem that this study is at a disadvantage. However, in the Escondido river model, too much correction margin was given to the hydrological parameters at the time of calibration. That is, the hydrological parameters were allowed to vary up to any value and at any level of increase or decrease, regardless of the reality of their values, which gave them an almost perfect model. Another aspect is that in that study they had some radar images of rainfall provided by the USA due to the proximity to Texas. It is convenient to improve the hourly hydrological modeling due to its importance in civil protection and in those points where the time of concentration of the watershed is less than 24 h. However, modeling should be done preserving the principles of the modeler representing the reality it intends to simulate.
The visual analysis of the Tubares station shows temporal similarity between the predicted and measured hydrographs. Considering the coefficients obtained, it can be said that the model found at the outlet of the Tubares subbasin represents, at an acceptable level (Moriassi, et al, 2007), the hydrometeorological phenomenon in the study area, because it shows a satisfactory fit in calibration and in one of the validations.
An attempt was also made to calibrate the model at two upstream hydrometric stations but was unsuccessful. One possible explanation is that rainfall varies more spatially in mountain areas. Moreover, WMO (1972) recommends one station per 100 to 250 km2 for mountainous areas, and in the study watershed there are 14 stations (eight AWS and four pluviometric stations with daily data) distributed over 17 145 km2; that is, one station covers an average of 1 224.64 km2, which leads us to think that rainfall may not be well represented in this area.
On the other hand, good fitting at the outlet is attributed to two aspects: 1) the transit of the flow from the upstream stations until reaching the outlet and 2) the contributions of 11 subbasins (lower part), which are not calibrated individually (as is the case of the upstream subbasins Urique and Batopilas), but up to the Tubares station, which allows compensating the bad fitting caused by the contributions of the upstream subbasins.
Haberlandt et al. (2008) mention that simple hydrological models provide an objective global estimate, but at the cost of smoothing the hydrograph, which leads to underestimation or overestimation; this is observed in the hydrographs measured in the present study. Perhaps this model could be improved by considering a distributed model that uses information from satellite images of hourly rainfall and that is calibrated with data from the CFE. Another improvement to the model would be to generate measured unit hydrographs, instead of the synthetic Clark unit hydrographs used in this study.
Although one might believe that it is not worthwhile to build hourly hydrological models in Mexico because there is not enough measured rainfall and hourly flow data, it is important to prepare for the future and strive to develop a sufficient and reliable hydro-meteorological network.
The hydrologic model developed at the outlet of the Tubares watershed using the curve number method to calculate losses and Clark's synthetic unit hydrograph method for hydrographs, is acceptable based on the Nash-Sutcliffe coefficients.
The time of concentration that yielded the best modeling results is the one calculated with the California Culvert Practice formula, which overestimates the correct value by about 10 %. An alternative to improve modeling could be the use of an aggregated model that uses information from satellite or radar images to estimate rainfall. It is also recommended to model individually the upper subbasins with different methods than those used in the present study, for example, a different unit hydrograph or a different spatial distribution of rainfall. All this to obtain a general model that properly represents hydrology in the entire watershed.
The authors thank the Consejo Nacional de Ciencia y Tecnología (CONACyT) for funding this study, the Comisión Federal de Electricidad and the Servicio Meteorológico Nacional for providing hydrometeorological information from their databases
*Corresponding author: libanezc@chapingo.mx, tel. 595 12 55 581.



















