Tweedie Model for Predicting Factors Associated with Distance Traveled to Access Inpatient Services in Kenya

Aim . This study aims to examine which factors inﬂuence the distance traveled by patients for inpatient care in Kenya. Methods . We used data from the fourth round of the Kenya Household Health Expenditure and Utilization survey. Our dependent variable was the self-reported distance traveled by patients to access inpatient care at public health facilities. As the clustered data were correlated, we used the generalized estimating equations approach with an exchangeable correlation under a Tweedie distribution. To select the best-ﬁt covariates for predicting distance, we adopted a variable selection technique using the QIC u and R 2 criteria, wherein the lowest (highest) value for the former (latter) is preferred. Results . Using data for 451 participants from 47 counties, we found that three-ﬁfths were admitted between 1 and 5 days, two-thirds resided in rural areas, and 90% were satisﬁed with the facilities’ service. Wealth quintiles were evenly distributed across respondents. Most admissions (81%) comprised < 15, > 65, and 25–54 years. Many households were of medium size (4–6 members) and had low education level (48%), and nine-tenths had no access to insurance. While two-thirds reported employment-based income, the same number reported not having cash to pay for inpatient services; 6 out of 10 paid over 3000 KES. Thus, diﬀerences in employment, ability to pay, and household size inﬂuence the distance traveled to access government healthcare facilities in Kenya. Interpretation . Low-income individuals more likely have large households and live in rural areas and, thus, are forced to travel farther to access inpatient care. Unlike the unemployed, the employed may have better socioeconomic status and possibly live near inpatient healthcare facilities, thereby explaining their short distances to access these services. Policymakers must support equal access to inpatient services, prioritize rural areas, open job opportunities, and encourage smaller families.


Introduction
Inpatient care is a case wherein an individual is hospitalized for more than 24 hours and reflects a more serious health complaint. An estimated 1.2 million Kenyans required these services in 2013. is number is predicted to increase exponentially in the upcoming decade [1].
Among those seeking care, various factors are key in predicting the distance traveled to access healthcare facilities. For example, differences in wealth may determine the distance traveled. Wealthier individuals are more likely to possess the capabilities to access any facility of their choice, compared to those in the lower quintiles. e former can better afford to pay the healthcare costs through out-of-pocket or pay insurance premiums that can assist in securing admission at the required facilities in case of inpatient care need. In contrast, those in the lower-income quintiles have fewer options regarding the type of healthcare facility they can access due to financial constraints.
Government healthcare facilities have previously been a choice for many who are seeking inpatient care because they offer the needed care at affordable cost. However, despite being preferred among the majority population, such inpatient facilities precisely for inpatient care remain inaccessible [2].
To mitigate this, the Kenyan government has made efforts to increase the number of inpatient care facilities. is includes building new hospitals or upgrading the lower cadre facilities that are currently offering outpatient care.
is is achieved by purchasing equipment required for operating an inpatient service, as well as deploying personnel by hiring more medical and nonmedical staff to manage these facilities [3].
For a developing country like Kenya with a struggling economy and whose health budget, compared to the total government revenue (7%) [4], is still slightly below halfway in meeting the 15% Abuja declaration [5], this implies that the number of facilities will continue to decrease. One major challenge is expensive and sophisticated medical equipment that is often not locally available and must be imported [6]. Another challenge is ensuring that the machines get uninterrupted electricity and having a good road network to access such healthcare facilities [7].
A pandora box opens when we explore electricity distribution in Kenya: the latest numbers from the Kenya Power and Lighting Company [8] indicate that more urban areas (77.6%) have electricity connections than rural areas (48.36%). Moreover, urbanites have better road networks than ruralites [7]. Conversely, the rural population (15 million) was twice that of the urban (33 million), according to the 2019 Census [9]. is implies that road networks and electricity coverage in urban areas are denser than those in rural areas. erefore, inpatient facilities are more likely to favor urban than rural areas [6]. us, ruralites will be forced to travel to urban areas to access the required healthcare services.
Facility categorization in Kenya is based on the type of service they provide, with those offering inpatient care categorized from levels 3 to 6 [10]. According to the 2020 Economic Survey report, there was only a 3.2% increase in the number of government-owned facilities with inpatient care from 2015 to 2019 [3], compared to the approximately 10% increase in the population (5 million Kenyans) within the same period [11]. us, the facilities are seemingly already overstretched. In fact, according to the 2019 Kenya Master Health Facility List, the national average inpatient bed density was 13.3, or half of the targeted density of 25 [12].
Long distances remain the most prominent physical obstacle that has led to the underutilization of inpatient care services in the country. Data suggest low access to any physical health facility coverage in Kenya at one facility per 50-200 km [13]. is has been accompanied by an increase in poor health outcome: more hospitalization days in a facility due to an increase in the severity of a condition and nonattendance at follow-ups [14], including at fatalities [15]. e influence of distance on access in such scenarios includes the following aspects: (1) the extensive distance to be traveled to access a facility itself and (2) extensive distances being costly for a patient in need.
Distance, in general, when accessing healthcare facilities has been investigated [16,17]. However, the influence of distance in accessing inpatient care is unclear. rough a literature review, we found studies focusing on women seeking delivery services and children below the age of five years that need hospital admissions. Notably, there has been an increase in the number of women seeking such services (46% delivered within a public facility), with 6 out of 10 availing the services at the facilities. is reduced maternal mortality rate (362 maternal deaths per 100,000 live births) and neonatal mortality (21 deaths per 1000 live births) in Kenya in 2014. 22% of the women aged 15-49 reported that distance was the most serious problem in accessing health care [18].
Studies on distances for inpatient care have found that mothers in a rural coastal area in the country were still delivering at home due to long distances to access inpatient facilities [19]. e same was observed in a rural western setting in another study [20]. ese sets of authors reported that long distances to facilities were a hindrance to access.
Kukla et al. [21] examined children suffering from diarrheal illness and requiring inpatient care in Kenya and found out that there were challenges in accessing public facilities due to long distances and high transport cost. is implies that, for those living far away from public facilities, their access to care is determined by their financial capabilities.
Mochida et al. [22] investigated access for inpatient care for children suffering from malaria, the most lethal killer disease in Sub-Saharan Africa [23]; the delay in seeking treatment affects disease management, progression, and outcomes. e authors found that distance played a critical role in deciding on whether to utilize a healthcare service. Further, they established that those who need to travel longer distances were less likely to use the facilities. is situation contributed to morbidities and worse-case mortalities among children. Such studies provide evidence of a distance decay in the utilization of healthcare facilities [24], with those far away associated with under-use and those closer with better use.
In summary, most studies have focused on mothers' and children's access to healthcare facilities and others focused on general access, combining both inpatient and outpatient care. us, a research gap exists on inpatient access to public facilities in the general population in Kenya.

Methodology.
Tweedie distributions are members of the exponential dispersion models (EDMs) [25] defined as follows: where ϕ > 0 is the dispersion parameter and θ is the natural parameter. e mean and variance for the random variable Y are given by E(Y) � μ � c ′ (θ) and Var(y) � c ″ (θ)ϕ, respectively. Tweedies, as a special case of EDMs, have a meanpower variance relationship expressed as Var(μ) � ϕμ p such that p ∉ (0, 1). Tweedie distributions have been widely applied in modeling nonnormal response data with a discrete mass of zero because they can incorporate skewness without any data transformation. Most methods suggested in the literature for the analysis of such data mainly consider data 2 Journal of Probability and Statistics transformation [26], two-way analysis [27], and Bayesian methods [28]. However, these methods are not efficient for our approach because of the correlational nature of our data within clusters. Clustered data naturally have depedence due to the reason that they share common property [29]; for example, people who come from the same region have shared culture and traditions and mostly share common medical facilities.
Approaches for analyzing nonnormal data in the generalized linear model framework may ignore the correlation among subjects who belong to the same cluster. Our data were clustered by counties; thus, an exchangeable correlation structure is the most flexible in modeling the association through the generalized estimation equations (GEE) approach. Notably, both independent [30] and autoregressive (AR) (1) [31,32] correlation structures under nonnormal responses have been considered. e independent structure assumes a weak correlation among the subjects in a cluster, and the number of clusters is small. Meanwhile, the AR (1) assumes a time dependence order for association in a repeated measure. Such measurements taken far apart have a weaker correlation compared to those that are taken closer.
Previously mentioned evidence alluded to the influence of covariates on distance to healthcare facilities. However, a research gap on the selection of the best-fit covariates remains.
is study aims to determine the combination of covariates that influence the distance that a Kenyan citizen will travel to seek inpatient care. is also extends the application of the Tweedie distribution to understand the influence of a given set of covariates on distance.

Data.
A cross-sectional survey to determine healthcare expenditure for both inpatient and outpatient for Kenyans was conducted from 9 April to 19 May 2018 across the 47 counties. A multistage sampling design was employed such that the stratified cluster sampling selected 1500 clusters spread evenly through the country. In the second stage, 25 households were uniformly sampled within each cluster, with 527 (923) located in urban (rural) areas.
e Kenya Household Health Expenditure and Utilization Survey (KHHEUS) questionnaire [2] was designed by a technical working committee comprising members drawn from the Ministry of Health (MoH), the World Bank (WB), and the Kenya National Bureau of Statistics (KNBS). is was then administered to all sampled household through computer-assisted personal interviews designed using CSpro software [33].
A total of 37,500 households were sampled, with an overall response rate of 95%. ere were 4822 individuals who sought inpatient care in the country, with 55.49% seeking care in government-owned facilities. Complete information was available for only 451 individuals and is used here. e missing mechanism was mostly Missing Not at Random (MNAR); therefore, no imputation was performed to the missing values as it may result in biased estimates [34].

Selection of Variables.
e inpatient data from the KHHEUS were obtained, cleaned, and coded. We investigated several covariates against our dependent variable, that is, the distance to a health facility traveled by Kenyans seeking inpatient services. Variable selection was based on the descriptive report of KHHEUS 2018 [2] and the Demographic and Health Survey 2014 [18], as well as published literature [17,[35][36][37][38][39][40][41].

Dependent Variable.
Our dependent variable was the distance traveled by people to access inpatient care facilities. e values ranged from minimum of 0 km (for those living within reach of the facilities) up to a maximum of 700 km (those living far from the facilities). Zero, which is a nonnegative response, is often common because of nontravelers, while the positive responses, which are right skewed, are due to users.
Education level was categorized using the methods from Rippin et al. [43]: those who never went to school (those under 3 years of age and who responded "Never went to school") or have low-level education (pre-primary, primary, and informal (madrassa)), intermediate-level education (secondary, vocational, and college), and higher-level education (university degree or higher). Age groupings for employment followed those defined by the Organisation for Economic Cooperation and Development (OECD): those aged below 15 and above 65 who are considered as unable to work; those aged 15-24 who are entering the labor market after an education; those aged 25-54 who are in their prime working years; and those aged 55-64 who are passing the peak of their career and approaching retirement [44].
We divided household sizes into three categories: small (1-3 members), medium (4-6 members), and large (7 + members). Access to insurance was classified as yes or no, while employment status (i.e., formal, informal/casuals, or self-employed) was classified as either employed or unemployed.
Amounts paid for healthcare were categorized into three groups: low (1-3,000 KES), medium (3,001-10,000 KES), and high (10,000 + KES). e cost included the amount paid for the service received at a facility. Further, satisfaction was categorized into two: satisfied and not satisfied. Finally, if they had cash to pay for the service, it was categorized into either yes or no.

Tweedie Distributions in Modeling Distance for Inpatient
Care. We used a Tweedie distribution to model the predictors of distance for inpatient care. e justification to use the Tweedie distribution is provided in Figure 1: distance, as the dependent variable, has several characteristics that are interesting to model including (1) positive skewness and nonnormality of the distance, (2) the range of positive distance, and (3) excess outliers for those who covered more distance than the average (Table 1).
We analyzed the distance with the covariates to determine the best combinations to explain any existing association.
Let R i be the distance recorded for a Kenyan traveling to seek inpatient care. We can assume that the distances traveled within a county during the survey period N follow a Poisson distribution with mean λ so that the following holds: if there is no distance covered, then N � 0; Y represents total distance covered and equals the Poisson sum of the gamma random variables such that Y � R 1 +, . . . , +R N . e resulting distribution is called the Poisson gamma and is explained as follows.

Notations.
Let y it be a vector of responses with a set of corresponding r covariates X it , where i indexes K units of analysis i � 1, 2, . . . , K; and t indexes the time points t � 1, 2, . . . , n i for each unit. us, the number of clusters observed is K. Further, N � n i and is the total number of observations across all units. For inclusion of an intercept, the first element of X it is set to 1.
Further, let y i � [y i1 , y i2 , . . . , y in i ] denote the corresponding column vector of observations on the response variables for unit i and X i � X i1 , X i2 , . . . , X in i indicate the n i × r matrix covariates for unit i.
In our case application to model distance data, we define the following: (i) e response variable y it which is the distance traveled in a given county by any given individual seeking inpatient care. (ii) With data available for all 47 counties, then K � 47 for i � 1, . . . , 47. (iii) e linear predictor. (iv) A link function used to relate the response to the linear combination of the covariates as g(μ it ) � η it .

Exchangeable or Symmetrical Correlation for Imbalanced
Data. Accounting for correlation in any model is a key principle to avoid biased results. Observations from an individual tend to be correlated when referring to people in the same cluster. erefore, to let our results be more efficient, we need to specify the correlation structure described by Liang and Zeger [45]. Our preferred correlation structure is the exchangeable or symmetrical correlation structure which assumes that there is a common correlation (R i (α)) within the observation in a given county, such that R u,v � 1 u � v α otherwise . Following Hardin and Hilbe [46], the exchangeable correlation structure uses the Pearson residuals from the current fit of the model to estimate the exchangeable correlation parameter, An important property of Tweedie under GEE is its ability to accommodate both the correlation and the right skewness, which is a characteristic of our continuous data. We use this approach here. is complements the work of Swan [31], who used the AR (1) correlation structure. Tweedie regression models allow relation of the mean of distance to the selected covariates. is allows the mean of distance to be modeled as a linear function of covariates using the log link given by where β j vectors are regression coefficients that correspond to the x j vectors of covariates, all fitted based on Tweedie EDM. e means are calculated to assess the relationship between covariates and the distance.
To fit the models, we need to estimate the index parameter p and the βs from the Tweedie distribution using the GLM framework. is can be computationally difficult, but the R packages Tweedie [47] and statmod [48] facilitate a flexible fit. e estimated index parameter with the 95% CI, using a set of covariates by employing cubic spline smoothing interpolation, is shown in Figure 2.  To estimate the βs, we apply the approach of McLuggar and Nelder [49] referred to as the iterative reweighted least square (IRLS) method in the GLMs. ese β s are then updated under the GEE framework.
Following Hardin and Hilbe [46], the quasi-likelihood estimating equations for GLM without any preference for mean and variance function with 47 clusters (counties) are expressed as follows: for k � 1, 2, . . . , 47, where T is a diagonal matrix of derivatives zμ i /zη i . V(μ i ) is an n i × n i diagonal matrix which can be decomposed into e observations in the clusters are treated as independent, but our focus is on the population average for which both the mean and the variance are averaged over all clusters. Following Liang and Zeger [45], the identity matrix in the above is replaced with a more general correlation matrix, since the variance matrix for data which are correlated does not have a diagonal form, as follows: where the correlation matrix CM i (α) is estimated by vector α. Proper specification of the CM i (α) means that the β are consistent and asymptotically normal. To achieve more efficiency, it is necessary to include a hypothesized correlation structure within the clusters. e structure that is suitable for our clustered data without any time dependence is the exchangeable given by the following: We explored the regression model fit to select the best model using the quasi-likelihood based information criterion (QIC u ) proposed by Hardin and Hilbe [46], which is an extension of the QIC proposed by Pan [50].
e QIC u imposes a penalty based on model complexity to ensure that only a few covariates are used to achieve model parsimony.
is criterion is used as a guide for selecting models when no scientific knowledge is available for the researcher to rely on. Once the values are computed for different models, the one with the lowest values is usually considered.
Finally, Table 2 shows the deviance and degrees of freedom for the residuals, considering logarithmic and canonical link functions. e logarithm link function produces the lowest residual deviance (5251.198) against canonical deviance (5280.799).
is indicates that it was the most suitable link function to use when modeling distance to access inpatient care in Kenya. R statistical software version 3.6.3 [51] was used in the analysis.

Results
Exploratory analysis of the data is presented in Table 3, from which we select the covariates that predict distance traveled to access inpatient care. Overall, complete information was available for 451 patients across 47 counties in Kenya who visited public health facilities in 2018 and were admitted more than 24 hours. Furthermore, 3 out of 5 patients were admitted between 1 and 5 days, two-thirds resided in rural areas, and 90% were satisfied with the service received at the facility. e wealth quintiles were evenly distributed across the respondents. Most admissions (81%) comprised < 15, > 65, and 25-54 years. e majority of households were of medium size (4-6 members) and had lower education (48%), while 9 out of 10 did not have access to insurance.
Two-thirds of members reported income through employment. e same number reported not having cash available to pay for the inpatient service, with 6 out of 10 paying over 3000 KES for the hospital bills and more than  three-quarters of all inpatient cases sought services in level 4 hospitals. Table 4 presents 10 competing models for distance that demonstrates the best-fitting model with the lowest QIC u .
We added the covariates into the model and computed their QIC u and R 2 . We then removed the covariates one by one and checked whether the changes improved the model fit.
Model 7, with covariates ability to pay, employed, and household size, was selected as the most parsimonious as it had the lowest value for QIC u . Considering the intercept, the model shows that, on average, a Kenyan seeking inpatient care traveled 22.04 km in 2018. e model with covariates of choice can be expressed as follows: where μ is the expected distance traveled to access inpatient care. In this model, the amount paid for healthcare (medium and high) equals 0 or 1 depending on which category is being assessed. e low group is the reference category. Employment equals 1 if the respondent is employed. Finally, household size (medium and large) equals 0 or 1 depending on what is being assessed. is model resulted in ϕ � 6.12, α � 0.045, R 2 � 9.96%, and QIC u � 13158.23, with p � 1.64, 95% CI (1.59, 1.68). To interpret the coefficients which are captured in logarithmic form, we need to take the exponential.
Compared to those who paid the lowest amounts for healthcare (1-3,000 KES), citizens in the middle pay category (3,001-10,000 KES) traveled 1.24 times the distance to a healthcare facility, whereas those who paid the most traveled 3.40 times the distance. e employed traveled half the distance to a healthcare facility for inpatient care than the unemployed (0.59 times). Compared to small households (1-3 members), medium-sized households (4-6 members) traveled 1.096 times the distance to a healthcare facility, while the largest (7+) traveled 1.60 times the distance compared to small households.
We calculated probabilities based on Dunn and Smyth [52] to demonstrate the applicability of the Tweedie distributions in modeling the distance traveled to access inpatient care. When 1 < p < 2, then the Tweedie parameters (μ, p, ϕ) can be parameterized to Poisson and gamma parameters (λ, c, α) which we use for estimation. ese are given in the following equation: where λ is the average distance traveled, c is the shape, and αc is the mean. Considering our best fitting model, the parameter index p is 1.64, μ � 22.04, and ϕ � 6.12. When we reparametrize to gamma and Poisson, it gives the predicted mean cost spend per month calculated as and c � 6.12(1.64 − 1)22.04 (1.64− 1) � 28. Finally, e mean distance to access inpatient care is αc � 0.56 * 28 � 15.75.
Following Dunn and Smyth [52], the probability of incurring zero cost on outpatient care by the household (in other words, the probability of not seeking outpatient care) is given by the following: . (14) e probability of households covering zero distance is given by exp(−1.38) � 0.25, meaning that 25% of household is within reach to an inpatient facility.

Discussion
is study identified factors associated with the distance travelled for inpatient care in Kenya. e results indicate that only one-quarter (25%) of the Kenyan population is within reach of inpatient services. Healthcare costs at the facility, household sizes, and being employed were predictors of distances. ese results should be interpreted with caution because the data were MNAR, wherein no sufficient imputation method would be appropriate for imputing such a problem [34].
e amount paid at the facility can be a good indicator for how far a person traveled for inpatient care. Here, higher costs at a facility were associated with longer distances traveled.
is implies seeking a service that may not be available at a closer facility. When a healthcare provider at a low-level hospital refers a patient to a higher-level hospital due to a lack of skills or equipment, this is referred to as a referral [53]. Complex cases call for referrals, and the higher facilities are few and typically found in major urban areas. A study in Asembo, Kenya, found that more children with severe illnesses traveled further [24] as facilities to treat their illness were not available.
Most complex cases become worse with time. is implies that if a case is handled early before it progresses, then it can be averted. A case example is cervical cancer, which is preventable through vaccination and screening once detected early [54,55]. However, acquiring diagnostics and treatments in Kenya is impossible due to long travel distances [56]. is indicates that conditions may deteriorate, cost more in later treatment, and lead to impoverishment of the already poor households [57]. In Kenya, most referrals to far facilities were observed in rural (10.4%) versus urban areas (7.1%) [2]. is is because the facilities that can handle these complexities are mostly found in urban areas, wherein infrastructure such as roads [7] and services like electricity [6] exist.
However, being employed helps reduce the hindrance to access inpatient service, similar to the finding by Allin et al. [58]. e majority who have jobs likely live in urban areas or places where social amenities such as water and electricity are typically available [59]. In these areas, one may expect inpatient facilities. Studies have shown that health insurance, which cushions people against catastrophic spending, is mostly acquired by the employed. is indicates that the employed enjoy the double benefits of living close to the facilities and incurring minimum expenses when they seek care. ey can also afford the fee as they have income, which indirectly influences access to inpatient care as reported in Kenya [19,20], Ghana [60], Zambia, and Malawi [61].
Members from large families are more likely to travel long distances to access inpatient health facilities. ese facilities are mostly found in urban areas [62,63]. However, large families are more associated with poverty [64,65], which influences healthcare utilization due to access capability. Further studies have shown that large family sizes are poor purchasers of insurance [66], thereby exposing large families to catastrophic spending.
Healthcare system performance can be assessed according to the healthcare service distribution, access, and utilization [67]. Access is mostly determined by cost and distance. us, irrespective of the availability of a service in a hospital, if a service is not utilized by the target group, its full utility cannot be actualized. Kenya's health system continues to suffer due to inadequate infrastructure and personnel in its healthcare system. Due to an insufficient number of personnel [68], specialized doctors from Cuba [69] were invited to fill in this gap in main county referral hospitals (level 4). is will strengthen primary health care and facilitates achieving Universal Health Care (UHC). However, these facilities may not fully meet the need of the majority as they are located in major towns; thus, those who are very far have limited access. is is specific for the rural population as reported by similar studies [70,71]. Strengthening a healthcare system is a step-by-step process which requires a good framework provided by the government through the Ministry of Health [12]. e aim is to accelerate the achievement of UHC by providing a structure to ensure that infrastructure will be improved and increase the number of healthcare personnel to serve the entire population. In addition, an improvement in the health structural development was reported by [72,73], who pointed out proper management at lower levels through devolution. However, they also cited challenges emanating from sources of finance. us, if UHC has to be attained anytime soon, then funding should not be an issue.
One of the key aspects toward strengthening the healthcare system is upgrading lower-level facilities and providing them with materials necessary to offer inpatient services. Admission by facilities that are more able to offer outpatient care has been reported, such as level 3 hospitals [2] wherein 8.42% were admitted in 2018. Although they are not able to handle extreme cases, they are well-equipped to handle many cases such as childbirth. is is likely to have a positive change in terms of facility deliveries and, thus, reduce the maternal and neonatal mortality rates. However, during childbirth complications, such as the need for caesarean services, patients may need to be referred to a larger facility. erefore, major conditions still need referrals to large hospitals, and access for inpatient care for these services at lower-level hospitals remains a challenge.
A reasonable distribution of health facilities at a facility density of 2.2 per 10,000 population, which is above the World Health Organization's target of 2 per 10,000 [12], has been found as well. However, one may still argue that these facilities are out of reach for the majority of Kenyans. erefore, the distance issue must be addressed if we ever hope to achieve Sustainable Development Goal 3.
A drawback of previous studies is the analysis of distance using summary statistics because researchers did not dig deeper into the data and only reported averages. Using Table 1 as a guide, for example, we could have reported a median of 10 km, which may be misleading, as we did not factor in skewness and correlations that exist in the data, which could provide more insight. is shows that advanced statistical analysis provides a more meaningful interpretation of the data by factoring in both skewness and correlations.
Finally, regarding policy implications, policies that target having more government healthcare facilities in rural areas and slums should be undertaken because large populations without adequate access to healthcare exist in these areas. However, setting up these facilities will require improved infrastructure (such as roads) and services (such as water and electricity), which are crucial components when setting up an inpatient facility.
In addition, policies that advocate smaller families should be encouraged to ensure that people can afford better healthcare. Job availability will also increase flexibility in terms of selecting a facility. Sophisticated services should be brought closer to low-income families to ensure that they do not travel long distances for much-needed services.

Limitations
is study has some limitations. First, the data were missing a significant amount of information, which made it difficult to use them as they were MNAR. erefore, we only used complete data as opposed to the available imputation techniques. is aspect can be examined in future studies to determine if there may be any differences in the findings. Furthermore, we only considered public facilities in the analysis. We hope other researchers will consider private facilities and examine the differences in the findings. Lastly, our research does not provide for statistical significance of the variables, but rather focused on overall model selection.
is is a limitation that other researchers can consider, to further research on analysis of nonnormal clustered data.
Data Availability e datasets are freely available from the Kenya National Bureau of Statistics by registering at https://statistics.knbs. or.ke/nada. e authors confirm that others can access these data in the same manner as the authors and that they did not have any special access privileges. e Reproducible R code can be accessed in the GitHub repository https://github. com/samwenda/Tweedie-with-Exchengable-Correlation.
Ethical Approval e study did not require any approvals as it was a secondary analysis.

Conflicts of Interest
e authors declare that they have no conflicts of interest.