Get 20M+ Full-Text Papers For Less Than $1.50/day. Start a 14-Day Trial for You or Your Team.

Learn More →

Modelling Under-Five Mortality among Hospitalized Pneumonia Patients in Hawassa City, Ethiopia: A Cross-Classified Multilevel Analysis

Modelling Under-Five Mortality among Hospitalized Pneumonia Patients in Hawassa City, Ethiopia: A... Ann. Data. Sci. (2018) 5(2):111–132 https://doi.org/10.1007/s40745-017-0121-4 Modelling Under-Five Mortality among Hospitalized Pneumonia Patients in Hawassa City, Ethiopia: A Cross-Classified Multilevel Analysis Tariku Tessema Received: 11 April 2017 / Revised: 7 June 2017 / Accepted: 13 July 2017 / Published online: 28 July 2017 © The Author(s) 2017. This article is an open access publication Abstract Community acquired pneumonia refers to pneumonia acquired outside of hospitals or extended health facilities and it is a leading infectious disease. This study aims to model mortality of hospitalized under-5 year child pneumonia patients and investigate potential risk factors associated with child mortality due to pneumonia. The study was a retrospective study on 305 sampled under-five hospitalized patients of community acquired pneumonia. A cross-classified multilevel logistic regression was employed with resident and hospital classified at the second level. Bayesian estimation method was applied in which the posterior distribution was simulated via Markov Chain Monte Carlo. The variability attributable to hospital was found to be larger than variability attributable to residence. The odds of dying from the community acquired pneumonia was higher among patients who were; diagnosed in spring season, complicated with malaria, AGE and AFI, in a neonatal age group, diagnosed late (more than a week). The risk of mortality was also found high for lower nurse: patient and physician: patients’ ratios. Keywords Cross-classified · MCMC · Sub-city · Hospital · CAP · Child 1 Introduction Pneumonia is a form of acute respiratory tract infection (ARTI) that affects the lungs [1]. It is among the main killers of children under the age five worldwide [2]. Com- munity acquired pneumonia (CAP), which refers to pneumonia acquired outside of B Tariku Tessema tarikutessema1@gmail.com Department of Statistics, College of Computing and Informatics, Haramaya University, Haramaya, Ethiopia 123 112 Ann. Data. Sci. (2018) 5(2):111–132 hospitals or extended health facilities, is a leading infectious disease requiring hospital admission in both developed and developing countries including Ethiopia. According to World Health Organization (WHO), pneumonia is the second leading cause of death among the under-fives next to preterm birth complication [3]. It killed nearly to one million children under the age of five in 2015, accounting for 16% of all deaths of children under 5 years old [4]. Fifteen countries that are home for 55% of world’s children shares 72% of the global burdens of pneumonia and diahrrea child deaths [4]. WHO [2] estimated that, about half of the deaths in children under five occurred in 2015 were caused by infectious diseases and conditions such as pneumonia, diarrhoea, malaria, meningitis, tetanus, HIV and measles. Ethiopia was among 15 countries contributing to burden of child mortality due to pneumonia and diarrhea between 2015 and 2016 [4]. In Ethiopia, the child mortality rate is 67 deaths per 1000 live births [5]. Pneumonia has been the main killer of children under age of 5. In 2012 UNICEF [6] reported a 21% of the total deaths among children under five was due to pneumonia. A similar Pattern was observed in the 2000 Ethiopian Demographic and Health Survey (EDHS). The infant mortality rate in the 5 years preceding survey is 77 and under-five mortality is 123 deaths per 1000 live births for the same period. This means that one in every thirteen Ethiopian children dies before reaching age one, while one in every eight does not survive to the fifth birthday. However, these figures declined to 68 deaths per 1000 live births where pneumonia accounts for 18% of the total deaths [7]. Various studies have been expended to identify important risk factors of mortality and morbidity from the Pneumonia patients. Some of the literatures have used spatial and non-spatial analytic techniques to examine patterns of the illnesses, and the fac- tors that determine it [8]. There is also literature that have used multivariate logistic regression to determine the factors that predict in-hospital mortality among patients who require hospitalization for the treatment of CAP [9]. These statistical method- ologies are not capable to consider variability that could be observed in groups and does not take into account the dependency of observation in the same groups/cluster. Therefore, it is necessary to use different approach that may explore the important determinants of child mortality due to pneumonia at patient, hospital and patient’s residence level. This study is, therefore, intended to model the variability of under-five child mor- tality (U5M) of hospitalized pneumonia patients observed at resident or hospital, and identify risk factors associated with U5M due to pneumonia using cross-classified multilevel model. A cross-classified model is a multilevel model where individual or lower level units are nested to two different groups [10]. In this model individual is nested into a classification of two groups, that is, patients admitted to one hospital come from different sub-city, likewise patient from one sub-city admitted on different hospitals. Hence, source of variation at individual/patients and group/hospital, sub- city levels can be considered. This study also contributes to the existing literature by providing methodological demonstration of cross-classified multilevel models in hospitalized patients of community acquired pneumonia. 123 Ann. Data. Sci. (2018) 5(2):111–132 113 Table 1 Population distribution No. Sub-cities Male Female Total across sub-cities of Hawassa City administration. Source 1 Addis Ketema 13,162 13,158 26,320 Hawassa city administration (11) 2 Bahil Aderash 11,007 10,919 21,926 3 Hawalla-Tulla 65,080 61,809 126,889 4 Hayek Dar 13,221 12,442 25,663 5 Mehal Ketema 10,982 10,638 21,620 6 Menehariya 14,642 13,977 28,619 7 Misrak 17,979 16,278 34,257 8 Tabor 33,877 30,464 64,341 2 Materials and Methodology 2.1 Description of Study Area and Data This study had been carried out at Hawassa City, capital of Southern Nations Nation- alities and People’s Region (SNNPR), Ethiopia. The city, which is the economic and cultural hub of the region, has a total area of about 157.2km divided in to 8 sub- cities (Kifle-ketema). These sub-cities (see Table 1) are divided mainly based on the geographical structure. Under Hawassa City Administration, there are a total of five hospitals and six health centres owned by three different sectors (Government, Non-Governmental Organiza- tion, and Private Sectors). All hospitals and one health centre under these sectors are currently admitting patients. These hospitals and health centres are listed in Table 2 with their respective sub-cities. The study used a retrospective study on the under-five patients admitted to hospitals during the period of January 1, 2012 to January 1, 2014. The data were extracted from patients’ cards that were confirmed by physician as a community acquired pneumonia patient. All hospitals and health centres, under Hawassa City Administration that admit patients of CAP during the period of January 1, 2012 to January 1, 2014 were taken into account. These Hospitals/Health Centres are; Hawassa Referral Hospital, Bushulo Health Centres, Adare Hospital, Kibru and Asher General Hospitals. Hawassa city administration has a population of 349,635 people, out of which 169,685 are females and 179,950 are males. Among the population of the city admin- istration, 223,821 people live in urban area, while the remaining 125,814 live in rural area of the administration. Population distributions across the sub-cities are given in Table 1. 2.2 Variables Considered in the Study Variables considered at individual/patient level in the study were selected based on some previous study [9,12–16] and those that are expected to be the fac- tors/determinants of mortality of under-five from hospitalized patients of CAP. 123 114 Ann. Data. Sci. (2018) 5(2):111–132 Table 2 Hospitals and health centers admits patients with respective sub-cities No. Hospitals/health centers Sub-cities Owned by 1 Hawassa referral hospital Tabor Government 2 Adare hospital Hayik Dar Government 3 Bushulo health centers Tulla NGO 4 Asher hospital Meneharia Private 5 Beta-Abrham hospital Bahil Adarash Private 6 Kibru general hospital Bahil Adarash Private Table 3 Independent variables Variable Description Patient level variables Sex Sex of patient Age Age of patient which lie between the interval (0,5] years Season of diagnosed It is the season at which the patients admitted to a hospital/health centre CAP complication If patients suffer other additional disease such as Acquired Immune Deficiency Syndrome (AIDS), Acute Faecal Incontinence (AFI), Acute Gastro Enteritis (AGE) and Malaria Time onset to diagnosis It is the time in which a particular symptom is observed till the diagnosis Length of hospital stay Total day in which patients undergone in the hospital Treatment Treatment taken by patients for CAP during diagnosis Patient refer status It is the status of patients if they are referred from one hospital to another Hospital level variables Bed Occupancy Rate (BOR) The percentage of official beds occupied by hospital inpatients for a given period of time; (Total length of hospital stay in a given month ) BOR = 30 × Number of beds in that month Patient: Physician Ratio (PPhR) Ratio of patient to physician counted in a given month; Number of patients admitted in a given month PPhR = Number of physician on service in a given month Patient: Nurse Ratio (PNR) Ratio of patient to nurse counted in a month Number of patients admitted in a given month PNR = Number of nurse on service in a given month * Length of Hospital stay is the number of colander days from the days of patient admission to the day of discharge Patients’ discharge status was dependent variable in this study. It is a binary, discharge alive or dead, response which was recorded during discharge of patients from hospital. The independent variables at patient’s and hospital levels which could possibly affect this variable are condensed in the following Table 3 with their description. 123 Ann. Data. Sci. (2018) 5(2):111–132 115 2.3 Data Analysis 2.3.1 Multilevel Regression Models The multilevel regression model often called hierarchical multilevel linear model (HLM) assumes that there is a hierarchical multilevel data set, with one single depen- dent variable that is measured at lowest level and explanatory variables at all existing levels [17]. Many kinds of data, including observational data collected in the human and biolog- ical sciences, have a hierarchical, nested, or clustered structure (Fig. 1a). Traditional linear regression techniques are not capable of taking into account data with a hierar- chical multilevel structure [18]. Data with hierarchical structure violates an important assumption of linear regression model. As a result, regression coefficients are not efficient and its standard errors are understated. 2.3.1.1 Two Level Multilevel Regression Models Multilevel regression model can be described as the extension of regression model, which adds random effects at multiple levels. For instance, in this study the interest is to know the risk of dying from CAP if one child is in a neonatal age. In this case, both variables are patient level variables; as a result, one can use simple logistic regression. Since, patients are nested with in hospital, using multilevel models allows us to divide the variance in the outcome variable at the group and individual levels. A two-level multilevel model in health area assigns patients to level 1 and hospitals to level 2. Consider a collection of r independent variables which will be characterized by X = (X , X2,...X ). Then the conditional probability of the outcome of interest 1 r in a study is present (in our case death of child due to CAP) can be denoted by P = P(Y = 1|X = x). The ratio of success probability to failure probability is ij ij known as odds of success. ij i.e, , i = 1, 2,..., n; j = 1, 2, ··· h 1 − P ij Suppose that we have a sample of n independent observations of (X , Y ) for r+1 ij ij variables, where Y denotes the value of dichotomous variable, and Y is the value ij ij Fig. 1 Patients nested to hospitals and hospitals to sub-city (a) and patients nested to cross classification of sub-city with hospital at level 2 (b). a Purely nested data structure. b Cross-classified data structure. Note This figure was adapted for the data analysed here from Fielding et al. [10] 123 116 Ann. Data. Sci. (2018) 5(2):111–132 of the independent variables for the ith individual (patients). Then, the logistic model can be written using odds of success as follows. ij = β X , i = 1, 2, 3, ··· n (2.1) l ijl 1 − P ij l=0 Thus, the data layout can be represented as: ⎛ ⎞ ⎛ ⎞ ⎛ ⎞ y 1x x ... x β 11 12 1k o ⎜ ⎟ ⎜ ⎟ ⎜ ⎟ y 1x x ... x β 21 22 2k 1 ⎜ ⎟ ⎜ ⎟ ⎜ ⎟ ⎜ ⎟ ⎜ ⎟ ⎜ ⎟ . .. . . . . ⎜ ⎟ ⎜ ⎟ ⎜ ⎟ Y = , X = , β = ⎜ ⎟ ⎜ ⎟ ⎜ ⎟ . .. . . . . ⎜ ⎟ ⎜ ⎟ ⎜ ⎟ ⎝ ⎠ ⎝ ⎠ ⎝ ⎠ . .. . . . . y 1x x ... x β n1 n2 nk k The relationship between success probability P and his/her characteristics are non- linear. Hence, by applying logit transformation, in order to have a meaningful interpretation, one would have the following logistic regression model equation. ij log = β X , i = 1, 2, 3,... , n l ijl 1 − P ij l=0 logit P = β X (2.2) ij l ijl l=0 where β = [β ,β ,...,β ] 0 1 k β is the intercept and β , .., ., ., ..β are coefficients of the p explanatory variables 0 1 p (which may be continuous or dummy variables). A two level logistic regression models allow to assess variation in a dependent variable at hospital and patient level simultaneously in which patients are nested within hospitals. The response variable is denoted by Y : ij 1 if child i is dead due CAP in hospital j Y = ij 0 if child i discharged alive from hospital j with probabilities P = P(Y = 1/X ), probability of dead for patient ‘i’ in hospital ij lij ij ‘j’, and 1-P , probability of discharged alive for patient i and hospital j. The standard ij assumption is that Y has a Bernoulli distribution. Let the P be modelled using logit ij ij link function. 123 Ann. Data. Sci. (2018) 5(2):111–132 117 The two-level model is given by: ij logit(P ) = log = β ij oj 1-P ij + β x ; l = 1, 2, 3, ..., r ; j = 1, 2,..., h and i = 1, 2, 3 ... , n (2.3) lj li j l=1 where β = β + u ,β = β + u ,...,β = β + u this becomes: 0 j 0 0 j 1 j 1 1 j rj r rj r r ij logit P = log = β + β x + u + u x (2.4) ij 0 l 0 j li j lij lj 1 − P ij l=1 l=1 X = (X , X ,..., X ) represents the first and the second covariates, (β ,β , ··· , ij 1ij 2ij rij 0 1 β ) are regression coefficients, u , u , u , ··· u are the random effect of model r 0j 1j 2j kj parameter at level two. It is assumed that these level two random effects distributed normally with mean 0 and variance σ . The Empty Model The empty two-level model for dichotomous outcome variable refers to population of groups (level-two units) and specifies the probability distribution for the group- dependent probabilities P , where P characterised by Y = P +e , without taking ij ij ij ij ij further explanatory variables into account. The logit transformed models, logit(P ), ij has a normal distribution. Then the empty model is expressed by formula: logit(P ) = γ + u (2.5) ij 0 0 j where γ is the population average of the transformed probabilities and u the random 0 0 j deviation from this average for hospital j. The residual term associated with deviation, u is a unique effect of hospital j on the outcome variable; it is assumed to be normally 0 j and independently distributed with mean of zero and constant variance, σ that is u ∼ N(0, σ ). 0 j The level 2 residual variance captures the variation across hospital means. With this model, the amount of variance in the outcome variable that is attributable to within group characteristics (here, patients) and to between group differences (hospitals) can be identified. This model (2.5) does not include a separate parameter for the level- one variance [19]. This is because the level one residual variance of the dichotomous outcome variable follows directly from the success probability is indicated below. var(e ) = P (1 − P ) (2.6) ij ij ij where e is individual-dependent residual. ij Define π - the probability corresponding to the average value by the γ , as follows: 0 0 f (π ) = γ 0 0 For the logit function, the logistic transformation of γ is given by: 123 118 Ann. Data. Sci. (2018) 5(2):111–132 exp(γ ) π = logistic(γ ) = (2.7) 0 0 1 + exp(γ ) Now the variance for probability of success (death of child from CAP in-hospital j) can be given by: 2 2 var(P ) = (π (1 − π )) σ (2.8) ij 0 0 Random Intercept Model The extension of equation 2.5 above in which it incorporates explanatory variables is termed as random intercept model. The success probability is not necessarily the same for all individual in a given group. Therefore, the success probability now depends on the individual as well as on the group, and is denoted by P ,[19]. Accordingly, the ij outcome is split into an expected value and a residual i.e.: Y = P + R (2.9) ij ij ij The logistic random intercept model expresses the log-odds, i.e., the logit of P is: ij logit(P ) = γ + γ x + u (2.10) ij 0 l li j 0 j l=1 As usual, the deviation u are assumed to be independently distributed with zero 0 j mean and variance σ . Equation (2.10) does not include a level one residual because it is an equation for the probability rather than the outcome Y . Thus, the level residual ij has already included in the equation. Random Slope Model Another model which is used to assess whether the slope of any of the explanatory variable has a significant variance component between the groups is called the random slope model. Assume that effect of variable X = (X , X , ··· , X ) across the j 1j 2j kj groups, and accordingly has a random slope. Equation (2.10) for the logit of the success probability is extended with the random effect u x (19), which leads to: 1 j 1ij, logit(P ) = β + β x ij 0j lj li j l=1 β = γ + u 0j 0 oj β = γ + u (2.11) lj l lj The characteristic that is unique to this model, compared to others discussed here, is that the random coefficients β are themselves dependent variables in a second regression equation. Of course, in RSM we have: logit(P ) = γ + γ x + u x + u (2.12) ij 0 j lj li j lj li j oj l=1 There are now r+1 random group effects, the random intercept u and the random oj slope u . It is assumed that both have a zero mean. Their variances are denoted by lj 123 Ann. Data. Sci. (2018) 5(2):111–132 119 2 2 σ ,...,σ respectively and their covariance is denoted by σ . This can be described 0r by the covariance matrix . ⎡ ⎤ σ σ ...σ 01 0r ⎢ ⎥ σ σ ...σ 10 1r ⎢ 1 ⎥ ⎣ ⎦ .. σ σ ....σ r0 r1 2.3.1.2 Two Level Cross-classified Multilevel Models Unlike hierarchical multilevel data with a purely nested structure, data that are cross- classified not only may be clustered into hierarchical multilevel ordered units but also may belong to more than one unit at a given level of a hierarchy [10]. Goldstein [20] labelled multilevel model for purely nested structure as hierarchical multilevel (HM) models. However, multilevel level models for cross-classified data structure is labelled as cross-classified multilevel (CCM) models. In a cross-classified design, patients at a given hospital might be from several different resident places/sub-cities and one sub-city might have patients who attend a number of different hospitals. In this type of scenario, hospital and sub-city are considered to be cross classified factors, and cross classified random effects modelling (CCREM) should be used to analyse these appropriately. The consequences of ignoring an important cross-classification are similar to those of ignoring an important hierarchical multilevel classification [21]. Therefore, it is very important to consider the classification at existing level (second level in this study). In Fig. 1 the first diagram illustrates a purely nested data structure. In this structure, there are three levels; patients, hospitals and sub-city. A typical hierarchical multilevel model has patients nested in hospitals, and hospitals nested within sub-cities. In a cross-classified multilevel structure, however, patients from a given sub-city might visited different hospital and hospital might serve patients from different sub-city. Subsequently the cross classified data structure occurs at level 2. In this example, we can take into account influences coming from two different factors: sub-cities and hospitals. This may improve the estimation of explanatory variable effects ensuring the structure is properly specified [10]. Assume one has patients nested within a cross-classification of hospital by sub-city, which is the case illustrated in Fig. 1b and P be the probability of child to i(hosp,subc) die from CAP in hospital ‘hosp’ and came from sub-city ‘subc’. logit(P ) = γ + γ x + u 0 l li(hosp, subc) hosp(0, j ) i(hosp, subc) l=1 + u x + v (2.13) hosp(l, j) i(hosp, subc) subc(i) l=1 v is the sub-city effect which are mutually independent random variables and subc(i ) also independent to u , which is hospital random effect. All v are normally hosp(0, j) subc(i ) distributed with mean 0 and variance τ . 123 120 Ann. Data. Sci. (2018) 5(2):111–132 Note the notation used in this paper for cross classified model is adapted from Fielding et al. [10]. Like HM model, in a CCM model, before introducing explanatory variables, the variation in outcomes could be examined by looking at the differences between hospi- tals, between sub-cities and between individual patients after controlling for sub-cities and hospital effects [22]. This gives a basis for extending the model to identify which sub-cities, hospitals, and patients’ characteristics might explain some part of these component variances. After adding explanatory variables, the residual components of variances can be estimated, this will provide more information about the variation in outcomes [21]. 2.3.2 Method of Parameter Estimation Estimation of CCM models by existing maximum likelihood approaches run into important computational limitations, especially when large numbers of units are involved [21]. As a result, the models used in this paper are fitted using Markov Chain Monte Carlo (MCMC) based algorithms as implemented in the MLwiN package [23]. Starting values for the fixed parameters are estimated from simpler models using a maximum likelihood approach, penalized quasi- likelihood (PQL) in MLwiN [23]. Markov Chain Monte Carlo(MCMC) is a simulation techniques that generate ran- dom samples from a complex posterior distribution [22]. The simulated posterior distribution is then used to compute a point estimate and a confidence interval. The posterior distribution of θ is defined from Bayes’ theorem as p(θ /data)/p(data/θ )p(θ ) Here p(θ) is the prior distribution for the parameter vector θ and should represent all knowledge we have about θ prior to obtaining the data. In multilevel models, a non-informative prior for both fixed effect and random effect parameters are recommended [18]. The default prior distribution applied in MLwiN when MCMC is used are ‘flat’ or ‘diffuse’ priors for all parameters (23). Thus, according to Browne et al. [23] the prior for a fixed parameter, β in MLwiN is an improper uniform distribution, p(β) ∝ 1, which is equivalent to normal prior with large variance. The prior for scalar variance, σ is specific as inverse gamma, p(σ ) ∼ invGamma(α, β). In MLwiN, the prior applied for covariance matrices, −1 is an inverse Wishart distribution, p( ) ∼ Wishart (p, p, ), where p is the number of rows in the variance matrix and  is the estimate for the true value of . Consider a two-level random slope binary logistic regression in equation (2.12), r r logit(P ) = γ + γ x + u x + u ij 0 j lj li j lj li j oj l−1 l=1 with Y = P + R is the outcome variables where  = σ I is level one residual ij ij ij ε nj is i.i.d.  is a level two covariance matrix. In this study, the prior distribution used for all fixed parameters is non-informative normal distribution with variance 10 and different values of α and β in gamma distribution for scalar variance prior was used. 123 Ann. Data. Sci. (2018) 5(2):111–132 121 Table 4 Summary result of discharge Status of Patients with their correspondent hospitals/health centers Hospitals/health centre Discharged status Total Number of alive (%) No. of death (%) Referral hospital 96 (92.3) 8 (7.7) 104 Bushulo health centre 127 (80.4) 31 (19.6) 158 Adare hospital 23 (71.9) 9 (28.1) 32 Asher and Kibru hospital 9 (81.8) 2 (18.2) 11 Total 255 (83.6) 50 (16.4) 305 (100) Consider Eq. (2.12) above, random slope model, in which each observation is considered as an outcome of a Bernoulli trial (y ∼ Bernoulli(p )), and hence for ij ij the ith patient in jth hospital: ij 1−y ij P(Y = y ) = p (1 − p ) (2.14) ij ij ij Assuming the n observations are independent, the likelihood function is given by: t 1−y y ij ij n n −X β ij 1 e ij 1−y ij L(X/θ) = p (1 − p ) = ij ij t t −X β −X β ij ij 1 + e 1 + e i =i i =1 (2.15) where θ stands for the joint parameters, u ,Σ ,Ω . j ε u The posterior distribution is obtained by multiplying the prior distribution over all parameters by the full likelihood functions. The posterior distribution is: f (θ/data) ∝ p(u )Xp( )Xp( )X f (data/θ) (2.16) j ε u 3 Results and Discussion 3.1 Results This study was carried out to model variability in mortality of under-five hospitalized pneumonia patients, using a retrospective data on the patients admitted between Jan- uary 2012 and January 2014. The study used a sample of 305 under-5 year patients out of which 50 (16.4%) died in the hospital and the rest of them were discharged alive from hospitals. A hierarchical two level logistic regression and cross-classified logistic regression with the classification of ‘sub-city’ and hospitals at second level was modelled. Then these two models are compared based on their DIC. It can be seen from Table 4, more death occurred in Bushulo Health Centre which accounted about 10.2% of the total death occurred during the retrospective interval time. Likewise, in Hawassa University Referral Hospital, about 2.6% of the patients of CAP from Hawassa city were expired. In the same manner, the distribution of patients discharge status is displayed in the following Table 5. 123 122 Ann. Data. Sci. (2018) 5(2):111–132 Table 5 Summary result of discharge status of patients with their resident Resident/sub-city Discharged status Total No. of alive (%) No. of death (%) Addis Ketema 21 (77.8) 6 (22.2) 27 Bahil Aderash 20 (100) 0 (0) 20 Hawalla-Tulla 60 (72.3) 23 (27.7) 77 Hayek Dar 22 (88) 3 (12) 25 Mehal Ketema 26 (86.7) 4 (13.3) 30 Menehariya 22 (95.7) 2 (4.3) 24 Misrak 18 (69.2) 8 (30.8) 22 Tabor 66 (93.0) 5 (7.0) 80 Total 255 (85.6) 50 (16.4) 305 (100) Table 6 DIC values of each models for comparison of parameters in prior distribution of single variance Models (α, β) (5, 5) (10, 5) (5, 10) (10, 10) (100, 100) Two level HM empty model 252.87 252.778 253.83 252.69 252.55 Two level HM random intercept model 181.61 180.62 181.11 181.87 179.82 Two level CCM empty model 265.10 269.26 256.08 264.44 264.32 Two level CCM random intercept model 179.75 180.69 181.04 179.93 179.26 3.1.1 Setting for Bayesian Inference In this study, non-informative priors have been used which express the complete igno- rance of the value of the parameter. Accordingly, as we mentioned on chapter three Sect. 3.5.3, we have used normal distribution with mean 0 and variance 10 (huge variance) for fixed parameters. However inverse gamma distribution with parameter α and β for single variance was used. The value of parameters for this distribution was chosen based on Bayesian deviance information criteria (BDIC). Table 6 shows the proposed parameter value for prior distribution of single variance. On Table 6, BDIC is smaller when we use α = 100 and β = 100 in each model except on cross classified empty model. Since, there was no significant change on the values of BDIC, the same prior parameter for these models were used too. The number of iteration used in the analysis for all of the parameters (fixed and random effect) was set up to be 50,000 with ‘thining’ number of 1; the first 5000 was discarded(burn-in) and the rest of the chains was used to summarize the posterior distribution. 3.1.2 Cross-Classified Multilevel Logistic Regression Results Now like the case of HM model, an empty model can be extended to random inter- cept CCM model using explanatory variables those were significantly associated with 123 Ann. Data. Sci. (2018) 5(2):111–132 123 discharge status of patients. In CCM random intercept model we vary the intercept across both hospitals and sub-city. In this analysis, the coefficients of Intercept (−10.323), number of patients attended (3.024), spring (2.021), AGE (1.893), AFI (2.017), Malaria (1.967), postnatal age group (−2.383), Child age group (−2.466), duration of onset to diagnosis from 4- 7days (1.556), duration of diagnosis more than a week (2.165), Patient to Physician ratio (0.014) coefficient of Patient to Nurse ratio (0.330) were statistically significant at 5% level of significance. In the model, more variation (19.3%) of discharge status of the patients (dead or alive) was due to hospital as compared to variation attributable to residents/sub-city (18.9%). The proportion of variation of discharge status explained by explanatory variables is 78.95% in random intercept of hierarchical multilevel logistic regression. In other words, season of diagnosed, CAP complicated disease, duration till diagnosed, age, patient refer status, patient to nurse ratio and patient to physician ration explains about 78.95% of variation in whether patients were dead or alive due to community acquired pneumonia (Tables 7, 8). 3.1.3 Model Diagnostic Check In MLwiN statistical Packages, MCMC estimation methods can be easily run by taking either PQL result or MQL results as an initial point for simulation. The packages finally display MCMC summery statistics and MCMC plots which is important to test the diagnostic convergence of MCMC chain. Under MCMC summery statistics the package computes chain means (posterior means), the chain standard deviation, etc. Raftry and Lewis diagnostic [24] is a diagnostic based on a particular quantile of the distribution. The diagnostic that is used to estimate the length of Markov chain required to estimate a particular quintile to a given accuracy. In MLwiN the diagnostic is calculated for the two quintiles (the defaults are the 2.5 and 97.5% quintiles) that will form a central interval estimate. The dependency among the Markov chain samples are measured by the autocorre- lation which is one method of model diagnostic of convergence. MCMC algorithms do not produce independent samples from the distribution. Rather, it generates samples that are auto correlated. As in time series analysis, we can compute the autocorrelation function at each lag (1+) and include the number of iterations we need to skip in order to have a non-significant autocorrelation. High correlations between long lags indicate poor mixing. Thus, low autocorrelation indicates good convergence. All the coefficients were tested using kernel density plot, Lewis and Reftry test PACF,and ACF. Thus, the results were obtained after all were satisfied which can be attained with increasing number of iteration. 3.1.4 Model Selection Model was selected based on deviance information criteria (DIC) found for each model, where model with small deviance information criteria was selected for inter- pretation of the results. We have used deviance information criteria to choose better model for this analysis. The DIC for cross-classified random intercept model was small as compared to the other models. 123 124 Ann. Data. Sci. (2018) 5(2):111–132 Table 7 Cross-classified and hierarchical multilevel logistic model-random intercept models Variables Cross-classified multilevel model Hierarchical multilevel model PQL estimates SE (PQl) MCMC SE (MCMC) Exp (β) p value PQL SE (PQl) MCMC SE Exp (β) p value estimates estimates estimates (MCMC) Fixed effect Intercept −7.192 1.863 −10.323 2.407 0.000 0.000* −6.162 1.830 −8.750 2.088 8.067 0.000* PREFSTS 0.997 0.864 3.024 1.165 20.57 0.004* 0.574 1.051 0.669 1.156 3.177 0.281 Season Autumn (ref) Summer 1.189 0.923 1.343 1.111 3.83 0.111 1.328 0.800 2.055 1.231 7.806 0.036* Winter 0.814 0.924 0.803 1.136 2.23 0.240 1.029 0.782 1.675 1.215 5.338 0.210 Spring 1.471 0.836 2.021 1.053 7.54 0.027* 1.285 0.735 2.007 1.184 7.441 0.022* Diseases complicated No (ref) AGE 1.801 0.586 1.893 0.671 6.64 0.002* 1.527 0.542 2.147 0.659 8.556 0.000* AFI 2.030 0.720 2.017 0.816 7.52 0.000* 1.847 0.687 2.388 0.879 10.892 0.000* Malaria 1.710 0.813 1.967 0.859 7.15 0.011* 1.557 0.749 2.184 0.895 8.882 0.001* Others 0.397 0.716 0.493 0.875 1.64 0.286 0.429 0.610 0.540 0.903 1.716 0.241 Age Neonatal (ref) Postnatal −1.734 0.713 −2.383 0.897 0.09 0.003* −1.797 0.677 −2.406 0.872 0.090 0.052 Child −2.062 0.754 −2.446 0.933 0.08 0.004* −1.971 0.711 −2.585 0.895 0.075 0.038* Ann. Data. Sci. (2018) 5(2):111–132 125 Table 7 continued Variables Cross-classified multilevel model Hierarchical multilevel model PQL estimates SE (PQl) MCMC SE (MCMC) Exp (β) p value PQL SE (PQl) MCMC SE Exp (β) p value estimates estimates estimates (MCMC) Duration to onset diagnosis ≥3 days (ref) 4–7 days 1.416 0.479 1.556 0.561 4.74 0.002* 1.164 0.420 1.651 0.566 5.212 0.000* >7 days 1.862 0.603 2.165 0.720 8.71 0.000* 1.635 0.543 2.307 0.731 10.044 0.000* Hospital level variables PPhR 0.019 0.005 0.014 0.008 1.01 0.003* 0.014 0.005 0.017 0.006 1.017 0.023* PNR 0.198 0.043 0.330 0.069 1.39 0.000* 0.235 0.051 0.330 0.060 1.391 0.000* BOR 1.734 1.906 −0.925 1.996 0.319 0.425 1.766 1.729 2.691 2.085 14.716 0.468 Random parameters σ = 0.108 0.599 1.026 0.102 0.001* 0.527 0.435 1.034 0.103 0.002* Var (u ) hosp (Between hosp) τ = 0.000 0.000 1.010 0.104 0.002* – – – – – Var (υ ) subc(i ) (Betweensub-c) ICC (Subc) 0.189 – ICC (Hosp) 0.193 0.187 R (%) 78.95 58.5 DIC 179.26 180.62 * Significant at 5% level of Significance, Ref Reference category, ICC intra class correlation coefficient, BDIC Bayesian deviance information criteria 126 Ann. Data. Sci. (2018) 5(2):111–132 Table 8 Bayesian deviance information criteria for all models Models DIC pD (number of R parameter) Hierarchical multilevel models Empty models 252.55 11.15 – Random intercept model 180.62 16.69 0.585 Cross-classified multilevel models Empty models 264.32 46.12 – Random intercept model 179.21 20.80 0.7895 All of the variables significantly predicted discharge status of patients in both ran- dom intercept models except referred status of patients. This variable was significant in CCM random intercept model. However, it was insignificant in HM random intercept model. In addition, the explained proportion of variation (R ) in outcome variable is large for CCM random intercept model. The model comparison result suggests CCM logistic model as the best fit for dis- charge status of hospitalized pneumonia patients. Thus, the effects of each covariate can be interpreted using the estimated odds observed under cross-classified multilevel logistic. The odds of dying due to community acquired pneumonia are the probability that the patient died divided by the probability that the patient discharged alive. Most of the patient level variables included in the model significantly affect the discharge status of hospitalized pneumonia patients. Season diagnosed, CAP compli- cation, duration of onset to diagnosis and age are patient level variable that significantly affect the discharge status of hospitalized pneumonia patients. Whereas, patient to physician ratio and patient to nurse ratio are hospital level variable that significantly affect the outcome of response variable. The value of Exp(β) for spring season is 7.54 which implies that the odds of being at risk to death during spring season were 7.54 times more likely than patients diagnosed in winter season. In CAP complication variable, there was high probability of death on patients with AGE, AFI, and malaria than patients only with CAP. The odds of these coefficients are 6.64, 7.52 and 7.15, respectively. There was high probability of dying from CAP if the patients are complicated with these diseases. There is less likely to die from CAP, if patients are postnatal and child age than those patients in neonatal age group. Accordingly, the risk of dying from CAP for postnatal age group and child age group patients was less by 91 and 92% respectively. On the result, it was found that patients who late diagnosis has high probability of being at risk than those admitted before 3 days. The odds of being at risk for patients who diagnosed after a week is 8.71 times than those patients diagnosed before three days. The odds of being at risk for patients who diagnosed with 4 to 7 days was also 4.74, which means the risk of dying from pneumonia if patients diagnosed between these days was 4.74 times than those diagnosed before three days keeping another risk factors constant. Patient to nurse ratio, patient to physician ratio were variables significantly predicts the discharge status of patients in the chosen CCM logistic model among hospital level variables. The odds of being at risk increases by 1.39 for a unit increases of patient to nurse ratio keeping the others risk factors constant. In the same manner, 123 Ann. Data. Sci. (2018) 5(2):111–132 127 the odds of being at risk increases by 1.01 for a unit increase of patient to physician ratio keeping the risk factors constant. These means number of patients increased by the number physician/nurses keeping number of physician/nurse constant. Generally, patients diagnosed during patient to nurse ratio and/or patient to physician ratio was large had high probability of dying from CAP. 3.2 Discussion CCM logistic random intercept model revealed that more of the variation of discharge status of patients (alive/dead) was due to hospitals as compared to variation attributable to resident/sub-city. During the period of our study, the pandemic strain of community acquired pneumonia resulted in the death of 16.34% patients in the sample. Some of previous study had found approximate result, for example, 18.79% in [12] 14.7% in Cuietal. [14], 23.8% in Kuo-Tung [25]. On the result, it was also observed that patients admitted due to the case (CAP) during spring season was high in all hospitals; this is may be due to the fact that there was high prevalence of CAP in that season. In this study discharge status of hospitalized patients due to CAP was not associated with sex as reported earlier by Teka [12] and Kuo-Tung [25]. There was also no associ- ation between treatment given by the physician and patients discharge status. Since all patients were hospitalized which means they were under the care of nurse and physi- cians there might be a proper usage of treatments. Thus, the non-significance of associ- ation between discharge status and treatment taken might be due to this proper usage. In the study, also it was found that length of hospital stay (LOS) is not significantly associated with discharge status of CAP under-five patients. The median LOS was 4 days which means 50% of patients admitted due pneumonia stayed in the hospital were less than 4 days. Similar study in Spain during 2003 had found that 9% of LOS due CAP,(16). In this model, some variation of discharge status of under-five age CAP patient due to hospitals and residence/sub-city were 19.3 and 18.9% respectively. The previous studies [12] that used some other statistical methods like binary logistic regression in which it is assumed that all patients were independent also found a significant association between the residence and discharge status of patients. Thus, this results in Type I error [26]. Finally, it was investigated that hospitals and sub-city also contributed to a variation among discharge status of CAP patients. The variation due to hospitals might be due to service given, quantity and quality of physician/nurse, in-efficient management, etc. In the same manner inaccessibility of health facility, neatness of the environment which results in other complicated disease such as malaria, high population density which results in high prevalence of pneumonia disease, could be the reason for the variation due to residence/sub-city. In the study, it was found that patients admitted in summer and spring season had high risk of dying from CAP as compared with other seasons. According to Lieberman’s [15] finding there is high incidence of CAP during spring and summer seasons. This could be the reason for the high risk of death from CAP in these seasons. 123 128 Ann. Data. Sci. (2018) 5(2):111–132 Result showed that patients who were delayed to be diagnosed had high risk of dying from CAP than those patients who diagnosed early. An appropriate usage of treatment as well as delaying to start treatment for one disease leads to high risk of dying. This is may be due to the fact that; the expansion of causing organisms increases through time in patients’ body. As a result, duration of onset to diagnosis was a significant predictor of discharge status of patients. This result match with results of Menendez et al. [16] and Teka [12]. Among hospital level variables patients to nurse ratio and patients to physician ratio significantly predict discharge status of CAP patients. Patients who admitted during patients to nurse ratio was high had high risk of dying from pneumonia. Since patients in hospital are nurtured by nurses and this has a positive impact on the recovery from their illness. Fortunately, patients who admitted during ratio of patient to nurse is high has less chance to survive as it is compared to others patients. Thus, the result obtained had a complete agreement with this logic. Another important hospital level variable which again significantly predicted the risk of dying from pneumonia was patient to physician ratio. Patients who admitted during patients to nurse ratio was high had high probability of dying from CAP. Bed occupancy rate which is one measure of quality of hospitals was insignificantly associated with discharge status of under patients. The reason for this result is not clear but it might be due to the fact that, in this study, there was no patients admitted during the beds were crowded. 4 Conclusions and Recommendations 4.1 Conclusions sThis study was aimed to model the mortality of hospitalized under-five pneumonia patients and identify risk factors associated to discharge status of patients. Hierarchi- cal multilevel and cross-classified multilevel were used. It was interesting to find out that cross-classified multilevel model was better than simple hierarchical multilevel modelling in fitting to the data and explore variability attributed to hospital and resi- dence. The variability attributable to hospital was found to be larger than variability attributable to residence. From the analysis, it was found that age of patients, season of diagnosis, duration of onset to diagnosis, complicated disease, patient to physician ratio, patient to nurse ratio. Hospitals and place of residence/sub-city were found to contribute significantly to the variability of patient’s status (dead/alive). 4.2 Recommendations Scaling up of interventions such as health promotions on appropriate and effective treatment in home or community based care should be emphasized across all sub-cities. And also, the awareness of early diagnosis to the community should be expanded via health extensions, clinicians or any concerned body in collaboration with governmen- tal and non-governmental organizations. Health facilities based on quality parameter of hospitals like bed occupancy rate, patient to nurse ratio, and patient to physician 123 Ann. Data. Sci. (2018) 5(2):111–132 129 ratio should be improved in hospitals and be handled with effective management. Finally, researchers who are interested on similar area are recommended to choose cross-classified models for the analysis in the case where patients are nested to the classification of hospitals and resident to explore variation of response variable at all existing levels. Acknowledgements I would like to acknowledge Bushulo Health Centre, Hawassa Referral Hospital, Adare Hospitals Kibru Hospital, Asher General Hospital Managers’ for providing me the data and reading materials used for this study. Thanks are also due Hawassa City administration and Central Statistical Agency-Hawassa Branch offices for honestly supplying me secondary data on population size of sub-cities of Hawassa city. Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 Interna- tional License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. Appendix 1: MLwiN Results MLwiN Result for Equations of Hierarchical and Non-Hierarchical Multilevel Models See Figs. 2, 3. Fig. 2 MLwiN equation result for multilevel logistic regression-random intercept model 123 130 Ann. Data. Sci. (2018) 5(2):111–132 Fig. 3 MLwiN result equation for cross-classified multilevel logistic regression- random intercept model 123 Ann. Data. Sci. (2018) 5(2):111–132 131 Appendix 2: Cross-Classified Multilevel Model Coefficients’ Convergence Test Intercept Coefficient of Duration of Onset (>7 days) Coefficient of autumn Coefficient of Patient refer Status Coefficient of Postnatal Coefficient of winter Coefficient of Child Coefficient of spring Coefficient of patient-to-physician ratio Coefficient of AGE Coefficient of Patient to Nurse Ratio Coefficient of AFI Coefficient to Bed Occupancy Rate Coefficient of Malaria Random Effect Variance of Hospitals Coefficient of Others Random Effect Variance of Sub-cities Coefficient of Duration of Onset (4-7days) 123 132 Ann. Data. Sci. (2018) 5(2):111–132 References 1. Nga Tong, BA, M. (2013). Background Paper 6.22 Pneumonia. “A Public Health Approach to Inno- vation,” (May), 7–8. Retrieved from http://www.who.int/medicines/areas/priority_medicines/BP6_ 22Pneumo.pdf 2. UNICEF. (2015). Levels and trends in child mortality. New York: UNICEF (pp. 1–30). Retrieved from http://www.who.int/maternal_child_adolescent/documents/levels_trends_child_mortality_2013/en/ 3. World Health Organization (WHO) (2015) World Health Statistics 2015. World Health Organization. Luxembourg 4. International Vaccine Access Center (IVAC). (2016) Pneumonia and diarrhea progress report? reaching goals through action and innovation. Johns Hopkins Bloomberg School of Public Health (pp. 1–46) 5. Central Statistical Agency (CSA) [Ethiopia] I (2016) Ethiopia demographic and health survey 2016: key indicators report. Addis Ababa, Ethiopia, and Rockville, Meryland, CSA and ICF. Retrieved from https://www.usaid.gov/sites/default/files/documents/1860/ Ethiopia%20DHS%202016%20KIR%20-%20Final%2010-17-2016.pdf 6. UNICEF (2012) Pneumonia and diarrhoea: tackling the deadliest diseases for the world’s poorest children. New York 7. Legesse H, Degefie T, Hiluf M, Sime K, Tesfaye C, Abebe H et al (2014) National scale-up of integrated community case management in rural Ethiopia: implementation and early lessons learned. Ethiop Med J 52(Supplement 3):15–26 8. Crighton EJ, Elliott SJ, Moineddin R, Kanaroglou P, Upshur R (2007) A spatial analysis of the determi- nants of pneumonia and influenza hospitalizations in Ontario (1992–2001). Soc Sci Med 64:1636–50 9. Marrie TJ, Wu L (2005) Factors influencing in-hospital mortality in community-acquired pneumonia: a prospective study of patients not initially admitted to the ICU. Chest 127(4), 1260 – 1270. doi:10. 1378/chest.127.4.1260 10. Fielding A, Goldstein H (2006) Cross-classified and multiple membership structures in multilevel models: an introduction and review 11. Hawassa Facts and Figures [Internet]. [cited 2017 Mar 22]. Available from http://www.hawassaonline. com/facts.php 12. Teka Z (2011) Statistical Analysis of risk factors for mortality of in-hospital patients due to pneumonia: a case study at bushulo major health centre, Hawassa. Hawassa University, Ethiopia 13. Raza M, Ahmed a, Ahmed F, Ghani A, Rizvi N (2012) Seasonal incidence of community acquired pneu- monia and its mortality in Karachi-A multi-centric hospital based study. IpublishingCo.in 3(2):885–894 14. Cui W, Zhao H, Lu X, Wen Y, Zhou Y, Deng B, et al. (2010) Factors associated with death in hospitalized pneumonia patients with 2009 H1N1 influenza in Shenyang, China. BMC Infect Dis. 10(1):145 15. Lieberman D, Porath A (1996) Seasonal variation in community-acquired pneumonia. Eur Respir J 9(12):2630–2634 16. Menéndez R, Cremades MJ, Martínez-Moragón E, Soler JJ, Reyes S, Perpiña M (2003) Duration of length of stay in pneumonia: influence of clinical factors and hospital type. Eur Respir J 22(4):643–648 17. Hox J (1995) Applied multilevel analysis, vol 274. TT-Publikaties, Amsterdam 18. Gelman A (2006) Prior distribution for variance parameters in hierarchical models. Int Soc Bayesian Anal 1(3):515–533 19. Snijders TAB, Bosker RJ (1999) Multilevel analysis: an introduction to basic and advanced multilevel modeling. Sage, New York 20. Goldstein H (1999) Multilevel statistical models. Wiley, New York 21. Rasbash J, Browne W (2008) Non-hierarchical multilevel models. In: Handbook of multilevel analysis. pp 301–34 22. Alemdar M (2009) A monte carlo study: the impact of missing data in cross-classification random effects models. Georgia State University, Atlanta 23. Browne WJ, Charlton C, Rasbash J, Kelly M, Pillinger R (2016) MCMC estimation in MLwiN version 2.36. Centre for mutlilevel modelling. University of Bristol, Bristol 24. Raftery AE, Lewis S (1991) How many iterations in the Gibbs sampler?. Oxford University Press, Oxford 25. Huang K-T, Tseng C-C, Fang W-F, Lin M-C (2010) An early predictor of the outcome of patients with ventilator-associated pneumonia. Chang Gung Med J 33(3):274–82 26. Raudenbush SW, Bryk AS (2002) Hierarchical Linear models: applications and data analysis methods. Sage, New York http://www.deepdyve.com/assets/images/DeepDyve-Logo-lg.png Annals of Data Science Springer Journals

Modelling Under-Five Mortality among Hospitalized Pneumonia Patients in Hawassa City, Ethiopia: A Cross-Classified Multilevel Analysis

Annals of Data Science , Volume 5 (2) – Jul 28, 2017

Loading next page...
 
/lp/springer-journals/modelling-under-five-mortality-among-hospitalized-pneumonia-patients-6jdIwiFT8j
Publisher
Springer Journals
Copyright
Copyright © 2017 by The Author(s)
Subject
Business and Management; Business and Management, general; Statistics for Business/Economics/Mathematical Finance/Insurance; Computing Methodologies
ISSN
2198-5804
eISSN
2198-5812
DOI
10.1007/s40745-017-0121-4
Publisher site
See Article on Publisher Site

Abstract

Ann. Data. Sci. (2018) 5(2):111–132 https://doi.org/10.1007/s40745-017-0121-4 Modelling Under-Five Mortality among Hospitalized Pneumonia Patients in Hawassa City, Ethiopia: A Cross-Classified Multilevel Analysis Tariku Tessema Received: 11 April 2017 / Revised: 7 June 2017 / Accepted: 13 July 2017 / Published online: 28 July 2017 © The Author(s) 2017. This article is an open access publication Abstract Community acquired pneumonia refers to pneumonia acquired outside of hospitals or extended health facilities and it is a leading infectious disease. This study aims to model mortality of hospitalized under-5 year child pneumonia patients and investigate potential risk factors associated with child mortality due to pneumonia. The study was a retrospective study on 305 sampled under-five hospitalized patients of community acquired pneumonia. A cross-classified multilevel logistic regression was employed with resident and hospital classified at the second level. Bayesian estimation method was applied in which the posterior distribution was simulated via Markov Chain Monte Carlo. The variability attributable to hospital was found to be larger than variability attributable to residence. The odds of dying from the community acquired pneumonia was higher among patients who were; diagnosed in spring season, complicated with malaria, AGE and AFI, in a neonatal age group, diagnosed late (more than a week). The risk of mortality was also found high for lower nurse: patient and physician: patients’ ratios. Keywords Cross-classified · MCMC · Sub-city · Hospital · CAP · Child 1 Introduction Pneumonia is a form of acute respiratory tract infection (ARTI) that affects the lungs [1]. It is among the main killers of children under the age five worldwide [2]. Com- munity acquired pneumonia (CAP), which refers to pneumonia acquired outside of B Tariku Tessema tarikutessema1@gmail.com Department of Statistics, College of Computing and Informatics, Haramaya University, Haramaya, Ethiopia 123 112 Ann. Data. Sci. (2018) 5(2):111–132 hospitals or extended health facilities, is a leading infectious disease requiring hospital admission in both developed and developing countries including Ethiopia. According to World Health Organization (WHO), pneumonia is the second leading cause of death among the under-fives next to preterm birth complication [3]. It killed nearly to one million children under the age of five in 2015, accounting for 16% of all deaths of children under 5 years old [4]. Fifteen countries that are home for 55% of world’s children shares 72% of the global burdens of pneumonia and diahrrea child deaths [4]. WHO [2] estimated that, about half of the deaths in children under five occurred in 2015 were caused by infectious diseases and conditions such as pneumonia, diarrhoea, malaria, meningitis, tetanus, HIV and measles. Ethiopia was among 15 countries contributing to burden of child mortality due to pneumonia and diarrhea between 2015 and 2016 [4]. In Ethiopia, the child mortality rate is 67 deaths per 1000 live births [5]. Pneumonia has been the main killer of children under age of 5. In 2012 UNICEF [6] reported a 21% of the total deaths among children under five was due to pneumonia. A similar Pattern was observed in the 2000 Ethiopian Demographic and Health Survey (EDHS). The infant mortality rate in the 5 years preceding survey is 77 and under-five mortality is 123 deaths per 1000 live births for the same period. This means that one in every thirteen Ethiopian children dies before reaching age one, while one in every eight does not survive to the fifth birthday. However, these figures declined to 68 deaths per 1000 live births where pneumonia accounts for 18% of the total deaths [7]. Various studies have been expended to identify important risk factors of mortality and morbidity from the Pneumonia patients. Some of the literatures have used spatial and non-spatial analytic techniques to examine patterns of the illnesses, and the fac- tors that determine it [8]. There is also literature that have used multivariate logistic regression to determine the factors that predict in-hospital mortality among patients who require hospitalization for the treatment of CAP [9]. These statistical method- ologies are not capable to consider variability that could be observed in groups and does not take into account the dependency of observation in the same groups/cluster. Therefore, it is necessary to use different approach that may explore the important determinants of child mortality due to pneumonia at patient, hospital and patient’s residence level. This study is, therefore, intended to model the variability of under-five child mor- tality (U5M) of hospitalized pneumonia patients observed at resident or hospital, and identify risk factors associated with U5M due to pneumonia using cross-classified multilevel model. A cross-classified model is a multilevel model where individual or lower level units are nested to two different groups [10]. In this model individual is nested into a classification of two groups, that is, patients admitted to one hospital come from different sub-city, likewise patient from one sub-city admitted on different hospitals. Hence, source of variation at individual/patients and group/hospital, sub- city levels can be considered. This study also contributes to the existing literature by providing methodological demonstration of cross-classified multilevel models in hospitalized patients of community acquired pneumonia. 123 Ann. Data. Sci. (2018) 5(2):111–132 113 Table 1 Population distribution No. Sub-cities Male Female Total across sub-cities of Hawassa City administration. Source 1 Addis Ketema 13,162 13,158 26,320 Hawassa city administration (11) 2 Bahil Aderash 11,007 10,919 21,926 3 Hawalla-Tulla 65,080 61,809 126,889 4 Hayek Dar 13,221 12,442 25,663 5 Mehal Ketema 10,982 10,638 21,620 6 Menehariya 14,642 13,977 28,619 7 Misrak 17,979 16,278 34,257 8 Tabor 33,877 30,464 64,341 2 Materials and Methodology 2.1 Description of Study Area and Data This study had been carried out at Hawassa City, capital of Southern Nations Nation- alities and People’s Region (SNNPR), Ethiopia. The city, which is the economic and cultural hub of the region, has a total area of about 157.2km divided in to 8 sub- cities (Kifle-ketema). These sub-cities (see Table 1) are divided mainly based on the geographical structure. Under Hawassa City Administration, there are a total of five hospitals and six health centres owned by three different sectors (Government, Non-Governmental Organiza- tion, and Private Sectors). All hospitals and one health centre under these sectors are currently admitting patients. These hospitals and health centres are listed in Table 2 with their respective sub-cities. The study used a retrospective study on the under-five patients admitted to hospitals during the period of January 1, 2012 to January 1, 2014. The data were extracted from patients’ cards that were confirmed by physician as a community acquired pneumonia patient. All hospitals and health centres, under Hawassa City Administration that admit patients of CAP during the period of January 1, 2012 to January 1, 2014 were taken into account. These Hospitals/Health Centres are; Hawassa Referral Hospital, Bushulo Health Centres, Adare Hospital, Kibru and Asher General Hospitals. Hawassa city administration has a population of 349,635 people, out of which 169,685 are females and 179,950 are males. Among the population of the city admin- istration, 223,821 people live in urban area, while the remaining 125,814 live in rural area of the administration. Population distributions across the sub-cities are given in Table 1. 2.2 Variables Considered in the Study Variables considered at individual/patient level in the study were selected based on some previous study [9,12–16] and those that are expected to be the fac- tors/determinants of mortality of under-five from hospitalized patients of CAP. 123 114 Ann. Data. Sci. (2018) 5(2):111–132 Table 2 Hospitals and health centers admits patients with respective sub-cities No. Hospitals/health centers Sub-cities Owned by 1 Hawassa referral hospital Tabor Government 2 Adare hospital Hayik Dar Government 3 Bushulo health centers Tulla NGO 4 Asher hospital Meneharia Private 5 Beta-Abrham hospital Bahil Adarash Private 6 Kibru general hospital Bahil Adarash Private Table 3 Independent variables Variable Description Patient level variables Sex Sex of patient Age Age of patient which lie between the interval (0,5] years Season of diagnosed It is the season at which the patients admitted to a hospital/health centre CAP complication If patients suffer other additional disease such as Acquired Immune Deficiency Syndrome (AIDS), Acute Faecal Incontinence (AFI), Acute Gastro Enteritis (AGE) and Malaria Time onset to diagnosis It is the time in which a particular symptom is observed till the diagnosis Length of hospital stay Total day in which patients undergone in the hospital Treatment Treatment taken by patients for CAP during diagnosis Patient refer status It is the status of patients if they are referred from one hospital to another Hospital level variables Bed Occupancy Rate (BOR) The percentage of official beds occupied by hospital inpatients for a given period of time; (Total length of hospital stay in a given month ) BOR = 30 × Number of beds in that month Patient: Physician Ratio (PPhR) Ratio of patient to physician counted in a given month; Number of patients admitted in a given month PPhR = Number of physician on service in a given month Patient: Nurse Ratio (PNR) Ratio of patient to nurse counted in a month Number of patients admitted in a given month PNR = Number of nurse on service in a given month * Length of Hospital stay is the number of colander days from the days of patient admission to the day of discharge Patients’ discharge status was dependent variable in this study. It is a binary, discharge alive or dead, response which was recorded during discharge of patients from hospital. The independent variables at patient’s and hospital levels which could possibly affect this variable are condensed in the following Table 3 with their description. 123 Ann. Data. Sci. (2018) 5(2):111–132 115 2.3 Data Analysis 2.3.1 Multilevel Regression Models The multilevel regression model often called hierarchical multilevel linear model (HLM) assumes that there is a hierarchical multilevel data set, with one single depen- dent variable that is measured at lowest level and explanatory variables at all existing levels [17]. Many kinds of data, including observational data collected in the human and biolog- ical sciences, have a hierarchical, nested, or clustered structure (Fig. 1a). Traditional linear regression techniques are not capable of taking into account data with a hierar- chical multilevel structure [18]. Data with hierarchical structure violates an important assumption of linear regression model. As a result, regression coefficients are not efficient and its standard errors are understated. 2.3.1.1 Two Level Multilevel Regression Models Multilevel regression model can be described as the extension of regression model, which adds random effects at multiple levels. For instance, in this study the interest is to know the risk of dying from CAP if one child is in a neonatal age. In this case, both variables are patient level variables; as a result, one can use simple logistic regression. Since, patients are nested with in hospital, using multilevel models allows us to divide the variance in the outcome variable at the group and individual levels. A two-level multilevel model in health area assigns patients to level 1 and hospitals to level 2. Consider a collection of r independent variables which will be characterized by X = (X , X2,...X ). Then the conditional probability of the outcome of interest 1 r in a study is present (in our case death of child due to CAP) can be denoted by P = P(Y = 1|X = x). The ratio of success probability to failure probability is ij ij known as odds of success. ij i.e, , i = 1, 2,..., n; j = 1, 2, ··· h 1 − P ij Suppose that we have a sample of n independent observations of (X , Y ) for r+1 ij ij variables, where Y denotes the value of dichotomous variable, and Y is the value ij ij Fig. 1 Patients nested to hospitals and hospitals to sub-city (a) and patients nested to cross classification of sub-city with hospital at level 2 (b). a Purely nested data structure. b Cross-classified data structure. Note This figure was adapted for the data analysed here from Fielding et al. [10] 123 116 Ann. Data. Sci. (2018) 5(2):111–132 of the independent variables for the ith individual (patients). Then, the logistic model can be written using odds of success as follows. ij = β X , i = 1, 2, 3, ··· n (2.1) l ijl 1 − P ij l=0 Thus, the data layout can be represented as: ⎛ ⎞ ⎛ ⎞ ⎛ ⎞ y 1x x ... x β 11 12 1k o ⎜ ⎟ ⎜ ⎟ ⎜ ⎟ y 1x x ... x β 21 22 2k 1 ⎜ ⎟ ⎜ ⎟ ⎜ ⎟ ⎜ ⎟ ⎜ ⎟ ⎜ ⎟ . .. . . . . ⎜ ⎟ ⎜ ⎟ ⎜ ⎟ Y = , X = , β = ⎜ ⎟ ⎜ ⎟ ⎜ ⎟ . .. . . . . ⎜ ⎟ ⎜ ⎟ ⎜ ⎟ ⎝ ⎠ ⎝ ⎠ ⎝ ⎠ . .. . . . . y 1x x ... x β n1 n2 nk k The relationship between success probability P and his/her characteristics are non- linear. Hence, by applying logit transformation, in order to have a meaningful interpretation, one would have the following logistic regression model equation. ij log = β X , i = 1, 2, 3,... , n l ijl 1 − P ij l=0 logit P = β X (2.2) ij l ijl l=0 where β = [β ,β ,...,β ] 0 1 k β is the intercept and β , .., ., ., ..β are coefficients of the p explanatory variables 0 1 p (which may be continuous or dummy variables). A two level logistic regression models allow to assess variation in a dependent variable at hospital and patient level simultaneously in which patients are nested within hospitals. The response variable is denoted by Y : ij 1 if child i is dead due CAP in hospital j Y = ij 0 if child i discharged alive from hospital j with probabilities P = P(Y = 1/X ), probability of dead for patient ‘i’ in hospital ij lij ij ‘j’, and 1-P , probability of discharged alive for patient i and hospital j. The standard ij assumption is that Y has a Bernoulli distribution. Let the P be modelled using logit ij ij link function. 123 Ann. Data. Sci. (2018) 5(2):111–132 117 The two-level model is given by: ij logit(P ) = log = β ij oj 1-P ij + β x ; l = 1, 2, 3, ..., r ; j = 1, 2,..., h and i = 1, 2, 3 ... , n (2.3) lj li j l=1 where β = β + u ,β = β + u ,...,β = β + u this becomes: 0 j 0 0 j 1 j 1 1 j rj r rj r r ij logit P = log = β + β x + u + u x (2.4) ij 0 l 0 j li j lij lj 1 − P ij l=1 l=1 X = (X , X ,..., X ) represents the first and the second covariates, (β ,β , ··· , ij 1ij 2ij rij 0 1 β ) are regression coefficients, u , u , u , ··· u are the random effect of model r 0j 1j 2j kj parameter at level two. It is assumed that these level two random effects distributed normally with mean 0 and variance σ . The Empty Model The empty two-level model for dichotomous outcome variable refers to population of groups (level-two units) and specifies the probability distribution for the group- dependent probabilities P , where P characterised by Y = P +e , without taking ij ij ij ij ij further explanatory variables into account. The logit transformed models, logit(P ), ij has a normal distribution. Then the empty model is expressed by formula: logit(P ) = γ + u (2.5) ij 0 0 j where γ is the population average of the transformed probabilities and u the random 0 0 j deviation from this average for hospital j. The residual term associated with deviation, u is a unique effect of hospital j on the outcome variable; it is assumed to be normally 0 j and independently distributed with mean of zero and constant variance, σ that is u ∼ N(0, σ ). 0 j The level 2 residual variance captures the variation across hospital means. With this model, the amount of variance in the outcome variable that is attributable to within group characteristics (here, patients) and to between group differences (hospitals) can be identified. This model (2.5) does not include a separate parameter for the level- one variance [19]. This is because the level one residual variance of the dichotomous outcome variable follows directly from the success probability is indicated below. var(e ) = P (1 − P ) (2.6) ij ij ij where e is individual-dependent residual. ij Define π - the probability corresponding to the average value by the γ , as follows: 0 0 f (π ) = γ 0 0 For the logit function, the logistic transformation of γ is given by: 123 118 Ann. Data. Sci. (2018) 5(2):111–132 exp(γ ) π = logistic(γ ) = (2.7) 0 0 1 + exp(γ ) Now the variance for probability of success (death of child from CAP in-hospital j) can be given by: 2 2 var(P ) = (π (1 − π )) σ (2.8) ij 0 0 Random Intercept Model The extension of equation 2.5 above in which it incorporates explanatory variables is termed as random intercept model. The success probability is not necessarily the same for all individual in a given group. Therefore, the success probability now depends on the individual as well as on the group, and is denoted by P ,[19]. Accordingly, the ij outcome is split into an expected value and a residual i.e.: Y = P + R (2.9) ij ij ij The logistic random intercept model expresses the log-odds, i.e., the logit of P is: ij logit(P ) = γ + γ x + u (2.10) ij 0 l li j 0 j l=1 As usual, the deviation u are assumed to be independently distributed with zero 0 j mean and variance σ . Equation (2.10) does not include a level one residual because it is an equation for the probability rather than the outcome Y . Thus, the level residual ij has already included in the equation. Random Slope Model Another model which is used to assess whether the slope of any of the explanatory variable has a significant variance component between the groups is called the random slope model. Assume that effect of variable X = (X , X , ··· , X ) across the j 1j 2j kj groups, and accordingly has a random slope. Equation (2.10) for the logit of the success probability is extended with the random effect u x (19), which leads to: 1 j 1ij, logit(P ) = β + β x ij 0j lj li j l=1 β = γ + u 0j 0 oj β = γ + u (2.11) lj l lj The characteristic that is unique to this model, compared to others discussed here, is that the random coefficients β are themselves dependent variables in a second regression equation. Of course, in RSM we have: logit(P ) = γ + γ x + u x + u (2.12) ij 0 j lj li j lj li j oj l=1 There are now r+1 random group effects, the random intercept u and the random oj slope u . It is assumed that both have a zero mean. Their variances are denoted by lj 123 Ann. Data. Sci. (2018) 5(2):111–132 119 2 2 σ ,...,σ respectively and their covariance is denoted by σ . This can be described 0r by the covariance matrix . ⎡ ⎤ σ σ ...σ 01 0r ⎢ ⎥ σ σ ...σ 10 1r ⎢ 1 ⎥ ⎣ ⎦ .. σ σ ....σ r0 r1 2.3.1.2 Two Level Cross-classified Multilevel Models Unlike hierarchical multilevel data with a purely nested structure, data that are cross- classified not only may be clustered into hierarchical multilevel ordered units but also may belong to more than one unit at a given level of a hierarchy [10]. Goldstein [20] labelled multilevel model for purely nested structure as hierarchical multilevel (HM) models. However, multilevel level models for cross-classified data structure is labelled as cross-classified multilevel (CCM) models. In a cross-classified design, patients at a given hospital might be from several different resident places/sub-cities and one sub-city might have patients who attend a number of different hospitals. In this type of scenario, hospital and sub-city are considered to be cross classified factors, and cross classified random effects modelling (CCREM) should be used to analyse these appropriately. The consequences of ignoring an important cross-classification are similar to those of ignoring an important hierarchical multilevel classification [21]. Therefore, it is very important to consider the classification at existing level (second level in this study). In Fig. 1 the first diagram illustrates a purely nested data structure. In this structure, there are three levels; patients, hospitals and sub-city. A typical hierarchical multilevel model has patients nested in hospitals, and hospitals nested within sub-cities. In a cross-classified multilevel structure, however, patients from a given sub-city might visited different hospital and hospital might serve patients from different sub-city. Subsequently the cross classified data structure occurs at level 2. In this example, we can take into account influences coming from two different factors: sub-cities and hospitals. This may improve the estimation of explanatory variable effects ensuring the structure is properly specified [10]. Assume one has patients nested within a cross-classification of hospital by sub-city, which is the case illustrated in Fig. 1b and P be the probability of child to i(hosp,subc) die from CAP in hospital ‘hosp’ and came from sub-city ‘subc’. logit(P ) = γ + γ x + u 0 l li(hosp, subc) hosp(0, j ) i(hosp, subc) l=1 + u x + v (2.13) hosp(l, j) i(hosp, subc) subc(i) l=1 v is the sub-city effect which are mutually independent random variables and subc(i ) also independent to u , which is hospital random effect. All v are normally hosp(0, j) subc(i ) distributed with mean 0 and variance τ . 123 120 Ann. Data. Sci. (2018) 5(2):111–132 Note the notation used in this paper for cross classified model is adapted from Fielding et al. [10]. Like HM model, in a CCM model, before introducing explanatory variables, the variation in outcomes could be examined by looking at the differences between hospi- tals, between sub-cities and between individual patients after controlling for sub-cities and hospital effects [22]. This gives a basis for extending the model to identify which sub-cities, hospitals, and patients’ characteristics might explain some part of these component variances. After adding explanatory variables, the residual components of variances can be estimated, this will provide more information about the variation in outcomes [21]. 2.3.2 Method of Parameter Estimation Estimation of CCM models by existing maximum likelihood approaches run into important computational limitations, especially when large numbers of units are involved [21]. As a result, the models used in this paper are fitted using Markov Chain Monte Carlo (MCMC) based algorithms as implemented in the MLwiN package [23]. Starting values for the fixed parameters are estimated from simpler models using a maximum likelihood approach, penalized quasi- likelihood (PQL) in MLwiN [23]. Markov Chain Monte Carlo(MCMC) is a simulation techniques that generate ran- dom samples from a complex posterior distribution [22]. The simulated posterior distribution is then used to compute a point estimate and a confidence interval. The posterior distribution of θ is defined from Bayes’ theorem as p(θ /data)/p(data/θ )p(θ ) Here p(θ) is the prior distribution for the parameter vector θ and should represent all knowledge we have about θ prior to obtaining the data. In multilevel models, a non-informative prior for both fixed effect and random effect parameters are recommended [18]. The default prior distribution applied in MLwiN when MCMC is used are ‘flat’ or ‘diffuse’ priors for all parameters (23). Thus, according to Browne et al. [23] the prior for a fixed parameter, β in MLwiN is an improper uniform distribution, p(β) ∝ 1, which is equivalent to normal prior with large variance. The prior for scalar variance, σ is specific as inverse gamma, p(σ ) ∼ invGamma(α, β). In MLwiN, the prior applied for covariance matrices, −1 is an inverse Wishart distribution, p( ) ∼ Wishart (p, p, ), where p is the number of rows in the variance matrix and  is the estimate for the true value of . Consider a two-level random slope binary logistic regression in equation (2.12), r r logit(P ) = γ + γ x + u x + u ij 0 j lj li j lj li j oj l−1 l=1 with Y = P + R is the outcome variables where  = σ I is level one residual ij ij ij ε nj is i.i.d.  is a level two covariance matrix. In this study, the prior distribution used for all fixed parameters is non-informative normal distribution with variance 10 and different values of α and β in gamma distribution for scalar variance prior was used. 123 Ann. Data. Sci. (2018) 5(2):111–132 121 Table 4 Summary result of discharge Status of Patients with their correspondent hospitals/health centers Hospitals/health centre Discharged status Total Number of alive (%) No. of death (%) Referral hospital 96 (92.3) 8 (7.7) 104 Bushulo health centre 127 (80.4) 31 (19.6) 158 Adare hospital 23 (71.9) 9 (28.1) 32 Asher and Kibru hospital 9 (81.8) 2 (18.2) 11 Total 255 (83.6) 50 (16.4) 305 (100) Consider Eq. (2.12) above, random slope model, in which each observation is considered as an outcome of a Bernoulli trial (y ∼ Bernoulli(p )), and hence for ij ij the ith patient in jth hospital: ij 1−y ij P(Y = y ) = p (1 − p ) (2.14) ij ij ij Assuming the n observations are independent, the likelihood function is given by: t 1−y y ij ij n n −X β ij 1 e ij 1−y ij L(X/θ) = p (1 − p ) = ij ij t t −X β −X β ij ij 1 + e 1 + e i =i i =1 (2.15) where θ stands for the joint parameters, u ,Σ ,Ω . j ε u The posterior distribution is obtained by multiplying the prior distribution over all parameters by the full likelihood functions. The posterior distribution is: f (θ/data) ∝ p(u )Xp( )Xp( )X f (data/θ) (2.16) j ε u 3 Results and Discussion 3.1 Results This study was carried out to model variability in mortality of under-five hospitalized pneumonia patients, using a retrospective data on the patients admitted between Jan- uary 2012 and January 2014. The study used a sample of 305 under-5 year patients out of which 50 (16.4%) died in the hospital and the rest of them were discharged alive from hospitals. A hierarchical two level logistic regression and cross-classified logistic regression with the classification of ‘sub-city’ and hospitals at second level was modelled. Then these two models are compared based on their DIC. It can be seen from Table 4, more death occurred in Bushulo Health Centre which accounted about 10.2% of the total death occurred during the retrospective interval time. Likewise, in Hawassa University Referral Hospital, about 2.6% of the patients of CAP from Hawassa city were expired. In the same manner, the distribution of patients discharge status is displayed in the following Table 5. 123 122 Ann. Data. Sci. (2018) 5(2):111–132 Table 5 Summary result of discharge status of patients with their resident Resident/sub-city Discharged status Total No. of alive (%) No. of death (%) Addis Ketema 21 (77.8) 6 (22.2) 27 Bahil Aderash 20 (100) 0 (0) 20 Hawalla-Tulla 60 (72.3) 23 (27.7) 77 Hayek Dar 22 (88) 3 (12) 25 Mehal Ketema 26 (86.7) 4 (13.3) 30 Menehariya 22 (95.7) 2 (4.3) 24 Misrak 18 (69.2) 8 (30.8) 22 Tabor 66 (93.0) 5 (7.0) 80 Total 255 (85.6) 50 (16.4) 305 (100) Table 6 DIC values of each models for comparison of parameters in prior distribution of single variance Models (α, β) (5, 5) (10, 5) (5, 10) (10, 10) (100, 100) Two level HM empty model 252.87 252.778 253.83 252.69 252.55 Two level HM random intercept model 181.61 180.62 181.11 181.87 179.82 Two level CCM empty model 265.10 269.26 256.08 264.44 264.32 Two level CCM random intercept model 179.75 180.69 181.04 179.93 179.26 3.1.1 Setting for Bayesian Inference In this study, non-informative priors have been used which express the complete igno- rance of the value of the parameter. Accordingly, as we mentioned on chapter three Sect. 3.5.3, we have used normal distribution with mean 0 and variance 10 (huge variance) for fixed parameters. However inverse gamma distribution with parameter α and β for single variance was used. The value of parameters for this distribution was chosen based on Bayesian deviance information criteria (BDIC). Table 6 shows the proposed parameter value for prior distribution of single variance. On Table 6, BDIC is smaller when we use α = 100 and β = 100 in each model except on cross classified empty model. Since, there was no significant change on the values of BDIC, the same prior parameter for these models were used too. The number of iteration used in the analysis for all of the parameters (fixed and random effect) was set up to be 50,000 with ‘thining’ number of 1; the first 5000 was discarded(burn-in) and the rest of the chains was used to summarize the posterior distribution. 3.1.2 Cross-Classified Multilevel Logistic Regression Results Now like the case of HM model, an empty model can be extended to random inter- cept CCM model using explanatory variables those were significantly associated with 123 Ann. Data. Sci. (2018) 5(2):111–132 123 discharge status of patients. In CCM random intercept model we vary the intercept across both hospitals and sub-city. In this analysis, the coefficients of Intercept (−10.323), number of patients attended (3.024), spring (2.021), AGE (1.893), AFI (2.017), Malaria (1.967), postnatal age group (−2.383), Child age group (−2.466), duration of onset to diagnosis from 4- 7days (1.556), duration of diagnosis more than a week (2.165), Patient to Physician ratio (0.014) coefficient of Patient to Nurse ratio (0.330) were statistically significant at 5% level of significance. In the model, more variation (19.3%) of discharge status of the patients (dead or alive) was due to hospital as compared to variation attributable to residents/sub-city (18.9%). The proportion of variation of discharge status explained by explanatory variables is 78.95% in random intercept of hierarchical multilevel logistic regression. In other words, season of diagnosed, CAP complicated disease, duration till diagnosed, age, patient refer status, patient to nurse ratio and patient to physician ration explains about 78.95% of variation in whether patients were dead or alive due to community acquired pneumonia (Tables 7, 8). 3.1.3 Model Diagnostic Check In MLwiN statistical Packages, MCMC estimation methods can be easily run by taking either PQL result or MQL results as an initial point for simulation. The packages finally display MCMC summery statistics and MCMC plots which is important to test the diagnostic convergence of MCMC chain. Under MCMC summery statistics the package computes chain means (posterior means), the chain standard deviation, etc. Raftry and Lewis diagnostic [24] is a diagnostic based on a particular quantile of the distribution. The diagnostic that is used to estimate the length of Markov chain required to estimate a particular quintile to a given accuracy. In MLwiN the diagnostic is calculated for the two quintiles (the defaults are the 2.5 and 97.5% quintiles) that will form a central interval estimate. The dependency among the Markov chain samples are measured by the autocorre- lation which is one method of model diagnostic of convergence. MCMC algorithms do not produce independent samples from the distribution. Rather, it generates samples that are auto correlated. As in time series analysis, we can compute the autocorrelation function at each lag (1+) and include the number of iterations we need to skip in order to have a non-significant autocorrelation. High correlations between long lags indicate poor mixing. Thus, low autocorrelation indicates good convergence. All the coefficients were tested using kernel density plot, Lewis and Reftry test PACF,and ACF. Thus, the results were obtained after all were satisfied which can be attained with increasing number of iteration. 3.1.4 Model Selection Model was selected based on deviance information criteria (DIC) found for each model, where model with small deviance information criteria was selected for inter- pretation of the results. We have used deviance information criteria to choose better model for this analysis. The DIC for cross-classified random intercept model was small as compared to the other models. 123 124 Ann. Data. Sci. (2018) 5(2):111–132 Table 7 Cross-classified and hierarchical multilevel logistic model-random intercept models Variables Cross-classified multilevel model Hierarchical multilevel model PQL estimates SE (PQl) MCMC SE (MCMC) Exp (β) p value PQL SE (PQl) MCMC SE Exp (β) p value estimates estimates estimates (MCMC) Fixed effect Intercept −7.192 1.863 −10.323 2.407 0.000 0.000* −6.162 1.830 −8.750 2.088 8.067 0.000* PREFSTS 0.997 0.864 3.024 1.165 20.57 0.004* 0.574 1.051 0.669 1.156 3.177 0.281 Season Autumn (ref) Summer 1.189 0.923 1.343 1.111 3.83 0.111 1.328 0.800 2.055 1.231 7.806 0.036* Winter 0.814 0.924 0.803 1.136 2.23 0.240 1.029 0.782 1.675 1.215 5.338 0.210 Spring 1.471 0.836 2.021 1.053 7.54 0.027* 1.285 0.735 2.007 1.184 7.441 0.022* Diseases complicated No (ref) AGE 1.801 0.586 1.893 0.671 6.64 0.002* 1.527 0.542 2.147 0.659 8.556 0.000* AFI 2.030 0.720 2.017 0.816 7.52 0.000* 1.847 0.687 2.388 0.879 10.892 0.000* Malaria 1.710 0.813 1.967 0.859 7.15 0.011* 1.557 0.749 2.184 0.895 8.882 0.001* Others 0.397 0.716 0.493 0.875 1.64 0.286 0.429 0.610 0.540 0.903 1.716 0.241 Age Neonatal (ref) Postnatal −1.734 0.713 −2.383 0.897 0.09 0.003* −1.797 0.677 −2.406 0.872 0.090 0.052 Child −2.062 0.754 −2.446 0.933 0.08 0.004* −1.971 0.711 −2.585 0.895 0.075 0.038* Ann. Data. Sci. (2018) 5(2):111–132 125 Table 7 continued Variables Cross-classified multilevel model Hierarchical multilevel model PQL estimates SE (PQl) MCMC SE (MCMC) Exp (β) p value PQL SE (PQl) MCMC SE Exp (β) p value estimates estimates estimates (MCMC) Duration to onset diagnosis ≥3 days (ref) 4–7 days 1.416 0.479 1.556 0.561 4.74 0.002* 1.164 0.420 1.651 0.566 5.212 0.000* >7 days 1.862 0.603 2.165 0.720 8.71 0.000* 1.635 0.543 2.307 0.731 10.044 0.000* Hospital level variables PPhR 0.019 0.005 0.014 0.008 1.01 0.003* 0.014 0.005 0.017 0.006 1.017 0.023* PNR 0.198 0.043 0.330 0.069 1.39 0.000* 0.235 0.051 0.330 0.060 1.391 0.000* BOR 1.734 1.906 −0.925 1.996 0.319 0.425 1.766 1.729 2.691 2.085 14.716 0.468 Random parameters σ = 0.108 0.599 1.026 0.102 0.001* 0.527 0.435 1.034 0.103 0.002* Var (u ) hosp (Between hosp) τ = 0.000 0.000 1.010 0.104 0.002* – – – – – Var (υ ) subc(i ) (Betweensub-c) ICC (Subc) 0.189 – ICC (Hosp) 0.193 0.187 R (%) 78.95 58.5 DIC 179.26 180.62 * Significant at 5% level of Significance, Ref Reference category, ICC intra class correlation coefficient, BDIC Bayesian deviance information criteria 126 Ann. Data. Sci. (2018) 5(2):111–132 Table 8 Bayesian deviance information criteria for all models Models DIC pD (number of R parameter) Hierarchical multilevel models Empty models 252.55 11.15 – Random intercept model 180.62 16.69 0.585 Cross-classified multilevel models Empty models 264.32 46.12 – Random intercept model 179.21 20.80 0.7895 All of the variables significantly predicted discharge status of patients in both ran- dom intercept models except referred status of patients. This variable was significant in CCM random intercept model. However, it was insignificant in HM random intercept model. In addition, the explained proportion of variation (R ) in outcome variable is large for CCM random intercept model. The model comparison result suggests CCM logistic model as the best fit for dis- charge status of hospitalized pneumonia patients. Thus, the effects of each covariate can be interpreted using the estimated odds observed under cross-classified multilevel logistic. The odds of dying due to community acquired pneumonia are the probability that the patient died divided by the probability that the patient discharged alive. Most of the patient level variables included in the model significantly affect the discharge status of hospitalized pneumonia patients. Season diagnosed, CAP compli- cation, duration of onset to diagnosis and age are patient level variable that significantly affect the discharge status of hospitalized pneumonia patients. Whereas, patient to physician ratio and patient to nurse ratio are hospital level variable that significantly affect the outcome of response variable. The value of Exp(β) for spring season is 7.54 which implies that the odds of being at risk to death during spring season were 7.54 times more likely than patients diagnosed in winter season. In CAP complication variable, there was high probability of death on patients with AGE, AFI, and malaria than patients only with CAP. The odds of these coefficients are 6.64, 7.52 and 7.15, respectively. There was high probability of dying from CAP if the patients are complicated with these diseases. There is less likely to die from CAP, if patients are postnatal and child age than those patients in neonatal age group. Accordingly, the risk of dying from CAP for postnatal age group and child age group patients was less by 91 and 92% respectively. On the result, it was found that patients who late diagnosis has high probability of being at risk than those admitted before 3 days. The odds of being at risk for patients who diagnosed after a week is 8.71 times than those patients diagnosed before three days. The odds of being at risk for patients who diagnosed with 4 to 7 days was also 4.74, which means the risk of dying from pneumonia if patients diagnosed between these days was 4.74 times than those diagnosed before three days keeping another risk factors constant. Patient to nurse ratio, patient to physician ratio were variables significantly predicts the discharge status of patients in the chosen CCM logistic model among hospital level variables. The odds of being at risk increases by 1.39 for a unit increases of patient to nurse ratio keeping the others risk factors constant. In the same manner, 123 Ann. Data. Sci. (2018) 5(2):111–132 127 the odds of being at risk increases by 1.01 for a unit increase of patient to physician ratio keeping the risk factors constant. These means number of patients increased by the number physician/nurses keeping number of physician/nurse constant. Generally, patients diagnosed during patient to nurse ratio and/or patient to physician ratio was large had high probability of dying from CAP. 3.2 Discussion CCM logistic random intercept model revealed that more of the variation of discharge status of patients (alive/dead) was due to hospitals as compared to variation attributable to resident/sub-city. During the period of our study, the pandemic strain of community acquired pneumonia resulted in the death of 16.34% patients in the sample. Some of previous study had found approximate result, for example, 18.79% in [12] 14.7% in Cuietal. [14], 23.8% in Kuo-Tung [25]. On the result, it was also observed that patients admitted due to the case (CAP) during spring season was high in all hospitals; this is may be due to the fact that there was high prevalence of CAP in that season. In this study discharge status of hospitalized patients due to CAP was not associated with sex as reported earlier by Teka [12] and Kuo-Tung [25]. There was also no associ- ation between treatment given by the physician and patients discharge status. Since all patients were hospitalized which means they were under the care of nurse and physi- cians there might be a proper usage of treatments. Thus, the non-significance of associ- ation between discharge status and treatment taken might be due to this proper usage. In the study, also it was found that length of hospital stay (LOS) is not significantly associated with discharge status of CAP under-five patients. The median LOS was 4 days which means 50% of patients admitted due pneumonia stayed in the hospital were less than 4 days. Similar study in Spain during 2003 had found that 9% of LOS due CAP,(16). In this model, some variation of discharge status of under-five age CAP patient due to hospitals and residence/sub-city were 19.3 and 18.9% respectively. The previous studies [12] that used some other statistical methods like binary logistic regression in which it is assumed that all patients were independent also found a significant association between the residence and discharge status of patients. Thus, this results in Type I error [26]. Finally, it was investigated that hospitals and sub-city also contributed to a variation among discharge status of CAP patients. The variation due to hospitals might be due to service given, quantity and quality of physician/nurse, in-efficient management, etc. In the same manner inaccessibility of health facility, neatness of the environment which results in other complicated disease such as malaria, high population density which results in high prevalence of pneumonia disease, could be the reason for the variation due to residence/sub-city. In the study, it was found that patients admitted in summer and spring season had high risk of dying from CAP as compared with other seasons. According to Lieberman’s [15] finding there is high incidence of CAP during spring and summer seasons. This could be the reason for the high risk of death from CAP in these seasons. 123 128 Ann. Data. Sci. (2018) 5(2):111–132 Result showed that patients who were delayed to be diagnosed had high risk of dying from CAP than those patients who diagnosed early. An appropriate usage of treatment as well as delaying to start treatment for one disease leads to high risk of dying. This is may be due to the fact that; the expansion of causing organisms increases through time in patients’ body. As a result, duration of onset to diagnosis was a significant predictor of discharge status of patients. This result match with results of Menendez et al. [16] and Teka [12]. Among hospital level variables patients to nurse ratio and patients to physician ratio significantly predict discharge status of CAP patients. Patients who admitted during patients to nurse ratio was high had high risk of dying from pneumonia. Since patients in hospital are nurtured by nurses and this has a positive impact on the recovery from their illness. Fortunately, patients who admitted during ratio of patient to nurse is high has less chance to survive as it is compared to others patients. Thus, the result obtained had a complete agreement with this logic. Another important hospital level variable which again significantly predicted the risk of dying from pneumonia was patient to physician ratio. Patients who admitted during patients to nurse ratio was high had high probability of dying from CAP. Bed occupancy rate which is one measure of quality of hospitals was insignificantly associated with discharge status of under patients. The reason for this result is not clear but it might be due to the fact that, in this study, there was no patients admitted during the beds were crowded. 4 Conclusions and Recommendations 4.1 Conclusions sThis study was aimed to model the mortality of hospitalized under-five pneumonia patients and identify risk factors associated to discharge status of patients. Hierarchi- cal multilevel and cross-classified multilevel were used. It was interesting to find out that cross-classified multilevel model was better than simple hierarchical multilevel modelling in fitting to the data and explore variability attributed to hospital and resi- dence. The variability attributable to hospital was found to be larger than variability attributable to residence. From the analysis, it was found that age of patients, season of diagnosis, duration of onset to diagnosis, complicated disease, patient to physician ratio, patient to nurse ratio. Hospitals and place of residence/sub-city were found to contribute significantly to the variability of patient’s status (dead/alive). 4.2 Recommendations Scaling up of interventions such as health promotions on appropriate and effective treatment in home or community based care should be emphasized across all sub-cities. And also, the awareness of early diagnosis to the community should be expanded via health extensions, clinicians or any concerned body in collaboration with governmen- tal and non-governmental organizations. Health facilities based on quality parameter of hospitals like bed occupancy rate, patient to nurse ratio, and patient to physician 123 Ann. Data. Sci. (2018) 5(2):111–132 129 ratio should be improved in hospitals and be handled with effective management. Finally, researchers who are interested on similar area are recommended to choose cross-classified models for the analysis in the case where patients are nested to the classification of hospitals and resident to explore variation of response variable at all existing levels. Acknowledgements I would like to acknowledge Bushulo Health Centre, Hawassa Referral Hospital, Adare Hospitals Kibru Hospital, Asher General Hospital Managers’ for providing me the data and reading materials used for this study. Thanks are also due Hawassa City administration and Central Statistical Agency-Hawassa Branch offices for honestly supplying me secondary data on population size of sub-cities of Hawassa city. Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 Interna- tional License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. Appendix 1: MLwiN Results MLwiN Result for Equations of Hierarchical and Non-Hierarchical Multilevel Models See Figs. 2, 3. Fig. 2 MLwiN equation result for multilevel logistic regression-random intercept model 123 130 Ann. Data. Sci. (2018) 5(2):111–132 Fig. 3 MLwiN result equation for cross-classified multilevel logistic regression- random intercept model 123 Ann. Data. Sci. (2018) 5(2):111–132 131 Appendix 2: Cross-Classified Multilevel Model Coefficients’ Convergence Test Intercept Coefficient of Duration of Onset (>7 days) Coefficient of autumn Coefficient of Patient refer Status Coefficient of Postnatal Coefficient of winter Coefficient of Child Coefficient of spring Coefficient of patient-to-physician ratio Coefficient of AGE Coefficient of Patient to Nurse Ratio Coefficient of AFI Coefficient to Bed Occupancy Rate Coefficient of Malaria Random Effect Variance of Hospitals Coefficient of Others Random Effect Variance of Sub-cities Coefficient of Duration of Onset (4-7days) 123 132 Ann. Data. Sci. (2018) 5(2):111–132 References 1. Nga Tong, BA, M. (2013). Background Paper 6.22 Pneumonia. “A Public Health Approach to Inno- vation,” (May), 7–8. Retrieved from http://www.who.int/medicines/areas/priority_medicines/BP6_ 22Pneumo.pdf 2. UNICEF. (2015). Levels and trends in child mortality. New York: UNICEF (pp. 1–30). Retrieved from http://www.who.int/maternal_child_adolescent/documents/levels_trends_child_mortality_2013/en/ 3. World Health Organization (WHO) (2015) World Health Statistics 2015. World Health Organization. Luxembourg 4. International Vaccine Access Center (IVAC). (2016) Pneumonia and diarrhea progress report? reaching goals through action and innovation. Johns Hopkins Bloomberg School of Public Health (pp. 1–46) 5. Central Statistical Agency (CSA) [Ethiopia] I (2016) Ethiopia demographic and health survey 2016: key indicators report. Addis Ababa, Ethiopia, and Rockville, Meryland, CSA and ICF. Retrieved from https://www.usaid.gov/sites/default/files/documents/1860/ Ethiopia%20DHS%202016%20KIR%20-%20Final%2010-17-2016.pdf 6. UNICEF (2012) Pneumonia and diarrhoea: tackling the deadliest diseases for the world’s poorest children. New York 7. Legesse H, Degefie T, Hiluf M, Sime K, Tesfaye C, Abebe H et al (2014) National scale-up of integrated community case management in rural Ethiopia: implementation and early lessons learned. Ethiop Med J 52(Supplement 3):15–26 8. Crighton EJ, Elliott SJ, Moineddin R, Kanaroglou P, Upshur R (2007) A spatial analysis of the determi- nants of pneumonia and influenza hospitalizations in Ontario (1992–2001). Soc Sci Med 64:1636–50 9. Marrie TJ, Wu L (2005) Factors influencing in-hospital mortality in community-acquired pneumonia: a prospective study of patients not initially admitted to the ICU. Chest 127(4), 1260 – 1270. doi:10. 1378/chest.127.4.1260 10. Fielding A, Goldstein H (2006) Cross-classified and multiple membership structures in multilevel models: an introduction and review 11. Hawassa Facts and Figures [Internet]. [cited 2017 Mar 22]. Available from http://www.hawassaonline. com/facts.php 12. Teka Z (2011) Statistical Analysis of risk factors for mortality of in-hospital patients due to pneumonia: a case study at bushulo major health centre, Hawassa. Hawassa University, Ethiopia 13. Raza M, Ahmed a, Ahmed F, Ghani A, Rizvi N (2012) Seasonal incidence of community acquired pneu- monia and its mortality in Karachi-A multi-centric hospital based study. IpublishingCo.in 3(2):885–894 14. Cui W, Zhao H, Lu X, Wen Y, Zhou Y, Deng B, et al. (2010) Factors associated with death in hospitalized pneumonia patients with 2009 H1N1 influenza in Shenyang, China. BMC Infect Dis. 10(1):145 15. Lieberman D, Porath A (1996) Seasonal variation in community-acquired pneumonia. Eur Respir J 9(12):2630–2634 16. Menéndez R, Cremades MJ, Martínez-Moragón E, Soler JJ, Reyes S, Perpiña M (2003) Duration of length of stay in pneumonia: influence of clinical factors and hospital type. Eur Respir J 22(4):643–648 17. Hox J (1995) Applied multilevel analysis, vol 274. TT-Publikaties, Amsterdam 18. Gelman A (2006) Prior distribution for variance parameters in hierarchical models. Int Soc Bayesian Anal 1(3):515–533 19. Snijders TAB, Bosker RJ (1999) Multilevel analysis: an introduction to basic and advanced multilevel modeling. Sage, New York 20. Goldstein H (1999) Multilevel statistical models. Wiley, New York 21. Rasbash J, Browne W (2008) Non-hierarchical multilevel models. In: Handbook of multilevel analysis. pp 301–34 22. Alemdar M (2009) A monte carlo study: the impact of missing data in cross-classification random effects models. Georgia State University, Atlanta 23. Browne WJ, Charlton C, Rasbash J, Kelly M, Pillinger R (2016) MCMC estimation in MLwiN version 2.36. Centre for mutlilevel modelling. University of Bristol, Bristol 24. Raftery AE, Lewis S (1991) How many iterations in the Gibbs sampler?. Oxford University Press, Oxford 25. Huang K-T, Tseng C-C, Fang W-F, Lin M-C (2010) An early predictor of the outcome of patients with ventilator-associated pneumonia. Chang Gung Med J 33(3):274–82 26. Raudenbush SW, Bryk AS (2002) Hierarchical Linear models: applications and data analysis methods. Sage, New York

Journal

Annals of Data ScienceSpringer Journals

Published: Jul 28, 2017

References