A novel size biased distribution: Regression model, INAR(1) process and Applications in Environmental and Medical Sciences
Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
A new statistical distribution, termed as Size-Biased Poisson Xgamma distribution, is introduced in this study to enhance modeling and interpretation of data arising from size-biased sampling schemes. In such settings, the probability of selecting a unit is proportional to its size, resulting in the overrepresentation of larger observations. In this work, we consider the specific case where the size of a unit is defined to be the observed value of the response variable, thereby focusing on size-biased distributions where selection is directly linked to outcome magnitude. We derive some structural features of the model and a thorough reliability analysis is also carried out. For parameter estimation, Maximum likelihood estimation (MLE) and method of moments (MoM) are used. As the MLE is not available in closed form, the likelihood function is maximized numerically and a simulation analysis based on MLE further validates the robustness of the model. Additionally, we extend the usefulness of the SBPXG model in predictive analytics by developing an associated regression model for applications with relevant covariate data. Furthermore, we present an INAR(1) process under the SBPXG model to handle count data scenarios. We investigate its special characteristics including conditional maximum likelihood (CML) method. By examining three different datasets, we show that the SBPXG model continuously performs better in terms of goodness-of-fit than other models. Additionally, the Vuong’s Likelihood Ratio Test is applied to formally compare non-nested models, and the results confirm that the SBPXG regression model offers a significantly better fit than conventional alternatives. The findings demonstrate the model’s flexibility and strong potential for analyzing size-biased and overdispersed count data, particularly in healthcare and environmental domains.