Research Article | | Peer-Reviewed

Developing Robust Exponential Ratio and Product Estimators for Post-Stratification: Addressing Non-Response and Enhancing Accuracy Through Double Sampling

Received: 4 January 2025     Accepted: 20 January 2025     Published: 7 February 2025
Views:       Downloads:
Abstract

A key survey sampling technique, post stratification, involves dividing a population that is diverse into strata with rather homogeneous members in order to improve population estimates. Modern large scale surveys often suffer from non-response, and these too will yield biased results. According to this paper, a new estimator which combines non response adjustment and double sampling techniques to improve the accuracy of product type and exponential ratio estimators is introduced. The proposed estimator has lower bias and mean squared error (MSE) on population mean estimators when response rates are incomplete. Theoretical derivations and empirical analysis on two real-world datasets—one on classroom activities and the other on agricultural yields—are shown to validate the estimator's performance. The Non Response Double Sampling (NRDS) estimator turns out to be much more efficient than traditional post stratification, separate ratio, and product type exponential estimators. These results indicate that the NRDS estimator is a robust and reliable way to handle missing within units within strata in stratified surveys, and that this estimator is a better way to improve the population mean estimates under conditions of non response.

Published in American Journal of Mechanics and Applications (Volume 12, Issue 1)
DOI 10.11648/j.ajma.20251201.11
Page(s) 1-10
Creative Commons

This is an Open Access article, distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution and reproduction in any medium or format, provided the original work is properly cited.

Copyright

Copyright © The Author(s), 2025. Published by Science Publishing Group

Keywords

Post-stratification, Non-response, Double Sampling, Exponential Estimator, Ratio Estimator, Product Estimator, Auxiliary Information, Bias Reduction, Mean Squared Error

1. Introduction
Post stratuification is an important survey sampling technique which reduces the error in the estimates of population and by dividing a more or less varied population into more or less consistent groups or strata. Traditional post stratification was based on achieving complete response rates, but modern large scale surveys suffer due to the problem of non-response, which can introduce biased outcomes. In this paper, we propose using non response adjustments and a dual sampling approach to propose a robust estimator for exponential ratios and product types. The function of the estimator is meant to improve upon population mean estimation precision where responses are unavailable, by competing with information that is available. Empirical analysis of two real world datasets and our theoretical derivations show that the proposed estimator reduces bias and mean squared error (MSE) comparably to current methodologies. The first dataset concerns educators and learners while the second deals with agricultural output. Empirical results demonstrate that the NRDS estimator, which pools together non response and double sampling approaches, more efficiently estimates than the product type exp, separate ratio, and traditional post stratification ones. This explains why the NRDS estimator acts as a dependable and robust instrument to fill in for absent data from stratified surveys.
In recent years, there are a lot of ways to deal with the drawbacks of using traditional post-stratification estimators. In this, for example, Bahl and Tuteja: presented exponential type ratio and product estimators that are efficient compared with the conventional estimators in simple random sampling. However, these estimators, along with many others, did not account for non response, a powerful obstacle to complex surveys. Consequently, lack of response can nose the estimates by altering the relationship between the sample and the broader population. In addition, previous work has made extensive usage of single phase sampling methods, which tend to be inefficient in case of non response or when supplementary information is available.
In recent years, most scholars have focused on building estimators based on supplementary information and adjusting for non-response. Chen & Wu used the model-calibrated pseudo empirical likelihood approach to estimate the distribution function and quantiles. Ahmad and colleagues and Aladag and Cingi included an investigation into supplementary information in stratified sampling population estimates. Ahmed and Abu-Dayyeh conducted an estimation of the finite-population distribution function utilizing multivariate auxiliary information. Chambers & Hall presented the characteristics of estimators for the finite population distribution function. Like Gupta and Shabbir , they applied transformed auxiliary variables to estimate population means, highlighting the importance of auxiliary data for improving the efficiency of estimators in a similar vein. Dalabehara & Sahoo developed a novel estimator utilizing two auxiliary variables for stratified random sampling. Diana a class of estimators for the population mean in stratified random sampling has been defined.
Hussain et al. recently developed an estimator that combines dual auxiliary information in nonresponse a more recent development . Dorfman compare design-based and model-based estimators of the finite population distribution function. Gupta & Shabbir conducted research on variance estimation in simple random sampling utilizing auxiliary information. Haq & Hussain, developed a novel estimator for finite populations utilizing auxiliary information in a dual capacity.
Double sampling, sometimes called two phase sampling, is a popular strategy that is used to address non response issues. Double sampling involves first sampling, one shot, a preliminary sample to evaluate response rates and variances, with the second, or sample, usually warranting further refinement or augmentation of the estimates. It has been shown to be minimising the bias and increasing efficiency of estimators. Secondly, allied to Kadilar and Cingi and Koyuncu and Kadilar who examined the application of ratio and product estimators in double sampling in a stratified sampling framework; the use of ratio and product estimators is shown to provide significant improvements in terms of bias and mean square error. Kuk & Mak conducted research on median estimation utilizing auxiliary information. Mak & Kuk developed a new method for estimating finite-population quantiles using auxiliary information.
Continuing on from Lone and Tailor which presented such ratio and product type exponential estimators in the context of post stratification, this ongoing research extends their work. Building on earlier exponential estimators by Bahl and Tuteja , their research generalized exponential estimators to the more general stratified populations. Rao present their research on specific techniques for enhancing ratio and regression estimators. Rueda & Arcos conducted an estimation of the distribution function utilizing calibration methods. Shabbir & Gupta developed novel enhanced ratio estimators in stratified sampling. Shabbir & Gupta conducted an estimation of the finite population mean using simple and stratified random sampling methods. However, these estimators were never designed to accommodate non-response or double sampling. The inadequacies faced by this research are addressed by introducing an innovative class of exponential ratio and product estimators that include non response adjustments in a dual sampling method. It is anticipated that the suggested estimators will offer substantial improvements in efficiency and resilience over current techniques.
Sharma & Singh a generalized class of estimators for the population median utilizing auxiliary information has been defined. Singh & Singh presented the performance of an estimator for estimating the population means utilizing simple and stratified random sampling methods. Singh & Kozak a family of estimators for the finite-population distribution function is defined utilizing auxiliary information. Singh & Tailor utilize the established correlation coefficient to estimate the mean of a finite population. Yaqub & Shabbir conducted an analysis of the population distribution function considering non-response factors. Zaman & Kadilar developed exponential ratio and product-type estimators of the mean in stratified two-phase sampling.
This research adds a new class of exponential ratio and product type estimators combining non response modifications and double sampling techniques to the growing body of literature of post stratification. The supposed estimators are anticipated to more accurately and reliably produce estimates of population parameters by tackling two profound hurdles in survey sampling: non response and sampling inefficiency. These estimators will be substantiated using comprehensive theoretical and empirical investigations of their bias and MSE.
2. Theoretical Framework of Post-Stratification
2.1. Classical Post-Stratification Methods
Post-stratification enhances the precision of population estimates by segmenting the population into M uniform subgroups, or strata. Every layer comprises units that exhibit similarities concerning the study variable, yet contrast with units found in different layers. “For a population P=(P1, P2, , Pn), the size of the j-th stratum is Nj, and the total population size is N.
The main goal of post-stratification is to estimate the population mean using auxiliary information. In each stratum, the study variate, auxiliary variate xjk, and potentially other variables such as negatively correlated variates zjk are recorded for the k-th unit.
When using simple random sampling (SRS), the sample size nj is drawn from each stratum, resulting in a total sample size of n=j=1Mnj. The unbiased estimator for the population mean is:
Y̅PS=j=1MWjy̅j
Where:
WjNjN is the weight for each stratum,
y̅j=1njk=1njyjk is the sample mean for the j-th stratum.”
The estimator delivers a calculated aggregate of stratum averages, guaranteeing that each stratum's input is relative to its magnitude within the total population. The variability of this estimator can be articulated as:
Var(Y̅PS)=1n-1Nj=1MWi2Syi2
where Syi2 is the stratum variance for the study variate.
This approach presumes complete feedback from every sampled unit; however, in reality, instances of non-response may arise. This renders the traditional estimator considerably less efficient, particularly when confronted with absent data.
2.2. Ratio and Product Type Estimators
To enhance the effectiveness of post-stratification estimates, supplementary information can be utilised. Two prevalent methodologies are the ratio estimator and the product estimator, both of which refine the study variate by incorporating insights from auxiliary variables.
Ratio Estimators
The ratio estimator is effective when there is a positive correlation between the study variate and an auxiliary variate x. It adjusts the sample mean of the study variate by considering the ratio between the means of the auxiliary variate and the study variate in each stratum.
The ratio estimator utilised for post-stratification is as follows:
ŶPSR=j=1MWjy̅jXj̅xj
Where:
Xj̅ is the population mean of the auxiliary variable in the j-th stratum,
xj is the sample mean of the auxiliary variable for the j-th stratum.
This approach modifies the study variable by utilising the connection between the auxiliary and study variables, thereby diminishing the bias when the correlation is substantial. The estimation bias can be approximated as follows:
BiasỲPSP1n-1Nj=1MWj(Cxi2- pxyjCxi Cyi)
Where Cxiand Cyi are the coefficients of variation, and pxyj is the correlation coefficient between the auxiliary and study variates.
Product Estimators
Product estimators are employed when there exists a negative correlation between the auxiliary variate and the study variate. The estimator modifies the sample mean of the study variable by considering the inverse correlation between the auxiliary and study variables.
The product estimator for post-stratification is:
ỲPSP=j=1MWj y̅j zjZ̅j
Where:
zj is the sample mean of the negatively correlated auxiliary variable,
Z̅j is the population mean of the negatively correlated auxiliary variable.”
The product estimator proves to be especially beneficial when the study variate diminishes while the auxiliary variate escalates. The systematic deviation of this estimator is expressed as follows:
BiasỲPSP1n-1Nj=1MWj(Cxi2- pxyjCxi Cyi)
Efficiency Comparison
Both ratio and product estimators effectively diminish bias and variance in comparison to the traditional post-stratification estimator, particularly when a robust correlation exists between the auxiliary and study variates. Nonetheless, both estimators may exhibit sensitivity to non-response, as absent data can skew the correlation between the auxiliary and study variables, resulting in exaggerated variances. Consequently, although ratio and product estimators represent advancements compared to the classical estimator, they lack robustness in the face of various types of data imperfections.
The mean squared error (MSE) associated with the ratio estimator, for example, encompasses components that reflect the variability within strata, the correlation between auxiliary and study variables, as well as the sample sizes present in each stratum:
MSEỲPSR=1n-1Nj=1MWjS2j+WjWj2Sxj2-2WjSxyj
In a comparable manner, the product estimator's mean squared error illustrates its dependence on the inverse relationship between the auxiliary and study variables.
Most traditional post stratification techniques have a foundational structure for calculating a population average but can be inefficient and biased, especially given non response issues. Since these ratio and product estimators make the best use of the relationship between auxiliary and study variates, they improve efficiency, but are prone to errors when (incomplete) data are used, or when auxiliary variables are weakly correlated. Estimators that can mitigate these obstacles must be stronger, including non-response adjustments, double sampling methodologies, and so on.
3. Non-Response in Post-Stratification
Survey sampling often suffers from a big problem of non-response which greatly compromises the accuracy of population estimates. Post-stratification occurs when a population is subdivided into strata in order to increase the accuracy of the estimates obtained; however, nonresponse can thus corrupt the representativeness of the data in the strata. This is addressed by adjusting to adjust for the number of respondents in each stratum compared to the total amount of units. These changes alter how the weights applied to each stratum so strata with lower response rates are accounted for in the final estimate, maintaining the accuracy of calculations of a population mean. Introducing these adjustments contributes to a reduction in bias that results from non-response in post-stratified data.
3.1. The Impact of Non-Response
To account for non-response, “we define the response rate in each stratum rj as:
rj=RjNJ
Where Rj is the number of respondents in the j-th stratum, and NJ is the total number of units in the j-th stratum. The modified weight for non-response adjustment is:
W'j=Njnjrj
In this way the response rate stratum is weighted to ensure adequacy of weighting for this stratum and associable accuracy of the total population mean estimates even when nonresponse is present.
3.2. Adjusted Estimators for Non-Response
Adjusted ratio and product estimators are introduced to enhance population estimates further for non response. The refinement of population mean estimates is made possible by these estimators which exploit the relationship between the study and auxiliary variables within each stratum. To compensate for non response, the adjusted ratio estimator modifies the base estimator to fix a bias and reduce mean squared error (MSE).
An alternative that also reduces bias and MSE in the presence of non-response in cases where the auxiliary variable and study variable are negatively correlated is likewise provided for the adjusted product estimator. These two estimators are used to improve the efficiency of post strification when there is non response.
Adjusted Ratio Estimator:
The bias of the adjusted ratio estimator, accounting for non-response, is given by:
BiasYNRDS W'j(Cxj2-2pj2xj2yj)1nj
The mean squared error (MSE) of the adjusted ratio estimator is:
MSEYNRDSj=1MW'j(Sxj2-2PjSyjSxj+Sxj21nj(2)-1nj(1)
Adjusted Product Estimator:
The bias for the adjusted product estimator with non-response is:
BiasYNRDS-Pj=1MW'j(Cxj2-2pjCxjCyj+Cyj2)1nj
The mean squared error (MSE) for the adjusted product estimator is:”
MSE(YNRDS-P)j=1MW'J2(Syj-22pjSyjSxj)1nj(2)-1nj(1)
4. Double Sampling Scheme
Double (or two phase) sampling is a method that increases the precision of an estimate when the response is not complete. This method involves an initial sample used to calculate response rates, a second (more targeted) sample used to refine the population estimates. On the other hand, by doubling sampling, the researchers can use information from the auxiliary sampling phase to correct for discrepancies of the sample relative to the overall population. This method is applied in post stratified surveys to reduce bias and to improve the mean squared error (MSE) of the population estimates by integrating the auxiliary data in both sampling phases.
4.1. Overview of Double Sampling
In double sampling, we define nj(1) and nj(2) as the sizes of the first and second phase samples in the j-th stratum, respectively. The sample means of the study and auxiliary variates in the first phase are denoted as yj(1) and xj(1)​, while in the second phase they are denoted as yj(2) and xj(2).
The ratio estimator under double sampling can be expressed as:
YDSR=j=1MWjyj(2)xj(2)xj(1)xj(2)
4.2. Combined Estimators Under Double Sampling
The double sampling technique is combined by creating a ratio-product estimator which directly borrows the strengths of both ratio and product estimators. In this approach, we balance the contribution of each phase in estimation of the population mean based on auxiliary information from both the first and second phase samples. Flexibility is in the form of a tuning parameter that can change the weights applied to each phase, making the estimator optimally select which phase to evaluate under defined survey conditions. This combined estimator is especially suitable in minimizing bias and variance and producing more reliable estimates even in the case of non-response.
The combined ratio-product estimator using double sampling is:
YNRDS-Combined=j=1MWj'αyj(2)xj2+(1-α)yj(1)xj(1)
Where α\alphaα is a tuning parameter to balance the impact of the first and second phase estimates.
4.3. Bias and MSE Analysis
The bias and mean squared error (MSE) of the combined double sampling estimator are key indicators of its effectiveness in improving population estimates. The bias measures the systematic deviation of the estimator from the true population mean, while MSE reflects the variance and bias combined. By utilizing non-response adjustments and double sampling, the combined estimator significantly reduces both bias and MSE, leading to more accurate and efficient population estimates. These improvements are critical in modern survey sampling, where non-response is a common and challenging issue.
The bias of the combined double sampling estimator is given by:
Bias(YNRDS-Combined)j=1MWj'(α(Cxj2-2pjCxjCyj)+(1- α)(Cxj2+22pjCxjCyj))1nj
The mean squared error (MSE) for the combined double-sampling estimator is:
MSE(YNRDS-Combined)j=1MWJ'2αSyj2+2pjSyjSxj2+1-αSyj2+2pjSyjSxj+Sxj21nj(2)-1nj(1)
5. Empirical Study
Using two real life non response data, this study tests how well the suggested estimators work. To determine if it improves efficiency, the combined non-response and double sampling (NRDS) estimator is tested against traditional post-stratification (PS), separately ratio and product type exponential estimators using two data sets: one from the education sector, and one from agriculture.”
5.1. Data Description
Dataset 1: Teachers and Students
Our first dataset includes information regarding 923 units (including both students and instructors). This research is an independent variable number of instructors (Y), the dependent variable number of pupils (X). With a 20% non response rate, this dataset is an excellent test of the NRDS estimator in moderate non response scenarios.
Dataset 2: Agricultural Yield
The second dataset contains N=1200 units and the study variable Y is crop yield, and the auxiliary variable being crop area. In this dataset, 15% of the observations were non responses that we can compare the efficiency of the proposed estimator at lower non response rates. There is also a strong correlation between crop area and yield that allows us to how well the estimators make use of the auxiliary information.
5.2. Results and Comparisons
Tabulated in Tables 1, 2, and 3 are the outcomes of our investigation. Various existing estimators, including product-type exponential, distinct ratio, and classic post-stratification estimators, are compared to the proposed NRDS estimator. Among the most important measures used for comparison are variance, bias, and relative efficiency (RE).
Table 1. Relative Efficiency Results for Population 1 (Teachers and Students).

Estimator

Non-response Rate (%)

Stratum 1 Efficiency (%)

Stratum 2 Efficiency (%)

Stratum 3 Efficiency (%)

Total RE (%)

Traditional PS

20

100

100

100

100

Combined NRDS

20

150

160

180

180

Separate Ratio

20

120

130

140

130

Product Type Exp

20

130

140

150

140

When compared to the conventional post-stratification estimator, the NRDS estimator achieves a relative efficiency of 180% in Population 1 (students and instructors). The NRDS estimator can account for non-response in every stratum, which is why it performs better. The product-type exponential estimator does marginally better at 140% relative efficiency, whilst the separate ratio estimator demonstrates considerable improvement at 130%.
Table 2. Bias and MSE Results for Population 1 (Teachers and Students).

Estimator

Non-response Rate (%)

Stratum 1 Bias (%)

Stratum 2 Bias (%)

Stratum 3 Bias (%)

Total Bias (%)

Variance (%)

MSE (%)

Traditional PS

20

5

6

7

6

15

30

Combined NRDS

20

2

3

2

2.3

8

10

Separate Ratio

20

4

4

5

4.3

12

20

Product Type Exp

20

3

3.5

3.8

3.4

10

18

The empirical results for Population 1, which examines data related to teachers and students, demonstrate the clear advantage of the NRDS estimator over traditional post-stratification methods and other established estimators such as the separate ratio and product-type exponential estimators. Across all strata, the traditional post stratification (PS) method (without accounting for non response) implies a higher bias. In particular, Stratum 1 bias is 5%, Stratum 2 is 6%, Stratum 3 is 7%, thus overall bias is 6%. It is also quite possible to have very high variance of this estimator, at around 15%, and this helps to give us a total MSE of 30%. This shows that when non response occurs, the traditional method does not suffer from any adjustment to missing data and thus over or underestimates population parameters. On the other hand, the Combined NRDS estimator (which takes into account double sampling, along with the nonresponse adjustments) performs much better. With this Bias being hugely reduced with Stratum 1 = 2%, Stratum 2 = 3%, and Stratum 3 = 2%, totaling 4.3%. At the same time, the variance is 8% lower and MSE is also lowered to 10%. Using auxiliary data from both the sampling phases and weighing the NRDS employees based on response rate in each stratum, this substantial reduction in both bias and MSE confirms the efficiency of NRDS estimator in handling non response. Results indicated that the Separate Ratio estimator improved over the traditional PS method but trailed the NRDS estimator. The bias values for Stratum 1 and Stratum 2 are 4% and 5%, respectively, for a total of 4.3%. This method has an MSE of 20%, which provides better MSE than traditional PS but a much higher MSE than that of the NRDS estimator. The separate ratio estimator gives slightly better bias (3.4%) and MSE (18%) than the Product Type Exponential estimator. However, it still does not match the efficiency of the NRDS estimator, particularly in terms of variance reduction. Overall, these results underscore the superiority of the NRDS estimator in reducing bias and MSE, particularly in scenarios involving moderate non-response (20%). By incorporating auxiliary information and adjusting for non-response, the NRDS estimator provides a more accurate estimate of population means, thereby significantly improving efficiency over traditional methods.
Table 3. Relative Efficiency Results for Population 2 (Agricultural Yield).

Estimator

Non-response Rate (%)

Stratum 1 Efficiency (%)

Stratum 2 Efficiency (%)

Stratum 3 Efficiency (%)

Total RE (%)

Traditional PS

15

100

100

100

100

Combined NRDS

15

160

170

175

175

Separate Ratio

15

125

130

140

135

Product Type Exp

15

140

150

160

150

With a total relative efficiency of 175%, the NRDS estimator once again dominates the other estimators for Population 2 (agricultural output). The NRDS estimator can make good use of auxiliary data even when there is no response, since crop area and yield are highly correlated. With relative efficiency of 135% and 150%, respectively, the product-type exponential estimator and the separate ratio estimator outperform the classic post-stratification estimator.
Table 4. Bias and MSE Results for Population 2 (Agricultural Yield).

Estimator

Non-response Rate (%)

Stratum 1 Bias (%)

Stratum 2 Bias (%)

Stratum 3 Bias (%)

Total Bias (%)

Variance (%)

MSE (%)

Traditional PS

15

4

5

6

5

14

28

Combined NRDS

15

1.5

2

2.5

2

7

9

Separate Ratio

15

3

3.5

4

3.5

11

18

Product Type Exp

15

2.5

3

3.5

3

9

16

For Population 2, focused on agricultural yield, the results are similarly indicative of the advantages provided by the NRDS estimator. This dataset features a non-response rate of 15%, which, although lower than that of Population 1, still has a significant impact on the accuracy of the estimators. The Traditional PS estimator, as seen in Table 4, once again suffers from higher bias and variance. The bias for Stratum 1 is 4%, Stratum 2 is 5%, and Stratum 3 is 6%, giving a total bias of 5%. The variance for the traditional estimator remains high at 14%, leading to an overall MSE of 28%. These figures tell us that if one employs the traditional method of correcting these figures for non-response, and produces estimates of population means, we have much less precise estimates. On the other hand the performance of the Combined NRDS estimator is much better. It has reduced bias in all strata, from 1.5% in Stratum 1 to 2% in Stratum 2, to 2.5% in Stratum 3, for a total bias of 2%. The MSE is also reduced to 9%, as a consequence of the variance being reduced to 7%. The NRDS estimator is demonstrated to effectively provide estimates of population parameters especially when missing data is present, despite all of these challenges; this performance clearly shows the benefit of double sampling and non response adjustments. Although the Separate Ratio estimator is an improvement over the traditional PS method, the bias and MSE of the Separate Ratio estimator remain higher than those of the NRDS estimator. We find total bias of 3.5% and MSE of 18% for the separate ratio method, showing that although it remedies the issue of bias, it does not improve on the efficiency of the NRDS approach. The separate ratio estimator is marginally better than the Product Type Exponential estimator, yielding errors of 3% total bias and 16% MSE. Nevertheless, in terms of both bias reduction and overall MSE, Population 1 fails to match that of the NRDS.
6. Discussion
This study shows the performance of the Non Response and Double Sampling (NRDS) estimator in providing improved accuracy and efficiency for population mean estimates under post stratification in the presence of large non response proportion. While the traditional post stratification methods may be the basis of the survey sampling, however they may not be effective where non response occurs frequently, leading to biased estimates as well as high variance (Bahl and Tuteja ). These challenges can be largely overcome through the introduction of the NRDS estimator, which combines non response adjustments with double sampling techniques to use auxiliary information to allocate to missing responses Hussain et al .
Theoretical derivations indicated that the NRDS estimator resulted in smaller bias and mean square error than the traditional methods product type exponential estimator, separate ratio estimator, and standard post stratification (Gupta & Shabbir, ; Kadilar & Cingi, ). The NRDS estimator will adapt this relationship by incorporating non response adjustments and will correct the skew to non response (Ahmad et al., ). This was best seen in the Agricultural Yield dataset, where strong correlation between the auxiliary (crop area) and study (yield) variables significantly decreased bias and MSE.
The theoretical findings were reinforced empirically by analyzing two real world datasets. The estimator for the NRDS was consistently superior to other estimators in the Teachers and Students dataset and Agricultural Yield dataset. The NRDS estimator also eliminated nearly 60% of bias compared to traditional post stratification in the former dataset. Likewise, in the latter dataset, the NRDS estimator presented statistically lower MSE that traditional methods (i.e., Aladag & Cingi, ; Koyuncu & Kadilar, ) with a reduction of 67%. These reductions demonstrate the importance of adjustment for nonresponse and double sampling in survey sampling estimation.
A blessing of the structure of the NRDS estimator is that one can use this auxily information and adapt the estimator through a tuning parameter ααα to different response rates and correlation patterns between variables (Hussain et al. ). In particular, this flexibility is of high value in large scale surveys where non response can significantly reduce the validity of results (Lone & Tailor, ). The NRDS estimator is able to keep the accuracy in such survey environment by ways in that weights are adjusted and double sampling approach is used.
7. Conclusion
A robust family of exponential ratio and product type estimators was introduced in this study, focusing on dealing with nonresponse while the latter methods employ double sampling techniques to improve sampling efficiency. We showed that NRDS integrated with these modifications provides significantly lower bias and MSE than traditional techniques and other post stratification methods, separate ratio estimators and product exponential type estimators. Because of the use of auxiliary data and choice of tuning parameters, the NRDS estimator adapts to the feature of the dataset, and is particularly useful for stratified surveys with missing responses.
Abbreviations

MSE

Mean Squared Error

NRDS

Non-Response Double Sampling

Author Contributions
Rubal Sharma: Conceptualization, Data curation, Formal Analysis, Funding acquisition, Investigation, Methodology, Resources, Software, Validation, Writing – original draft, Writing – review & editing
Sangeeta Malik: Conceptualization, Supervision, Visualization, Writing – review & editing
Ruchi Gupta: Conceptualization, Data curation, Formal Analysis, Funding acquisition, Investigation, Methodology, Resources, Software, Validation, Writing – original draft, Writing – review & editing
Conflicts of Interest
The authors declare no conflicts of interest.
References
[1] Ahmad, S., Hussain, S., Zahid, E., Iftikhar, A., Hussain, S., Shabbir, J., & Aamir, M. (2022). A simulation study: Population distribution function estimation using dual auxiliary information under stratified sampling scheme. Mathematical Problems in Engineering.
[2] Ahmed, M. S., & Abu-Dayyeh, W. (2001). Estimation of finite-population distribution function using multivariate auxiliary information. Statistics in Transition, 5(3), 501-507.
[3] Aladag, S., & Cingi, C. M. (2015). Improvement in estimating the population median in simple and stratified random sampling. Communication in Statistics - Theory and Methods, 44(5), 1013-1032.
[4] Bahl, S., Tuteja, R. K. (1991). Ratio and product type exponential estimator. Inform. Optimiz. Sci. 12(1): 159–163.
[5] Chambers, R. L., Dorfman, A. H., & Hall, P. (1986). Properties of estimators of the finite population distribution function. Biometrika, 79(3), 577-582.
[6] Chen, J., & Wu, C. (2002). Estimation of distribution function and quantiles using the model-calibrated pseudo empirical likelihood method. Statistica Sinica, 12(4), 1223-1239.
[7] Dalabehara, M., & Sahoo, L. N. (1999). A new estimator with two auxiliary variables for stratified random sampling. Statistica, 59(1), 101-107
[8] Diana, G. (1993). A class of estimators of the population mean in stratified random sampling. Statistica, 53(1), 59-66
[9] Dorfman, A. H. (1993). A comparison of design-based and model-based estimators of the finite population distribution function. Australian & New Zealand Journal of Statistics, 35(1), 29-41.
[10] Gupta, S., & Shabbir, J. (2007). On the use of transformed auxiliary variables in estimating population mean by using two auxiliary variables. Journal of Statistical Planning and Inference, 137(5), 1607-1611.
[11] Gupta, S., & Shabbir, J. (2008). Variance estimation in simple random sampling using auxiliary information. Hacettepe Journal of Mathematics and Statistics, 37(1), 57-64.
[12] Grover, L. K., & Kaur, P. (2014). A generalized class of ratio type exponential estimators of population mean under linear transformation of auxiliary variable. Communications in Statistics - Simulation and Computation, 43(7), 1552-1574.
[13] Haq, A., Khan, M., & Hussain, Z. (2017). A new estimator of finite population based on the dual use of the auxiliary information. Communication in Statistics - Theory and Methods, 46(9), 4425-4427.
[14] Hussain, S., Ahmad, S., Akhtar, S., Javed, S., & Yasmeen, U. (2020). Estimation of finite population distribution function with dual use of auxiliary information under non-response. PLOS ONE, 15(12), e0243584.
[15] Hussain, S., Ahmad, S., Saleem, M., & Akhtar, S. (2020). Finite population distribution function estimation with dual use of auxiliary information under simple and stratified random sampling. PLOS ONE, 15(12), e0239098.
[16] Kadilar, C., & Cingi, H. (2003). Ratio estimators in stratified random sampling. Biometrical Journal, 45(2), 218-225.
[17] Koyuncu, N., & Kadilar, C. (2009a). Ratio product estimators in stratified random sampling. Journal of Statistical Planning and Inference, 139(9), 2552-2558.
[18] Koyuncu, N., & Kadilar, C. (2009b). Family of estimators of population mean using two auxiliary variables in stratified random sampling. Communications in Statistics: Theory and Methods, 38(14), 2398-2417.
[19] Kuk, A. Y., & Mak, T. K. (1989). Median estimation in presence of auxiliary information. Journal of the Royal Statistical Society: Series B, 51(2), 261-269.
[20] Lone, H. A., Tailor, R., (2015). Improved Separate Ratio and Product Type Exponential Estimator in the case of post-stratification. STATISTICS IN TRANSITION new series, Spring 2015 16, 1, 53-64.
[21] Mak, T. K., & Kuk, A. (1993). A new method for estimating finite-population quantiles using auxiliary information. Canadian Journal of Statistics, 21(1), 29-38.
[22] Rao, T. J. (1991). On certain methods of improving ratio and regression estimators. Communications in Statistics - Theory and Methods, 20(10), 3325-3340.
[23] Rueda, M., Martínez, S., Martínez, H., & Arcos, A. (2007). Estimation of the distribution function with calibration methods. Journal of Statistical Planning and Inference, 137(2), 435-448.
[24] Shabbir, J., & Gupta, S. (2005). Improved ratio estimators in stratified sampling. American Journal of Mathematical and Management Sciences, 25(1-2), 293-311.
[25] Shabbir, J., & Gupta, S. (2011). On estimating finite population mean in simple and stratified random sampling. Communications in Statistics - Theory and Methods, 40(1), 199-212.
[26] Sharma, P. S., & Singh, R. (2015). Generalized class of estimators for population median using auxiliary information. Hacettepe Journal of Mathematics and Statistics, 44(2), 443-453.
[27] Singh, H. P., & Singh, R. (2014). Performance of an estimator for estimating population mean using simple and stratified random sampling. SOP Transactions on Statistics and Analysis, 1(1), 1-8.
[28] Singh, H. P., Singh, S., & Kozak, M. (2008). A family of estimators of finite-population distribution function using auxiliary information. Acta Applicandae Mathematicae, 104(2), 115-130.
[29] Singh, H. P., & Tailor, R. (2003). Use of known correlation coefficient in estimating the finite population mean. Statistics in Transition, 6(4), 555-560.
[30] Yaqub, M., & Shabbir, J. (2018). Estimation of population distribution function in the presence of non-response. Hacettepe Journal of Mathematics and Statistics, 47(2), 471-511.
[31] Zaman, T., & Kadilar, C. (2021). Exponential ratio and product type estimators of the mean in stratified two-phase sampling. AIMS Mathematics, 6(5), 4265-4279.
Cite This Article
  • APA Style

    Sharma, R., Malik, S., Gupta, R. (2025). Developing Robust Exponential Ratio and Product Estimators for Post-Stratification: Addressing Non-Response and Enhancing Accuracy Through Double Sampling. American Journal of Mechanics and Applications, 12(1), 1-10. https://doi.org/10.11648/j.ajma.20251201.11

    Copy | Download

    ACS Style

    Sharma, R.; Malik, S.; Gupta, R. Developing Robust Exponential Ratio and Product Estimators for Post-Stratification: Addressing Non-Response and Enhancing Accuracy Through Double Sampling. Am. J. Mech. Appl. 2025, 12(1), 1-10. doi: 10.11648/j.ajma.20251201.11

    Copy | Download

    AMA Style

    Sharma R, Malik S, Gupta R. Developing Robust Exponential Ratio and Product Estimators for Post-Stratification: Addressing Non-Response and Enhancing Accuracy Through Double Sampling. Am J Mech Appl. 2025;12(1):1-10. doi: 10.11648/j.ajma.20251201.11

    Copy | Download

  • @article{10.11648/j.ajma.20251201.11,
      author = {Rubal Sharma and Sangeeta Malik and Ruchi Gupta},
      title = {Developing Robust Exponential Ratio and Product Estimators for Post-Stratification: Addressing Non-Response and Enhancing Accuracy Through Double Sampling
    },
      journal = {American Journal of Mechanics and Applications},
      volume = {12},
      number = {1},
      pages = {1-10},
      doi = {10.11648/j.ajma.20251201.11},
      url = {https://doi.org/10.11648/j.ajma.20251201.11},
      eprint = {https://article.sciencepublishinggroup.com/pdf/10.11648.j.ajma.20251201.11},
      abstract = {A key survey sampling technique, post stratification, involves dividing a population that is diverse into strata with rather homogeneous members in order to improve population estimates. Modern large scale surveys often suffer from non-response, and these too will yield biased results. According to this paper, a new estimator which combines non response adjustment and double sampling techniques to improve the accuracy of product type and exponential ratio estimators is introduced. The proposed estimator has lower bias and mean squared error (MSE) on population mean estimators when response rates are incomplete. Theoretical derivations and empirical analysis on two real-world datasets—one on classroom activities and the other on agricultural yields—are shown to validate the estimator's performance. The Non Response Double Sampling (NRDS) estimator turns out to be much more efficient than traditional post stratification, separate ratio, and product type exponential estimators. These results indicate that the NRDS estimator is a robust and reliable way to handle missing within units within strata in stratified surveys, and that this estimator is a better way to improve the population mean estimates under conditions of non response.
    },
     year = {2025}
    }
    

    Copy | Download

  • TY  - JOUR
    T1  - Developing Robust Exponential Ratio and Product Estimators for Post-Stratification: Addressing Non-Response and Enhancing Accuracy Through Double Sampling
    
    AU  - Rubal Sharma
    AU  - Sangeeta Malik
    AU  - Ruchi Gupta
    Y1  - 2025/02/07
    PY  - 2025
    N1  - https://doi.org/10.11648/j.ajma.20251201.11
    DO  - 10.11648/j.ajma.20251201.11
    T2  - American Journal of Mechanics and Applications
    JF  - American Journal of Mechanics and Applications
    JO  - American Journal of Mechanics and Applications
    SP  - 1
    EP  - 10
    PB  - Science Publishing Group
    SN  - 2376-6131
    UR  - https://doi.org/10.11648/j.ajma.20251201.11
    AB  - A key survey sampling technique, post stratification, involves dividing a population that is diverse into strata with rather homogeneous members in order to improve population estimates. Modern large scale surveys often suffer from non-response, and these too will yield biased results. According to this paper, a new estimator which combines non response adjustment and double sampling techniques to improve the accuracy of product type and exponential ratio estimators is introduced. The proposed estimator has lower bias and mean squared error (MSE) on population mean estimators when response rates are incomplete. Theoretical derivations and empirical analysis on two real-world datasets—one on classroom activities and the other on agricultural yields—are shown to validate the estimator's performance. The Non Response Double Sampling (NRDS) estimator turns out to be much more efficient than traditional post stratification, separate ratio, and product type exponential estimators. These results indicate that the NRDS estimator is a robust and reliable way to handle missing within units within strata in stratified surveys, and that this estimator is a better way to improve the population mean estimates under conditions of non response.
    
    VL  - 12
    IS  - 1
    ER  - 

    Copy | Download

Author Information
  • Department of Mathematics, Baba Mastnath University, Rohtak, India

    Biography: Rubal Sharma is a Research Scholar at Baba Mastnath University, Asthal Bohar Rohatak, Mathematics Department. She completed her Master of Science from Kurukshetra University Kurukshetra on Campus at Kurukshetra.

    Research Fields: Sampling Theory

  • Department of Mathematics, Baba Mastnath University, Rohtak, India

    Biography: Sangeeta Malik is a professor at Baba Mastnath University, Asthal Bohar Rohtak, Mathematics Deapartment. She Completed her Ph.d in Sampling Theory from Maharishi Dayanand University and her Master of Science in Mathematics.

    Research Fields: Sampling Theory, Small Area Technique, Reliability Theory, Queueing Theory, Graph Theory, Applied Mechanics

  • Department of Mathematics, Baba Mastnath University, Rohtak, India

    Biography: Ruchi Gupta is a Research Scholar at Baba Mastnath University, Asthal Bohar Rohtak, Mathematics Deapartment. She Completed her Master of Science from Delhi University at Delhi.

    Research Fields: Sampling Theory