Non-Robustness in Log-Like Specifications

Jack Fitzgerald
Joop Adema
Lenka Fiala
Essi Kujansuu
David Valenta

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Recent literature shows that when regression models are estimated on variables transformed with 'log-like' functions such as the inverse hyperbolic sine or ln(Z + 1) transformations, one can obtain (semi-)elasticity estimates of any magnitude by linearly re-scaling the input variable(s) before transformation. We systematically re-analyze the replication data of 46 papers whose main conclusions are defended by log-like specifications. Our replication findings motivate new theoretical and simulation results showing that in log-like specifications, unit scale can be used to overfit data, creating an uncontrolled multiple hypothesis testing problem that frequently yields spuriously significant results. In particular, 38% of the estimates we re-analyze sit in a 'sweet spot', where both upward and downward re-scalings of variables' units before transformation shrink test statistics. Consequently, published estimates in this literature are statistically significant over 40% more frequently than in the general economics literature. We find that modest changes to model specification yield different statistical significance conclusions for 14-37% of estimates defending papers' main claims. We also show that for 99.8% of estimates, variables transformed with log-like functions do not meet data requirements for log-like specifications from a methodological recommendation cited by all papers in our replication sample. We synthesize and harmonize methodological guidelines and advocate for more robust alternative specifications, including normalized estimands, Poisson regression, and quantile regression.

Version published to 10.31222/osf.io/juda7_v1 on OSF Preprints
Mar 16, 2026

On semiparametric generalized linear models with nonparametric canonical link

This article has 3 authors:
1. Busayasachee Puang-Ngern
2. Wandee Wanishsakpong
3. Jun Ma
This article has no evaluationsLatest version Mar 8, 2026
Testing Martingale Difference Hypothesis for Functional Time Series

This article has 3 authors:
1. Kai Lou
2. Guochang Wang
3. Pengfei Liu
This article has no evaluationsLatest version Mar 24, 2026
Heterogeneity in Statistics: A Conceptual and Methodological Review

This article has 3 authors:
1. Zhanshan (Sam) Ma
2. Shu Liu
3. Aaron Ellison
This article has no evaluationsLatest version Apr 14, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

On semiparametric generalized linear models with nonparametric canonical link

Testing Martingale Difference Hypothesis for Functional Time Series

Heterogeneity in Statistics: A Conceptual and Methodological Review