Sampling bias obscures biodiversity patterns, reveals data gaps in priority conservation areas: a call for improved documentation

Kier Mitchel E. Pitogo
Camila G. Meneses
Syrus Cesar P. Decena
Christian E. Supsup
Hannah E. Som
Justin M. Bernstein
Kin Onn Chan
Mark W. Herr
Rafe M. Brown

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Where and how species are sampled can shape biodiversity knowledge, spatial patterns, and data-driven conservation. In many Global South biodiversity hotspots, sampling remains uneven, and available data often lack the synthesis needed to assess region-wide gaps for effective conservation planning and priority-setting. This shortfall is common within conserved areas and key biodiversity areas (hereafter ‘priority conservation areas’ or PCAs). We demonstrate this case in the Philippines, one of the most biodiverse countries in the world, where longstanding biodiversity research and growing policy momentum support efforts to expand coverage of conserved areas. Drawing on over a century of species occurrence records made digitally accessible, we compiled and manually curated these data to assemble and analyze information on Philippine amphibians and squamate reptiles from multiple sources, assessing the spatial distribution of observed diversity in relation to PCAs. Results reveal strong spatial biases, with preserved specimens comprising the majority of records and largely shaping observed diversity patterns. Citizen-science data complement already well-sampled regions, while records from peer-reviewed literature contribute valuable documentation in poorly sampled areas. PCAs are proportionally well-sampled, although gaps and biases remain. Sampling effort and observed diversity were higher in larger PCAs, but this positive area effect diminishes with increasing topographic relief, highlighting large mountain ranges as persistent blind spots in biodiversity documentation. Notably, some areas of higher diversity occur outside established PCAs. We discuss implications of these biases and propose enabling mechanisms to improve primary biodiversity data collection. This study affirms the importance of integrating digitally accessible biodiversity data from multiple sources in revealing sampling gaps and biases, guiding future studies towards poorly sampled areas and informing conservation priorities.

Version published to 10.1101/2025.09.13.676052 on bioRxiv
Sep 17, 2025

Data availability impacts the predictive accuracy of pressure-based biodiversity models

This article has 6 authors:
1. Jakob Nyström
2. Jeffrey R. Smith
3. Lisa Mandle
4. Andrew Gonzalez
5. Thomas B. Schön
6. Tobias Andermann
This article has no evaluationsLatest version Dec 23, 2025
Deciphering the patterns and drivers of tardigrade diversity along altitudinal gradients

This article has 4 authors:
1. Bartłomiej Surmacz
2. Diego Fontaneto
3. Grzegorz Vončina
4. Daniel Stec
This article has no evaluationsLatest version Dec 15, 2025
A new analysis of biodiversity and conservation knowledge products to support environmental assessments

This article has 16 authors:
1. Thomas Starnes
2. Laure Denos
3. Lewis Kramer
4. Francesca Ridley
5. Tom Scott
6. Simon Tarr
7. Rosamunde Almond
8. Stuart Butchart
9. Heather Bingham
10. Neil Burgess
11. Craig Hilton-Taylor
12. Louise Mair
13. Philip McGowan
14. Aidin Niamir
15. Andrew Plumptre
16. Thomas Brooks
This article has no evaluationsLatest version Feb 4, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Data availability impacts the predictive accuracy of pressure-based biodiversity models

Deciphering the patterns and drivers of tardigrade diversity along altitudinal gradients

A new analysis of biodiversity and conservation knowledge products to support environmental assessments