tidypopgen : Tidy Population Genetics in R

Evelyn J. Carter
Eirlys E. Tysall
Jason Hodgson
Andrea Manica

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

As genome-wide data has become increasingly available, software libraries for their analysis have proliferated. While new tools for downstream analyses are constantly emerging, existing workflows are hindered by inefficiencies. Switching between coding languages and object types in the early stages of pipelines wastes researchers’ time, impedes reproducibility, and creates opportunity for error. To confront these obstacles, we introduce tidypopgen , a comprehensive R package for population genetic analysis of biallelic SNP data. Genotype data can be read, filtered, and analysed within a single environment, without the need for prior data cleaning or setup with other software. tidypopgen ’s gen_tibble object structure makes analysis efficient and intuitive, while standardised tidy grammar makes data manipulation clear.

Functionality within tidypopgen supports cleaning and merging datasets, basic descriptive statistics, multivariate analysis, clustering algorithms, and F-statistics, as well as integrating with existing tools for population genetic analyses in R. We use the Human Genome Diversity Project SNP dataset (Li et al., 2008) to show that a basic population genetic workflow can be executed in under 25 lines of code in a single environment using one file set, without the need to write superfluous outputs or change directories. By supporting data assembly through to data analysis, tidypopgen significantly streamlines workflows without compromising speed or functionality.

Version published to 10.1101/2025.06.06.658325v1 on bioRxiv
Jun 8, 2025

2Pipe: It Starts with a Question. Matching You with the Correct Pipeline for MAG Reconstruction

This article has 2 authors:
1. Jeferyd Yepes Garcí
2. Laurent Falquet
This article has no evaluationsLatest version Jun 9, 2025
Gener anno : A Genomic Foundation Model for Metagenomic Annotation

This article has 6 authors:
1. Qiuyi Li
2. Wei Wu
3. Yiheng Zhu
4. Fuli Feng
5. Jieping Ye
6. Zheng Wang
This article has no evaluationsLatest version Jun 5, 2025
SNiPgenie: A tool for microbial SNP site detection from whole genome sequencing data

This article has 3 authors:
1. Damien Farrell
2. Viktor Perets
3. Stephen V Gordon
Reviewed by Access Microbiology

This article has 5 evaluationsLatest version May 19, 2025Latest activity Jun 9, 2025

tidypopgen : Tidy Population Genetics in R

Listed in

Abstract

Article activity feed

2Pipe: It Starts with a Question. Matching You with the Correct Pipeline for MAG Reconstruction

Gener anno : A Genomic Foundation Model for Metagenomic Annotation

SNiPgenie: A tool for microbial SNP site detection from whole genome sequencing data

Listed in

Abstract

Article activity feed

Related articles

2Pipe: It Starts with a Question. Matching You with the Correct Pipeline for MAG Reconstruction

Gener anno : A Genomic Foundation Model for Metagenomic Annotation

SNiPgenie: A tool for microbial SNP site detection from whole genome sequencing data