Latest preprint reviews

  1. Chromosome-level genome assembly and annotation of the crested gecko, Correlophus ciliatus, a lizard incapable of tail regeneration

    This article has 3 authors:
    1. Marc A. Gumangan
    2. Zheyu Pan
    3. Thomas P. Lozito
    This article has been curated by 1 group:
    • Curated by GigaByte

      Editors Assessment:

      The crested gecko (Correlophus ciliatus), is a lizard species endemic to New Caledonia, and a potentially interesting model organism due to its unusual (for a gecko) inability to regenerate amputated tails. With that in mind here is presented a new reference genome for the species, assembled using PacBio Sequel II platform and Dovetail Omni-C libraries. Producing a genome with a total size of 1.65 Gb, 152 scaffolds, a L50 of 6, and N50 of 109 Mb. Peer review making sure more detail was added on data acquisition and processing to enhance reproducibility. In the end producing potentially useful data for studying the genetic mechanisms involved in loss of tail regeneration.

      This evaluation refers to version 1 of the preprint

    Reviewed by GigaByte

    This article has 2 evaluationsAppears in 2 listsLatest version Latest activity
  2. SMARTER-database: a tool to integrate SNP array datasets for sheep and goat breeds

    This article has 18 authors:
    1. Paolo Cozzi
    2. Arianna Manunza
    3. Johanna Ramirez-Diaz
    4. Valentina Tsartsianidou
    5. Konstantinos Gkagkavouzis
    6. Pablo Peraza
    7. Anna Maria Johansson
    8. Juan José Arranz
    9. Fernando Freire
    10. Szilvia Kusza
    11. Filippo Biscarini
    12. Lucy Peters
    13. Gwenola Tosser-Klopp
    14. Gabriel Ciappesoni
    15. Alexandros Triantafyllidis
    16. Rachel Rupp
    17. Bertrand Servin
    18. Alessandra Stella
    This article has been curated by 1 group:
    • Curated by GigaByte

      Editors Assessment:

      This paper presents the SMARTER database, a collection of tools and scripts to gather, standardize, and share with the scientific community a comprehensive dataset of genomic data and metadata information on worldwide small ruminant populations. Which has come out of the EU multi-actor (12 country) H2020 project called SMARTER: SMAll RuminanTs breeding for Efficiency and Resilience. This bringing together genotypes for about 12,000 sheep and 6,000 goats, alongside phenotypic and geographic information. The paper providing insight into how the database was put together, presenting the code for the SMARTER—frontend, backend and API, alongside instructions for users. Peer review tested the platform and provided suggestions on improving the metadata. Demonstrating the project provides valuable information on sheep and goat populations around the world, that can be an essential tool for ruminant researchers. Enabling them to generate new insights and offer the possibility to store new genotypes and drive progress in the field.

      This evaluation refers to version 1 of the preprint

    Reviewed by GigaByte

    This article has 2 evaluationsAppears in 2 listsLatest version Latest activity
  3. NucBalancer: streamlining barcode sequence selection for optimal sample pooling for sequencing

    This article has 2 authors:
    1. Saurabh Gupta
    2. Ankur Sharma
    This article has been curated by 1 group:
    • Curated by GigaByte

      Editors Assessment:

      This paper presents NucBalancer, a R-pipeline and Shiny app designed for the optimal selection of barcode sequences for sample multiplexing in sequencing. Providing a user-friendly interface aiming to make this process accessible to both bioinformaticians and experimental researchers, enhancing its utility in adapting libraries prepared for one sequencing platform to be compatible with others. Important now with the introduction of additional sequencing platforms by Element Biosciences (AVITI System) and Ultima Genomics (UG100) increasing the diversity and capability of genomic research tools available. NucBalancer’s incorporation of dynamic parameters, including customizable red flag thresholds, allows for precise and practical barcode sequencing strategies. This adaptability is key in ensuring uniform nucleotide distribution, particularly in MGI sequencing and single-cell genomic studies, leading to more reliable and cost-effective sequencing outcomes across various experimental conditions. All the code is available under an open source license, and upon review the authors have also shared the code for the Shiny app.

      This evaluation refers to version 1 of the preprint

    Reviewed by GigaByte

    This article has 2 evaluationsAppears in 2 listsLatest version Latest activity
  4. V-pipe 3.0: a sustainable pipeline for within-sample viral genetic diversity estimation

    This article has 19 authors:
    1. Lara Fuhrmann
    2. Kim Philipp Jablonski
    3. Ivan Topolsky
    4. Aashil A Batavia
    5. Nico Borgsmüller
    6. Pelin Icer Baykal
    7. Matteo Carrara
    8. Chaoran Chen
    9. Arthur Dondi
    10. Monica Dragan
    11. David Dreifuss
    12. Anika John
    13. Benjamin Langer
    14. Michal Okoniewski
    15. Louis du Plessis
    16. Uwe Schmitt
    17. Franziska Singer
    18. Tanja Stadler
    19. Niko Beerenwinkel

    Reviewed by GigaScience, GigaByte

    This article has 4 evaluationsAppears in 2 listsLatest version Latest activity
  5. Building a community-driven bioinformatics platform to facilitate Cannabis sativa multi-omics research

    This article has 4 authors:
    1. Locedie Mansueto
    2. Tobias Kretzschmar
    3. Ramil Mauleon
    4. Graham J. King
    This article has been curated by 1 group:
    • Curated by GigaByte

      Editors Assessment:

      This paper reports the establishment of the International Cannabis Genomics Research Consortium (ICGRC) web portal leveraging the open source Tripal platform to enhance data accessibility and integration for Cannabis sativa (Cannabis) multi-omics research. With the aim of bringing together the wealth of publicly available genomic, transcriptomic, proteomic, and metabolomic data sets to improve cannabis for food, fiber and medicinal traits. Tripal is a content management system for genomics data, presenting a ready-to-use specialized ‘omics modules for loading, visualization, and analysis, and is GMOD (Generic Model Organism Database) standards-compliant. The paper explaining how this was put together, what data and features are available, and providing a case study for other communities wanting to create their own Tripal platform. Covering their setup and customizations of the Tripal platform, and how they re-engineered modules for multi-omics data integration, and addition of many other custom features that can be reused. Peer review fixed a few minor bugs and added clarifications on how the platform will be updated.

      *This evaluation refers to version 1 of the preprint

    Reviewed by GigaByte

    This article has 2 evaluationsAppears in 2 listsLatest version Latest activity
  6. PhysiMeSS - a new physiCell addon for extracellular matrix modelling

    This article has 4 authors:
    1. Vincent Noël
    2. Marco Ruscone
    3. Robyn Shuttleworth
    4. Cicely K. Macnamara
    This article has been curated by 1 group:
    • Curated by GigaByte

      Editors Assessment:

      PhysiCell is an open source multicellular systems simulator for studying many interacting cells in dynamic tissue microenvironments. As part of the PhysiCell ecosystem of tools and modules this paper presents a PhysiCell addon, PhysiMeSS (MicroEnvironment Structures Simulation) which allows the user to accurately represent the extracellular matrix (ECM) as a network of fibres. This can specify rod-shaped microenvironment elements such as the matrix fibres (e.g. collagen) of the ECM, allowing the PhysiCell user the ability to investigate physical interactions with cells and other fibres. Reviewers asked for additional clarification on a number of features. And the paper now clear future releases will provide full 3D compatibility and include working on fibrogenesis, i.e. the creation of new ECM fibres by cells.

      This evaluation refers to version 1 of the preprint

    Reviewed by GigaByte

    This article has 2 evaluationsAppears in 2 listsLatest version Latest activity
  7. High-speed whole-genome sequencing of a Whippet: Rapid chromosome-level assembly and annotation of an extremely fast dog’s genome

    This article has 8 authors:
    1. Marcel Nebenführ
    2. David Prochotta
    3. Alexander Ben Hamadou
    4. Axel Janke
    5. Charlotte Gerheim
    6. Christian Betz
    7. Carola Greve
    8. Hanno Jörn Bolz
    This article has been curated by 1 group:
    • Curated by GigaByte

      Editors Assessment:

      This Data Release paper presents the genome of the whippet breed of dog. Demonstrating a streamlined laboratory and bioinformatics workflows with PacBio HiFi long-read whole-genome sequencing that enables the generation of a high-quality reference genome within one week. The genome study being a collaboration between an academic biodiversity institute and a medical diagnostic company. The presented method of working and workflow providing examples that can be used for a wide range of future human and non-human genome projects. The final is 2.47 Gbp assembly being of high quality - with a contig N50 of 55 Mbp and a scaffold N50 of 65.7 Mbp. This reference being scaffolded into 39 chromosome-length scaffolds and the annotation resulting in 28,383 transcripts. The results also looked at the Myostatin gene which can be used for breeding purposes, as these heterozygous animals can have an advantage in dog races. The reviewers making the authors clarify this part a little better with additional results. Overall this study demonstrating how rapidly animal genome research can be carried out through close and streamlined time management and collaboration.

      This evaluation refers to version 1 of the preprint

    Reviewed by GigaByte

    This article has 2 evaluationsAppears in 2 listsLatest version Latest activity
  8. RiboSnake – a user-friendly, robust, reproducible, multipurpose and documentation-extensive pipeline for 16S rRNA gene microbiome analysis

    This article has 9 authors:
    1. Ann-Kathrin Dörr
    2. Josefa Welling
    3. Adrian Dörr
    4. Jule Gosch
    5. Hannah Möhlen
    6. Ricarda Schmithausen
    7. Jan Kehrmann
    8. Folker Meyer
    9. Ivana Kraiselburd
    This article has been curated by 1 group:
    • Curated by GigaByte

      Editors Assessment:

      This new software paper presents RiboSnake, a validated, automated, reproducible analysis pipeline implemented in the popular Snakemake workflow management system for microbiome analysis. Analysing16S rRNA gene amplicon sequencing data, this uses the widely used oQIIME2 [ tool as the basis of the workflow as it offers a wide range of functionality. Users of QIIME2 can be overwhelmed by the number of options at their disposal, and this workflow provides a fully automated and fully reproducible pipeline that can be easily installed and maintained. Providing an easy-to-navigate output accessible to non bioinformatics experts, alongside sets of already validated parameters for different types of samples. Reviewers requested some clarification for testing, worked examples and documentation, and this was improved to produce a convincingly easy-to-use workflow. Hopefully opening up an already very established technique to a new group of users and assisting them with reproducible science.

      This evaluation refers to version 1 of the preprint

    Reviewed by GigaByte

    This article has 2 evaluationsAppears in 2 listsLatest version Latest activity
  9. Kinship analysis and pedigree reconstruction by RAD sequencing in cattle

    This article has 8 authors:
    1. Yiming Xu
    2. Wanqiu Wang
    3. Jiefeng Huang
    4. Minjie Xu
    5. Binhu Wang
    6. Yingsong Wu
    7. Yongzhong Xie
    8. Jianbo Jian
    This article has been curated by 1 group:
    • Curated by GigaByte

      Editors Assessment:

      RAD-Seq (Restriction-site-associated DNA sequencing) is a cost-effective method for single nucleotide polymorphism (SNP) discovery and genotyping. In this study the authors performed a kinship analysis and pedigree reconstruction for two different cattle breeds (Angus and Xiangxi yellow cattle). A total of 975 cattle, including 923 offspring with 24 known sires and 28 known dams, were sampled and subjected to SNP discovery and genotyping using RAD-Seq. Producing a SNP panel with 7305 SNPs capturing the maximum difference between paternal and maternal genome information, and being able to distinguish between the F1 and F2 generation with 90% accuracy. Peer review helped highlight better the practical applications of this work. The combination of the efficiency of RNA-seq and advances in kinship analysis here can helpfully help improve breed management, local resource utilization, and conservation of livestock.

      This evaluation refers to version 1 of the preprint

    Reviewed by GigaByte

    This article has 2 evaluationsAppears in 2 listsLatest version Latest activity
  10. Chromosomal-level genome assembly and single-nucleotide polymorphism sites of black-faced spoonbill Platalea minor

    This article has 20 authors:
    1. Hong Kong Biodiversity Genomics Consortium
    2. Jerome H.L. Hui
    3. Ting Fung Chan
    4. Leo L. Chan
    5. Siu Gin Cheung
    6. Chi Chiu Cheang
    7. James K.H. Fang
    8. Juan Diego Gaitan-Espitia
    9. Stanley C.K. Lau
    10. Yik Hei Sung
    11. Chris K.C. Wong
    12. Kevin Y.L. Yip
    13. Yingying Wei
    14. Wai Lok So
    15. Wenyan Nong
    16. Sean T.S. Law
    17. Paul Crow
    18. Aiko Leong
    19. Liz Rose-Jeffreys
    20. Ho Yin Yip
    This article has been curated by 1 group:
    • Curated by GigaByte

      Editors Assessment: This work is part of a series of papers from the Hong Kong Biodiversity Genomics Consortium sequencing the rich biodiversity of species in Hong Kong (see https://doi.org/10.46471/GIGABYTE_SERIES_0006). This example assembles the genome of the black-faced spoonbill (Platalea minor), an emblematic wading bird from East Asia that is classified as globally endangered by the IUCN. This Data Release reporting a 1.24Gb chromosomal-level genome assembly produced using a combination of PacBio SMRT and Omni-C scaffolding technologies. BUSCO and Merqury validation were carried out, gene models created, and peer reviewers also requested MCscan synteny analysis. This showed the genome assembly had high sequence continuity with scaffold length N50=53 Mb. Presenting data from 14 individuals this will hopefully be a useful and valuable resources for future population genomic studies aimed at better understanding spoonbill species numbers and conservation.

      *This evaluation refers to version 1 of the preprint

    Reviewed by GigaByte

    This article has 2 evaluationsAppears in 2 listsLatest version Latest activity
Newer Page 2 of 11 Older