Endorsed by GigaByte

Showing page 1 of 3 pages of list content

EMImR: a Shiny application for identifying transcriptomic and epigenomic changes

This article has 5 authors:
1. Hiba Ben Aribi
2. Careen Naitore
3. Farah Ayadi
4. Souheila Guerbouj
5. Olaitan I. Awe
This article has been curated by 1 group:
- Curated by GigaByte
  
  Editors Assessment:
  
  Coded and written up as part of the African Society for Bioinformatics and Computational Biology (ASBCB) Omicscodeathons, EMImR is a novel Shiny application for transcriptomic and epigenomic change identification and correlation wrapped up using a combination of Bioconductor and CRAN packages. Case studies are on publicly available GEO data corresponding to sequencing data of human blood cell samples of multiple sclerosis patients to demonstrate how the tool works. And a documentation and videos are provided. Peer review and the study highlighting the usefulness of the developed tool for analyzing transcriptomic and epigenomic data.
  
  This evaluation refers to version 1 of the preprint
Reviewed by GigaByte

This article has 2 evaluationsAppears in 2 listsLatest version Oct 30, 2025Latest activity Nov 28, 2025
Aedes mosquito distribution across urban and peri-urban areas of Kinshasa city, Democratic Republic of Congo

This article has 15 authors:
1. Victoire Nsabatien
2. Josue Zanga
3. Nono Mvuama
4. Arsene Bokulu
5. Hyacinthe Lukoki
6. Glodie Diza
7. Dorcas Kantin
8. Leon Mbashi
9. Christelle Bosulu
10. Narcisse Basosila
11. Erick Bukaka
12. Fiacre Agossa
13. Jonas Nagahuedi
14. Jean-Claude Palata
15. Emery Metelo
This article has been curated by 1 group:
- Curated by GigaByte
  
  Editors Assessment:
  
  In the Democratic Republic of Congo (DRC) Aedes mosquitoes are principal vectors of the arboviruses that cause yellow fever, chikungunya and dengue in the human population. However systematic surveillance data on these species remains limited, hindering for entomological and modelling research and control strategies. This paper is one of a series of Data Release papers in GigaByte supported by TDR and the WHO describing datasets hosted in GBIF to tackle these data gaps in vectors of human disease data. To address this data deficiency this paper presents a geo-referenced dataset of 6,577 entomological occurrence records collected in 2024 throughout urban and peri-urban areas of Kinshasa in the Democratic Republic of Congo. The data collected using Larval dipping, Human landing catches, Prokopack aspirator, and BG-Sentinel traps. Data auditing and peer review found the data well validated, but requested some additional fields and methodological details. This work and the extremely useful data provided representing an important step towards building a pan-African resource for Aedes mosquito data collection.
  
  This evaluation refers to version 1 of the preprint
Reviewed by GigaByte

This article has 2 evaluationsAppears in 2 listsLatest version Oct 7, 2025Latest activity Sep 17, 2025
Whole genome sequencing and assembly of the house sparrow, Passer domesticus

This article has 13 authors:
1. Vikas Kumar
2. Gopesh Sharma
3. Sankalp Sharma
4. Samvrutha Prasad
5. Shailesh Desai
6. Toral Vaishnani
7. Dalia Vishnudasan
8. Gopinathan Maheswaran
9. Kaomud Tyagi
10. Inderjeet Tyagi
11. Polavarapu B Kavi Kishor
12. Gyaneshwer Chaubey
13. Prashanth Suravajhala
This article has been curated by 1 group:
- Curated by GigaByte
  
  Editors Assessment:
  
  This paper presents present the genome sequencing of the house sparrow (Passer domesticus) carrying out genome assembly and annotation using in silico approaches with tools that could be a valuable resource for understanding passerine evolution, biology, ethnology, geography, and demography. The final genome assembly was generated using short read sequencing and a computational workflow that included Shovill, SPAdes, MaSuRCA, and BUSCO benchmarking. Producing a 922 MB reference genome with 24,152 genes. The first draft was significantly smaller than this but peer review provided suggestions on how to improve the assembly quality. And after a few attempts and assembly with a reasonable size and BUSCO score was achieved. This openly available data potentially serving as a valuable resource for checking adaptation, divergence, and speciation of birds.
  
  This evaluation refers to version 2 of the preprint
Reviewed by GigaByte

This article has 2 evaluationsAppears in 2 listsLatest version Jul 21, 2025Latest activity Sep 2, 2025
Chevreul: an R bioconductor package for exploratory analysis of full-length single cell sequencing

This article has 3 authors:
1. Kevin Stachelek
2. Bhavana Bhat
3. David Cobrinik
This article has been curated by 1 group:
- Curated by GigaByte
  
  Editors Assessment:
  
  This paper presents Chevreul, a new open-source R Bioconductor (meta-)package for processing and integration of scRNA-seq data from cDNA end-counting, full-length short-read or long-read protocols. Alongside a R Shiny app for easy visualization, formatting, and analysis for exploratory analyses of scRNA-seq data processed in the SingleCellExperiment Bioconductor or Seurat formats. The name of the tool is inspired by the colour theorist Michel-Eugène Chevreul and the optical illusion of the same name. To demonstrate the use of Chevreul, the authors provide a sample analysis, which helps to demonstrate how users can visualize a wide range of parameters, enabling transparent and reproducible scRNA-seq analyses. Peer review also pushing the author to provide extensive guidance materials to assist with use. Being implemented in R, the R package and integrated Shiny application are freely available under an open-source MIT license in Bioconductor and their GitHub page here: https://github.com/cobriniklab/chevreul
  
  This evaluation refers to version 1 of the preprint
Reviewed by GigaByte

This article has 2 evaluationsAppears in 2 listsLatest version Jun 24, 2025Latest activity Jul 30, 2025
Chromosome-level genome assembly of the lemon sole, Microstomus kitt (Pleuronectiformes: Pleuronectidae)

This article has 15 authors:
1. Marcel Nebenführ
2. David Prochotta
3. Maria A. Nilsson
4. Menno J. de Jong
5. Tunca D. Yazici
6. Fabienne Langefeld
7. Malambo Muloongo
8. Helena Woköck
9. Jakob Jilg
10. Sina C. Bender
11. Marvin M. Zangl
12. Juan-Manuel Ortega Guatame
13. Kimberley Williams
14. Moritz Sonnewald
15. Axel Janke
This article has been curated by 1 group:
- Curated by GigaByte
  
  Editors Assessment:
  
  This Data Release paper presents the first genome assembly of the lemon sole (Microstomus kitt), a commercially important flatfish found in European coastal waters. It is also interesting that this work was carried out in a University course setting involving the students. The resulting chromosome-level genome was assembled using long-read PacBio HiFi sequencing and the Hi-C technique. The 628 Mbp reference (which is consistent with other Pleuronectidae fish species) is assembled into 24 chromosome-length scaffolds with high completeness, achieving a scaffold N50 of 27.2 Mbp. Peer review and data curation made the author clarify a few points and share all of the data and results in an open and well curated manner. The annotated genome of the lemon sole, with its high continuity, should therefore provide important reference data for future population genetic analyses and conservation strategies of this organism.
  
  This evaluation refers to version 1 of the preprint
Reviewed by GigaByte

This article has 2 evaluationsAppears in 2 listsLatest version May 27, 2025Latest activity May 30, 2025
Chromosome-level genome assemblies of five Sinocyclocheilus species

This article has 6 authors:
1. Chao Bian
2. Ruihan Li
3. Yuqian Ouyang
4. Junxing Yang
5. Xidong Mu
6. Qiong Shi
This article has been curated by 1 group:
- Curated by GigaByte
  
  **Editors Assessment: ** Sinocyclocheilus are a genus of freshwater cavefish fish that are endemic to the Karst regions of Southwest China. Having diverse traits in morphology, behavior, and physiology typical of cavefish, that make them interesting models for studying cave adaptation and phylogenetic evolution. The manuscript assembled chromosomal-level genomes of five Sinocyclocheilus species, and conducted allotetraploid origin analysis on these species. Assembling S. grahami (the golden-line barbel), using PacBio and Hi-C sequencing technologies, a final chromosome-level genome assembly was 1.6 Gb in size with a contig N50 of 738.5 kb and a scaffold N50 of 30.7 Mb. With 93.1% of the assembled genome sequences and 93.8% of the predicted genes anchored onto 48 chromosomes. Subsequently the authors conducted a homologous comparison to obtain chromosome-level genome assemblies for four other Sinocyclocheilus species: S. maitianheensis, S. rhinocerous, S. anshuiensis, and S. Anophthalmus. With over 82% of the genome sequences anchored on these constructed chromosomes. Peer review provided clarification on the assembly strategy and provided more benchmarking. This data having the potential to contribute to species conservation and the exploitation of potential economic and ecological values of diverse Sinocyclocheilus members.
  
  This evaluation refers to version 1 of the preprint
Reviewed by GigaByte

This article has 2 evaluationsAppears in 2 listsLatest version May 9, 2025Latest activity May 14, 2025
Efficiently constructing complete genomes with CycloneSEQ to fill gaps in bacterial draft assemblies

This article has 19 authors:
1. Hewei Liang
2. Yuanqiang Zou
3. Mengmeng Wang
4. Tongyuan Hu
5. Haoyu Wang
6. Wenxin He
7. Yanmei Ju
8. Ruijin Guo
9. Junyi Chen
10. Fei Guo
11. Tao Zeng
12. Yuliang Dong
13. Yuning Zhang
14. Bo Wang
15. Chuanyu Liu
16. Xin Jin
17. Wenwei Zhang
18. Xun Xu
19. Liang Xiao
This article has been curated by 1 group:
- Curated by GigaByte
  
  Editors Assessment:
  
  With the recent official launch of BGI’s new CycloneSEQ sequencing platform that delivers long-reads using novel nanpores, this paper presents benchmarking data and validation studies comparing short, long-rea data from other platforms and hybrid assemblies. This study tests the performance of the new platform in sequencing diverse microbial genomes, presenting raw and processed data to enable others to scrutinise and verify the work. Being openly peer-reviewed, and having scripts and protocols also shared for the first time helps provide transparency in this benchmarking process to increase trust in this new technology. On top of benchmarking typed strains, the technology also was tested with complex microbial communities. Yielding complete metagenome-assembled genomes (MAGs) which were not achieved by short- or long-read assemblies alone. By directly reading DNA molecules without fragmentation, the study demonstrating CycloneSEQ delivers long-read data with impressive length and accuracy, unlocking gaps that short-read technologies alone cannot bridge. Future work is expanding to real samples, with and fine-tuning the balance between short-read and long-read data for even faster, higher-quality assemblies.
  
  This evaluation refers to version 1 of the preprint
Reviewed by GigaByte

This article has 3 evaluationsAppears in 2 listsLatest version Apr 25, 2025Latest activity Apr 28, 2025
Scott C Edmunds

See more on the transparent benchmarking of this new technology here:

http://gigasciencejournal.com/blog/open-cycloneseq-benchmarking-for-complete-bacterial-genomes/

As well as the protocol collection in protocols.io: https://dx.doi.org/10.17504/protocols.io.kqdg3k3kev25/v1
Genome assembly and annotation of Acropora pulchra from Mo’orea French Polynesia

This article has 4 authors:
1. Trinity Conn
2. Jill Ashey
3. Ross Cunning
4. Hollie M. Putnam
This article has been curated by 1 group:
- Curated by GigaByte
  
  Editors Assessment:
  
  Acropora pulchra is a species small polyped stony corals in the family Acroporidae from the the Indo-Pacific. This Data Release is the first study in stony corals to present the DNA methylome in tandem with a high-quality genome assembled utilizing PacBio long-read HiFi sequencing. Sequencing an A. pulchra specimen from Mo’orea, French Polynesia. From this single molecule sequencing data DNA methylation data was also called and quantified, and additional short-read Illumina RNASeq data was used for gene annotation. This producing an assembly size is 518 Mbp, with 174 scaffolds, and a scaffold N50 of 17 Mbp, and 40,518 protein-coding genes called. Peer review requested some improved benchmarking, and it is impressive to see from the results that the genome assembly represents the most complete and contiguous stony coral genome assembly to date. As an important indicator species and this data will hopefully serve as a resource to the coral and wider scientific community. Further quantification of the genome-wide methylation is needed aid the study epigenetics of non-model organisms, and specifically future analyses on methylation in coral.
  
  *This evaluation refers to version 1 of the preprint
Reviewed by GigaByte

This article has 2 evaluationsAppears in 2 listsLatest version Apr 10, 2025Latest activity Apr 13, 2025
CompactTree: a lightweight header-only C++ library and Python wrapper for ultra-large phylogenetics

This article has 1 author:
1. Niema Moshiri
This article has been curated by 1 group:
- Curated by GigaByte
  
  Editors Assessment:
  
  As volumes of viral and bacterial sequence data grow exponentially, the field of computational phylogenetics now demands resources to manage the burgeoning scale of this input data. This study introduces CompactTree, a C++ library designed for ultra-large phylogenetic trees with millions of tips. To address these scalability issues while maintaining ease of incorporation into external code bases, CompactTree is a header-only library with enhanced performance utilizing minimal dependencies, optimized node representation, and memory-efficient tree structure schemes. Resulting in significantly reduced memory footprints and improved processing times. Peer review requested some more detail on the functionality and some real-world examples, demonstrating the current utility of the tool. Although primarily supporting the (text-based) Newick format, the increased and extensibility scalability holds promise for multiple biological and epidemiological applications supporting more complex formats such as Nexus and NeXML. The tool is open source (GPLv3 licensed) and available in GitHub: https://niema.net/CompactTree
  
  This evaluation refers to version 1 of the preprint
Reviewed by GigaByte

This article has 2 evaluationsAppears in 2 listsLatest version Mar 7, 2025Latest activity Mar 23, 2025
Draft genome of the endangered visayan spotted deer (Rusa alfredi), a Philippine endemic species

This article has 8 authors:
1. Ma. Carmel F. Javier
2. Albert C. Noblezada
3. Persie Mark Q. Sienes
4. Robert S. Guino-o
5. Nadia Palomar-Abesamis
6. Maria Celia D. Malay
7. Carmelo S. del Castillo
8. Victor Marco Emmanuel N. Ferriols
This article has been curated by 1 group:
- Curated by GigaByte
  
  Editors Assessment:
  
  The Visayan spotted deer (Rusa alfredi), is a small, endangered, primarily nocturnal species of deer found in the rainforests of the Visayan Islands in the Philippines. The present study reports the first draft genome assembly for the species, addressing a critical gap in genomic data for this IUCN-redlisted cervid. Using Illumina sequencing, the resulting genome assembly spans 2.52 Gb in size with a BUSCO completeness score of 95.5% and encompasses 24,531 annotated genes. Phylogenetic analysis suggests a close evolutionary relationship between R. alfredi and Cervus species suggesting that the genus Rusa is sister to Cervus. Peer-review teased out more benchmarking results and the annotation files, demonstrating this genomic resource is useful and usable for advancing population genetics and evolutionary studies, thereby informing conservation strategies and enhancing breeding programs for the critically threatened species. Providing whole genome sequences for other native species of Rusa could further provide genomic resources for detecting hybrids, which will also help the management and monitoring of these species, especially for the reintroduction of captive populations in the wild.
  
  This evaluation refers to version 1 of the preprint
Reviewed by GigaByte

This article has 2 evaluationsAppears in 2 listsLatest version Feb 24, 2025Latest activity Mar 17, 2025
The assembly and annotation of two teinturier grapevine varieties, Dakapo and Rubired

This article has 6 authors:
1. Eleanore J. Ritter
2. Noé Cochetel
3. Andrea Minio
4. Peter Cousins
5. Dario Cantu
6. Chad Niederhuth
This article has been curated by 1 group:
- Curated by GigaByte
  
  Editors Assessment:
  
  Teinturier grapes produce berries with pigmented skin and flesh, and are used in red wine blends, as they provide a deeper colour. This paper presents the genomes of two popular teinturier varieties (Dakapo and Rubired); sequenced, assembled, and annotated to provide additional resources for their use in breeding. Combining Nanopore and Illumina sequencing for Dakapo, scaffolding to the existing grapevine assembly to generate a final assembly of 508.5 Mbp and 36,940 gene annotations. For Rubired PacBio HiFi reads were assembled, scaffolded, and phased to generate a diploid assembly with two haplotypes 474.7-476.0 Mbp long and 56,681 genes annotated. Peer review has helped validate their high quality, these genomes hopefully enabling more insight into the genetics of grapevine berry colour and their other traits like frost and mildew-resistance.
  
  This evaluation refers to version 1 of the preprint
Reviewed by GigaByte

This article has 2 evaluationsAppears in 2 listsLatest version Feb 27, 2025Latest activity Mar 17, 2025
SqueezeCall: nanopore basecalling using a Squeezeformer network

This article has 1 author:
1. Zhongxu Zhu
This article has been curated by 1 group:
- Curated by GigaByte
  
  Editors Assessment:
  
  The accuracy of basecalling of nanopore sequencing still needs to be improved. With recent advances in deep learning this paper introduces SqueezeCall, a novel end-to-end tool for accurate basecalling. This uses Squeezeformer-achitecture which integrates local context extraction through convolutional layers and long-range dependency modeling via global context acquisition. Testing and peer review demonstrated that SqueezeCall outperformed traditional RNN and Transformer-based basecallers across multiple datasets, indicating its potential to refine genomic assembly and facilitate direct detection of modified bases in future genomic analytics. Future work is ongoing that will focus on training on highly curated datasets, including known modifications, to further increase research value. SqueezeCall is MIT licensed and available from GitHub here: https://github.com/labcbb/SqueezeCall
  
  This evaluation refers to version 1 of the preprint
Reviewed by GigaByte

This article has 2 evaluationsAppears in 2 listsLatest version Feb 14, 2025Latest activity Mar 15, 2025
A practical DNA data storage using an expanded alphabet introducing 5-methylcytosine

This article has 11 authors:
1. Deruilin Liu
2. Demin Xu
3. Liuxin Shi
4. Jiayuan Zhang
5. Kewei Bi
6. Bei Luo
7. Chen Liu
8. Yuxiang Li
9. Guangyi Fan
10. Wen Wang
11. Zhi Ping
This article has been curated by 1 group:
- Curated by GigaByte
  
  Editors Assessment:
  
  DNA has huge potential as a data storage medium because of its incredibly high storage density and stability. This work addresses the potential of modified bases, specifically 5-methylcytosine (5mC), in enhancing DNA data storage systems. This paper introduces a transcoding scheme named R+, which incorporates this modified 5mC base to increase information density beyond the standard limits. By encoding various file types into DNA sequences of between 1.3 to 1.6 kb in size, this method achieves an average recovery rate of 98.97% (with reference), validating the effectiveness of the method. On top of a wet-lab protocol (hosted in protocols.io) for the experimental validation of the transcoding scheme, it also includes open source code for in-silico simulation tests. Peer review scruitinising the protocols and validation are reusable and provide convincing results. As nanopore sequencing has enabled reading of these modified bases, it is timely making them applicable as extra letters in the molecular alphabet for DNA data storage
  
  This evaluation refers to version 1 of the preprint
Reviewed by GigaByte

This article has 2 evaluationsAppears in 2 listsLatest version Jan 24, 2025Latest activity Feb 4, 2025
Scott C Edmunds

This is GigaByte's first example of a paper with a peer-reviewed protocols.io protocol, and the reviews shared here include a protocol peer review. See the blog covering this: http://gigasciencejournal.com/blog/publish-peer-reviewed-protocol-papers/

This is the associated protocol in protocols.io:

Liu D et al., Experimental validation for transcoding scheme name R+ in DNA data storage. Protocols.io. 2024; https://dx.doi.org/10.17504/protocols.io.q26g7mr78gwz/v1

The genome of the sapphire damselfish Chrysiptera cyanea: a new resource to support further investigation of the evolution of Pomacentrids

Emma Gairin
Saori Miura
Hiroki Takamiyagi
Marcela Herrera
Vincent Laudet

Curated by GigaByte

Editors Assessment:

Among hot topics in coral reef research, the difference between anemonefish and other damselfish is currently a popular area of research. In this study the authors provide a new high-quality non-anemonefish genome, which will be of high relevance to further the depth of such analyses. In this case of the sapphire damselfish Chrysiptera cyanea, a widely distributed damselfish in the Indo-Pacific area, often studied to elucidate the roles of various environmental controls on their reproduction, and investigate related hormonal processes To further the potential of biomolecular analyses based on this species, this study generated the first genome of a Chrysiptera fish from a male individual collected in Okinawa, Japan. Using PacBio and HiFI long-read sequencing with 94.5x coverage, a chromosome-scale genome was assembled and 28,173 genes identified and annotated. Peer review gathered more parameters and details on the quality, and the final assembly comprised of 896 Mb pairs across 91 contigs, and a BUSCO completeness of 97.6%. This reference genome should therefore be of high value for future genetic-based approaches, from population structure to gene expression analyses.

This evaluation refers to version 1 of the preprint

Reviewed by GigaByte

Whole-genome re-sequencing of the Baikal seal and other phocid seals for a glimpse into their genetic diversity, demographic history, and phylogeny

This article has 3 authors:
1. Marcel Nebenführ
2. Ulfur Arnason
3. Axel Janke
This article has been curated by 1 group:
- Curated by GigaByte
  
  Editors Assessment:
  
  Due to them being found in the landlocked, isolated habitat of Lake Baikal makes the Baikal Seal (Pusa sibirica) unique among all pinnipeds as the only freshwater seal. This paper presents reference-based assemblies of six newly sequenced Baikal seal individuals, one individual of the ringed seal, as well as the first short-read data of the harbor seal and the Caspian seal . This data aiding the study of the genomic diversity of the Baikal seal and to contribute baseline data to the limited genomic data available for seals. Peer review extended the description of the used tools and parameters in the revised manuscript, and provided some more information on the methods..This newly generated sequencing data hopefully now helps to extend the phylogeny of the Phoca/Pusa group on genome-wide data and can also broaden the view into the genetic structure and diversity of the Baikal seal
  
  This evaluation refers to version 1 of the preprint
Reviewed by GigaByte

This article has 2 evaluationsAppears in 2 listsLatest version Nov 20, 2024Latest activity Nov 21, 2024
TSTA: thread and SIMD-based trapezoidal pairwise/multiple sequence-alignment method

This article has 4 authors:
1. Peiyu Zong
2. Wenpeng Deng
3. Jian Liu
4. Jue Ruan
This article has been curated by 1 group:
- Curated by GigaByte
  
  Editors Assessment:
  
  The article presents strategies for accelerating sequence alignment using multithreading and SIMD (Single Instruction, Multiple Data) techniques, and introduces a new algorithm called TSTA (Thread and SIMD-Based Trapezoidal Pairwise/Multiple Sequence-Alignment). The Technical Release write-up presenting a detailed description of TSTA's performance in pairwise sequence alignment (PSA) and multiple sequence alignment (MSA), and compares it with various existing alignment algorithms. Demonstrating the performance gains achieved by vectorized SIMD technology and the application of threading. Testing and debugging a few errors, and adding some more background detail, demonstrating it can achieve faster comparison speed. Demonstrating TSTA's efficacy in pairwise sequence alignment and multiple sequence alignment, particularly with long reads, and showcasing considerable speed enhancements compared to existing tools.
  
  This evaluation refers to version 1 of the preprint
Reviewed by GigaByte

This article has 2 evaluationsAppears in 2 listsLatest version Nov 5, 2024Latest activity Nov 21, 2024
Chromosome-level genome assembly and annotation of the crested gecko, Correlophus ciliatus, a lizard incapable of tail regeneration

This article has 3 authors:
1. Marc A. Gumangan
2. Zheyu Pan
3. Thomas P. Lozito
This article has been curated by 1 group:
- Curated by GigaByte
  
  Editors Assessment:
  
  The crested gecko (Correlophus ciliatus), is a lizard species endemic to New Caledonia, and a potentially interesting model organism due to its unusual (for a gecko) inability to regenerate amputated tails. With that in mind here is presented a new reference genome for the species, assembled using PacBio Sequel II platform and Dovetail Omni-C libraries. Producing a genome with a total size of 1.65 Gb, 152 scaffolds, a L50 of 6, and N50 of 109 Mb. Peer review making sure more detail was added on data acquisition and processing to enhance reproducibility. In the end producing potentially useful data for studying the genetic mechanisms involved in loss of tail regeneration.
  
  This evaluation refers to version 1 of the preprint
Reviewed by GigaByte

This article has 2 evaluationsAppears in 2 listsLatest version Nov 6, 2024Latest activity Nov 21, 2024
SMARTER-database: a tool to integrate SNP array datasets for sheep and goat breeds

This article has 18 authors:
1. Paolo Cozzi
2. Arianna Manunza
3. Johanna Ramirez-Diaz
4. Valentina Tsartsianidou
5. Konstantinos Gkagkavouzis
6. Pablo Peraza
7. Anna Maria Johansson
8. Juan José Arranz
9. Fernando Freire
10. Szilvia Kusza
11. Filippo Biscarini
12. Lucy Peters
13. Gwenola Tosser-Klopp
14. Gabriel Ciappesoni
15. Alexandros Triantafyllidis
16. Rachel Rupp
17. Bertrand Servin
18. Alessandra Stella
This article has been curated by 1 group:
- Curated by GigaByte
  
  Editors Assessment:
  
  This paper presents the SMARTER database, a collection of tools and scripts to gather, standardize, and share with the scientific community a comprehensive dataset of genomic data and metadata information on worldwide small ruminant populations. Which has come out of the EU multi-actor (12 country) H2020 project called SMARTER: SMAll RuminanTs breeding for Efficiency and Resilience. This bringing together genotypes for about 12,000 sheep and 6,000 goats, alongside phenotypic and geographic information. The paper providing insight into how the database was put together, presenting the code for the SMARTER—frontend, backend and API, alongside instructions for users. Peer review tested the platform and provided suggestions on improving the metadata. Demonstrating the project provides valuable information on sheep and goat populations around the world, that can be an essential tool for ruminant researchers. Enabling them to generate new insights and offer the possibility to store new genotypes and drive progress in the field.
  
  This evaluation refers to version 1 of the preprint
Reviewed by GigaByte

This article has 2 evaluationsAppears in 2 listsLatest version Oct 21, 2024Latest activity Nov 21, 2024
NucBalancer: streamlining barcode sequence selection for optimal sample pooling for sequencing

This article has 2 authors:
1. Saurabh Gupta
2. Ankur Sharma
This article has been curated by 1 group:
- Curated by GigaByte
  
  Editors Assessment:
  
  This paper presents NucBalancer, a R-pipeline and Shiny app designed for the optimal selection of barcode sequences for sample multiplexing in sequencing. Providing a user-friendly interface aiming to make this process accessible to both bioinformaticians and experimental researchers, enhancing its utility in adapting libraries prepared for one sequencing platform to be compatible with others. Important now with the introduction of additional sequencing platforms by Element Biosciences (AVITI System) and Ultima Genomics (UG100) increasing the diversity and capability of genomic research tools available. NucBalancer’s incorporation of dynamic parameters, including customizable red flag thresholds, allows for precise and practical barcode sequencing strategies. This adaptability is key in ensuring uniform nucleotide distribution, particularly in MGI sequencing and single-cell genomic studies, leading to more reliable and cost-effective sequencing outcomes across various experimental conditions. All the code is available under an open source license, and upon review the authors have also shared the code for the Shiny app.
  
  This evaluation refers to version 1 of the preprint
Reviewed by GigaByte

This article has 2 evaluationsAppears in 2 listsLatest version Oct 11, 2024Latest activity Nov 21, 2024
Building a community-driven bioinformatics platform to facilitate Cannabis sativa multi-omics research

This article has 4 authors:
1. Locedie Mansueto
2. Tobias Kretzschmar
3. Ramil Mauleon
4. Graham J. King
This article has been curated by 1 group:
- Curated by GigaByte
  
  Editors Assessment:
  
  This paper reports the establishment of the International Cannabis Genomics Research Consortium (ICGRC) web portal leveraging the open source Tripal platform to enhance data accessibility and integration for Cannabis sativa (Cannabis) multi-omics research. With the aim of bringing together the wealth of publicly available genomic, transcriptomic, proteomic, and metabolomic data sets to improve cannabis for food, fiber and medicinal traits. Tripal is a content management system for genomics data, presenting a ready-to-use specialized ‘omics modules for loading, visualization, and analysis, and is GMOD (Generic Model Organism Database) standards-compliant. The paper explaining how this was put together, what data and features are available, and providing a case study for other communities wanting to create their own Tripal platform. Covering their setup and customizations of the Tripal platform, and how they re-engineered modules for multi-omics data integration, and addition of many other custom features that can be reused. Peer review fixed a few minor bugs and added clarifications on how the platform will be updated.
  
  *This evaluation refers to version 1 of the preprint
Reviewed by GigaByte

This article has 2 evaluationsAppears in 2 listsLatest version Oct 18, 2024Latest activity Oct 20, 2024
Scott C Edmunds

There is a Cassyni webinar from the first author of this preprint which presents on this work here https://doi.org/10.52843/cassyni.y1p61f