Automatic identification and annotation of MYB gene family members in plants

This article has been Reviewed by the following groups

Read the full article See related articles

Abstract

Background

MYBs are among the largest transcription factor families in plants. Consequently, members of this family are involved in a plethora of processes including development and specialized metabolism. The MYB families of many plant species were investigated in the last two decades since the first investigation looked at Arabidopsis thaliana . This body of knowledge and characterized sequences provide the basis for the identification, classification, and functional annotation of candidate sequences in new genome and transcriptome assemblies.

Results

A pipeline for the automatic identification and functional annotation of MYBs in a given sequence data set was implemented in Python. MYB candidates are identified, screened for the presence of a MYB domain and other motifs, and finally placed in a phylogenetic context with well characterized sequences. In addition to technical benchmarking based on existing annotation, the transcriptome assembly of Croton tiglium and the annotated genome sequence of Castanea crenata were screened for MYBs. Results of both analyses are presented in this study to illustrate the potential of this application. The analysis of one species takes only a few minutes depending on the number of predicted sequences and the size of the MYB gene family. This pipeline, the required bait sequences, and reference sequences for a classification are freely available on github: https://github.com/bpucker/MYB_annotator .

Conclusions

This automatic annotation of the MYB gene family in novel assemblies makes genome-wide investigations consistent and paves the way for comparative studies in the future. Candidate genes for in-depth analyses are presented based on their orthology to previously characterized sequences which allows the functional annotation of the newly identified MYBs with high confidence. The identification of orthologs can also be harnessed to detect duplication and deletion events.

Article activity feed

  1. Peer review report

    Reviewer: Deyou Qiu Institution: Chinese Academy of Forestry. email: qiudy@caf.ac.cn


    Section 1 – Serious concerns

    • Do you have any serious concerns about the manuscript such as fraud, plagiarism, unethical or unsafe practices? No
    • Have authors’ provided the necessary ethics approval (from authors’ institution or an ethics committee)? not applicable

    Section 2 – Language quality

    • How would you rate the English language quality? High quality

    Section 3 – validity and reproducibility

    • Does the work cite relevant and sufficient literature? Yes
    • Is the study design appropriate and are the methods used valid? Yes
    • Are the methods documented and analysis provided so that the study can be replicated? Yes
    • Is the source data that underlies the result available so that the study can be replicated? Yes
    • Is the statistical analysis and its interpretation appropriate? Yes
    • Is quality of the figures and tables satisfactory? Yes
    • Are the conclusions adequately supported by the results? Yes
    • Are there any objective errors or fundamental flaws that make the research invalid? Please describe these thoroughly. No

    Section 4 – Suggestions

    • In your opinion how could the author improve the study? No
    • Do you have any other feedback or comments for the Author?

    The resolution of Fig 3 is not good, could you pls improve it?


    Section 5 – Decision

    Requires revisions