Improved functions for non-linear sequence comparison using SEEKR

Read the full article See related articles

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

SEquence Evaluation through k -mer Representation (SEEKR) is a method of sequence comparison that utilizes sequence substrings called k -mers to quantify non-linear similarity between nucleic acid species. We describe the development of new functions within SEEKR that enable end-users to estimate p-values that ascribe statistical significance to SEEKR-derived similarities as well as visualize different aspects of k -mer similarity. We apply the new functions to identify chromatin-enriched long noncoding RNAs (lncRNAs) that harbor XIST -like sequence fragments and show that several of these fragments are bound by XIST -associated proteins. We also highlight the best practice of using RNA-Seq data to evaluate support for lncRNA annotations prior to their in-depth study in cell types of interest.

Article activity feed