Giter Site home page Giter Site logo

rnastructureidentifier's Introduction

RNAStructureIdentifier

A Mathematica package for identifying the target secondary structure from RNA structure ensemble.

Amongst other things, our implementations allow you to:

  • compute the ensemble tree, a hierarchical bi-partition of the ensemble of structures
  • calculate the base pairs having maximum information entropy
  • predict the target structure via recursively querying about whether or not a base pair of maximum entropy is contained in the target
  • visualize the ensemble tree, together with base pairs of maximum entropy
  • draw a single secondary structure as a diagram
  • present an ensemble of secondary structures as a greyscale diagram

Introduction

The package focuses on how to identify a target from a Boltzmann ensemble of secondary structures. The key idea is to employ an information-theoretic approach to solve the problem, via considering a variant of the Rényi-Ulam game. Our framework is centered around the ensemble tree, a hierarchical bi-partition of the input ensemble, that is constructed by recursively querying about whether or not a base pair of maximum information entropy is contained in the target. These queries are answered via relating local with global chemical probing data, employing the modularity in RNA secondary structures (see References).

In 1, we present that leaves of the tree are comprised of sub-samples exhibiting a distinguished structure with high probability. In particular, for a Boltzmann ensemble incorporating probing data, which is well established in the literature, the probability of our framework correctly identifying the target in the leaf is greater than 90%.

Demonstration

We present a demo of the ensemble trees for 5S rRNA, riboswitch and long non-coding RNA.

Installation

To use the package, the only thing you need to install is Wolfram Mathematica Free Trial.

We have tested our package using Mathematica versions 11.3.0 and 12.0.0 under UNIX and Windows system.

Usage

Download the package to your Notebook directory and load the pacakge with

Get[NotebookDirectory[] <> "RNAStructureIdentifier.wl"]

Quick Start

We provide a quick tutorial (tutorial.nb) to demonstrate functions by examples.

References

If you use our package, you may want to cite the follwing publications:

  1. Thomas J.X. Li and Christian M. Reidys (2020) "On an enhancement of RNA probing data using Information Theory", Algorithms for Molecular Biology, 15: 15.

Contact

We need your feedback! Send your comments, suggestions, and questions to [email protected]

Thomas Li, Autumn 2019

rnastructureidentifier's People

Contributors

gaussbackyard avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.