Giter Site home page Giter Site logo

embo_iclip_course's Introduction

iCLIP2 EMBO course 2024

Welcome to the bioinformatics part of the course!

We have two practical sessions for you.

Practical 1 - Exploring CLIP data with the UCSC genome browser

In this task, you will explore crosslink profiles and binding sites defined by different tools. We uploaded crosslink profiles of CSTF2 and SF3B1 to the UCSC genome browser for you. Have a look at https://genome.ucsc.edu/s/melinak/How_to_define_binding_iCLIP2_EMBO_course_bioinformatics. Go through the tasks below and make screenshots of the examples you find, so we can discuss your examples in the end.

  • CSTF2 is a polyadenylation factor. Find some examples of crosslink peaks at polyadenylation sites and alternative polyadenylation sites. Are these sites also annotated in the genome annotation?

  • For SF3B1 we uploaded the crosslinks from two different experiments: A very deep iCLIP2 experiment from Pacholewska et al. 2024 (https://doi.org/10.1101/2024.01.26.576051) and a less deep eCLIP experiment from the ENCODE database. Find some examples of overlaps and discrepancies in the crosslink profiles from both experiments.

  • There are different methods to define binding sites CLIP data. We included in the session also crosslink peaks from PureCLIP, binding sites from BindingSiteFinder and enriched binding regions from DEW-seq. Compare the binding sites from the different methods.

  • Another tool often used for binding site definition is clipper. Clipper binding sites are also provided in the ENCODE database for all ENCODE data sets. Add the clipper binding sites for CSTF2 to the UCSC genome browser session (see institutions below) and compare the binding sites to the other methods.

How to load ENCODE data to the UCSC genome browser

  • Go to https://www.encodeproject.org/ and click Data > Experiment search.
  • Type your protein of interest in the Search field in the top right corner.
  • On the left, you can choose Data type > Dataset to reduce the number of results. Then select the CSTF2 HepG2 eCLIP data set. You will now see a summary of this experiment.
  • Scroll down to Files and click on Output type > peaks in the bar on the left to select the clipper peaks.
  • Now you can click Genome browser to directly see the clipper peaks in a genome browser.
  • To load the clipper peaks to the UCSC session with the other tracks, go to File details and click Visualize.

Practical 2 - Finding binding motifs with XSTREME

In this task, you will predict binding motifs based on the sequences of binding sites. For this, we will use the XSTREME algorithm, which is part of the MEME suite. We provide you with fastq files from CSTF2, RBFOX2 and PUM2 binding sites.

embo_iclip_course's People

Contributors

melinaklostermann avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.