Giter Site home page Giter Site logo

samhuairen / re-searcher Goto Github PK

View Code? Open in Web Editor NEW

This project forked from labbandsb/re-searcher

0.0 0.0 0.0 54.56 MB

"re-Searcher" is a toolbox aimed to simplify the task for genomics data mining from big VCF files.

License: GNU General Public License v3.0

Python 100.00%

re-searcher's Introduction

re-Searcher

logo

re-Searcher is a tool aimed to simplify the task for genomics data mining from VCF files. Now there's no need to perform difficult script manipulations in IDE with Python or R. re-Searcher can work with any variant of VCF, for instance, with annotated VCF in ANNOVAR.

🚀 Quick Start

  1. Click on Releases on the right side of this page and download re-Searcher.zip from the latest release.

2. To unzip the archive file, right-click to it and select Extract All (or other unzipper like 7-zip).

3. Execute **re-searcher** file to run the tool.

✅ Features

  1. Browse and open VCF files.

  2. Extract header from VCF to new VCF.

  3. Extract lines containing user input keywords from the input entry or from a Text Document (.txt) file to new VCF file. e.g. if you want filter your VCF file to only have rows with 'INDEL' keyword in them.

  4. Extract columns with user input samples from the input entry or from a Text Document (.txt) file to a new VCF file. Unnecessary samples from original VCF will be cut off and only user input samples will be remained. e.g. if you want filter your VCF file to only have two samples: WE0001 and WE006.

  5. Convert genotype (GT) format from number GT to letter GT. Original GT format is numeric (0/0, 0/1, 1/0 or 1/1, etc.), whereas 0 is reference (REF) allele and 1 is alternative (ALT) allele. Multiallelic rows (0/3, 3/2) will be converted by inserting respective ALT allele.

⚒ Usage

● To open VCF file, click ‘Browse’ button and find your file's location:

● To extract header from VCF, click ‘Extract Header’ button:

Program will ask you where to save an output file with extracted Header.

Can be used when file is big and cannot be opened in Excel, but user needs some information in header (for instance, meta-lines or sample names)

● To search and extract lines with certain user input keywords, firstly, input your keywords in ‘Keywords’ entry, and then press ‘Extract’ button.

Can be used when particular rows with keywords are needed.

● In cases when there are too many keywords, it is inconvenient to input them manually. Create Text Document and copy all necessary keywords in one column.

Then, press ‘Extract from File’ button, after which program ask you to open the Text Document with keywords that you had created.

● To search and extract user input samples (columns), firstly, input your keywords in ‘Sample’ entry, and then press ‘Extract’ button. Program will cut off unnecessary samples.

Can be used when only particular samples are needed.

● Similarly as multiple keywords search from a file, you can input a file with samples. Create .txt and copy all necessary samples in one column.

Then, press ‘Extract from File’ button, after which program ask you to open the .txt with keywords that you had created.

● To convert the numeric GT format to letter GT format press ‘Convert GT’ button.

🤝 Contributing

●Contributions, issues and feature requests are welcome. Feel free to check issues page if you want to contribute.

Credits

● re-Searcher was created in Laboratory of Bioinformatics and Systems Biology, Center for Life Sciences, National Laboratory Astana-Nazarbayev University

📝 License

● Copyright 2019 © LBSB.

re-searcher's People

Contributors

imdanique avatar labbandsb avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.