Giter Site home page Giter Site logo

corona_mutations_analysis's Introduction

Analysis of COVID-19 virus sequences

Data Exploration:

Imported necessary libraries and read a CSV file containing metadata about COVID-19 sequences continuing with data cleaning and preprocessing on the metadata. Filtered the metadata to identify specific sequence types, such as RefSeq (reference sequences), Delta variant, and Omicron variant.

Downloading and Parsing Sequences

Defined a function download_sequence to retrieve nucleotide sequences from the NCBI database using accession numbers. The downloaded sequences were then parsed using the BioPython library.

Sequence Alignment

Used the Align module from BioPython to perform pairwise sequence alignment between the reference sequence and other variants and calculated percentage of similarity between the reference sequence and the first non-mutated sequence, Afterwards created a matrix to store alignment scores between all pairs of sequences to finall store them in a pandas DataFrame.

Mutations

Focused on the Omicron variant and performs a detailed alignment with the reference sequence. Mutations (insertions, deletions, and substitutions) between the two sequences were identified and highlighted using color-coded HTML formatting and visualised with the IPython.display module.

Translation

Finally, translated the Reference genome and the Omicron genome to their Amino-acid sequences with my https://bioinfotools.streamlit.app/ web-app, and stored them in txt file by eiting the Stop codons out, and separating the Amino-acides between them

corona_mutations_analysis's People

Contributors

kris96tian avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.