Giter Site home page Giter Site logo

Data Scientist

PhD in Computational Physics. 5+ years of experience in Data Science.

Data Science Skills

• Programming Languages: Python, R

• Statistical Analysis: Generalized linear models, multivariate regression, time-series analysis

• Machine Learning: Neural networks, support vector machines, random forests, boosting methods

• Data Integration and Management: SQL, handling multi-omic datasets (genomics, proteomics, transcriptomic)

• Data Visualization: ggplot2, Matplotlib, Seaborn

• Big Data and High-Performance Computing: Use of HPC clusters for large-scale data analysis

• Bioinformatics Tools: Bioconductor, Galaxy

• Natural Language Processing: Text mining, sentiment analysis

Analytical Skills

• Data preprocessing, normalization, and transformation

• Predictive modeling and algorithm development

• Network and pathway analysis, Differential gene expression analysis

Research and Project Experience

• Developed and implemented predictive models for clinical trial data analysis, improving early-phase trial insights.

• Conducted exploratory data analysis and visualized complex datasets to identify trends and patterns.

• Designed and executed experiments to test hypotheses and validate models.

Soft Skills

• Excellent written and verbal communication skills in English

• Collaboration in interdisciplinary and multicultural teams

• Independent project management and leadership

Additional Skills

• Linux systems, command-line tools

• Version control (Git)

• Deep learning frameworks (TensorFlow, Keras, PyTorch)

• Experience with relational databases and big data technologies

BNTechie's Projects

bioinformatics icon bioinformatics

Differential Expression Analysis of protein, Gene set enrichment analysis, Multi-omic factor analysis, Pathway analysis, WGCNA

data-preprocessing icon data-preprocessing

Creating empty dataframe, data normalization, Dimensionality reduction, Outlier detection, Overfitting of model and its solution, Remove column with zero values, Replace NA with zeros.

data-visualization icon data-visualization

Manhattan plot, Scatter plot, Venn diagram, Waterfall plot, histogram, Upsetplot, Correlation plot, etc

nlp icon nlp

Basic NLP tasks with Python, End-to-end sentiment analysis, Fake news detection, personality prediction, Span Filtering, Text classification, Topic modeling, Twitter sentiment analysis, Yelp review, etc with NLP

predictive-modeling icon predictive-modeling

Credit card fraud detection, Breast cancer prediction, Wine quality prediction, Bank note authentication, prediction of attrition of employees, Stock prediction, etc

regression_analysis icon regression_analysis

house price prediction, Comparison of Ml algorithm, Logistic regression, Multicollinearity, Multivariate regression analysis, Linear model with random effects, Robust regression

statistical-tests icon statistical-tests

Permutational Multivariate Analysis of Variance, Causal mediation analysis, PCA, adjusted p_values, correlation amon distance matrix, power analysis, exat t_test, Fourier transform

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.