Data Scientist (7+ years). Masters in Big Data Analytics, and PhD in Computational Physics.
Data Science Skills
β’ Programming Languages: Python, R
β’ Statistical Analysis: Generalized linear models, multivariate regression, time-series analysis.(scikit-learn, statsmodel, pandas, numpy)
β’ Machine Learning: Neural networks, support vector machines, random forests, boosting methods (scikit-learn, pyTorch, keras)
β’ Data Integration and Management: SQL, handling multi-omic datasets (genomics, proteomics, transcriptomic)
β’ Data Visualization: ggplot2, Matplotlib, Seaborn
β’ Big Data and High-Performance Computing: Use of HPC clusters for large-scale data analysis
β’ Bioinformatics Tools: Bioconductor, Galaxy
β’ Natural Language Processing: Text mining, Sentiment analysis, Topic Modelling (NLTK, SPacy, BERT)
Analytical Skills
β’ Data preprocessing, normalization, and transformation
β’ Predictive modeling and algorithm development
β’ Network and pathway analysis, Differential gene expression analysis
Research and Project Experience
β’ Developed and implemented predictive models for clinical trial data analysis, improving early-phase trial insights.
β’ Conducted exploratory data analysis and visualized complex datasets to identify trends and patterns.
β’ Designed and executed experiments to test hypotheses and validate models.
Soft Skills
β’ Excellent written and verbal communication skills in English
β’ Collaboration in interdisciplinary and multicultural teams
β’ Independent project management and leadership
Additional Skills
β’ Linux systems, command-line tools
β’ Version control (Git)
β’ Deep learning frameworks (TensorFlow, Keras, PyTorch)
β’ Experience with relational databases and big data technologies