[go: nahoru, domu]

Skip to content
View danon6868's full-sized avatar
Block or Report

Block or report danon6868

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
danon6868/README.md

Daniil Litvinov

Linkedin Badge Website Badge GitHub followers Gmail Badge Telegram Badge

I am a bioinformatician with an academic and commercial experience in python and R programming and in machine learning. I am really passionate about applying machine learning methods and statistics for analyzing Omics data and computer vision methods for cryo‑EM data analysis. In addition, I teach statistics and machine learning to students of biology and medicine. My primary goal is to continue developing my skills in the field of AI/ML/DL applied to biology and medicine.

Key projects 📝

LoReNNMap

Application. Source

Developed an algorithm for cryo-EM maps resolution estimation using deep learning.

  • Collected and prepared training data, pairs of electron density maps, and local resolution maps.
  • Developed and trained neural network model to estimate local resolution map based on electron density map using 3D‑UNet model architecture.
  • Based on the trained model, created a CLI tool as well as a web application for cryo‑EM maps resolution estimation using deep learning.
  • Skills: Python, Django, PyTorch, Keras, EMAN2, RELION-3, Scikit-learn, Bash.

Portfolio website

Application. Source

Python Django-based personal portfolio website. All content: personal information, portfolio projects, social media links, etc. can be adjusted in Wagtail admin.

  • Created a Django‑based personal portfolio website.
  • The website uses Wagtail CMS. Wagtail is a Django Content Management System.
  • Code from the attached repository can be used as a starting point for developing a production‑ready Django personal website with deployment to Heroku.
  • Skills: Python, Docker, JavaScript, CSS, HTML, Django, Wagtail, SQL.

Recommender system

Results. Source

Content-based recommender system API based on the text of the post and user data.

  • Created a content‑based recommendation system based on user data and post texts.
  • Built a web API using FastAPI to predict the most relevant posts for a user.
  • Added an A/B testing system to select the most appropriate recommendation model.
  • Skills: Python, Docker, SQL, PyTorch, Scikit-learn, CatBoost, FastAPI, Optuna, NLP, Transformers.

Developing best practices for single-cell analysis: data integration

Results. Source

The single-cell transcriptomics analysis has multiple steps, but we have focused on data integration — a crucial step when working with clinical data coming from patients.

  • Researched libraries for scRNA‑seq data analysis and batch effect removal.
  • Tested 8 algorithms on different data types and select the most suitable one for each case.
  • Created software for batch effect removal from scRNA‑seq data using the best algorithm for the current case.
  • Skills: Python, R, Scikit-learn, Scanpy, BBKNN, MNN, Scanorama, Cell Ranger, Bash.

Study of the transcriptome during intense exercises in highlands

Results. Source

This project aims to study differential genes expression of 19 sportsmen during physical and psychological stress before and after running in extreme highlands conditions.

  • Processed and evaluated the quality of raw reads.
  • Aligned processed reads on the human reference genome and counted gene and isoform expression levels.
  • Identify the list of differential expressed genes and their functional analysis as well as cluster analysis.
  • Skills: Python, R, DESeq2, FastQC, Bash, STAR, RSEM, MSigDB, GeneQuery.
My skills 🛠️

Programming

  • Python (Numpy, Pandas, Matplotlib, Seaborn, Sklearn, PyTorch, Keras, FastAPI, Django)
  • R (ggplot2, Seurat, DeSeq2, dplyr)
  • Linux, Bash, git, GitHub, Bitbucket, Docker, Kubernetes, Airflow, Jira
  • JavaScript

Machine Learning Methods

  • Classical Machine Learning (linear models, tree-based approaches, Catboost, LightGBM, XGBoost, Bayesian methods)
  • Deep learning (MLP, CNN, image segmentation, detection, RNN, LSTM, Transformers, AE, VAE, GAN, TabNet)
  • Model tuning (Optuna, genetic algorithm, Boruta)
  • Interpretable machine learning (SHAP, LIME, Pixel Attribution)

Statistics

  • Hypothesis testing, ANOVA
  • Survival analysis (Kaplan-Meier curves, Log rank test, Cox regression)
  • Casual inference (Propensity score, Instrumental variables, Regression discontinuity)

Bioinformatics

  • Databases (NCBI, UniProt, PDB, MsigDB, SILVA)
  • Command-line tools (Cellranger, cellSNP, Picard, BLAST, GATK, STAR, SPAdes)
  • Protein sequence analysis tools (MAFFT, MUSCLE, HMMER, ESM)
  • Protein structure analysis tools (Rosetta, Phenix, Coot, AlphaFold)

Languages

  • Russian – Native
  • English – Full professional proficiency
  • German – Elementary proficiency

Life Sciences

  • Biological education, which helps me to understand specialized biological and medical literature
  • Work experience in molecular, microbiological, and biochemical labs

Soft skills

  • Agile software development methods
  • Presentation skills

GitHub Streak

Pinned Loading

  1. CryoEM_Resolution_Estimation CryoEM_Resolution_Estimation Public

    This repository contains source code for cryoEM maps resolution estimation using neural networks (LoReNNMap).

    Jupyter Notebook 1

  2. portfolio-website portfolio-website Public

    Portfolio website built with Wagtail CMS

    Python 2

  3. immunomind/bi2021spring immunomind/bi2021spring Public

    A project for students from Bioinformatics Institute Spring 2021 class.

    2

  4. sky_runners sky_runners Public

    Project on the analysis of differential gene expression in athletes during periods of intense exercise.

    R 1

  5. recommender-system-api recommender-system-api Public

    Recommender system web API for text posts.

    Jupyter Notebook 2

  6. BI_Stat_2020 BI_Stat_2020 Public

    The repository contains projects that were carried out during the course "Statistics and Data Analysis in R" in the Bioinformatics Institute.

    HTML 1