My interest is in the area of large scale data analysis and containerization approaches in the areas of DNA microarrays, exomes, genomes, and transcriptomes for human patient and patient derived cell lines as well as model organisms such as mouse, rat, guinea pig and bacteria. Detailed breakdown of different analysis themes are below:
1) Complete somatic/germline variant calling and Copy number variation pipeline creation and analysis of Whole Genome, whole exome, targeted panels analysis, single cell DNA (scDNA) data.
2) Complete differential gene expression and pathways analysis pipeline for RNA-seq transcriptome, proteome and microarray data.
3) ChIP seq analysis to verify transcription factor binding to designated pathways.
4) Alignment and quality control long non-coding RNA (lncRNA) data from Pac-Bio long read sequencing.
5) Differential gene expression analysis of single cell RNA (scRNA) data from 10x method.