sk.sahu
  • About
  • Projects
  • Blog
← Back to Blog

Tag: Bioinformatics

9 posts tagged with Bioinformatics

November 15, 2023
variant-callingGATKDeepVariantgenomicsVCFbioinformaticsWGS

Variant Calling Pipelines: From Raw Reads to Annotated VCF

A complete walkthrough of short-variant calling using GATK4 HaplotypeCaller and DeepVariant — covering alignment, BQSR, variant filtering, and functional annotation with code throughout.

Read more
September 22, 2023
single-cellmulti-omicsscATAC-seqCITE-seqSeuratbioinformaticschromatin

Single-Cell Multi-omics Integration: Linking RNA, ATAC, and Protein Data

How to integrate scRNA-seq, scATAC-seq, and CITE-seq protein data using Seurat v5 and Muon — with a worked example covering alignment, joint embedding, and regulatory interpretation.

Read more
June 27, 2023
deep-learninggenomicspytorchgene-expressionbioinformaticsmachine-learning

Deep Learning for Genomics: Predicting Gene Expression with PyTorch

A practical walkthrough of building a neural network to predict gene expression from DNA sequence — covering data preparation, model architecture, training, and interpretation.

Read more
April 3, 2023
FAIRopen-sciencedata-managementbioinformaticsreproducibilitymetadata

FAIR Data Principles in Practice: A Bioinformatics Engineer's Guide

How to actually implement FAIR (Findable, Accessible, Interoperable, Reusable) data principles in bioinformatics projects — with concrete tools, metadata standards, and code.

Read more
February 14, 2023
single-cellRNA-seqRbioinformatics

Single-Cell RNA-seq: A Practical Overview

An accessible introduction to scRNA-seq analysis — from raw reads to cell clusters — with notes on tooling and common pitfalls.

Read more
December 19, 2022
gene-regulatory-networksGRNsingle-cellSCENICbioinformaticstranscriptomics

Gene Regulatory Network Inference: Methods, Tools, and Pitfalls

A practical guide to inferring gene regulatory networks from single-cell and bulk RNA-seq data — comparing GENIE3, SCENIC, and GRNBoost2 with real code and honest caveats.

Read more
November 8, 2022
snakemakebioinformaticsworkflowsreproducibility

Getting Started with Snakemake for Bioinformatics Workflows

A practical introduction to Snakemake — why it matters for reproducible bioinformatics and how to structure your first workflow.

Read more
September 5, 2022
nextflowbioinformaticspipelinesdockerreproducibilityHPC

Nextflow DSL2: Building Modular, Scalable Bioinformatics Pipelines

A deep dive into Nextflow DSL2 — how to design modular, reusable pipeline components that scale from a laptop to an HPC cluster or the cloud.

Read more
July 11, 2022
dockersingularitycontainersbioinformaticsreproducibilityCI/CDHPC

Containerising Bioinformatics Tools: Docker and Singularity Best Practices

A practical guide to building minimal, reproducible Docker images for bioinformatics tools — and converting them to Singularity for HPC use — with real Dockerfiles and CI/CD integration.

Read more

© 2026 Sangram Keshari Sahu