Zero trust data pipeline framework

We have all experienced the pain of data loss or data incorrectness in our batch data pipelines. To address this, I designed a framew...

In journal, Feb 06, 2026

All Stories

Zero trust data pipeline framework

We have all experienced the pain of data loss or data incorrectness in our batch data pipelines. To address this, I designed a framework for ensuring data correctness at every stage o...

In journal, Feb 06, 2026

Asyncio_to_thread

Optimizing Python Asyncio Performance with asyncio.to_thread

In Sep 17, 2025

Model inference in seldon kubernetes cluster

In this blog post, I’m sharing my journey of installing Seldon Core from scratch on my Mac. I started by setting up a local Kubernetes cluster and installing all the necessary depende...

In journal, Mar 09, 2025

Synthetic data generation

In my current organization, I am working on a model to sort post-consumer clothes to make them easier to recycle into products of different grades.

In journal, Dec 27, 2024

Dockerize whisper model

IntroductionWhat is a whisper model?Whisper is an open source speech recognition model trained by OpenAI. It enables transcription in multiple languages, as well as translation from t...

In journal, Apr 13, 2024

tsai model inference

In journal, Apr 03, 2024

Training LLM

In journal, Oct 07, 2023

Tunable loss functions for binary classification problems

Paper: Xtreme Margin: A Tunable Loss Function for Binary Classification Problems

In journal, Jul 16, 2023

Image augmentation with imgaug

This is a code snippet to generate image augmentation on your training data.

In journal, Jul 04, 2023

Featured