Model inference in seldon kubernetes cluster

In this blog post, I’m sharing my journey of installing Seldon Core from scratch on my Mac. I started by setting up a local Kubernetes cluster and installing all the necessary dependencies, then tested everything with a scikit-learn Iris classification model. Although I followed the official guide, I ran into a few bumps along the way. Initiall... Read more 09 Mar 2025 - 5 minute read

Synthetic data generation

In my current organization, I am working on a model to sort post-consumer clothes to make them easier to recycle into products of different grades. For example, if a post-consumer cloth has buttons and zippers and its quality is not great, we use it to make products like table cloths etc. Since sorting is a critical step in the process, achiev... Read more 27 Dec 2024 - 7 minute read

Dockerize whisper model

Introduction What is a whisper model? Whisper is an open source speech recognition model trained by OpenAI. It enables transcription in multiple languages, as well as translation from those languages into English. Because Whisper was trained on a large and diverse dataset and was not fine-tuned to any specific one, it does not beat models that... Read more 13 Apr 2024 - 2 minute read

tsai model inference

Introduction: tsai is an open source deep learning package built on top of Pytorch and fastai focused on tasks like classification, regression, forecasting, imputation and many more. The aim of the blog post is to provide solution in performing inference of tsai trained classification model. The training method is simple, but I was facing pr... Read more 03 Apr 2024 - 2 minute read

Training LLM

Hi everyone, I have written this blog post to share steps and my knowledge on how to fine tune a Large language model at ease. Introduction I am using the huggingface library to fine tune the “bigscience/bloom-7b1” model using LoRA (Low rank adaptation of large language models). bigscience/bloom-7b1 is a BigScience Large Open science op... Read more 07 Oct 2023 - 7 minute read

Tunable loss functions for binary classification problems

Paper: Xtreme Margin: A Tunable Loss Function for Binary Classification Problems This is a paper summary generated from summarizepaper.com. I edited for better understanding. Introduction Loss functions are crucial in optimizing machine learning algorithms. The choice of loss function impacts the training process and model learning. Bi... Read more 16 Jul 2023 - 3 minute read

Image augmentation with imgaug

This is a code snippet to generate image augmentation on your training data. from imgaug import augmenters as iaa from skimage.io import imread_collection import cv2 import numpy as np # list .PNG files from the folder IMAGE_FOLDER_PATH = "/content/*.PNG" seq = iaa.Sequential([ iaa.Crop(px=(0, 16)), # crop images from each side by 0 to 1... Read more 04 Jul 2023 - less than 1 minute read

Google pegasus summarization model installation

In this blog post, I explain the steps that I followed to install google pegasus summatization model from the officical github page. I failed to complete the installation, since the pip package “tensorflow_text” failed to import, I will mention the error it threw. I cloned the github repository. I downloaded the pretrained model weights. ... Read more 25 Jun 2023 - 10 minute read

Convert decision tree to decision table

I was working on a project, where I wanted to analyze the decisions from the decision tree. I thought, converting a decision tree to decision table helps in the interpretation and analysis. There was just one stack-overflow solution for converting a decision tree to decision table, but it failed for my use case. Hence I wrote a parsing code to... Read more 16 Jan 2023 - 2 minute read