Blogs | Prashant Pandey

Jan 10, 2026

Dynamic Vision Transformer: Faster Inference

Vision Transformers usually run at a fixed image resolution. Easy images still go through the full high-resolution pipeline, wasting time and compute...

Nov 20, 2025

Implementing DeepSeek-OCR on Google Colab

DeepSeek recently released DeepSeek-OCR, the research paper focuses on vision text compression, the model can decode thousands of text tokens from few hundred vision tokens...

Sep 20, 2025

How Do LLMs Decide the Next Token?

Large Language Models (LLMs) like ChatGPT, Gemini, or Claude generate text one piece at a time. They don't write full sentences in one go...

Jan 30, 2025

Understanding Artificial Neural Networks (ANNs): A Beginner's Guide

Artificial Neural Networks (ANNs) are one of the most important concepts in machine learning and artificial intelligence...