blog work papers
Dynamic Vision Transformer
Jan 10, 2026

Dynamic Vision Transformer: Faster Inference

Vision Transformers usually run at a fixed image resolution. Easy images still go through the full high-resolution pipeline, wasting time and compute...

DeepSeek OCR
Nov 20, 2025

Implementing DeepSeek-OCR on Google Colab

DeepSeek recently released DeepSeek-OCR, the research paper focuses on vision text compression, the model can decode thousands of text tokens from few hundred vision tokens...

LLM Next Token
Sep 20, 2025

How Do LLMs Decide the Next Token?

Large Language Models (LLMs) like ChatGPT, Gemini, or Claude generate text one piece at a time. They don't write full sentences in one go...

Understanding ANN
Jan 30, 2025

Understanding Artificial Neural Networks (ANNs): A Beginner's Guide

Artificial Neural Networks (ANNs) are one of the most important concepts in machine learning and artificial intelligence...