Skip to content
View Sai21112000's full-sized avatar
:shipit:
…
:shipit:
…

Highlights

  • Pro

Block or report Sai21112000

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Sai21112000/README.md

Hi, I'm Sai Teja Vaidya πŸ‘‹

Computer Vision Β· Deep Learning Β· Agentic AI Β· Edge ML
M.Eng. in ICT β€” Asian Institute of Technology, Thailand


I work at the intersection of computer vision and physical systemsβ€”where a model's output has real-world consequence. My thesis detected and measured individual oil palm trees from drone imagery across 8 altitude levels using YOLOv8, YOLOv11, SAM 2.1, and Mask R-CNN. I built an agent-in-the-loop annotation pipeline that reduced labeling time by 80% and a generative tiling algorithm that simulates multi-altitude flights from a single drone pass.

Recent focus: agentic AI systems that are auditable and constrained, edge vision models under strict latency and power budgets, efficient training of LLMs and small models, and local-first tools for knowledge work.

Open to AI Engineer, Computer Vision, Remote Sensing ML, and Agentic AI roles in Thailand, India or remote.


🌱 Currently Learning

  • Deep Reinforcement Learning (HuggingFace RL Course)
  • Langchain Academy Course
  • Kaggle Competetions / Hackathons

πŸ› οΈ Tech Stack

Python PyTorch TypeScript OpenCV HuggingFace ONNX QGIS Jekyll

🎯 Featured Projects

Dopamine.Diet

I built Dopamine.Diet: a local-first productivity system that hijacks your brain's love for checkboxes, streaks, and green squares β€” and redirects it toward deep work.

AI Thesis Agent Kit

Multi-agent orchestration system with 9 specialized agents, 6 immutable writing laws, and 90% confidence gate for hallucination control. Built during thesis at AIT to automate research documentation and inference.

OpenAI Parameter Golf Reproduction

Reproduced OpenAI's Parameter Golf challenge: H100 training on RunPod with torchrun, FineWeb dataset, and continuous val_bpb tracking. Focus on reproducibility under strict wall-clock constraints.

Qualcomm AI Hub LPCVC 2026 β€” Track 1

Image-to-text retrieval on XR2 Gen 2 proxy (edge device). Achieved Recall@10 β‰ˆ 0.73.

  • Full pipeline: model selection, ONNX export, hardware profiling, dataset curation, inference optimization
  • Demonstrates end-to-end edge ML deployment

Minimalist Blog

Zero-framework Jekyll blog with dark mode, image lightbox, giscus comments, RSS, keyboard navigation, and reading progress indicator. Built via AI pair programming; MIT licensed, open source.

Obsidian Plugin: Jekyll Publisher

One-click Obsidian-to-Jekyll converter with automated frontmatter, image copying, and intelligent filename generation. ~8KB, zero dependencies.

HTML Table Structured Data Scraper

Browser console utility for extracting structured data from HTML tables and forms. Zero external dependencies; designed for rapid prototyping and data collection.

Oil Palm Instance Segmentation (Thesis)

Detection, counting, and canopy biometry of individual oil palms from UAV imagery at multiple ground sample distances (0.03–0.20m).

  • Multi-model comparison: YOLOv8, YOLOv11, Mask R-CNN, SAM hybrids
  • Metrics: precision, recall, F1, IoU, crown geometry errors across 8 altitude levels
  • Agent-in-the-loop annotation (80% labeling time reduction) + generative tiling for synthetic multi-altitude data

πŸ”¬ Oil Palm AI Series β€” Thesis Blog

A technical series documenting the full research and deployment journey:

# Post Companion Repo
B1 The Drone That Couldn't See uav-gsd-scale-invariance
B2 Building the Dataset Nobody Had oil-palm-dataset-pipeline
B3 Six Models Enter, One Problem Wins oil-palm-instance-segmentation
B4 The Hybrid Paradox hybrid-yolo-sam-pipeline
B5 From Pixels to Meters canopy-biometry-calculator
B6 A Framework for Flying Smarter uav-deployment-guide

πŸš€ What I'm Optimizing For Next

  • Production-grade AI systems: Converting research prototypes into testable, deployable, and maintainable systems with clear evaluation metrics
  • Edge & on-device vision: Geospatial ML and vision models that run under strict latency, power, and memory constraints
  • Building Projects: Building productivity systems utilising OCR/TTS/RAG-Chatbots that keep data local and enable fast context-switching between projects
  • Reliable, auditable agents: Agentic AI with constraint-based reasoning, explicit control flow, and verifiable outputs (not just creative generation)

"The tension you feel is not a problem to solve. It is the process."

Pinned Loading

  1. sai21112000.github.io sai21112000.github.io Public

    Personal Blog - Documenting my journey

    HTML

  2. oil-palm-instance-segmentation oil-palm-instance-segmentation Public

    6-model comparison: YOLOv8, YOLOv11, Mask R-CNN, Hybrid-SAM on UAV imagery

    Jupyter Notebook