Skip to content
View bharath03-a's full-sized avatar

Block or report bharath03-a

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
bharath03-a/README.md

Bharath Velamala

Data Engineer | Distributed Systems & AI Data Platforms


πŸš€ Executive Summary

I am a Data Engineer specializing in architecting massive-scale data infrastructure, streaming pipelines, and AI-integrated systems. I focus on building robust, fault-tolerant platforms using Rust, Python, and Google Cloud.

  • πŸ—οΈ Currently: Architecting multi-tenant clinical data platforms with end-to-end ETL ownership, processing 1M+ weekly records at Centauri Health Solutions.
  • πŸ† Recently: Won 2nd Place at MLSys 2026 (Track B) for building a neurosymbolic multi-agent pipeline for tensor DAG scheduling, achieving a 7.77x speedup over baselines.
  • πŸŽ“ Background: MS in Data Science from the University of Arizona | 2x Google Cloud Certified.

πŸ“Š Engineering Focus

Domain Key Technologies Active Projects
Distributed Systems Rust, gRPC, Tokio WAL Consensus, GitCortex
AI Data Platforms Python, LLMs, GCP Agentic Scheduler, MCP Server
Streaming Pipelines Apache Beam, Spark Clinical Data Platform (CLARION)

πŸ”¬ Featured Systems & Architecture

A branch-aware code knowledge graph built with Rust, tree-sitter, KuzuDB, and gRPC. Utilizes an MCP server for AI assistants to perform blast-radius analysis.

Agentic Scratchpad Scheduler (MLSys 2026 - 2nd Place)

Neurosymbolic multi-agent pipeline for tensor DAG scheduling on memory-limited hardware. An LLM proposes op fusion and traversal structures, while a deterministic solver enforces constraints to achieve a 7.77x aggregate speedup.

Distributed Write-Ahead Log (WAL)

Engineered a Write-Ahead Log with full Raft consensus in Rust and Tokio. Features a crash-safe segment engine running at 429 MiB/s, log compaction, and Prometheus metrics tracking.

Dataflow Template MCP Server

An MCP server and CLI allowing AI assistants to generate ready-to-extend Apache Beam pipeline templates with a single command, standardizing CI/CD and deployment for GCP Dataflow.


βš™οΈ Core Engineering Stack

Languages & Frameworks Python Rust Java SQL

Data Infrastructure & Streaming Google Cloud Apache Spark Apache Kafka Apache Airflow

Systems & Deployment PyTorch Kubernetes Docker Git

Pinned Loading

  1. Gun-Violence-Incidents Gun-Violence-Incidents Public

    Jupyter Notebook 2

  2. MultilabelImageClassification MultilabelImageClassification Public

    Deep Learning

    Jupyter Notebook 1

  3. OCR2LaTeX OCR2LaTeX Public

    Python 1

  4. SpotifyDataWarehousing SpotifyDataWarehousing Public

    Python