Open to new opportunities

Sami Koneru-Ansari

Building ML systems from the ground up.

EECS at UC Berkeley. Currently leading engineering at DreamIt and researching low-resource language models at TokenWorks — with CUDA kernels and from-scratch paper reimplementations on the side.

PythonC++CUDAPyTorchRustTypeScriptReactNode.jsSQLLangChainDockerAWS

Education

University of California, Berkeley logo

University of California, Berkeley

B.S. Electrical Engineering and Computer Science · 4.0 GPA

Relevant coursework

  • Data Structures and Algorithms
  • Machine Learning
  • Machine Structures
  • Signal Processing
  • Computer Networks
  • Discrete Mathematics
  • Probability Theory
  • Linear Algebra

Work experience

Founding Software Engineer
DreamIt · Berkeley, CA
Feb 2026 — Present
  • Directing a 3-person engineering team to architect and ship end-to-end Rust features for an early-stage no-code gaming app, scaling the platform to support 1,400+ users and public game deployments.
  • Architected an async Claude API pipeline for end-to-end game generation from prompts in under 2 minutes.
RustClaude API
Machine Learning Research Intern
TokenWorks · Remote
Feb 2026 — Present
  • Fine-tuning open-source models such as the ByT5 transformer to detect and translate low-resource languages in 200,000+ OCR-processed documents using aligned corpora from Wikidata and previous OCR runs.
  • Building a containerized Wikibase with Docker to serve a 1,400-item knowledge graph of the Sumerian lexicon.
PyTorchByT5DockerWikibase
Machine Learning Engineer
GetGreen (Contract) · Remote
Sept 2025 — Dec 2025
  • Engineered a Retrieval-Augmented Generation chatbot using LangChain for LLM orchestration and ChromaDB for vector storage and semantic search to deliver environmental answers in under 3 seconds.
  • Scraped and processed 1,000+ articles with BeautifulSoup, Pandas, and SQL for a model retrieval corpus.
LangChainChromaDBPandasSQL
Software Development Intern
California Department of Technology · Rancho Cordova, CA
May 2024 — July 2024
  • Built full-stack web apps with HTML/CSS/JS for the frontend, Node.js for the backend, PostgreSQL for relational data, and JWT authentication to manage scheduling and file sharing for 100+ employees.
  • Managed datasets of 500+ employees and 70+ clients using SQL and Excel to drive operational reporting.
Node.jsPostgreSQLJWTSQL

Featured projects

View all projects

Custom Digit Classifier

Machine Learning

CNN built from scratch in C++/CUDA with hand-written backprop, im2col, and GEMM kernels. 99%+ on MNIST at ~10% the throughput of an equivalent PyTorch/cuDNN model on T4 GPUs.

C++CUDA

GPT-2 from scratch

Machine Learning

124M-parameter decoder-only transformer with causal self-attention and learned positional encodings. Optimized for multi-GPU CUDA with minimized CPU-GPU sync — 11× speedup over the pre-optimized baseline.

PyTorchCUDAPython

Currently

  • BuildingEnd-to-end Rust features at DreamIt
  • ResearchingLow-resource language models with ByT5 at TokenWorks
  • WritingGPU kernels in C++/CUDA on the side

Beyond code

  • Chess
    Built my own engine — negamax + alpha-beta search.
  • Game engines
    Procedural world generation in Rust + Bevy ECS.
  • From-scratch ML
    Reimplementing GPT-2, transformers, and CNNs by hand.

Get in touch

Always happy to chat about ML, RL, game engines, or anything you're building. Drop me a line and I'll get back to you.

samikoneruansari@gmail.com