Nowadays I focus on low-level deep learning - ML compilers and systems
My fav tech: C++, CUDA, Python, PyTorch, Linux, ML compilers, LLMs, Rust, WebGPU and TypeScript
I'm a regular author for Paged Out! magazine, which is a cool nerdy zine for things like hacking, programming and everything computer-related
I like math and I try to find time every week to learn something new. I'm currently slowly working through the "How to prove it" by Daniel Velleman (basic math undergrad book)
I want to be a full time research scientist and do research on the intersection of math, AI and low-level systems
My name is spelled as yenjay in English and 延杰 in Chinese
Night job:
tiny-vllm - High performance LLM inference engine in C++ and CUDA, a younger sibling of vLLM, with continuous batching and paged attention
torch-webgpu - PyTorch compiler and WebGPU runtime
I got bachelor degree in computer science from Wrocław University of Science and Technology. I'd like to get back to university one day and do master's/PhD in math or computer science! :D