Writing - Zhenlin Wang

2026.06.07

10 min

Agents Do Not Need To Train To Learn

The model weights can stay frozen while an agent learns through memory, rules, commands, skills, logs, and the workspace around it.

2026.06.07

8 min

The Missing Fence Between AI Plans And PR Stacks

AI agents can produce plausible plans quickly, but teams need a small review surface before those plans harden into a stack of PRs.

2026.05.25

7 min

Code Is Cheap, Show Me The Idea

AI made the first demo cheaper. The scarce thing now is the idea the model would not have found on its own.

2026.05.09

3 min

The AI-Era Newspaper Is A Feed You Have To Interpret

I never became a newspaper person, but AI gave me a stranger version of the habit: reading releases, benchmarks, replies, and changelogs until the field starts to become legible.

2026.05.08

4 min

Personalized AI Agents And The Visual Novel Design Stack

Personalized agents do not just need a warmer prompt. They need earned continuity: memory, callbacks, pacing, and boundaries the user can see.

2026.05.07

4 min

Vibe Researching Is Not Outsourcing Your Thinking

Claude made parts of a multi-agent RL project faster, but the hard part stayed mine: deciding where agents synchronize and what claims were earned.

2026.05.06

4 min

Writing Technical Blogs After AI

AI can generate explanations on demand. The posts I still want to publish need to carry judgment, contact with reality, and a way of thinking.

2026.05.05

1 min

Heron Event-Driven MARL

A domain-agnostic MARL framework for evaluating trained policies under heterogeneous event-driven execution and realistic observability constraints

2024.10.24

1 min

ResumeAssist

A private, configurable AI resume assistant with LangChain agents, CV rendering, and a full-stack review workflow

2024.09.20

1 min

LLM Validator

A configurable LLM benchmarking template for repeatable model, prompt, dataset, and metric validation

2024.07.27

5 min

Model Iteration Series: Validating Model Infra

Infrastructure checks for LLM model changes before QA: compatibility, latency, and cost.

2024.07.11

8 min

Model Iteration Series: Validating Model Research

How to validate LLM model-change proposals before they move into infra, QA, and product testing.

2024.06.23

1 min

Deployable AI

A lightweight local inference-serving toolkit for registering models and exposing prediction endpoints quickly

2024.06.01

5 min

Model Iteration Series: Intro

A practical overview of how LLM model changes move from research validation to staging and production.

2024.04.19

6 min

Testing in Machine Learning

A practical checklist for testing data, models, ML systems, and CI/CD pipelines.

2024.03.20

7 min

Prompt Engineering Whitebook

A practical handbook for designing, testing, and debugging prompts for LLM applications.

2024.03.14

4 min

A Good Python Project Template to Use as a Starting Point

A practical Python project scaffold for packaging, testing, linting, documentation, and CI.

2024.03.09

5 min

Writing Quality Code for Machine Learning

Practical notes on turning ML code from proof-of-concept scripts into maintainable systems.

2024.03.08

7 min

MLOps Post-Training Considerations

A practical overview of experiment tracking, model registries, serving, and monitoring after model training.

2024.03.04

6 min

Understanding Distributed Training in Deep Learning

A practical map of data parallelism, model sharding, pipeline parallelism, launch tools, and the bottlenecks that usually decide whether distributed training is worth it.

2024.03.02

3 min

Some Tricks in Real-World Machine Learning Engineering

Practical notes on moving from notebooks to pipelines, handling missing values, scaling features, encoding categories, and keeping ML code production-friendly.

2024.02.23

4 min

Quantization in Deep Learning

A practical guide to post-training quantization, quantization-aware training, mixed precision, and low-bit model deployment.

2024.02.19

6 min

Deep Learning Training: A Practical Guide

A practical guide to optimizer choice, learning-rate schedules, stability, memory pressure, throughput, checkpointing, and experiment management during deep learning training.

2024.02.19

4 min

Fine-Tuning in LLMs

A practical overview of supervised fine-tuning, LoRA, prompt tuning, adapters, RLHF, DPO, data quality, and evaluation for large language models.

2024.02.17

5 min

Starting Your AI/ML Project: From Research to Engineering

A practical checklist for turning an AI or ML idea into an engineering project with clear goals, data contracts, evaluation, infrastructure, and operational risk management.

2024.02.17

2 min

Testing Machine Learning Systems

A compact guide to unit tests, data tests, model behavior tests, evaluation, regression tests, and production checks for machine learning systems.

2024.02.10

4 min

Deep Learning System Design: A Checklist, Part II

A practical checklist for the production side of deep learning systems: packaging, deployment, serving, monitoring, logging, and model operations.

2024.02.09

5 min

Deep Learning System Design: A Checklist, Part I

A practical checklist for the early stages of a deep learning system: data, modeling, evaluation, training, and experiment tracking.

2023.12.15

4 min

Neural Network Applied: Optimizer Selection

A practical guide to choosing SGD, momentum, RMSProp, Adam, AdamW, and related optimizers for neural network training.

2023.07.29

1 min

Needle: High-performance DL System

A Deep Learning framework with customized GPU and CPU backend in C++ and Python

2023.03.04

1 min

Motion Prediction with Guided Diffusion

Researched and developed a classifier-free guidance-based latent diffusion model for autonomous vehicle motion forecasting using UNet and Transformer as backbones

2022.11.13

1 min

Starlink Tracking

A small D3.js-powered Satellite Tracking visualization

2022.08.30

1 min

Travel Planner

A GPT-powered web application to help users automate travel plan suggestion, generation and archiving

2022.08.30

1 min

Twitch+

A Search & Recommendation Engine for Twitch Streaming Video Resources

2022.08.20

2 min

About

About Me I\'m Zhenlin Wang Criss . I graduated as an MS student from Machine Learning Department @ Carnegie Mellon University https://www.ml.cmu.edu/ . I have a strong passion for ...

2022.01.25

1 min

Code Analyzer

C++ based code static program analyzer

2021.11.01

2 min

More on Model Deployment

A practical overview of model deployment patterns, artifact promotion, online and batch serving, rollout strategies, rollback, and production monitoring.

2021.10.14

2 min

Variational Inference

A practical overview of variational inference, approximate posteriors, KL divergence, ELBO, mean-field assumptions, and the connection to VAEs.

2021.07.21

2 min

Stats in ML: Dirichlet Distribution

A practical explanation of the Dirichlet distribution, its relationship to categorical probabilities, conjugacy with the multinomial, and why it appears in topic models.

2021.06.22

2 min

An Overview of Hidden Markov Models and Their Algorithms

A practical overview of hidden Markov models, including states, observations, transition probabilities, emission probabilities, forward-backward, Viterbi, and Baum-Welch.

2021.05.03

1 min

SmartMall Discounted Electronic Shopping

A robust online shopping app with various middlewares serving the microservices architecture

2021.02.04

2 min

Ensemble Models: Boosting Techniques

A practical explanation of boosting, including AdaBoost, gradient boosting, XGBoost-style regularization, learning rate, tree depth, and common pitfalls.

2021.02.03

2 min

Variational Autoencoder (VAE)

A practical explanation of variational autoencoders, latent variables, the encoder-decoder structure, reconstruction loss, KL regularization, and sampling.

2021.02.01

1 min

Ensemble Models: Bagging Techniques

A practical explanation of bagging, bootstrap sampling, random forests, out-of-bag evaluation, variance reduction, and when bagging helps.

2021.01.28

1 min

Reinforcement Learning: Theoretical Foundations, Part IV

A practical introduction to policy-gradient methods, stochastic policies, the policy-gradient objective, baselines, and variance reduction.

2021.01.20

1 min

Reinforcement Learning: Theoretical Foundations, Part V

A practical overview of actor-critic methods, deep reinforcement learning, replay buffers, target networks, PPO-style updates, and evaluation concerns.

2021.01.18

2 min

Ensemble Models: Overview

A practical overview of ensemble learning, including bagging, boosting, stacking, voting, variance reduction, bias reduction, and common tradeoffs.

2021.01.07

1 min

Reinforcement Learning: Theoretical Foundations, Part III

A practical guide to dynamic programming, policy evaluation, policy improvement, value iteration, and Q-learning.

2021.01.05

1 min

Reinforcement Learning: Theoretical Foundations, Part II

A practical explanation of Markov decision processes, transition dynamics, rewards, policies, value functions, and Bellman equations.

2021.01.04

1 min

Reinforcement Learning: Theoretical Foundations, Part I

A practical introduction to reinforcement learning concepts: agent, environment, state, action, reward, policy, return, and the exploration-exploitation tradeoff.

2020.11.13

3 min

Gradient Descent Algorithm and Its Variants

A practical explanation of gradient descent, stochastic gradient descent, mini-batch training, momentum, adaptive learning rates, and common optimization issues.

2020.09.04

1 min

SQL: Going Into Applications With MySQL and MongoDB

A practical comparison of relational and document databases in application development, with notes on schemas, queries, transactions, and data modeling.

2020.09.03

2 min

The Data Mining Trilogy III: Analysis

A practical guide to exploratory data analysis: distributions, relationships, missingness, outliers, leakage checks, segmentation, and communicating findings.

2020.08.31

1 min

SQL: Index and Optimization

A concise guide to SQL indexes, query plans, filtering, joins, aggregation, and practical optimization habits.

2020.08.15

1 min

SQL: Pick Up the Basics Within a Day

A concise SQL primer covering SELECT, filtering, joins, aggregation, subqueries, inserts, updates, deletes, and practical query habits.

2020.08.07

1 min

CLI-nic

Java-based medical resource management application

2020.07.01

2 min

Database Intro

A compact introduction to databases, covering relational systems, NoSQL systems, transactions, schemas, indexes, and when each storage pattern fits.

2020.06.13

2 min

A Regex Tutorial

Learn to process string operations in an efficient way

2020.05.01

2 min

A Fundamental Course for Data Engineering

A practical introduction to data engineering fundamentals: ingestion, storage, batch and streaming processing, orchestration, data quality, governance, and serving.

2020.04.13

2 min

Apache Spark: Only the Simple Answer

A simple explanation of Apache Spark: distributed data processing, DataFrames, lazy evaluation, transformations, actions, partitioning, and common performance mistakes.

2020.04.05

2 min

Recommender Systems III: Deep Learning Methods

A practical overview of deep learning methods for recommender systems, including embeddings, two-tower retrieval, ranking models, sequence models, and evaluation.

2020.04.04

1 min

Recommender Systems II: Factorization Machines

A practical explanation of factorization machines, feature interactions, sparse data, and why they are useful in recommendation and click prediction.

2020.04.01

2 min

Recommender Systems I: Content-Based and Collaborative Filtering

A practical introduction to content-based recommendation, collaborative filtering, user-item matrices, similarity, cold start, and evaluation.

2020.03.04

3 min

Dimensionality Reduction: Life Savers

A practical guide to dimensionality reduction, including PCA, t-SNE, UMAP, autoencoders, feature selection, and how to choose the right method.

2020.02.15

1 min

Unsupervised Learning: Measures About Clustering

A practical guide to evaluating clustering with silhouette score, Davies-Bouldin index, Calinski-Harabasz score, external labels, stability, and business usefulness.

2020.02.11

1 min

Clustering: Apriori

A practical introduction to Apriori and association rule mining, including support, confidence, lift, frequent itemsets, and market-basket analysis.

2020.02.09

1 min

Clustering: Affinity Propagation

A concise explanation of affinity propagation, exemplars, similarity messages, preferences, damping, and practical limitations.

2020.02.06

1 min

Clustering: DBSCAN

A practical guide to DBSCAN, density-based clustering, epsilon, minimum samples, noise points, and when density clustering is useful.

2020.02.05

1 min

Clustering: Hierarchical, BIRCH, and Spectral

A practical overview of hierarchical clustering, BIRCH, and spectral clustering, with guidance on when each method is useful.

2020.02.01

1 min

Clustering: K-Means and Gaussian Mixture Models

A practical comparison of K-means and Gaussian mixture models, including assumptions, distance, soft assignments, initialization, and evaluation.

2020.01.02

2 min

An Overview of Big Data Analytics

A concise overview of big data analytics: batch and streaming workloads, descriptive and predictive analysis, data quality, storage, compute, and communication.

2019.12.20

3 min

A Brief Intro to A/B Testing

A practical introduction to A/B testing: hypotheses, metrics, randomization, sample size, statistical significance, and common experiment pitfalls.

2019.09.01

2 min

The Data Mining Trilogy II: Cleaning

A practical guide to cleaning data: missing values, duplicates, outliers, inconsistent categories, schema checks, and reproducible cleaning pipelines.

2019.08.25

2 min

The Data Mining Trilogy I: Preparation

A practical overview of data preparation: defining the problem, collecting data, building schemas, splitting datasets, and preventing leakage.

2019.07.21

1 min

Topic Modeling With Latent Dirichlet Allocation

A practical introduction to topic modeling with LDA, including bag-of-words, document-topic distributions, topic-word distributions, preprocessing, and evaluation.

2019.06.25

3 min

Hyperparameter Tuning

A practical guide to hyperparameter tuning with search spaces, validation design, random search, Bayesian optimization, early stopping, and experiment tracking.

2019.06.04

2 min

Feature Selection and Model Selection

A practical guide to selecting features, choosing models, avoiding leakage, comparing validation results, and balancing accuracy with complexity.

2019.06.01

1 min

Some Supervised Learning Models

A practical overview of supervised learning models, including linear models, trees, ensembles, support vector machines, nearest neighbors, and neural networks.

2019.05.30

1 min

Regression Models: GAM, GLM, and GLMM

A concise explanation of generalized linear models, generalized additive models, and generalized linear mixed models, with guidance on when to use each.

2019.05.29

2 min