Model Iteration Series: Validating Model Infra
The system and product side of concerns
A searchable archive of 72 posts on machine learning, data systems, software engineering, and projects.
The system and product side of concerns
The first line of defense to robust model iteration
An overview of industry-level model iteration procedure
A holistic approach to evaluate machine learning code
A copy of what I learned and gathered about prompting, both from online and from work
A summary of many masterminds' projects style
Draft notes on proof-of-concept ML code, separation of concerns, configuration, legacy models, and type/documentation hygiene.
An Overview of how to design a full-stack Deep Learning System
What's important in industry AND research, not taught in school???
Draft notes on feature engineering, notebook conversion, missing values, scaling, and categorical encoding.
A must-know technique to save your computation resources
Training can be hard a lot of time. But the bottlenecks vary
Draft notes on LoRA, prompt tuning, adapter tuning, RLHF, and DPO.
Things you need to do before starting a real-world AI/ML engineering project
Draft notes on software tests, model evaluation, and model behavior tests.
An Overview of how to design a full-stack Deep Learning System
An Overview of how to design a full-stack Deep Learning System
Choose a good optimization strategy is as important as selecting the right model
A Deep Learning framework with customized GPU and CPU backend in C++ and Python
Researched and developed a classifier-free guidance-based latent diffusion model for autonomous vehicle motion forecasting using UNet and Transformer as backbones
A small D3.js-powered Satellite Tracking visualization
A GPT-powered web application to help users automate travel plan suggestion, generation and archiving
A Search & Recommendation Engine for Twitch Streaming Video Resources
About Me I\'m Zhenlin Wang Criss . I graduated as an MS student from Machine Learning Department @ Carnegie Mellon University https://www.ml.cmu.edu/ . I have a strong passion for ...
C++ based code static program analyzer
Placeholder draft note preserved from the previous repository.
A useful inference method for distributional approximation
A 'popular' distribution not widely known
Learning stochastic process with more details
A robust online shopping app with various middlewares serving the microservices architecture
You know XGBoost, but do you KNOW XGBoost?
Combine variational inference with representation learning
Gather all we have!
RL Continued - Value Function Approximation
RL Continued - Policy Gradient
Using the power of public
RL Continued - Model-free algorithms
RL Continued - Dynamic Programming
Finally the most intriguing part (to me)
The most important technique in neural networks (as of now)
Going back to the technical details of what and how for popular SQL and NoSQL dbms
Let the plot fly ~
A short guide on optimizing query performances
The best way to get the memory back is by looking at the code themselves
Java-based medical resource management application
A brief introduction to database system
Learn to process string operations in an efficient way
All you need for understanding industrial-level data
Some conceptual understanding of spark and big data. Honestly, this just scratches the surface
Applications of Deep learning in recommender systems
Matrix factorization will save us from sparsity
Welcome to the world the recommenders
We must dispel the curse of dimensionality!
We have clusters, and then?
Association Rule realized via inference
Affinity: voting with democracy
Graphical methods continued...
The nitty-gritty of 'brute-force'
The basic models for clustering
Know what data analysis is and for
A Recap of Fundamental Concept: A/B Testing
The first step when you get some data: clean it up!
Know how to get data from various sources and load them successfully
LDA using a great yet less popular distrbution
Unicorn is when your horn outshines others
A rudimentary method forgotten by many practitioners nowadays
So, give me a taste of supervised learning
A brief introduction to generalized regression
How to determine if a model is truly good
Logit Model: the simple becomes the powerful
Shallow and Deep Linear Regression
A Relationship Management System