从梯度下降到 Semantic ID:理解 RQ-VAE 所需的全部前序知识
一篇文章补齐从机器学习基础、深度学习、Transformer/LLM、搜广推系统架构到向量量化的全部知识链——读完后直接进入 RQ-VAE Semantic ID 训练那篇文章,不会有任何理解断裂。
Algorithm Engineer. System Builder. AI Explorer.
Interested in
I'm an algorithm engineer at a leading digital marketing group, where I design and build real-time bidding systems, ML model serving pipelines, and budget optimization algorithms for programmatic advertising at scale. My day-to-day involves Go and TensorFlow Serving — turning ad auction math into production models that handle millions of bid requests.
On the side, I run an AI infrastructure project: an LLM API gateway aggregating 40+ model providers, a lightweight agent framework, and a service quality monitoring system built on real-token probing. I care about systems that actually work under load — not just demos.
My path: from search-ads-rec system architecture to algorithm research. Currently exploring LLM4Rec and unified sequence modeling for large-scale recommendation — where transformer architectures meet feature interaction in conversion prediction. I believe the best way to understand a system is to build it yourself.
Agent Harness Observability — detect errors, context rot, and regressions in AI agent systems.
A native macOS voice-to-text app — press Fn, speak, and polished text lands at your cursor in any app.
A production-ready multi-agent platform with sandboxed execution, budget control, and observability.
A Claude Code skill that generates daily AI/tech intelligence reports from Hacker News and HuggingFace Papers.
A Claude Code skill that generates importable Excalidraw architecture diagrams from source code.
一篇文章补齐从机器学习基础、深度学习、Transformer/LLM、搜广推系统架构到向量量化的全部知识链——读完后直接进入 RQ-VAE Semantic ID 训练那篇文章,不会有任何理解断裂。
把码本向量维度改小,利用率飙到 90%+——这说明模型学好了吗?从码本崩塌到 Scaling Law,深入拆解 RQ-VAE Semantic ID 训练中的七个核心工程权衡。
给你 16GB 显存的 GPU,你能部署多大的模型?从显存计算、存储层级、Roofline Model 到量化策略,按 Bloom 认知分类法建立 GPU 推理部署的完整认知框架。