Skip to content

Reference

Lookup material — abbreviations, bibliography, and reading order suggestions.

Pages

  • abbreviations — quick lookup for acronyms and shorthand used throughout
  • bibliography — citations to papers, NVIDIA docs, kernel-library source, and notable blog posts

For a glossary of terms (full prose definitions), see overview/glossary.

Suggested reading orders

"I want to understand why my MoE model is slow on workstation Blackwell"

  1. overview/architecture — the central thesis
  2. blackwell/sm100-vs-sm120 — the hardware difference
  3. interconnect/nvlink-vs-pcie — the bandwidth difference
  4. interconnect/moe-parallelism — how MoE expects to use that bandwidth
  5. compatibility/ep-to-tp-rewriting — the fix

"I want to understand the GPU itself, end-to-end"

  1. fundamentals/gpu-execution-model
  2. fundamentals/memory-hierarchy
  3. fundamentals/cuda-pipeline
  4. fundamentals/tensor-cores
  5. fundamentals/number-formats
  6. blackwell/ (entire section)

"I want to learn about the kernel libraries"

  1. fundamentals/ (entire section)
  2. kernels/ (entire section, in order)
  3. compatibility/translating-tcgen05 — when you need to port one
  4. compatibility/smem-budget-management — the constraint side

"I just want a TL;DR"

overview/architecture. Read that page; it summarizes the whole wiki in 5 minutes.