← tt-awesome / 🪐 Planet Tenstorrent
June 2026
May 2026
🎥 video affiliated May 13, 2026
FreeCiv powered by Tenstorrent hardware
tt-tinkering — YouTube

Kernels written by and for tt-lang are loaded to Blackhole chips on the Tenstorrent Quietbox 2, powering all forms of "AI" in the open source Civilization clone, FreeCiv. Look at all those fish!

📝 article community May 13, 2026
Tenstorrent Blackhole Architecture Guide
anuraagw.me — February 2026

A 6,500-word community deep dive into the Blackhole p100a architecture: the tile model (Tensix, DRAM, SiFive x280 L2CPU, Ethernet, PCIe, NoC arc), firmware startup sequence, MOP micro-op processor, replay buffer, FPU/SFPU sync, and the anatomy of a kernel. From the author of blackhole-py.

🎥 video community May 13, 2026
Tenstorrent Architecture — W&M CSCI654 Advanced Computer Architecture
Lecture 20 — Tenstorrent Architecture (YouTube)

Lecture 20 from William & Mary's graduate Computer Architecture course. Frames Tenstorrent in the landscape between GPUs and TPUs, draws comparisons to Cerebras and SambaNova, then dives deep into the Wormhole chip and Tensix core: the 5 RISC-V core design, SFPU, NoC, and dataflow execution model.

📝 article official May 13, 2026
tt-llk
Top-level architecture overview

Tenstorrent Low-Level Kernels: the C++ library that directly programs the RISC-V cores inside each Tensix compute engine. TRISC0 (unpack), TRISC1 (math/FPU/SFPU), and TRISC2 (pack) are all programmed through this layer — it is the interface between TT-Metal kernel code and bare silicon.

📄 paper community May 12, 2026
Stencil Computations on Tenstorrent Wormhole
arXiv:2605.07599

Maps 2D 5-point stencil computations onto the Tenstorrent Wormhole RISC-V AI dataflow accelerator via two implementations: element-wise decomposition (Axpy) and matrix-multiplication reformulation (MatMul). Profiling shows the isolated Wormhole kernel is competitive with CPU execution, with PCIe transfers and initialization driving end-to-end overhead; Axpy achieves lower energy than the CPU baseline at large scales. Identifies architectural and software directions for making AI accelerators viable for HPC stencil workloads. 2025.

🎤 talk community May 8, 2026
tt-boltz
FOSDEM 2026 — Drug Discovery on Tenstorrent Hardware

Boltz-2 biomolecular model for drug discovery on Tenstorrent Blackhole. Supports single-card and multi-card configurations — QuietBox (4×) and Galaxy (32×). Approaches physics-based FEP accuracy at 1000× the speed.

📝 article community May 8, 2026
Tenstorrent SFPU Kernel Series — Jason Davies
Optimal "where" on Tenstorrent

Sponsored series of deep technical articles on implementing optimal SFPU kernels for the Tenstorrent Wormhole and Blackhole vector units. Covers where, typecasting, 16/32-bit integer multiplication, cube root, and accurate sin/cos/tan — with cycle counts, assembly walkthroughs, and Blackhole vs Wormhole comparisons throughout.

📝 article community May 8, 2026
Tenstorrent SFPU Kernel Series — Jason Davies
32-bit Integer Multiplication on Tenstorrent

Sponsored series of deep technical articles on implementing optimal SFPU kernels for the Tenstorrent Wormhole and Blackhole vector units. Covers where, typecasting, 16/32-bit integer multiplication, cube root, and accurate sin/cos/tan — with cycle counts, assembly walkthroughs, and Blackhole vs Wormhole comparisons throughout.

📝 article community May 8, 2026
Tenstorrent SFPU Kernel Series — Jason Davies
16-bit Integer Multiplication on Tenstorrent

Sponsored series of deep technical articles on implementing optimal SFPU kernels for the Tenstorrent Wormhole and Blackhole vector units. Covers where, typecasting, 16/32-bit integer multiplication, cube root, and accurate sin/cos/tan — with cycle counts, assembly walkthroughs, and Blackhole vs Wormhole comparisons throughout.

📝 article community May 8, 2026
Tenstorrent SFPU Kernel Series — Jason Davies
Accurate sin/cos/tan on Tenstorrent

Sponsored series of deep technical articles on implementing optimal SFPU kernels for the Tenstorrent Wormhole and Blackhole vector units. Covers where, typecasting, 16/32-bit integer multiplication, cube root, and accurate sin/cos/tan — with cycle counts, assembly walkthroughs, and Blackhole vs Wormhole comparisons throughout.

📖 lesson affiliated May 8, 2026
Local AI Agents on Tenstorrent
Local AI Agents on QuietBox 2

Three agentic projects running fully on-device: local AI agents on QuietBox 2, a coding assistant powered by Aider against a local inference server, and the OpenClaw AI assistant on QuietBox 2. No cloud APIs — all inference runs on TT hardware.

📖 lesson affiliated May 8, 2026
Local AI Agents on Tenstorrent
Coding Assistant with Aider

Three agentic projects running fully on-device: local AI agents on QuietBox 2, a coding assistant powered by Aider against a local inference server, and the OpenClaw AI assistant on QuietBox 2. No cloud APIs — all inference runs on TT hardware.

📖 lesson affiliated May 8, 2026
Video Generation on Tenstorrent
Video Generation via Frame-by-Frame Diffusion

Three lesson-projects covering on-device video synthesis: frame-by-frame diffusion with tt-local-generator, native AnimateDiff video animation, and video generation on QuietBox 2. All run entirely on TT hardware with no cloud dependency.

📖 lesson affiliated May 8, 2026
Video Generation on Tenstorrent
Native Video Animation with AnimateDiff

Three lesson-projects covering on-device video synthesis: frame-by-frame diffusion with tt-local-generator, native AnimateDiff video animation, and video generation on QuietBox 2. All run entirely on TT hardware with no cloud dependency.

📖 lesson affiliated May 8, 2026
Video Generation on Tenstorrent
Video Generation on QuietBox 2

Three lesson-projects covering on-device video synthesis: frame-by-frame diffusion with tt-local-generator, native AnimateDiff video animation, and video generation on QuietBox 2. All run entirely on TT hardware with no cloud dependency.

📖 lesson affiliated May 8, 2026
CS Fundamentals on Tenstorrent Hardware
Module 1: RISC-V & Computer Architecture

Seven-module computer science curriculum taught on real Tenstorrent hardware. Covers RISC-V architecture, memory hierarchy, parallel computing, networks and NoC, synchronization, abstraction layers, and computational complexity — all grounded in what is physically happening on the chip.

📖 lesson affiliated May 8, 2026
CS Fundamentals on Tenstorrent Hardware
Module 2: The Memory Hierarchy

Seven-module computer science curriculum taught on real Tenstorrent hardware. Covers RISC-V architecture, memory hierarchy, parallel computing, networks and NoC, synchronization, abstraction layers, and computational complexity — all grounded in what is physically happening on the chip.

📖 lesson affiliated May 8, 2026
CS Fundamentals on Tenstorrent Hardware
Module 3: Parallel Computing

Seven-module computer science curriculum taught on real Tenstorrent hardware. Covers RISC-V architecture, memory hierarchy, parallel computing, networks and NoC, synchronization, abstraction layers, and computational complexity — all grounded in what is physically happening on the chip.

📖 lesson affiliated May 8, 2026
CS Fundamentals on Tenstorrent Hardware
Module 4: Networks and Communication

Seven-module computer science curriculum taught on real Tenstorrent hardware. Covers RISC-V architecture, memory hierarchy, parallel computing, networks and NoC, synchronization, abstraction layers, and computational complexity — all grounded in what is physically happening on the chip.

📖 lesson affiliated May 8, 2026
CS Fundamentals on Tenstorrent Hardware
Module 5: Synchronization

Seven-module computer science curriculum taught on real Tenstorrent hardware. Covers RISC-V architecture, memory hierarchy, parallel computing, networks and NoC, synchronization, abstraction layers, and computational complexity — all grounded in what is physically happening on the chip.

📖 lesson affiliated May 8, 2026
CS Fundamentals on Tenstorrent Hardware
Module 6: Abstraction Layers

Seven-module computer science curriculum taught on real Tenstorrent hardware. Covers RISC-V architecture, memory hierarchy, parallel computing, networks and NoC, synchronization, abstraction layers, and computational complexity — all grounded in what is physically happening on the chip.

📖 lesson affiliated May 8, 2026
CS Fundamentals on Tenstorrent Hardware
Module 7: Computational Complexity in Practice

Seven-module computer science curriculum taught on real Tenstorrent hardware. Covers RISC-V architecture, memory hierarchy, parallel computing, networks and NoC, synchronization, abstraction layers, and computational complexity — all grounded in what is physically happening on the chip.

📖 lesson affiliated May 8, 2026
tt-claw
OpenClaw AI Assistant on QuietBox 2

A Tenstorrent-powered claw machine that rewards players with real prizes. The QuietBox 2 runs local AI inference to act as an agent controlling the claw hardware — the OpenClaw AI assistant lesson builds directly on this project.

🚀 demo affiliated May 8, 2026
Tensix Grid Playground
Tensix Grid Playground (interactive)

Interactive browser-based visualizer of the Tenstorrent Tensix grid architecture. Explore the NoC, core layout, and dataflow patterns without hardware — a great companion for learning kernel programming.

📖 lesson official May 8, 2026
tt-xla
JAX and PyTorch/XLA on Tenstorrent

PJRT device plugin for Tenstorrent hardware. Enables JAX, PyTorch/XLA, and other XLA-based frameworks to target TT accelerators.

📖 lesson official May 8, 2026
tt-inference-server
Production Inference lesson (VSCode Toolkit)

Production-ready model serving for Tenstorrent hardware with OpenAI-compatible REST API. Supports continuous batching, multiple models, and all TT hardware configurations.

📖 lesson official May 8, 2026
tt-lang
Introduction to tt-lang

Python-based DSL that sits between TT-NN and TT-Metalium — expresses custom fused kernels with progressive disclosure, compiling directly to Tensix. Ships an integrated functional simulator (no hardware needed), line-by-line performance metrics, and AI-agent-friendly tooling. Two packages: tt-lang (compiler + hardware, requires ttnn) and tt-lang-sim (simulator only, works on Linux/macOS without Tenstorrent hardware).

📖 lesson official May 8, 2026
tt-installer
Modern Setup lesson (VSCode Toolkit)

Install the complete Tenstorrent software stack with one command. Handles drivers, firmware, Python environment, and SDK setup automatically.

📖 lesson official May 8, 2026
tt-vscode-toolkit
All 48 lessons

48 interactive lessons covering the full Tenstorrent developer path — from hardware detection to custom training — with click-to-run commands and hardware auto-detection. Available in VSCode and code-server.

📖 lesson official May 8, 2026
tt-vscode-toolkit
RISC-V Programming Guide

48 interactive lessons covering the full Tenstorrent developer path — from hardware detection to custom training — with click-to-run commands and hardware auto-detection. Available in VSCode and code-server.

🎥 video official May 4, 2026
TT-Deploy | Deploy at Scale, Customer Panel
Tenstorrent — YouTube

TT-Deploy, May 1st, 2026 – Customers running Tenstorrent in production — David Bennett (AI&), Alex Nataros (Cirrascale), Sanchayan Sinha (Turiyam), and Mike Gorbinski (Virtu Financial) — share why they chose our hardware and what they're building on it.

🎥 video official May 4, 2026
TT-Deploy | Where AI Runs, Jim Keller
Tenstorrent — YouTube

TT-Deploy, May 1st, 2026 – Jim Keller reads our love letter to DeepSeek v4, reviews how Tenstorrent achieves unlimited scale, and shows off our video gen benchmark on Artificial Analysis. He closes with Tenstorrent is where AI runs.

🎥 video official May 4, 2026
TT-Deploy | Run Anything, Stan Sokorac
Tenstorrent — YouTube

TT-Deploy, May 1st, 2026 – Stan Sokorac, Sr. Fellow, Software, reveals the results of an agentic pipeline that continuously tests random Hugging Face models on Tenstorrent hardware: a 90% pass rate, projecting to roughly 2.5 million models. Plus a deep dive on TT-Lang, TT-Forge, and the only 100% open-source high-performance AI software stack.

🎥 video official May 4, 2026
TT-Deploy | Deploy in Depth, Partner Panel
Tenstorrent — YouTube

TT-Deploy, May 1st, 2026 – Tenstorrent's Amr Elashmawi sits down with Justen Aguillon (Equinix), Sheng Yeo (OrionVM), and Abhishek Bhargava (BetterBrain) to walk through the Equinix Distributed AI Hub — a full-stack sovereign agentic AI platform now live in Ashburn.

🎥 video official May 4, 2026
TT-Deploy | Run Fast, Jasmina Vasiljevic
Tenstorrent — YouTube

TT-Deploy, May 1st, 2026 – Jasmina Vasilović, Senior Fellow of ML Frameworks & Programming Models, walks through the Tenstorrent software stack and how Blackhole's switch-free architecture extends the low-cost serving curve where GPU economics collapse — plus a partner spotlight on Prodia's faster-than-real-time WAN 2.2 video generation.

🎥 video official May 4, 2026
TT-Deploy | Welcome to TT-Deploy, Jim Keller
Tenstorrent — YouTube

TT-Deploy, May 1st, 2026 – Jim Keller walks through the fundamentals – scale, general-purpose, and lower cost – that enable Tenstorrent to be built for the constant changing landscape of AI.

🎥 video official May 2, 2026
TT-Deploy | Full Event Overview
Tenstorrent — YouTube

Join as we unveil Tenstorrent’s AI solutions deployed at scale. See the full breadth of what we've built — validated by real architecture, benchmarks, and customer deployments.

April 2026
March 2026
February 2026
December 2025
October 2025
🎥 video official Oct 23, 2025
TT-Blueprint | Robotics and Automotive | Thaddeus Fortenberry
Tenstorrent — YouTube

Thaddeus Fortenberry, VP of Robotics + Automotive at Tenstorrent, discusses how we're paving the open road to chiplet interoperability in robotics. Open Chiplet Atlas | https://www.openchipletatlas.org Tenstorrent Robotics IP | https://tenstorrent.com/ip X https://twitter.com/tenstorrent Discord https://discord.com/invite/tenstorrent

🎥 video official Oct 23, 2025
TT-Blueprint | Welcome and CPU IP Update | Jim Keller and Miles Dooley
Tenstorrent — YouTube

Watch Jim Keller and Miles Dooley kick off Tenstorrent's IP launch event in San Francisco, TT-Blueprint. Tenstorrent IP | https://tenstorrent.com/ip RISC-V CPU | https://tenstorrent.com/ip/risc-v-cpu Open Chiplet Atlas | https://www.openchipletatlas.org/ X https://twitter.com/tenstorrent Discord https://discord.com/invite/tenstorrent

🎥 video official Oct 23, 2025
TT-Blueprint | Empowering the Chiplet Ecosystem | Wei-Han Lien
Tenstorrent — YouTube

Wei-Han Lien, Chief Architect at Tenstorrent, discusses how Tenstorrent is pushing the chiplet ecosystem forward with Open Chiplet Atlas. Open Chiplet Atlas | https://www.openchipletatlas.org/ Tenstorrent IP | https://tenstorrent.com/ip RISC-V CPU | https://tenstorrent.com/ip/risc-v-cpu X https://twitter.com/tenstorrent Discord https://discord.com/invite/tenstorrent

🎥 video official Oct 23, 2025
TT-Blueprint | An Open IP Future | Aniket Saha
Tenstorrent — YouTube

Aniket Saha, VP of Product Strategy at Tenstorrent, discusses Tenstorrent's new, open business model for high-performance IP and why the old, closed-ecosystem approach isn't working. Tenstorrent IP | https://tenstorrent.com/ip X https://twitter.com/tenstorrent Discord https://discord.com/invite/tenstorrent

July 2025
🎥 video official Jul 9, 2025
Modeling Multi-Device in Tenstorrent's Compiler
Tenstorrent — YouTube

This session, led by Tapasvi Patel, Sr. Engineer at Tenstorrent, covers how Tenstorrent's compiler (TT-Forge) and runtime (TT-NN) manage device meshes, tensor sharding, and collective communications, with a focus on using JAX. Also, see how Shardy helps with automatic parallelization. 0:00 Introduction 0:25 Modeling multi-device in Tenstorrent's compiler & runtime 5:33 Parallelism strategies and multi-chip hardware 8:38 Core TT-NN runtime features for multi-device (meshes, tensors, mappers)…

June 2025
📄 paper community Jun 19, 2025
HetGPU: The pursuit of making binary compatibility towards GPUs
arXiv:2506.15993

Heterogeneous GPU infrastructures present a binary compatibility challenge: code compiled for one vendor's GPU will not run on another due to divergent instruction sets, execution models, and driver stacks . We propose hetGPU, a new system comprising a compiler, runtime, and abstraction layer that together enable a single GPU binary to execute on NVIDIA, AMD, Intel, and Tenstorrent hardware. The hetGPU compiler emits an architecture-agnostic GPU intermediate representation (IR) and inserts…

📄 paper community Jun 13, 2025
Topology-Aware Virtualization over Inter-Core Connected Neural Processing Units
arXiv:2506.11446

With the rapid development of artificial intelligence (AI) applications, an emerging class of AI accelerators, termed Inter-core Connected Neural Processing Units (NPU), has been adopted in both cloud and edge computing environments, like Graphcore IPU, Tenstorrent, etc. Despite their innovative design, these NPUs often demand substantial hardware resources, leading to suboptimal resource utilization due to the imbalance of hardware requirements across various tasks. To address this issue,…

May 2025
🎥 video official May 6, 2025
TT-Forge Architecture, Tools, Front-End Support
Tenstorrent — YouTube

TT-Forge is Tenstorrent’s MLIR-based compiler. This Q&A covers its architecture, front-end support (Torch, JAX), tools like TT-Explorer, and key discussions on runtime customization, debugging, production timelines, quantization, model training, the PiKernel DSL, and how to contribute. 0:00 What is TT-Forge? 1:09 Accessing Tenstorrent hardware on Koyeb 4:39 Compiler production timeline and manual model implementation 7:24 TT-Explorer for quantization schemes 8:26 Model training framework with…

April 2025
🎥 video official Apr 3, 2025
Intro to TT-Forge
Tenstorrent — YouTube

TT-Forge is Tenstorrent’s MLIR-based compiler. Learn how TT-Forge integrates with our AI software stack, why we’re building on MLIR, and the features that make TT-Forge flexible and adaptable. 0:00 Introduction 1:36 Integration with key ML frameworks 2:05 Why MLIR? 3:10 Driving principles of TT-Forge 7:48 Recap tt-forge: https://github.com/tenstorrent/tt-forge tt-mlir: https://github.com/tenstorrent/tt-mlir Follow Tenstorrent on X at https://x.com/tenstorrent Join our Discord at…

September 2024