WWDC26 · 29 min · AI & Machine Learning

Dive into Core AI model authoring and optimization

Dive into the complete custom model deployment workflow for Apple silicon with the new Core AI framework. Discover powerful techniques for authoring models using custom Metal kernels, alongside platform-aware compression strategies. The new Core AI Debugger offers deep intrinsic analysis, and AI-assisted workflows guide you from initial concept to optimized on-device execution.

Watch at developer.apple.com ↗

Transcript all transcripts

Chapters

0:00 — Introduction
1:49 — Models and skills
3:27 — Python workflow
5:54 — Model optimization
10:40 — Core AI Debugger
19:27 — Advanced authoring
20:43 — Custom Metal kernels
23:01 — Model re-authoring
28:46 — Next steps

Code shown on screen · 4 snippets

Define and export a PyTorch model python · at 3:27 ↗

import torch
import torch.nn as nn

# Define a simple model
class MLP(nn.Module):
    def __init__(self):
        super().__init__()
        self.fc1 = nn.Linear(256, 512)
        self.fc2 = nn.Linear(512, 10)

    def forward(self, x):
        return self.fc2(torch.relu(self.fc1(x)))

# Export with torch.export
model = MLP().eval()
example_input = (torch.randn(1, 256),)
exported_program = torch.export.export(model, example_input)

Convert, optimize and run inference with Core AI python · at 4:02 ↗

import coreai
import coreai_torch
from coreai.runtime import NDArray

# Convert to Core AI
converter = coreai_torch.TorchConverter()
converter.add_exported_program(
    exported_program,
    input_names=["features"], output_names=["logits"])
core_ai_program = converter.to_coreai()

# Optimize and save to .aimodel
core_ai_program.optimize()
asset = core_ai_program.save_asset("mlp.aimodel")

# Run inference
specialized_model = await AIModel.load("mlp.aimodel")
specialized_function = specialized_model.load_function("main")
result = await specialized_function({"features": NDArray(example[0].numpy())})

Define a SiLU Metal kernel with PyTorch reference python · at 21:12 ↗

import torch
from coreai_torch.dsl import TorchMetalKernel, MetalParameter

def silu_torch(x):
    return x * torch.sigmoid(x)

SILU_MSL = """
float val = float(x[gid]);
float sig = 1.0f / (1.0f + exp(-val));
y[gid] = TYPE(val * sig);
"""

silu_kernel = TorchMetalKernel(
    name="fused_silu",
    input_names=["x"],
    result_names=["y"],
    src=SILU_MSL,
    torch_defn=silu_torch,
    metal_params=[MetalParameter("gid", "uint", "thread_position_in_grid")],
    template_dtypes={"x": "TYPE"},
)

Use a custom Metal kernel and convert with TorchConverter python · at 22:09 ↗

class MyModel(torch.nn.Module):
    def __init__(self):
        super().__init__()
        self.linear = torch.nn.Linear(256, 256)

    def forward(self, x):
        h = self.linear(x)
        n = h.numel()
        return silu_kernel(
            h,
            threads_per_grid_size=(n, 1, 1),
            threads_per_thread_group=(min(n, 256), 1, 1),
            result_shapes=[h.shape],
        )

exported_program = torch.export.export(MyModel(), (torch.randn(1, 256),))

converter = coreai_torch.TorchConverter()
converter.register_custom_kernels([silu_kernel])
converter.add_exported_program(exported_program,
                               input_names=["x"], output_names=["y"])
deployable = converter.to_coreai()  # MSL integrated into asset

Resources

Documentation Core AI PyTorch Extensions
Documentation Core AI Python
Documentation Core AI Optimization
Documentation Inspecting, debugging, and profiling Core AI models
Documentation Inspecting Core AI models with Core AI Debugger
Documentation Core AI

Explore distributed inference and training with MLX

WWDC26 · 9 snippets

22 min
Run local agentic AI on the Mac using MLX

WWDC26 · 3 snippets

14 min
Explore numerical computing in Swift with MLX

WWDC26 · 6 snippets

15 min

Chapters

Code shown on screen · 4 snippets

Resources

Related sessions

Explore distributed inference and training with MLX

Run local agentic AI on the Mac using MLX

Explore numerical computing in Swift with MLX