Harris Notes

Home

❯

entities

❯

llama.cpp

Apr 14, 20261 min read

entity
tool
general

llama.cpp

An open-source C++ inference engine for LLMs that supports GGUF quantised models and Apple’s Metal backend, enabling hardware-accelerated local inference on Apple Silicon Macs.

Details

Services: LLM inference, GGUF model support, Metal acceleration

Harris Notes

Explorer

llama.cpp

llama.cpp

Details

Graph View

Table of Contents

Backlinks

Harris Notes

Explorer

llama.cpp

llama.cpp

Details

Related

Graph View

Table of Contents

Backlinks