Files
2025-01-30 22:55:11 +00:00

4 lines
139 B
Plaintext

Inference of Meta's LLaMA model (and others) in pure C/C++ with
minimal setup and state-of-the-art performance on a wide range
of hardware