Pre-optimized and pre-compield models in Hugging Face Hub

hyunsik · May 19, 2025, 10:34pm

Hi folks,

Since the 2025.2 release, we’ve uploaded the pre-optimized and pre-compield models in Hugging Face Hub.

For example, without an artifact compilation step, you can quickly run as follows:

furiosa-llm serve furiosa-ai/Llama-3.1-8B-Instruct-FP8

You can find more available pre-compiled models at furiosa-ai (FuriosaAI).

Topic		Replies	Views
Furiosa SDK 2024.2.0 Release Announcements release	0	31	January 13, 2025
Furiosa SDK 2025.2.0 release Announcements release , sdk , rngd	5	63	May 20, 2025
Furiosa SDK 2025.1.0 release Announcements release	0	98	February 24, 2025
Furiosa SDK 2024.2.0 릴리즈 공지사항 release	0	82	January 13, 2025
Furiosa SDK 2024.2.1 release (Up to 32k context length support) Announcements release	0	25	January 13, 2025