Furiosa SDK 2025.2.0 release

hyunsik · May 19, 2025, 5:34am

Hello! We are excited to announce the Furiosa SDK 2025.2.0 release. SDK 2025.2.0 has officially been published today. This release is the fourth major release for the RNGD, and provides the streamlined stack to enable LLM on RNGD, including driver, SoC FW, PERT, HAL, Model Compressor, Furiosa Compiler, and all Furiosa SDK components including Furiosa-LLM.

Here are the release notes are documents:

Key features and improvements in the 2025.2.0 release

Introduce LLM.chat() API to support chat-based models.
Add support for /v1/models and /v1/models/{model_id} endpoints in furiosa-llm.
Add support the chunked prefill feature in furiosa-llm.
Enable direct building of bfloat16/float16/float32 models without quantization step.
Add support for the reasoning model parser in OpenAI-Compatible Server.
LLM API, furiosa-mlperf, furiosa-llm serve now support loading artifacts from Hugging Face Hub.
furiosa-llm now supports Python 3.11 and 3.12.
Optimize the NPU DRAM stack usage for the furiosa-llm.
Support Ubuntu 24.04 (Noble Numbat).
Add support for abort() in LLMEngine and AsyncLLMEngine APIs.
Add support for the metrics endpoint (/metrics) used to monitor the health of OpenAI-Compatible Server.
Support sampling parameter “logprobs” in Furiosa-LLM
Add support for Container Device Interface (CDI) for container runtimes (e.g., docker, containerd, and crio).

You can find more update details at Release Note (2025.2.0).

Also, please check out the available pre-compiled models in Hugging Hub.

Topic		Replies	Views
Furiosa SDK 2025.1.0 release Announcements release	0	98	February 24, 2025
Furiosa SDK 2024.2.0 Release Announcements release	0	31	January 13, 2025
A brief introduction to Furiosa SDK 2025.2 Documentation release , rngd , furiosa-llm	0	24	June 1, 2025
Furiosa SDK 2024.2.0 릴리즈 공지사항 release	0	81	January 13, 2025
Furiosa SDK Ubuntu 24.04 가능 여부 질문드립니다 일반 sdk	2	46	June 1, 2025

Furiosa SDK 2025.2.0 release

Related topics