Skymizer Announces HTX301 — Reinventing On-Prem AI Inference
Skymizer Announces HTX301 — Reinventing On-Prem AI Inference

“For the first time, ultra-large models can run on a single PCIe card. Powered by six HTX301 chips and 384GB of memory, enterprises can now execute 700B-parameter LLM inference locally at just ~240W — eliminating the need for massive GPU clusters, NVLink/NVSwitch interconnects, and complex cooling infrastructure.
Built for the new era of inference-dominant AI, HyperThought™ introduces a fundamentally different approach. By disaggregating prefill and decode workloads and pairing decode-first silicon with an intelligent software orchestration stack, HTX301 enables higher utilization, lower latency, and significantly improved power efficiency across real-world deployments…”
Source: skymizer.ai/skymizer-announces-htx301-reinventing-on-prem-ai-inference/