{"id":54838,"date":"2026-05-02T07:48:00","date_gmt":"2026-05-02T00:48:00","guid":{"rendered":"https:\/\/thaipropertynews.com\/feeds\/?p=54838"},"modified":"2026-05-02T07:48:00","modified_gmt":"2026-05-02T00:48:00","slug":"moreh-demonstrates-production-ready-llm-inference-on-tenstorrent-galaxy-achieving-dgx-a100-class-performance-with-improved-cost-efficiency","status":"publish","type":"post","link":"https:\/\/thaipropertynews.com\/feeds\/?p=54838","title":{"rendered":"MOREH Demonstrates Production-Ready LLM Inference on Tenstorrent Galaxy, Achieving DGX A100-Class Performance with Improved Cost Efficiency"},"content":{"rendered":"<p class=\"prntac\"><b><i><span>R<\/span><span>educes HBM Costs with GPU\u2013Tenstorrent Heterogeneous Distributed Serving<br \/><\/span><\/i><\/b><b><i><span>First unveiled at Tenstorrent&#8217;s launch event, TT-Deploy, in San Francisco on May 1<\/span><\/i><\/b><\/p>\n<p><span class=\"legendSpanClass\">SANTA CLARA, Calif.<\/span>, May 2, 2026 \/PRNewswire\/ &#8212; <a href=\"https:\/\/moreh.io\/\" target=\"_blank\" rel=\"nofollow\">Moreh<\/a>, an AI infrastructure software company, led by CEO Gangwon Jo, announced that it has successfully validated LLM inference performance on the Tenstorrent Galaxy Wormhole system using its proprietary &#8216;MoAI Inference Framework<span>.&#8217;<\/span><\/p>\n<div class=\"PRN_ImbeddedAssetReference\">\n<p> <a href=\"https:\/\/mma.prnasia.com\/media2\/2971313\/Photo.html\" target=\"_blank\" rel=\"nofollow\"> <img decoding=\"async\" src=\"https:\/\/mma.prnasia.com\/media2\/2971313\/Photo.jpg?p=medium600\" title=\"\" alt=\"\" \/> <\/a> <br \/><span><\/span><\/p>\n<\/div>\n<p>Based on tests across leading Mixture-of-Experts (MoE) models\u2014including GPT-OSS, Qwen, GLM, and DeepSeek\u2014Moreh achieved LLM inference performance on Tenstorrent Galaxy Wormhole matching or surpassing NVIDIA DGX A100-class systems, demonstrating a compelling alternative to conventional GPU-centric AI infrastructure.<\/p>\n<p>Moreh also improved cost efficiency by implementing a disaggregated serving architecture that combines GPUs with Tenstorrent Wormhole chips. By utilizing Tenstorrent processors as dedicated prefill accelerators, the company reduced reliance on high-cost HBM and lowered overall infrastructure costs.<\/p>\n<p>The results were first unveiled at Tenstorrent&#8217;s launch event, TT-Deploy, held on May 1 in San Francisco.<\/p>\n<p>As a strategic partner of Tenstorrent and a major external contributor to Metalium, Moreh showcased a live LLM inference demo at the event. Building on its experience operating AMD GPU-based production environments in real-world data centers, the company presented its latest technical achievements in &#8216;Production-Ready LLM Inference on Tenstorrent Galaxy.&#8217;<\/p>\n<p>MoAI Inference Framework is a disaggregated inference solution that enables unified operation of heterogeneous GPUs and NPUs\u2014including NVIDIA, AMD, and Tenstorrent\u2014within a single cluster. This allows enterprises to build flexible AI infrastructure strategies without vendor lock-in.<\/p>\n<p>Moreh CEO Gangwon Jo stated, &#8220;Achieving production-grade LLM inference performance and stability on Tenstorrent-based systems marks a significant milestone,&#8221; and added, &#8220;We will continue to enhance performance through deeper optimization across heterogeneous architectures and closer integration with Tenstorrent NPUs.&#8221;<\/p>\n<p>Moreh is developing its own core AI infrastructure engine and, through its foundation LLM subsidiary Motif Technologies, is building end-to-end capabilities spanning both infrastructure and model domains. Simultaneously, the company is making its mark in the global market through collaborations with key partners such as AMD, Tenstorrent, and SGLang.<\/p>","protected":false},"excerpt":{"rendered":"<p><!-- wp:html --><\/p>\n<p class=\"prntac\"><b><i><span>R<\/span><span>educes HBM Costs with GPU\u2013Tenstorrent Heterogeneous Distributed Serving<br \/><\/span><\/i><\/b><b><i><span>First unveiled at Tenstorrent&#8217;s launch event, TT-Deploy, in San Francisco on May 1<\/span><\/i><\/b><\/p>\n<p><span class=\"legendSpanClass\">SANTA CLARA, Calif.<\/span>, May 2, 2026 \/PRNewswire\/ &#8212; <a href=\"https:\/\/moreh.io\/\" target=\"_blank\" rel=\"nofollow\">Moreh<\/a>, an AI infrastructure software company, led by CEO Gangwon Jo, announced that it has successfully validated LLM inference performance on the Tenstorrent Galaxy Wormhole system using its proprietary &#8216;MoAI Inference Framework<span>.&#8217;<\/span><\/p>\n<div class=\"PRN_ImbeddedAssetReference\">\n<p> <a href=\"https:\/\/mma.prnasia.com\/media2\/2971313\/Photo.html\" target=\"_blank\" rel=\"nofollow\"> <img decoding=\"async\" src=\"https:\/\/mma.prnasia.com\/media2\/2971313\/Photo.jpg?p=medium600\" title=\"\" alt=\"\" \/> <\/a> <br \/><span><\/span><\/p>\n<\/div>\n<p>Based on tests across leading Mixture-of-Experts (MoE) models\u2014including GPT-OSS, Qwen, GLM, and DeepSeek\u2014Moreh achieved LLM inference performance on Tenstorrent Galaxy Wormhole matching or surpassing NVIDIA DGX A100-class systems, demonstrating a compelling alternative to conventional GPU-centric AI infrastructure.<\/p>\n<p>Moreh also improved cost efficiency by implementing a disaggregated serving architecture that combines GPUs with Tenstorrent Wormhole chips. By utilizing Tenstorrent processors as dedicated prefill accelerators, the company reduced reliance on high-cost HBM and lowered overall infrastructure costs.<\/p>\n<p>The results were first unveiled at Tenstorrent&#8217;s launch event, TT-Deploy, held on May 1 in San Francisco.<\/p>\n<p>As a strategic partner of Tenstorrent and a major external contributor to Metalium, Moreh showcased a live LLM inference demo at the event. Building on its experience operating AMD GPU-based production environments in real-world data centers, the company presented its latest technical achievements in &#8216;Production-Ready LLM Inference on Tenstorrent Galaxy.&#8217;<\/p>\n<p>MoAI Inference Framework is a disaggregated inference solution that enables unified operation of heterogeneous GPUs and NPUs\u2014including NVIDIA, AMD, and Tenstorrent\u2014within a single cluster. This allows enterprises to build flexible AI infrastructure strategies without vendor lock-in.<\/p>\n<p>Moreh CEO Gangwon Jo stated, &#8220;Achieving production-grade LLM inference performance and stability on Tenstorrent-based systems marks a significant milestone,&#8221; and added, &#8220;We will continue to enhance performance through deeper optimization across heterogeneous architectures and closer integration with Tenstorrent NPUs.&#8221;<\/p>\n<p>Moreh is developing its own core AI infrastructure engine and, through its foundation LLM subsidiary Motif Technologies, is building end-to-end capabilities spanning both infrastructure and model domains. Simultaneously, the company is making its mark in the global market through collaborations with key partners such as AMD, Tenstorrent, and SGLang.<\/p>\n<p><!-- \/wp:html --><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"rop_custom_images_group":[],"rop_custom_messages_group":[],"rop_publish_now":"initial","rop_publish_now_accounts":[],"rop_publish_now_history":[],"rop_publish_now_status":"pending","footnotes":""},"categories":[5,7],"tags":[],"class_list":["post-54838","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-cision-pr-newswire","category-cision-pr-newswire-en"],"_links":{"self":[{"href":"https:\/\/thaipropertynews.com\/feeds\/index.php?rest_route=\/wp\/v2\/posts\/54838","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/thaipropertynews.com\/feeds\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/thaipropertynews.com\/feeds\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/thaipropertynews.com\/feeds\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/thaipropertynews.com\/feeds\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=54838"}],"version-history":[{"count":0,"href":"https:\/\/thaipropertynews.com\/feeds\/index.php?rest_route=\/wp\/v2\/posts\/54838\/revisions"}],"wp:attachment":[{"href":"https:\/\/thaipropertynews.com\/feeds\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=54838"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/thaipropertynews.com\/feeds\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=54838"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/thaipropertynews.com\/feeds\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=54838"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}