The battle between AMD (AMD.US) and Nvidia has spread to the field of AI! Meta and Oracle Civil Servants Decry Huge Investments to Buy AMD AI Chips

Zhitong Finance · Dec 6, 2023 20:30

AMD宣布正式推出其旗舰款AI GPU加速器MI300X，意味着AMD与英伟达之间的激烈竞争从PC领域全面延伸至AI领域。

智通财经APP获悉，CPU与GPU双产业巨头、英伟达(NVDA.US)竞争对手之一AMD(AMD.US)于美东时间周三举行“Advancing AI”发布会，AMD宣布正式推出其旗舰款AI GPU加速器MI300X，意味着AMD与英伟达之间的激烈竞争从PC领域全面延伸至AI领域，AMD测试数据显示其整体性能比英伟达H100高出60%。除了如预期发布诸如Instinct MI300X、MI300A等新产品外，在AI领域的一些全球领军者也来到现场，比如OpenAI、微软以及Meta，并表示他们后续将大量配置AMD Instinct MI300X。

“Advancing AI”发布会上，首先登场的是Instinct MI300X加速器，由于MI300系列AI芯片的参数在半年前的产品宣传活动上就已经公布，本次“Advancing AI”发布会更多聚焦于整套系统在实际应用中的表现，以及与AI训练/推理领域热度最高的英伟达H100 AI芯片的全方位性能对比。此外，AMD将截至2027年的全球AI芯片市场规模预期，从1500亿美元猛然上修至4000亿美元。

AI领域的全球领军者OpenAI、微软(MSFT.US)以及Meta(META.US)周三在AMD活动上表示，他们将使用AMD最新的AI芯片Instinct MI300X。这是迄今为止全球科技公司正在寻找昂贵的英伟达H100 AI芯片替代品的最明显迹象。英伟达H100对于创建和部署OpenAI旗下的ChatGPT等生成式人工智能(生成式AI)应用程序至关重要，如今AI芯片领域几乎处于垄断地位的英伟达H100终于有了强力竞争对手，那就是AMD Instinct MI300X。

如果AMD最新的高端AI芯片在明年初开始出货时，足以满足构建人工智能大模型的科技公司和云服务提供商的算力需求，同时降低科技公司开发人工智能模型的成本，势必将对英伟达旗下销售额不断飙升的AI芯片构成巨大的竞争压力。

AMD首席执行官苏姿丰(Lisa Su)周三表示:“潜在大客户的所有兴趣基本上集中在云计算领域的大型处理器和大型GPU上。”

AMD表示，MI300X基于一种全新的架构，这种架构通常会带来显著的性能提升。AMD 全新AI芯片最显著的特点是拥有192GB的尖端高性能HBM3内存，传输数据速度更快，可以适应更大规模的人工智能模型。

被AMD粉丝们亲切称为“苏妈”的苏姿丰在发布会上直接将MI300X及其构建的系统与英伟达的主要AI芯片H100进行了比较。“这种性能直接转化为更好的用户体验，”苏姿丰表示。“当你向模型提出要求时，你希望它能更快地回复，尤其是在反应变得更复杂的情况下。”

AMD面临的主要问题在于，一直以英伟达软硬件为基础的公司是否会投入时间和金钱来信任另一家AI芯片供应商。AMD周三告诉投资者和合作伙伴，该公司已经改进了名为ROCm的配套软件套件，以与英伟达的行业标准CUDA软件相竞争，一定程度上解决了AMD在AI芯片领域面临的一个关键缺陷——那就是软硬件生态系统，而这个缺陷一直是人工智能开发者目前更喜欢英伟达旗下产品的主要原因之一。

AMD在周三并未透露MI300X的定价，但英伟达AI芯片每块售价高达4万美元，苏姿丰表示，AMD的芯片必须比英伟达的芯片购买和运营成本更低，才能说服潜在大客户购买。

虽然发布会的内容基本上符合市场预期，但似乎未提振AMD股价。在周三美股科技股集体走弱的背景下，AMD发布会还没开完，股价就由涨转跌，最终收跌超1%，但是盘后涨幅超过1%。

由于AMD即将推出与英伟达H100相抗衡的高性能AI 芯片，华尔街分析师们对于AMD股价普遍看涨，Seeking Alpha汇编的华尔街分析师共识评级以及目标价显示，华尔街分析师们对AMD的共识评级为“买入”，平均目标价预期达132.24美元，意味着未来12个月潜在涨幅高达13%，最高目标价则为200美元。

性能强于英伟达H100! AMD大幅上调市场规模预期

与AI训练/推理领域热度最高的英伟达H100 AI芯片的最新性能对比数据显示，在一般LLM内核TFLOP中，MI300X在FlashAttention-2和Llama 2 70B中提供较英伟达H100高达20%的性能提升幅度。从平台的角度来看，将8x MI300X解决方案与8x 英伟达H100解决方案进行比较，AMD发现Llama 2 70B的增益程度要大得多，高达40%，Bloom 176B基准下的增益程度则高达60%。

具体的AMD Instinct MI300X与英伟达H100性能对比数据显示：

在1v1比较中，整体性能比H100 (Llama 2 70B)提高20%

在1v1比较中，整体性能比H100 (FlashAttention 2)提高20%

8v8 服务器中的整体性能比 H100 (Llama 2 70B) 提高40%

在8v8服务器中，整体性能比 H100 (Bloom 176B) 提高60%

最新 MI300 AI芯片背后的驱动力是AMD ROCm 6.0。该软件堆栈已更新到最新版本，具有强大的新功能，包括支持各种人工智能工作负载任务量，例如生成式人工智能和大语言模型(LLM)。

内存是AMD另一个巨大的升级领域，MI300X的HBM3容量比其前身MI250X(128 GB)增加了50%。为了实现高达192GB的内存池，AMD为MI300X配备了8个HBM3堆栈，每个堆栈都是12-Hi，同时整合了16Gb IC，每个IC具有2GB容量，或每个堆栈具有高达24GB的容量。

该级别的内存规模将提供高达5.3TB/s的带宽和896GB/s的Infinity Fabric带宽。相比之下，英伟达即将推出的H200 AI芯片提供141GB容量，而英特尔Gaudi3 将提供141GB容量。功耗方面，AMD Instinct MI300X的额定功率为750W，比Instinct MI250X的500W提升了50%，比NVIDIA H200多了50W。

AMD在周三对AI芯片领域的未来市场规模给出了大胆的预测数据，认为AI芯片市场将迅猛扩张。具体来说，AMD预计AI芯片市场整体规模到2027年将达到超过4000亿美元，较该公司几个月前提供的1500亿美元上调将近两倍，凸显全球各大企业对人工智能硬件的预期快速变化，各企业正在迅速布局全新的AI产品。

有哪些科技巨头将使用MI300X?

AMD在周三的发布会上表示，该公司已经与一些最需要GPU的科技公司签订了使用该芯片的协议。根据研究公司Omidia最近的一份报告，Meta和微软是2023年英伟达H100 AI芯片的最大规模买家。

在发布会上，Facebook和Instagram母公司Meta公开表示，该公司未来将大量使用MI300X GPU处理人工智能推理工作负载，如处理人工智能贴纸、图像编辑和操作其AI助手，并表示结合ROCm 软件堆栈来支持AI推理工作负载。

微软首席技术官Kevin Scott公开表示，微软将通过其Azure网络服务提供对MI300X芯片的技术访问通道。此外，同日消息显示，微软未来将评估对AMD的AI芯片产品的需求，评估采用该新品的可行性。

ChatGPT开发商OpenAI表示，该公司将在一款名为Triton的重要软件产品中支持AMD MI300等GPU。Triton不是像GPT-4那样的大语言模型，但对于人工智能研究领域来说是非常重要的产品。

英伟达最大规模客户之一甲骨文表示，将在自己的云计算服务体系中使用Instinct MI300X加速器，并计划基于AMD Instinct MI300X开发生成式AI服务。

AMD目前还没有预测该AI芯片的长期销售额，目前只给出了2024年预期，预计2024年数据中心GPU带来的总营收规模约为20亿美元。仅在最近一个季度，英伟达的数据中心业务营收就超过140亿美元，不过这一数据还包括GPU以外的业务。然而，AMD表示，未来四年，AI芯片领域的总市场规模可能将攀升至4000亿美元，是该公司此前预测值的两倍。

苏姿丰还向记者表示，AMD并不认为它需要击败英伟达才能在市场上取得好成绩。在谈到人工智能芯片市场时，苏姿丰对记者表示:“我认为很明显，英伟达现在占据绝大多数。”“我们认为，到2027年，AI芯片市场规模可能会超过4000亿美元，而我们将扮演重要角色。”

AMD announced the official launch of its flagship AI GPU accelerator, the MI300X, which means that the intense competition between AMD and Nvidia extends from the PC field to the AI field across the board.

The Zhitong Finance App learned that AMD (AMD.US), a giant in both the CPU and GPU industries and one of Nvidia's (NVDA.US) rivals, held a “NVDA.US” press conference on Wednesday EST. AMD announced the official launch of its flagship AI GPU accelerator MI300X, which means that the intense competition between AMD and Nvidia extends from the PC field to the AI field. AMD test data shows that its overall performance is 60% higher than Nvidia's H100. In addition to releasing new products such as the Instinct MI300X and MI300A as expected, some global leaders in the AI field also came to the scene, such as OpenAI, Microsoft, and Meta, and indicated that they will deploy a large number of AMD Instinct MI300X in the future.

The Instinct MI300X accelerator appeared first at the “Impaired AI” press conference. Since the parameters of the MI300 series AI chips were announced at the product promotion campaign half a year ago, this “Incurable AI” press conference focused more on the performance of the entire system in actual applications and the comprehensive performance comparison with the Nvidia H100 AI chip, which is the most popular in the field of AI training/reasoning. Furthermore, AMD has abruptly revised its global AI chip market size forecast up to 2027 from 150 billion US dollars to 400 billion US dollars.

Global leaders in the AI field OpenAI, Microsoft (MSFT.US), and Meta (META.US) said at an AMD event on Wednesday that they will use AMD's latest AI chip, the Instinct MI300X. This is the clearest sign that global tech companies are looking for alternatives to the expensive Nvidia H100 AI chip so far. The Nvidia H100 is essential for the creation and deployment of generative artificial intelligence (generative AI) applications such as ChatGPT under OpenAI. Today, the Nvidia H100, which has almost a monopoly in the AI chip field, finally has a strong competitor, and that is the AMD Instinct MI300X.

If AMD's latest high-end AI chips start shipping early next year, they will be sufficient to meet the computing power needs of technology companies and cloud service providers that build large models of artificial intelligence, while at the same time reducing the cost for technology companies to develop artificial intelligence models, it will inevitably put tremendous competitive pressure on Nvidia's AI chips, whose sales continue to soar.

AMD CEO Lisa Su (Lisa Su) said on Wednesday: “Basically, all interest from potential big customers is focused on large processors and large GPUs in the cloud computing sector.”

According to AMD, the MI300X is based on a brand new architecture, which usually brings significant performance improvements. The most prominent feature of AMD's new AI chip is that it has 192 GB of cutting-edge high-performance HBM3 memory, which transmits data faster and can adapt to larger artificial intelligence models.

Su Zifeng, affectionately known as “Su Ma” by AMD fans, directly compared the MI300X and the system it built with Nvidia's main AI chip H100 at the press conference. “This performance directly translates into a better user experience,” Su Zifeng said. “When you make a request to the model, you want it to respond more quickly, especially if the response becomes more complex.”

The main question facing AMD is whether the company that has always been based on Nvidia software and hardware will invest time and money to trust another AI chip supplier. AMD told investors and partners on Wednesday that the company has improved the supporting software suite called ROCm to compete with Nvidia's industry-standard CUDA software and to some extent solved one of the key flaws AMD faces in the field of AI chips — that is, the hardware and software ecosystem, and this flaw has always been one of the main reasons why AI developers currently prefer Nvidia's products.

AMD did not disclose the price of the MI300X on Wednesday, but each Nvidia AI chip sells for up to 40,000 US dollars. Su Zifeng said that AMD's chips must have lower purchasing and operating costs than Nvidia's chips in order to convince potential major customers to buy them.

Although the content of the press conference is generally in line with market expectations, it does not seem to have boosted AMD's stock price. Against the backdrop of the collective weakening of US technology stocks on Wednesday, before the AMD press conference was over, the stock price changed from rising to falling. In the end, it closed down more than 1%, but the after-hours increase was more than 1%.

Since AMD is about to launch a high-performance AI chip to compete with Nvidia's H100, Wall Street analysts are generally bullish on AMD's stock price. According to Wall Street analysts' consensus ratings and target prices compiled by Seeking Alpha, Wall Street analysts' consensus rating for AMD is “buy,” and the average target price is expected to reach 132.24 US dollars, which means a potential increase of up to 13% over the next 12 months. The highest target price is $200.

Better performance than Nvidia H100! AMD drastically raised market size expectations

The latest performance comparison data with the Nvidia H100 AI chip, which is the most popular in the field of AI training/inference, shows that in the general LLM core TFLOP, the MI300X provides a performance increase of up to 20% compared to the Nvidia H100 in the FlashAttention-2 and LLAMA 270B. From a platform perspective, comparing the 8x Mi300x solution with the 8x Nvidia H100 solution, AMD found that the LLAMA 270B had a much greater gain, up to 40%, while the gain under the Bloom 176B benchmark was as high as 60%.

The specific performance comparison data of AMD Instinct MI300X and Nvidia H100 shows:

In the 1v1 comparison, the overall performance is 20% higher than the H100 (Llama 2 70B)

Overall performance is 20% higher than H100 (FlashAttention 2) in the 1v1 comparison

Overall performance in 8v8 servers is 40% higher than H100 (Llama 2 70B)

In an 8v8 server, overall performance is 60% higher than H100 (Bloom 176B)

The driving force behind the latest MI300 AI chip is AMD ROCm 6.0. The software stack has been updated to the latest version with powerful new features, including support for a variety of AI workload workloads, such as generative artificial intelligence and large language models (LLM).

Memory is another huge upgrade area for AMD. The HBM3 capacity of the MI300X has increased 50% compared to its predecessor, the MI250X (128 GB). To achieve a memory pool of up to 192GB, AMD equipped the MI300X with 8 HBM3 stacks, each 12-Hi, and integrated 16Gb ICs. Each IC has 2GB capacity, or each stack has a capacity of up to 24GB.

This level of memory scale will provide up to 5.3 Tb/s of bandwidth and 896 Gb/s of Infinity Fabric bandwidth. In contrast, Nvidia's upcoming H200 AI chip offers 141GB of capacity, while Intel Gaudi3 will provide 141GB of capacity. In terms of power consumption, the rated power of the AMD Instinct MI300X is 750W, which is 50% higher than the Instinct MI250X's 500W, and 50W more than the NVIDIA H200.

On Wednesday, AMD gave bold predictions on the future market size of the AI chip field, believing that the AI chip market will expand rapidly. Specifically, AMD expects the overall size of the AI chip market to reach more than 400 billion US dollars by 2027, an increase of nearly two times from the 150 billion US dollars provided by the company a few months ago, highlighting the rapid changes in expectations of the world's major companies for artificial intelligence hardware, and various companies are rapidly deploying new AI products.

Which tech giants will use the MI300X?

At a press conference on Wednesday, AMD said that the company has signed agreements with some of the technology companies that need GPUs the most to use the chip. Meta and Microsoft were the biggest buyers of Nvidia's H100 AI chip in 2023, according to a recent report by research firm Omidia.

At the press conference, Facebook and Instagram parent company Meta publicly stated that in the future, the company will make extensive use of the MI300X GPU to process artificial intelligence inference workloads, such as processing artificial intelligence stickers, image editing, and operating its AI assistant, and said that it will combine the ROCm software stack to support AI inference workloads.

Microsoft Chief Technology Officer Kevin Scott publicly stated that Microsoft will provide technical access to the Mi300x chip through its Azure network service. Furthermore, news on the same day showed that Microsoft will evaluate demand for AMD's AI chip products in the future and evaluate the feasibility of adopting this new product.

ChatGPT developer OpenAI said the company will support GPUs such as the AMD MI300 in an important software product called Triton. Triton is not a large language model like GPT-4, but it is a very important product for the field of artificial intelligence research.

Oracle, one of Nvidia's largest customers, said it will use the Instinct MI300X accelerator in its cloud computing service system and plans to develop generative AI services based on the AMD Instinct MI300X.

At present, AMD has not predicted the long-term sales volume of this AI chip. Currently, it has only given an forecast for 2024. It is estimated that the total revenue generated by data center GPUs in 2024 will be about 2 billion US dollars. In the most recent quarter alone, Nvidia's data center business revenue exceeded $14 billion, but this figure also includes businesses other than GPUs. However, AMD said that in the next four years, the total market size of the AI chip field may rise to 400 billion US dollars, double the company's previous forecast.

Su Zifeng also told reporters that AMD does not think it needs to beat Nvidia to achieve good results in the market. When talking about the AI chip market, Su Zifeng told reporters: “I think it is obvious that Nvidia now occupies the vast majority.” “We think the AI chip market may exceed $400 billion by 2027, and we will play an important role.”

Disclaimer: This content is for informational and educational purposes only and does not constitute a recommendation or endorsement of any specific investment or investment strategy. Read more

AMD(AMD.US)与英伟达战火蔓延至AI领域! Meta与甲骨文官宣斥巨资购AMD AI芯片

The battle between AMD (AMD.US) and Nvidia has spread to the field of AI! Meta and Oracle Civil Servants Decry Huge Investments to Buy AMD AI Chips

Risk Disclaimer

Statement