Articles & Reviews
News Archive
Forums
Premium
Contact
Categories

Computers Display Drivers Graphics Cards Linux Gaming Memory Motherboards Processors Software Storage Operating Systems Peripherals

AMD Releases AMD-135M: An Open-Source Small Language Model

Written by Michael Larabel in AMD on 27 September 2024 at 02:05 PM EDT. 9 Comments

AMD

AMD today announced "AMD-135M" as their first small language model they are publicly releasing. AMD-135M is open-source with the training code, dataset, and weights all being open-source to help in the development of other SLMs and LLMs.

AMD-135M features speculative decoding and was trained from scratch using AMD Instinct MI250 accelerators with 670 billion tokens. Training using four MI250 nodes took six days. There is also an AMD-Llama-135M-code variant that has an additional 20 billion tokens of code data. AMD-135M is based on the LLaMA2 model architecture.

AMD is making all of the AMD-135M model assets open-source in hopes of helping other AI development -- and for AMD's part, hoping that the training and inferencing is happening from AMD hardware.

AMD-135M

More details on the AMD-135M SLM via the AMD blog. AMD-135M is available via HuggingFace and GitHub.

9 Comments

Related News

ASUS TUF GAMING X670E PLUS Seeing Linux Sensors Support

AMD Hardware Feedback Interface "HFI" Patches Updated For The Linux Kernel

AMD P-State Driver Improvements Getting Ready For Linux 6.14

New AMD XDNA Linux Driver Patches Add Ryzen AI NPU6 IP, Other Improvements

AMD NPU Firmware Upstreamed For The Ryzen AI AMDXDNA Driver Coming In Linux 6.14

AMD Per-Core Energy Counter Support Slated For Linux 6.14

About The Author

Michael Larabel

Michael Larabel is the principal author of Phoronix.com and founded the site in 2004 with a focus on enriching the Linux hardware experience. Michael has written more than 20,000 articles covering the state of Linux hardware support, Linux performance, graphics drivers, and other topics. Michael is also the lead developer of the Phoronix Test Suite, Phoromatic, and OpenBenchmarking.org automated benchmarking software. He can be followed via Twitter, LinkedIn, or contacted via MichaelLarabel.com.

Popular News This Week

OpenWrt Affected By Security Issue That Could Have Led To Compromised Build Artifacts

Linus Torvalds Comes Out Against "Completely Broken" x86_64 Feature Levels

How AMD Is Taking Standard C/C++ Code To Run Directly On GPUs

Linux EFI Zboot Abandoning "Compression Library Museum", Focusing On Gzip & Zstd

Ptyxis Becomes Ubuntu's Recommended Replacement To GNOME Terminal

COSMIC Alpha 4 Released For System76's Rust-Based Desktop

GNU Shepherd 1.0 Service Manager Released As "Solid Tool" Alternative To systemd

KDE Starts December By Landing A Number Of New Features