Script for testing PyTorch support with AMD GPUs using ROCM

[https://gist.github.com/damico/484f7b0a148a0c5f707054cf9c0a0533] - 2024-03-07 17:01:10 - public:mzimmerm

amd, gpu, pytorch, rocm, test - 5 | id:1489851 -

Test for PyTorch and ROCm after installing ROCm.

The mid-range graphics card market heats up as AMD and Nvidia get aggressive | PC Gamer

[https://www.pcgamer.com/hardware/graphics-cards/the-mid-range-graphics-card-market-heats-up-as-amd-and-nvidia-get-aggressive/] - 2024-03-07 12:10:13 - public:mzimmerm

buy, gpu, price - 3 | id:1489850 -

Follow falling prices

Free Artificial Intelligence (AI) Courses Online with Certificates [2024]

[https://www.mygreatlearning.com/ai/free-courses] - 2024-03-07 11:26:50 - public:mzimmerm

ai, course, free, online - 4 | id:1489849 -

Our free, hands-on data science courses - milan.zimmermann@gmail.com - Gmail

[https://mail.google.com/mail/u/0/#inbox/FMfcgzGxRxLVHXWdZFksqSXGwSgFzwhQ] - 2024-03-07 11:22:48 - public:mzimmerm

ai, course, free, kaggle - 4 | id:1489848 -

SageMaker Studio Lab

[https://studiolab.sagemaker.aws/users/mzimmerm] - 2024-03-07 11:18:42 - public:mzimmerm

account, ai, lab, online, sagemarker, studio - 6 | id:1489847 -

My account on SageMaker studio. The give out 4 hours of GPU a day!

Increase VRAM on Ryzen APU / Kernel & Hardware / Arch Linux Forums

[https://bbs.archlinux.org/viewtopic.php?id=283308] - 2024-03-06 20:06:16 - public:mzimmerm

amd, bios, gpu, laptop, memory - 5 | id:1489839 -

A program that changes VRAM / UMA size in BIOS on AMD APU, even if VRAM / UMA does not show in BIOS

bash - How can I fetch VRAM and GPU cache size in Linux? - Stack Overflow

[https://stackoverflow.com/questions/77708142/how-can-i-fetch-vram-and-gpu-cache-size-in-linux] - 2024-03-06 19:42:36 - public:mzimmerm

amd, apu, command, linux, memory - 5 | id:1489838 -

If you have AMD GPU as I do then you can grab PCI ID for the device with lspci command executed with -D flag (shows PCI doamin) and read the following file cat /sys/bus/pci/devices/${pci_slot}/mem_info_vram_total, it contains GPU VRAM size in bytes.

AMD Unveils Ryzen 8000G Series Processors: Zen 4 APUs For Desktop with Ryzen AI

[https://www.anandtech.com/show/21208/amd-unveils-ryzen-8000g-series-processors-zen-4-apus-for-desktop-with-ryzen-ai] - 2024-03-06 18:06:10 - public:mzimmerm

ai, amd, cpu - 3 | id:1489837 -

8000G is the APU series for AI

ROCm installation for Linux — ROCm installation (Linux)

[https://rocm.docs.amd.com/projects/install-on-linux/en/develop/index.html] - 2024-03-05 22:05:16 - public:mzimmerm

amd, apu, doc, good, guide, install, rocm - 7 | id:1489834 -

Top of the guide describing ROCm on Linux. There are 2 core approaches: Using RPM (Package manager), or using AMD installer. I should use Package manager. Also single-version vs. multi-version. I should use single-version, latest.

AMD Extends Support for PyTorch Machine Learning Development on Select RDNA™ 3 GPUs with ROCm™ 5.7 | PyTorch

[https://pytorch.org/blog/amd-extends-support-for-pt-ml/] - 2024-03-05 21:53:10 - public:mzimmerm

amd, apu, install, pytorch, rocm - 5 | id:1489832 -

The links in “How to guide“ provide instructions that are hopeful. Maybe start with those instructions!

how to install rocm on opensuse/suse | Civitai

[https://civitai.com/articles/2296/how-to-install-rocm-on-opensusesuse] - 2024-03-05 21:38:23 - public:mzimmerm

amd, gpu, rocm - 3 | id:1489831 -

Another rocm installation claim on Opensuse. Interesting note: I realize this is a bit old, but you don't really need amdgpu from the repository: it comes for free with the kernel. amdgpu-dkms is only needed if you're stuck on an older kernel version and you can't upgrade for some reason. For example, Tumbleweed users will not need it..

Installing ROCM in TW and Leap - English / Hardware - openSUSE Forums

[https://forums.opensuse.org/t/installing-rocm-in-tw-and-leap/151618/14] - 2024-03-05 21:30:54 - public:mzimmerm

amd, apu, cpu, gpu, install, opensuse, rocm, tumbleweed - 8 | id:1489830 -

This guy seems to claim ROCM can run on Tumbleweed using Distrobox. But what is distrobox?

Doesn't ROCm support AMD's integrated GPU (APU)? · Issue #2216 · ROCm/ROCm

[https://github.com/ROCm/ROCm/issues/2216] - 2024-03-05 21:25:48 - public:mzimmerm

amd, apu, best, gfx902, good, gpu, install, pytorch, rocm - 9 | id:1489829 -

This guy claims successful installation of ROCm on Ubuntu - this seems to be workable for Tumbleweed as well. See the comment “nav9 commented on Jul 16, 2023“

How To Use Virtual Environment inside Jupyter lab

[https://www.linkedin.com/pulse/how-use-virtual-environment-inside-jupyter-lab-sina-khoshgoftar] - 2024-03-05 19:25:42 - public:mzimmerm

config, jupiter, python, venv - 4 | id:1489828 -

Describes how to force Jupyter lab to use a venv for it's kernels!!

Kaggle’s New 29GB RAM GPUs: The Power You Need, Absolutely Free! | by Fareed Khan | Medium

[https://medium.com/@fareedkhandev/kaggles-new-29gb-ram-gpus-the-power-you-need-absolutely-free-b458c3c501ba] - 2024-03-05 16:47:27 - public:mzimmerm

company, cpu, kaggle - 3 | id:1489826 -

Describes the GPU Kaggle is giving 30h a month on.

Solving Transformer by Hand: A Step-by-Step Math Example | by Fareed Khan | Level Up Coding

[https://levelup.gitconnected.com/understanding-transformers-from-start-to-end-a-step-by-step-math-example-16d4e64e6eb1] - 2024-03-05 16:46:01 - public:mzimmerm

ai, example, math, principle, todo, transformer - 6 | id:1489825 -

Doing what a transformer is doing, by hand

Kaggle: Your Home for Data Science

[https://www.kaggle.com/] - 2024-03-05 16:41:23 - public:mzimmerm

ai, company, kaggle, notebook - 4 | id:1489824 -

Kaggle is like huggingface. They can run notebooks, and give GPU power to notebooks

Statistical Foundations of Machine Learning | Kaggle

[https://www.kaggle.com/code/alexandrelemercier/statistical-foundations-of-machine-learning] - 2024-03-05 16:37:34 - public:mzimmerm

ai, course, kaggle, learn, machine, statistics, todo - 7 | id:1489823 -

Mini course of statistical foundations of ML

stabilityai (Stability AI)

[https://huggingface.co/stabilityai?utm_source=button&utm_medium=email&utm_campaign=welcome_non-commercial] - 2024-03-05 16:31:58 - public:mzimmerm

account, ai, company, huggingface, stabilityai - 5 | id:1489822 -

My account on Stability AI - it is just a link to huggingface

Open LLM Leaderboard - a Hugging Face Space by HuggingFaceH4

[https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard] - 2024-03-05 15:50:45 - public:mzimmerm

ai, compare, huggingface, llm, model - 5 | id:1489821 -

Comparison of efficiency of all LLM models on hugging face

6 Ways to Run LLMs Locally (also how to use HuggingFace)

[https://semaphoreci.com/blog/local-llm] - 2024-03-05 13:45:35 - public:mzimmerm

ai, good, huggingface, llm, local - 5 | id:1489820 -

Various methods to run LLM models locally hugging face is only one of them.

Install packages in a virtual environment using pip and venv - Python Packaging User Guide

[https://packaging.python.org/en/latest/guides/installing-using-pip-and-virtual-environments/] - 2024-03-05 03:03:00 - public:mzimmerm

environment, python, venv, virtual - 4 | id:1489816 -

Training Bert on Yelp - Copy of training.ipynb - Colaboratory

[https://colab.research.google.com/drive/1FhwrZ05umMvj4cshnEMUOLxjD9ynvCy9#scrollTo=nCFiAJ55LcLt] - 2024-03-04 23:57:07 - public:mzimmerm

ai, bert, huggingface, model, notebook, progress, yelp - 7 | id:1489813 -

(203) Asus adds 3GB vram option for APUs (Ryzen 5 2400G, Ryzen 3 2200G ect) - YouTube

[https://www.youtube.com/watch?v=0fKXfC35hIA] - 2024-03-04 20:49:43 - public:mzimmerm

amd, asus, bios, memory, motherboard - 5 | id:1489812 -

(1) Interesting cheap GPU option: Instinct Mi50 : LocalLLaMA

[https://www.reddit.com/r/LocalLLaMA/comments/1b5ie1t/interesting_cheap_gpu_option_instinct_mi50/] - 2024-03-04 20:02:23 - public:mzimmerm

ai, amd, card, video - 4 | id:1489811 -

AMD seems to sell these accelerators, which are like video cards.

Ditching CUDA for AMD ROCm for more accessible LLM training and inference. | by Rafael Manzano Masapanta | Medium

[https://medium.com/@rafaelmanzanom/ditching-cuda-for-amd-rocm-for-more-accessible-llm-inference-ryzen-apus-edition-92c3649f8f7d] - 2024-03-04 19:55:11 - public:mzimmerm

ai, amd, apu, compile, gfx902, install, pytorch, rocm - 8 | id:1489810 -

Train LLM on AMD APU. In this scenario, we’ll use an APU because most laptops with a Ryzen CPU include an iGPU; specifically, this post should work with iGPUs based on the “GCN 5.0” architecture, or “Vega” for friends. We’ll use an AMD Ryzen 2200G in this post, an entry-level processor equipped with 4C/4T and an integrated GPU.

(203) ASUS PRIME X570-PRO BIOS Overview - YouTube

[https://www.youtube.com/watch?v=7ZbPjV_OddI] - 2024-03-04 19:33:02 - public:mzimmerm

asus, bios, motherboard, video - 4 | id:1489809 -

Configuring UMA Frame Buffer Size on Desktop Systems with Integrated Graphics | AMD

[https://www.amd.com/en/support/kb/faq/pa-280] - 2024-03-04 17:40:09 - public:mzimmerm

apu, card, cpu, memory, motherboard, video - 6 | id:1489806 -

UMA buffer size is the size of memory used by APU. It is set on the motherboard, often limited to 2GB. But LLM AI could use 16GB or more.

ASUS Account

[https://account.asus.com/overview.aspx] - 2024-03-04 17:18:27 - public:mzimmerm

account, asus, motherboard - 3 | id:1489805 -

My Account for Motherboerd Asus PRIME X570-P, registered here.

(1) Most cost effective GPU for local LLMs? : LocalLLaMA

[https://www.reddit.com/r/LocalLLaMA/comments/12vxxze/most_cost_effective_gpu_for_local_llms/] - 2024-03-04 16:49:23 - public:mzimmerm

ai, doc, llm, model, optimize, perform - 6 | id:1489804 -

GGML quantized models. They would let you leverage CPU and system RAM, instead of having to rely on a GPU’s. This could save you a fortune, especially if go for some used AMD Epyc platforms. This could be more viable for the larger models, especially the 30B/65B parameters models which would still press or exceed the VRAM on the P40.

Optimizing LLMs for Speed and Memory

[https://huggingface.co/docs/transformers/v4.35.2/en/llm_tutorial_optimization] - 2024-03-04 16:46:21 - public:mzimmerm

ai, doc, huggingface, llm, model, optimize, perform - 7 | id:1489803 -

7 Steps to Mastering Large Language Models (LLMs) - KDnuggets

[https://www.kdnuggets.com/7-steps-to-mastering-large-language-models-llms] - 2024-03-04 11:35:57 - public:mzimmerm

ai, doc, highlevel, llm, train - 5 | id:1489799 -

A Step-by-Step Guide to Training Your Own Large Language Models (LLMs). | by Sanjay Singh | GoPenAI

[https://blog.gopenai.com/a-step-by-step-guide-to-training-your-own-llm-2d81ff810695] - 2024-03-04 11:34:25 - public:mzimmerm

ai, doc, highlevel, llm, train - 5 | id:1489798 -

GenAI Stack Exchange

[https://genai.stackexchange.com/] - 2024-03-04 11:32:40 - public:mzimmerm

account, ai, doc, forum, stack, stackexchange - 6 | id:1489797 -

7 steps to master large language models (LLMs) | Data Science Dojo

[https://datasciencedojo.com/blog/master-large-language-models/#] - 2024-03-04 11:25:57 - public:mzimmerm

ai, doc, highlevel, llm, model, train - 6 | id:1489796 -

Home - Replit

[https://replit.com/~] - 2024-03-04 11:22:14 - public:mzimmerm

account, code, good, online, repl - 5 | id:1489795 -

Replit is a site where I can run any REPL online. Can be used for AI

LLM for a new language : MachineLearning

[https://www.reddit.com/r/MachineLearning/comments/12xu5ls/p_llm_for_a_new_language/] - 2024-03-04 11:15:48 - public:mzimmerm

ai, highlevel, llm, model, train - 5 | id:1489794 -

High level how to train a model

Up to date List of LLM Models

[https://docs.google.com/spreadsheets/d/1kT4or6b0Fedd-W_jMwYpb63e1ZR3aePczz3zlbJW-Y4/edit#gid=741531996] - 2024-03-04 11:13:58 - public:mzimmerm

ai, doc, list, llm, model - 5 | id:1489793 -

OSCAR dataset

[https://oscar-project.org/] - 2024-03-04 10:46:43 - public:mzimmerm

ai, dataset, opensource - 3 | id:1489792 -

The OSCAR project (Open Super-large Crawled Aggregated coRpus) is an Open Source project aiming to provide web-based multilingual resources and datasets for Machine Learning (ML) and Artificial Intelligence (AI) applications.

Newspeak-test-dataset

[https://www.kaggle.com/datasets/mzimmerm/newspeak-test-dataset?select=NewspeakGrammar.ns] - 2024-03-04 03:54:57 - public:mzimmerm

ai, dataset, kaggle, newspeak - 4 | id:1489791 -

Dataset is just a zip of files

Introduction to Constructing Your Dataset | Machine Learning | Google for Developers

[https://developers.google.com/machine-learning/data-prep/construct/construct-intro] - 2024-03-04 03:38:48 - public:mzimmerm

ai, dataset, train - 3 | id:1489790 -

(2) Are there any tiny (1-3b) models finetuned for coding available in GGUF format? : LocalLLaMA

[https://www.reddit.com/r/LocalLLaMA/comments/16csdq6/are_there_any_tiny_13b_models_finetuned_for/] - 2024-03-04 02:56:19 - public:mzimmerm

ai, code, generate, llm, model, newspeak, small - 7 | id:1489789 -

bigcode (BigCode)

[https://huggingface.co/bigcode] - 2024-03-04 02:50:02 - public:mzimmerm

ai, code, generate, huggingface, llm, model, newspeak, santacoder, small, starcoder - 10 | id:1489788 -

Research community developing various code models, small and big. Models may not be instruct

WizardLM (WizardLM)

[https://huggingface.co/WizardLM] - 2024-03-04 02:42:44 - public:mzimmerm

ai, code, generate, huggingface, llm, model, newspeak, small, wizardcoder - 9 | id:1489787 -

Another open source small (1B) model.

deepseek-ai (DeepSeek)

[https://huggingface.co/deepseek-ai] - 2024-03-04 02:24:32 - public:mzimmerm

ai, best, code, deepseek, good, huggingface, instruct, llm, model, newspeak, small - 11 | id:1489786 -

They have the 1.3B version!!! This may be the best to start with Newspeak. Should work train even on huggingcface

DeepSeek

[https://chat.deepseek.com/] - 2024-03-04 02:17:09 - public:mzimmerm

aichat, aicode, aigen, deepseek - 4 | id:1489785 -

Chat uses model geared for coding.

deepseek-ai/deepseek-coder-6.7b-instruct · Hugging Face

[https://huggingface.co/deepseek-ai/deepseek-coder-6.7b-instruct] - 2024-03-04 02:13:20 - public:mzimmerm

ai, code, generate, good, llm, model, newspeak, opensource - 8 | id:1489783 -

Another possible model. For coding capabilities, Deepseek Coder achieves state-of-the-art performance among open-source code models on multiple programming languages and various benchmarks.

LLaMA 7B GPU Memory Requirement - Transformers - Hugging Face Forums

[https://discuss.huggingface.co/t/llama-7b-gpu-memory-requirement/34323/6] - 2024-03-04 02:10:38 - public:mzimmerm

ai, code, generate, llama, llm, model, newspeak, train - 8 | id:1489782 -

With the optimizers of bitsandbytes (like 8 bit AdamW), you would need 2 bytes per parameter, or 14 GB of GPU memory.

stabilityai/stable-code-3b · Hugging Face

[https://huggingface.co/stabilityai/stable-code-3b] - 2024-03-04 02:05:36 - public:mzimmerm

ai, code, generate, llm, model, newspeak - 6 | id:1489781 -

Another potential model to use for Newspeak, but it is NOT open source. Adventage: 2.5B params, so should be usable in small GPUs

Large Language Models for Domain-Specific Language Generation: How to Train Your Dragon | by Andreas Mülder | Medium

[https://medium.com/@andreasmuelder/large-language-models-for-domain-specific-language-generation-how-to-train-your-dragon-0b5360e8ed76] - 2024-03-04 01:45:59 - public:mzimmerm

ai, article, code, doc, generate, llm, train - 7 | id:1489780 -

training a model like Llama with 2.7 billion parameters outperformed a larger model like Vicuna with 13 billion parameters. Especially when considering resource consumption, this might be a good alternative to using a 7B Foundation model instead of a full-blown ChatGPT. The best price-to-performance base model for our use case turned out to be Mistral 7b. The model is compact enough to fit into an affordable GPU with 24GB VRAM and outperforms the other models with 7B parameters.

Viewing mzimmerm's Bookmarks