DICTA - Hebrew LLM model

[https://dicta.org.il/dicta-lm-3] - 2025-12-06 06:31:29 - public:xxx

ai, artificial, download, hebrew, intelligence, llm, model - 7 | id:1536585 -

מודל שפה בעברית

Why your AI prompt is a behavioral anchor: 3 powerful resets

[https://www.thehuntingdynasty.com/2025/11/why-your-ai-prompt-is-a-behavioral-anchor-3-powerful-resets/] - 2025-11-24 05:45:48 - public:OPAYNE

AI, Ai prompts, anchor, behavioral science, behavioural science, generative AI, LLM, prompt architect - 8 | id:1536504 -

Your AI prompt is a behavioral anchor, and your prompt is the only guardrail: Here's 3 powerful resets to get brilliant answers. not just safe answers.

A small number of samples can poison LLMs of any size \ Anthropic

[https://www.anthropic.com/research/small-samples-poison] - 2025-11-17 19:07:39 - public:mzimmerm

ai, attack, good, hack, llm, poison, train - 7 | id:1536406 -

Cover article to the Poison article

Poisoning Attacks on LLMs Require a Near-constant Number of Poison Samples

[https://arxiv.org/html/2510.07192v1] - 2025-11-17 18:47:39 - public:mzimmerm

ai, attack, good, hack, llm, poison - 6 | id:1536405 -

karpathy/nanochat: The best ChatGPT that $100 can buy.

[https://github.com/karpathy/nanochat] - 2025-10-17 13:41:13 - public:xxx

ai, artificial, chat, chatgpt, intelligence, lib, library, llm, local, nano, nanochat - 11 | id:1523789 -

LLMs are in trouble - YouTube

[https://www.youtube.com/watch?v=o2s8I6yBrxE] - 2025-10-15 19:03:21 - public:mzimmerm

halucinate, llm, poison, train - 4 | id:1523780 -

A super small amount of data (350 documents with SUDO) poison a LLM model of any size

Michael-A-Kuykendall/shimmy: ⚡ Python-free Rust inference server — OpenAI-API compatible. GGUF + SafeTensors, hot model swap, auto-discovery, single binary. FREE now, FREE forever.

[https://github.com/Michael-A-Kuykendall/shimmy] - 2025-10-03 06:06:49 - public:xxx

ai, download, free, llm, model, ollama - 6 | id:1523096 -

Announcing Genkit Go 1.0 and Enhanced AI-Assisted Development - Google Developers Blog

[https://developers.googleblog.com/en/announcing-genkit-go-10-and-enhanced-ai-assisted-development/] - 2025-09-10 10:39:58 - public:xxx

ai, artificial, genai, go, golang, google, intelligence, lib, library, llm, modules - 11 | id:1522792 -

Bring your own brain? Why local LLMs are taking off • The Register

[https://www.theregister.com/2025/08/31/local_llm_opinion_column/] - 2025-08-31 13:02:55 - public:mzimmerm

ai, good, llm, local, todo - 5 | id:1521591 -

Open models by OpenAI | OpenAI

[https://openai.com/open-models/] - 2025-08-08 07:54:01 - public:xxx

ai, downloads, free, llm, models, openai - 6 | id:1521423 -

My 2.5 year old laptop can write Space Invaders in JavaScript now, using GLM-4.5 Air and MLX

[https://simonwillison.net/2025/Jul/29/space-invaders/] - 2025-08-01 06:38:00 - public:xxx

ai, artificial, intelligence, laptop, llm, local, model - 7 | id:1521380 -

Entering AI Autumn: Why LLMs Are Nearing Their Limit - The New Stack

[https://thenewstack.io/entering-ai-autumn-why-llms-are-nearing-their-limit/] - 2025-06-16 12:06:50 - public:mzimmerm

ai, llm, todo - 3 | id:1521027 -

Train Your Own LLM – Tutorial - YouTube

[https://www.youtube.com/watch?v=9Ge0sMm65jo] - 2025-04-19 05:59:04 - public:xxx

ai, artificial, intelligence, llm, train, tutorial, video - 7 | id:1518158 -

The 2025 AI Engineering Reading List - Latent Space

[https://www.latent.space/p/2025-papers] - 2025-01-21 13:13:26 - public:xxx

ai, artificial, intelligence, learn, llm, paper, read, to_read - 8 | id:1514474 -

Things we learned about LLMs in 2024

[https://simonwillison.net/2024/Dec/31/llms-in-2024/] - 2025-01-03 08:03:47 - public:xxx

2024, ai, artificial, chatgpt, gpt, intelligence, learn, llm - 8 | id:1513163 -

llama.cpp guide - Running LLMs locally, on any hardware, from scratch ::

[https://steelph0enix.github.io/posts/llama-cpp-guide/] - 2024-12-06 12:32:59 - public:xxx

cpp, guide, llama, LLM, local, scratch, tutorial - 7 | id:1512549 -

mlabonne/llm-course: Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

[https://github.com/mlabonne/llm-course] - 2024-09-02 17:09:48 - public:isaac

ai, llm, python, tutorial - 4 | id:1509353 -

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks. - mlabonne/llm-course

AIML API

[https://aimlapi.com/] - 2024-08-26 08:30:12 - public:NIkolayMO

AI API, AI models, LLM, text to music, text to speech, text to video - 6 | id:1492806 -

Supercharge Your Dev Stack: 200+ AI Models, One API What is AI/ML API? AI/ML API is a game-changing platform for developers and SaaS entrepreneurs looking to integrate cutting-edge AI capabilities into their products. It offers a single point of access to over 200 state-of-the-art AI models, covering everything from NLP to computer vision. Key Features for Developers: Extensive Model Library: 200+ pre-trained models for rapid prototyping and deployment Customization Options: Fine-tune models to fit your specific use case Developer-Friendly Integration: RESTful APIs and SDKs for seamless incorporation into your stack Serverless Architecture: Focus on coding, not infrastructure management Advantages for SaaS Entrepreneurs: Rapid Time-to-Market: Leverage advanced AI without building from scratch Scalability: From MVP to enterprise-grade solutions, AI/ML API grows with your business Cost-Efficiency: Pay-as-you-go pricing model reduces upfront investment Competitive Edge: Stay ahead with continuously updated AI models

lyogavin/godmodeanimation: 2D Game Animation in God Mode

[https://github.com/lyogavin/godmodeanimation] - 2024-08-23 12:57:40 - public:xxx

2d, ai, animation, generator, images, LLM, model, sprite - 8 | id:1492797 -

How Language Models Work

[https://every.to/chain-of-thought/how-language-models-work] - 2024-07-13 09:59:58 - public:xxx

ai, articles, artificial, how, intelligence, learn, LLM, works - 8 | id:1492534 -

nomic-ai/gpt4all: GPT4All: Chat with Local LLMs on Any Device

[https://github.com/nomic-ai/gpt4all] - 2024-07-12 05:38:03 - public:xxx

ai, chat, chatgpt, gpt, LLM, local, tools - 7 | id:1492530 -

How LLMs Work, Explained Without Math - miguelgrinberg.com

[https://blog.miguelgrinberg.com/post/how-llms-work-explained-without-math] - 2024-05-13 11:48:29 - public:xxx

ai, algorithms, artificial, code, explained, intelligence, learn, llama, llm - 9 | id:1492049 -

abi/secret-llama: Fully private LLM chatbot that runs entirely with a browser with no server needed. Supports Mistral and LLama 3.

[https://github.com/abi/secret-llama] - 2024-05-12 12:53:14 - public:xxx

ai, artificial, bot, browser, chat, intelligence, llama, llm, model - 9 | id:1492037 -

Introducing DBRX: A New State-of-the-Art Open LLM | Databricks

[https://www.databricks.com/blog/introducing-dbrx-new-state-art-open-llm] - 2024-03-29 08:32:34 - public:xxx

ai, artificial, intelligence, llm, opensource - 5 | id:1490813 -

The Best GPUs for Deep Learning in 2023 — An In-depth Analysis

[https://timdettmers.com/2023/01/30/which-gpu-for-deep-learning/] - 2024-03-21 19:50:52 - public:mzimmerm

ai, good, gpu, learn, llm, todo, train - 7 | id:1490076 -

LM Studio - Discover, download, and run local LLMs

[https://lmstudio.ai/] - 2024-03-11 22:12:00 - public:mzimmerm

amd, home, llm, lmstudio, studio - 5 | id:1489897 -

Run an LLM Locally with LM Studio - KDnuggets

[https://www.kdnuggets.com/run-an-llm-locally-with-lm-studio] - 2024-03-11 19:29:38 - public:mzimmerm

ai, doc, llm, lmstudio - 4 | id:1489896 -

Document about LM Studio

Optimum

[https://huggingface.co/docs/optimum/index] - 2024-03-11 12:44:39 - public:mzimmerm

ai, doc, huggingface, llm, model, optimum, repo, small, transformer - 9 | id:1489894 -

Optimum is an extension of Transformers that provides a set of performance optimization tools to train and run models on targeted hardware with maximum efficiency. It is also the repository of small, mini, tiny models.

google-research/bert: TensorFlow code and pre-trained models for BERT

[https://github.com/google-research/bert/] - 2024-03-10 21:44:09 - public:mzimmerm

ai, bert, github, home, llm, mini, model, tiny, transformer - 9 | id:1489883 -

BERT model home on github

BERT Transformers – How Do They Work? | Exxact Blog

[https://www.exxactcorp.com/blog/Deep-Learning/how-do-bert-transformers-work] - 2024-03-10 21:39:00 - public:mzimmerm

ai, bert, doc, good, llm, parameter, progress, todo, transformer - 9 | id:1489882 -

Excellent document about BERT transformers / models and their parameters: - L=number of layers. - H=size of the hidden layer = number of vectors for each word in the sentence. - A = Number of self-attention heads - Total parameters.

google/bert_uncased_L-4_H-256_A-4 · Hugging Face

[https://huggingface.co/google/bert_uncased_L-4_H-256_A-4] - 2024-03-10 21:19:21 - public:mzimmerm

ai, bert, huggingface, llm, model, parameter, small, todo - 8 | id:1489880 -

Repository of all Bert models, including small. Start using this model for testing.

Generative pre-trained transformer - Wikipedia

[https://en.wikipedia.org/wiki/Generative_pre-trained_transformer] - 2024-03-10 21:14:03 - public:mzimmerm

ai, doc, llm, todo - 4 | id:1489879 -

Open LLM Leaderboard - a Hugging Face Space by HuggingFaceH4

[https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard] - 2024-03-05 15:50:45 - public:mzimmerm

ai, compare, huggingface, llm, model - 5 | id:1489821 -

Comparison of efficiency of all LLM models on hugging face

6 Ways to Run LLMs Locally (also how to use HuggingFace)

[https://semaphoreci.com/blog/local-llm] - 2024-03-05 13:45:35 - public:mzimmerm

ai, good, huggingface, llm, local - 5 | id:1489820 -

Various methods to run LLM models locally hugging face is only one of them.

(1) Most cost effective GPU for local LLMs? : LocalLLaMA

[https://www.reddit.com/r/LocalLLaMA/comments/12vxxze/most_cost_effective_gpu_for_local_llms/] - 2024-03-04 16:49:23 - public:mzimmerm

ai, doc, llm, model, optimize, perform - 6 | id:1489804 -

GGML quantized models. They would let you leverage CPU and system RAM, instead of having to rely on a GPU’s. This could save you a fortune, especially if go for some used AMD Epyc platforms. This could be more viable for the larger models, especially the 30B/65B parameters models which would still press or exceed the VRAM on the P40.

Optimizing LLMs for Speed and Memory

[https://huggingface.co/docs/transformers/v4.35.2/en/llm_tutorial_optimization] - 2024-03-04 16:46:21 - public:mzimmerm

ai, doc, huggingface, llm, model, optimize, perform - 7 | id:1489803 -

7 Steps to Mastering Large Language Models (LLMs) - KDnuggets

[https://www.kdnuggets.com/7-steps-to-mastering-large-language-models-llms] - 2024-03-04 11:35:57 - public:mzimmerm

ai, doc, highlevel, llm, train - 5 | id:1489799 -

A Step-by-Step Guide to Training Your Own Large Language Models (LLMs). | by Sanjay Singh | GoPenAI

[https://blog.gopenai.com/a-step-by-step-guide-to-training-your-own-llm-2d81ff810695] - 2024-03-04 11:34:25 - public:mzimmerm

ai, doc, highlevel, llm, train - 5 | id:1489798 -

7 steps to master large language models (LLMs) | Data Science Dojo

[https://datasciencedojo.com/blog/master-large-language-models/#] - 2024-03-04 11:25:57 - public:mzimmerm

ai, doc, highlevel, llm, model, train - 6 | id:1489796 -

LLM for a new language : MachineLearning

[https://www.reddit.com/r/MachineLearning/comments/12xu5ls/p_llm_for_a_new_language/] - 2024-03-04 11:15:48 - public:mzimmerm

ai, highlevel, llm, model, train - 5 | id:1489794 -

High level how to train a model

Up to date List of LLM Models

[https://docs.google.com/spreadsheets/d/1kT4or6b0Fedd-W_jMwYpb63e1ZR3aePczz3zlbJW-Y4/edit#gid=741531996] - 2024-03-04 11:13:58 - public:mzimmerm

ai, doc, list, llm, model - 5 | id:1489793 -

(2) Are there any tiny (1-3b) models finetuned for coding available in GGUF format? : LocalLLaMA

[https://www.reddit.com/r/LocalLLaMA/comments/16csdq6/are_there_any_tiny_13b_models_finetuned_for/] - 2024-03-04 02:56:19 - public:mzimmerm

ai, code, generate, llm, model, newspeak, small - 7 | id:1489789 -

bigcode (BigCode)

[https://huggingface.co/bigcode] - 2024-03-04 02:50:02 - public:mzimmerm

ai, code, generate, huggingface, llm, model, newspeak, santacoder, small, starcoder - 10 | id:1489788 -

Research community developing various code models, small and big. Models may not be instruct

WizardLM (WizardLM)

[https://huggingface.co/WizardLM] - 2024-03-04 02:42:44 - public:mzimmerm

ai, code, generate, huggingface, llm, model, newspeak, small, wizardcoder - 9 | id:1489787 -

Another open source small (1B) model.

deepseek-ai (DeepSeek)

[https://huggingface.co/deepseek-ai] - 2024-03-04 02:24:32 - public:mzimmerm

ai, best, code, deepseek, good, huggingface, instruct, llm, model, newspeak, small - 11 | id:1489786 -

They have the 1.3B version!!! This may be the best to start with Newspeak. Should work train even on huggingcface

deepseek-ai/deepseek-coder-6.7b-instruct · Hugging Face

[https://huggingface.co/deepseek-ai/deepseek-coder-6.7b-instruct] - 2024-03-04 02:13:20 - public:mzimmerm

ai, code, generate, good, llm, model, newspeak, opensource - 8 | id:1489783 -

Another possible model. For coding capabilities, Deepseek Coder achieves state-of-the-art performance among open-source code models on multiple programming languages and various benchmarks.

LLaMA 7B GPU Memory Requirement - Transformers - Hugging Face Forums

[https://discuss.huggingface.co/t/llama-7b-gpu-memory-requirement/34323/6] - 2024-03-04 02:10:38 - public:mzimmerm

ai, code, generate, llama, llm, model, newspeak, train - 8 | id:1489782 -

With the optimizers of bitsandbytes (like 8 bit AdamW), you would need 2 bytes per parameter, or 14 GB of GPU memory.

stabilityai/stable-code-3b · Hugging Face

[https://huggingface.co/stabilityai/stable-code-3b] - 2024-03-04 02:05:36 - public:mzimmerm

ai, code, generate, llm, model, newspeak - 6 | id:1489781 -

Another potential model to use for Newspeak, but it is NOT open source. Adventage: 2.5B params, so should be usable in small GPUs

Large Language Models for Domain-Specific Language Generation: How to Train Your Dragon | by Andreas Mülder | Medium

[https://medium.com/@andreasmuelder/large-language-models-for-domain-specific-language-generation-how-to-train-your-dragon-0b5360e8ed76] - 2024-03-04 01:45:59 - public:mzimmerm

ai, article, code, doc, generate, llm, train - 7 | id:1489780 -

training a model like Llama with 2.7 billion parameters outperformed a larger model like Vicuna with 13 billion parameters. Especially when considering resource consumption, this might be a good alternative to using a 7B Foundation model instead of a full-blown ChatGPT. The best price-to-performance base model for our use case turned out to be Mistral 7b. The model is compact enough to fit into an affordable GPU with 24GB VRAM and outperforms the other models with 7B parameters.

Can Ai Code Results - a Hugging Face Space by mike-ravkine

[https://huggingface.co/spaces/mike-ravkine/can-ai-code-results] - 2024-03-04 01:38:45 - public:mzimmerm

ai, code, generate, huggingface, llm, model, summary - 7 | id:1489779 -

Comparison of LLM models for coding

Search

Results