(2) Are there any tiny (1-3b) models finetuned for coding available in GGUF format? : LocalLLaMA

[https://www.reddit.com/r/LocalLLaMA/comments/16csdq6/are_there_any_tiny_13b_models_finetuned_for/] - 2024-03-04 02:56:19 - public:mzimmerm

ai, code, generate, llm, model, newspeak, small - 7 | id:1489789 -

bigcode (BigCode)

[https://huggingface.co/bigcode] - 2024-03-04 02:50:02 - public:mzimmerm

ai, code, generate, huggingface, llm, model, newspeak, santacoder, small, starcoder - 10 | id:1489788 -

Research community developing various code models, small and big. Models may not be instruct

WizardLM (WizardLM)

[https://huggingface.co/WizardLM] - 2024-03-04 02:42:44 - public:mzimmerm

ai, code, generate, huggingface, llm, model, newspeak, small, wizardcoder - 9 | id:1489787 -

Another open source small (1B) model.

deepseek-ai/deepseek-coder-6.7b-instruct · Hugging Face

[https://huggingface.co/deepseek-ai/deepseek-coder-6.7b-instruct] - 2024-03-04 02:13:20 - public:mzimmerm

ai, code, generate, good, llm, model, newspeak, opensource - 8 | id:1489783 -

Another possible model. For coding capabilities, Deepseek Coder achieves state-of-the-art performance among open-source code models on multiple programming languages and various benchmarks.

LLaMA 7B GPU Memory Requirement - Transformers - Hugging Face Forums

[https://discuss.huggingface.co/t/llama-7b-gpu-memory-requirement/34323/6] - 2024-03-04 02:10:38 - public:mzimmerm

ai, code, generate, llama, llm, model, newspeak, train - 8 | id:1489782 -

With the optimizers of bitsandbytes (like 8 bit AdamW), you would need 2 bytes per parameter, or 14 GB of GPU memory.

stabilityai/stable-code-3b · Hugging Face

[https://huggingface.co/stabilityai/stable-code-3b] - 2024-03-04 02:05:36 - public:mzimmerm

ai, code, generate, llm, model, newspeak - 6 | id:1489781 -

Another potential model to use for Newspeak, but it is NOT open source. Adventage: 2.5B params, so should be usable in small GPUs

Large Language Models for Domain-Specific Language Generation: How to Train Your Dragon | by Andreas Mülder | Medium

[https://medium.com/@andreasmuelder/large-language-models-for-domain-specific-language-generation-how-to-train-your-dragon-0b5360e8ed76] - 2024-03-04 01:45:59 - public:mzimmerm

ai, article, code, doc, generate, llm, train - 7 | id:1489780 -

training a model like Llama with 2.7 billion parameters outperformed a larger model like Vicuna with 13 billion parameters. Especially when considering resource consumption, this might be a good alternative to using a 7B Foundation model instead of a full-blown ChatGPT. The best price-to-performance base model for our use case turned out to be Mistral 7b. The model is compact enough to fit into an affordable GPU with 24GB VRAM and outperforms the other models with 7B parameters.

Can Ai Code Results - a Hugging Face Space by mike-ravkine

[https://huggingface.co/spaces/mike-ravkine/can-ai-code-results] - 2024-03-04 01:38:45 - public:mzimmerm

ai, code, generate, huggingface, llm, model, summary - 7 | id:1489779 -

Comparison of LLM models for coding

openchat/openchat-3.5-0106 · Hugging Face

[https://huggingface.co/openchat/openchat-3.5-0106] - 2024-03-04 00:41:50 - public:mzimmerm

ai, code, generate, huggingface, llm, model, openchat - 7 | id:1489775 -

Open source with lots of information. Uses Multiple undrelying models. Not sure how I would train for it

Welcome Mixtral - a SOTA Mixture of Experts on Hugging Face

[https://huggingface.co/blog/mixtral] - 2024-03-04 00:24:33 - public:mzimmerm

ai, code, generate, huggingface, llm, mixtral, model, newspeak - 8 | id:1489774 -

The Mixtral model is new, and seems to be good. Click on “Demo“ to test it

StarCoder: A State-of-the-Art LLM for Code

[https://huggingface.co/blog/starcoder] - 2024-03-03 23:43:17 - public:mzimmerm

ai, code, generate, good, huggingface, llm, model, newspeak - 8 | id:1489773 -

Article has comparison with other code-LLM models

huybery/Awesome-Code-LLM: An awesome and curated list of best code-LLM for research.

[https://github.com/huybery/Awesome-Code-LLM] - 2024-03-03 23:33:15 - public:mzimmerm

ai, code, generate, list, llm, model - 6 | id:1489772 -

Large language models and the rise of the AI code generators | InfoWorld

[https://www.infoworld.com/article/3696970/llms-and-the-rise-of-the-ai-code-generators.html] - 2024-03-03 23:14:23 - public:mzimmerm

ai, code, generate, language, model, program, review - 7 | id:1489770 -

Review of LLM specialized for code generation

OpenAI Codex - Wikipedia

[https://en.wikipedia.org/wiki/OpenAI_Codex] - 2024-03-03 20:38:12 - public:mzimmerm

ai, code, codex, generate, language, model, program - 7 | id:1489759 -

Model which generates code for Python, Javascript, Go, Shell, Perl, Swifg, Ruby, PHP

codellama (Code Llama) - Huggingface model for generating programs. Maybe can be used for Newspeak?

[https://huggingface.co/codellama] - 2024-03-03 00:48:06 - public:mzimmerm

ai, code, generate, huggingface, language, llama, model, newspeak, program - 9 | id:1489750 -

AI Code Tools: The Ultimate Guide in 2024

[https://codesubmit.io/blog/ai-code-tools/] - 2024-03-03 00:19:57 - public:mzimmerm

ai, code, generate, good, model, tool - 6 | id:1489745 -

AI Code tools : Good summary. Does not talk about which pre-trained model they use. One is gemini (bard) -> alphacode2

FOAF-a-matic -- Describe yourself in RDF

[http://ldodds.com/foaf/foaf-a-matic.en.html] - 2023-12-15 16:54:40 - public:mzimmerm

foaf, generate, identity, online, webid - 5 | id:1486981 -

GEnerates WebID

BigCode - Playground - a Hugging Face Space by bigcode

[https://huggingface.co/spaces/bigcode/bigcode-playground] - 2023-12-09 16:38:55 - public:mzimmerm

ai, bigcode, code, generate, good, model, newspeak, playground, software, starcoder - 10 | id:1485780 -

Look for models that could be used in Newspeak

Coroutine - Wikipedia

[https://en.wikipedia.org/wiki/Coroutine] - 2023-11-29 15:24:16 - public:mzimmerm

computer, coroutine, generate, good, multi, program, science, software, thread, yield - 10 | id:1485369 -

Coroutine is a routine which can yield. Coroutines are typically scheduled cooperatively (=non-preemptively). Coroutines are similar to threads, although threads are typically scheduled preemptively (scheduler pre-empts=forces execution to pause and yield, even without yield in the language)

yabs.io

Yet Another Bookmarks Service

Search

Results