Building-llama3-from-scratch

Building-llama3-from-scratch

TLDR: This repository contains code to build the LLaMA 3 language model from scratch using Python. It explains the components of LLaMA 3 such as pre-normalization using RMSNorm, SwiGLU activation function, Rotary Embeddings (RoPE), and Byte Pair Encoding (BPE) Algorithm. The code shows how to tokenize input data, create embeddings for each token, implement attention heads, self-attention, multi-head attention, SwiGLU activation function, and generate the output.

2024-05-27 Github

LLMs-from-scratch

38,600
@rasbt

LLMs-from-scratch

TLDR: This repository contains code for developing, pretraining, and finetuning a GPT-like LLM. It is the official code repository for the book 'Build a Large Language Model (From Scratch)'. The code is designed to run on conventional laptops and automatically utilizes GPUs if available. It also includes bonus materials and has specific hardware requirements.

2023-07-23 Github

swarms

swarms

TLDR: The swarms repository provides an enterprise-grade production-ready multi-agent orchestration framework. It offers a variety of agent architectures and tools for tasks such as financial analysis, healthcare diagnosis, and task routing. The framework is highly customizable and includes features like sequential and parallel processing, long-term memory integration, and multi-modal capabilities.

2023-05-11 Github

aider

aider

TLDR: Aider is an AI pair programming tool that works in your terminal and edits code in local git repositories. It works best with GPT-4o and Claude 3.5 Sonnet and can connect to almost any LLM. It offers features like automatic git commits, works with multiple languages, and can edit multiple files at once. It has top tier performance on SWE Bench and received kind words from users.

2023-05-09 Github