Latest Open Source Projects
opensource_notebooklm
opensource_notebooklm
TLDR: An open-source implementation of NotebookLM using Deepseek-V3 and PlayHT TTS. Features include AI-powered conversations, text-to-speech conversion, interactive podcast-style conversations, and educational content generation. Requires Python 3.x, FAL API key, and OpenRouter API key.
ai-engineering-hub
llm_engineering
RAG_Techniques
RAG_Techniques
TLDR: This repository is a comprehensive collection of advanced Retrieval-Augmented Generation (RAG) techniques. It includes various methods for enhancing RAG systems such as query enhancement, context enrichment, advanced retrieval methods, iterative and adaptive techniques, evaluation, explainability, and advanced architectures.
Building-llama3-from-scratch
Building-llama3-from-scratch
TLDR: This repository contains code to build the LLaMA 3 language model from scratch using Python. It explains the components of LLaMA 3 such as pre-normalization using RMSNorm, SwiGLU activation function, Rotary Embeddings (RoPE), and Byte Pair Encoding (BPE) Algorithm. The code shows how to tokenize input data, create embeddings for each token, implement attention heads, self-attention, multi-head attention, SwiGLU activation function, and generate the output.
multiagent-systems-with-autogen
multiagent-systems-with-autogen
TLDR: This repository contains code examples for building multi-agent applications powered by generative AI models based on the AutoGen framework. It is the official code repository for the book 'Multi-Agent Systems with AutoGen' and provides instructions for setting up Jupyter Notebooks and is organized into chapters with code for various concepts discussed in the book.
PraisonAI
PraisonAI
TLDR: PraisonAI is a production-ready Multi AI Agents framework, designed to create AI Agents to automate and solve problems ranging from simple tasks to complex challenges. It provides a low-code solution to streamline the building and management of multi-agent LLM systems, emphasising simplicity, customisation, and effective human-agent collaboration.
data-engineer-handbook
data-engineer-handbook
TLDR: This is a repo with links to everything you'd ever want to learn about data engineering
applied-ai-engineering-samples
applied-ai-engineering-samples
TLDR: This repository contains reference guides, blueprints, code samples, and hands-on labs related to Applied AI Engineering developed by the Google Cloud Applied AI Engineering team. It includes sections on Generative AI on Vertex AI, Google Cloud AI/ML infrastructure, Research Operationalization, and various solutions catalogs.
LLMs-from-scratch
LLMs-from-scratch
TLDR: This repository contains code for developing, pretraining, and finetuning a GPT-like LLM. It is the official code repository for the book 'Build a Large Language Model (From Scratch)'. The code is designed to run on conventional laptops and automatically utilizes GPUs if available. It also includes bonus materials and has specific hardware requirements.