Latest Open Source Projects
applied-ai-engineering-samples
applied-ai-engineering-samples
TLDR: This repository contains reference guides, blueprints, code samples, and hands-on labs related to Applied AI Engineering developed by the Google Cloud Applied AI Engineering team. It includes sections on Generative AI on Vertex AI, Google Cloud AI/ML infrastructure, Research Operationalization, and various solutions catalogs.
FlagEmbedding
FlagEmbedding
TLDR: FlagEmbedding focuses on retrieval-augmented LLMs and consists of multiple projects including inference, finetune, evaluation, dataset, tutorials, and research. It offers various embedding and reranker models for different languages and tasks.
ms-swift
ms-swift
TLDR: Use PEFT or Full-parameter to finetune 450+ LLMs (Qwen2.5, InternLM3, GLM4, Llama3.3, Mistral, Yi1.5, Baichuan2, DeepSeek-R1, ...) and 150+ MLLMs (Qwen2.5-VL, Qwen2-Audio, Llama3.2-Vision, Llava, InternVL2.5, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, Yi-VL, DeepSeek-VL2, Phi3.5-Vision, GOT-OCR2, ...).
LLMs-from-scratch
LLMs-from-scratch
TLDR: This repository contains code for developing, pretraining, and finetuning a GPT-like LLM. It is the official code repository for the book 'Build a Large Language Model (From Scratch)'. The code is designed to run on conventional laptops and automatically utilizes GPUs if available. It also includes bonus materials and has specific hardware requirements.
llm-app
llm-app
TLDR: Ready-to-run cloud templates for RAG, AI pipelines, and enterprise search with live data. 🐳Docker-friendly.⚡Always in sync with Sharepoint, Google Drive, S3, Kafka, PostgreSQL, real-time data APIs, and more.
generative-ai-for-beginners
generative-ai-for-beginners
TLDR: 21 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/
llm-course
llm-course
TLDR: This repository contains a comprehensive course on large language models, divided into three parts: LLM Fundamentals, The LLM Scientist, and The LLM Engineer. It includes notebooks and articles related to various aspects of large language models such as fine-tuning, quantization, and building applications.
gpt-researcher
gpt-researcher
TLDR: GPT Researcher is an autonomous agent for comprehensive web and local research. It produces detailed research reports with citations, addresses issues of misinformation and token limitations in LLMs, and offers features like smart image scraping, long report generation, and multiple source aggregation. It can be installed via various methods and has a multi-agent assistant and enhanced frontend applications.
swarms
swarms
TLDR: The swarms repository provides an enterprise-grade production-ready multi-agent orchestration framework. It offers a variety of agent architectures and tools for tasks such as financial analysis, healthcare diagnosis, and task routing. The framework is highly customizable and includes features like sequential and parallel processing, long-term memory integration, and multi-modal capabilities.
aider
aider
TLDR: Aider is an AI pair programming tool that works in your terminal and edits code in local git repositories. It works best with GPT-4o and Claude 3.5 Sonnet and can connect to almost any LLM. It offers features like automatic git commits, works with multiple languages, and can edit multiple files at once. It has top tier performance on SWE Bench and received kind words from users.