Latest Open Source Projects

applied-ai-engineering-samples

661

applied-ai-engineering-samples

TLDR: This repository contains reference guides, blueprints, code samples, and hands-on labs related to Applied AI Engineering developed by the Google Cloud Applied AI Engineering team. It includes sections on Generative AI on Vertex AI, Google Cloud AI/ML infrastructure, Research Operationalization, and various solutions catalogs.

generative-ai google-cloud-platform llms vertex-ai Jupyter Notebook

2023-08-25 Github

FlagEmbedding

8,300

@FlagOpen

FlagEmbedding

TLDR: FlagEmbedding focuses on retrieval-augmented LLMs and consists of multiple projects including inference, finetune, evaluation, dataset, tutorials, and research. It offers various embedding and reranker models for different languages and tasks.

embeddings information-retrieval llm retrieval-augmented-generation sentence-embeddings text-semantic-similarity Python

2023-08-02 Github

ms-swift

5,400

@modelscope

ms-swift

TLDR: Use PEFT or Full-parameter to finetune 450+ LLMs (Qwen2.5, InternLM3, GLM4, Llama3.3, Mistral, Yi1.5, Baichuan2, DeepSeek-R1, ...) and 150+ MLLMs (Qwen2.5-VL, Qwen2-Audio, Llama3.2-Vision, Llava, InternVL2.5, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, Yi-VL, DeepSeek-VL2, Phi3.5-Vision, GOT-OCR2, ...).

Python

2023-08-01 Github

LLMs-from-scratch

38,600

@rasbt

LLMs-from-scratch

TLDR: This repository contains code for developing, pretraining, and finetuning a GPT-like LLM. It is the official code repository for the book 'Build a Large Language Model (From Scratch)'. The code is designed to run on conventional laptops and automatically utilizes GPUs if available. It also includes bonus materials and has specific hardware requirements.

chatgpt gpt large-language-models llm python pytorch Jupyter Notebook

2023-07-23 Github

llm-app

12,100

@pathwaycom

llm-app

TLDR: Ready-to-run cloud templates for RAG, AI pipelines, and enterprise search with live data. 🐳Docker-friendly.⚡Always in sync with Sharepoint, Google Drive, S3, Kafka, PostgreSQL, real-time data APIs, and more.

AI Pipelines LLM App Templates RAG Jupyter Notebook

2023-07-19 Github

generative-ai-for-beginners

68,500

@microsoft

generative-ai-for-beginners

TLDR: 21 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/

Application Building Beginner Learning Generative AI Jupyter Notebook

2023-06-19 Github

llm-course

44,500

@mlabonne

llm-course

TLDR: This repository contains a comprehensive course on large language models, divided into three parts: LLM Fundamentals, The LLM Scientist, and The LLM Engineer. It includes notebooks and articles related to various aspects of large language models such as fine-tuning, quantization, and building applications.

course large-language-models llm machine-learning roadmap Jupyter Notebook

2023-06-17 Github

gpt-researcher

15,900

@assafelovic

gpt-researcher

TLDR: GPT Researcher is an autonomous agent for comprehensive web and local research. It produces detailed research reports with citations, addresses issues of misinformation and token limitations in LLMs, and offers features like smart image scraping, long report generation, and multiple source aggregation. It can be installed via various methods and has a multi-agent assistant and enhanced frontend applications.

agent ai automation llms openai python research search webscraping Python

2023-05-12 Github

swarms

4,200

@kyegomez

swarms

TLDR: The swarms repository provides an enterprise-grade production-ready multi-agent orchestration framework. It offers a variety of agent architectures and tools for tasks such as financial analysis, healthcare diagnosis, and task routing. The framework is highly customizable and includes features like sequential and parallel processing, long-term memory integration, and multi-modal capabilities.

agents ai artificial-intelligence attention-mechanism chatgpt gpt4 gpt4all huggingface langchain langchain-python machine-learning multi-modal-imaging multi-modality multimodal prompt-engineering prompt-toolkit prompting swarms transformer-models tree-of-thoughts Python

2023-05-11 Github

aider

25,600

@Aider-AI

aider

TLDR: Aider is an AI pair programming tool that works in your terminal and edits code in local git repositories. It works best with GPT-4o and Claude 3.5 Sonnet and can connect to almost any LLM. It offers features like automatic git commits, works with multiple languages, and can edit multiple files at once. It has top tier performance on SWE Bench and received kind words from users.

anthropic chatgpt claude-3 cli command-line gemini gpt-3 gpt-35-turbo gpt-4 gpt-4o llama openai sonnet Python

2023-05-09 Github